Commit Graph

67 Commits (b6ce4f8b2fa85304cc3b95299d82212e90c663d7)

Author SHA1 Message Date
zhaoyuchen2018 4a544762a2
Add Asypadding for conv fusion. (#21041)
6 years ago
hong 8c4573a3cb
GradMaker for dygraph (#19706)
6 years ago
Yiqun Liu 03ba0fdae6
Move the codes of fused operators to operators/fused directory. (#20881)
6 years ago
Chen Weihang 26cc1fe508
Replace risky GetInputType method with secure IndicateVarDataType interface (#20668)
6 years ago
Zeng Jinle 4922eb6da5
make_conv_workspace_size_configurable, test=develop (#20662)
6 years ago
qingqing01 01eddc1a04
Support fp16 in GPU impl of fused_elemwise_activation_op. (#20636)
6 years ago
Zeng Jinle 48029ab06c
Remove some DefaultGradOpDescMaker (#20185)
6 years ago
Yiqun Liu 3cd985a669
Add a pass to fuse fc+elementwise_add+layernorm (#19776)
6 years ago
翟飞跃 93c85c930a Implement FusedEmbeddingSeqPoolGradKernel with cblas_saxpy (#19770)
6 years ago
Yiqun Liu a65c728e5d
Implement the GPU kernel of fc operator (#19687)
6 years ago
Tao Luo d6c85c96dc
paddle::framework::vectorize() templatization (#19627)
6 years ago
翟飞跃 2e3ee57954 Use sparse matrix to implement FusedEmbeddingSeqPoolGradKernel (#19153)
6 years ago
Yihua Xu b920395842 Use sparse matrix to implement fused emb_seq_pool operator (#19064)
6 years ago
Leo Chen 80eab822c1 Remove unused DefaultGradOpDescMaker in REGISTER_OPERATOR() (#19166)
6 years ago
石晓伟 ee2f296ef8
Fusion: seqpool_cvm_concat (#18471)
6 years ago
翟飞跃 802ea50956 fix spelling errors (#17941)
6 years ago
Yiqun Liu 8fd39f3e99
Enhance fused_elementwise_activation op and add python api in contrib.layers (#17236)
6 years ago
Zeng Jinle 0c335dcd2c
Make conv cudnn workspace size configurable (#17036)
6 years ago
sneaxiy 2c836ff914 check default grad maker
6 years ago
Qiyang Min c7f1f3ed0c
Merge pull request #16214 from velconia/imperative_infer_var_type
6 years ago
minqiyang b40e41fbd1 Polish code style
6 years ago
luotao1 d9f0e7252a refine with comments
6 years ago
luotao1 721c2c00ef refine fc_infershape
6 years ago
minqiyang ca392c7e97 Implement infer var type context
6 years ago
luotao1 fe78a92e6e refine with comments
6 years ago
luotao1 5d20954ac4 add runtime shape for fuse_emb_seq_pool_grad
6 years ago
luotao1 8f6597aa0e Merge branch 'develop' into infershape_example
6 years ago
luotao1 31ccaf0916 add all_kernels_must_compute_runtime_shape example for speedup infershape
6 years ago
tensor-tang 14a764c930 simplify the jitkernel templates and tests
6 years ago
tensor-tang 802f362ac4 unify the kernelfuncs cache and add unit test
6 years ago
tensor-tang 41a1270856 add vbroadcast jitkernel refer code and use it
6 years ago
luotao1 34404f9c31 refine infershape of sequence_enumerate, hash and fuse_emb_seq_pool
6 years ago
tensor-tang e1c707fe9c
fix warnings (#15790)
6 years ago
tensor-tang a3a3d3d861 add embseqpool jitkernel mkl impl and use it
6 years ago
tensor-tang 18bff5298d extract fused_emb_seq_pool forward function
6 years ago
tensor-tang ba02ac4692
use mat attr and refine test (#15448)
6 years ago
chengduo f8f91fb4b3
Revert conv transpose cudnn (#15514)
6 years ago
chengduo 5a8bd82c0c
Remove workspace_handle (#15376)
6 years ago
tensor-tang d618e48309 fix fuse square mat order and refine test
7 years ago
tensor-tang 38de1ff472 add fusion squared mat sub op
7 years ago
tensor-tang f347d6e4a1 add repeated fc relu unit test
7 years ago
tensor-tang 99010e6eae init repeated fc relu op
7 years ago
tensor-tang 8e086a8521 follow comment and fix typo
7 years ago
tensor-tang f702f8fd10 Merge remote-tracking branch 'ups/develop' into fuse/seqpool_concat
7 years ago
tensor-tang 316636404f add seqpool concat unit test
7 years ago
tensor-tang 7923d7271f add fusion seqpool concat op
7 years ago
minqiyang 0f94c1ac14 Polish code
7 years ago
minqiyang c09a379015 remove const_cast
7 years ago
minqiyang db8eb9b688 Polish code
7 years ago
minqiyang 39b98709b1 Move fused ops to fused dir
7 years ago