Commit Graph

157 Commits (8c19d7aa2f89a38b3a68e53c73d88af16a3de8ce)

Author SHA1 Message Date
jakpiase 5b4f8aac82
Added LSTM BF16 and fixed GRU BF16 (#31234)
4 years ago
Qi Li 84639b6193
[ROCM] update fluid operators for rocm (part3), test=develop (#31213)
4 years ago
Wojciech Uss 615d8a2264
Modify relu native implementation 2 (#30996)
4 years ago
jakpiase f8da5536ed
REUPLOAD Added vanilla LSTM and LSTM with peepholes oneDNN fp32 kernel (#30719)
4 years ago
Tao Luo 824a79d383
Revert "Added vanilla LSTM and LSTM with peepholes oneDNN fp32 kernel (#30661)" (#30708)
4 years ago
jakpiase d834f4e6e8
Added vanilla LSTM and LSTM with peepholes oneDNN fp32 kernel (#30661)
4 years ago
arlesniak 5bf25d1e8b
More precise mkldnn kernel rules in GetExpectedKernelType (#29840)
4 years ago
Jacek Czaja 173660be7b
[oneDNN] Cache oneDNN stream not to recreate in each oneDNN op (#30358)
4 years ago
石晓伟 a0ee09148e
enhance error msgs of fusion_seqpool_cvm_concat_op.cc, test=develop (#30240)
4 years ago
wangchaochaohu 8dcae0c55d
register OPMaker and Infer Shape Check for fused_elementwise_add (#30259)
4 years ago
AshburnLee 924aac2216
Add tf32 switch for cuDNN (#29192)
4 years ago
wangchaochaohu af80859dd6
reduce the occupied size of memory for the fused pattern of elementwise_add Op and activation Op(relu Op for example) (#29885)
4 years ago
石晓伟 53bb126510
fix a bug in op_version_registry, test=develop, test=op_version (#29994)
4 years ago
Jack Zhou 5a4e42ca9a
add gru op_register_version; test=op_version; (#29931)
4 years ago
Jacek Czaja f6cca62575
[oneDNN] Making ThreadID info in caching key optional (#29272)
4 years ago
LoveAn 671555ed32
Compiling operator libraries with Unity build (#29130)
4 years ago
arlesniak bc902044a4
Fixes mkldnn dygraph learning rate scheduler crashes (#28988)
4 years ago
gongweibao 1dad8ceaab
Fix gpu memory allocation bug. (#28703)
4 years ago
Wojciech Uss 04bcc13fac
Add multi_gru op and tests (#28591)
4 years ago
Shang Zhizhou 8699f38d08
裁剪transformer模型trt支持;修复tensorRT不支持DeletePass的bug (#28517)
4 years ago
Zhang Ting fdc06f2158
add Fuse bn add act pass (#28196)
4 years ago
Chen Weihang 4ba977c720
Polish some error message in opeators (#27876)
4 years ago
Jacek Czaja 606611d351
[oneDNN] GRU BF16 kernel (#27731)
4 years ago
cc ec7d11a492
refine fused_elemwise_activation error message (#27734)
4 years ago
Yiqun Liu 65207b4560
Polish the error message of fc, fused_fc_elementwise_layernorm and fused_embedding_seq_pool. (#27692)
4 years ago
Wojciech Uss 966447e338
Added support for quantization of fusion_gru (#27518)
4 years ago
wanghuancoder df43905f12
use iwyu clean include (#27267)
4 years ago
Shang Zhizhou c17f9cf25f
[bug fix]:Memory increases after adapting the cudnn version to cudnn8 (#27436)
4 years ago
Zhang Ting 906e7f921e
add fuse_bn_act op (#27230)
4 years ago
石晓伟 dd4c2d86a5
enhance error messages, test=develop (#27423)
4 years ago
Shang Zhizhou d93661942e
fix bug sequececonv_eltadd_relu_fuse_pass (#27404)
4 years ago
huangxu96 02606d45ef
Quant op dev (#25932)
4 years ago
Adam cc3f4b813a
Add int8 GRU kernel (#27220)
4 years ago
lidanqing 5c4eed66fd
Fix GRU mkldnn kernel fail on look_table_v2 (#27198)
4 years ago
Qi Li 7c7fbd3218
fix error msg of fused_embedding_fc_lstm_op, test=develop (#27231)
4 years ago
Yiqun Liu 1be6bf45ae
Add assign to fusion_group and enhance inplace execution in fusion_group. (#26121)
5 years ago
Zhaolong Xing 50f149a48e
fix cudnn workspace size problem during inference. (#26021)
5 years ago
Adam 68c6160e63
Add oneDNN fusion_gru kernel (#25594)
5 years ago
Zhaolong Xing 358bc06c72
[CUDNN8 support] : support CUDNN8 (#25664)
5 years ago
Adam 98899b73d2
Fix FC + GRU fuse pass (#25687)
5 years ago
Chen Weihang aa0f254fbe
Add macro BOOST_GET to enrich the error information of boost :: get (#24175)
5 years ago
liuwei1031 9a93f6aae0
improve efficiency of runtime InferVarType (#22778)
5 years ago
Zhaolong Xing 35148d17f7
[BUG]: Head number can only be > 1 on multihead op (#23974)
5 years ago
Zhou Wei 76d78c6387
fix conv_fusion_op conflict,test=develop (#24020)
5 years ago
zhaoyuchen2018 a28a63a943
OP(fusion_gru) error message enhancement. test=develop (#23591)
5 years ago
Zhou Wei 7817003795
Optimize the error messages of paddle CUDA API (#23816)
5 years ago
Yiqun Liu 8d0b0cb4ae
Op(conv2d_fusion) error message enhancement. (#23596)
5 years ago
yiicy f5f76e610d
fusion_seqconv_eltadd_relu error message enhancement. (#23554)
5 years ago
zhaoyuchen2018 f0b08123b2
OP(fused_embedding_fc_lstm) error message enhancement. test=develop (#23527)
5 years ago
yiicy de3e299dbb
fusion_seqexpand_concat_fc error message enhancement, test=develop (#23558)
5 years ago