Commit Graph

131 Commits (3d5522146e34a44aeaa9916fb46f0877cb0894af)

Author SHA1 Message Date
wanghuancoder df43905f12
use iwyu clean include (#27267)
5 years ago
Shang Zhizhou c17f9cf25f
[bug fix]:Memory increases after adapting the cudnn version to cudnn8 (#27436)
5 years ago
Zhang Ting 906e7f921e
add fuse_bn_act op (#27230)
5 years ago
石晓伟 dd4c2d86a5
enhance error messages, test=develop (#27423)
5 years ago
Shang Zhizhou d93661942e
fix bug sequececonv_eltadd_relu_fuse_pass (#27404)
5 years ago
huangxu96 02606d45ef
Quant op dev (#25932)
5 years ago
Adam cc3f4b813a
Add int8 GRU kernel (#27220)
5 years ago
lidanqing 5c4eed66fd
Fix GRU mkldnn kernel fail on look_table_v2 (#27198)
5 years ago
Qi Li 7c7fbd3218
fix error msg of fused_embedding_fc_lstm_op, test=develop (#27231)
5 years ago
Yiqun Liu 1be6bf45ae
Add assign to fusion_group and enhance inplace execution in fusion_group. (#26121)
5 years ago
Zhaolong Xing 50f149a48e
fix cudnn workspace size problem during inference. (#26021)
5 years ago
Adam 68c6160e63
Add oneDNN fusion_gru kernel (#25594)
5 years ago
Zhaolong Xing 358bc06c72
[CUDNN8 support] : support CUDNN8 (#25664)
5 years ago
Adam 98899b73d2
Fix FC + GRU fuse pass (#25687)
5 years ago
Chen Weihang aa0f254fbe
Add macro BOOST_GET to enrich the error information of boost :: get (#24175)
5 years ago
liuwei1031 9a93f6aae0
improve efficiency of runtime InferVarType (#22778)
5 years ago
Zhaolong Xing 35148d17f7
[BUG]: Head number can only be > 1 on multihead op (#23974)
5 years ago
Zhou Wei 76d78c6387
fix conv_fusion_op conflict,test=develop (#24020)
5 years ago
zhaoyuchen2018 a28a63a943
OP(fusion_gru) error message enhancement. test=develop (#23591)
5 years ago
Zhou Wei 7817003795
Optimize the error messages of paddle CUDA API (#23816)
5 years ago
Yiqun Liu 8d0b0cb4ae
Op(conv2d_fusion) error message enhancement. (#23596)
5 years ago
yiicy f5f76e610d
fusion_seqconv_eltadd_relu error message enhancement. (#23554)
5 years ago
zhaoyuchen2018 f0b08123b2
OP(fused_embedding_fc_lstm) error message enhancement. test=develop (#23527)
5 years ago
yiicy de3e299dbb
fusion_seqexpand_concat_fc error message enhancement, test=develop (#23558)
5 years ago
huzhiqiang 5fe3b63824
[error message enhancement] fused_elemwise_activation_op and fusion_conv_inception_op (#23686)
5 years ago
zhongpu b4b6763ab2
fix bug for exhaustive_search in conv_fusion_op, test=develop (#23727)
5 years ago
zhaoyuchen2018 7b5e23c034
OP(fusion_gru) error message enhancement. test=develop (#23599)
5 years ago
Wilber 1ac9db4354
error message enhancement for fusion_seqpool_concat_op. test=develop (#23563)
5 years ago
Wilber 286c2e0ede
error message enhancement for py_func op. (#23565)
5 years ago
Zhaolong Xing f345607115
Refine transpose flatten concat error message (#23625)
5 years ago
Wilber 5f22478a93
error message enhancement for repeated fc. test=develop (#23562)
5 years ago
zhangchunle 638d924d89
Op (FusionSquaredMatSub) error message enhancement. (#23498)
5 years ago
zhangchunle fd9b7bdb3d
Op (FusedEmbeddingSeqPool) error message enhancement. (#23454)
5 years ago
Chen Weihang 16315d3d9e
Delete Ref & VectorRef and add GetDataSafely (#22997)
5 years ago
zhongpu dbfbd7eac4
support Exhaustive search in dygraph (#23415)
5 years ago
zhongpu bfb07aafe8
Revert "Exhaustive search (#22821)", test=develop (#23401)
5 years ago
zhongpu 48144e4099
Exhaustive search (#22821)
5 years ago
Zhaolong Xing 430b0099c9
[Paddle-TRT]: Ernie Dynamic shape support. (#23138)
5 years ago
Wilber 95b356a069
update embedding_eltwise_layernorm fuse and kernel. test=develop (#23114)
5 years ago
Zeng Jinle a31d7328b7
Add dygraph double grad implementation (#22939)
5 years ago
Zhaolong Xing 8c6fde9e69
fix align error (#23090)
5 years ago
wangchaochaohu 3757e0687c
Add Unittest for backward of fusion group (#22932)
5 years ago
wangchaochaohu f0d193a23c
Cast fusion for fusion group (#22876)
5 years ago
Zhaolong Xing 8d6dc102fe
[Ernie GPU Optimize]: Embedding_eltwise_layernorm Fuse (#22494)
5 years ago
Zeng Jinle d33c4343e1
Imperative tracer refactoring (#22457)
5 years ago
Zhaolong Xing 1a533ed2de
[BUG]: Multihead matmul op's ouput size should be BxSx(N*H) (#22848)
5 years ago
tianshuo78520a 433cef03e5
fix typo word (#22784)
5 years ago
tianshuo78520a d2ba91aad1
fix typo words (#22653)
5 years ago
Yiqun Liu 22bbd54719
Add the support of fp16 in fusion_group (#22239)
5 years ago
Zhaolong Xing 8acd745c25
[Ernie GPU Optim]: Fuse three fc to multihtead matmul (#22486)
5 years ago