Commit Graph

16926 Commits (25a233e46d42f6cb76220d58d89f882723a7a4fc)

Author SHA1 Message Date
hong bfb60efb1f
change compare forece_cpu default value; test=develop (#23888)
5 years ago
wangchaochaohu 2270864019
Fusion group optimize for cuda codegen(#23940)
5 years ago
ShenLiang 94dfb7d770
opt the postprocess, test=develop (#24155)
5 years ago
Chengmo 28a558e867
Update index sample (#24109)
5 years ago
Zhang Ting ab8f8fa70d
fix example code, test=develop, test=document_fix (#24139)
5 years ago
Feiyu Chan e01262e691
add kron op (#24105)
5 years ago
Jacek Czaja eb411613e9
[DNNL] refine activations Inplace support (#24145)
5 years ago
wangchaochaohu 6bf26ef156
fix warning mac compiler (#24138)
5 years ago
lijianshe02 a398464e77
API/OP (affine_channel, group_norm, layer_norm, random_crop, unpool, … (#24118)
5 years ago
Guo Sheng a8c0fb4e86
Add cholesky_op (#23543)
5 years ago
Jacek Czaja 461e6a01ec
[DNNL] activations Inplace support (#24123)
5 years ago
wangchaochaohu 6ba7c3ac92
Reduce the construction time of fuction about profiler (#24117)
5 years ago
Zhang Ting fb0729ee7f
avoid warnings in MAC compile (#24124)
5 years ago
qingqing01 b059fb955d
Add trainable_statistics in attr for batch_norm. (#24072)
5 years ago
hutuxian aa0330f451
Add some print log before running UT (#24071)
5 years ago
Tao Luo 34b4153e97
show statistics summary of ccache (#24125)
5 years ago
arlesniak d31a174f51
added fusing matmul-transpose-reshape pass (#23866)
5 years ago
石晓伟 46f3139c7f
supports loading model from memory, test=develop (#24098)
5 years ago
xiaoting 96ffebef55
fix bicubic, change int to floor (#24063)
5 years ago
Zeng Jinle a67eea9f00
polish code by adding final, test=develop, test=develop (#24114)
5 years ago
Zeng Jinle ab2e284235
fix compilation failure (#24091)
5 years ago
石晓伟 34d7d6aef0
declare the stream::Priority as enum class, test=develop (#24013)
5 years ago
Pei Yang 695a53c874
remove conv_bn_fuse_pass and fc_fuse_pass in trt int8 calibration. test=develop (#23805)
5 years ago
0YuanZhang0 a164b10d05
API/OP error message enhancement (#23717)
5 years ago
Zeng Jinle acef55df04
fix isolated var fetch bug, test=develop (#24070)
5 years ago
Zeng Jinle 3ca700a987
code clean, test=develop (#24060)
5 years ago
Zhaolong Xing 35148d17f7
[BUG]: Head number can only be > 1 on multihead op (#23974)
5 years ago
Jacek Czaja c6c65c65c7
[DNNL] Added elementwise_add mkl-dnn inplace (#23477)
5 years ago
hutuxian 9ff558a46f
Optimize DataFeed (#23957)
5 years ago
Leo Chen 5cccc69f1a
update name generator, test=develop (#24048)
5 years ago
zhangchunle 735e9ccce0
modify paddle_build.sh (#24062)
5 years ago
tianshuo78520a 109ffffbb2
Update code format (#24000)
5 years ago
石晓伟 db6d867383
add boost dependency to cuda_stream (#24032)
5 years ago
wawltor 5c669ad1c2
Add the support dygraph out attribute for the op of mm in api2.0 (#23978)
5 years ago
石晓伟 d2584a7082
New feature: thread local allocator, test=develop (#23989)
5 years ago
Kaipeng Deng 80cf3c3c4d
Refine DataLoader support multi-processing (#23107)
5 years ago
Zhou Wei 76d78c6387
fix conv_fusion_op conflict,test=develop (#24020)
5 years ago
wangguanzhong 931cba2e64
add clamp api, test=develop (#23273)
5 years ago
zhaoyuchen2018 a28a63a943
OP(fusion_gru) error message enhancement. test=develop (#23591)
5 years ago
Zhou Wei 7817003795
Optimize the error messages of paddle CUDA API (#23816)
5 years ago
silingtong123 f6dbf8e3a4
add 'seed' arguemnt of randint API (#23809)
5 years ago
Li Fuchen 87d8dc3dc0
add diag_embed op (#23385)
5 years ago
liu zhengxi 8e555ba650
OP(pad, pad2d, pad_constant_like) error message enhancement (#23882)
5 years ago
tianshuo78520a 19235e8525
add cpu ci job check_style (#23617)
5 years ago
zhangchunle 6bd200db66
remove high level api (#23854)
5 years ago
ShenLiang 30bd7e1c83
Add rank_attention_op attributes for GPU memory in contrib (#23915)
5 years ago
Yiqun Liu 8d0b0cb4ae
Op(conv2d_fusion) error message enhancement. (#23596)
5 years ago
ShenLiang 7f0b2c7407
fix memory leaking problem of dataset, test=develop (#23955)
5 years ago
mapingshuo 7d4002e06a
restrict block num of layer_norm_grad cuda block to 128 (#23878)
5 years ago
Zhaolong Xing 133f1fc123
[Eernie TRT]: add slice op and add emb eltwise layernorm fp16 support (#23723)
5 years ago