Commit Graph

9973 Commits (fd2947babfedc47106543dc7c2a2d26cf21a2d30)

Author SHA1 Message Date
Thunderbrook fd2947babf
fix compile error with mkl (#26030)
5 years ago
Leo Chen 0a47387bd8
Use static local variable instead of global variable for safty (#26018)
5 years ago
Pei Yang beb0ca5fab
Fix TRT plugin registry without TRT lib (#25982)
5 years ago
123malin 2191a08317
【paddle.fleet】fleet_util move to paddle.fleet (#25805)
5 years ago
yaoxuefeng 224620071b
add new flatten op test=develop (#25393)
5 years ago
Adam 68c6160e63
Add oneDNN fusion_gru kernel (#25594)
5 years ago
Thunderbrook 0cb60c700d
add heter ps mode (#25682)
5 years ago
Zhong Hui dca56f47f5
fix invalid read of pnorm gradient function
5 years ago
WangXi 2c9d0f3cb9
【paddle.fleet】Add dgc to fleet meta optimizer (#25738)
5 years ago
Zhaolong Xing 358bc06c72
[CUDNN8 support] : support CUDNN8 (#25664)
5 years ago
Zhaolong Xing 5970871a64
add eltwise clip cuda impl. (#25689)
5 years ago
Zhen Wang 82374dc12f
Add some error messages for the op without double grads. (#25951)
5 years ago
Pei Yang b717895f64
Fix registering trt plugin (#25744)
5 years ago
wawltor a697e94693
Update the code of the compare ops for the broadcast function
5 years ago
Chen Weihang 9b5a65b819
refine init signal handler meg dumper (#25911)
5 years ago
wangchaochaohu ff717d5158
Add support for tuple of concat Op test=develop (#25800)
5 years ago
tangwei12 253fd407e8
Fix/distibuted heart beat (#25902)
5 years ago
WangXi a6c87fd091
Add amp to fleet meta optimizer, test=develop (#25770)
5 years ago
Pei Yang 9e9a569dae
add trt int8 support for elementwise_mul and scale (#25676)
5 years ago
xujiaqi01 d11c140e28
fix dump, fix cvm check (#25400)
5 years ago
JZ-LIANG 8ebffc78c9
add lars to fleet meta optimizer (#25884)
5 years ago
Dong Daxiang 8d2896f1fe
【paddle.fleet】Fleet run graph in Executor and add two more strategies (#25844)
5 years ago
Zhang Ting 6486fe8a94
improve GPU performance of transpose, test=develop (#25862)
5 years ago
Zhang Ting 2d24f56a7a
avoid data transfer, test=develop (#25810)
5 years ago
ShenLiang bca303165a
fix inverse bug (#25641)
5 years ago
Chen Weihang 48b9a56f1c
Polish framework error message - part 4 (#25807)
5 years ago
Aurelius84 e52dae6ef6
Using input.place() in GetExpectedKernel in slice_op (#25595)
5 years ago
wawltor 595a719795
Update the api for the compare_ops
5 years ago
wangchaochaohu 32b9577b2a
refine the split op for API 2.0 test=develop (#25320)
5 years ago
lilong12 ce506930c3
Fix the bug that Input(Offsets) and attr(offsets) cannot be set at the same time. (#24975)
5 years ago
tangwei12 2d9dbd31ad
Fix/mkl dnn (#25835)
5 years ago
Zhaolong Xing bcddefef39
[Fix Ut]: fix inference ut which exist bug on windows. (#25814)
5 years ago
lilong12 5f30e57cdd
fix test_pipeline, test=develop (#25808)
5 years ago
Chen Weihang d47304e6d9
Refine paddle error stack format (#25790)
5 years ago
tangwei12 caa90a6510
Integrated Trainer of Parameter Server (API add `fluid.contrib.layers.sparse_embedding` only) (#22957)
5 years ago
hong c2a21ca9c9
Fix dygraph grad bugs (#25781)
5 years ago
cc 42189be67b
[Quant] Remove the output for moving_average_abs_max_scale op (#25697)
5 years ago
Dong Daxiang a96d54ac19
Generate final strategy (#25782)
5 years ago
Chen Weihang 2469b578f5
Unified paddle error format when catch system signal (#25765)
5 years ago
Zhou Wei b484a59c39
fix copy file random fail on windows (#25731)
5 years ago
Chen Weihang 23d1228c4d
remove ProgramTranslator.save_inference_model (#25740)
5 years ago
Chen Weihang 1b3081b1b4
Simplify BufferedReader to improve DataLoader performance (#25648)
5 years ago
Pei Yang 55b6205ddf
add set_mkldnn_cache_capacity python api(#25524)
5 years ago
Zhou Wei e0a9115e28
fix random compile failure due to missing file (#25661)
5 years ago
Pei Yang eef98b7f86
add macro check for using TRT api dynamicRangeIsSet() (#25694)
5 years ago
Pei Yang f82baed866
fix trt instance norm plugin on gcc8. test=develop (#25730)
5 years ago
Dong Daxiang 920d998f1e
add more settings for distributed strategy (#25685)
5 years ago
Sylwester Fraczek 1aaa26f102
add dnnl sigmoid (logistic) activation (#25745)
5 years ago
Chen Weihang c34c80d302
Polish framework error message part3 (#25701)
5 years ago
arlesniak e52df3b125
Added DNNL cache management for DyGraph (#25624)
5 years ago