Commit Graph

29470 Commits (27bdbec7fc16f5d66d8a0458bb6cfb68898204d1)
 

Author SHA1 Message Date
Aurelius84 966aa0e387
Fix test_mobile_net random failed on windows GPU(#29480)
4 years ago
ShenLiang 2ef9e0e23c
Rebuild group automatically in dynamic graph distributed (#29255)
4 years ago
procr 3a0558339d
support mobilenet for kunlun (#29458)
4 years ago
chalsliu ec26a26a46
support precision test for py3
4 years ago
Huihuang Zheng a1909affc6
Fix Unit Test: Add Sleep Time for CUDA Retry (#29442)
4 years ago
Leo Chen e5e522493d
make gelu fp16 computing more robust (#29484)
4 years ago
LoveAn 8094ac686e
Print ccache/clcache hit rate (#29341)
4 years ago
Aurelius84 5d530c9319
fix amp support fleet (#29491)
4 years ago
ShenLiang 311b3b44fc
Fix the bug where embedding can‘t be processed correctly in reducer (#29485)
4 years ago
Zhang Ting 560b432349
Revert "improve elementwise_add_grad perf (#29277)" (#29464)
4 years ago
jakpiase 57a4f16d9e
added internal and external reorders to profiler (#29443)
4 years ago
Pei Yang 2480bdef6c
change hard_swish from plugin to layer (#29177)
4 years ago
lilong12 b122d0bb76
Fix bug in gloo that gloo initialization hangs (#29447)
4 years ago
taixiurong ecca6585cd
1. fix elementwise ops'bug 2. fix softmax_with_cross_entropy_op 3. add biliner_interp_op (#29448)
4 years ago
LoveAn 03b42d9fa7
fix unittest on windows, test=develop (#29365)
4 years ago
ShenLiang 22e6b9e373
Fix the ut of matmulv2 for broadcast case (#29461)
4 years ago
TTerror a5fcc4b545
update reduce_sum op on xpu (#29367)
4 years ago
Jack Zhou c7cada8571
Fix gru performace decline in 1.8.5 (#29455)
4 years ago
chentianyu03 acce962133
remove complex module direction (#29419)
4 years ago
Zhang Ting 6296f4ed09
revert cast eigen kernel (#29427)
4 years ago
Leo Chen a040c055a5
fix layer_norm accuracy (#29434)
4 years ago
Zhou Wei 24ba9ed436
fix that parameters'grad has grad var (#29408)
4 years ago
Leo Chen 4e19ce1df5
refine reshape grad and double grad kernel, use tensor copy async (#29128)
4 years ago
chalsliu f7b45fd694
Support precision test verification
4 years ago
Wilber ad01658e36
fix cmake error message. (#29421)
4 years ago
Shang Zhizhou 225a9c4ed8
Fix unittest (#29412)
4 years ago
Pei Yang f860de4af7
support clip op trt converter (#29411)
4 years ago
Jack Zhou 1dd7b97b66
fix rnn_op bug in cudnn_version>= 8 (#29406)
4 years ago
Bai Yifan 87bb726258
Add deform_conv2d,DeformConv2D (#29364)
4 years ago
LoveAn 671555ed32
Compiling operator libraries with Unity build (#29130)
4 years ago
tianshuo78520a 6f2bb20e0a
update docker nccl version 2.7.8 (#28575)
4 years ago
chentianyu03 64e4e17f0c
remove complexvariable (#29390)
4 years ago
Zhou Wei 5c9bd0bf7c
print whether has build cache (#29035)
4 years ago
chajchaj 79e6086743
change shape of output in cross_entropy, test=develop (#29220)
4 years ago
liuyuhui 2ee7a6b08c
[paddle v2.0.0rc1: API fixs] assign/conv2d/conv2d_transpose/cast/ParamAttr (#29171)
4 years ago
Wilber 6cb688865a
update lite tag. (#29392)
4 years ago
cc a623ce044f
Use different name_scope for different conv type, test=develop (#29355)
4 years ago
Guo Sheng 8fc7f1b66a
Fix api docs in RNN, Transformer, layer_norm, WeightNormParamAttr (#29235)
4 years ago
Chen Long c940f842ca
remove rarfile from requirements (#29319)
4 years ago
Wilber cff93b52a7
update cmake for FT openbals version. (#29382)
4 years ago
yongqiangma 7c508d8668
update unbind norm add CUDAPlace api doc information (#29322)
4 years ago
chentianyu03 879e913b6d
Make transpose, trace, kron, reshape, sum op support complex type (#29321)
4 years ago
Chen Long 66fd1c00a0
fix some docs test=develop;test=document_fix (#29374)
4 years ago
liym27 5f84d0b375
Fix bug: delete wrong check_type of paddle.concat and support LoDTensorArray (#29306)
4 years ago
Feiyu Chan f7cdcefa65
fix multiple documentation errors, test=document_fix (#29210)
4 years ago
卖鱼的哲学 074065e5de
fix expand/uniform_random && concat/transpose to new api on xpu (#29280)
4 years ago
ShenLiang 4064354a01
support dp run single card (#29358)
4 years ago
lilong12 1decf4ada6
update, test=develop (#29331)
4 years ago
Qi Li 2712df42a3
fix go demo, test=develop (#29107)
4 years ago
gongweibao 8989053443
Fix bug of test_fleet_launch_async.sh (#29332)
4 years ago