Commit Graph

29408 Commits (f6cca6257597964659717cabffa81c40d2f174f5)
 

Author SHA1 Message Date
Jacek Czaja f6cca62575
[oneDNN] Making ThreadID info in caching key optional (#29272)
4 years ago
GeminiCarrie 08f24a3108
Fix precision problem (#29567)
4 years ago
Wilber 740c0d58c3
update for xpu ci. (#29568)
5 years ago
JZ-LIANG d33d468f02
[Sharding] add hybrid-dp feature (#29518)
5 years ago
Leo Chen 1e72e03217
remove duplicated macro (#29563)
5 years ago
Zhang Ting 6702040e94
improve dropout (#29465)
5 years ago
Zhang Ting 30d9589afe
add cast cuda kernel (#29352)
5 years ago
Chen Weihang c1a26e2a05
fix train eval set error in static mode (#29540)
5 years ago
LoveAn b5d4a1f33d
Add the strategy of skipping cc/cu test compilation and execution in CI (#29499)
5 years ago
Aurelius84 2a42250699
Polish hash function of executor cache key (#29556)
5 years ago
taixiurong 760d015c14
add xpu ops for training transformer in kunlun (#29539)
5 years ago
Leo Chen 0fdd365665
Add fast path for dropout when p == 0 (#29553)
5 years ago
Wojciech Uss 917a11495f
fix ininite scale values (#29386)
5 years ago
Jacek Czaja 83a693ee55
[oneDNN] Added Unit Test for Multiple instances prediction (#29501)
5 years ago
lijianshe02 bd29052e33
fix random seed in nll_loss unitest test=develop (#29538)
5 years ago
joanna.wozna.intel 0ce6d7fa77
Fix bf16 activations test for softmax and gelu (#29502)
5 years ago
Zhong Hui 60bfd308ab
fix p_norm with empty shape (#29500)
5 years ago
Zhou Wei b9e926b8e5
change the code format (#29550)
5 years ago
Leo Chen 9f926eb720
Layernorm opt (#29522)
5 years ago
huangxu96 4001979309
Add ReserveSpace in dygraph batch_norm. (#29221)
5 years ago
arlesniak b781953ef5
[oneDNN] Fix flags use test for #29080, assert condition more general (#29493)
5 years ago
tangwei12 ae3f7a7100
add ps table (#29463)
5 years ago
chalsliu 36ec9456cf
Make PADDLE_ROOT as an environment variable
5 years ago
ShenLiang d8391a1983
fix error message of gather nd (#29521)
5 years ago
chalsliu 98edef3c45
Optimize accurate testing
5 years ago
Zhen Wang 5ac71b36fb
Remove tensor copy in the update_loss_scaling op. (#29426)
5 years ago
Wilber 5fe1f8aff7
update lite tag (#29517)
5 years ago
Zhou Wei e74e1a226c
support deepcopy for Layer/Tensor/Paramerbase (#29387)
5 years ago
chalsliu 701c8e06a0
Support precision test for cuda new ut
5 years ago
joejiong 50d3117d30
Add random_split and Subset dataset (#29291)
5 years ago
joejiong 87e75a77c2
Add tangent operator (#29207)
5 years ago
zlsh80826 95e334810a
Softmax vectorization (#29404)
5 years ago
wanghuancoder a136c9cdb8
fix increamental coverage script bug, WITH_INCREMENTAL_COVERAGE to DWITH_INCREMENTAL_COVERAGE, test=develop (#29509)
5 years ago
LoveAn bec51b136b
Revert matching file modification status, test=document_fix (#29514)
5 years ago
Wei Shengyu dc8bb76c68
remove addcmul (#28937)
5 years ago
Zhong Hui f459dd9634
fix abs double grad unittest (#29478)
5 years ago
huangxu96 576d0d938b
add fp16 check into max and avg pool (#29479)
5 years ago
LoveAn 1779e99ff4
Add accuary test and optimize match relu of op name (#29486)
5 years ago
Aurelius84 966aa0e387
Fix test_mobile_net random failed on windows GPU(#29480)
5 years ago
ShenLiang 2ef9e0e23c
Rebuild group automatically in dynamic graph distributed (#29255)
5 years ago
procr 3a0558339d
support mobilenet for kunlun (#29458)
5 years ago
chalsliu ec26a26a46
support precision test for py3
5 years ago
Huihuang Zheng a1909affc6
Fix Unit Test: Add Sleep Time for CUDA Retry (#29442)
5 years ago
Leo Chen e5e522493d
make gelu fp16 computing more robust (#29484)
5 years ago
LoveAn 8094ac686e
Print ccache/clcache hit rate (#29341)
5 years ago
Aurelius84 5d530c9319
fix amp support fleet (#29491)
5 years ago
ShenLiang 311b3b44fc
Fix the bug where embedding can‘t be processed correctly in reducer (#29485)
5 years ago
Zhang Ting 560b432349
Revert "improve elementwise_add_grad perf (#29277)" (#29464)
5 years ago
jakpiase 57a4f16d9e
added internal and external reorders to profiler (#29443)
5 years ago
Pei Yang 2480bdef6c
change hard_swish from plugin to layer (#29177)
5 years ago