Commit Graph

18034 Commits (0fdd3656654d7b326e7ac0c08893bac1ab10edde)

Author SHA1 Message Date
Jacek Czaja 83a693ee55
[oneDNN] Added Unit Test for Multiple instances prediction (#29501)
4 years ago
joanna.wozna.intel 0ce6d7fa77
Fix bf16 activations test for softmax and gelu (#29502)
4 years ago
Zhong Hui 60bfd308ab
fix p_norm with empty shape (#29500)
4 years ago
Zhou Wei b9e926b8e5
change the code format (#29550)
4 years ago
Leo Chen 9f926eb720
Layernorm opt (#29522)
4 years ago
arlesniak b781953ef5
[oneDNN] Fix flags use test for #29080, assert condition more general (#29493)
4 years ago
tangwei12 ae3f7a7100
add ps table (#29463)
4 years ago
chalsliu 36ec9456cf
Make PADDLE_ROOT as an environment variable
4 years ago
ShenLiang d8391a1983
fix error message of gather nd (#29521)
4 years ago
Zhen Wang 5ac71b36fb
Remove tensor copy in the update_loss_scaling op. (#29426)
4 years ago
Zhou Wei e74e1a226c
support deepcopy for Layer/Tensor/Paramerbase (#29387)
4 years ago
joejiong 87e75a77c2
Add tangent operator (#29207)
4 years ago
zlsh80826 95e334810a
Softmax vectorization (#29404)
4 years ago
wanghuancoder a136c9cdb8
fix increamental coverage script bug, WITH_INCREMENTAL_COVERAGE to DWITH_INCREMENTAL_COVERAGE, test=develop (#29509)
4 years ago
Aurelius84 966aa0e387
Fix test_mobile_net random failed on windows GPU(#29480)
4 years ago
ShenLiang 2ef9e0e23c
Rebuild group automatically in dynamic graph distributed (#29255)
4 years ago
procr 3a0558339d
support mobilenet for kunlun (#29458)
4 years ago
Huihuang Zheng a1909affc6
Fix Unit Test: Add Sleep Time for CUDA Retry (#29442)
4 years ago
Leo Chen e5e522493d
make gelu fp16 computing more robust (#29484)
4 years ago
LoveAn 8094ac686e
Print ccache/clcache hit rate (#29341)
4 years ago
Zhang Ting 560b432349
Revert "improve elementwise_add_grad perf (#29277)" (#29464)
4 years ago
jakpiase 57a4f16d9e
added internal and external reorders to profiler (#29443)
4 years ago
Pei Yang 2480bdef6c
change hard_swish from plugin to layer (#29177)
4 years ago
taixiurong ecca6585cd
1. fix elementwise ops'bug 2. fix softmax_with_cross_entropy_op 3. add biliner_interp_op (#29448)
4 years ago
LoveAn 03b42d9fa7
fix unittest on windows, test=develop (#29365)
4 years ago
TTerror a5fcc4b545
update reduce_sum op on xpu (#29367)
4 years ago
Jack Zhou c7cada8571
Fix gru performace decline in 1.8.5 (#29455)
4 years ago
Zhang Ting 6296f4ed09
revert cast eigen kernel (#29427)
4 years ago
Leo Chen a040c055a5
fix layer_norm accuracy (#29434)
4 years ago
Zhou Wei 24ba9ed436
fix that parameters'grad has grad var (#29408)
4 years ago
Leo Chen 4e19ce1df5
refine reshape grad and double grad kernel, use tensor copy async (#29128)
4 years ago
Shang Zhizhou 225a9c4ed8
Fix unittest (#29412)
4 years ago
Pei Yang f860de4af7
support clip op trt converter (#29411)
4 years ago
Jack Zhou 1dd7b97b66
fix rnn_op bug in cudnn_version>= 8 (#29406)
4 years ago
LoveAn 671555ed32
Compiling operator libraries with Unity build (#29130)
4 years ago
Zhou Wei 5c9bd0bf7c
print whether has build cache (#29035)
4 years ago
cc a623ce044f
Use different name_scope for different conv type, test=develop (#29355)
4 years ago
yongqiangma 7c508d8668
update unbind norm add CUDAPlace api doc information (#29322)
4 years ago
chentianyu03 879e913b6d
Make transpose, trace, kron, reshape, sum op support complex type (#29321)
4 years ago
卖鱼的哲学 074065e5de
fix expand/uniform_random && concat/transpose to new api on xpu (#29280)
4 years ago
lilong12 1decf4ada6
update, test=develop (#29331)
4 years ago
QingshuChen 74bf3bed36
support global pooling for kunlun (#29293)
4 years ago
liym27 b10ecd9d3a
[inplace] Add ShareHolderWith for class Variable and SharePlaceholderWith in VarBase.detach() to share the same Tensor/SelectedRows (#29267)
4 years ago
Chen Weihang 9ad800ebb2
Support type promote for basic math ops (quantum required) (#29265)
4 years ago
tangwei12 8358791607
fix gpu outofrange (#29238)
4 years ago
Leo Chen b58cfff89d
use has_grad instead of train_mode (#29309)
4 years ago
Zhang Ting befd6d5338
improve elementwise_add_grad perf (#29277)
4 years ago
Shang Zhizhou ebf689197d
fix tensorrt output shape error (#29308)
4 years ago
Aurelius84 67c700b479
[Dy2Stat] Add cache for Executor and Context in run_program_op (#28421)
4 years ago
ShenLiang 696dc4bb13
fix the warning of reducer (#29323)
4 years ago