Commit Graph

10809 Commits (42a6442a0800008c0d4fdcf70c3a922d4eb0512c)

Author SHA1 Message Date
liym27 3ce878f309
Check the rank of input in kernel of set_value op (#30147)
4 years ago
WeiXin 66dc4ac77b
modify error message based on comments (#30189)
4 years ago
wawltor fee424411a
just add the op error message for the matmul xpu (#30246)
4 years ago
GaoWei8 0a21924a8d
optimize softmax forward (#30217)
4 years ago
wangchaochaohu af80859dd6
reduce the occupied size of memory for the fused pattern of elementwise_add Op and activation Op(relu Op for example) (#29885)
4 years ago
zhang wenhui 5932fee60a
enhance error message, test=develop (#30220)
4 years ago
pangyoki da16b33f2e
add View(reuse allocation) strategy on squeeze, unsqueeze, reshape, flatten op (#29913)
4 years ago
Jacek Czaja 4aba17b5db
[oneDNN] Added UT for testing elementwise_mul caching (#30203)
4 years ago
Zhen Wang 7f7dfccf20
Support pure fp16 training for AMP API. (#29544)
4 years ago
Leo Chen 789743e190
use cuda generator in bernoulli cuda kernel (#30199)
4 years ago
Leo Chen 8696335f86
Fix dtype of ungenerated grad var (#28511)
4 years ago
Wilber 609c022222
shape op support int8 and uint8 tensor (#30201)
4 years ago
Wilber 01a287bf0a
fix windows compile when WITH_PYTHON=ON and WITH_TENSORRT=ON (#30194)
4 years ago
ruri e42e1e80dc
Add version checking, test=op_version (#30129)
4 years ago
Leo Chen 1f97d61c68
Add callback after TensorCopy (#30123)
4 years ago
Chengmo 528e03fc08
【Paddle.Fleet】Fix tensor table (#30075)
4 years ago
Wilber ade244948c
disable mkldnn inplace pass on windows (#30164)
4 years ago
joanna.wozna.intel 907262ee15
Fix analysis predictor test (#30191)
4 years ago
lijianshe02 2dc7ee276b
enhance error message of nll_loss op test=develop (#30125)
4 years ago
Huihuang Zheng 54bf3f5a56
Refine PADDLE_ENFORCE Error Messages. test=develop (#30149)
4 years ago
Chen Weihang d0fb06b27f
[Complex] Simplify prepared op impl to improve performance (#30153)
4 years ago
123malin c5b415bfd9
Improve Index select cuda kernel (#30139)
4 years ago
wangchaochaohu 7dd551e08b
refine the paddle place support using str (#28769)
4 years ago
WeiXin 404c16763a
Add detailed error message for curandStatus_t, cublasStatus_t, cusolverStatus_t (#30161)
4 years ago
Wilber 91a8a25721
enhance error info for py_func (#30138)
4 years ago
weihaoji b8207af6bc
[XPU] Remove lite_xpu ut lite_resnet50_test since fusion pass changes introduced precision diff. test=develop (#30122)
4 years ago
liuyuhui 15fac5e7fa
fix assign_op_xpu concat_op_xpu warining (#30120)
4 years ago
Jack Zhou f5428eca4f
fix enforce msg of sum xpu op (#30113)
4 years ago
123malin 198fbdfb60
Add Lookahead and ModelAverage Optimizer (#30004)
4 years ago
Leo Chen adac38c506
add dispenable input for core.ops.reshape2/expand/slice (#30072)
4 years ago
ShenLiang becf99d2e8
fix error message (#30135)
4 years ago
Zhou Wei 30888ca343
Polish and Optimize the print/repr information of Layer (#29998)
4 years ago
wangguanzhong 69839f8a9a
fix error message for distribute_fpn_proposals_op (#30116)
4 years ago
QingshuChen 8e1c3ddf15
add aarch64 and sunway kunlun lib (#30027)
4 years ago
Shang Zhizhou 05b27695f1
add inference api: DisableTensorRtOps (#30109)
4 years ago
石晓伟 53bb126510
fix a bug in op_version_registry, test=develop, test=op_version (#29994)
4 years ago
xiemoyuan 3e0c492910
Optimize the error message of framework. (#30134)
4 years ago
liym27 9922bd4125
Fix bug: In dynamic mode, if start or end is negetive, __getitem__ return wrong result(#30003)
4 years ago
chentianyu03 666e665132
change the kron gradient when complex types (#29995)
4 years ago
chentianyu03 a5e422c85d
add trace op_register_version and fix version bug; test=op_version (#30000)
4 years ago
cc 9f34374b48
Fix the formate of raising error in randperm op (#30108)
4 years ago
liuyuhui 254ad61959
fix xpu pe sync, test=notest (#30095)
4 years ago
Thunderbrook 0b8e1fadc5
add topo-aware in heter-ps (#30087)
4 years ago
hong 297fff1a79
support dygraph in xpu place (#30051)
4 years ago
wangchaochaohu d0a5620575
fix the compiler error when gcc4 cuda9.0 (#29997)
4 years ago
WangXi ee16006b5d
Optimization grad merge performance (#29784)
4 years ago
yongqiangma e891f4da1b
Add p_norm op version info (#30042)
4 years ago
tangwei12 7d1c149e09
for inference checkpoint (#30081)
4 years ago
tangwei12 7d4bdff07d
fix large scale memory (#30035)
4 years ago
Shang Zhizhou 08dc5bc27e
fix op version checker of pass bug (#30028)
4 years ago