Commit Graph

18388 Commits (cced930b61ba246dffec68bbe09bd9e22a142d64)

Author SHA1 Message Date
Qi Li cced930b61
[ROCM] update fluid operators for rocm (part1), test=develop (#31077)
5 years ago
wangchaochaohu 364cfa2686
fix windows for optimization of elementwise_add Op (#31068)
5 years ago
joanna.wozna.intel 781df300d0
Unification of BF16 enablement process (#31034)
5 years ago
Zhong Hui 16fe11d71e
fix softmax cross entropy integer overflow (#30590)
5 years ago
Zhou Wei 44ee251fde
fix UNIX cmake problem (#31113)
5 years ago
Qi Li a60d93fb77
[ROCM] update fluid framework for rocm (part2), test=develop (#31010)
5 years ago
Thunderbrook 565354f676
support save multi sparse table in one path (#31108)
5 years ago
Qi Li 50967135a5
[ROCM] update fluid framework for rocm (part3), test=develop (#31011)
5 years ago
Qi Li 8fe09faf14
[ROCM] update fluid framework for rocm (part1), test=develop (#31009)
5 years ago
Qi Li 334296306c
[ROCM] update fluid platform for rocm39 (part4), test=develop (#30936)
5 years ago
Shang Zhizhou a5c56d83a1
update trt int8 calibrator to IEntropyCalibratorV2 (#31060)
5 years ago
Zhou Wei adaec0073d
[2.0Custom OP]Support New Custom OP on Windows (#31063)
5 years ago
Qi Li 1d996637e6
[ROCM] update fluid imperative for rocm (part1), test=develop (#31017)
5 years ago
JamesLim b95eb38b8a
fix the bug in backward OP of index_sample. (#31026)
5 years ago
Chengmo 6b3371e0c7
Remove PE special profiler (#30886)
5 years ago
Chen Weihang 6beeafe797
[CustomOp] Add more dispatch marco for users (#31058)
5 years ago
TTerror d5323dab41
add squeeze_op/unsqueeze_op on kunlun;fix conv op and parallel executor;optimize lookup_table op (#31056)
5 years ago
123malin 16b4260b2f
test=develop, save/load, shrink (#30625)
5 years ago
Jiabin Yang 628451af06
hide useless headers and add complex support (#31074)
5 years ago
Wilber 463eae0383
update paddle_fluid.so to paddle_inference.so (#30850)
5 years ago
liym27 5b367dab44
[static setitem] Support the index is Tensor; step>1; step<0 .(#30949)
5 years ago
Qi Li eb3050fa9a
[ROCM] update fluid inference for rocm (part1), test=develop (#31018)
5 years ago
Jacek Czaja f7465641c3
Added reshape grad bf16 (#31035)
5 years ago
Wojciech Uss 615d8a2264
Modify relu native implementation 2 (#30996)
5 years ago
ShenLiang 9401173e3a
Remove scale loss before reduce in dygraph (#30807)
5 years ago
Wilber 0020d91506
fix python pass builder error. (#30946)
5 years ago
Wilber 39aeaa160e
fix jetson problem (#30939)
5 years ago
Wilber 01ccfbcde9
update trt error message when input height or width is -1 (#31019)
5 years ago
Wilber cf8b8f9c5e
resolve memory leak in cudnn8.0 (#31029)
5 years ago
Guanghua Yu 5b267474a9
add offset parameter in roi_align,generate_proposals.etc ops (#30864)
5 years ago
Chen Weihang 75f81233ae
fix regex error & simplify marco name (#31031)
5 years ago
Zhang Ting f0ee159280
enable exhaustive_search for forward and backward algos when dtype is float16 (#30959)
5 years ago
Pei Yang 9b54fe4154
add trt transpose and flatten converter (#31022)
5 years ago
joanna.wozna.intel caf9d39839
Add Conv Transpose BF16 (#30877)
5 years ago
Chen Weihang f649442ddd
New custom operator extension mechanism (#30690)
5 years ago
Zhou Wei 5c0332714f
fix bug of Linux UT parallel level (#30971)
5 years ago
wuhuanzhou 9b3c80c8ab
update eigen version on Windows (#30573)
5 years ago
ShenLiang dae3e1f337
Solve inconsistent order in each card in dynamic graph (#30931)
5 years ago
WangXi 14d039e4a1
Fix the problem that the number of ops executed by xpu is wrong (#30961)
5 years ago
Chen Weihang 010f2caa23
try to fix reader and signal test failed (#30960)
5 years ago
Adam Osewski 3ba69809bf
Fix LayerNorm tester for gcc4.8 (#30962)
5 years ago
Qi Li 93c1d9e761
[ROCM] update fluid platform for rocm39 (part3), test=develop (#30913)
5 years ago
QingshuChen 15297a065c
fix depends of kunlun bkcl (#30945)
5 years ago
liym27 97f7a70c01
Add error message for slice op(#30851)
5 years ago
liuyuhui 87197f8c2e
[kunlun]fix sync in multi kunlun xpu dygraph training. (#30943)
5 years ago
石晓伟 99bd16eb4e
bug fix of xpu lite engine, test=develop (#30918)
5 years ago
tianshuo78520a 2e93233899
Add WITH_XPU_BKCL in Kunlun-CI (#30919)
5 years ago
Qi Li 34f1628ce8
[ROCM] update fluid platform for rocm39 (part2), test=develop (#30774)
5 years ago
Jacek Czaja 9e527d9956
[oneDNN] Added basic changes for elementwise_add_grad bf16 (#30925)
5 years ago
Chengmo c98f144fbc
add truncated gaussian random (#30922)
5 years ago