Commit Graph

30185 Commits (b48841ba2e7335eaa435a54436ed580d4aef001c)
 

Author SHA1 Message Date
liu zhengxi ae2be49f40
Add cublas_handle() to expose cublas_handle to ops (#31157)
4 years ago
Aurelius84 406f4a7513
[CustomOp] Support to specific extra_cflags and exctra_cuda_flags independently (#31059)
4 years ago
qingqing01 572cc8bd0f
Update doc for 2.0 API and some callback (#31180)
4 years ago
Pei Yang 00b09e86ac
[Paddle-TRT] support group_norm (#31040)
4 years ago
Chen Weihang c209751c8d
change test_multiprocess_reader_exception cmake (#31174)
4 years ago
YUNSHEN XIE 153121457f
fix ut timeout (#31061)
4 years ago
Chen Weihang 1ce96fa118
[CustomOp] Add new paddle custom op so (#31141)
4 years ago
tangwei12 ebbdf52557
fix entry (#31079)
4 years ago
Qi Li ee76ea72de
[ROCM] update fluid collective op for rocm, test=develop (#31075)
4 years ago
yaoxuefeng d8fa65a3a8
fix heter compile (#30518)
4 years ago
Aurelius84 dce2db4857
[CustomOp] Split build directory for each setup.py (#31124)
4 years ago
Zhou Wei 4b220550ef
[Custom OP]Fix problem of custom op unitests on Windows CI (#31114)
4 years ago
chentianyu03 70131b475f
add warning message when dtypes of operator are not same (#31136)
4 years ago
Zhou Wei be61c2d06b
support build whl and inference library nightly,test=windows3 (#30616)
4 years ago
alncat 5d6a8c7b73
added support for fake_quantize_dequantize_abs_max op in quantization… (#30896)
4 years ago
Chen Weihang e60fd1f6a8
[CustomOp] Split test and add inference test (#31078)
4 years ago
Jacek Czaja d3f09ad702
Update of onednn to 2.2 (#31067)
4 years ago
Guanghua Yu 24ba5ee05c
merge develop conflict (#31122)
4 years ago
xiemoyuan edacb6293c
Optimization of Transformer API (#30957)
4 years ago
WeiXin ee1801c1ad
Save load/save pickle protocol (#31044)
4 years ago
Qi Li cced930b61
[ROCM] update fluid operators for rocm (part1), test=develop (#31077)
4 years ago
yukavio 99fd9815b6
fix flops api (#31081)
4 years ago
wangchaochaohu 364cfa2686
fix windows for optimization of elementwise_add Op (#31068)
4 years ago
joanna.wozna.intel 781df300d0
Unification of BF16 enablement process (#31034)
4 years ago
Zhong Hui 16fe11d71e
fix softmax cross entropy integer overflow (#30590)
4 years ago
Zhou Wei 44ee251fde
fix UNIX cmake problem (#31113)
4 years ago
Qi Li a60d93fb77
[ROCM] update fluid framework for rocm (part2), test=develop (#31010)
4 years ago
Thunderbrook 565354f676
support save multi sparse table in one path (#31108)
4 years ago
Qi Li 50967135a5
[ROCM] update fluid framework for rocm (part3), test=develop (#31011)
4 years ago
Huihuang Zheng cf43a321a8
[Dy2stat] Refactoring tensor_shape_transformer.py to Fix Change after Assign Bug (#31082)
4 years ago
tangwei12 0e4b154298
fix dist fleet ctr ut (#31087)
4 years ago
Qi Li 8fe09faf14
[ROCM] update fluid framework for rocm (part1), test=develop (#31009)
4 years ago
Qi Li 334296306c
[ROCM] update fluid platform for rocm39 (part4), test=develop (#30936)
4 years ago
Shang Zhizhou a5c56d83a1
update trt int8 calibrator to IEntropyCalibratorV2 (#31060)
4 years ago
Zhou Wei adaec0073d
[2.0Custom OP]Support New Custom OP on Windows (#31063)
4 years ago
Chen Weihang 2168f08ac8
add optional for param attr args, test=document_fix (#31105)
4 years ago
Qi Li 1d996637e6
[ROCM] update fluid imperative for rocm (part1), test=develop (#31017)
4 years ago
JamesLim b95eb38b8a
fix the bug in backward OP of index_sample. (#31026)
4 years ago
Chengmo 6b3371e0c7
Remove PE special profiler (#30886)
4 years ago
Chen Weihang 6beeafe797
[CustomOp] Add more dispatch marco for users (#31058)
4 years ago
TTerror d5323dab41
add squeeze_op/unsqueeze_op on kunlun;fix conv op and parallel executor;optimize lookup_table op (#31056)
4 years ago
123malin 16b4260b2f
test=develop, save/load, shrink (#30625)
4 years ago
Shibo Tao 4424aac608
export paddle.static.normalize_program method. (#31072)
4 years ago
Jiabin Yang 628451af06
hide useless headers and add complex support (#31074)
4 years ago
Wilber 463eae0383
update paddle_fluid.so to paddle_inference.so (#30850)
4 years ago
tangwei12 a2170a0866
change fleet reviewer (#31069)
4 years ago
liym27 5b367dab44
[static setitem] Support the index is Tensor; step>1; step<0 .(#30949)
4 years ago
Qi Li eb3050fa9a
[ROCM] update fluid inference for rocm (part1), test=develop (#31018)
4 years ago
Jack Zhou 6df1ca54c8
add detail about states index in rnn result, test=document_fix (#31048)
4 years ago
Huihuang Zheng ef627ac5b9
Fix that convert_var_shape doesn't support slice like [0:], test=develop (#31051)
4 years ago