Commit Graph

955 Commits (39d5bb6dce23d4bd9c1ef47ad6f2c5e50dba516a)

Author SHA1 Message Date
GaoWei8 4ff16eb201
Add padding cudnn interface (#26370)
4 years ago
wangchaochaohu 3eacced950
[cuda11 support] add support for cublas load of same function name (parameter diff) (#26963)
5 years ago
joanna.wozna.intel 95e1434bb2
Add bfloat16 data type (#25402)
5 years ago
Zhen Wang f9066e6a6f
Update the demo code and the doc of varbase.backward. (#26506)
5 years ago
lilong12 1c68138327
[api 2.0] add collective op for cpu using gloo and paddle.distributed.* apis (#26552)
5 years ago
joanna.wozna.intel 559e43eee4
Small change in conv2d and quantize pass (#26671)
5 years ago
Adam f3909020de
Add mechanism for blocking oneDNN cache clearing (#26502)
5 years ago
QingshuChen 138ecf24aa
support Baidu Kunlun AI Accelerator (#25959)
5 years ago
GaoWei8 1fbee267d4
remove scope in cudnn lstm (#25188)
5 years ago
Leo Chen 672578a797
Print user-friendly error message in core.ops (#26261)
5 years ago
wangchaochaohu 0b81d76310
[API2.0] add op for cudnn version query test=develop (#26180)
5 years ago
joanna.wozna.intel 734cf1c3e9
Change use_quantizer attribute name and data type (#25838)
5 years ago
Leo Chen 751305ecf0
Add flags to control call stack of error message (#25997)
5 years ago
Pei Yang beb0ca5fab
Fix TRT plugin registry without TRT lib (#25982)
5 years ago
Adam 68c6160e63
Add oneDNN fusion_gru kernel (#25594)
5 years ago
Zhaolong Xing 358bc06c72
[CUDNN8 support] : support CUDNN8 (#25664)
5 years ago
Pei Yang b717895f64
Fix registering trt plugin (#25744)
5 years ago
Chen Weihang 9b5a65b819
refine init signal handler meg dumper (#25911)
5 years ago
Chen Weihang d47304e6d9
Refine paddle error stack format (#25790)
5 years ago
Chen Weihang 2469b578f5
Unified paddle error format when catch system signal (#25765)
5 years ago
Chen Weihang 1b3081b1b4
Simplify BufferedReader to improve DataLoader performance (#25648)
5 years ago
arlesniak e52df3b125
Added DNNL cache management for DyGraph (#25624)
5 years ago
joanna.wozna.intel e5bbffa84c
Add NOMINMAX define due to windows.h max/min macro conflict (#25637)
5 years ago
Chen Weihang a6abd92dfd
Polish install error hint message (#25531)
5 years ago
Jacek Czaja 7dbc441eab
[oneDNN] cache cosmetics improvement (#25576)
5 years ago
LielinJiang 7129f544f0
Add bilateral_slice op (#25401)
5 years ago
GaoWei8 c10dcff12d
refine PADDLE_ENFORCE (#25456)
5 years ago
Chen Weihang 0b54d54fd8
Fix index overflow bug of the CUDA kernel loop increment (#25435)
5 years ago
Chen Weihang 7be285a66f
remove useless property, test=develop (#25461)
5 years ago
Jacek Czaja a5d1592f6c
Added missing oneDNN format (#25450)
5 years ago
Chen Weihang 172d4ecb6c
remove WITH_DSO compile option (#25444)
5 years ago
Zhen Wang bb45af02ac
add the c++ part of Imperative QAT. test=develop (#25446)
5 years ago
GaoWei8 ea7e532598
Refine PADDLE_ENFORCE (#25369)
5 years ago
GaoWei8 fb70682f00
fix PADDLE_ENFORCE (#25297)
5 years ago
Chen Weihang 5a959f6e6e
Refactor dynamic dso search functions (#25214)
5 years ago
Wilber 4c964abdf7
support build on arm. test=develop (#25212)
5 years ago
Chen Weihang 353ea9e8ad
Add default cudnn lib path (#25175)
5 years ago
Adam bd0b38e671
Refactor of conv fp32 oneDNN operator (#25137)
5 years ago
Tao Luo 2996315fc9
fix profiler_test on win32 (#25073)
5 years ago
Jacek Czaja a7944904d3
[oneDNN]elementwise_add and elementwise_mul int8 support (#24984)
5 years ago
hutuxian 5822862d8a
Monitor Framework (#24079)
5 years ago
wangchaochaohu feba131893
fix the sgement fault error of profiler in seqseq model test=develop (#24952)
5 years ago
Zhou Wei 4058e736ff
temporarily disable these unittests failed on windows (#24942)
5 years ago
Chen Weihang 4a702ef361
Support SelelctedRows allreduce in multi-cards imperative mode (#24690)
5 years ago
Chen Weihang d1062d5278
Replace all errors thrown by LOG(FATAL) with PADDLE_THROW (#24759)
5 years ago
Zhang Ting 7d0cbfd045
fix negative framework overhead in Profiling Report (#24850)
5 years ago
Chen Weihang 0aed095188
The third time to simplify the C ++ error stack (#24831)
5 years ago
Adam b490e41c1d
Add isCached() mechanism for BatchNorm and LRN oneDNN operators (#24798)
5 years ago
Wilber f8e370ac7f
[Inference] [unittest] Inference unit tests rely on dynamic libraries (#24743)
5 years ago
Zhou Wei d1047d0a69
add WITH_GPU for cudaerror download (#24056)
5 years ago