Commit Graph

135 Commits (2fbe9b097a41bff2b8c73296bf52e387ec88842a)

Author SHA1 Message Date
Qi Li 334296306c
[ROCM] update fluid platform for rocm39 (part4), test=develop (#30936)
4 years ago
Zhou Wei adaec0073d
[2.0Custom OP]Support New Custom OP on Windows (#31063)
4 years ago
wanghuancoder 35c5b23f68
use iwyu clean include second time, test=develop (#30829)
4 years ago
Qi Li f89da4ab45
[ROCM] update fluid platform for rocm35 (part1), test=develop (#30639)
4 years ago
wanghuancoder 90773473a0
use nvtx push pop in timeline (#30567)
4 years ago
Zhou Wei 30888ca343
Polish and Optimize the print/repr information of Layer (#29998)
5 years ago
Chen Weihang 1a304e6c06
[Complex] Add support for complex grad accumulated (#29889)
5 years ago
Y_Xuan 76738504ad
添加rocm平台支持代码 (#29342)
5 years ago
chentianyu03 8f45d14263
add complex64 and complex128 type; add +-*/@ and slice opreator for c… (#29199)
5 years ago
Zhou Wei e668cb07fb
fix CUDA 11 error on windows (#29101)
5 years ago
Pei Yang 994673bf4f
change avg pooling and global pooling to trt layer in dynamic shape mode (#28702)
5 years ago
lilong12 80d2024644
bug fix, test=develop (#28674)
5 years ago
Shang Zhizhou ea851796e5
TensorRT中ernie模型推理性能优化,支持变长输入 (#28367)
5 years ago
Zhou Wei 5d7000215a
fix dynamic_loader more safe and error message on windows (#28117)
5 years ago
Pei Yang a0b2f93689
reduce trt warning message (#28011)
5 years ago
Zhang Ting d5cc144c60
tune backward filter algorithm for float16 (#27529)
5 years ago
lilong12 5218b7af6b
add ncclSend and ncclRecv (#27621)
5 years ago
Li Fuchen 1501a80f74
add support to float64 input of warpctc op. (#27399)
5 years ago
Shibo Tao 8f7bb52bd2
fix tensorrt 6 build error. test=develop (#27511)
5 years ago
wanghuancoder df43905f12
use iwyu clean include (#27267)
5 years ago
Shang Zhizhou c17f9cf25f
[bug fix]:Memory increases after adapting the cudnn version to cudnn8 (#27436)
5 years ago
GaoWei8 1a7559718e
fix cudnn dyload (#27308)
5 years ago
GaoWei8 4ff16eb201
Add padding cudnn interface (#26370)
5 years ago
wangchaochaohu 3eacced950
[cuda11 support] add support for cublas load of same function name (parameter diff) (#26963)
5 years ago
GaoWei8 1fbee267d4
remove scope in cudnn lstm (#25188)
5 years ago
Pei Yang beb0ca5fab
Fix TRT plugin registry without TRT lib (#25982)
5 years ago
Zhaolong Xing 358bc06c72
[CUDNN8 support] : support CUDNN8 (#25664)
5 years ago
Pei Yang b717895f64
Fix registering trt plugin (#25744)
5 years ago
Chen Weihang a6abd92dfd
Polish install error hint message (#25531)
5 years ago
GaoWei8 c10dcff12d
refine PADDLE_ENFORCE (#25456)
5 years ago
Chen Weihang 172d4ecb6c
remove WITH_DSO compile option (#25444)
5 years ago
Zhen Wang bb45af02ac
add the c++ part of Imperative QAT. test=develop (#25446)
5 years ago
GaoWei8 ea7e532598
Refine PADDLE_ENFORCE (#25369)
5 years ago
GaoWei8 fb70682f00
fix PADDLE_ENFORCE (#25297)
5 years ago
Chen Weihang 5a959f6e6e
Refactor dynamic dso search functions (#25214)
5 years ago
Chen Weihang 353ea9e8ad
Add default cudnn lib path (#25175)
5 years ago
Chen Weihang 4a702ef361
Support SelelctedRows allreduce in multi-cards imperative mode (#24690)
5 years ago
Yiqun Liu 560c815390
Add some check for CUDA Driver API and NVRTC (#22719)
5 years ago
Guo Sheng 4a5de14426
Remove cusolver potrfBatched support on Windows. (#24338)
5 years ago
Guo Sheng 1fc6cc502a
Fix cusolver loader for Windows (#24157)
5 years ago
Yiqun Liu ecfddebbef
Add the implementation of inverse (#23310)
5 years ago
Guo Sheng a8c0fb4e86
Add cholesky_op (#23543)
5 years ago
littletomatodonkey 1c08a2136e
test=develop, add addmm op (#23384)
5 years ago
Tao Luo e4f1b1c5e1
solve mklml memory leak (#23557)
5 years ago
Wilber 7bc4b09500
add WITH_NCCL option for cmake. (#22384)
5 years ago
Yiqun Liu d48320777e
Add the first implememtation of fusion_group op (#19621)
6 years ago
Jie Fang 5e813b53c5 nhwc optimization for batchnorm (#21090)
6 years ago
danleifeng 425279a57b Improve elementwise operators performance in same dimensions. (#19763)
6 years ago
qingqing01 1a3eef026c
Enable users to create custom cpp op outside framework. (#19256)
6 years ago
liym27 24010472d4 fix pool2d pool3d,support asymmetric padding and channel_last (#19739)
6 years ago