Commit Graph

18544 Commits (17030ff28b9a54bb57779e9b8448a6d222110ec5)

Author SHA1 Message Date
niuliling123 a71d72d921
relu forward and backward with vectortype (#31869)
4 years ago
tianshuo78520a 8829a309fe
Delete cudnn6 code (#31835)
4 years ago
liym27 525c32e33c
Fix bug of set_value op:Decerease axes to do right broadcast (#31875)
4 years ago
ronnywang 123949eb48
[ROCM] added a cudnn switch of conv2d for rocm platform (#31836)
4 years ago
Shang Zhizhou 61805d8f0a
fix cmake model path (#31866)
4 years ago
Jiabin Yang 51eb29de18
[CustomOP] Add shape related constructor for Tensor (#31681)
4 years ago
zlsh80826 e3a38d790a
[Paddle-TRT] roi_align_plugin (#31732)
4 years ago
zlsh80826 bfb5cf5567
[Paddle-TRT] trt affine channel converter (#31628)
4 years ago
cc b47478efc2
[dygraph qat] Use layer to calculate output scale (#31861)
4 years ago
lilong12 c3974d0e2a
[3D-parallel] Reformat pipeline parallel (#31786)
4 years ago
zlsh80826 01aa252624
[Paddle-TRT] multiclass nms (#31742)
4 years ago
Wilber 70b67f1029
fix go api bug. (#31857)
4 years ago
tianshuo78520a e804f08559
delete include framework.pb.h (#31859)
4 years ago
Chen Weihang 27f2d8df8e
Polish two error messages (#31852)
4 years ago
niuliling123 6472d62093
Revert "add relu forward kernel and backward kernel (#31613)" (#31853)
4 years ago
winter-wang e7f28d6c0d
fix runtime crash when rnn model inference, test=develop (#31833)
4 years ago
Wojciech Uss e5f7a834d4
fix cache key in concat oneDNN kernel (#31820)
4 years ago
ronnywang 270699e647
[ROCM] fix test_matmul_v2_op (#31802)
4 years ago
Zhou Wei 1eb927f935
Restore the third-party library cache for windows (#31811)
4 years ago
Chen Weihang 3f66e7deab
add cmath header for bfloat (#31792)
4 years ago
Zhou Wei a70de87d76
Update windows compiler and CI from VS2015 to VS2017 (#31652)
4 years ago
Wilber f4d9212de2
trt plugin upgrade to pluginv2ext (#31670)
4 years ago
niuliling123 372ac08a17
add relu forward kernel and backward kernel (#31613)
4 years ago
tianshuo78520a 513641e153
Delete fast_check_nan_inf (#31788)
4 years ago
Shang Zhizhou 9d04ef7369
fix tensorrt output varible reshape (#31733)
4 years ago
Qi Li 46dd1d4aad
[ROCM] fix reduce_sum nan in ROCM platform, test=develop (#31780)
4 years ago
zlsh80826 bfced39eb6
[Paddle-TRT] nearest_interp op (#31626)
4 years ago
arlesniak 7ccf6b6030
[oneDNN] Initial bf16 amp integration (#31093)
4 years ago
lilong12 a501a7b0ca
[3D-parallel] add 1f1b scheduler for pipeline (#31566)
4 years ago
ronnywang 8c19d7aa2f
[ROCM] fix test_conv2d_transpose_op (#31749)
4 years ago
Ouyang Chao a45c8ca69d
fix bug of DepthwiseConvTransposeGradKernel (#31762)
4 years ago
Jacek Czaja 25fc2a1fdb
[oneDNN] Added Elementwise Mul grad fp32/bf16 (#31647)
4 years ago
Chen Weihang 878e117b6d
[CustomOp] Support float16 in custom op (#31725)
4 years ago
ronnywang c9e1d9dc31
[ROCM] fix test_rnn_op (#31735)
4 years ago
zlsh80826 1c67cf0c98
run radix sort of proposals layer on context stream (#31631)
4 years ago
Chen Weihang e429deb0c4
[CustomOp] Support attribute in infershape function (#31713)
4 years ago
Adam Osewski a4a2b77def
[oneDNN] lookup_table op with support for BF16 data type. (#31558)
4 years ago
zlsh80826 c86e771e94
NMS Performance Optimization (#31634)
4 years ago
zlsh80826 50cafa0b0c
remove redundant sync, set collect/dist kernel to context stream, sub_lod memcpy opt (#31641)
4 years ago
ronnywang 420527f0d9
[ROCM] fix layer_norm, norm, p_norm, test_sequence_softmax_op, test_math_op_patch_var_base (#31709)
4 years ago
Chen Weihang 87852616aa
[CustomOp] Support complex dtype in custom op (#31657)
4 years ago
zlsh80826 fe241fd02f
[Paddle-TRT] gather converter (#31640)
4 years ago
zlsh80826 4ea3427865
[Paddle-TRT] support batch axis concatenation when using dynamic shape (#31627)
4 years ago
Zhang Ting 7f50bb7ec1
support NHWC for temporal_shift op (#31642)
4 years ago
Chen Weihang 2fbe9b097a
[CustomOp] Remove Eigen dependencies of float16 (#31669)
4 years ago
Qi Li d9b50f664f
[ROCM] update ci scripts and dockefile, test=develop (#31551)
4 years ago
YUNSHEN XIE 1a6e3b04cd
Second optimization of retry method (#31646)
4 years ago
yiak c1b1ccfbf5
Update tinyformat.h (#31612)
4 years ago
ronnywang da10c5cf8b
[ROCM] fix softmax_with_cross_entropy_op, test=develop (#31629)
4 years ago
Chen Weihang 027b574a0e
[CustomOp] Remove the dependence of the underlying data types on eigen (#31602)
4 years ago