Commit Graph

30201 Commits (98e803e04f7057bd6bd1a6d3816b80054a7e354b)
 

Author SHA1 Message Date
Pei Yang 98e803e04f
map_matmul_to_mul_pass support 3dim (#31958)
4 years ago
wuhuanzhou a37a7f67e1
modify CI recommend information (#31395)
4 years ago
jakpiase 6dca7a1de7
Added int8 kernel for oneDNN LSTM op (#31894)
4 years ago
Pei Yang 14b7e3cf06
[Paddle-TRT] TRT inference support for BERT/Transformer in paddle 2.0 api (#31744)
4 years ago
Zhou Wei 245252b86e
fix bug when dtype of to_tensor is core.VarType (#31931)
4 years ago
Zhen Wang e1f931610e
Fix save/load error in imperative qat UT. (#31937)
4 years ago
Yiqun Liu e50bc2c2a6
Enhance cmake to support specifying CUDA_ARCH_NAME to Ampere. (#31923)
4 years ago
Zhou Wei 04a49b097e
[Custom OP]Remove old custom OP and reduce whl package volume (#31813)
4 years ago
wangguanzhong fe2848686b
add exclusive for test_conv2d_op, test=develop (#31936)
4 years ago
chajchaj 73a6fa3ed0
add deprecated for softmax_with_cross_entropy (#31722)
4 years ago
Shang Zhizhou 8084b7594b
fix batchnorm when inpu dims < 3 (#31933)
4 years ago
zlsh80826 64ee255ffd
[Paddle-TRT] yolobox (#31755)
4 years ago
Aurelius84 c4b60efabd
Fix segment Fault from set_value (#31891)
4 years ago
wuhuanzhou 17030ff28b
fix op benchmark ci error caused by missing test_pr branch, test=document_fix (#31920)
4 years ago
niuliling123 a71d72d921
relu forward and backward with vectortype (#31869)
4 years ago
tianshuo78520a 8829a309fe
Delete cudnn6 code (#31835)
4 years ago
wanghuancoder b48841ba2e
modify API nn.Bilinear's doc (#31889)
4 years ago
liym27 525c32e33c
Fix bug of set_value op:Decerease axes to do right broadcast (#31875)
4 years ago
ronnywang 123949eb48
[ROCM] added a cudnn switch of conv2d for rocm platform (#31836)
4 years ago
Shang Zhizhou 61805d8f0a
fix cmake model path (#31866)
4 years ago
Jiabin Yang 51eb29de18
[CustomOP] Add shape related constructor for Tensor (#31681)
4 years ago
zlsh80826 e3a38d790a
[Paddle-TRT] roi_align_plugin (#31732)
4 years ago
zlsh80826 bfb5cf5567
[Paddle-TRT] trt affine channel converter (#31628)
4 years ago
cc b47478efc2
[dygraph qat] Use layer to calculate output scale (#31861)
4 years ago
lilong12 c3974d0e2a
[3D-parallel] Reformat pipeline parallel (#31786)
4 years ago
zlsh80826 01aa252624
[Paddle-TRT] multiclass nms (#31742)
4 years ago
Wilber 70b67f1029
fix go api bug. (#31857)
4 years ago
tianshuo78520a e804f08559
delete include framework.pb.h (#31859)
4 years ago
Chengmo f58cb01864
【Paddle.Fleet】fix dataset zip py3 bug (#31441)
4 years ago
Kaipeng Deng bf09dcb346
add GPU tensor notice & update default_collate_fn/default_convert_fn. test=develop (#31763)
4 years ago
Chen Weihang 27f2d8df8e
Polish two error messages (#31852)
4 years ago
Zhou Wei 511e204e62
LRScheduler.get_lr should not update lr in LinearWarmup (#31843)
4 years ago
niuliling123 6472d62093
Revert "add relu forward kernel and backward kernel (#31613)" (#31853)
4 years ago
winter-wang e7f28d6c0d
fix runtime crash when rnn model inference, test=develop (#31833)
4 years ago
parap1uie-s 5d89ec36dc
Update pooling.py (#31829)
4 years ago
Huihuang Zheng 649868ffb2
[Dy2stat] Fix the bug that loop_body_func may return single element (#31806)
4 years ago
Wojciech Uss e5f7a834d4
fix cache key in concat oneDNN kernel (#31820)
4 years ago
Aurelius84 f2cfc0f46d
[CustomOp]Avoid raising warning while import paddle (#31804)
4 years ago
cc 84a551380e
[dygraph qat] Refine saving output scale to infer program (#31784)
4 years ago
Chen Weihang 68497e7b39
change trainable to stop_gradient in optimizer (#31823)
4 years ago
ronnywang 270699e647
[ROCM] fix test_matmul_v2_op (#31802)
4 years ago
Zhou Wei 1eb927f935
Restore the third-party library cache for windows (#31811)
4 years ago
Chen Weihang 3f66e7deab
add cmath header for bfloat (#31792)
4 years ago
Feiyu Chan 4046f1303a
add coalesce_tensor into white list when checking re-creation of parameters (#31800)
4 years ago
Zhou Wei a70de87d76
Update windows compiler and CI from VS2015 to VS2017 (#31652)
4 years ago
Wilber f4d9212de2
trt plugin upgrade to pluginv2ext (#31670)
4 years ago
niuliling123 372ac08a17
add relu forward kernel and backward kernel (#31613)
4 years ago
Wojciech Uss 814b38e30f
update scale collection and propagation algorithm (#31783)
4 years ago
tianshuo78520a 513641e153
Delete fast_check_nan_inf (#31788)
4 years ago
Shang Zhizhou 9d04ef7369
fix tensorrt output varible reshape (#31733)
4 years ago