Commit Graph

583 Commits (b5ebca47a352412b01692d01aff7b6f4f371b685)

Author SHA1 Message Date
tensor-tang 8117725852 add jit kernel hsum, hmax and softmax refer code
7 years ago
sneaxiy ba4f43fd62 fix compile error in distributed mode
7 years ago
Yiqun Liu 3008fa1261
Add the CUDA kernel for beam_search op (#15020)
7 years ago
Zeng Jinle 2480a3df7d
Merge pull request #15496 from sneaxiy/lazy_allocator2
7 years ago
sneaxiy 9c360cc798 test=develop
7 years ago
Xin Pan 58cb18d9d9
Merge pull request #15322 from velconia/imperative_resnet
7 years ago
sneaxiy 51227bd447 lazy_allocator
7 years ago
tangwei12 8b50ad80ff
checkpoint at distributed training (#14854)
7 years ago
minqiyang 8ce198b2e1 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into imperative_resnet
7 years ago
minqiyang 315b133e67 Add single GPU support to imperative
7 years ago
tensor-tang 3759c1db8c
Merge pull request #14805 from mozga-intel/mozga-intel/element_wise_operator_ngraph
7 years ago
peizhilin eea75a1d93 fix issue when type is invalid
7 years ago
peizhilin 9adb158e5b Merge remote-tracking branch 'upstream/develop' into debug/support
7 years ago
chengduo 46d01d798e
Revert "Revert "Remove workspace_handle in conv_cudnn (#15186)"" (#15290)
7 years ago
Wojciech Uss cb2ba58458 Fix performance drop when with MKL-DNN
7 years ago
chengduozh c4eced9881 fix thread safe bug
7 years ago
chengduozh 358e657f68 Revert "Remove workspace_handle in conv_cudnn (#15186)"
7 years ago
wopeizl 5d9edb4124
Merge pull request #15156 from wopeizl/windows/fixgpuissue
7 years ago
chengduo 064512aa47
Remove workspace_handle in conv_cudnn (#15186)
7 years ago
xiaolil1 8f17c714de Conv int8 residual (#15145)
7 years ago
peizhilin 439691f5bd adjust the shlwapi on windows
7 years ago
peizhilin 92da467c99 Merge remote-tracking branch 'upstream/develop' into windows/fixgpuissue
7 years ago
peizhilin c1235c935f add the enable_debug flag
7 years ago
Zeng Jinle e29f10d315
Merge pull request #15207 from sneaxiy/remove_op_handle_lock_and_fix_var
7 years ago
mozga-intel a42f8f4f6f Enable element_wise_add operator for a ngraph
7 years ago
Zeng Jinle c562be20d9
Merge pull request #15193 from sneaxiy/fix_cudnn_compatible_check
7 years ago
peizhilin 1cd95d8a0b use thread local instance test=develop
7 years ago
sneaxiy ed409ac9f4 Revert "Revert "Remove op handle lock""
7 years ago
peizhilin d54133ea85 not include the numeric under linux test=develop
7 years ago
peizhilin a6f5ceee74 add the python callstack for debug support test=develop
7 years ago
Zeng Jinle dacfaaa966 Revert "Remove op handle lock"
7 years ago
xiaolil1 c8f101e5da Conv int8 relu (#15130)
7 years ago
sneaxiy 9793a0b6a6 fix_cudnn_compatible_check
7 years ago
Zeng Jinle ccb322d6a5 merge develop
7 years ago
Zeng Jinle f3a13512fc
Merge pull request #15139 from sneaxiy/remove_op_handle_lock
7 years ago
xiaolil1 bbc9336878 Enable basic MKL-DNN INT8 Conv OP (#15124)
7 years ago
peizhilin c919b2f31d Merge remote-tracking branch 'upstream/develop' into windows/fixgpuissue
7 years ago
peizhilin fd4f4d0e5f fix build issue test=develop
7 years ago
Yan Xu a1e60ab19b
Merge pull request #14791 from Yancey1989/parallel_graph_mode
7 years ago
peizhilin 9ae50dd07d fix gpu buils issue on windows test=develop
7 years ago
sneaxiy d0a8a1e950 remove_op_handle_lock
7 years ago
Yancey1989 e65436103f Merge branch 'develop' of github.com:PaddlePaddle/Paddle into parallel_graph_mode
7 years ago
sneaxiy 6f06e6cdac Merge remote origin
7 years ago
Xin Pan 9186451f60 hide GetTensor
7 years ago
sneaxiy d25395fc98 remove tensor core lock
7 years ago
Yancey1989 82b42e31f0 polish unittest test=develop
7 years ago
Yancey1989 0a885ac12a Merge branch 'develop' of github.com:PaddlePaddle/Paddle into parallel_graph_mode
7 years ago
peizhilin 813c2ce539 fix timer test=develop
7 years ago
wopeizl 7ab501264d
Merge pull request #15069 from wopeizl/windows/dsosupport
7 years ago
guru4elephant ff739449ab
Merge pull request #15018 from guru4elephant/add_timer
7 years ago