Commit Graph

269 Commits (4d774953c6cb584f084129746b4d2aea0e59237a)

Author SHA1 Message Date
guochaorong 151e169eb7
Revert "Add CudnnHolder and use it in Conv and ConvTranspose op"
7 years ago
dzhwinter 6fb28796f5
memory (#13143)
7 years ago
dzhwinter f05520060e
fix style (#13142)
7 years ago
fengjiayi 0236966b68 follow commits
7 years ago
fengjiayi 5398e1a3a6 fix bugs
7 years ago
fengjiayi f79ca23115 fix bugs
7 years ago
fengjiayi c501826f42 use framework::RWLock
7 years ago
fengjiayi 1f36a4c27c Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dev_CudnnHolder
7 years ago
fengjiayi b0aca8824d make CudnnHolder thread safe
7 years ago
luotao1 7169f9378c fix mkldnn include format
7 years ago
fengjiayi 15cc9128be fix compile error
7 years ago
fengjiayi 407ff0bdbc use CudnnHolder in conv_cudnn_op
7 years ago
fengjiayi 04bfd5c10c add CudnnHolder to manage cudnn_handle and workspace
7 years ago
Yan Chunwei 902f19b46a
fea/fuse attention lstm simplify.with fusion lstm.with sequnce expand (#13006)
7 years ago
dzhwinter d361624c1d
platform module (#12932)
7 years ago
dzhwinter eca4563e5d
operators module (#12938)
7 years ago
tensor-tang 0d46f518ae refine avx condition and warning
7 years ago
tensor-tang 4e538db14d refine jit space
7 years ago
tensor-tang ec59f0d454 add cpu vec
7 years ago
tensor-tang 3dd66390b2 add blas vexp
7 years ago
tensor-tang 0ec1f65cf1 fix blas dot and add cblas scal
7 years ago
tensor-tang a2203d0466 add cblas dot
7 years ago
Michał Gallus cd32ddac12 Fuse Convolution and Eltwise Add into MKLDNN's Conv+Bias (#12669)
7 years ago
dzhwinter e23ddf6ae4
status (#12764)
7 years ago
Tao Luo d04ef276a5
Merge pull request #12745 from tensor-tang/refine/op/elewise_mul
7 years ago
dzhwinter 00463fdfe3
cudnn windows support (#12757)
7 years ago
dzhwinter 2673798ddb
"fix float16 ShuffleDownSync Bug" (#12756)
7 years ago
tensor-tang 6644ce79a5 add mklml vmul
7 years ago
tensor-tang ff92b6ba81
Merge pull request #12531 from tensor-tang/refine/op/gru
7 years ago
Chen Weihang 1e961b145c
Merge pull request #12591 from chenwhql/enforce_msg_polish
7 years ago
Yan Chunwei 0a641ba326
add ratio to profiler (#12701)
7 years ago
tensor-tang c588c64a76 Merge remote-tracking branch 'ups/develop' into refine/op/gru
7 years ago
chenweihang da39d84a48 refine by reviewer's advice
7 years ago
tensor-tang 1ab1d03c62 fix missing macro condition
7 years ago
Qiao Longfei e8fcb71bed
Merge pull request #12620 from jacquesqiao/timeline-support-pure-cpu
7 years ago
tensor-tang 3bf3e77ac8 Merge remote-tracking branch 'ups/develop' into refine/op/gru
7 years ago
qiaolongfei 5a6c3cd9e0 fix profiler dead lock
7 years ago
tensor-tang a50889f523 introduce xbyak
7 years ago
qiaolongfei 3f2aa91970 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into timeline-support-pure-cpu
7 years ago
qiaolongfei e008600b08 optimize code
7 years ago
qiaolongfei 7c649e06c3 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into timeline-support-pure-cpu
7 years ago
Sylwester Fraczek d74bb6ab9c fix ut for mkldnn 0.15 - added forcing layout NCHW in mkldnn conv tests
7 years ago
chenweihang b1dd4149b9 adjust enforce test cases
7 years ago
chenweihang 61052cdbc6 polish high frequency enforce error message
7 years ago
qiaolongfei 954d680b40 fix test_parallel_do.py
7 years ago
tensor-tang 836068569f Merge remote-tracking branch 'ups/develop' into refine/op/gru
7 years ago
qiaolongfei 1623f1ba4f Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-profiler
7 years ago
qiaolongfei 4c5bcd7859 add guard to profiler
7 years ago
tensor-tang 43cee33a23 add mkl packed gemm
7 years ago
Xin Pan caf10b474f make profiler use thread_id from g_thread_id
7 years ago