Commit Graph

383 Commits (d93b2d0365355430f3db723dc3e278851b7a88b4)

Author SHA1 Message Date
chengduozh 82d2903b63 Fix fast ParallelExe bug
6 years ago
sneaxiy 2002e71da8 fix pinned allocator
6 years ago
sneaxiy 21fdf8e87d add unittest for allocator_facade.cc
6 years ago
gongweibao 078223b3e3
Add rpc timeline. (#13900)
6 years ago
tensor-tang 6447155dac
Merge pull request #13851 from tensor-tang/fea/jitkernel_peephole
6 years ago
Yibing Liu 6b795d424c
Merge pull request #13901 from kuke/seq_slice_py
6 years ago
dzhwinter e41a3fcd68 fix update to develop hang problem.
6 years ago
Qiao Longfei 681226e97c
Merge pull request #13864 from jacquesqiao/py-reader-add-test-mode
6 years ago
Yibing Liu 16b2c6dc78 Add py api for sequence_slice_op
6 years ago
chengduo 2c9839c847
add cuda version display (#13885)
6 years ago
wanghaoshuang 3ae9645084 compile in linux
6 years ago
Qiao Longfei 8686f7c68e add reader_queue_speed_test_mode flag for speed test
6 years ago
tensor-tang bcb8ea397d Merge remote-tracking branch 'ups/develop' into fea/jitkernel_peephole
6 years ago
Qiao Longfei 5428cb9908
Profiler support merge data of all thread (#13811)
6 years ago
sneaxiy 4c672ab1a2 Merge reyoung:rewrite_allocation
6 years ago
tensor-tang ea7dc9cbf6 Merge remote-tracking branch 'ups/develop' into fea/jitkernel
6 years ago
Xin Pan ab798a2832 clarify the fraction_of_gpu_memory flag
6 years ago
Yu Yang 15076c325e Add comments and polish code style
6 years ago
Yu Yang 29f66c2408 Polish code
6 years ago
Yu Yang 8e3fdc6e65 Fix SetDevice on init
6 years ago
Yu Yang 524f6e9b36 Refine code
6 years ago
Yu Yang 5cf395beaf Fix bug in uts
6 years ago
dzhwinter 2d00e65819
namespace issue (#13543)
6 years ago
Yu Yang 58ed412f68 refactor(memory): rewrite memory allocation and make it extentable
6 years ago
typhoonzero a4f7696a18 Revert "Some trivial optimization (#13530)"
6 years ago
tensor-tang dee5d35c20 refine vmul
6 years ago
chengduo 1d91a49d2f
Some trivial optimization (#13530)
6 years ago
dzhwinter 7806c5625f
fix enforce (#13544)
6 years ago
dzhwinter 97636a9fcf
"fix link error" (#13545)
6 years ago
sneaxiy 612e1a3155 modification
7 years ago
sneaxiy d0b2453ecd merge develop
7 years ago
sneaxiy 24ea39c4c6 feature/eager_delete_tensor
7 years ago
dzhwinter 85f8dd1c77 debug version
7 years ago
dzhwinter e1999538eb debug the device context
7 years ago
dzhwinter 372caf4000 windows staff
7 years ago
dzhwinter c3e1fb5a3e add demo
7 years ago
Krzysztof Binias 2ed7982d09
Merge pull request #13327 from kbinias/kbinias/conv-weights-converted-once
7 years ago
Krzysztof Binias accdecc681 Correcting Lint errors
7 years ago
Krzysztof Binias 1ce9e9dc30 Renaming decision variable
7 years ago
Krzysztof Binias 1658958fe6 Reusing converted weights
7 years ago
Yang Yu 8331e835a8 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_CudnnHolder_bug
7 years ago
Wu Yi f90c7865f0
Benchmark tool for imgnet (#12305)
7 years ago
JiabinYang e322fc4e0e add error info for nccl not found
7 years ago
fengjiayi 7b577b92e0 fix a memory bug in CudnnHolder
7 years ago
fengjiayi 82a1b35b9b Revert "Revert "Add CudnnHolder and use it in Conv and ConvTranspose op""
7 years ago
guochaorong 151e169eb7
Revert "Add CudnnHolder and use it in Conv and ConvTranspose op"
7 years ago
dzhwinter 6fb28796f5
memory (#13143)
7 years ago
dzhwinter f05520060e
fix style (#13142)
7 years ago
fengjiayi 0236966b68 follow commits
7 years ago
fengjiayi 5398e1a3a6 fix bugs
7 years ago
dzhwinter dbe90cc0f6 merge develop branch
7 years ago
fengjiayi f79ca23115 fix bugs
7 years ago
fengjiayi c501826f42 use framework::RWLock
7 years ago
fengjiayi 1f36a4c27c Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dev_CudnnHolder
7 years ago
fengjiayi b0aca8824d make CudnnHolder thread safe
7 years ago
luotao1 7169f9378c fix mkldnn include format
7 years ago
fengjiayi 15cc9128be fix compile error
7 years ago
fengjiayi 407ff0bdbc use CudnnHolder in conv_cudnn_op
7 years ago
fengjiayi 04bfd5c10c add CudnnHolder to manage cudnn_handle and workspace
7 years ago
Yan Chunwei 902f19b46a
fea/fuse attention lstm simplify.with fusion lstm.with sequnce expand (#13006)
7 years ago
dzhwinter b78394ea57 done
7 years ago
dzhwinter b74af56bbc cpu compile is done
7 years ago
dzhwinter 78aab05b71 fix more op errors
7 years ago
dzhwinter cd8f3e9ed0 operator module is done
7 years ago
dzhwinter d361624c1d
platform module (#12932)
7 years ago
dzhwinter 2ec589a24e float.h fixed
7 years ago
dzhwinter 7dceb8a080 check some operators
7 years ago
dzhwinter d7f98f37a7 more platform is done
7 years ago
dzhwinter efd0884fa9 add op registry
7 years ago
dzhwinter eca4563e5d
operators module (#12938)
7 years ago
dzhwinter 488a2dd2e8 with ir node
7 years ago
dzhwinter cfbf1ba305 add source
7 years ago
dzhwinter c1ad52f768 pre-commit
7 years ago
dzhwinter 89f95ea25e merge develop branch
7 years ago
dzhwinter 34f8c9b6f5 windows port
7 years ago
tensor-tang 0d46f518ae refine avx condition and warning
7 years ago
tensor-tang 4e538db14d refine jit space
7 years ago
tensor-tang ec59f0d454 add cpu vec
7 years ago
tensor-tang 3dd66390b2 add blas vexp
7 years ago
tensor-tang 0ec1f65cf1 fix blas dot and add cblas scal
7 years ago
tensor-tang a2203d0466 add cblas dot
7 years ago
Michał Gallus cd32ddac12 Fuse Convolution and Eltwise Add into MKLDNN's Conv+Bias (#12669)
7 years ago
dzhwinter e23ddf6ae4
status (#12764)
7 years ago
Tao Luo d04ef276a5
Merge pull request #12745 from tensor-tang/refine/op/elewise_mul
7 years ago
dzhwinter 00463fdfe3
cudnn windows support (#12757)
7 years ago
dzhwinter 17602eab94 windows port of malloc
7 years ago
dzhwinter 2673798ddb
"fix float16 ShuffleDownSync Bug" (#12756)
7 years ago
dzhwinter 5c88cd2af5 remove werror in windows
7 years ago
dzhwinter 64ce1210aa "windows support"
7 years ago
dzhwinter 36878d78cc comment out backtarce
7 years ago
dzhwinter 335398f18b dlfnh
7 years ago
tensor-tang 6644ce79a5 add mklml vmul
7 years ago
tensor-tang ff92b6ba81
Merge pull request #12531 from tensor-tang/refine/op/gru
7 years ago
Chen Weihang 1e961b145c
Merge pull request #12591 from chenwhql/enforce_msg_polish
7 years ago
Yan Chunwei 0a641ba326
add ratio to profiler (#12701)
7 years ago
tensor-tang c588c64a76 Merge remote-tracking branch 'ups/develop' into refine/op/gru
7 years ago
chenweihang da39d84a48 refine by reviewer's advice
7 years ago
tensor-tang 1ab1d03c62 fix missing macro condition
7 years ago
Qiao Longfei e8fcb71bed
Merge pull request #12620 from jacquesqiao/timeline-support-pure-cpu
7 years ago
tensor-tang 3bf3e77ac8 Merge remote-tracking branch 'ups/develop' into refine/op/gru
7 years ago