Commit Graph

243 Commits (8e4ad008fbced7ccea7a7351f999b31a8798754b)

Author SHA1 Message Date
Qiyang Min 8e4ad008fb
Merge pull request #16198 from velconia/imperative_train_speed
6 years ago
qingqing01 8ad672a287
Support sync batch norm. (#16121)
6 years ago
minqiyang 7355d41834 1. Add imperative gperf profiler
6 years ago
minqiyang 42e96a029f Accelerate CPU part
6 years ago
Yan Xu 30568473ec
fix broadcast on mp mode (#15951)
6 years ago
baojun e3c37bd564 remove const_cast and refactor ngraph engine code (#15925)
6 years ago
Qiyang Min 1f4aa7a202 Imperative remove all descs (#16045)
6 years ago
sneaxiy 732fa00eaf disable gc in recurrent_op currently
6 years ago
Qiyang Min 187cffd019
Merge pull request #15928 from velconia/imperative_backward_hooks
6 years ago
minqiyang ac88c62a5b Reset output var's pre_op pointer when op was destructed
6 years ago
mozga-intel 68a9ead17a The flag of mkldnn is enabled iff it is necessary
6 years ago
minqiyang efb2f2baf8 Fix bugs
6 years ago
minqiyang b420ec3a92 invoke backward_hooks after reduce op's depcounts map
6 years ago
minqiyang 2b3510bc50 Add imperative python tracer
6 years ago
Xin Pan 32d5a16036 resolve conflicts
6 years ago
Xin Pan 26e32e095a allow compiler to use graph
6 years ago
sneaxiy d331e97af8 fix compiler place compare
6 years ago
sneaxiy e6ff549849 small fix doc
6 years ago
sneaxiy 796e221efc fix api arg0
6 years ago
Zhen Wang bc95a4ccfe
Merge branch 'develop' into quantization_inference_passes
6 years ago
dzhwinter 381f2015a5
Merge pull request #15665 from dzhwinter/experiment/refactor_memory
6 years ago
xuezhong eeaa2066e5 add device info to tensor
6 years ago
dzhwinter 04e9776aef add details. test=develop
6 years ago
dzhwinter 4f01de6378 Merge remote-tracking branch 'origin/develop' into feature/ir_inplace_pass
6 years ago
liuwei1031 6e84eb131f expose peak gpu memory API to python test=develop (#15529)
6 years ago
WangZhen 2175292634 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into quantization_inference_passes
6 years ago
WangZhen c67b29c178 fix some bugs of graph.to_program and get_pass.
6 years ago
dzhwinter ee3aae56cd merge develop branch. test=develop
6 years ago
乔龙飞 Qiao Longfei c58555067e
Merge pull request #14731 from jacquesqiao/optimize-cpp-reader
6 years ago
Zeng Jinle 2480a3df7d
Merge pull request #15496 from sneaxiy/lazy_allocator2
6 years ago
Zeng Jinle dec89bd7ed
Merge pull request #15460 from sneaxiy/try_to_turn_on_remove_unnecessary_lock
6 years ago
Xin Pan 58cb18d9d9
Merge pull request #15322 from velconia/imperative_resnet
6 years ago
sneaxiy 51227bd447 lazy_allocator
6 years ago
minqiyang c8965dc1ab Polish code
6 years ago
sneaxiy ef788603d4 merge develop
6 years ago
Paddle CI 289aba750a Polish code
6 years ago
WangZhen b913463e83 Update according to the reviewers' suggestion. test=develop
6 years ago
sneaxiy d8568acd19 turn on remove_unnecessary_lock
6 years ago
WangZhen 3ce6172052 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into graph_quantization
6 years ago
flame d60751fb71
add python inference api (#15248)
6 years ago
dzhwinter 8f3b252392 squash commits. test=develop
6 years ago
Qiao Longfei 45578c1b48 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-cpp-reader
6 years ago
minqiyang 8ce198b2e1 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into imperative_resnet
6 years ago
minqiyang 31a1cd8ce5 Align the first batch of gpu resnet
6 years ago
Dun 9f8f0fc2d3 Memory optimization of depthwise conv op and group norm op (#15313)
6 years ago
WangZhen 451896fce4 init quantization.
6 years ago
minqiyang dbd4d058af Add static implementation and fix fc layer
6 years ago
Xin Pan e395f2c6a3 polish codes
6 years ago
Xin Pan 9a4314f025 imperative gan
6 years ago
Qiao Longfei cd31b90a46 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-cpp-reader
6 years ago