Commit Graph

248 Commits (f8ed2c229e01863798543ba3cde922726e08696d)

Author SHA1 Message Date
sneaxiy f8ed2c229e try to fix ci error
7 years ago
sneaxiy a93a9eef8f add op registry type
7 years ago
chengduo f26ba5bddd
Fuse AllReduce (#15921)
7 years ago
Qiyang Min c7f1f3ed0c
Merge pull request #16214 from velconia/imperative_infer_var_type
7 years ago
Qiyang Min 8e4ad008fb
Merge pull request #16198 from velconia/imperative_train_speed
7 years ago
minqiyang 36dce65bb3 Take DataType and VarType apart
7 years ago
qingqing01 8ad672a287
Support sync batch norm. (#16121)
7 years ago
minqiyang 7355d41834 1. Add imperative gperf profiler
7 years ago
minqiyang 42e96a029f Accelerate CPU part
7 years ago
Yan Xu 30568473ec
fix broadcast on mp mode (#15951)
7 years ago
baojun e3c37bd564 remove const_cast and refactor ngraph engine code (#15925)
7 years ago
Qiyang Min 1f4aa7a202 Imperative remove all descs (#16045)
7 years ago
sneaxiy 732fa00eaf disable gc in recurrent_op currently
7 years ago
Qiyang Min 187cffd019
Merge pull request #15928 from velconia/imperative_backward_hooks
7 years ago
minqiyang ac88c62a5b Reset output var's pre_op pointer when op was destructed
7 years ago
mozga-intel 68a9ead17a The flag of mkldnn is enabled iff it is necessary
7 years ago
minqiyang efb2f2baf8 Fix bugs
7 years ago
minqiyang b420ec3a92 invoke backward_hooks after reduce op's depcounts map
7 years ago
minqiyang 2b3510bc50 Add imperative python tracer
7 years ago
Xin Pan 32d5a16036 resolve conflicts
7 years ago
Xin Pan 26e32e095a allow compiler to use graph
7 years ago
sneaxiy d331e97af8 fix compiler place compare
7 years ago
sneaxiy e6ff549849 small fix doc
7 years ago
sneaxiy 796e221efc fix api arg0
7 years ago
Zhen Wang bc95a4ccfe
Merge branch 'develop' into quantization_inference_passes
7 years ago
dzhwinter 381f2015a5
Merge pull request #15665 from dzhwinter/experiment/refactor_memory
7 years ago
xuezhong eeaa2066e5 add device info to tensor
7 years ago
dzhwinter 04e9776aef add details. test=develop
7 years ago
dzhwinter 4f01de6378 Merge remote-tracking branch 'origin/develop' into feature/ir_inplace_pass
7 years ago
liuwei1031 6e84eb131f expose peak gpu memory API to python test=develop (#15529)
7 years ago
WangZhen 2175292634 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into quantization_inference_passes
7 years ago
WangZhen c67b29c178 fix some bugs of graph.to_program and get_pass.
7 years ago
dzhwinter ee3aae56cd merge develop branch. test=develop
7 years ago
乔龙飞 Qiao Longfei c58555067e
Merge pull request #14731 from jacquesqiao/optimize-cpp-reader
7 years ago
Zeng Jinle 2480a3df7d
Merge pull request #15496 from sneaxiy/lazy_allocator2
7 years ago
Zeng Jinle dec89bd7ed
Merge pull request #15460 from sneaxiy/try_to_turn_on_remove_unnecessary_lock
7 years ago
Xin Pan 58cb18d9d9
Merge pull request #15322 from velconia/imperative_resnet
7 years ago
sneaxiy 51227bd447 lazy_allocator
7 years ago
minqiyang c8965dc1ab Polish code
7 years ago
sneaxiy ef788603d4 merge develop
7 years ago
Paddle CI 289aba750a Polish code
7 years ago
WangZhen b913463e83 Update according to the reviewers' suggestion. test=develop
7 years ago
sneaxiy d8568acd19 turn on remove_unnecessary_lock
7 years ago
WangZhen 3ce6172052 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into graph_quantization
7 years ago
flame d60751fb71
add python inference api (#15248)
7 years ago
dzhwinter 8f3b252392 squash commits. test=develop
7 years ago
Qiao Longfei 45578c1b48 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-cpp-reader
7 years ago
minqiyang 8ce198b2e1 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into imperative_resnet
7 years ago
minqiyang 31a1cd8ce5 Align the first batch of gpu resnet
7 years ago
Dun 9f8f0fc2d3 Memory optimization of depthwise conv op and group norm op (#15313)
7 years ago