Commit Graph

785 Commits (8ebffc78c9f999759a35921c71b83226200d8561)

Author SHA1 Message Date
Qiao Longfei ff01d70583 fix style
6 years ago
Qiao Longfei dab7f36909 optimize code test=develop
6 years ago
Qiao Longfei cf0511f21e Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor
6 years ago
Qiao Longfei 43c82376cb use one graph
6 years ago
dzhwinter 660e410655
Merge pull request #15855 from dzhwinter/fix/nightly_test
6 years ago
Xin Pan a6e3cd5eb7
Merge pull request #15425 from panyx0718/api
6 years ago
Qiao Longfei b8491bfd4e Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-communicator
6 years ago
dzhwinter a71f2fbe4f fix default value. test=develop
6 years ago
乔龙飞 Qiao Longfei ec8e878200
Merge pull request #15840 from jacquesqiao/revert-15684-revert-15661-fix-cpu-broadcast
6 years ago
Qiao Longfei 2b7931d5c9 refine code test=develop
6 years ago
Xin Pan 0362ef75f4 fix
6 years ago
Xin Pan 12a0e2ed9d polish codes
6 years ago
Xin Pan 19d78f6797 polish
6 years ago
Qiao Longfei ecedd531c1 fix code bug test=develop
6 years ago
Qiao Longfei f4f4816b0c fix gpu error test=develop
6 years ago
Xin Pan 32d5a16036 resolve conflicts
6 years ago
Qiao Longfei 3f9263f67e optimize style test=develop
6 years ago
Qiao Longfei 4233d0a820 add more comment test=develop
6 years ago
Qiao Longfei 3bccc1e6e2 optimize broadcast logic test=develop
6 years ago
Xin Pan 26e32e095a allow compiler to use graph
6 years ago
Qiao Longfei 12f6b8c3d6 change the include of ThreadPool.h test=develop
6 years ago
Qiao Longfei 7f3be09045 fix multi graph test=develop
6 years ago
Qiao Longfei 31a05d3efd Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor
6 years ago
Qiao Longfei 62f1248ff5 fix use gpu test=develop
6 years ago
Xin Pan 6019054cdd
Merge pull request #15716 from Yancey1989/refine_pg
6 years ago
Dun a83e470405
Profiler refine and add CUDA runtime api tracer (#15301)
6 years ago
Qiao Longfei cc71e89499 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor
6 years ago
dzhwinter f2e8409f5a
Merge pull request #15795 from dzhwinter/fix/block_desc
6 years ago
tensor-tang e1c707fe9c
fix warnings (#15790)
6 years ago
Yancey1989 4b193db14c polish code test=develop
6 years ago
dzhwinter 6deb17ed8c fix default value. test=develop
6 years ago
dzhwinter 089d262c41 fix default value. test=develop
6 years ago
dzhwinter 9c92d0304f fix default value. test=develop
6 years ago
Yancey1989 d5090c892d polish code test=develop
6 years ago
Yancey1989 0f8bd73cc9 cleanup code test=develop
6 years ago
Yancey1989 5677c9d4ee update comment test=develop
6 years ago
Yancey1989 642fd68ce0 update by comment test=develop
6 years ago
dzhwinter d94a314db5 add reference. test=develop
6 years ago
dzhwinter 591ad33e32 polish code for reading. test=develop
6 years ago
dzhwinter 18afb77e78 polish code for reading. test=develop
6 years ago
dzhwinter 684b572307 polish code for reading. test=develop
6 years ago
dzhwinter 3787e61fca polish code for reading. test=develop
6 years ago
dzhwinter c1455e606d Merge remote-tracking branch 'origin/develop' into cherry-pick/memory
6 years ago
dzhwinter d376cf71b7 polish code for reading. test=develop
6 years ago
Yancey1989 7cd6de37f5 fix cpu test=develop
6 years ago
Yancey1989 bd0d44af24 fix build failed test=develop
6 years ago
Yancey1989 ecdd1166b8 cleanup code test=develop
6 years ago
Yancey1989 73005ee00d cleanup code test=develop
6 years ago
Yancey1989 88d3dc949e Merge branch 'develop' of github.com:PaddlePaddle/Paddle into refine_pg
6 years ago
Yancey1989 f3463ecb6e refine pg execution
6 years ago
dzhwinter 283573c6aa add details. test=develop
6 years ago
乔龙飞 Qiao Longfei 45b19cbc9a
Revert "Revert "cpu reduce mode did not need to broadcast params test=develop""
6 years ago
dzhwinter 6d6ddcfe15 add details. test=develop
6 years ago
dzhwinter 11afbe0f53 add details. test=develop
6 years ago
tensor-tang e49706c80e
Merge pull request #15659 from GBuella/add_to_string
6 years ago
Gabor Buella da9c94da33 Clang build fixes (#15628)
6 years ago
乔龙飞 Qiao Longfei 6e0e706198
Revert "cpu reduce mode did not need to broadcast params test=develop"
6 years ago
Qiao Longfei 97b143fb49 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix-cpu-broadcast
6 years ago
Qiao Longfei ffd0d1d216 clean need_broadcast_var_ test=develop
6 years ago
Qiao Longfei fbadd4b60c follow comment test=develop
6 years ago
dzhwinter 04e9776aef add details. test=develop
6 years ago
Qiao Longfei c4ded17e8c async mode support dist train
6 years ago
Qiao Longfei 84367cf8bc support async mode in dist mode parallel executor
6 years ago
Qiao Longfei e72637ddd2 ThreadedSSAGraphExecutor support num_iteration_per_run test=develop
6 years ago
Qiao Longfei a7152613f7 Merge branch 'fix-cpu-broadcast' of ssh://github.com/jacquesqiao/Paddle into add-communicator
6 years ago
Qiao Longfei 76072261f8 fix compiler
6 years ago
Qiao Longfei b99db0e2c2 cpu reduce mode did not need to broadcast test=develop
6 years ago
Qiao Longfei 5cf0092825 add more log and fix test_dist_base in multi_batch_merge_pass
6 years ago
Gabor Buella 4975a9050a Tests - add some missing to_string calls
6 years ago
Qiao Longfei b1fe8d4570 add a check for async_ssa_graph_exe test=develop
6 years ago
Qiao Longfei 16af1dbc7b Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor
6 years ago
dzhwinter 4ef34916a4 enhanced print message. test=develop
6 years ago
dzhwinter ce0394bcd0 merge develop branch. test=develop
6 years ago
dzhwinter cca71532eb add skip send.recv test=develop
6 years ago
dzhwinter 9f001c6525 skip dist. test=develop
6 years ago
dzhwinter 2561a6fc59 follow comment. test=develop
6 years ago
dzhwinter 2a5ecb68b0 follow comment. test=develop
6 years ago
dzhwinter e537634d16 delete graph print pass. test=develop
6 years ago
dzhwinter 5cab99a686 fuck windows. rerun windows ci. test=develop
6 years ago
dzhwinter 0a63234c85 follow comments. test=develop
6 years ago
dzhwinter 9e87fbebb7 rerun windows ci. test=develop
6 years ago
dzhwinter 32a2014939 refine build strategy. test=develop
6 years ago
dzhwinter 8156fedf56 merge develop branch. test=develop
6 years ago
Qiao Longfei 02dab46ab8 add some debug info
6 years ago
dzhwinter ee3aae56cd merge develop branch. test=develop
6 years ago
dzhwinter d6d3e6afe2 add more skip strategy
6 years ago
Qiao Longfei be738a646e add some debug infor
6 years ago
dzhwinter 2739096eec compatibable with python side mem_opt
6 years ago
Qiao Longfei a66115bed5 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor
6 years ago
Qiao Longfei fab8457e6b code optimize
6 years ago
gongweibao d303270a0e
revert test=develop (#15535)
6 years ago
Qiao Longfei ada43e89c3 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor
6 years ago
gongweibao d54494ba87
cleanup test=develop (#15347)
6 years ago
sneaxiy ef788603d4 merge develop
6 years ago
sneaxiy d8568acd19 turn on remove_unnecessary_lock
6 years ago
sneaxiy eac5a0aa0c Merge develop
6 years ago
WangZhen 3ce6172052 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into graph_quantization
6 years ago
dzhwinter 8f3b252392 squash commits. test=develop
6 years ago
Dun 9f8f0fc2d3 Memory optimization of depthwise conv op and group norm op (#15313)
6 years ago
WangZhen e2ff300b02 add UT for quantization.
6 years ago
WangZhen 451896fce4 init quantization.
6 years ago
Qiao Longfei 88d71fa2f9 support num_iteration_per_run
6 years ago
gongweibao 7cd4dd7ce4
Hide varhandle members. (#15382)
6 years ago
Qiao Longfei ea66979684 can run
6 years ago
Qiao Longfei 92a6c7a049 init async ssa executor
6 years ago
flame fb63cd89d4
Add python ir graph API (#14917)
6 years ago
sneaxiy 594dc4d8f0 partial gc 1st version
6 years ago
minqiyang 68a07328fa Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_pyramid_dnn_support
6 years ago
minqiyang 4bfa110fd8 Add no lock optimize pass
6 years ago
chengduo eabb2105fa
Refactor MultiDevSSAGraphBuilder (#15090)
6 years ago
Yancey1989 4ad9de74dd disable sync nccl by default test=develop
6 years ago
Yancey1989 e65436103f Merge branch 'develop' of github.com:PaddlePaddle/Paddle into parallel_graph_mode
6 years ago
Yancey1989 94c80347b6 update by comment
6 years ago
Wu Yi 227e0c4518
fix nccl2 mode startup test=develop (#15132)
6 years ago
Yancey1989 35cda13e9f fix unittest test=develop
6 years ago
Yancey1989 0a885ac12a Merge branch 'develop' of github.com:PaddlePaddle/Paddle into parallel_graph_mode
6 years ago
Yancey1989 ca8c77d966 selecte execution according to strategy test=develop
6 years ago
wopeizl 7ab501264d
Merge pull request #15069 from wopeizl/windows/dsosupport
6 years ago
Yancey1989 4743c9cd5d Merge branch 'develop' of github.com:PaddlePaddle/Paddle into parallel_graph_mode
6 years ago
minqiyang ef7d563db9 Add changes back
6 years ago
minqiyang a318a490ab Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into accelerate_ddpg
6 years ago
Zeng Jinle c0bcff00dc
Merge pull request #14962 from sneaxiy/rewrite_variable_type
6 years ago
chengduo fe8495a758
[WIP] Refine MultiDevSSAGraph (#15040)
6 years ago
Qiyang Min 0238a3bb4f
Merge pull request #14972 from velconia/accelerate_lstm
6 years ago
minqiyang 8ed0233924 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into accelerate_ddpg
6 years ago
Yancey1989 1a4f79a7de fix unittest test=develop
6 years ago
Yancey1989 86bb583881 Merge branch 'develop' of github.com:PaddlePaddle/Paddle into parallel_graph_mode
6 years ago
Yancey1989 845bfd5807 cleanup code
6 years ago
peizhilin 1e7f83e60a add cuda dso support for windows
6 years ago
Yancey1989 41a64f6a2a Merge branch 'develop' of github.com:PaddlePaddle/Paddle into parallel_graph_mode
6 years ago
Wu Yi 856f0da0fe
Fp16 training (#14992)
6 years ago
minqiyang 68b86d6665 Change default value to align with the original react
6 years ago
minqiyang 250e893745 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into accelerate_ddpg
6 years ago
minqiyang 8149a07a41 Fix wait stream two times bug
6 years ago
minqiyang 0a4b6fc056 Remove unnessesary code
6 years ago
minqiyang 53619a79b4 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into accelerate_lstm
6 years ago
minqiyang 6fabbd8fb8 Polish code and remove spin lock
6 years ago
sneaxiy c1f7e54f62 merge develop
6 years ago
typhoonzero da87f7a698 Revert "[Feature] Fp16 training for resnet50 (#14850)"
6 years ago
chengduo 550e7e410b
Code Clean parallel_executor.py (#14849)
6 years ago
Wu Yi 3d750f9c5a
[Feature] Fp16 training for resnet50 (#14850)
6 years ago
minqiyang 679d1a9e0b Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into accelerate_lstm
6 years ago
sneaxiy ae6f46a1a9 rewrite variable type
6 years ago
dzhwinter 7cd24b1318
add ir memory optimize. (#14530)
6 years ago
Yancey1989 06936a2ff5 fix 1gpu test=develop
6 years ago
Yancey1989 d3a4da5cf6 fix comment test=develop
6 years ago
Yancey1989 49870f507d delete unused code test=develop
6 years ago
Yancey1989 a7d6b1f921 code cleanup test=develop
6 years ago
Yancey1989 a760a550b0 Merge branch 'develop' of github.com:PaddlePaddle/Paddle into parallel_graph_mode
6 years ago
Yancey1989 fd144954ed redefine api test=develop
6 years ago
minqiyang 81651fca45 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into accelerate_ddpg
6 years ago
Yu Yang bacf1d2399 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/tensor_type
6 years ago
gongweibao 0b1c7d838c
Add brpc serialization support. (#11430)
6 years ago
Yancey1989 4a4ccac1d0 update by comment test=develop
6 years ago
minqiyang 3bb9b0cac2 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into accelerate_ddpg
6 years ago
minqiyang ad6ae0b071 1. Add SpinLock
6 years ago
Yancey1989 c722b1dcb6 Merge branch 'develop' of github.com:PaddlePaddle/Paddle into parallel_graph_mode
6 years ago
Yu Yang 4ecdb6f486 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/tensor_type
6 years ago
sneaxiy 2328bee1cc fix Windows compile bug
6 years ago
Yancey1989 23eb8c4299 fix ci test=develop
6 years ago
Yancey1989 106e285236 add unittest for parllelgraph mode test=develop
6 years ago
minqiyang b75bd29c3a Remove debug info
6 years ago
Yu Yang c00e07cda0 Fix distribute compile
6 years ago
Yu Yang 9bd70a1e04 Change tensor uses proto::VarType::type
6 years ago
Yancey1989 82726402be exception safe
6 years ago
minqiyang 570338699b Add debug info
6 years ago
sneaxiy 7923042365 merge develop
6 years ago
Yancey1989 79082c9459 fix pyreader failed
6 years ago
Yancey1989 2dda19f756 Merge branch 'develop' of github.com:PaddlePaddle/Paddle into parallel_graph_mode
6 years ago
gongweibao f1fb64b17f
Add reduce sparse tensor feature. (#14757)
6 years ago
sneaxiy eb8252466b polish code
6 years ago
Yancey1989 47740ace28 fix performance
6 years ago
Yancey1989 220db4f334 clean code
6 years ago
Yancey1989 cb8a24be14 clean code
6 years ago
sneaxiy 387bac46b5 refine code
6 years ago
Yancey1989 c9de6f1b05 init parallel graph mode
6 years ago
Wu Yi 29d9fb53fc
[Feature] multi process multi gpu dist training, boost v100 performance by 20% (#14661)
6 years ago
sneaxiy e694d0c2e4 fix while_op eager deletion bug
6 years ago
sneaxiy 35a2578426 fix bug
6 years ago
zhang wenhui abbe382e1e
Revert "Add EstiminateFlops"
6 years ago
sneaxiy 64ad051b9a merge develop
6 years ago
sneaxiy c47c451a00 fix bug
6 years ago
Yu Yang 0f0e197914
Merge pull request #14666 from reyoung/feature/estiminate_flops
6 years ago
Wang Guibao 41e19eb431
AsyncExecutor (#14627)
6 years ago
sneaxiy 096673f675 refactor eager deletion
6 years ago
Yu Yang 589b863b98 Add EstiminateFlops
6 years ago
Qiao Longfei 8ebde595c9 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refactor-prefetch
6 years ago
Qiao Longfei 145c535750 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refactor-prefetch
6 years ago
minqiyang a02ce58f2c Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into revert_vlog
6 years ago
gongweibao 867c312bc4
Fix allreduce dependency order. (#14586)
6 years ago
Qiao Longfei 9589babe12 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refactor-prefetch
6 years ago
minqiyang be04d99fe4 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into revert_vlog
6 years ago
Qiao Longfei af2f5fc824 fix some bugs
6 years ago
minqiyang 53433d7f2e Revert the changes of VLOG
6 years ago
peizhilin 7c8c9dc9bf fix unit test cases
6 years ago
wopeizl d9a1f3e58e Windows/online (#14474)
6 years ago
peizhilin 703b26e697 add profiler, parallel_executor back
6 years ago
Yu Yang c8f6e70ab4 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation
6 years ago
Xin Pan 8cfda7ee0c
Merge pull request #14382 from panyx0718/fix4
6 years ago
Zeng Jinle 38d32c98b8 merge develop
6 years ago
sneaxiy eb18d532a5 fix num_threads in fast_pe
6 years ago
Xin Pan 759ffca423 some improvements
6 years ago
Xin Pan 99dffb91d6 allow to repeatedly share and update BuildStrategy
6 years ago
sneaxiy d231e55065 merge develop
6 years ago
Yu Yang fdc689142c Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation
6 years ago
minqiyang 87450b9ad4 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_vlog
6 years ago
chengduo c5b6573a5a
Fix input<tensor> (#14208)
6 years ago
minqiyang 3db9fad764 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_vlog
6 years ago
Xin Pan ff6c809bfc
Merge pull request #14251 from panyx0718/fix
6 years ago
minqiyang 0c3227a523 Change the origin VLOG level to 10 times
6 years ago
Xin Pan 59c66532e7 add more logs and comments
6 years ago
Yu Yang 057a682ee9 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation
6 years ago
Yu Yang c28beb8a3c
test(Pe): add dry run tests for pe (#14254)
6 years ago
Xin Pan cb2d33a851 resolve conflict
6 years ago
Xin Pan 8c11d3fed6 clean up
6 years ago
Xin Pan 0a89650507 fix more tests
6 years ago
Xin Pan a3b27e3237 fix
6 years ago
Xin Pan f25eb9a71d fix some tests.
6 years ago
Xin Pan adf5615e54 clean kGraphOp
6 years ago
Xin Pan 2e14999942 clean1
6 years ago
Xin Pan 34b401fc6c clean up a global graph attr.
6 years ago
Zeng Jinle 8ac2242b6e
Merge pull request #14075 from sneaxiy/remove_some_locks_in_pe
6 years ago
sneaxiy faac8a76ce remove unnecessary codes
6 years ago
Wu Yi 306236c2c0
feature/DC asgd (#12722)
6 years ago
sneaxiy 7ff320f8cc merge develop
6 years ago
sneaxiy 203027ca86 test=develop
6 years ago
sneaxiy cf1944af2a test=develop
6 years ago
Yan Xu d10b8efcc0
Merge pull request #14152 from Yancey1989/add_fused_broadcast_unittest
6 years ago
Yu Yang c21597cf07
fix(PE): use shared_ptr<BlockingQueue> for cross thread communication (#14136)
6 years ago
chengduo ed087f8232
refine op_handle (#14178)
6 years ago
Yancey1989 6bfa6a0a33 add fused broadcast op unit test, test=develop
6 years ago
sneaxiy 2494ca83ab test=develop
6 years ago
sneaxiy f2eed667c0 test=develop
6 years ago
sneaxiy cef8cc81db merge develop
6 years ago
Qiao Longfei 3d4e050802 fix compile, optimize code test=develop
6 years ago
Yu Yang c01696f8c2 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation
6 years ago
Qiao Longfei d26ff8cb2d Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into cpu-for-1.1-merge-with-shape
6 years ago
Wu Yi 26200f2e42
[1.1] [project] train imagenet using large batch size (#13766)
6 years ago
sneaxiy 2414f92f54 test=develop
6 years ago
sneaxiy 45559d042c move to pass
6 years ago
sneaxiy a314a80cdb merge develop
6 years ago
seiriosPlus c34610f86d Fix lookup table at CPU Reduce strategy, test=develop
6 years ago
Qiao Longfei fad42fe7cc broadcast handle not inited parameter
6 years ago
sneaxiy 1af3fe8c35 test=develop
6 years ago
Xin Pan d5d09672c8 better fix
6 years ago
sneaxiy 5be6f762d0 remove_lock_in_some_ops
6 years ago
Yu Yang 461f71a90b Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation
6 years ago
chengduozh 82d2903b63 Fix fast ParallelExe bug
6 years ago
sneaxiy fb6201e93e test=develop
6 years ago
sneaxiy d87569134c test=develop
6 years ago