Commit Graph

2194 Commits (34890fd3b129f85f28489453ddd1d5f62dd526f7)

Author SHA1 Message Date
Qiao Longfei 97b143fb49 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix-cpu-broadcast
6 years ago
dzhwinter 381f2015a5
Merge pull request #15665 from dzhwinter/experiment/refactor_memory
6 years ago
Qiao Longfei ffd0d1d216 clean need_broadcast_var_ test=develop
6 years ago
Qiao Longfei fbadd4b60c follow comment test=develop
6 years ago
dzhwinter 04e9776aef add details. test=develop
6 years ago
baojun f4a0e68481 Fix ngraph compile WITH_DISTRIBUTE=ON (#15636)
6 years ago
Dun Liang 1905f1a108 bug fix && test=develop
6 years ago
Qiao Longfei 2171aa77f1 async ssa exe only support local mode
6 years ago
Qiao Longfei c4ded17e8c async mode support dist train
6 years ago
Qiao Longfei 84367cf8bc support async mode in dist mode parallel executor
6 years ago
Qiao Longfei e72637ddd2 ThreadedSSAGraphExecutor support num_iteration_per_run test=develop
6 years ago
Qiao Longfei a7152613f7 Merge branch 'fix-cpu-broadcast' of ssh://github.com/jacquesqiao/Paddle into add-communicator
6 years ago
Qiao Longfei 76072261f8 fix compiler
6 years ago
Qiao Longfei b99db0e2c2 cpu reduce mode did not need to broadcast test=develop
6 years ago
Qiao Longfei 5cf0092825 add more log and fix test_dist_base in multi_batch_merge_pass
6 years ago
Gabor Buella 4975a9050a Tests - add some missing to_string calls
6 years ago
Qiao Longfei b1fe8d4570 add a check for async_ssa_graph_exe test=develop
6 years ago
Qiao Longfei 16af1dbc7b Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor
6 years ago
Qiao Longfei 381f383989 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-communicator
6 years ago
wopeizl c1e18b13aa
Merge pull request #15635 from wopeizl/fixbuildissue
6 years ago
dzhwinter 5d30b55de1 rerun ci. test=develop
6 years ago
dzhwinter 4ef34916a4 enhanced print message. test=develop
6 years ago
peizhilin 238ef94702 fix the build issue on gpu mode for win
6 years ago
dzhwinter ce0394bcd0 merge develop branch. test=develop
6 years ago
Xin Pan 74bc55c2a6
Merge pull request #14975 from dzhwinter/ir_inplace_pass
6 years ago
dzhwinter cca71532eb add skip send.recv test=develop
6 years ago
dzhwinter 9f001c6525 skip dist. test=develop
6 years ago
Yan Chunwei dc5e25fc7f
remove dot marked node (#15606)
6 years ago
dzhwinter 2561a6fc59 follow comment. test=develop
6 years ago
dzhwinter 2a5ecb68b0 follow comment. test=develop
6 years ago
dzhwinter 9f693fcac4 rerun ci. test=develop
6 years ago
dzhwinter e537634d16 delete graph print pass. test=develop
6 years ago
dzhwinter 4f01de6378 Merge remote-tracking branch 'origin/develop' into feature/ir_inplace_pass
6 years ago
liuwei1031 6e84eb131f expose peak gpu memory API to python test=develop (#15529)
6 years ago
dzhwinter 5cab99a686 fuck windows. rerun windows ci. test=develop
6 years ago
dzhwinter 9c9ad7d40b Merge remote-tracking branch 'origin/develop' into feature/ir_inplace_pass
6 years ago
dzhwinter 0a63234c85 follow comments. test=develop
6 years ago
Yan Chunwei 897789b16e
fix save_inferece_model bug (#15365)
6 years ago
dzhwinter 9e87fbebb7 rerun windows ci. test=develop
6 years ago
dzhwinter 6f9904e99a rerun windows ci. test=develop
6 years ago
dzhwinter a52be7c081 refine build strategy. test=develop
6 years ago
dzhwinter 32a2014939 refine build strategy. test=develop
6 years ago
Krzysztof Binias b1bdcd4de8 Make separate folders for mkldnn codes
6 years ago
dzhwinter 06f2448848 Merge remote-tracking branch 'origin/develop' into feature/ir_inplace_pass
6 years ago
dzhwinter 8156fedf56 merge develop branch. test=develop
6 years ago
Qiao Longfei d6c0dcaa16 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor
6 years ago
Jiabin Yang fd286f3596
Merge pull request #15534 from JiabinYang/fix/multi_output_support_imperative
6 years ago
Qiao Longfei c7e3868007 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-communicator
6 years ago
Qiao Longfei 02dab46ab8 add some debug info
6 years ago
dzhwinter ee3aae56cd merge develop branch. test=develop
6 years ago
dzhwinter d6d3e6afe2 add more skip strategy
6 years ago
Yan Chunwei b62b756b28
add version support (#15469)
6 years ago
tensor-tang 3c224e7e79
Merge pull request #15537 from baojun-nervana/rm_ngraph_operator
6 years ago
Zhaolong Xing 97b76c94c4
Merge pull request #15242 from NHZlX/trt_int8_ultimate_version
6 years ago
Jiabin Yang 10bc9ffc2d
Merge pull request #15518 from JiabinYang/fix/refine_error_message
6 years ago
Qiao Longfei be738a646e add some debug infor
6 years ago
Qiao Longfei 62549e0714 add GenParentScopeTreeDebugInfo
6 years ago
dzhwinter 2739096eec compatibable with python side mem_opt
6 years ago
Qiao Longfei a66115bed5 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor
6 years ago
Qiao Longfei fab8457e6b code optimize
6 years ago
gongweibao d303270a0e
revert test=develop (#15535)
6 years ago
baojun-nervana 8e9308a51a mv ngraph_bridge to ngraph directory test=develop
6 years ago
baojun-nervana da3f9cc512 rm ngraph_operator.cc test=develop
6 years ago
Qiao Longfei ada43e89c3 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor
6 years ago
JiabinYang 5639f49b16 test=develop, fix/multi_output_support_imperative
6 years ago
gongweibao d54494ba87
cleanup test=develop (#15347)
6 years ago
JiabinYang c52f57de5b test=develop, refine_error_message for data type
6 years ago
baojun efce25673c Adding ngraph_engine_op (#14948)
6 years ago
Yiqun Liu 3008fa1261
Add the CUDA kernel for beam_search op (#15020)
6 years ago
nhzlx 0779e35544 fix two bug:
6 years ago
Qiao Longfei ca5d96bb3d complete send lod tensor
6 years ago
Zeng Jinle dec89bd7ed
Merge pull request #15460 from sneaxiy/try_to_turn_on_remove_unnecessary_lock
6 years ago
Xin Pan 58cb18d9d9
Merge pull request #15322 from velconia/imperative_resnet
6 years ago
Qiao Longfei be72940b76 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-communicator
6 years ago
sneaxiy ef788603d4 merge develop
6 years ago
sneaxiy d8568acd19 turn on remove_unnecessary_lock
6 years ago
sneaxiy eac5a0aa0c Merge develop
6 years ago
WangZhen 3ce6172052 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into graph_quantization
6 years ago
dzhwinter 8f3b252392 squash commits. test=develop
6 years ago
Yan Chunwei 885c4e57ab
fea/infer memory optim2 (#14953)
6 years ago
minqiyang 8ce198b2e1 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into imperative_resnet
6 years ago
Dun 9f8f0fc2d3 Memory optimization of depthwise conv op and group norm op (#15313)
6 years ago
WangZhen e2ff300b02 add UT for quantization.
6 years ago
WangZhen 451896fce4 init quantization.
6 years ago
Qiao Longfei f3210b60ba fix copy_memory and share_memory
6 years ago
Qiao Longfei 9958775b31 add NewTmpScope to scope
6 years ago
Qiao Longfei 7021979bc2 init communicator
6 years ago
Qiao Longfei 69484f71e0 remote communicator
6 years ago
Qiao Longfei 88d71fa2f9 support num_iteration_per_run
6 years ago
gongweibao 7cd4dd7ce4
Hide varhandle members. (#15382)
6 years ago
Qiao Longfei ea66979684 can run
6 years ago
Qiao Longfei afda840126 init communicator
6 years ago
Qiao Longfei 92a6c7a049 init async ssa executor
6 years ago
tensor-tang 3759c1db8c
Merge pull request #14805 from mozga-intel/mozga-intel/element_wise_operator_ngraph
6 years ago
mozga-intel cba729404d Enable softmax operator for a ngraph engine
6 years ago
tensor-tang a7fc3d42a0
Merge pull request #15304 from tensor-tang/fuse/second_order_mul_sub
6 years ago
乔龙飞 Qiao Longfei b14d4cdd75
Merge pull request #14890 from jacquesqiao/multithread-sparse-adam
6 years ago
Qiao Longfei 9b4fe283e1 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into multithread-sparse-adam
6 years ago
peizhilin 5e450833bd test=develop
6 years ago
peizhilin eea75a1d93 fix issue when type is invalid
6 years ago
peizhilin 9adb158e5b Merge remote-tracking branch 'upstream/develop' into debug/support
6 years ago
tensor-tang 84b0ecdcce Merge remote-tracking branch 'ups/develop' into fuse/second_order_mul_sub
6 years ago
chengduo 46d01d798e
Revert "Revert "Remove workspace_handle in conv_cudnn (#15186)"" (#15290)
6 years ago
tensor-tang d618e48309 fix fuse square mat order and refine test
6 years ago
tensor-tang a5d2a6d1ad add fuse pass of sequared mat sub fusion
6 years ago
tensor-tang ca6fdc6e33 refine and fix test
6 years ago
tensor-tang a89296ac1f add repeated fc relu pass
6 years ago
Xin Pan 50b4ac08b0 fix
6 years ago
Xin Pan a1bfb35dd6 try fix py2
6 years ago
Xin Pan 6a18c0f9ff
Merge pull request #15278 from chengduoZH/revert_remove_workspace_handle_in_conv2d_cudnn
6 years ago
Zhaolong Xing 98e85f3735 add_transpose_flatten_concat_fuse (#15121)
6 years ago
chengduozh 358e657f68 Revert "Remove workspace_handle in conv_cudnn (#15186)"
6 years ago
tensor-tang fc9fbab6a0
Merge pull request #15271 from tensor-tang/fix/typo
6 years ago
chengduo 064512aa47
Remove workspace_handle in conv_cudnn (#15186)
6 years ago
tensor-tang c3a9f3c4b2 fix typo and refine
6 years ago
tensor-tang ab9c4b2a9f refine seqpool concat pass and remove unused nodes
6 years ago
tensor-tang ce909664d8 Merge remote-tracking branch 'ups/develop' into refine/seqpool/feed
6 years ago
flame fb63cd89d4
Add python ir graph API (#14917)
6 years ago
tensor-tang a0a27bd240 add seqpool concat fuse pass tester
6 years ago
sneaxiy 594dc4d8f0 partial gc 1st version
6 years ago
tensor-tang 8e086a8521 follow comment and fix typo
6 years ago
tensor-tang 48410b9bfe
Merge pull request #15237 from tensor-tang/fuse/seqpool_concat_2
6 years ago
peizhilin c1235c935f add the enable_debug flag
6 years ago
Xin Pan 7b73fc9e1a
Merge pull request #15089 from panyx0718/api
6 years ago
tensor-tang f8c305b243 Merge remote-tracking branch 'ups/develop' into fuse/seqpool_concat_2
6 years ago
Zeng Jinle e29f10d315
Merge pull request #15207 from sneaxiy/remove_op_handle_lock_and_fix_var
6 years ago
Zeng Jinle 7b638f2781
Merge pull request #15218 from sneaxiy/fix_same_name_func
6 years ago
mozga-intel a42f8f4f6f Enable element_wise_add operator for a ngraph
6 years ago
tensor-tang 72d2a1801e add seqpool concat fuse pass
6 years ago
sneaxiy bc205ef374 fix same name func
6 years ago
xuezhong c0bc818688
Merge pull request #15188 from velconia/add_pyramid_dnn_support
6 years ago
sneaxiy 4a443ffc98 merge develop
6 years ago
sneaxiy 7c7342bf12 fix scope.var()
6 years ago
Tao Luo 4d9aa1745a
Merge pull request #14806 from mozga-intel/mozga-intel/scale_operator_ngraph
6 years ago
peizhilin a6f5ceee74 add the python callstack for debug support test=develop
6 years ago
minqiyang b76695418a Polish log
6 years ago
minqiyang 1bfbc0d963 Polish code
6 years ago
minqiyang 7f45b9511a Polish code
6 years ago
minqiyang 68a07328fa Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_pyramid_dnn_support
6 years ago
Qiao Longfei 44b300556d change min_row_size_to_use_multithread to parameter of adam
6 years ago
Qiao Longfei 87b4eb1da4 change min_param_size_to_use_multithread to min_row_size_to_use_multithread
6 years ago
minqiyang 4bfa110fd8 Add no lock optimize pass
6 years ago
chengduo eabb2105fa
Refactor MultiDevSSAGraphBuilder (#15090)
6 years ago
Yan Chunwei 875a07c32d
refactor inference analysis api (#14634)
6 years ago
mozga-intel e77956c920 Enable mean operator for a ngraph
6 years ago
mozga-intel dd768714ab Enable scale operator for a ngraph
6 years ago
Qiao Longfei 17b1b660fc Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into multithread-sparse-adam
6 years ago
baojun-nervana f0cde74564 Update ngraph with elt-wise relu test=develop
6 years ago
Xin Pan 8ae9094e07 polish and resolve conflicts
6 years ago
Xin Pan 5e928e579a try unify Executor and ParallelExecutor
6 years ago
Yan Xu a1e60ab19b
Merge pull request #14791 from Yancey1989/parallel_graph_mode
6 years ago
Yancey1989 4ad9de74dd disable sync nccl by default test=develop
6 years ago
Yancey1989 db603398b7 disable parallel graph executor by default
6 years ago
Xin Pan 087af6a686
Merge pull request #15131 from panyx0718/clean
6 years ago
Yancey1989 e65436103f Merge branch 'develop' of github.com:PaddlePaddle/Paddle into parallel_graph_mode
6 years ago
Yancey1989 94c80347b6 update by comment
6 years ago
Qiyang Min 23761beaef
Merge pull request #14971 from velconia/imperative_mnist
6 years ago
Wu Yi 227e0c4518
fix nccl2 mode startup test=develop (#15132)
6 years ago
Xin Pan 9186451f60 hide GetTensor
6 years ago
Yancey1989 35cda13e9f fix unittest test=develop
6 years ago
minqiyang 2547f9d1b8 Polish code
6 years ago
minqiyang 09e2e66236 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into imperative_mnist
6 years ago
Yancey1989 0a885ac12a Merge branch 'develop' of github.com:PaddlePaddle/Paddle into parallel_graph_mode
6 years ago
Yancey1989 ca8c77d966 selecte execution according to strategy test=develop
6 years ago
minqiyang 858e903231 Add unittest for operator
6 years ago
wopeizl 7ab501264d
Merge pull request #15069 from wopeizl/windows/dsosupport
6 years ago
minqiyang 6a5f604607 Support stop_gradients var in imperative backward
6 years ago
guru4elephant ff739449ab
Merge pull request #15018 from guru4elephant/add_timer
6 years ago
Qiyang Min e29cbfe4f7
Merge pull request #14829 from velconia/accelerate_ddpg
6 years ago
Tao Luo 9c2cbfb89e
Merge pull request #15093 from baojun-nervana/intel/cmake
6 years ago
Zeng Jinle 25b49a0896
Merge pull request #14933 from sneaxiy/rewrite_ddim
6 years ago
Wu Yi a8bc05b5ff
Refactor distributed RPC (#15075)
6 years ago
baojun-nervana 555fbc10d8 upgrade ngraph to v0.10.1 test=develop
6 years ago
baojun-nervana c714c36482 simplify logic test=develop
6 years ago
Xin Pan 3e8408429d
Merge pull request #15053 from panyx0718/imperative_hold
6 years ago
sneaxiy 73896eeb94 merge develop
6 years ago
Wu Yi e26cced7cc
refine batch merge pass (#14777)
6 years ago
Yancey1989 4743c9cd5d Merge branch 'develop' of github.com:PaddlePaddle/Paddle into parallel_graph_mode
6 years ago
sneaxiy 9a3a246cb5 fix py35 compile error
6 years ago
Zhaolong Xing 4048cfa9da
Merge pull request #15048 from NHZlX/add_affine_channel_fuse
6 years ago
minqiyang ef7d563db9 Add changes back
6 years ago
minqiyang a318a490ab Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into accelerate_ddpg
6 years ago
Zeng Jinle c0bcff00dc
Merge pull request #14962 from sneaxiy/rewrite_variable_type
6 years ago
chengduo fe8495a758
[WIP] Refine MultiDevSSAGraph (#15040)
6 years ago
dongdaxiang 82335cd88c Merge branch 'add_timer' of https://github.com/guru4elephant/Paddle into add_timer
6 years ago
wopeizl 719ebe3786
Merge pull request #15070 from wopeizl/windows/testcasefix
6 years ago
Xin Pan b91a7a9d30 clear operator changes
6 years ago
Xin Pan f52b514dcd call kernel
6 years ago
Xin Pan 4e80e04f23 fix
6 years ago
Xin Pan 61491ce250 clean
6 years ago
Xin Pan ce7e503cbe refactor to avoid scope.
6 years ago
Qiyang Min 0238a3bb4f
Merge pull request #14972 from velconia/accelerate_lstm
6 years ago
Houjiang Chen 242d3c71a6
Merge pull request #15031 from hjchen2/develop
6 years ago
Xin Pan 71a4a8e981
Merge pull request #15071 from wopeizl/revert/15035
6 years ago
Qiao Longfei 3b294e2e2e Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into multithread-sparse-adam
6 years ago
sneaxiy c4ce2e7b21 merge develop, solve conflict
6 years ago
minqiyang 8ed0233924 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into accelerate_ddpg
6 years ago
Zeng Jinle 9c6a0203e2
Merge pull request #15073 from sneaxiy/add_scope_pool
6 years ago
sneaxiy b56aca82e9 merge develop
6 years ago
sneaxiy ee83ce75bf try to fix py35 compile error
6 years ago
sneaxiy 3e917a934a add scope_pool
6 years ago
Yancey1989 1a4f79a7de fix unittest test=develop
6 years ago
Yancey1989 86bb583881 Merge branch 'develop' of github.com:PaddlePaddle/Paddle into parallel_graph_mode
6 years ago
Yancey1989 495e73d766 enable gc
6 years ago
Yancey1989 28cdfbc2b0 delete comment code
6 years ago
Yancey1989 845bfd5807 cleanup code
6 years ago
peizhilin 2388d0e7d6 Revert "cherry-pick the #12759"
6 years ago
peizhilin 01c00b07dd fix test issues on windows
6 years ago
peizhilin 1e7f83e60a add cuda dso support for windows
6 years ago
tangwei12 dc8eca826e
code style fix, test=develop (#15045)
6 years ago
Yancey1989 41a64f6a2a Merge branch 'develop' of github.com:PaddlePaddle/Paddle into parallel_graph_mode
6 years ago
nhzlx a6aa8ea771 faster rcnn input is presistable. (fix it in paddle-trt)
6 years ago
hjchen2 956cf92145 Fix conv_elementwise_add2_act pass
6 years ago
Tao Luo 69659f4ae2
Merge pull request #15037 from jianhang-liu/fix/abnormal_stack_op_time
6 years ago
sneaxiy 179acc60b3 fix conflict with develop
6 years ago
wopeizl 09bd8fa67a
Merge pull request #15035 from wopeizl/debug/improvement1
6 years ago
sneaxiy dde3afe7b7 Merge develop
6 years ago
dongdaxiang 2df1d80767 Merge branch 'add_timer' of https://github.com/guru4elephant/Paddle into add_timer
6 years ago
Wu Yi 856f0da0fe
Fp16 training (#14992)
6 years ago
Brian Liu e821b12f57 Fix issue which cause abnormal CPU usage in stack op
6 years ago
chengduo b9fb03cf54
Move GetTensor to tensor_util (#15011)
6 years ago
nhzlx 73b47df1f4 Merge branch 'develop' of https://github.com/paddlepaddle/paddle into add_affine_channel_fuse
6 years ago
nhzlx ce3782c193 add affine_channel fuse.
6 years ago
peizhilin 7f6d8acecb cherry-pick the #12759
6 years ago
sneaxiy 3a2afbf02e polish code
6 years ago
tensor-tang 05d1121b22
Merge pull request #14802 from mozga-intel/mozga-intel/fill_constant_operator_ngraph
6 years ago
tensor-tang 9d4f1d468a
Merge pull request #14804 from mozga-intel/mozga-intel/top_k_operator_ngraph
6 years ago
sneaxiy 68d91cd594 add copy ctor
6 years ago
dongdaxiang 3b3cb4ea55 Merge branch 'add_timer' of https://github.com/guru4elephant/Paddle into add_timer
6 years ago
sneaxiy e02f67eff7 rewrite unsafe_cast
6 years ago
minqiyang 68b86d6665 Change default value to align with the original react
6 years ago
dongdaxiang 2dee8f6cd5 add TrainFilesWithTimer in async_executor
6 years ago
dongdaxiang d434fcbaa6 add TrainFilesWithTimer in async_executor
6 years ago
minqiyang 250e893745 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into accelerate_ddpg
6 years ago
Xin Pan 103f08f50e
Merge pull request #14910 from panyx0718/clean3
6 years ago
Zeng Jinle 0021b05b19
Merge pull request #14993 from sneaxiy/fix_check_lod
6 years ago
chengduo 79bd6dfa18
[Feature] Add Temporary Allocator (#14875)
6 years ago
sneaxiy a30c5373eb use std::is_sorted
6 years ago
minqiyang 8149a07a41 Fix wait stream two times bug
6 years ago
sneaxiy b8051e7927 merge develop
6 years ago
Tao Luo df1e4e2f10 fix check_lod
6 years ago
minqiyang 0a4b6fc056 Remove unnessesary code
6 years ago
minqiyang 53619a79b4 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into accelerate_lstm
6 years ago
minqiyang 6fabbd8fb8 Polish code and remove spin lock
6 years ago
Zeng Jinle 95cbe07c40
Merge pull request #14836 from sneaxiy/feature/py_func
6 years ago
mozga-intel 7048caf9a0 Enable top_k operator for a ngraph
6 years ago
mozga-intel ecfa68ecaa Enable fill_constant operator for a ngraph
6 years ago
sneaxiy 600f6d8272 polish code
6 years ago
sneaxiy 7f6e513b1f fix mac ci bug
6 years ago
sneaxiy c1f7e54f62 merge develop
6 years ago