Commit Graph

240 Commits (a6072055be98fe21c898ad4685faaedd591c6b93)

Author SHA1 Message Date
pkpk e9c7e218f2
Nan debugger init (#18401)
6 years ago
Qiao Longfei 58f7695ab2
Async exe support communicator (#17386)
6 years ago
Qiao Longfei 728bbaa4e3
add cache_update_mutex_ for operator test=develop (#17124)
6 years ago
Tao Luo 5babcd02dd
Revert "remove unnecessary prepare_data (#17080)" (#17432)
6 years ago
Zeng Jinle 5e5e7b3305
fix data_type error message (#17312)
6 years ago
Tao Luo aca60e9a20
remove unnecessary prepare_data (#17080)
6 years ago
luotao1 490e746269 fix runtime_context_cache bug when gpu model has an op runs only on cpu
6 years ago
luotao1 4098ba29ed reduce hasAttr elapsed time in RunImpl
6 years ago
luotao1 f89a9c5d95 Merge branch 'develop' into has_attr
6 years ago
luotao1 6afc97ca6b reduce hasAttr elapsed time in RunImpl
6 years ago
luotao1 226596a296 Merge branch 'develop' into core_opt_choose_kernel
6 years ago
gongweibao a61ed9782e
fix log level test=develop (#16554)
6 years ago
liuwei1031 278debab71
fix comments of 16410, test=develop (#16499)
6 years ago
gongweibao eb83abeac3
Add DGC(Deep Gradient Compression) interface. (#15841)
6 years ago
Zeng Jinle c7c6eeb44e
Merge pull request #16409 from sneaxiy/feature/advance_gc
6 years ago
liuwei1031 8d22bc17a4
Memory optimize (#16410)
6 years ago
sneaxiy 7000ec85d9 fix some op grad maker
6 years ago
sneaxiy a93a9eef8f add op registry type
6 years ago
luotao1 056599a738 add expected_kernel_cache_pass
6 years ago
luotao1 bfdab00e5b Merge branch 'develop' into core_opt_choose_kernel
6 years ago
luotao1 6c6a39222b Merge branch 'core_opt_choose_kernel' of https://github.com/Xreki/Paddle into core_opt_choose_kernel
6 years ago
luotao1 82af8031d9 add runtime_context_cache_pass
6 years ago
Tao Luo 7d2740db83
Revert "cache runtime_context"
6 years ago
luotao1 cc0ae1f1a1 refine with comments
6 years ago
luotao1 46ee6bb1aa fix distributed unit-tests
6 years ago
luotao1 b2898c0f57 Merge branch 'develop' into runtime_context
6 years ago
Tao Luo 4ef6f738c3
Merge pull request #16154 from luotao1/infershape_example
6 years ago
luotao1 d94fd97230 add runtime_context_cache_pass
6 years ago
luotao1 b561ad1e55 Merge branch 'develop' into runtime_context
6 years ago
luotao1 fe78a92e6e refine with comments
6 years ago
wopeizl 85709f4378
restore the exception caught since it is necessary for python call stack (#16160)
6 years ago
luotao1 31ccaf0916 add all_kernels_must_compute_runtime_shape example for speedup infershape
6 years ago
Tao Luo f4587789d8 remove legacy function in ExecutionContext
6 years ago
Liu Yiqun 1041e18c47 Refine codes.
6 years ago
luotao1 c0b240aa43 try to fix distributed unit-test
6 years ago
luotao1 784826a4f5 enhance cache runtime_context for different scope
6 years ago
luotao1 2fb38c108c Merge branch 'develop' into runtime_context
6 years ago
Liu Yiqun d4674dab13 Cache the chosen kernel of operators'.
6 years ago
luotao1 9773f38f99 cache runtime_context
6 years ago
tangwei12 6d5a04c1e7
add op type in check nan/inf (#15986)
6 years ago
Xin Pan 5dd281f738 polish
6 years ago
Xin Pan 5eb87506bc add per kernel config and remove const_cast.
6 years ago
Dun a83e470405
Profiler refine and add CUDA runtime api tracer (#15301)
6 years ago
chengduo ad61e1b22c
fix potential bug (#15688)
6 years ago
Gabor Buella da9c94da33 Clang build fixes (#15628)
6 years ago
Jiabin Yang fd286f3596
Merge pull request #15534 from JiabinYang/fix/multi_output_support_imperative
6 years ago
JiabinYang 5639f49b16 test=develop, fix/multi_output_support_imperative
6 years ago
JiabinYang c52f57de5b test=develop, refine_error_message for data type
6 years ago
minqiyang 8ce198b2e1 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into imperative_resnet
6 years ago
乔龙飞 Qiao Longfei b14d4cdd75
Merge pull request #14890 from jacquesqiao/multithread-sparse-adam
6 years ago
peizhilin 5e450833bd test=develop
6 years ago
peizhilin eea75a1d93 fix issue when type is invalid
6 years ago
peizhilin c1235c935f add the enable_debug flag
7 years ago
peizhilin a6f5ceee74 add the python callstack for debug support test=develop
7 years ago
Qiao Longfei 44b300556d change min_row_size_to_use_multithread to parameter of adam
7 years ago
Qiao Longfei 87b4eb1da4 change min_param_size_to_use_multithread to min_row_size_to_use_multithread
7 years ago
Qiao Longfei 17b1b660fc Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into multithread-sparse-adam
7 years ago
Xin Pan 3e8408429d
Merge pull request #15053 from panyx0718/imperative_hold
7 years ago
Zeng Jinle c0bcff00dc
Merge pull request #14962 from sneaxiy/rewrite_variable_type
7 years ago
Xin Pan b91a7a9d30 clear operator changes
7 years ago
Xin Pan f52b514dcd call kernel
7 years ago
Xin Pan 4e80e04f23 fix
7 years ago
Xin Pan 61491ce250 clean
7 years ago
Xin Pan ce7e503cbe refactor to avoid scope.
7 years ago
Qiyang Min 0238a3bb4f
Merge pull request #14972 from velconia/accelerate_lstm
7 years ago
Qiao Longfei 3b294e2e2e Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into multithread-sparse-adam
7 years ago
sneaxiy c4ce2e7b21 merge develop, solve conflict
7 years ago
peizhilin 2388d0e7d6 Revert "cherry-pick the #12759"
7 years ago
Tao Luo 69659f4ae2
Merge pull request #15037 from jianhang-liu/fix/abnormal_stack_op_time
7 years ago
sneaxiy dde3afe7b7 Merge develop
7 years ago
Brian Liu e821b12f57 Fix issue which cause abnormal CPU usage in stack op
7 years ago
peizhilin 7f6d8acecb cherry-pick the #12759
7 years ago
minqiyang 53619a79b4 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into accelerate_lstm
7 years ago
sneaxiy c1f7e54f62 merge develop
7 years ago
Qiao Longfei d76bda50c4 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into multithread-sparse-adam
7 years ago
Xin Pan 969ad966ba all converted
7 years ago
minqiyang 679d1a9e0b Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into accelerate_lstm
7 years ago
sneaxiy 74a8e6b032 merge develop
7 years ago
Xin Pan 1fe3ac352a move more and fix while
7 years ago
sneaxiy ae6f46a1a9 rewrite variable type
7 years ago
Xin Pan 9ef8a76873 convert more
7 years ago
Xin Pan 876993887b convert more interface to avoid scope
7 years ago
Xin Pan 8c19f0bfe3 fix
7 years ago
Xin Pan 4dd61e7260 convert GetInputVarPtrs and GetOutputVarPtrs
7 years ago
Xin Pan 52d3903a12 fix
7 years ago
Xin Pan 0e0983cc1d convert more infer shape
7 years ago
Xin Pan 62eb43ba98 convert more
7 years ago
Xin Pan f897bd16c0 clean
7 years ago
Xin Pan 70981f5d79 clean
7 years ago
Xin Pan fb8ae30331 fix
7 years ago
Xin Pan eaf8ba35b5 change input
7 years ago
Xin Pan 840e6729e2 inject context
7 years ago
Xin Pan bbff0df320 try cache variables
7 years ago
Xin Pan 52bc4ee75a delay infer scope
7 years ago
Qiao Longfei 3bd54ed769 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into multithread-sparse-adam
7 years ago
Qiao Longfei fac8702269 adam support multithread
7 years ago
minqiyang 3bb9b0cac2 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into accelerate_ddpg
7 years ago
minqiyang ad6ae0b071 1. Add SpinLock
7 years ago
Yu Yang 4ecdb6f486 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/tensor_type
7 years ago
minqiyang b75bd29c3a Remove debug info
7 years ago