Commit Graph

191 Commits (ab57d3893ea2cfe8b002ed4a82e88a0d40b2f1e8)

Author SHA1 Message Date
Leo Zhao ff77dea969 not use transferscope cache in cpu case (#18578)
6 years ago
pkpk e9c7e218f2
Nan debugger init (#18401)
6 years ago
Qiao Longfei 58f7695ab2
Async exe support communicator (#17386)
6 years ago
Qiao Longfei 728bbaa4e3
add cache_update_mutex_ for operator test=develop (#17124)
6 years ago
Tao Luo 5babcd02dd
Revert "remove unnecessary prepare_data (#17080)" (#17432)
6 years ago
Zeng Jinle 5e5e7b3305
fix data_type error message (#17312)
6 years ago
Tao Luo aca60e9a20
remove unnecessary prepare_data (#17080)
6 years ago
luotao1 490e746269 fix runtime_context_cache bug when gpu model has an op runs only on cpu
6 years ago
luotao1 4098ba29ed reduce hasAttr elapsed time in RunImpl
6 years ago
luotao1 f89a9c5d95 Merge branch 'develop' into has_attr
6 years ago
luotao1 6afc97ca6b reduce hasAttr elapsed time in RunImpl
6 years ago
luotao1 226596a296 Merge branch 'develop' into core_opt_choose_kernel
6 years ago
gongweibao a61ed9782e
fix log level test=develop (#16554)
6 years ago
liuwei1031 278debab71
fix comments of 16410, test=develop (#16499)
6 years ago
gongweibao eb83abeac3
Add DGC(Deep Gradient Compression) interface. (#15841)
6 years ago
Zeng Jinle c7c6eeb44e
Merge pull request #16409 from sneaxiy/feature/advance_gc
6 years ago
liuwei1031 8d22bc17a4
Memory optimize (#16410)
6 years ago
sneaxiy 7000ec85d9 fix some op grad maker
6 years ago
sneaxiy a93a9eef8f add op registry type
6 years ago
luotao1 056599a738 add expected_kernel_cache_pass
6 years ago
luotao1 bfdab00e5b Merge branch 'develop' into core_opt_choose_kernel
6 years ago
luotao1 6c6a39222b Merge branch 'core_opt_choose_kernel' of https://github.com/Xreki/Paddle into core_opt_choose_kernel
6 years ago
luotao1 82af8031d9 add runtime_context_cache_pass
6 years ago
Tao Luo 7d2740db83
Revert "cache runtime_context"
6 years ago
luotao1 cc0ae1f1a1 refine with comments
6 years ago
luotao1 46ee6bb1aa fix distributed unit-tests
6 years ago
luotao1 b2898c0f57 Merge branch 'develop' into runtime_context
6 years ago
Tao Luo 4ef6f738c3
Merge pull request #16154 from luotao1/infershape_example
6 years ago
luotao1 d94fd97230 add runtime_context_cache_pass
6 years ago
luotao1 b561ad1e55 Merge branch 'develop' into runtime_context
6 years ago
luotao1 fe78a92e6e refine with comments
6 years ago
wopeizl 85709f4378
restore the exception caught since it is necessary for python call stack (#16160)
6 years ago
luotao1 31ccaf0916 add all_kernels_must_compute_runtime_shape example for speedup infershape
6 years ago
Tao Luo f4587789d8 remove legacy function in ExecutionContext
6 years ago
Liu Yiqun 1041e18c47 Refine codes.
6 years ago
luotao1 c0b240aa43 try to fix distributed unit-test
6 years ago
luotao1 784826a4f5 enhance cache runtime_context for different scope
6 years ago
luotao1 2fb38c108c Merge branch 'develop' into runtime_context
6 years ago
Liu Yiqun d4674dab13 Cache the chosen kernel of operators'.
6 years ago
luotao1 9773f38f99 cache runtime_context
6 years ago
tangwei12 6d5a04c1e7
add op type in check nan/inf (#15986)
6 years ago
Xin Pan 5dd281f738 polish
6 years ago
Xin Pan 5eb87506bc add per kernel config and remove const_cast.
6 years ago
Dun a83e470405
Profiler refine and add CUDA runtime api tracer (#15301)
6 years ago
chengduo ad61e1b22c
fix potential bug (#15688)
6 years ago
Gabor Buella da9c94da33 Clang build fixes (#15628)
6 years ago
Jiabin Yang fd286f3596
Merge pull request #15534 from JiabinYang/fix/multi_output_support_imperative
6 years ago
JiabinYang 5639f49b16 test=develop, fix/multi_output_support_imperative
6 years ago
JiabinYang c52f57de5b test=develop, refine_error_message for data type
6 years ago
minqiyang 8ce198b2e1 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into imperative_resnet
6 years ago