Commit Graph

183 Commits (bc037c13c728f6d17d48408f986c78dea11d4a89)

Author SHA1 Message Date
luotao1 4098ba29ed reduce hasAttr elapsed time in RunImpl
6 years ago
luotao1 f89a9c5d95 Merge branch 'develop' into has_attr
6 years ago
luotao1 6afc97ca6b reduce hasAttr elapsed time in RunImpl
6 years ago
luotao1 226596a296 Merge branch 'develop' into core_opt_choose_kernel
6 years ago
gongweibao a61ed9782e
fix log level test=develop (#16554)
6 years ago
liuwei1031 278debab71
fix comments of 16410, test=develop (#16499)
6 years ago
gongweibao eb83abeac3
Add DGC(Deep Gradient Compression) interface. (#15841)
6 years ago
Zeng Jinle c7c6eeb44e
Merge pull request #16409 from sneaxiy/feature/advance_gc
6 years ago
liuwei1031 8d22bc17a4
Memory optimize (#16410)
6 years ago
sneaxiy 7000ec85d9 fix some op grad maker
6 years ago
sneaxiy a93a9eef8f add op registry type
6 years ago
luotao1 056599a738 add expected_kernel_cache_pass
6 years ago
luotao1 bfdab00e5b Merge branch 'develop' into core_opt_choose_kernel
6 years ago
luotao1 6c6a39222b Merge branch 'core_opt_choose_kernel' of https://github.com/Xreki/Paddle into core_opt_choose_kernel
6 years ago
luotao1 82af8031d9 add runtime_context_cache_pass
6 years ago
Tao Luo 7d2740db83
Revert "cache runtime_context"
6 years ago
luotao1 cc0ae1f1a1 refine with comments
6 years ago
luotao1 46ee6bb1aa fix distributed unit-tests
6 years ago
luotao1 b2898c0f57 Merge branch 'develop' into runtime_context
6 years ago
Tao Luo 4ef6f738c3
Merge pull request #16154 from luotao1/infershape_example
6 years ago
luotao1 d94fd97230 add runtime_context_cache_pass
6 years ago
luotao1 b561ad1e55 Merge branch 'develop' into runtime_context
6 years ago
luotao1 fe78a92e6e refine with comments
6 years ago
wopeizl 85709f4378
restore the exception caught since it is necessary for python call stack (#16160)
6 years ago
luotao1 31ccaf0916 add all_kernels_must_compute_runtime_shape example for speedup infershape
6 years ago
Tao Luo f4587789d8 remove legacy function in ExecutionContext
6 years ago
Liu Yiqun 1041e18c47 Refine codes.
6 years ago
luotao1 c0b240aa43 try to fix distributed unit-test
6 years ago
luotao1 784826a4f5 enhance cache runtime_context for different scope
6 years ago
luotao1 2fb38c108c Merge branch 'develop' into runtime_context
6 years ago
Liu Yiqun d4674dab13 Cache the chosen kernel of operators'.
6 years ago
luotao1 9773f38f99 cache runtime_context
6 years ago
tangwei12 6d5a04c1e7
add op type in check nan/inf (#15986)
6 years ago
Xin Pan 5dd281f738 polish
6 years ago
Xin Pan 5eb87506bc add per kernel config and remove const_cast.
6 years ago
Dun a83e470405
Profiler refine and add CUDA runtime api tracer (#15301)
6 years ago
chengduo ad61e1b22c
fix potential bug (#15688)
6 years ago
Gabor Buella da9c94da33 Clang build fixes (#15628)
6 years ago
Jiabin Yang fd286f3596
Merge pull request #15534 from JiabinYang/fix/multi_output_support_imperative
6 years ago
JiabinYang 5639f49b16 test=develop, fix/multi_output_support_imperative
6 years ago
JiabinYang c52f57de5b test=develop, refine_error_message for data type
6 years ago
minqiyang 8ce198b2e1 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into imperative_resnet
6 years ago
乔龙飞 Qiao Longfei b14d4cdd75
Merge pull request #14890 from jacquesqiao/multithread-sparse-adam
6 years ago
peizhilin 5e450833bd test=develop
6 years ago
peizhilin eea75a1d93 fix issue when type is invalid
6 years ago
peizhilin c1235c935f add the enable_debug flag
6 years ago
peizhilin a6f5ceee74 add the python callstack for debug support test=develop
6 years ago
Qiao Longfei 44b300556d change min_row_size_to_use_multithread to parameter of adam
6 years ago
Qiao Longfei 87b4eb1da4 change min_param_size_to_use_multithread to min_row_size_to_use_multithread
6 years ago
Qiao Longfei 17b1b660fc Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into multithread-sparse-adam
6 years ago