Commit Graph

196 Commits (569951c418fb3c9f82cbdde9fda3910cc7033bff)

Author SHA1 Message Date
Chen Weihang 003f369bb2
Add IndicateVarDataType interface to block tensor is not initialized problem in OP GetExceptedKernelType (#20044)
6 years ago
Zeng Jinle 0af8549750 fix seg fault of share lod, test=develop (#19573)
6 years ago
Leo Chen a9d5fc5142 Enhance OpTest to check the consistency of operators when using and not using inplace (#19101)
6 years ago
Zeng Jinle 708bd9798d
move_flags_to_unified_files_for_management, test=develop (#19224)
6 years ago
chengduo 21440b4d69
Add call stack info during compile time (#19067)
6 years ago
Leo Zhao ff77dea969 not use transferscope cache in cpu case (#18578)
6 years ago
pkpk e9c7e218f2
Nan debugger init (#18401)
6 years ago
Qiao Longfei 58f7695ab2
Async exe support communicator (#17386)
6 years ago
Qiao Longfei 728bbaa4e3
add cache_update_mutex_ for operator test=develop (#17124)
6 years ago
Tao Luo 5babcd02dd
Revert "remove unnecessary prepare_data (#17080)" (#17432)
6 years ago
Zeng Jinle 5e5e7b3305
fix data_type error message (#17312)
6 years ago
Tao Luo aca60e9a20
remove unnecessary prepare_data (#17080)
6 years ago
luotao1 490e746269 fix runtime_context_cache bug when gpu model has an op runs only on cpu
6 years ago
luotao1 4098ba29ed reduce hasAttr elapsed time in RunImpl
6 years ago
luotao1 f89a9c5d95 Merge branch 'develop' into has_attr
6 years ago
luotao1 6afc97ca6b reduce hasAttr elapsed time in RunImpl
6 years ago
luotao1 226596a296 Merge branch 'develop' into core_opt_choose_kernel
6 years ago
gongweibao a61ed9782e
fix log level test=develop (#16554)
6 years ago
liuwei1031 278debab71
fix comments of 16410, test=develop (#16499)
6 years ago
gongweibao eb83abeac3
Add DGC(Deep Gradient Compression) interface. (#15841)
6 years ago
Zeng Jinle c7c6eeb44e
Merge pull request #16409 from sneaxiy/feature/advance_gc
6 years ago
liuwei1031 8d22bc17a4
Memory optimize (#16410)
6 years ago
sneaxiy 7000ec85d9 fix some op grad maker
6 years ago
sneaxiy a93a9eef8f add op registry type
6 years ago
luotao1 056599a738 add expected_kernel_cache_pass
7 years ago
luotao1 bfdab00e5b Merge branch 'develop' into core_opt_choose_kernel
7 years ago
luotao1 6c6a39222b Merge branch 'core_opt_choose_kernel' of https://github.com/Xreki/Paddle into core_opt_choose_kernel
7 years ago
luotao1 82af8031d9 add runtime_context_cache_pass
7 years ago
Tao Luo 7d2740db83
Revert "cache runtime_context"
7 years ago
luotao1 cc0ae1f1a1 refine with comments
7 years ago
luotao1 46ee6bb1aa fix distributed unit-tests
7 years ago
luotao1 b2898c0f57 Merge branch 'develop' into runtime_context
7 years ago
Tao Luo 4ef6f738c3
Merge pull request #16154 from luotao1/infershape_example
7 years ago
luotao1 d94fd97230 add runtime_context_cache_pass
7 years ago
luotao1 b561ad1e55 Merge branch 'develop' into runtime_context
7 years ago
luotao1 fe78a92e6e refine with comments
7 years ago
wopeizl 85709f4378
restore the exception caught since it is necessary for python call stack (#16160)
7 years ago
luotao1 31ccaf0916 add all_kernels_must_compute_runtime_shape example for speedup infershape
7 years ago
Tao Luo f4587789d8 remove legacy function in ExecutionContext
7 years ago
Liu Yiqun 1041e18c47 Refine codes.
7 years ago
luotao1 c0b240aa43 try to fix distributed unit-test
7 years ago
luotao1 784826a4f5 enhance cache runtime_context for different scope
7 years ago
luotao1 2fb38c108c Merge branch 'develop' into runtime_context
7 years ago
Liu Yiqun d4674dab13 Cache the chosen kernel of operators'.
7 years ago
luotao1 9773f38f99 cache runtime_context
7 years ago
tangwei12 6d5a04c1e7
add op type in check nan/inf (#15986)
7 years ago
Xin Pan 5dd281f738 polish
7 years ago
Xin Pan 5eb87506bc add per kernel config and remove const_cast.
7 years ago
Dun a83e470405
Profiler refine and add CUDA runtime api tracer (#15301)
7 years ago
chengduo ad61e1b22c
fix potential bug (#15688)
7 years ago