Commit Graph

2404 Commits (aa5307ce5b0856cc2eb21186c8bc4c9276638d41)

Author SHA1 Message Date
chengduo 794a195881
fix fuse optimizer ops (#17102)
6 years ago
Tao Luo aca60e9a20
remove unnecessary prepare_data (#17080)
6 years ago
Zeng Jinle 842ded14b0
fix reference_count_pass,test=develop (#17060)
6 years ago
Tao Luo d9cd989825
Merge pull request #17048 from luotao1/fix_runtime_cache_bug
6 years ago
chengduo cc31681687
use fast executor as default (#17044)
6 years ago
chengduo a2be4b4d91
Add fuse momenutum ops (#16745)
6 years ago
luotao1 490e746269 fix runtime_context_cache bug when gpu model has an op runs only on cpu
6 years ago
wopeizl 51a0243a56 fix nccl wrapper on windows
6 years ago
Zeng Jinle 1202d3fc74
Refine model gpu memory (#16993)
6 years ago
Yibing Liu 3c375751f8
Support seq len equal to 0 in sequence ops (#16935)
6 years ago
jiaqi 8bcba3db84
Merge pull request #16896 from xjqbest/develop
6 years ago
guru4elephant bbc6c5714f
Merge pull request #16887 from guru4elephant/add_nccl_context_pybind
6 years ago
gongweibao cbdb8a17b1
Polish DGC code (#16818)
6 years ago
dongdaxiang 2ab2869c2d fix GPU compile error problem
6 years ago
dongdaxiang 466d177d09 add pybind dependency
6 years ago
xjqbest 10991e00a9 fix bug of num > INT_MAX
6 years ago
xjqbest 241120d94d fix bug of num > INT_MAX
6 years ago
xjqbest dac70ad4c5 fix bug of num > INT_MAX
6 years ago
xjqbest 74471397cf fix bug of num > INT_MAX
6 years ago
dongdaxiang b091139049 add nccl wrapper for python API
6 years ago
dongdaxiang fff795e5c8 add nccl_wrapper
6 years ago
乔龙飞 Qiao Longfei 82cff5ec42
Merge pull request #16762 from jacquesqiao/add-async_sparse_param_update_recorder
6 years ago
Yibing Liu 4267a81afc
Correct the lod level of compiled time in lod_reset (#16790)
6 years ago
chengduo e9409665f7
Refine Fuse Optimize Ops (#16810)
6 years ago
chengduo d105c06b50
Replace ThreadedExecutor with FastThreadedExecutor (#16650)
6 years ago
Qiao Longfei 1526a3e4da Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async_sparse_param_update_recorder
6 years ago
Yihua Xu 93cedfdb9c Fix the order while sorting the operators (#16756)
6 years ago
Qiao Longfei afc56949c1 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async_sparse_param_update_recorder
6 years ago
liuwei1031 85363848a1
Security issue (#16774)
6 years ago
guru4elephant aa46caf3d9
Merge pull request #16765 from guru4elephant/gpu_dataset_train
6 years ago
dongdaxiang 3c2d236815 remove all warnings
6 years ago
Yiqun Liu 112f16143b
Add an option to enable the cache of expected kernel in train phase. (#16724)
6 years ago
liuwei1031 2e07c19a9c
disable memory_optimize and inpalce strategy by default, test=develop (#16760)
6 years ago
dongdaxiang ea07eb8cd2 remove comment in data_feed.cc
6 years ago
dongdaxiang 05464e7c5c add gpu training for Executor.train_from_dataset
6 years ago
Qiao Longfei 0608f8ca56 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async_sparse_param_update_recorder
6 years ago
Zeng Jinle 9f7b027dce
fix activation grad op desc maker (#16715)
6 years ago
liuwei1031 fdb719a1bf
avoid optimize variable used in subblock, test=develop (#16739)
6 years ago
liuwei1031 a18ef10c87
only use the latest version variable for inplace strategy (#16736)
6 years ago
Tao Luo 5c364cda3c
Merge pull request #16711 from luotao1/has_attr
6 years ago
chengduo 55b15db5af
Add unit test for fuse all_reduce ops (#16699)
6 years ago
luotao1 4098ba29ed reduce hasAttr elapsed time in RunImpl
6 years ago
luotao1 f89a9c5d95 Merge branch 'develop' into has_attr
6 years ago
Tao Luo ad4a1bd13c
Merge pull request #16339 from luotao1/core_opt_choose_kernel
6 years ago
luotao1 6afc97ca6b reduce hasAttr elapsed time in RunImpl
6 years ago
gongweibao 8b793d0efd
Fix DGC bug. (#16697)
6 years ago
Yiqun Liu 3fe8cb0dd7
Enable the runtime_context_cache pass in train phase (#16640)
6 years ago
xjqbest 6a57e8075a remove trainer_id in datafeed and dataset
6 years ago
luotao1 695f2db6a0 update expected_kernel_cache_pass
6 years ago
luotao1 226596a296 Merge branch 'develop' into core_opt_choose_kernel
6 years ago