yuyang18
|
86a61c177f
|
Add ScopeBufferedSSAGraphExecutor
|
7 years ago |
yuyang18
|
7c777dd549
|
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/exec_strategy
|
7 years ago |
yuyang18
|
08295f9877
|
Add build strategy
|
7 years ago |
yuyang18
|
e5281b3c2d
|
Clean code & add execution strategy
|
7 years ago |
typhoonzero
|
928418a9ac
|
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into gen_nccl_id_op
|
7 years ago |
typhoonzero
|
f5840d8925
|
follow comments
|
7 years ago |
chengduoZH
|
97cb5479ae
|
change PE strategy
|
7 years ago |
typhoonzero
|
a529d790b6
|
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into gen_nccl_id_op
|
7 years ago |
typhoonzero
|
d9320dcd94
|
complete code
|
7 years ago |
chengduoZH
|
c891189568
|
update sparse gradient parameter with reduce and broadcast
|
7 years ago |
chengduoZH
|
5ff1ef36ee
|
update sparse parameter
|
7 years ago |
yangyaming
|
82571deb89
|
Change `customize_loss_grad` to `use_default_grad_scale`.
|
7 years ago |
Yu Yang
|
54ada9449e
|
Add demo for recordio train/test and parallel executor
|
7 years ago |
Yu Yang
|
7a395881d4
|
Add customize_loss_grad option to PE
|
7 years ago |
Yu Yang
|
5305c5f845
|
Correctly implement destructor of ParallelExecutor
|
7 years ago |
fengjiayi
|
fbe562478d
|
Merge pull request #9994 from reyoung/feature/debug
Fix bugs in local_scopes
|
7 years ago |
Yu Yang
|
06fb055a2f
|
New group
|
7 years ago |
Yu Yang
|
71a2e6b73c
|
Reverse create var
|
7 years ago |
Yu Yang
|
89728f8e66
|
update
|
7 years ago |
Yu Yang
|
eb2e4eeade
|
Debug
|
7 years ago |
Yu Yang
|
b4aaa00a8a
|
Polish logic of ParallelExecutor
|
7 years ago |
Yu Yang
|
ad73b331c7
|
Eagerly drop local scope in iteration (#9838)
* Eagerly drop local scope in iteration
* Correct create var
* Fix typo
* Debug
|
7 years ago |
fengjiayi
|
90084a25d2
|
Merge pull request #9743 from JiayiFeng/modify_readers_to_fit_parallel_executor
Modify readers to fit the parallel executor
|
7 years ago |
wanghaoshuang
|
19c1a68ee9
|
Fix lost of LoD while splitting tensor in parallel executor.
|
7 years ago |
JiayiFeng
|
ee178d5aeb
|
fix bugs
|
7 years ago |
chengduoZH
|
7e7611d067
|
when the number of samples of current batch is less than the count of devices, let it crash.
|
7 years ago |
qingqing01
|
2b7e5bd366
|
Support testing during training by ParallelExecutor. (#9738)
* Support testing during training by ParallelExecutor.
* Add unit test.
* Improve the interface.
* Follow comments.
|
7 years ago |
Xin Pan
|
4bbfa9eccb
|
Add feed to ParallelExecutor
|
7 years ago |
Xin Pan
|
b123ce88a1
|
Add enable/disable for delayed ops
|
7 years ago |
Xin Pan
|
d0ac92531d
|
Improve ParallelExecutor performance
|
7 years ago |
qiaolongfei
|
9a101cfc08
|
clean code
|
7 years ago |
qiaolongfei
|
997e9a1fd2
|
fix mac compile
|
7 years ago |
chengduoZH
|
60d0a0594e
|
refine parallel
|
7 years ago |
Yu Yang
|
3aa2a8ffcf
|
Follow comments
|
7 years ago |
Yu Yang
|
02aaecca35
|
Fix CPU compile
|
7 years ago |
Yu Yang
|
edfd741e3a
|
Add simple python wrapper for ParallelExecutor
|
7 years ago |
Yu Yang
|
a7b0d5bd26
|
Clean code
|
7 years ago |
Yu Yang
|
e3144393e3
|
Extract Executors to indie modules
|
7 years ago |
Yu Yang
|
c70b60dd70
|
Make executor steal graph inside
|
7 years ago |
Yu Yang
|
4c3361cda8
|
Extract GraphExecutor
|
7 years ago |
Yu Yang
|
b123e43bf9
|
extract multi devices graph builder
|
7 years ago |
Yu Yang
|
dd73d18bb7
|
Extract SSAGraph
|
7 years ago |
Yu Yang
|
79989c9025
|
Add SSA builder
|
7 years ago |
Yu Yang
|
64d7a30271
|
Extract SSAGraph
|
7 years ago |
Yu Yang
|
8dec4ad7a1
|
Use int not Place for vars
|
7 years ago |
Yu Yang
|
3181501013
|
Rerange code
|
7 years ago |
Yu Yang
|
f28ae6e4b1
|
Reorganize Code
|
7 years ago |
Yu Yang
|
5c333e4143
|
Add dctor for dev_ctx
|
7 years ago |
Yu Yang
|
15f5f10ed5
|
AddInput/AddOutput for OpHandle
|
7 years ago |
Yu Yang
|
5368e50d84
|
Reorganize code
|
7 years ago |