Zeng Jinle
f855c05fff
Merge pull request #13520 from sneaxiy/enhance_eager_delete
...
Enhance eager delete and sparse Adam
7 years ago
Yu Yang
3043f51b3a
Merge pull request #13511 from reyoung/fix_ce
...
Revert "Merge pull request #13431 from chengduoZH/refine_lod"
7 years ago
Wu Yi
29c63d180f
[Feature] dist op role and lr op role, to support memory optimize with dist training ( #13220 )
...
* wip
* clean up
* should fix running with memopt
* add ut
* mark lr schedule op role
* hide lr_schedule_guard
* use op_role_var instead of ufind
* unify dist test name
* wip for py3 support
* fix var deref
* fix python3 mem_opt order
* remove comments
7 years ago
sneaxiy
0a36ef3c18
enhance eager deletion
7 years ago
Yu Yang
a6c8d6b9a2
Revert "Merge pull request #13431 from chengduoZH/refine_lod"
...
This reverts commit bd79e04667
, reversing
changes made to 6b4d290c18
.
7 years ago
chengduo
d402234ba8
Feature/op_fuse_pass ( #12440 )
...
* Add Preface
* Add demo code
* Save file
* Refine code
* seems can work
* use elementwise strategy
* Use ElementwiseComputeEx
* Add comments
* extract functions from operator
* Refine code
* Follow comment
* code refine
* add op_fuse pass
* add backward
* code refine
* use TopologySortOperations
* follow comments
* refine IsFusible
* code enhance
* fix op_fusion_pass
* refine code
* refine fuse_elemwise_act_op
* adjust the input and output
* refine logic
* add intermediate_edge
* disable inplace
* follow comments
* refine logic
* follow comments
* Remove the removable IntermediateOut
* change strategy
* code refine
* enable fuse backward
* code refine
* code refine
* rename unit test
* follow comments
7 years ago
Xin Pan
23ba766252
Merge pull request #13475 from panyx0718/ir5
...
avoid creating dangling ir::Node.
7 years ago
Xin Pan
0bd7a67eaf
avoid creating dangling ir::Node.
...
Node should be created by Graph::CreateXXX so that
they are managed by graph.
7 years ago
chengduoZH
6534f8527a
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refine_lod
7 years ago
Xin Pan
a83a4fab5c
Merge pull request #13441 from panyx0718/ir2
...
simplify and hide bcast_params
7 years ago
Xin Pan
e5b322051b
clean
7 years ago
chengduoZH
def2a8b1b0
Async memcpy
7 years ago
sneaxiy
612e1a3155
modification
7 years ago
sneaxiy
d0b2453ecd
merge develop
7 years ago
sneaxiy
24ea39c4c6
feature/eager_delete_tensor
7 years ago
Yancey1989
757f9683ab
update comment text
7 years ago
Yancey1989
9ee1b7bc04
add some comments
7 years ago
Yancey1989
bad4ea192e
update by comment
7 years ago
Yancey1989
1664899b63
update
7 years ago
Yancey1989
1e1b6622fd
update by comment
7 years ago
Yancey1989
5ce1a960a5
move bcast op into pass
7 years ago
Xin Pan
5ae052bd14
remove some unused codes
7 years ago
dzhwinter
ab1097cd8e
Feature/template ( #13093 )
...
* remove template operator
* "fix compile"
* "fix ci"
* "fix ci"
7 years ago
Yu Yang
819af27d63
Merge pull request #13079 from reyoung/feature/fix_bug
...
Fix bug when loss@GRAD is reused.
7 years ago
Wu Yi
0b8067c0dc
fix dist train reduce mode ( #13068 )
...
* fix dist train reduce mode
* fix previous fix
7 years ago
yuyang18
555f03d8f0
Fix bug when loss@GRAD is reused.
7 years ago
Wu Yi
0ee6fed05b
Refine dist rpc deps ( #12899 )
...
* refine dist train RPC deps
* clean up
* clean up
* fix ut
* remove input for fetch_barrier
* follow comments
7 years ago
Wu Yi
c6f212a339
Merge pull request #12847 from typhoonzero/add_scope_info_in_graphviz
...
Add scope info in graphviz debug
7 years ago
Wu Yi
b8da70c370
Resovle multi gpu async deps ( #12828 )
...
* dist transpiler add control dependency var between send and recv
* fix async deps
* follow comments and refine
* fix deps connect for rpc ops
7 years ago
typhoonzero
dd7a79158b
add scope info in graphviz debug
7 years ago
yuyang18
b8029fd650
Follow comments
7 years ago
yuyang18
05cadf1b24
Add FastExecutor
7 years ago
yuyang18
c6eb7a89ff
Merge branch 'feature/better_exception_holder' into feature/fast_executor
7 years ago
yuyang18
aac80ef4cc
Polish API of exception holder
7 years ago
yuyang18
d49763a87d
Stash
7 years ago
Xin Pan
626abfc33a
code clean up and renaming
...
Reduce one level of inheritence.
7 years ago
Yu Yang
c4f8afa258
Fix bug when memopt optimize loss.grad and use ParallelExecutor
7 years ago
chengduo
97a77512b4
Fix the order of sum ( #12562 )
...
* fix the order of sum
* add doc
* check whether need to copy
* follow comments
7 years ago
Xin Pan
caf10b474f
make profiler use thread_id from g_thread_id
...
Add a few more RecordEvent.
Cleanup
7 years ago
fengjiayi
bc1b7b96ec
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dev_exception_holder
7 years ago
qiaolongfei
c8c8c01a23
fix mac build of graph_executor
7 years ago
fengjiayi
3e4083ed1f
Make exception handling of threaded_ssa_graph_executor an independent class
7 years ago
Xin Pan
4b8ae523c4
Merge pull request #12367 from panyx0718/ir_pass
...
Ir pass
7 years ago
Xin Pan
398cfb47b1
disable dist_se_resnext since it's not stable yet.
...
fix fluid_benchmark.py
7 years ago
Xin Pan
ab72d28a5e
clean up and correctness check
7 years ago
Xin Pan
aa1085ddc5
all passes
...
add doc
7 years ago
Xin Pan
e4d7d7ae8f
pass refactoring
7 years ago
Xin Pan
142e832d21
pass registration
7 years ago
qiaolongfei
64e7902530
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into parallel-executor-support-prefetch
7 years ago
qiaolongfei
754e96a30c
distribute lookup table work with parallel executor
7 years ago