chengduo
4ba6d062f8
Fix fluid.data of executor ( #20365 )
...
* fix fluid.data
test=document_fix
* fix doc batch dim
test=document_fix
6 years ago
chengduo
d77e770353
Polish executor doc ( #20244 )
...
* polish executor doc
test=develop
6 years ago
flame
fe18cfdb4f
hide with inference optim API ( #17355 )
6 years ago
chengduo
4278518fb0
Update CompiledProgram ( #18919 )
...
* use PE for compiler
test=develop
6 years ago
chengduo
3f4c088ad8
prune the feed op in compiler ( #18997 )
...
test=develop
6 years ago
chengduo
d23603322e
Remove compile from PE ( #19080 )
...
* remove compile from PE
test=develop
6 years ago
Zeng Jinle
a802da650b
Feature/mem opt pass refactor ( #18735 )
...
* first version memory optimize pass, test=develop
* remove move_tensor_sharing_pass, test=develop
* refine code comments, add unittests, test=develop
* turn off memory_optimize by default, test=develop
* follow huihuang's comments, test=develop
* follow chengduoZH's comments, test=develop
* fix grammar error, add const qualifier, fix pass_test exception message, test=develop
* follow chengduoZH's comments 2nd, test=develop
6 years ago
gongweibao
c0a82748cf
Polish backwards optimizer dependency codes and use more default values. ( #18255 )
6 years ago
Zeng Jinle
d3003a1620
Feature/buffer_shared_inplace ( #17911 )
...
* feature/buffer_shared_inplace, test=develop
* refine code, test=develop
* fix elementwise_add op cpu inplace and sum inplace bug, test=develop
* add unittest and debug log, test=develop
* fix parallel_executor scope bug, polish code, test=develop
* fix sum op, activation op, single_in_place_inference bug, test=develop
* remove kLocalExecScopeName, test=develop
* fix unittest,test=develop
* fix out_var first version bug, test=develop
* follow comments,test=develop
6 years ago
chengduo
8ed33bf91f
Fix Bug-prone code of PE ( #18354 )
...
* update pe reduce config
test=develop
* drop the local_exe_scopes of the previous parallel_executor
test=develop
6 years ago
liuwei1031
4151d90c16
improve the hint message of memory optimize, test=develop ( #18260 )
6 years ago
whs
5df65e506d
Add Ligth-NAS for PaddleSlim ( #17679 )
...
* Add auto pruning strategy.
1. Fix compressor.
2. Enhence graph executor.
3. Add SAController
4. Add auto pruning strategy.
5. Add unitest for auto pruning strategy.
test=develop
* Init light-nas
* Add light nas.
* Some fix.
test=develop
* Fix sa controller.
test=develop
* Fix unitest of light nas.
test=develop
* Fix setup.py.in and API.spec.
test=develop
* Fix unitest.
1. Fix unitest on windows.
2. Fix package importing in tests directory.
* 1. Remove unused comments.
2. Expose eval_epoch option.
3. Remove unused function in search_agent.
4. Expose max_client_num to yaml file.
5. Move flops constraint to on_epoch_begin function
test=develop
* Fix light nas strategy.
test=develop
* Make controller server stable.
test=develop
* 1. Add try exception to compressor.
2. Remove unitest of light-nas for windows.
test=develop
* Add comments
Enhence controller
test=develop
* Fix comments.
test=develop
6 years ago
gongweibao
65bbf950ee
Add multi-ncclcomm and 2D ncclallreduce support. ( #17263 )
6 years ago
chengduo
8f534696b7
Polish Executor and Compiler doc ( #17262 )
...
* polish doc
test=develop
* updata parallel executor doc
test=develop
* update API.spec
test=develop
* polish code
test=develop
6 years ago
Zeng Jinle
1202d3fc74
Refine model gpu memory ( #16993 )
...
* speedup gc and inplace softmax_with_cross_entropy_grad
test=develop
* refine models gpu mem
Merge skip vars and warning messages of mem opt
remove relu mem opt
test=develop
* follow comments
test=develop
6 years ago
sneaxiy
2f54d9f995
Merge develop
...
test=develop
6 years ago
sneaxiy
bb166a1e10
fix API.spec
...
test=develop
6 years ago
sneaxiy
3a09693f5c
change API name
...
test=develop
6 years ago
sneaxiy
161b8ddcaa
Merge develop
6 years ago
sneaxiy
4b073c95dc
fix compiler
...
test=develop
6 years ago
qingqing01
8ad672a287
Support sync batch norm. ( #16121 )
...
* Support Sync Batch Norm.
* Note, do not enable it in one device.
Usage:
build_strategy = fluid.BuildStrategy()
build_strategy.sync_batch_norm = True
binary = fluid.compiler.CompiledProgram(tp).with_data_parallel(
loss_name=loss_mean.name,
build_strategy=build_strategy)
6 years ago
sneaxiy
ac0e0f5181
merge develop
...
test=develop
6 years ago
Yan Xu
30568473ec
fix broadcast on mp mode ( #15951 )
...
* fix broadcast with mp mode
* polish code test=develop
* fix bcast strategy test=develop
* fic cpplint test=develop
* fix py3 failed test=develop
* fix comment test=develop
* update comment test=develop
6 years ago
Zhen Wang
4ab0607634
fix NaN bugs.
6 years ago
sneaxiy
b80d76f784
merge develop
6 years ago
chengduo
2f1b3afa6f
fix compiler_py bug ( #16122 )
...
test=develop
6 years ago
liuwei1031
caadd0581d
add IfElse test case for ir memory optimize ( #15998 )
...
* add ir memory optimize test case for IfElse op, test=develop
* fix some unitttest failure by force using the python memory_optimize, test=develop
* tweak comments, test=develop
* fix unittest, test=develop
* fix unittest, test=develop
6 years ago
chengduo
ae37f82964
Unified ParallelExecutor and Compiler ( #15970 )
...
* Unified ParallelExecutor and Compiler
6 years ago
sneaxiy
69b1ebdfa5
merge develop
...
test=develop
6 years ago
Xin Pan
a6e3cd5eb7
Merge pull request #15425 from panyx0718/api
...
Pass graph to parallel executor instead of program
6 years ago
sneaxiy
a8c4324d3c
fix hang bug
6 years ago
Xin Pan
8b1672fe76
follow comments
...
test=develop
6 years ago
乔龙飞 Qiao Longfei
ec8e878200
Merge pull request #15840 from jacquesqiao/revert-15684-revert-15661-fix-cpu-broadcast
...
fix cpu broadcast
6 years ago
Qiao Longfei
2b7931d5c9
refine code test=develop
6 years ago
sneaxiy
1e4c0a6f72
merge develop
6 years ago
Xin Pan
32d5a16036
resolve conflicts
...
test=develop
6 years ago
Xin Pan
26e32e095a
allow compiler to use graph
...
test=develop
6 years ago
Zeng Jinle
c420375185
Merge pull request #15806 from sneaxiy/fix-compiler
...
Fix compiler.py place compare bug
6 years ago
sneaxiy
7160cb0f32
decoupled reader
...
test=develop
6 years ago
sneaxiy
d331e97af8
fix compiler place compare
...
test=develop
6 years ago
dzhwinter
07ee40c6e9
fix default value. test=develop
6 years ago
dzhwinter
283573c6aa
add details. test=develop
6 years ago
乔龙飞 Qiao Longfei
45b19cbc9a
Revert "Revert "cpu reduce mode did not need to broadcast params test=develop""
6 years ago
dzhwinter
51d1e8cd06
add details. test=develop
6 years ago
乔龙飞 Qiao Longfei
6e0e706198
Revert "cpu reduce mode did not need to broadcast params test=develop"
6 years ago
Qiao Longfei
16ec4b8c8b
clean code test=develop
6 years ago
Qiao Longfei
abf17226f8
fix code style test=develop
6 years ago
Qiao Longfei
76072261f8
fix compiler
...
test=develop
6 years ago
dzhwinter
0a63234c85
follow comments. test=develop
6 years ago
flame
d60751fb71
add python inference api ( #15248 )
...
add python inference api
6 years ago