dongdaxiang
b091139049
add nccl wrapper for python API
6 years ago
Yiqun Liu
112f16143b
Add an option to enable the cache of expected kernel in train phase. ( #16724 )
...
* Add an option to enable the cache of expected kernel in train phase.
test=develop
* Change the default value of cache_expected_kernel to true.
6 years ago
chengduo
55b15db5af
Add unit test for fuse all_reduce ops ( #16699 )
...
* test fuse all_reduce
6 years ago
Yiqun Liu
3fe8cb0dd7
Enable the runtime_context_cache pass in train phase ( #16640 )
...
* Try to enable the runtime_context_cache pass in train phase.
* Put the append of runtime_context_cache pass ahead of multi_dev passes.
test=develop
6 years ago
guru4elephant
7d653f0aed
Merge pull request #16652 from xjqbest/dataset_merge_develop
...
fix dataset bug
6 years ago
xjqbest
6a57e8075a
remove trainer_id in datafeed and dataset
...
test=develop
6 years ago
Yan Xu
b4c3a6aa0b
[Imperative] implement imperative NCCLParallelContext ( #16477 )
...
add NCCLParallelContext for parallel dygraph
6 years ago
xjqbest
271b7147cc
fix dataset bug
...
test=develop
6 years ago
chengduo
b75a69bad6
Add Stream for fetch op handle ( #16600 )
...
* expose fuse broadcast ops
6 years ago
乔龙飞 Qiao Longfei
21622ca30b
Merge pull request #16172 from jacquesqiao/add-async-ssa-graph-executor-communicator
...
Add async ssa graph executor communicator
6 years ago
sneaxiy
10249c0b78
Merge develop
...
test=develop
6 years ago
Qiao Longfei
adf272bcec
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor-communicator
...
test=develop
6 years ago
xjqbest
9b84e8e66b
fix code style
...
test=develop
6 years ago
xjqbest
a99c8d0c29
fix client to client communication bug
...
test=develop
6 years ago
sneaxiy
33473890f3
Merge develop
...
test=develop
6 years ago
dongdaxiang
720647e17f
rebase current develop and fix conflict
...
test=develop
6 years ago
dongdaxiang
45eb6f0765
run pre-commit check files and fix code style problem
...
test=develop
6 years ago
xjqbest
e95cafd9a7
fix code style & add dataset testcase
...
test=develop
6 years ago
xjqbest
be74de2c61
fix code style & fix register bug & add release_memory
...
test=develop
6 years ago
xujiaqi01
a5b1a0e12b
support multi dataset && add init model && fix bug
6 years ago
dongdaxiang
b7a202aa38
add distributed optimizer factory
6 years ago
dongdaxiang
f612877797
add incubate for unified API
6 years ago
dongdaxiang
317eb0aad3
add incubate for unified API
6 years ago
xujiaqi01
ecfc7df913
add dataset factory && fix style
6 years ago
xujiaqi01
3cea00bd52
store memory data in Dataset && fix bug
6 years ago
dongdaxiang
cc4def6ba5
fix some conflict for compilation
6 years ago
heqiaozhi
9bca1926c1
refactor & fix bug
6 years ago
xjqbest
2e9a836c6f
add DataSet and InMemoryDataFeed, support load data into memory and shuffle data
6 years ago
dongdaxiang
e36bbcc871
fix some typo and CMakefile.txt
6 years ago
xjqbest
824b84d185
add DataSet and InMemoryDataFeed, support load data into memory and shuffle data
6 years ago
dongdaxiang
be757096da
add pybind for fleet
6 years ago
Qiao Longfei
d8974e6da0
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor-communicator
...
test=develop
6 years ago
chengduo
1096746cbf
Fuse Adam And SGD ops ( #15933 )
...
* fuse optimizer
6 years ago
sneaxiy
2c836ff914
check default grad maker
...
test=develop
6 years ago
Zeng Jinle
69cb9792ea
Merge pull request #16506 from sneaxiy/revert-16424-fix_allocator_bug
...
Revert "Fix allocator bug"
6 years ago
chengduo
ed61d67c73
Fix the interface of Pass::Apply ( #16484 )
...
* modify the interface of Pass::Allay
test=develop
* Polish code
test=develop
* Fix Travis CI
test=develop
* fix Pass::Apply interface
test=develop
* Fix Travis CI
test=develop
6 years ago
Zeng Jinle
174d0d0b90
Revert "Fix allocator bug"
...
add include headers to fix travis-ci
test=develop
6 years ago
gongweibao
eb83abeac3
Add DGC(Deep Gradient Compression) interface. ( #15841 )
6 years ago
Zeng Jinle
644e8af4cf
Merge pull request #16424 from sneaxiy/fix_allocator_bug
...
Fix allocator bug
6 years ago
Zeng Jinle
c7c6eeb44e
Merge pull request #16409 from sneaxiy/feature/advance_gc
...
Enhance gc to support deleting tensor buffer in advance
6 years ago
wopeizl
c300b1ba69
Tensor index ( #16223 )
...
* extend the slice function for python
test=develop
6 years ago
Xin Pan
f8c279b11c
Merge pull request #16454 from panyx0718/imperative2
...
polish deepCF model to support real dataset
6 years ago
Qiao Longfei
30618409db
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor-communicator
6 years ago
chengduo
4f2278f032
Add doc for CPUPlace CUDAPlace CUDAPinPlace ( #16442 )
...
test=develop
6 years ago
sneaxiy
78fb3a62e0
fix env variable settting bug
...
test=develop
6 years ago
sneaxiy
2d92b6be98
merge develop
...
test=develop
6 years ago
Xin Pan
fd24ab47ab
polish
...
test=develop
6 years ago
sneaxiy
a7d0ac50b8
Merge develop
6 years ago
sneaxiy
7000ec85d9
fix some op grad maker
...
fix ctest eager deletion disable bug
test=develop
6 years ago
sneaxiy
f8ed2c229e
try to fix ci error
...
test=develop
6 years ago
sneaxiy
c20db6357b
split PR
...
test=develop
6 years ago
sneaxiy
2f54d9f995
Merge develop
...
test=develop
6 years ago
sneaxiy
a93a9eef8f
add op registry type
...
refine gc code
test=develop
6 years ago
sneaxiy
953214ad97
add more unittest
...
modify allocator strategy
remove changes of legacy buddy_allocator
test=develop
6 years ago
chengduo
f26ba5bddd
Fuse AllReduce ( #15921 )
...
* fuse all_reduce
test=develop
* add fuse_parameter_groups_size
test=develop
* Polish code
test=develop
* Fix travis-ci
test=develop
* Add SetGroupAccordingToLayers and SetGroupAccordingToGroupSize
test=develop
* Add SetGroupAccordingToMemorySize
test=develop
* fix multi_devices_graph
test=develop
* reset params_grads
test=develop
* Polish code
test=develop
6 years ago
Tao Luo
7d2740db83
Revert "cache runtime_context"
6 years ago
sneaxiy
fd23262e0c
merge develop, fix conflict
...
test=develop
6 years ago
Qiyang Min
c7f1f3ed0c
Merge pull request #16214 from velconia/imperative_infer_var_type
...
Implement imperative infer var type
6 years ago
Tao Luo
dbb92ee4b1
Merge pull request #16002 from luotao1/runtime_context
...
cache runtime_context
6 years ago
sneaxiy
161b8ddcaa
Merge develop
6 years ago
minqiyang
b40e41fbd1
Polish code style
...
test=develop
6 years ago
Qiyang Min
8e4ad008fb
Merge pull request #16198 from velconia/imperative_train_speed
...
Improve imperative mode training speed
6 years ago
minqiyang
36dce65bb3
Take DataType and VarType apart
...
test=develop
6 years ago
minqiyang
438bca9c3d
Implement Runtime Var Type Inference
...
test=develop
6 years ago
luotao1
1b59bed989
Merge branch 'develop' into runtime_context
6 years ago
qingqing01
8ad672a287
Support sync batch norm. ( #16121 )
...
* Support Sync Batch Norm.
* Note, do not enable it in one device.
Usage:
build_strategy = fluid.BuildStrategy()
build_strategy.sync_batch_norm = True
binary = fluid.compiler.CompiledProgram(tp).with_data_parallel(
loss_name=loss_mean.name,
build_strategy=build_strategy)
6 years ago
minqiyang
7355d41834
1. Add imperative gperf profiler
...
2. Add binutils 2.27 in manylinux support
test=develop
6 years ago
luotao1
b2898c0f57
Merge branch 'develop' into runtime_context
...
test=develop
6 years ago
minqiyang
98dfb492bb
Release GIL lock
6 years ago
sneaxiy
ac0e0f5181
merge develop
...
test=develop
6 years ago
minqiyang
42e96a029f
Accelerate CPU part
6 years ago
sneaxiy
682f2dbf29
merge develop
...
test=develop
6 years ago
sneaxiy
2c4fcaa683
merge develop
6 years ago
luotao1
d94fd97230
add runtime_context_cache_pass
...
test=develop
6 years ago
Yan Xu
30568473ec
fix broadcast on mp mode ( #15951 )
...
* fix broadcast with mp mode
* polish code test=develop
* fix bcast strategy test=develop
* fic cpplint test=develop
* fix py3 failed test=develop
* fix comment test=develop
* update comment test=develop
6 years ago
baojun
e3c37bd564
remove const_cast and refactor ngraph engine code ( #15925 )
...
* remove concast_cast and refactor code test=develop
* reduce flag use test=develop
6 years ago
Zhen Wang
ac6ef06ffa
Add the Clone method in Graph. test=develop
6 years ago
Zhen Wang
01eddf125c
Not add graph copy construction method. test=develop
6 years ago
Zhen Wang
1b9c8d5f06
add clone function for IrGraph. test=develop
6 years ago
Qiyang Min
1f4aa7a202
Imperative remove all descs ( #16045 )
...
* Remove Desc in Forward Pass
* Refactor VarBase
* Add dbg info
* Only check type in imperative mode
* Polish code and support optimizer
test=develop
* Fix stop gradient problem in PyLayer
test=develop
6 years ago
Zeng Jinle
472f16b5aa
Merge pull request #16063 from sneaxiy/enhance_gc
...
Enhance gc
6 years ago
wopeizl
a38db3cb99
Fixrecordio ( #16124 )
...
* fix recordio on win
test=develop
* test=develop
* test=develop
* fix code style
test=develop
* test=develop
6 years ago
sneaxiy
b80d76f784
merge develop
6 years ago
sneaxiy
732fa00eaf
disable gc in recurrent_op currently
...
test=develop
6 years ago
Tao Luo
6f2581e4c5
Merge pull request #16090 from lidanqing-intel/paddle-int32
...
Add PaddleDType INT32 support
6 years ago
Zhaolong Xing
3d63aa0a11
Merge pull request #15729 from NHZlX/add_static_model_load_for_trt
...
Four points for enhancing Paddle-TRT
6 years ago
nhzlx
a9ed427749
cant not pass ci
...
add if use static engine for trt
test=develop
6 years ago
lidanqing
4aeb261da9
Add INT32 support. INT32 in last switch case
...
test=develop
6 years ago
sneaxiy
2a639d5c2a
add allocator chain to fix bug
...
test=develop
6 years ago
Qiao Longfei
8744f9a083
fix parallel executor async mode
6 years ago
Qiao Longfei
e70b1727ef
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor
6 years ago
Qiao Longfei
847e4f4e85
pure async mode train
6 years ago
sneaxiy
3334c279d0
add sample_generator
...
test=develop
6 years ago
Qiyang Min
187cffd019
Merge pull request #15928 from velconia/imperative_backward_hooks
...
Imperative backward hooks
6 years ago
minqiyang
ac88c62a5b
Reset output var's pre_op pointer when op was destructed
6 years ago
sneaxiy
69b1ebdfa5
merge develop
...
test=develop
6 years ago
mozga-intel
68a9ead17a
The flag of mkldnn is enabled iff it is necessary
...
test=develop
6 years ago
Zhen Wang
e00c7a2e26
Merge pull request #15830 from wzzju/add_ir_node_encapsulation
...
add IrNode&IrVarNode&IrOpNode. test=develop
6 years ago
Qiao Longfei
f768fbf715
support multi graph
...
test=develop
6 years ago
minqiyang
efb2f2baf8
Fix bugs
...
test=develop
6 years ago
Qiao Longfei
cf0511f21e
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor
...
test=develop
6 years ago
Zhen Wang
548931456c
update some functions' names according to the suggestion. test=develop
6 years ago
sneaxiy
c545f1ed8f
unify API
...
test=develop
6 years ago
minqiyang
b420ec3a92
invoke backward_hooks after reduce op's depcounts map
...
test=develop
6 years ago
Qiyang Min
4bd28b304b
Merge pull request #15831 from velconia/imperative_engine
...
Imperative training network to the end
6 years ago
sneaxiy
b17541a9c1
fix hang bug
6 years ago
minqiyang
84bf4d7b06
Move ClearBlock into OpBase and VarBase's destructor
...
test=develop
6 years ago
minqiyang
2b3510bc50
Add imperative python tracer
6 years ago
minqiyang
a15a3fc314
Polish code
...
test=develop
6 years ago
sneaxiy
1e4c0a6f72
merge develop
6 years ago
minqiyang
9dc64edfd9
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into imperative_engine
...
test=develop
6 years ago
Xin Pan
32d5a16036
resolve conflicts
...
test=develop
6 years ago
Xin Pan
26e32e095a
allow compiler to use graph
...
test=develop
6 years ago
minqiyang
8fe0c0c52c
implement backward refs
6 years ago
Qiao Longfei
cc71e89499
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor
...
test=develop
6 years ago
minqiyang
74551758cc
Polish code
...
test=develop
6 years ago
minqiyang
f53e1d5c4b
implement ClearBlock
6 years ago
sneaxiy
7160cb0f32
decoupled reader
...
test=develop
6 years ago
sneaxiy
d331e97af8
fix compiler place compare
...
test=develop
6 years ago
sneaxiy
e6ff549849
small fix doc
...
test=release/1.3
6 years ago
sneaxiy
796e221efc
fix api arg0
...
test=release/1.3
6 years ago
minqiyang
52e5ee60bd
Add debug info
6 years ago
Zhen Wang
bc95a4ccfe
Merge branch 'develop' into quantization_inference_passes
6 years ago
Gabor Buella
da9c94da33
Clang build fixes ( #15628 )
...
* Remove some superfluous std::move calls
The std:move triggered a build error (with -Werror):
```
[ 9%] Building CXX object paddle/fluid/memory/allocation/CMakeFiles/allocator_facade.dir/allocator_facade.cc.o
/home/tej/code/gbuella_paddle/paddle/fluid/memory/allocation/allocator_facade.cc:86:29: error: moving a temporary object prevents copy elision [-Werror,-Wpessimizing-move]
[this] { return std::move(CreateAllocatorWithChunk()); }, capacity);
^
/home/tej/code/gbuella_paddle/paddle/fluid/memory/allocation/allocator_facade.cc:86:29: note: remove std::move call here
[this] { return std::move(CreateAllocatorWithChunk()); }, capacity);
^~~~~~~~~~ ~
1 error generated.
```
See: https://reviews.llvm.org/D7633
* Remove a superfluous lambda capture from framework/operator.h
```
[ 10%] Building CXX object paddle/fluid/platform/CMakeFiles/device_context.dir/init.cc.o
In file included from /home/tej/code/gbuella_paddle/paddle/fluid/platform/init.cc:19:
/home/tej/code/gbuella_paddle/paddle/fluid/framework/operator.h:229:21: error: lambda capture 'this' is not used [-Werror,-Wunused-lambda-capture]
[this](Variable* var) { return var; });
^~~~
1 error generated.
```
Changing it to `return it->second;`, as is in the function below.
* Rethrow an exception (instead of copying it)
```
[ 11%] Building CXX object paddle/fluid/framework/CMakeFiles/operator.dir/operator.cc.o
/home/tej/code/gbuella_paddle/paddle/fluid/framework/operator.cc:191:13: error: local variable 'exception' will be copied despite being thrown by name [-Werror,-Wreturn-std-move]
throw exception;
^~~~~~~~~
/home/tej/code/gbuella_paddle/paddle/fluid/framework/operator.cc:191:13: note: call 'std::move' explicitly to avoid copying
throw exception;
^~~~~~~~~
std::move(exception)
```
See https://reviews.llvm.org/D43322 for an explanation of this diagnostic message.
* Remove an unused variable
```
/home/tej/code/gbuella_paddle/paddle/fluid/framework/operator.cc:884:16: error: private field 'scope_' is not used [-Werror,-Wunused-private-field]
const Scope& scope_;
^
```
* struct ComputationOpHandle -> class ComputationOpHandle
```
[ 13%] Building CXX object paddle/fluid/framework/details/CMakeFiles/memory_early_delete_pass.dir/memory_early_delete_pass.cc.o
In file included from /home/tej/code/gbuella_paddle/paddle/fluid/framework/details/memory_early_delete_pass.cc:21:
/home/tej/code/gbuella_paddle/paddle/fluid/framework/details/reference_count_pass_helper.h:30:1: error: class 'ComputationOpHandle' was previously declared as a struct; this is valid, but may result in linker errors under the Microsoft C++ ABI [-Werror,-Wmismatched-tags]
class ComputationOpHandle;
^
/home/tej/code/gbuella_paddle/paddle/fluid/framework/details/computation_op_handle.h:29:8: note: previous use is here
struct ComputationOpHandle : public OpHandleBase {
^
/home/tej/code/gbuella_paddle/paddle/fluid/framework/details/reference_count_pass_helper.h:30:1: note: did you mean struct here?
class ComputationOpHandle;
^~~~~
struct
1 error generated.
```
* Fix name() methods under fluid/operators
```
In file included from /home/tej/code/gbuella_paddle/paddle/fluid/operators/jit/gen/act.cc:15:
In file included from /home/tej/code/gbuella_paddle/paddle/fluid/operators/jit/gen/act.h:19:
/home/tej/code/gbuella_paddle/paddle/fluid/operators/jit/gen/jitcode.h:71:23: error: 'name' overrides a member function but is not marked 'override' [-Werror,-Winconsistent-missing-override]
virtual const char* name() const = 0;
^
/home/tej/code/gbuella_paddle/paddle/fluid/operators/jit/gen_base.h:31:23: note: overridden virtual function is here
virtual const char* name() const = 0;
^
```
test=develop
6 years ago
dzhwinter
381f2015a5
Merge pull request #15665 from dzhwinter/experiment/refactor_memory
...
refactor optimize pass.
6 years ago
xuezhong
eeaa2066e5
add device info to tensor
...
test=develop
6 years ago
dzhwinter
04e9776aef
add details. test=develop
6 years ago
Qiao Longfei
16af1dbc7b
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor
...
test=develop
6 years ago
peizhilin
3a4110f960
fix ci broken randomly and disable some warnings
...
test=develop
6 years ago
dzhwinter
4f01de6378
Merge remote-tracking branch 'origin/develop' into feature/ir_inplace_pass
6 years ago
liuwei1031
6e84eb131f
expose peak gpu memory API to python test=develop ( #15529 )
...
* expose peak gpu memory API to python test=develop
* add unittest for peak gpu memory monitoring test=develop
* add pybind change test=develop
* add mutex to gpu mem usage monitor test=develop
* update benchmark flag definition file test=develop
* tweak unittest for memory monitoring test=develop
6 years ago
WangZhen
2175292634
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into quantization_inference_passes
...
test=develop
6 years ago
dzhwinter
06f2448848
Merge remote-tracking branch 'origin/develop' into feature/ir_inplace_pass
6 years ago
Yan Chunwei
655179089f
AnalysisConfig remove contrib namespace ( #15540 )
6 years ago
Qiao Longfei
d6c0dcaa16
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor
...
test=develop
6 years ago
WangZhen
c67b29c178
fix some bugs of graph.to_program and get_pass.
6 years ago
dzhwinter
ee3aae56cd
merge develop branch. test=develop
6 years ago
Zhaolong Xing
97b76c94c4
Merge pull request #15242 from NHZlX/trt_int8_ultimate_version
...
add trt int8 support
6 years ago
WangZhen
c8095eeb82
add freeze pass, and UT is passed.
6 years ago
Qiao Longfei
ada43e89c3
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor
...
test=develop
6 years ago
乔龙飞 Qiao Longfei
c58555067e
Merge pull request #14731 from jacquesqiao/optimize-cpp-reader
...
Optimize cpp reader
6 years ago
nhzlx
36abc964df
fix pybind problem: add an enum to AnalysisConfig
...
test=develop
6 years ago
Zeng Jinle
2480a3df7d
Merge pull request #15496 from sneaxiy/lazy_allocator2
...
Fix bug when user set CUDA_VISIBLE_DEVICES be empty and run CPU-only models
6 years ago
WangZhen
dde19a0ff8
add quantization freeze pass.
6 years ago
Zeng Jinle
dec89bd7ed
Merge pull request #15460 from sneaxiy/try_to_turn_on_remove_unnecessary_lock
...
Turn on remove_unnecessary_lock by default
6 years ago
Xin Pan
58cb18d9d9
Merge pull request #15322 from velconia/imperative_resnet
...
Imperative Resnet
6 years ago
sneaxiy
51227bd447
lazy_allocator
...
test=develop
6 years ago
minqiyang
c8965dc1ab
Polish code
...
test=develop
6 years ago
sneaxiy
ef788603d4
merge develop
...
test=develop
6 years ago
Zhen Wang
58727e8e6d
Merge pull request #15455 from wzzju/graph_quantization
...
Graph quantization pass. TODO(Add public API comments.)
6 years ago
Tao Luo
fef3fd6d62
Merge pull request #15452 from luotao1/legacy_option
...
remove legacy compiler option
6 years ago
Paddle CI
289aba750a
Polish code
...
test=develop
6 years ago
WangZhen
b913463e83
Update according to the reviewers' suggestion. test=develop
6 years ago
sneaxiy
d8568acd19
turn on remove_unnecessary_lock
...
test=develop
6 years ago
WangZhen
3ce6172052
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into graph_quantization
6 years ago
WangZhen
59e5cc51d6
Add quantization transform pass and UT.
6 years ago
flame
d60751fb71
add python inference api ( #15248 )
...
add python inference api
6 years ago
dzhwinter
8f3b252392
squash commits. test=develop
6 years ago
Tao Luo
cf29ea1592
remove legacy ANDROID option
6 years ago
Qiao Longfei
45578c1b48
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-cpp-reader
6 years ago
minqiyang
8ce198b2e1
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into imperative_resnet
...
test=develop
6 years ago
minqiyang
31a1cd8ce5
Align the first batch of gpu resnet
6 years ago
Dun
9f8f0fc2d3
Memory optimization of depthwise conv op and group norm op ( #15313 )
...
* mem opt
* test=develop
* test=develop
* test=develop
* test=develop
* test=develop
* test=develop
* test=develop
* refine code test=develop
* refine code test=develop
* refine code test=develop
* refine code test=develop
* refine with cub test=develop
* fix mkldnn test && remove comments && test=develop
* polish code && test=develop
* add only_forward test && test=develop
6 years ago
Xin Pan
9a9c690e71
Merge pull request #15343 from panyx0718/imperative3
...
add a GAN model in imperative mode
6 years ago
WangZhen
e2ff300b02
add UT for quantization.
6 years ago
WangZhen
451896fce4
init quantization.
6 years ago
Qiao Longfei
88d71fa2f9
support num_iteration_per_run
6 years ago
Qiao Longfei
ea66979684
can run
6 years ago
minqiyang
dbd4d058af
Add static implementation and fix fc layer
6 years ago
Xin Pan
e395f2c6a3
polish codes
...
test=develop
6 years ago
minqiyang
315b133e67
Add single GPU support to imperative
6 years ago
Xin Pan
9a4314f025
imperative gan
...
test=develop
6 years ago
Qiao Longfei
cd31b90a46
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-cpp-reader
...
test=develop
6 years ago
peizhilin
9adb158e5b
Merge remote-tracking branch 'upstream/develop' into debug/support
6 years ago
minqiyang
ddfb9f1123
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into imperative_shared_ptr
...
test=develop
6 years ago
Xin Pan
d1220f23bb
Merge pull request #15229 from panyx0718/imperative
...
support python codes in the imperative model
6 years ago
minqiyang
d0b640dca1
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into imperative_shared_ptr
...
test=develop
6 years ago
minqiyang
687171d22b
Move from shared_ptr to raw pointer
...
test=develop
6 years ago
peizhilin
439691f5bd
adjust the shlwapi on windows
...
test=develop
6 years ago
flame
fb63cd89d4
Add python ir graph API ( #14917 )
6 years ago
minqiyang
08e2a5d611
Polish tracer code
...
test=develop
6 years ago
minqiyang
cded24768c
Remove shared_ptr holder for VarBase
...
test=develop
6 years ago
minqiyang
c8d1a8e909
Change var_ and grad_ to shared_ptr
6 years ago
minqiyang
7aab39af15
Change grads to VarBase
6 years ago
Qiao Longfei
edad60e612
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-cpp-reader
6 years ago
Xin Pan
7b73fc9e1a
Merge pull request #15089 from panyx0718/api
...
try unify Executor and ParallelExecutor
6 years ago
Xin Pan
9597fd05e9
polish
...
test=develop
6 years ago
Xin Pan
7aad6afd49
forward and backward
...
test=develop
6 years ago
Xin Pan
2349acea48
checkpoint
...
test=develop
6 years ago
Xin Pan
11d4d39cd7
forward working
...
test=develop
6 years ago
Xin Pan
b629133375
checkpoint runnable PyLayer
...
test=develop
6 years ago
peizhilin
a6f5ceee74
add the python callstack for debug support test=develop
6 years ago
Xin Pan
0d0bc61248
update api
...
test=develop
6 years ago
chengduo
eabb2105fa
Refactor MultiDevSSAGraphBuilder ( #15090 )
...
* Refactor ParallelExecutor
test=develop
* extract Reduce and AllReduce mode from MultiDevSSAGraphBuilder
test=develop
* Refactor MultiDevSSAGraphBuilder
test=developt
* Remove enable_data_balance
test=develop
* code refine
test=develop
* remove data balance
test=develop
* refine ScaleLossGradOp
test=develop
* remove uncessary file
test=develop
* code refine
test=develop
* modify function name
test=develop
* follow comments
test=develop
* add is_distribution field
test=develop
* set is_distribution
test=develop
* fix DistSSAGraphBuilder
test=develop
6 years ago
Xin Pan
5e928e579a
try unify Executor and ParallelExecutor
...
test=develop
6 years ago
minqiyang
2547f9d1b8
Polish code
...
test=develop
6 years ago
minqiyang
09e2e66236
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into imperative_mnist
6 years ago
gongweibao
ce70229ba6
Add max_body_size flags to brpc ( #15084 )
6 years ago
minqiyang
6a5f604607
Support stop_gradients var in imperative backward
...
test=develop
6 years ago
minqiyang
9e3155e01d
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into imperative_mnist
...
test=develop
6 years ago
minqiyang
336160e651
Complete imperative optimizer implementation
...
test=develop
6 years ago
minqiyang
d4b9928c5a
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into imperative_mnist
...
test=develop
6 years ago
minqiyang
5822f7f1d8
Polish code
...
test=develop
6 years ago
Xin Pan
7b6bf9ddf2
make fill_constant kernel-based
...
test=develop
6 years ago
Xin Pan
61491ce250
clean
...
test=develop
6 years ago
Xin Pan
ce7e503cbe
refactor to avoid scope.
...
test=develop
6 years ago
Xin Pan
71a4a8e981
Merge pull request #15071 from wopeizl/revert/15035
...
Revert "cherry-pick the #12759"
6 years ago
minqiyang
68e9b841ab
Add support for optimizer
6 years ago
Zeng Jinle
9c6a0203e2
Merge pull request #15073 from sneaxiy/add_scope_pool
...
Add scope_pool
6 years ago
sneaxiy
3e917a934a
add scope_pool
...
add module cleanup
test=develop
6 years ago
peizhilin
2388d0e7d6
Revert "cherry-pick the #12759"
...
test=develop
This reverts commit 7f6d8acecb
.
6 years ago
peizhilin
7f6d8acecb
cherry-pick the #12759
...
test=develop
6 years ago
minqiyang
3de0f612e8
Polish code
...
test=develop
7 years ago
Zeng Jinle
95cbe07c40
Merge pull request #14836 from sneaxiy/feature/py_func
...
Featue/py_func op
7 years ago
minqiyang
1c116462cc
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into imperative_mnist
...
test=develop
7 years ago
minqiyang
29697c2e25
Add stop_gradient to VarBase to support loss function
...
test=develop
7 years ago
chengduo
550e7e410b
Code Clean parallel_executor.py ( #14849 )
...
* refine parallel_executor
* remove uncessary code
test=develop
7 years ago
minqiyang
3cd10a7c4f
Add Conv2D forward
...
test=develop
7 years ago
minqiyang
8d88c5a87d
Shameless copy
7 years ago
Xin Pan
6324032602
MLP forward backward
...
test=develop
7 years ago
colourful-tree
44ad2f4479
Merge pull request #14873 from colourful-tree/develop
...
add pslib(pserver) to paddle, an industrial scale high performance parameter server library
7 years ago
dzhwinter
7cd24b1318
add ir memory optimize. ( #14530 )
...
* follow comments. test=develop
* Fix typo
* fix compile error. test=develop
* merge develop branch. test=develop
* Remove set_equal
* Polish code
* Delete unused functions
test=develop
* polish code. test=develop
* follow comment
* polish code.
* fix windows compile error. test=develop
* fix op handle.
* rerun ci. test=develop
* rerun ci. test=develop
* rerun macci. test=develop
* polish code. test=develop
* rewrite sort code. test=develop
* remove unused code. test=develop
* fix tests. test=develop
* fix conflict. test=develop
* follow comment. test=develop
* merge develop branch. test=develop
* fix tests. test=develop
* remove ToTypeIndex. test=develop
* rerun ci. test=develop
7 years ago
guru4elephant
a79a3ea2f0
Merge branch 'develop' into develop
7 years ago
wopeizl
0f085f0a5a
Merge pull request #14892 from wopeizl/windows/port3
...
fix script issue
7 years ago
Yu Yang
bacf1d2399
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/tensor_type
7 years ago
gongweibao
0b1c7d838c
Add brpc serialization support. ( #11430 )
7 years ago
peizhilin
23dec78772
fix script issue
...
test=develop
7 years ago
dongdaxiang
e52bb816e3
add copyright to __init__.py in distributed folder
7 years ago
sneaxiy
f0df62f136
add more unittest case
...
test=develop
7 years ago
Yu Yang
4ecdb6f486
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/tensor_type
...
test=develop
7 years ago
sneaxiy
f6741df462
merge develop
...
fix bug
test=develop
7 years ago
heqiaozhi
f81957a753
refine cmake for pslib & pre_define
7 years ago
sneaxiy
deb0d41cea
fix cmake
...
fix cmake again
test=develop
7 years ago
sneaxiy
e7c5c9d2de
remove unnecesary code
...
test=develop
7 years ago
sneaxiy
db2daefe50
merge develop
...
test=develop
7 years ago
sneaxiy
ca84c2ca8f
merge develop
...
test=develop
7 years ago
sneaxiy
e240ba2918
implement backward
...
test=develop
7 years ago
Yu Yang
81520a24cf
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/refine_eigen_tensor
7 years ago
Yu Yang
9bd70a1e04
Change tensor uses proto::VarType::type
...
test=develop
7 years ago
Qiao Longfei
1213e2838f
Merge pull request #14820 from jacquesqiao/fix-split-selected-rows
...
split selected rows op should always init output selected rows
7 years ago
sneaxiy
7923042365
merge develop
...
test=develop
7 years ago
Qiao Longfei
0a7c7e97af
test zero output of split_selected_rows_op
...
test=develop
7 years ago
Tao Luo
3243b45bb2
Merge pull request #14651 from luotao1/has_attr
...
add Opdesc's HasProtoAttr
7 years ago
heqiaozhi
016a068775
stop server
7 years ago
sneaxiy
8760d23c7d
featue/py_func
7 years ago
Tao Luo
067ed70f2d
add HasProtoAttr function in op_desc.h, clean node.h
...
test=develop
7 years ago
Xin Pan
748549b2e3
Revert "Merge pull request #14798 from PaddlePaddle/revert-14786-revert-14782-revert-14398-imperative"
...
This reverts commit b1d3a1c8b4
, reversing
changes made to f1fb64b17f
.
7 years ago
Xin Pan
c049fa7cf7
Revert "Revert "Revert "Imperative"""
7 years ago
gongweibao
f1fb64b17f
Add reduce sparse tensor feature. ( #14757 )
7 years ago
heqiaozhi
575ae7c6c3
refine pslib inferface & fix some bugs
7 years ago