guru4elephant
326bf8291a
add Run Prepared Ctx ( #17616 )
...
add Run Prepared Ctx, fix pybind problem
6 years ago
flame
2280f185d7
BuildStrategy api comment ( #17348 )
...
Python examples of fluid.layers.io.double_buffer and some BuildStrategy's methods.
6 years ago
guru4elephant
7f8bc49d00
polish_executor_and_add_ctx_cache ( #17536 )
...
* polish_executor_and_add_ctx_cache
6 years ago
Zeng Jinle
c6189637cd
Fix allocator bug ( #16712 )
...
* Revert "Revert "Fix allocator bug""
This reverts commit 174d0d0b90
.
* Revert "fix travis ci"
This reverts commit 5656fa9f7c
.
test=develop
* add inlined_vector.h, test=develop
* add inlined_vector_test,test=develop
6 years ago
Qiao Longfei
92e7d5d7cc
fix distribute doc test=develop ( #17318 )
...
* fix distribute doc
6 years ago
Qiao Longfei
58f7695ab2
Async exe support communicator ( #17386 )
...
Async exe support communicator
6 years ago
Tao Luo
32da5e9c3d
remove unused expected_kernel_cache_pass ( #17486 )
...
test=develop
6 years ago
Yan Xu
0217555530
polish parallel dygraph code ( #17164 )
...
* add var grad hook test=develop
6 years ago
Jiabin Yang
d7df4e5e5b
Fix/Fix memory leak in dygraph ( #17394 )
...
* test=develop, add gradient sort backward strategy
* test=develop, fix test by add FLAGS_cudnn_deterministic on new tests
* test=develop, fix memory leak in dygraph mode
* test=develop, fix memory leak in dygraph mode
* test=develop, polish code
* test=develop, polish code
* test=develop, polish code
6 years ago
Zhen Wang
4a1b7fec96
Add setting Scope function for the graph class ( #17417 )
...
* add set_not_owned function for graph
* add scope set. test=develop
* add scope_ptr enforce not null before setting.test=develop
6 years ago
jiaqi
66d51206b1
add save/load model, shrink table, cvm, config file & fix pull dense bug ( #17118 )
...
* add save/load model, shrink table, cvm, config file & fix pull dense bug
test=develop
* fix global shuffle bug, fix pull dense bug, fix release memeory bug, fix shrink error
add client flush, add get data size
test=develop
* fix global shuffle bug
test=develop
* fix global shuffle bug
test=develop
* fix code style
test=develop
* fix code style & modify pslib cmake
test=develop
* fix error of _role_maker
test=develop
* fix code style
test=develop
* fix code style
test=develop
* fix code style
test=develop
* fix code style
test=develop
* fix code style
test=develop
* fix windows compile error of fleet
test=develop
* fix global shuffle bug
* add comment
test=develop
* update pslib.cmake
test=develop
* fix fill sparse bug
test=develop
* fix push sparse bug
test=develop
6 years ago
Tao Luo
68ec0a6f74
make parallel_executor support FLAGS_use_mkldnn ( #17341 )
...
* make parallel_executor support FLAGS_use_mkldnn
test=develop
* add warning when set mkldnn_enabled_op_types_ in non-mkldnn env
test=develop
6 years ago
Jiabin Yang
4624d7c642
test=develop, add gradient sort backward strategy ( #17125 )
...
* test=develop, add gradient sort backward strategy
* test=develop, fix test by add FLAGS_cudnn_deterministic on new tests
6 years ago
chengduo
bc833945a4
Add DropLocalExeScopes in ParallelExecutor ( #17297 )
...
* reset drop local scope counter
test=develop
6 years ago
qingqing01
e32c9888f5
Double backward of conv2d. ( #17211 )
...
* Add conv2d_grad_grad_op
* Extracte the cuDNN conv algo searching code in conv_cudnn_helper.h.
- Now use it in conv2d_grad_grad.
- Will simply the searching code in conv2d and conv2d_grad in next PR.
* Enhance and fix bug in unit testing of gradient_checker.
* Support to fetch empty variables,return None in Python.
6 years ago
lujun
e388a1fb66
Repair api example ( #17221 )
...
Fix the following API examples:
paddle.fluid.scope_guard
paddle.fluid.backward.append_backward
paddle.fluid.cpu_places
paddle.fluid.cuda_pinned_places
paddle.fluid.cuda_places
paddle.fluid.in_dygraph_mode
paddle.fluid.CUDAPlace
paddle.fluid.CPUPlace
paddle.fluid.CUDAPinnedPlace
6 years ago
chengduo
04bd413acb
Code Clean: Move all pass to paddle::framework::ir ( #17228 )
...
* move pass to ir
* polish code
test=develop
* fix dependency
test=develop
6 years ago
Zeng Jinle
f2fa3f7300
fix api doc,test=develop ( #17241 )
6 years ago
石晓伟
a72dbe9abf
Cherry-pick benchmark related changes from release/1.4 ( #17156 )
...
* cherry-pick commit from 8877054
* cherry-pick commit from 3f0b97d
* cherry-pick from 16691:Anakin subgraph support yolo_v3 and faster-rcnn
(cherry picked from commit 8643dbc233
)
* Cherry-Pick from 16662 : Anakin subgraph cpu support
(cherry picked from commit 7ad182e16c
)
* Cherry-pick from 1662, 16797.. : add anakin int8 support
(cherry picked from commit e14ab180fe
)
* Cherry-pick from 16813 : change singleton to graph RegistBlock
test=release/1.4
(cherry picked from commit 4b9fa42307
)
* Cherry Pick : 16837 Support ShuffleNet and MobileNet-v2
Support ShuffleNet and MobileNet-v2, test=release/1.4
(cherry picked from commit a6fb066f90
)
* Cherry-pick : anakin subgraph add opt config layout argument #16846
test=release/1.4
(cherry picked from commit 8121b3eccb
)
* 1. add shuffle_channel_detect
(cherry picked from commit 6efdea8997
)
* update shuffle_channel op convert, test=release/1.4
(cherry picked from commit e4726a066f
)
* Modify symbol export rules
test=develop
6 years ago
Zeng Jinle
c5eeecca7c
Fix tensor_py.h ( #17195 )
...
* fix tensor_py,test=develop
* change class name,test=develop
6 years ago
Zeng Jinle
5dfe2ab9e8
Fix mem leak when converting Tensor to numpy array ( #17182 )
...
* fix mem leak when converting Tensor to numpy array
test=develop
* remove unused unittest,test=develop
* follow comments, test=develop
* fix dygraph bug,test=develop
6 years ago
Yan Xu
0b07eef118
ParallelDyGraph with GPU collective mode ( #16827 )
...
implement dygraph.parallel.DataParallel to hook reduce op.
6 years ago
guru4elephant
03d469ad98
Merge pull request #17005 from wopeizl/fix_ncclwrapper_win1
...
fix nccl wrapper on windows
6 years ago
liuwei1031
a770ce0615
add doc for memory_optimize, test=develop ( #17010 )
...
* add doc for memory_optimize, test=develop
* update doc, test=develop
* doc update, test=develop
6 years ago
qingqing01
ea42e431f8
Speed unit testing. ( #16978 )
...
* Speed affine_channel_op unit testing
* Add check in tensor_py
* Fix ONLY_CPU Compiling
6 years ago
wopeizl
51a0243a56
fix nccl wrapper on windows
...
test=develop
6 years ago
Zeng Jinle
1202d3fc74
Refine model gpu memory ( #16993 )
...
* speedup gc and inplace softmax_with_cross_entropy_grad
test=develop
* refine models gpu mem
Merge skip vars and warning messages of mem opt
remove relu mem opt
test=develop
* follow comments
test=develop
6 years ago
guru4elephant
bbc6c5714f
Merge pull request #16887 from guru4elephant/add_nccl_context_pybind
...
Add nccl context pybind
6 years ago
gongweibao
cbdb8a17b1
Polish DGC code ( #16818 )
6 years ago
dongdaxiang
466d177d09
add pybind dependency
...
test=develop
6 years ago
dongdaxiang
4aa6f679b5
add pybind dependency
...
test=develop
6 years ago
dongdaxiang
b091139049
add nccl wrapper for python API
6 years ago
Yiqun Liu
112f16143b
Add an option to enable the cache of expected kernel in train phase. ( #16724 )
...
* Add an option to enable the cache of expected kernel in train phase.
test=develop
* Change the default value of cache_expected_kernel to true.
6 years ago
chengduo
55b15db5af
Add unit test for fuse all_reduce ops ( #16699 )
...
* test fuse all_reduce
6 years ago
Yiqun Liu
3fe8cb0dd7
Enable the runtime_context_cache pass in train phase ( #16640 )
...
* Try to enable the runtime_context_cache pass in train phase.
* Put the append of runtime_context_cache pass ahead of multi_dev passes.
test=develop
6 years ago
guru4elephant
7d653f0aed
Merge pull request #16652 from xjqbest/dataset_merge_develop
...
fix dataset bug
6 years ago
xjqbest
6a57e8075a
remove trainer_id in datafeed and dataset
...
test=develop
6 years ago
Yan Xu
b4c3a6aa0b
[Imperative] implement imperative NCCLParallelContext ( #16477 )
...
add NCCLParallelContext for parallel dygraph
6 years ago
xjqbest
271b7147cc
fix dataset bug
...
test=develop
6 years ago
chengduo
b75a69bad6
Add Stream for fetch op handle ( #16600 )
...
* expose fuse broadcast ops
6 years ago
乔龙飞 Qiao Longfei
21622ca30b
Merge pull request #16172 from jacquesqiao/add-async-ssa-graph-executor-communicator
...
Add async ssa graph executor communicator
6 years ago
sneaxiy
10249c0b78
Merge develop
...
test=develop
6 years ago
Qiao Longfei
adf272bcec
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor-communicator
...
test=develop
6 years ago
xjqbest
9b84e8e66b
fix code style
...
test=develop
6 years ago
xjqbest
a99c8d0c29
fix client to client communication bug
...
test=develop
6 years ago
sneaxiy
33473890f3
Merge develop
...
test=develop
6 years ago
dongdaxiang
720647e17f
rebase current develop and fix conflict
...
test=develop
6 years ago
dongdaxiang
45eb6f0765
run pre-commit check files and fix code style problem
...
test=develop
6 years ago
xjqbest
e95cafd9a7
fix code style & add dataset testcase
...
test=develop
6 years ago
xjqbest
be74de2c61
fix code style & fix register bug & add release_memory
...
test=develop
6 years ago
xujiaqi01
a5b1a0e12b
support multi dataset && add init model && fix bug
6 years ago
dongdaxiang
b7a202aa38
add distributed optimizer factory
6 years ago
dongdaxiang
f612877797
add incubate for unified API
6 years ago
dongdaxiang
317eb0aad3
add incubate for unified API
6 years ago
xujiaqi01
ecfc7df913
add dataset factory && fix style
6 years ago
xujiaqi01
3cea00bd52
store memory data in Dataset && fix bug
6 years ago
dongdaxiang
cc4def6ba5
fix some conflict for compilation
6 years ago
heqiaozhi
9bca1926c1
refactor & fix bug
6 years ago
xjqbest
2e9a836c6f
add DataSet and InMemoryDataFeed, support load data into memory and shuffle data
6 years ago
dongdaxiang
e36bbcc871
fix some typo and CMakefile.txt
6 years ago
xjqbest
824b84d185
add DataSet and InMemoryDataFeed, support load data into memory and shuffle data
6 years ago
dongdaxiang
be757096da
add pybind for fleet
6 years ago
Qiao Longfei
d8974e6da0
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor-communicator
...
test=develop
6 years ago
chengduo
1096746cbf
Fuse Adam And SGD ops ( #15933 )
...
* fuse optimizer
6 years ago
sneaxiy
2c836ff914
check default grad maker
...
test=develop
6 years ago
Zeng Jinle
69cb9792ea
Merge pull request #16506 from sneaxiy/revert-16424-fix_allocator_bug
...
Revert "Fix allocator bug"
6 years ago
chengduo
ed61d67c73
Fix the interface of Pass::Apply ( #16484 )
...
* modify the interface of Pass::Allay
test=develop
* Polish code
test=develop
* Fix Travis CI
test=develop
* fix Pass::Apply interface
test=develop
* Fix Travis CI
test=develop
6 years ago
Zeng Jinle
174d0d0b90
Revert "Fix allocator bug"
...
add include headers to fix travis-ci
test=develop
6 years ago
gongweibao
eb83abeac3
Add DGC(Deep Gradient Compression) interface. ( #15841 )
6 years ago
Zeng Jinle
644e8af4cf
Merge pull request #16424 from sneaxiy/fix_allocator_bug
...
Fix allocator bug
6 years ago
Zeng Jinle
c7c6eeb44e
Merge pull request #16409 from sneaxiy/feature/advance_gc
...
Enhance gc to support deleting tensor buffer in advance
6 years ago
wopeizl
c300b1ba69
Tensor index ( #16223 )
...
* extend the slice function for python
test=develop
6 years ago
Xin Pan
f8c279b11c
Merge pull request #16454 from panyx0718/imperative2
...
polish deepCF model to support real dataset
6 years ago
Qiao Longfei
30618409db
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor-communicator
6 years ago
chengduo
4f2278f032
Add doc for CPUPlace CUDAPlace CUDAPinPlace ( #16442 )
...
test=develop
6 years ago
sneaxiy
78fb3a62e0
fix env variable settting bug
...
test=develop
6 years ago
sneaxiy
2d92b6be98
merge develop
...
test=develop
6 years ago
Xin Pan
fd24ab47ab
polish
...
test=develop
6 years ago
sneaxiy
a7d0ac50b8
Merge develop
6 years ago
sneaxiy
7000ec85d9
fix some op grad maker
...
fix ctest eager deletion disable bug
test=develop
6 years ago
sneaxiy
f8ed2c229e
try to fix ci error
...
test=develop
6 years ago
sneaxiy
c20db6357b
split PR
...
test=develop
6 years ago
sneaxiy
2f54d9f995
Merge develop
...
test=develop
6 years ago
sneaxiy
a93a9eef8f
add op registry type
...
refine gc code
test=develop
6 years ago
sneaxiy
953214ad97
add more unittest
...
modify allocator strategy
remove changes of legacy buddy_allocator
test=develop
6 years ago
chengduo
f26ba5bddd
Fuse AllReduce ( #15921 )
...
* fuse all_reduce
test=develop
* add fuse_parameter_groups_size
test=develop
* Polish code
test=develop
* Fix travis-ci
test=develop
* Add SetGroupAccordingToLayers and SetGroupAccordingToGroupSize
test=develop
* Add SetGroupAccordingToMemorySize
test=develop
* fix multi_devices_graph
test=develop
* reset params_grads
test=develop
* Polish code
test=develop
6 years ago
Tao Luo
7d2740db83
Revert "cache runtime_context"
6 years ago
sneaxiy
fd23262e0c
merge develop, fix conflict
...
test=develop
6 years ago
Qiyang Min
c7f1f3ed0c
Merge pull request #16214 from velconia/imperative_infer_var_type
...
Implement imperative infer var type
6 years ago
Tao Luo
dbb92ee4b1
Merge pull request #16002 from luotao1/runtime_context
...
cache runtime_context
6 years ago
sneaxiy
161b8ddcaa
Merge develop
6 years ago
minqiyang
b40e41fbd1
Polish code style
...
test=develop
6 years ago
Qiyang Min
8e4ad008fb
Merge pull request #16198 from velconia/imperative_train_speed
...
Improve imperative mode training speed
6 years ago
minqiyang
36dce65bb3
Take DataType and VarType apart
...
test=develop
6 years ago
minqiyang
438bca9c3d
Implement Runtime Var Type Inference
...
test=develop
6 years ago
luotao1
1b59bed989
Merge branch 'develop' into runtime_context
6 years ago
qingqing01
8ad672a287
Support sync batch norm. ( #16121 )
...
* Support Sync Batch Norm.
* Note, do not enable it in one device.
Usage:
build_strategy = fluid.BuildStrategy()
build_strategy.sync_batch_norm = True
binary = fluid.compiler.CompiledProgram(tp).with_data_parallel(
loss_name=loss_mean.name,
build_strategy=build_strategy)
6 years ago
minqiyang
7355d41834
1. Add imperative gperf profiler
...
2. Add binutils 2.27 in manylinux support
test=develop
6 years ago
luotao1
b2898c0f57
Merge branch 'develop' into runtime_context
...
test=develop
6 years ago
minqiyang
98dfb492bb
Release GIL lock
6 years ago
sneaxiy
ac0e0f5181
merge develop
...
test=develop
6 years ago
minqiyang
42e96a029f
Accelerate CPU part
6 years ago
sneaxiy
682f2dbf29
merge develop
...
test=develop
6 years ago
sneaxiy
2c4fcaa683
merge develop
6 years ago
luotao1
d94fd97230
add runtime_context_cache_pass
...
test=develop
6 years ago
Yan Xu
30568473ec
fix broadcast on mp mode ( #15951 )
...
* fix broadcast with mp mode
* polish code test=develop
* fix bcast strategy test=develop
* fic cpplint test=develop
* fix py3 failed test=develop
* fix comment test=develop
* update comment test=develop
6 years ago
baojun
e3c37bd564
remove const_cast and refactor ngraph engine code ( #15925 )
...
* remove concast_cast and refactor code test=develop
* reduce flag use test=develop
6 years ago
Zhen Wang
ac6ef06ffa
Add the Clone method in Graph. test=develop
6 years ago
Zhen Wang
01eddf125c
Not add graph copy construction method. test=develop
6 years ago
Zhen Wang
1b9c8d5f06
add clone function for IrGraph. test=develop
6 years ago
Qiyang Min
1f4aa7a202
Imperative remove all descs ( #16045 )
...
* Remove Desc in Forward Pass
* Refactor VarBase
* Add dbg info
* Only check type in imperative mode
* Polish code and support optimizer
test=develop
* Fix stop gradient problem in PyLayer
test=develop
6 years ago
Zeng Jinle
472f16b5aa
Merge pull request #16063 from sneaxiy/enhance_gc
...
Enhance gc
6 years ago
wopeizl
a38db3cb99
Fixrecordio ( #16124 )
...
* fix recordio on win
test=develop
* test=develop
* test=develop
* fix code style
test=develop
* test=develop
6 years ago
sneaxiy
b80d76f784
merge develop
6 years ago
sneaxiy
732fa00eaf
disable gc in recurrent_op currently
...
test=develop
6 years ago
Tao Luo
6f2581e4c5
Merge pull request #16090 from lidanqing-intel/paddle-int32
...
Add PaddleDType INT32 support
6 years ago
Zhaolong Xing
3d63aa0a11
Merge pull request #15729 from NHZlX/add_static_model_load_for_trt
...
Four points for enhancing Paddle-TRT
6 years ago
nhzlx
a9ed427749
cant not pass ci
...
add if use static engine for trt
test=develop
6 years ago
lidanqing
4aeb261da9
Add INT32 support. INT32 in last switch case
...
test=develop
6 years ago
sneaxiy
2a639d5c2a
add allocator chain to fix bug
...
test=develop
6 years ago
Qiao Longfei
8744f9a083
fix parallel executor async mode
6 years ago
Qiao Longfei
e70b1727ef
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor
6 years ago
Qiao Longfei
847e4f4e85
pure async mode train
6 years ago
sneaxiy
3334c279d0
add sample_generator
...
test=develop
6 years ago
Qiyang Min
187cffd019
Merge pull request #15928 from velconia/imperative_backward_hooks
...
Imperative backward hooks
6 years ago
minqiyang
ac88c62a5b
Reset output var's pre_op pointer when op was destructed
6 years ago
sneaxiy
69b1ebdfa5
merge develop
...
test=develop
6 years ago
mozga-intel
68a9ead17a
The flag of mkldnn is enabled iff it is necessary
...
test=develop
6 years ago
Zhen Wang
e00c7a2e26
Merge pull request #15830 from wzzju/add_ir_node_encapsulation
...
add IrNode&IrVarNode&IrOpNode. test=develop
6 years ago
Qiao Longfei
f768fbf715
support multi graph
...
test=develop
6 years ago
minqiyang
efb2f2baf8
Fix bugs
...
test=develop
6 years ago
Qiao Longfei
cf0511f21e
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor
...
test=develop
6 years ago
Zhen Wang
548931456c
update some functions' names according to the suggestion. test=develop
6 years ago
sneaxiy
c545f1ed8f
unify API
...
test=develop
6 years ago
minqiyang
b420ec3a92
invoke backward_hooks after reduce op's depcounts map
...
test=develop
6 years ago
Qiyang Min
4bd28b304b
Merge pull request #15831 from velconia/imperative_engine
...
Imperative training network to the end
6 years ago
sneaxiy
b17541a9c1
fix hang bug
6 years ago
minqiyang
84bf4d7b06
Move ClearBlock into OpBase and VarBase's destructor
...
test=develop
6 years ago
minqiyang
2b3510bc50
Add imperative python tracer
6 years ago
minqiyang
a15a3fc314
Polish code
...
test=develop
6 years ago
sneaxiy
1e4c0a6f72
merge develop
6 years ago
minqiyang
9dc64edfd9
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into imperative_engine
...
test=develop
6 years ago
Xin Pan
32d5a16036
resolve conflicts
...
test=develop
6 years ago
Xin Pan
26e32e095a
allow compiler to use graph
...
test=develop
6 years ago
minqiyang
8fe0c0c52c
implement backward refs
6 years ago
Qiao Longfei
cc71e89499
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor
...
test=develop
6 years ago
minqiyang
74551758cc
Polish code
...
test=develop
6 years ago
minqiyang
f53e1d5c4b
implement ClearBlock
6 years ago
sneaxiy
7160cb0f32
decoupled reader
...
test=develop
6 years ago
sneaxiy
d331e97af8
fix compiler place compare
...
test=develop
6 years ago
sneaxiy
e6ff549849
small fix doc
...
test=release/1.3
6 years ago
sneaxiy
796e221efc
fix api arg0
...
test=release/1.3
6 years ago
minqiyang
52e5ee60bd
Add debug info
6 years ago
Zhen Wang
bc95a4ccfe
Merge branch 'develop' into quantization_inference_passes
6 years ago
Gabor Buella
da9c94da33
Clang build fixes ( #15628 )
...
* Remove some superfluous std::move calls
The std:move triggered a build error (with -Werror):
```
[ 9%] Building CXX object paddle/fluid/memory/allocation/CMakeFiles/allocator_facade.dir/allocator_facade.cc.o
/home/tej/code/gbuella_paddle/paddle/fluid/memory/allocation/allocator_facade.cc:86:29: error: moving a temporary object prevents copy elision [-Werror,-Wpessimizing-move]
[this] { return std::move(CreateAllocatorWithChunk()); }, capacity);
^
/home/tej/code/gbuella_paddle/paddle/fluid/memory/allocation/allocator_facade.cc:86:29: note: remove std::move call here
[this] { return std::move(CreateAllocatorWithChunk()); }, capacity);
^~~~~~~~~~ ~
1 error generated.
```
See: https://reviews.llvm.org/D7633
* Remove a superfluous lambda capture from framework/operator.h
```
[ 10%] Building CXX object paddle/fluid/platform/CMakeFiles/device_context.dir/init.cc.o
In file included from /home/tej/code/gbuella_paddle/paddle/fluid/platform/init.cc:19:
/home/tej/code/gbuella_paddle/paddle/fluid/framework/operator.h:229:21: error: lambda capture 'this' is not used [-Werror,-Wunused-lambda-capture]
[this](Variable* var) { return var; });
^~~~
1 error generated.
```
Changing it to `return it->second;`, as is in the function below.
* Rethrow an exception (instead of copying it)
```
[ 11%] Building CXX object paddle/fluid/framework/CMakeFiles/operator.dir/operator.cc.o
/home/tej/code/gbuella_paddle/paddle/fluid/framework/operator.cc:191:13: error: local variable 'exception' will be copied despite being thrown by name [-Werror,-Wreturn-std-move]
throw exception;
^~~~~~~~~
/home/tej/code/gbuella_paddle/paddle/fluid/framework/operator.cc:191:13: note: call 'std::move' explicitly to avoid copying
throw exception;
^~~~~~~~~
std::move(exception)
```
See https://reviews.llvm.org/D43322 for an explanation of this diagnostic message.
* Remove an unused variable
```
/home/tej/code/gbuella_paddle/paddle/fluid/framework/operator.cc:884:16: error: private field 'scope_' is not used [-Werror,-Wunused-private-field]
const Scope& scope_;
^
```
* struct ComputationOpHandle -> class ComputationOpHandle
```
[ 13%] Building CXX object paddle/fluid/framework/details/CMakeFiles/memory_early_delete_pass.dir/memory_early_delete_pass.cc.o
In file included from /home/tej/code/gbuella_paddle/paddle/fluid/framework/details/memory_early_delete_pass.cc:21:
/home/tej/code/gbuella_paddle/paddle/fluid/framework/details/reference_count_pass_helper.h:30:1: error: class 'ComputationOpHandle' was previously declared as a struct; this is valid, but may result in linker errors under the Microsoft C++ ABI [-Werror,-Wmismatched-tags]
class ComputationOpHandle;
^
/home/tej/code/gbuella_paddle/paddle/fluid/framework/details/computation_op_handle.h:29:8: note: previous use is here
struct ComputationOpHandle : public OpHandleBase {
^
/home/tej/code/gbuella_paddle/paddle/fluid/framework/details/reference_count_pass_helper.h:30:1: note: did you mean struct here?
class ComputationOpHandle;
^~~~~
struct
1 error generated.
```
* Fix name() methods under fluid/operators
```
In file included from /home/tej/code/gbuella_paddle/paddle/fluid/operators/jit/gen/act.cc:15:
In file included from /home/tej/code/gbuella_paddle/paddle/fluid/operators/jit/gen/act.h:19:
/home/tej/code/gbuella_paddle/paddle/fluid/operators/jit/gen/jitcode.h:71:23: error: 'name' overrides a member function but is not marked 'override' [-Werror,-Winconsistent-missing-override]
virtual const char* name() const = 0;
^
/home/tej/code/gbuella_paddle/paddle/fluid/operators/jit/gen_base.h:31:23: note: overridden virtual function is here
virtual const char* name() const = 0;
^
```
test=develop
6 years ago
dzhwinter
381f2015a5
Merge pull request #15665 from dzhwinter/experiment/refactor_memory
...
refactor optimize pass.
6 years ago
xuezhong
eeaa2066e5
add device info to tensor
...
test=develop
6 years ago
dzhwinter
04e9776aef
add details. test=develop
6 years ago
Qiao Longfei
16af1dbc7b
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor
...
test=develop
6 years ago
peizhilin
3a4110f960
fix ci broken randomly and disable some warnings
...
test=develop
6 years ago
dzhwinter
4f01de6378
Merge remote-tracking branch 'origin/develop' into feature/ir_inplace_pass
6 years ago
liuwei1031
6e84eb131f
expose peak gpu memory API to python test=develop ( #15529 )
...
* expose peak gpu memory API to python test=develop
* add unittest for peak gpu memory monitoring test=develop
* add pybind change test=develop
* add mutex to gpu mem usage monitor test=develop
* update benchmark flag definition file test=develop
* tweak unittest for memory monitoring test=develop
6 years ago
WangZhen
2175292634
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into quantization_inference_passes
...
test=develop
6 years ago
dzhwinter
06f2448848
Merge remote-tracking branch 'origin/develop' into feature/ir_inplace_pass
6 years ago
Yan Chunwei
655179089f
AnalysisConfig remove contrib namespace ( #15540 )
6 years ago
Qiao Longfei
d6c0dcaa16
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor
...
test=develop
6 years ago
WangZhen
c67b29c178
fix some bugs of graph.to_program and get_pass.
6 years ago
dzhwinter
ee3aae56cd
merge develop branch. test=develop
6 years ago
Zhaolong Xing
97b76c94c4
Merge pull request #15242 from NHZlX/trt_int8_ultimate_version
...
add trt int8 support
6 years ago
WangZhen
c8095eeb82
add freeze pass, and UT is passed.
6 years ago
Qiao Longfei
ada43e89c3
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor
...
test=develop
6 years ago
乔龙飞 Qiao Longfei
c58555067e
Merge pull request #14731 from jacquesqiao/optimize-cpp-reader
...
Optimize cpp reader
6 years ago
nhzlx
36abc964df
fix pybind problem: add an enum to AnalysisConfig
...
test=develop
6 years ago
Zeng Jinle
2480a3df7d
Merge pull request #15496 from sneaxiy/lazy_allocator2
...
Fix bug when user set CUDA_VISIBLE_DEVICES be empty and run CPU-only models
6 years ago
WangZhen
dde19a0ff8
add quantization freeze pass.
6 years ago
Zeng Jinle
dec89bd7ed
Merge pull request #15460 from sneaxiy/try_to_turn_on_remove_unnecessary_lock
...
Turn on remove_unnecessary_lock by default
6 years ago
Xin Pan
58cb18d9d9
Merge pull request #15322 from velconia/imperative_resnet
...
Imperative Resnet
6 years ago
sneaxiy
51227bd447
lazy_allocator
...
test=develop
6 years ago
minqiyang
c8965dc1ab
Polish code
...
test=develop
6 years ago
sneaxiy
ef788603d4
merge develop
...
test=develop
6 years ago
Zhen Wang
58727e8e6d
Merge pull request #15455 from wzzju/graph_quantization
...
Graph quantization pass. TODO(Add public API comments.)
6 years ago
Tao Luo
fef3fd6d62
Merge pull request #15452 from luotao1/legacy_option
...
remove legacy compiler option
6 years ago
Paddle CI
289aba750a
Polish code
...
test=develop
6 years ago
WangZhen
b913463e83
Update according to the reviewers' suggestion. test=develop
6 years ago
sneaxiy
d8568acd19
turn on remove_unnecessary_lock
...
test=develop
6 years ago
WangZhen
3ce6172052
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into graph_quantization
6 years ago
WangZhen
59e5cc51d6
Add quantization transform pass and UT.
6 years ago
flame
d60751fb71
add python inference api ( #15248 )
...
add python inference api
6 years ago
dzhwinter
8f3b252392
squash commits. test=develop
6 years ago
Tao Luo
cf29ea1592
remove legacy ANDROID option
6 years ago
Qiao Longfei
45578c1b48
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-cpp-reader
6 years ago
minqiyang
8ce198b2e1
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into imperative_resnet
...
test=develop
6 years ago
minqiyang
31a1cd8ce5
Align the first batch of gpu resnet
6 years ago
Dun
9f8f0fc2d3
Memory optimization of depthwise conv op and group norm op ( #15313 )
...
* mem opt
* test=develop
* test=develop
* test=develop
* test=develop
* test=develop
* test=develop
* test=develop
* refine code test=develop
* refine code test=develop
* refine code test=develop
* refine code test=develop
* refine with cub test=develop
* fix mkldnn test && remove comments && test=develop
* polish code && test=develop
* add only_forward test && test=develop
6 years ago
Xin Pan
9a9c690e71
Merge pull request #15343 from panyx0718/imperative3
...
add a GAN model in imperative mode
6 years ago
WangZhen
e2ff300b02
add UT for quantization.
6 years ago
WangZhen
451896fce4
init quantization.
6 years ago
Qiao Longfei
88d71fa2f9
support num_iteration_per_run
6 years ago
Qiao Longfei
ea66979684
can run
6 years ago
minqiyang
dbd4d058af
Add static implementation and fix fc layer
6 years ago
Xin Pan
e395f2c6a3
polish codes
...
test=develop
6 years ago
minqiyang
315b133e67
Add single GPU support to imperative
6 years ago
Xin Pan
9a4314f025
imperative gan
...
test=develop
6 years ago
Qiao Longfei
cd31b90a46
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-cpp-reader
...
test=develop
6 years ago
peizhilin
9adb158e5b
Merge remote-tracking branch 'upstream/develop' into debug/support
6 years ago
minqiyang
ddfb9f1123
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into imperative_shared_ptr
...
test=develop
6 years ago
Xin Pan
d1220f23bb
Merge pull request #15229 from panyx0718/imperative
...
support python codes in the imperative model
6 years ago
minqiyang
d0b640dca1
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into imperative_shared_ptr
...
test=develop
6 years ago
minqiyang
687171d22b
Move from shared_ptr to raw pointer
...
test=develop
6 years ago
peizhilin
439691f5bd
adjust the shlwapi on windows
...
test=develop
6 years ago
flame
fb63cd89d4
Add python ir graph API ( #14917 )
6 years ago
minqiyang
08e2a5d611
Polish tracer code
...
test=develop
6 years ago
minqiyang
cded24768c
Remove shared_ptr holder for VarBase
...
test=develop
6 years ago
minqiyang
c8d1a8e909
Change var_ and grad_ to shared_ptr
6 years ago
minqiyang
7aab39af15
Change grads to VarBase
6 years ago
Qiao Longfei
edad60e612
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-cpp-reader
6 years ago
Xin Pan
7b73fc9e1a
Merge pull request #15089 from panyx0718/api
...
try unify Executor and ParallelExecutor
6 years ago
Xin Pan
9597fd05e9
polish
...
test=develop
6 years ago
Xin Pan
7aad6afd49
forward and backward
...
test=develop
6 years ago
Xin Pan
2349acea48
checkpoint
...
test=develop
6 years ago
Xin Pan
11d4d39cd7
forward working
...
test=develop
6 years ago
Xin Pan
b629133375
checkpoint runnable PyLayer
...
test=develop
6 years ago
peizhilin
a6f5ceee74
add the python callstack for debug support test=develop
6 years ago
Xin Pan
0d0bc61248
update api
...
test=develop
6 years ago
chengduo
eabb2105fa
Refactor MultiDevSSAGraphBuilder ( #15090 )
...
* Refactor ParallelExecutor
test=develop
* extract Reduce and AllReduce mode from MultiDevSSAGraphBuilder
test=develop
* Refactor MultiDevSSAGraphBuilder
test=developt
* Remove enable_data_balance
test=develop
* code refine
test=develop
* remove data balance
test=develop
* refine ScaleLossGradOp
test=develop
* remove uncessary file
test=develop
* code refine
test=develop
* modify function name
test=develop
* follow comments
test=develop
* add is_distribution field
test=develop
* set is_distribution
test=develop
* fix DistSSAGraphBuilder
test=develop
6 years ago
Xin Pan
5e928e579a
try unify Executor and ParallelExecutor
...
test=develop
6 years ago
minqiyang
2547f9d1b8
Polish code
...
test=develop
6 years ago
minqiyang
09e2e66236
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into imperative_mnist
6 years ago
gongweibao
ce70229ba6
Add max_body_size flags to brpc ( #15084 )
6 years ago
minqiyang
6a5f604607
Support stop_gradients var in imperative backward
...
test=develop
6 years ago
minqiyang
9e3155e01d
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into imperative_mnist
...
test=develop
6 years ago
minqiyang
336160e651
Complete imperative optimizer implementation
...
test=develop
6 years ago
minqiyang
d4b9928c5a
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into imperative_mnist
...
test=develop
6 years ago
minqiyang
5822f7f1d8
Polish code
...
test=develop
6 years ago
Xin Pan
7b6bf9ddf2
make fill_constant kernel-based
...
test=develop
6 years ago
Xin Pan
61491ce250
clean
...
test=develop
6 years ago
Xin Pan
ce7e503cbe
refactor to avoid scope.
...
test=develop
6 years ago
Xin Pan
71a4a8e981
Merge pull request #15071 from wopeizl/revert/15035
...
Revert "cherry-pick the #12759"
6 years ago
minqiyang
68e9b841ab
Add support for optimizer
6 years ago
Zeng Jinle
9c6a0203e2
Merge pull request #15073 from sneaxiy/add_scope_pool
...
Add scope_pool
6 years ago
sneaxiy
3e917a934a
add scope_pool
...
add module cleanup
test=develop
6 years ago
peizhilin
2388d0e7d6
Revert "cherry-pick the #12759"
...
test=develop
This reverts commit 7f6d8acecb
.
6 years ago
peizhilin
7f6d8acecb
cherry-pick the #12759
...
test=develop
6 years ago
minqiyang
3de0f612e8
Polish code
...
test=develop
7 years ago
Zeng Jinle
95cbe07c40
Merge pull request #14836 from sneaxiy/feature/py_func
...
Featue/py_func op
7 years ago
minqiyang
1c116462cc
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into imperative_mnist
...
test=develop
7 years ago
minqiyang
29697c2e25
Add stop_gradient to VarBase to support loss function
...
test=develop
7 years ago
chengduo
550e7e410b
Code Clean parallel_executor.py ( #14849 )
...
* refine parallel_executor
* remove uncessary code
test=develop
7 years ago
minqiyang
3cd10a7c4f
Add Conv2D forward
...
test=develop
7 years ago
minqiyang
8d88c5a87d
Shameless copy
7 years ago