yuyang18
017bba1664
Add op role
7 years ago
yuyang18
08295f9877
Add build strategy
7 years ago
yuyang18
e5281b3c2d
Clean code & add execution strategy
7 years ago
typhoonzero
928418a9ac
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into gen_nccl_id_op
7 years ago
chengduoZH
97cb5479ae
change PE strategy
7 years ago
typhoonzero
d9320dcd94
complete code
7 years ago
fengjiayi
d11b8e56e5
fix
7 years ago
yangyaming
f456cd8079
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix-10219
7 years ago
yangyaming
82571deb89
Change `customize_loss_grad` to `use_default_grad_scale`.
7 years ago
Yu Yang
0c24b3f937
Clean memcpy async
7 years ago
Yu Yang
7a395881d4
Add customize_loss_grad option to PE
7 years ago
Yu Yang
f738691777
Merge pull request #9740 from dzhwinter/memory/activation
...
"polish activation"
7 years ago
Liu Yiqun
7ffbcbcaf0
Add flush of program desc to update the proto information.
7 years ago
dzhwinter
1fc9fe773d
merge develop
7 years ago
Yiqun Liu
598035f985
Fix a bug in save_inference_model and prune when the program is initailized by load_inference_model ( #10011 )
...
* Fix bug in save_inference_model and prune when the program is initialized by load_inference_program.
* Save the transpiled program instead.
7 years ago
dzhwinter
425a1e76a2
Merge remote-tracking branch 'origin/develop' into memory/activation_memory
7 years ago
Yu Yang
b4aaa00a8a
Polish logic of ParallelExecutor
7 years ago
Yu Yang
2ab12ca248
Add comments and clean code
7 years ago
Yu Yang
a822f8ddbb
Add wait
7 years ago
Yu Yang
e9e27e0f32
Revert
7 years ago
Yu Yang
e6bc7bb09a
Tuning code
7 years ago
Yu Yang
0ca28b8507
Sync Copy
7 years ago
dzhwinter
e54f203c55
"move to a new PR"
7 years ago
Yang Yang(Tony)
a64edbf14e
delete backward.cc related code on the python side ( #9854 )
7 years ago
Yang Yang(Tony)
b26f505002
remove net op and cond_op ( #9663 )
...
* remove net op and cond_op
* fix cpplint
* fix dependency
* delete backward_test; fix compile
* disable batch_norm backward
* rm test_net.py
* make batchnorm test independent of backward.cc
* make test_layer_norm_op independent of backward.cc
* make test_layer_norm_op independent of backward.cc
* delete unused code
* clean up
7 years ago
typhoonzero
dfc60251bf
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into multigpumultinode
7 years ago
fengjiayi
90084a25d2
Merge pull request #9743 from JiayiFeng/modify_readers_to_fit_parallel_executor
...
Modify readers to fit the parallel executor
7 years ago
fengjiayi
a84b81502c
Remove Readers' HasNext()
7 years ago
typhoonzero
0bf799a523
wip testing
7 years ago
Xin Pan
326b434ab5
Merge pull request #9699 from panyx0718/test
...
Avoid init_p2p all the times
7 years ago
Yu Yang
a4bf635795
Add gitignores
7 years ago
qingqing01
2b7e5bd366
Support testing during training by ParallelExecutor. ( #9738 )
...
* Support testing during training by ParallelExecutor.
* Add unit test.
* Improve the interface.
* Follow comments.
7 years ago
Yi Wang
25ad6884bb
Merge branch 'develop' of http://github.com/paddlepaddle/paddle into cpplint-memory-detail
7 years ago
Luo Tao
50e036a4ed
fix compiler error on `tensor_py.h`
7 years ago
Yi Wang
eebb205324
Update CMakeLists
7 years ago
Yi Wang
1543c4cf6a
Fix cpplint errors of paddle/fluid/pybind and add some tests ( #9694 )
...
* cpplint test and add tesnor_py_test.cc
* Update
* Update
7 years ago
Xin Pan
b138d29c38
Avoid init_p2p all the times
7 years ago
Yi Wang
797a7184ac
Unify Fluid code to Google C++ style ( #9685 )
7 years ago
lgone2000
517f619501
fix pybind.cc compile error ( #9681 )
7 years ago
chengduoZH
4ff237f93c
follow comments
7 years ago
chengduoZH
17842e33c3
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/expose_CUDAPinnedPlace_to_python
7 years ago
chengduoZH
8e4e155c52
add PyCUDAPinnedTensorSetFromArray
7 years ago
chengduoZH
6af178356b
expose CUDAPinnedPlace to Python
7 years ago
Luo Tao
09b53c086d
add remove_var from c++ end
7 years ago
Xin Pan
b123ce88a1
Add enable/disable for delayed ops
7 years ago
Yu Yang
50e7e25db3
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into cpp_parallel_executor
7 years ago
Yu Yang
edfd741e3a
Add simple python wrapper for ParallelExecutor
7 years ago
sabreshao
e0ac6bc436
CMake refine for HIP support.
...
Fix CI.
7 years ago
Yu Yang
d42117e742
Set NumThreads
7 years ago
Yu Yang
9cb8f50302
Complete fetch op
7 years ago
sabreshao
45c988d86a
Demostration of cmake refine for HIP support.
...
1. Add option WITH_AMD_GPU.
2. Add cmake/hip.cmake for HIP toolchain.
3. Some external module such as eigen may need HIP port.
4. Add macro hip_library/hip_binary/hip_test to cmake/generic.cmake.
5. Add one HIP source concat.hip.cu as an example. Each .cu may have its corresponding .hip.cu.
7 years ago
Yu Yang
5e87cd7574
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into cpp_parallel_executor
7 years ago
qingqing01
45073b7c39
Always synchronize when copy data on GPU from C++ to Numpy array. ( #9110 )
7 years ago
Kexin Zhao
e26f1123da
Add fp16 mul op support and bind paddle fp16 to numpy fp16 ( #9017 )
...
* add fp16 mul op support
* small fix
* fix bug
* small fix
* fix PADDLE_WITH_CUDA compiling issue
* reorg code
* test for pybind
* treate as float16 as uint16_t in pybind
* bind np.float16 to paddle float16
* small fix
* clean code
* remove redundancy
* fix mul_op test
* address comments
* small fix
* add is_float16_supported func
7 years ago
Yu Yang
baef1124fb
ParallelExecutor And dependency engine
7 years ago
Yu Yang
e13aec601a
Merge pull request #8830 from reyoung/feature/recordio_file_reader
...
Feature/recordio file reader
7 years ago
QI JUN
f7e9fe57d3
[Memory]More memory optimization policy ( #8690 )
...
* add memopt level
* add opt level for image classification demo
* clean code
* add delete op
* clean code
* test machine translation demo
* clean code
* clean code
* skip fill constant with force cpu
* clean code
* clean code
* refine code
* clean code
* fix bug
7 years ago
Yu Yang
fea43077f6
Refine
7 years ago
Yu Yang
72be7a6151
Complete RecordIO reader op
7 years ago
Xin Pan
b9ec24c6e9
Extend current profiler for timeline and more features.
7 years ago
typhoonzero
a50230add6
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_raw_var_type
7 years ago
typhoonzero
9b5be6ef43
fix short connection again
7 years ago
Yu Yang
eebfb71d3f
Merge pull request #8524 from reyoung/feature/unique_name_guard
...
Moving unique_name to python
7 years ago
Yu Yang
e8cb97b8a2
Moving unique_name to python
...
* Add reset and guard to unique_name
7 years ago
Yu Yang
7a9098a60e
Add block.fwd_block_id ( #8489 )
...
* Add block.fwd_block_id
* fix bug in memory optimization transpiler
* Change DFS to BFS
* Add comments
7 years ago
kavyasrinet
77ee8fb240
Exposing Channel to be used as a Variable and integrating with Fluid ( #8486 )
...
* Adding set_capacity method support
* Adding Python for make_channel
* Updating notest_concurrency
* Write python for make_channel method
* Write python for make_channel method
* Fix make_channel and test
* Placeholder ops for channel send, recv and close
* Adding ToTypeIndex method to var_type.h
* Add var_type.h to channel:
* Added POD_Type to the method
* Add CHANNEL to executor
* Updated get and set DataType to accomodate Channels
* Updating get and set to incorporate channels
* Adding CHANNEL as supported VarType in protobuf
* Removing unecessary import
* Fixing VarDesc to adapt to Channel as VarType
* Add channel.h to executor
* Remove innclude from channel
* Updated var_type to support Channel as var type
* Adding get_channel to pybind
* Added ChannelHolder
* Adding make_channel as an op
* Adding ChannelHolder in channel
* Fixing typo
* Commenting out operators in concurrency
* Removing totypeid right now since we don't need it.
* Reverting python changes
* Fixing typo in framework.py
* Modify comments for ReaderHolder
7 years ago
武毅
c490f1b38b
Merge pull request #8049 from typhoonzero/no_counter_on_pserver
...
Enhancement/transpiler rename grad vars to add trainer id, so RPC call can be retried.
7 years ago
Yang Yang
ec01f635f5
merge develop
7 years ago
Abhinav Arora
c7ad26d6a4
[WIP] Move DataType enum inside VarType ( #8447 )
...
* Move Pod Types from DataType enum to Type enum
* Fixed data_type.h
* Fix type in TensorDesc
* Add comment to framework.proto
* Fixed type in data_type.h
* Updated format of type in data_type.h
* Fix var_desc.h
* Fix op_kernel_type.h
* Fixed data_type_transform_test.cc
* Fix operator.h
* Fixed data_type_transform.cc
* Fixed op_kernel_type_test.cc
* Fix operator.cc
* Fixed data_layout_transform_test.cc
* Fix var_desc.cc
* Fixed assign_value_op.cc
* Fixed assign_value_op.h
* fixed protobuf.cc
* Fix data_layout_transform_test.cc and op_kernel_type_test.cc
* Fixed rnn_memory_helper_op.cc
* Fix progrma_desc_test.cc
* Fixed fill_constant_batch_size_like_op.cc
* Fix operator_test.cc
* Fixed fill_constant_op.cc
* Fixed gaussian_random_op.cc
* Fixed uniform_random_op.cc
* Fixed edit_distance_op.cc
* Fixed fill_constant_batch_size_like_op.cc
* Fixed rnn_memory_helper_op.cc
* Fixed chunk_eval_op.cc
* Fixed assign_value_op.cc
* Fixed assign_value_op.h
* Fixed cast_op.h
* Fixed cast_op.h
* Fix fill constant op
* Fixed clang for assign_value_op.cc
* Fix one_hot_op.h
* Fix one_hot_op.cc
* Fix fill_op.cc
* Fixed sum_op.cc
* Fixed sum_op clang
* Fix uniform_random_op.cc
* Fix gaussian_random_op.cc
* Fix backward.cc
* Fix protobuf.cc
* Fixed prune_test.cc
* Fixed op_registry_test.cc
* Fix data_device_transform_test.cu
* Fix travis error
* Fixed one_hot_op.cu
* Fixed op_registry_test.cc
* Fixed nccl_op.cc
* Fixing python tests
* Revert "Fixing python tests"
This reverts commit fccaa4c5818ed9f379ea1ce4315066cc78076c64.
* Fixing Pybind to remove data type
* Fixing tensor.py
* Updated the new files:
* Resolve error in merge conflict of fill_constant_batch_size_like_op.cc
7 years ago
Yang Yang
5229ccbdc7
merge develop
7 years ago
Yi Wang
cfffb1a362
Update tensor_util.h ( #8422 )
...
* Update tensor_util.h
* Update with moved TensorDesc
* Fix tensur_utils.cu
* Update
* Update
* Update
* Update
* Make tensor_util.cu a symbolic link
7 years ago
typhoonzero
1494673dcb
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into no_counter_on_pserver
7 years ago
Yang Yang
7129fa3c90
merge develop
7 years ago
Abhinav Arora
fcadb45251
Separate VarType from VarDesc in framework.proto and fix all related compiler errors ( #8414 )
...
* Refine Type system
* Fixing type inference
* Fixed create_reader_op.cc
* Fix var_desc.h
* Fixed executor.cc
* Fix shape_inference.h
* Fixed create_reader_op.cc
* Fix tensor_util.h
* Fixed var_type_inference_test.cc
* Fix shape_inference.cc
* Fixed sum_op.c
* Fixed read_op.cc
* Fix var_type.h
* Fixed beam_search_decode_op.cc
* sendrecvop_utils.cc
* Fix operator.cc
* Fixed lookup_table_op.cc
* Fixed op_desc.cc
* Fixed get_places_op.cc
* Fixed lod_rank_table_op.cc
* Fixed beam_search_op.cc
* Fix var_desc.cc
* Fixed lod_tensor_to_array_op.cc
* Fixed while_op.cc
* Fix program_desc_test.cc
* tensor_array_read_write_op.cc
* Fix assign_op.cc
* Fix executor.cc
* Fix protobuf.cc
* Fix protobuf.cc
7 years ago
Yang Yang
da97d9dfd6
merge develop
7 years ago
qingqing01
24509f4af9
Fix the grammar in copyright. ( #8403 )
7 years ago
typhoonzero
11bcb43a44
fix merge issue
7 years ago
Tao Luo
b56f4a4ee2
move code from /paddle/string to /paddle/fluid/string ( #8363 )
7 years ago
typhoonzero
b0096361c7
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into no_counter_on_pserver
7 years ago
Yang Yang
cd9e660d4b
merge develop
7 years ago
Luo Tao
77f04fd97a
move paddle/pybind/pybind.h to paddle/fluid/pybind/pybind.h, and cancel the test_parallel_op temporary
7 years ago
Yi Wang
bc7be8320e
Update pre-commit
7 years ago
Yi Wang
154368b1a0
Re-add the lack pybind.h
7 years ago
Yi Wang
fc374821dd
Correct #include path
7 years ago
Yi Wang
90648f336d
Move file to fluid/; Edit CMakeLists.txt
7 years ago