Yancey
a585b585dd
Batch barrier in send/recv op ( #7847 )
...
* initialize batch barrier
* add some comments
* update
* fix batch barrier
* use sendvariable rpc interface to send batch barrier
* fix comment
* fix method
* fix by comment
* fix by comment
7 years ago
Yancey
0f0ce4e5ec
Fix cpplint ( #7914 )
7 years ago
chengduo
a026f52b73
refine channel ( #7910 )
7 years ago
武毅
8a6a339eb4
Merge pull request #7889 from helinwang/send_err
...
Fix send op data race
7 years ago
chengduo
9beec1212b
Add Channel ( #7442 )
...
* add Channle
* refine Channel
7 years ago
Helin Wang
0e1109cdf3
Fix send op data race
...
std::vector<bool> is not safe for concurrent write, even to difference indices.
More discussion:
https://stackoverflow.com/questions/48452611/is-stdfuturewait-a-memory-barrier-i-can-not-explain-this-data-race
7 years ago
helinwang
1ab1181460
aysnc send/recv, seriliaze/deserialize using threadpool. ( #7705 )
...
* aysnc send/recv, seriliaze/deserialize using threadpool
* implement paralell deserialization correctly
7 years ago
typhoonzero
5f4d9130f0
merge codes
7 years ago
typhoonzero
ae19d2ea1e
fix comm issues
7 years ago
typhoonzero
f233b936c7
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into distributed_split_selectedrows
7 years ago
typhoonzero
02ea349101
enhance dist train performance
7 years ago
helinwang
052c05be34
Merge pull request #7537 from Yancey1989/distributed_gpu
...
Fluid distributed supports CUDA place
7 years ago
gongweibao
535fefb7e8
Fix grpc bugs ( #7435 )
...
Fix grpc bugs
7 years ago
Yancey1989
9de1809590
fluid distributed on CUDA place
7 years ago
gongweibao
da3087ada1
Async GRPC sendrecv ( #7133 )
...
Async GRPC sendrecv
7 years ago
Yancey
e5fe8935fb
send_recv variables ( #7161 )
...
* send_recv variable
* delete unused logs
* fix ci failed
* update
* resize tensor before tensor copy
* add selectedrows unit test
* check rows
7 years ago
typhoonzero
d2ded51adf
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_sendrecv_ut
7 years ago
Luo Tao
761b329793
unify the indentation of license
7 years ago
typhoonzero
4dde9a0057
fix send recv unit test
7 years ago
dzhwinter
0d2235aadf
GPUPlace to CUDAPlace ( #6960 )
7 years ago
typhoonzero
b848416166
follow comments
7 years ago
typhoonzero
b1b7af400f
support multi node
7 years ago
typhoonzero
7be79231e1
wip multi-trainer
7 years ago
typhoonzero
1e549563d5
multi trainers
7 years ago
typhoonzero
9508c72685
wip: should fix variable recreate
7 years ago
typhoonzero
b4cd7f3d75
wip need ut
7 years ago
typhoonzero
489b9695e4
wip for testing
7 years ago
武毅
e1b8c27acb
Add back print_operators_doc ( #5970 )
...
* add back print_operators_doc
* fix style check
* fix style check
7 years ago
Yu Yang
985e4ab62d
Add Python wrap of conv2d_transpose and its unittest ( #5946 )
...
* Add Python wrap of conv2d_transpose and its unittest
* Follow comments
* Fix format
7 years ago
武毅
0a8a86e0c9
Send recv op ( #5520 )
...
* WIP send recv op
* WIP send recv
* put grpc impl in details
* put grpc impl in details
* update wip
* update proto
* update proto
* update proto
* clean cmake
* wip on op implementations
* wip on op implementations
* compile ok adding ut
* wip unitest
* add extern cares for linking
* wip add ut
* working version send recv
* revert optimizer.py
* update test cmake
* add libtool to dockerfile
* update cmake dependency
* update cmake depends
* update cmake grpc depends
* fix cmake dependency
* fix compile error
* fix compile
* follow comments
* update
* update copyfrom
7 years ago
Yang Yang(Tony)
18f0c40a97
feature/while_grad_op ( #5554 )
...
* first commit
* Python API for while op
* Python Unittest for simple while_op forward
* fix out to be list
* Fix UT
* VarType
* Fix several bugs
* Fix bug
* Fix bug
* Fix Bug
* Fix bug
* Fix unittest
* Remove debug log
* Add comments
* add PADDLE_ENFORCE
* while_grad_op first commit
* Add `BlockDescBind::FindRecursiveOrCreateVar()` and fix bugs
* not sure how to setdim of while outputs
* push for test
* add executor vlog
* fix bug of while_op cond
* Several enhancement for code
1. Backward always infer shape & infer var type. Since there are RENAME
variables will be created when creating backward operator, but their
shape & var types are not inferenced.
2. Never use SomePtr-> directly, since every pointer could be nullptr if
it is a function return value. Add `detail::Ref` to cast pointer to
reference safely.
3. Enhance error message for backward.
4. Infer data type of variable in `sum` and `tensor_write`
* Fix bugs of while_op gradient
* Fix several bugs of while_op grad
* fix fill zeros like
* fix 3 >= 3
* fix place holder shouldn't be null
* fail on sum op
* Fix SumOp of TensorList
* clean up
* pass while test
* fix test_array_write_read
* pass sum op
* Support int/int64 for fill_constant_batch_size_like
* Fix compile
7 years ago
Yi Wang
4558807c48
Use PADDLE_WITH_CUDA instead of PADDLE_WITH_GPU
8 years ago
Yu Yang
84500f9487
Change `PADDLE_ONLY_CPU` to `PADDLE_WITH_GPU`
...
By shell command
```bash
sed -i 's#ifdef PADDLE_ONLY_CPU#ifndef PADDLE_WITH_GPU#g' `find ./paddle/ -name '*.h' -o -name '*.cc' -o -name '*.cpp' -o -name '*.c' -o -name '*.cu'`
sed -i 's#ifndef PADDLE_ONLY_CPU#ifdef PADDLE_WITH_GPU#g' `find ./paddle/ -name '*.h' -o -name '*.cc' -o -name '*.cpp' -o -name '*.c' -o -name '*.cu'`
```
8 years ago
Yu Yang
07915c95ec
Renamed to strided_memcpy and prettify unittests
...
Add unittests for Crop and Concat
8 years ago
Yu Yang
3a4897ab15
Add TensorCopy method
...
A method to copy a tensor with stride and dimension. It is useful
for Crop, Concat, etc.
8 years ago