Yang Yang
d002f60af9
merge develop
7 years ago
Yang Yang
e0e45c052e
merge develop
7 years ago
Yang Yang
28b240bbcf
delete todo in MergeLoDTensor
7 years ago
Yu Yang
b8a17987ec
Feature/parallel for bug fix ( #7474 )
...
* Fix ParallelDo not support empty input gradient
* Polish ParallelDo and fix several bugs
* Fix CI
* Fix CI
7 years ago
Yan Chunwei
ac73900942
enhance/add lod check ( #7439 )
7 years ago
Qiao Longfei
23df6c4478
Add get lod for debug ( #7375 )
...
* add GetLoD for debug
* add LoDToString
* optimize if
* typo
* add lod_tensor to operator's dependency
7 years ago
Yang Yang
208f950ccc
delete todo
7 years ago
Yang Yang
8c1025d66f
first commit
7 years ago
Yang Yang(Tony)
4bcc0b64cb
[WIP] feature/parallel_gpu ( #7293 )
...
feature/parallel_gpu
7 years ago
Yu Yang
ce6dad3b35
Rename CopyFrom to Copy for tensors ( #7292 )
...
* Rename Tensor::CopyFrom to Tensor::Copy
* Fix CI
* Fix compile
7 years ago
Yu Yang
1dad4bb2e9
Remove unused LoDTensor methods ( #7247 )
...
* Remove unused LoDTensor methods
* Update
7 years ago
Yang Yu
8496b2e41b
Refine parallel_do
7 years ago
Yang Yu
60e27d110b
Merge branch 'develop' of github.com:baidu/Paddle into parallel_do
7 years ago
Yancey
e5fe8935fb
send_recv variables ( #7161 )
...
* send_recv variable
* delete unused logs
* fix ci failed
* update
* resize tensor before tensor copy
* add selectedrows unit test
* check rows
7 years ago
Yang Yang
9313233297
merge develop
7 years ago
Yang Yu
63e3150772
Update code
7 years ago
Yang Yang
cb0b81f97b
add << lodtensor
7 years ago
Yancey
2cdef424d9
Implement selectedrows serialize and deserialize ( #7042 )
...
* implement selectedrows serialize and deserialize
* make serialize/deserialize as global function
* recover send_imp.cc
* delete unused brackets
* fix compile error
* serialize version in LodTensor and SelecetedRows
* fix ci
* fix ci
7 years ago
Luo Tao
761b329793
unify the indentation of license
7 years ago
dzhwinter
0d2235aadf
GPUPlace to CUDAPlace ( #6960 )
7 years ago
Yang Yang
f879ef23c3
pass forward backward runtime
7 years ago
Yang Yang
f899150e0a
pass forward runtime
7 years ago
Yu Yang
e445b3ff20
Move framework.proto to proto namespace ( #6718 )
...
* Move framework.proto to proto namespace
* Fix compile
* Fix compile
* Fix Compile
7 years ago
武毅
0a8a86e0c9
Send recv op ( #5520 )
...
* WIP send recv op
* WIP send recv
* put grpc impl in details
* put grpc impl in details
* update wip
* update proto
* update proto
* update proto
* clean cmake
* wip on op implementations
* wip on op implementations
* compile ok adding ut
* wip unitest
* add extern cares for linking
* wip add ut
* working version send recv
* revert optimizer.py
* update test cmake
* add libtool to dockerfile
* update cmake dependency
* update cmake depends
* update cmake grpc depends
* fix cmake dependency
* fix compile error
* fix compile
* follow comments
* update
* update copyfrom
7 years ago
Yu Yang
f72729d407
Feature/rnn to array to lod tensor ( #5411 )
...
* Add LoDRankTable
LoD Rank Table stores the `level` of `lod` which is ordered by sequence
length in descending order. It is useful when implement dynamic RNN and
is shared by dynamic RNN memory, dynamic RNN slice input and dynamic
RNN slice output operators.
* Add skeleton for array_to_lod_tensor and lod_tensor_to_array
* Add VarType::LoDTensorArray
* Add PyBind of LoDTensorArray
* Add InferVarType
* Add first unittest
* Add ut
* Add unittest
* Add unittest
* Add unittests
* update
* init
* add infershape for lod_tensor_to_array_op
* compelete array_to_lod_tensor_op
* copy data
* clean code
* clean code
* Fix unittest data
* fix bugs
* fix compile error
* Refine TensorToArrayOp
* refactor array_to_lod_tensor
* Unittest
* fix bugs
* Fix unittest
* Fix unittest
* debug
* Debug
* Fix unittest
* clean code
* refactor
* use ostream
* update test
* fix gpu build error
* make gpu test pass
8 years ago
fengjiayi
d05c182e93
Add LoD's slice and append function ( #5368 )
...
* Add GetFineGrainedLoDLength and AppendLoD
* Follow comments and fix bugs
* fix a compile error
* fix a compile bug
8 years ago
Yu Yang
efc2464f6c
Feature/save op ( #5090 )
...
* Init
* Stash
* Polish SaveLoadOp
* Fix CI
* Polish code
* Save GPU Tensor
* Stash
* Fix CI
8 years ago
Yan Chunwei
9a0233de9f
Feature/tensor array lod pack ( #5007 )
8 years ago
dzhwinter
fd2eb55071
"Serialize LoDTensor, Save/Restore model" ( #4602 )
...
* "add model format design doc"
* "add restore function"
* "add parse protobuf"
* "move necessary information to saver.proto"
* "format code"
* "add gpu option"
* "add lod info"
* "add saveop python test wrapper"
* "checkpoint reuse save operator"
* "rewrite model format design doc"
* "async support needed"
* "fix run once"
* "fix doc based on comments"
* "refine based on comments"
* "fix based comments"
* "remove persistable flag from framework.proto"
* "add IndicateDataType to restore op"
* "add save test"
* "modify save restore code"
* "modified the restore logic"
* rm checkpoint_op.cc
* rm test_checkpoint_op.py
* "get inputs outputs name from execution context"
* Saving each variable to a independent file
* Fix bugs
* Rewrite save_restore_op_test with new Python framework
* Move `SaveOp` and `RestoreOp` from OpWithKernel to OpBase
* Refine unit test of SaveOp and RestoreOp
* fix compile errorwq
8 years ago
Yan Chunwei
db7b11719b
change lod tensor to absolute offsets ( #4952 )
8 years ago
Yan Chunwei
e43073e898
design of TensorArray to simplify varient/dynamic RNNs ( #4447 )
...
* some enforce change
* add LoDTensor::NumElements(id,id)
* update
* init
* finish
* refacter interfaces
8 years ago
Yan Chunwei
fdc4d4abce
rename LoDTensor::Slice to Shrink ( #4343 )
8 years ago
Yan Chunwei
c0523f24b0
rename LOD to LoD for short of "Level of Details" ( #3936 )
8 years ago
qijun
f03811b06f
init refine LODTensor
8 years ago
Yan Chunwei
1ee633d1d2
remove detail from LODTensor ( #3364 )
...
* remove SliceCopied
* remove SliceCopied
* rename SliceShared to SliceLevels, SliceInLevel
* merge lod_tensor/details
* remove lod_start_pos_'s shared_ptr
* make lod-tensor a special tensor
* add clone to lod_tensor
* add lod tensor impl
* add lodtensor clone test
* init lod
* add equal
* merge LOD and its methods
* recover tensor and variable
* change thrust to host_vector
8 years ago
Yan Chunwei
ede02d7d79
LODTensor (Level of details, or Level of sequences Tensor). ( #3109 )
...
* add lodtensor
* add reshape of lod
* add details
* rename Elements/Levels
* size_t and vector reserve
* add details
* add const& std::shared_ptr
* add lod_tensor_impl.h
* remove a shared_ptr
8 years ago