Paddle

Commit Graph

Author	SHA1	Message	Date
Qiao Longfei	23df6c4478	Add get lod for debug (#7375 ) * add GetLoD for debug * add LoDToString * optimize if * typo * add lod_tensor to operator's dependency	7 years ago
Qiao Longfei	377424bf21	reorganize data transform related code (#7391 ) * init data_type_transform * split data_layout_transform * tmp rm data_transform_test * change device_data_transform to data_device_transform * clean code * clean code	7 years ago
Yang Yu	bdc82956d6	Merge branch 'develop' of github.com:baidu/Paddle into feature/make_lod_a_share_ptr	7 years ago
Qiao Longfei	0f353ab46e	cpu gpu transform function (#7191 ) * add rename guard * add device_data_transform * add device_data_transform_test * modify GetExpectedKernelType * update operator.run * support test test_label_semantic_roles * optimize code * optimize code * rename GetActualKernelType to GetExpectedKernelType * fix chunk_eval_op and device_data_transform_test * add is_same_place to place * optimize code, refine rename_guard * refine rename guard, add GetKernelTypeForVar * optimize code * add some log * rename guard * use sub scope to create var * fix compile * add IsInitialized for Tensor * add VarIsTensor * fix op_registry_test * test * tmp disable priority * restore switch_kernel.md * code clean	7 years ago
Yang Yu	0cfb5465cd	Add COWPtr and its unittest It will be used for LoD information in LoDTensor since LoD is a copy on write field. It is pretty slow for copying LoD information between operators. For resnet it will cost roughly 10% time of whole time, including reading data.	7 years ago
tensor-tang	6177cb5162	Merge remote-tracking branch 'upstream/develop' into context	7 years ago
dzhwinter	5593858dd9	Feature/use cudnn (#7141 ) * "add c++ side kernel selection" * "add multiple kernel op test" * "kernel selection only support cudnn" * "better formatter" * "small fix with UseCPU" * "depends on change interface Get(Place, Library)" * "fix CI" * "fix python cudnn test" * "leave the register cudnn op to another PR" * "fix CI" * "use all kernel by default" * "fix CI"	7 years ago
Yang Yu	e138bcf450	Update cmake of scope	7 years ago
tensor-tang	0a8775cc5d	fix shape_inference deps	7 years ago
dzhwinter	899a79cceb	Feature/transform (#7111 ) * "fix data transform" * "data transformer" * "add device pool" * "add test" * "fix ci" * "fix datalayout implementation " * "fix based on comment"	7 years ago
Yang Yu	f97205ee83	Merge branch 'develop' of github.com:baidu/Paddle into feature/is_nan	7 years ago
Yang Yu	3158b4b37a	Update tensor_util	7 years ago
Yancey	2cdef424d9	Implement selectedrows serialize and deserialize (#7042 ) * implement selectedrows serialize and deserialize * make serialize/deserialize as global function * recover send_imp.cc * delete unused brackets * fix compile error * serialize version in LodTensor and SelecetedRows * fix ci * fix ci	7 years ago
Yang Yu	a9a44e017c	Fix compile	7 years ago
Yang Yu	878d2e919c	Fix compile	7 years ago
dzhwinter	35c1683e80	"refine kernel registrar" (#6998 ) * "refine kernel registrar" * "refine registrar with multikey" * "fix register" * "refine multikernel register" * "fix CI" * "fix CI" * "fix registry" * "swtich GPU to CUDA" * "add register macro test case" * "fix CI"	7 years ago
QI JUN	94096ae554	add memory switch mechanism in operator kernel switch (#6991 ) * add memory switch mechanism in operator kernel switch	7 years ago
Qiao Longfei	f97f69feec	Add data transform fn (#6953 ) * init data_transform * complete DataTransform * fix build error * add data_transform_test * add a register test for data_transform_fn * use function to simulate registration macro * add register macro * update test * clean code * restore unrelated code * update data transform test * generate unique name for REGISTER_DATA_TRANSFORM_FN * add const * follow comment * update KernelTypePair hash function	7 years ago
dzhwinter	80dafdf594	"fix threadpool style" (#7017 ) * "fix threadpool style" * "remove header"	7 years ago
Yancey	127bc2e09c	Implement a simple threadpool (#6684 ) * implement a simple threadpool * unlock before cv.notify * add done function * add lock with GetAvailable function * delete done_ * using call_once in GetInstance * update by comment * update comment * enhance unit test for multi threads task	7 years ago
qiaolongfei	313afc9cce	add op_kernel_type_test	7 years ago
dzhwinter	735eba2976	Feature/operator run place (#6783 ) * "change operator interface" * "move devicepool to device_context" * "fix operator test" * "fix op_registry Run interface" * "net op passed. Need to fix nccl multi-Context" * "add nccl group function" * "add nccl group function" * "fix gpu count exceed 32 error" * "fix recurrent op, nccl op" * "change the other operators interface with Place" * "fix typo" * "fix pybind" * "fix device in python side" * "fix pybind failed" * "add init for test" * "fix CI"	7 years ago
dzhwinter	24fda39220	Feature/global context (#6537 ) * "add DeviceContextPool" * "add devicecontextpool in pybind" * "add comments in python side " * "fix static link error" * "fix CI error" * "add executor.py" * "fix CI error" * "add with gpu macro" * "remove comment out codes" * "add TODO items" * "update init devices"	7 years ago
dzhwinter	45062fe5d7	Feature/copytensor (#5455 ) * "make global tensor function independently" * "replace functor" * "fix inline template error" * "fix tensor array with CopyFrom" * "fix other case use CopyFrom" * "move the op interface hardly" * "fix operators" * "fix typo" * "delete dynamic recurrent rnn and fix gru_unit in debugmode" * "fix unique_ptr copy" * "fix cuda copy" * "fix namespace error" * "removed nccl python test" * "fix include error" * "fix typo" * fix copy util test	7 years ago
QI JUN	5f9f990e62	fix gitignore (#5657 ) * fix gitignore * refine cmake file	8 years ago
Yu Yang	74849158e3	Add LoDRankTable (#5349 ) * Add LoDRankTable LoD Rank Table stores the `level` of `lod` which is ordered by sequence length in descending order. It is useful when implement dynamic RNN and is shared by dynamic RNN memory, dynamic RNN slice input and dynamic RNN slice output operators. * Add InferVarType	8 years ago
dangqingqing	1c8a0c4bd4	Refine activation function pointer for LSTM operator.	8 years ago
Yu Yang	8f6c0a0fad	Extract InferShape to many cc files (#5174 ) * Shrink Operator.h * Fix CI compile	8 years ago
Yu Yang	2a5edec03e	Add debug logs in scope, meta_cache and memory (#5170 ) * Add debug logs in scope, meta_cache and memory * Add missing deps	8 years ago
QI JUN	7f8574c0f5	add sparse support for sum op (#5093 ) * add sparse support for sum op * typo fix * fix gpu build error * fix unittest error * typo fix * infer var type and shape in op_test * follow comments * fix build error * bypass some unittests depend on NetOp	8 years ago
Yu Yang	be00b0c4d6	Gradient check use graph (#5027 ) * Simplize Gradient Check * Stash * Extract apply_backward_pass to backward.py Rename apply_backward_pass to append_backward_ops * Use graph API to check gradient * Fix ci * Fix CI * Fix backward for double precision * Stash * Fix CI * Fix ci * Ignore GRU test * Ignore xe op * Fix CI * Fix softmax with xe gradient The correct equation should be IG = OG * (d_softmax_with_xe()) * Fix typo * Fix merge error * Disable LRN	8 years ago
Yu Yang	efc2464f6c	Feature/save op (#5090 ) * Init * Stash * Polish SaveLoadOp * Fix CI * Polish code * Save GPU Tensor * Stash * Fix CI	8 years ago
dzhwinter	fd2eb55071	"Serialize LoDTensor, Save/Restore model" (#4602 ) * "add model format design doc" * "add restore function" * "add parse protobuf" * "move necessary information to saver.proto" * "format code" * "add gpu option" * "add lod info" * "add saveop python test wrapper" * "checkpoint reuse save operator" * "rewrite model format design doc" * "async support needed" * "fix run once" * "fix doc based on comments" * "refine based on comments" * "fix based comments" * "remove persistable flag from framework.proto" * "add IndicateDataType to restore op" * "add save test" * "modify save restore code" * "modified the restore logic" * rm checkpoint_op.cc * rm test_checkpoint_op.py * "get inputs outputs name from execution context" * Saving each variable to a independent file * Fix bugs * Rewrite save_restore_op_test with new Python framework * Move `SaveOp` and `RestoreOp` from OpWithKernel to OpBase * Refine unit test of SaveOp and RestoreOp * fix compile errorwq	8 years ago
Yu Yang	e9e0d7d774	Correct the dependencies (#4978 )	8 years ago
Yu Yang	3db5278301	Feature/py executor test (#4922 ) * Implement FC layer with helper * Update LayerHelper * Add debug string for Python ProtoBuf and Rename `Sync` to `Flush` * Add check of ProtoBuf initialization * Layer wrapper for FC * Fix unittest * Fix CI * Add code generator * AttributeChecker Better error log and speicalize bool Since lots of types can be cast to bool * Complete mlp, fit_a_line * Expose get global scope * Make global scope not thread-safe 1. It is no need to make global scope thread-safe, since it will be invoked in Python main thread. 2. Do not free the global scope when C++ exit. Let the OS free memories, otherwise, we need to handle the destroy dependencies. See https://google.github.io/styleguide/cppguide.html#Static_and_Global_Variables * Fix * Implementation of simple conv_2d layer * Stash * Remove private data members in OpRegister * Fix bugs * Stash * Expose FeedFetchList as VarType * Change ProgramDesc not a global variable * Polish code style * Stash * Correct implement BlockDesc destructor * Correct implement BlockDesc destructor * Unify program as parameter name * Fix bugs * Add unittest * Fix unit test error * Remove unused functions * Add clone for Python Program * Working on executor * Stash * Add glog as dependencies of ops * Use VLOG to logging some information is helpful when we debug Paddle * Expose VarDesc::persistable to Python * Test executor * Complete unittest * Polish code * Fix merge error * Follow comment * Polish Python Code	8 years ago
Yu Yang	47f773ddb2	Copy Constructor for ProgramDesc (#4895 ) * Implement FC layer with helper * Update LayerHelper * Add debug string for Python ProtoBuf and Rename `Sync` to `Flush` * Add check of ProtoBuf initialization * Layer wrapper for FC * Fix unittest * Fix CI * Add code generator * AttributeChecker Better error log and speicalize bool Since lots of types can be cast to bool * Complete mlp, fit_a_line * Implementation of simple conv_2d layer * Fix bugs * Change ProgramDesc not a global variable * Polish code style * Stash * Correct implement BlockDesc destructor * Correct implement BlockDesc destructor * Unify program as parameter name * Fix bugs * Add unittest * Fix unit test error * Remove unused functions * Add clone for Python Program * Compare OpDescBind directly	8 years ago
Yang Yang(Tony)	831927d58c	Merge pull request #4738 from tonyyang-svail/prune_impl Prune implementation	8 years ago
Yu Yang	e9249d16cb	Add glog as dependencies of ops (#4908 ) * Add glog as dependencies of ops * Use VLOG to logging some information is helpful when we debug Paddle * Fix Unittests	8 years ago
Yang Yang	39aa81e74e	Merge remote-tracking branch 'upstream/develop' into prune_impl	8 years ago
QI JUN	172e460d50	Merge pull request #4797 from reyoung/feature/implenment_infer_var_type Complete infer_var_type	8 years ago
Qiao Longfei	b10cd43554	rm cpp executor_test, rewrite in python later (#4849 ) * rm cpp executor_test, rewrite in python later * remove executor_test code in CMakeList.txt	8 years ago
Yang Yang	865c2c8ed8	add compile DEPS	8 years ago
Yang Yang	eb187366f4	merge develop	8 years ago
Qiao Longfei	d7383c6dd0	create grad_var when run Backward pass (#4796 ) * add target to Backward, generate var in block when call backward * modify backward_test * fix executor_test * set var desc default type to LOD_TENSOR * update backward_test * insert loss in the top level of backward * create grad vars for all blocks in current program * optimize code * update test_program.py * only create var for newly create blocks when backward	8 years ago
Yu Yang	a96372b108	Merge branch 'develop' of github.com:baidu/Paddle into feature/implenment_infer_var_type	8 years ago
Yu Yang	1b1cb44f13	Complete infer_var_type	8 years ago
qijun	4b13c80eeb	add selected rows	8 years ago
Yu Yang	4838ea25d3	Wrong dependency order for op_info and proto_desc (#4763 )	8 years ago
Yang Yang	58b8a1ae4c	prune link fail	8 years ago
Yang Yang	a31ff363fd	prune pass dummy test	8 years ago

1 2 3 4

195 Commits (331bfd9835eaa971d1512e797ae53cca4a3dfd22)