dzhwinter
71305e5f90
"polish code based on comment"
7 years ago
dangqingqing
b50c33fd00
Use fixed activation in the lstm kernel, since there is some bug in the activation function pointer. It will be fixed later.
7 years ago
wanghaoshuang
35e79448a0
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into seq_expand_op
7 years ago
Yu Yang
b84e822651
Cast Operator ( #5149 )
...
* Cast Operator
Cast input variable to other data type
* Fix compile error
* Add cast op
* Follow comments
7 years ago
Yu Yang
46a13e37d7
Polish Accuracy Op ( #5191 )
...
* Accuracy does not support float/double, only support integers
* Polish error message when an operator does not support some device.
7 years ago
QI JUN
008f40ce09
support sparse output for lookup table grad op ( #5145 )
...
* add sparse support for sum op
* typo fix
* fix gpu build error
* fix unittest error
* typo fix
* infer var type and shape in op_test
* follow comments
* fix build error
* bypass some unittests depend on NetOp
* support sparse output for lookup table grad op
* refine codes
* fix gpu build error
* fix lookup table grad gpu kernel
* fix ci
* fix ci
* fix ci
* fix bug in lookup_table_grad op
* fix bug in test_word2vec
* register double kernel for some operators
* set is_sparse=True in test_word2vec
* fix lookup table grad op CUDA kernel bug
* disable test_modified_huber_loss_op temporarily
* disable test_lstm_unit_op temporarily
7 years ago
Yu Yang
8f6c0a0fad
Extract InferShape to many cc files ( #5174 )
...
* Shrink Operator.h
* Fix CI compile
7 years ago
chengduoZH
6bdf5c1417
fix bug
7 years ago
fengjiayi
6783dcee9e
Python API for inference model saving/load ( #5020 )
...
* Add `dump_to_file()` for ProgrameDescBind in pybind
* Update
* Add utility.py
* typo
* Fix bugs
* Move add_feed/fetch_components to untility.py
* Compelete dump
* Follow comments
* Change output of Prune() from inference to pointer
* Expose Prune() to Python
* Compelete save/load API of inference model
* Fix errors
* Debuging
* Compelete unit tests
* follow comments
7 years ago
zchen0211
52eb42cf7c
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into batch-norm-latest
7 years ago
Yu Yang
2a5edec03e
Add debug logs in scope, meta_cache and memory ( #5170 )
...
* Add debug logs in scope, meta_cache and memory
* Add missing deps
7 years ago
zchen0211
03789a7df4
batch norm fully tortured and passed
7 years ago
Yu Yang
b067639621
Fix clang compile ( #5171 )
7 years ago
Abhinav Arora
1a26f5a548
Adding the Sign Op for L1 Weight Decay Regularization ( #5138 )
7 years ago
zchen0211
8a07aff4d7
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into batch-norm-latest
7 years ago
zchen0211
822cf9785b
more test and bn fix
7 years ago
chengduo
09ed52831b
Merge branch 'develop' into Add_deconv3d_op
7 years ago
chengduo
e248866449
Merge branch 'develop' into Add_conv3d_gemm_op
7 years ago
chengduoZH
51113cfe52
write together
7 years ago
chengduo
5a95792925
Merge pull request #4636 from chengduoZH/Add_pool_cudnn_op
...
Add pool2d cudnn op
7 years ago
武毅
6ef9da8ef7
fix compile error ( #5160 )
...
* fix compile error
* remove unittest
* disable huber loss unittest
7 years ago
chengduoZH
ed120ee741
Add unit test
7 years ago
chengduoZH
bf3ae0631e
remove conflict
7 years ago
Tao Luo
c1fd1dc744
Merge pull request #5154 from tensor-tang/merge
...
refine check macro
7 years ago
chengduoZH
cadee843b8
follow comments
7 years ago
yangyaming
d2b10cc0b1
Refine doc and fix data type of metrics.
7 years ago
Tao Luo
92c32799e7
Merge pull request #4864 from luotao1/maxseq
...
add Max strategy for sequence_pool op
7 years ago
caoying03
cca383cfba
follow comments.
7 years ago
Yang yaming
fd5199fdf7
Merge pull request #3989 from pkuyym/fix-3923-r
...
Add huber loss operator.
7 years ago
Luo Tao
f086f5649b
Merge branch 'develop' into maxseq
7 years ago
武毅
24d988ce5c
Merge pull request #4063 from typhoonzero/auc_op
...
Auc op
7 years ago
tensor-tang
aecfeb7257
refine check macro
7 years ago
chengduoZH
df48b43b91
fix clear zero method and remove useless code
7 years ago
yangyaming
ac5f42184f
Using static_cast to make more robust.
7 years ago
caoying03
3afb9dc88a
use double in unittest.
7 years ago
chengduoZH
b9edcc4a1b
sss
7 years ago
yangyaming
97bfc0dfae
Add comments.
7 years ago
武毅
2000cafe75
Merge pull request #5132 from typhoonzero/fix_ft_job_converge
...
fix ft job converge
7 years ago
Yu Yang
8f4476b893
Add device.Wait() in fetch_op ( #5141 )
7 years ago
chengduoZH
0ab012cf7f
fix doc
7 years ago
caoying03
bce4f7d6eb
follow comments.
7 years ago
Abhinav Arora
94fb7ba4fa
Adding L1 norm op for L1 regularization ( #5058 )
...
* Adding L1 norm op for L1 regularization
* Addressing code review feedback
* Address code review feedback
* Change variable names to match google style guide
7 years ago
helinwang
20d9b20a2a
Fix CI style check.
7 years ago
Yu Yang
b44f4ccbeb
Make InferShape as a field in OpInfo ( #5139 )
...
* Op developer can add `InferShape` to any operator
7 years ago
QI JUN
7f8574c0f5
add sparse support for sum op ( #5093 )
...
* add sparse support for sum op
* typo fix
* fix gpu build error
* fix unittest error
* typo fix
* infer var type and shape in op_test
* follow comments
* fix build error
* bypass some unittests depend on NetOp
7 years ago
Dong Zhihong
37842d802d
rerun ci
7 years ago
Dong Zhihong
f632706c18
fix based on comment
7 years ago
Yu Yang
be00b0c4d6
Gradient check use graph ( #5027 )
...
* Simplize Gradient Check
* Stash
* Extract apply_backward_pass to backward.py
Rename apply_backward_pass to append_backward_ops
* Use graph API to check gradient
* Fix ci
* Fix CI
* Fix backward for double precision
* Stash
* Fix CI
* Fix ci
* Ignore GRU test
* Ignore xe op
* Fix CI
* Fix softmax with xe gradient
The correct equation should be IG = OG * (d_softmax_with_xe())
* Fix typo
* Fix merge error
* Disable LRN
7 years ago
Qiao Longfei
fc68290bcc
update _create_op_func_ and support generate dropout layer ( #5134 )
7 years ago
Dong Zhihong
52200523d6
"polish code based on comment"
7 years ago
Dong Zhihong
6cce5268ed
"fixed based on comment"
7 years ago
fengjiayi
aa379ccb5e
Add functions of restoring ProgramDescBind from ProgramDesc ( #5109 )
...
* compelete restoring program_bind from program_desc
* Fix bugs
* fix compile errors
* fix errors and add unit tests
* rename some vars
* Follow comments
7 years ago
Dong Zhihong
16a39d24f3
fix conflict
7 years ago
Dong Zhihong
dbfd1302e1
"FIX CI"
7 years ago
Dong Zhihong
6bc261b933
fix ci
7 years ago
chengduoZH
25df892932
remove conflict
7 years ago
Siddharth Goyal
66476fc7b7
Add proximal adagrad optimizer ( #5128 )
7 years ago
yangyaming
65dbbd57af
Add and pass unittests.
7 years ago
chengduoZH
db1bb8224a
follow comments
7 years ago
typhoonzero
cec5e6511b
fix ft job converge
7 years ago
chengduo
8e3ecf5d11
Merge pull request #4814 from chengduoZH/Add_sequence_project_op
...
Add sequence_conv_op and sequence_projection functor
7 years ago
gongweibao
9d142d5060
Local response normalize. ( #4426 )
...
Add local response normalize
7 years ago
dangqingqing
bd680f157f
fix compiling warning.
7 years ago
chengduoZH
99c6f44a5a
follow comments
7 years ago
Luo Tao
746f2a2e36
only compute the first max value in backward
7 years ago
dangqingqing
bcc0dad7df
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into lstm_bp
7 years ago
dangqingqing
ac3370a467
Add unit testing for gemv and fix the gradien check for bais.
7 years ago
chengduoZH
dcb3da591e
refine code
7 years ago
Zhaolong Xing
4273b3513a
Merge pull request #4473 from NHZlX/fix_merge_model
...
refine paddle_merge_model
7 years ago
chengduoZH
56bbfd1af2
Add deconv3d op
7 years ago
Yu Yang
aa3de3571d
Polish unit test for xe, generate probablities ( #5096 )
...
* Cross Entropy Wrong
* Fix XE
* Polish gradient check for xe
* Fix compile
7 years ago
yangyaming
06c7c8c80e
Add CPU kernel.
7 years ago
chengduoZH
1bb0e2943b
Add pool2d cudnn
7 years ago
chengduoZH
eafbbc11a0
write conv2d and conv3d together
7 years ago
dangqingqing
2e0298744e
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into lstm_bp
7 years ago
dangqingqing
cd38286684
Add gradient check unit testing and fix bug.
7 years ago
Yu Yang
efc2464f6c
Feature/save op ( #5090 )
...
* Init
* Stash
* Polish SaveLoadOp
* Fix CI
* Polish code
* Save GPU Tensor
* Stash
* Fix CI
7 years ago
Dong Zhihong
4b9cf0e8b1
"add disable"
7 years ago
zhouxiao-coder
9b0f092853
change input data type
7 years ago
Yan Chunwei
39a6f43bcc
fix bug ( #5106 )
7 years ago
Yang Yang(Tony)
6cc2ce010a
add fill constant batch size like op ( #5057 )
7 years ago
Dong Zhihong
626ff3b79e
"polish cmake file"
7 years ago
Luo Tao
368737e41b
Merge branch 'develop' into maxseq
7 years ago
Abhinav Arora
b0a267c0b8
Adding the squared L2 norm operator for L2 regularization ( #5030 )
...
* Adding the L2 loss operator for L2 regularization
* Renaming l2_loss op to squared_l2_norm_op
* Addressing code review feedback
7 years ago
caoying03
4c63086921
Merge branch 'develop' into crf
7 years ago
Tao Luo
b68f2d209a
Merge pull request #5049 from tensor-tang/mkldnn_bn
...
enable mkldnn_batch_norm
7 years ago
武毅
97fcaef032
Merge pull request #5083 from typhoonzero/fix_sparse_update_size
...
fix sparse update size
7 years ago
Dong Zhihong
2573ac1448
"remove python side test case to another PR."
7 years ago
Dong Zhihong
4e165f4ea3
"fix create output variable bug"
7 years ago
Helin Wang
01df52aaad
Fix pserver SGD with momemtum checkpoing recover.
7 years ago
Qiao Longfei
56b723c40d
Cudnn batch norm op ( #5067 )
...
* init cudnn batch norm op
* rename batch_norm_cudnn_op.cc batch_norm_op.cu
* correct name style
* add ExtractNCWHD, simplify code
* fix ExtractNCWHD
* use CUDNN_ENFORCE instead of PADDLE_ENFORCE
7 years ago
helinwang
629cbdae01
Merge pull request #5095 from helinwang/zero
...
Fix new remote updater sending all 0 param / grad to pserver
7 years ago
Dong Zhihong
61c1b0469a
"fix multigpu testcase"
7 years ago
Dong Zhihong
38d3adfeb6
"add multioperator testcase"
7 years ago
Helin Wang
8383e48f6a
Fix new remote updater sending all 0 param / grad to pserver
7 years ago
helinwang
ea63390736
Merge pull request #5091 from helinwang/init
...
Fix new pserver having random momemtum initial values
7 years ago
Yan Chunwei
9a0233de9f
Feature/tensor array lod pack ( #5007 )
7 years ago
zhouxiao-coder
c22f7fcd17
add positive_negative_pair_op evaluator
7 years ago
Dong Zhihong
94992a990b
"add multiop testcase"
7 years ago
Helin Wang
238abc4ad3
Fix new pserver having random momemtum initial values
7 years ago
Dong Zhihong
bab5e53c03
Merge remote-tracking branch 'origin/develop' into feature/multigpu
7 years ago
Dong Zhihong
11cf3e3a43
"refactorization of nccl test case"
7 years ago
Dong Zhihong
6d1493a460
"add bcast c++ test case"
7 years ago
Dong Zhihong
5200c657a7
"move Tensor to LoDTensor"
7 years ago
caoying03
c74107bfdc
fix backward computation.
7 years ago
tensor-tang
5ba1e1e10a
Merge remote-tracking branch 'upstream/develop' into mkldnn_bn
7 years ago
typhoonzero
a2412ce070
fix sparse update size
7 years ago
tensor-tang
703947928f
refine comment and code
7 years ago
xzl
acfdc312f9
support trainconfig and modelconfig of MergedModel
7 years ago
Luo Tao
0f67a82728
add test_Expand and simply the gserver/tests/CMakeLists
7 years ago
dangqingqing
d2bd7357d2
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into lstm_bp
7 years ago
dangqingqing
3d8b6ebcf8
Add LSTM backward implenmentation.
7 years ago
xzl
0dc4b29879
deal with conflit
7 years ago
chengduoZH
9f7c9875a9
fix doc
7 years ago
chengduoZH
3f8a7b55b8
remove conflict
7 years ago
Yu Yang
6c0b383672
Add VarType::STEP_SCOPES for RNN ( #5056 )
7 years ago
Qiao Longfei
ee998a9c44
CPU Batch Norm Op ( #4964 )
...
* init batch norm op
* prepare input output
* compute mean_out var_out save_mean save_var on CPU
* active is test
* use eigen to do computation
* complete batch norm forward
* set default momentum to 0.9
* add batch norm grad op in CPU
* add tensor_format and NHWC support, add python test
* add test training
* add batch norm gradient test
* improve comment, fix foward Python UnitTest
* add gradient test
* fix eigen warning
* follow name style
* fix a bug
* change float to T
* add simple forward test
* test with different place
* add backward test
* refine python test
* remove old python test code
* code clean
* follow code style
* update comment
7 years ago
Dong Zhihong
63fb41b399
"redefine the initop from kernel to OpBase"
7 years ago
helinwang
b54990e951
Merge pull request #5053 from helinwang/serialization
...
Fix parameter server checkpoint serialization crash
7 years ago
Dong Zhihong
1e8474b9f1
"delete python ops testcase"
7 years ago
dzhwinter
fd2eb55071
"Serialize LoDTensor, Save/Restore model" ( #4602 )
...
* "add model format design doc"
* "add restore function"
* "add parse protobuf"
* "move necessary information to saver.proto"
* "format code"
* "add gpu option"
* "add lod info"
* "add saveop python test wrapper"
* "checkpoint reuse save operator"
* "rewrite model format design doc"
* "async support needed"
* "fix run once"
* "fix doc based on comments"
* "refine based on comments"
* "fix based comments"
* "remove persistable flag from framework.proto"
* "add IndicateDataType to restore op"
* "add save test"
* "modify save restore code"
* "modified the restore logic"
* rm checkpoint_op.cc
* rm test_checkpoint_op.py
* "get inputs outputs name from execution context"
* Saving each variable to a independent file
* Fix bugs
* Rewrite save_restore_op_test with new Python framework
* Move `SaveOp` and `RestoreOp` from OpWithKernel to OpBase
* Refine unit test of SaveOp and RestoreOp
* fix compile errorwq
7 years ago
Dong Zhihong
0990c87bf6
checkin nccl operator
7 years ago
Helin Wang
f28b4d6805
Fix parameter server checkpoint serialization
7 years ago
Dong Zhihong
da1181bfc6
Merge remote-tracking branch 'origin/develop' into feature/multigpu
7 years ago
Dong Zhihong
ef257e6d96
write nccl c++ test case
7 years ago
tensor-tang
8845218636
add batchnorm layer in simple test and branch test
7 years ago
tensor-tang
ad6b531917
add unit test for mkldnn_batch_norm layer
7 years ago
tensor-tang
64eaeba1a8
enable mkldnn_batch_norm layer
7 years ago
tensor-tang
02fdf24115
enable copyFrom of MKLDNNMatrix
7 years ago
chengduoZH
05239b6ff5
fix functor
7 years ago
yangyaming
5939a17c47
Follow comments and adapt to new interface.
7 years ago
caoying03
6a630f2798
Merge branch 'develop' into crf
7 years ago
dangqingqing
0190d5d644
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into check_out_lod
7 years ago
dangqingqing
2e783663fa
Enable to output LoD in fetch_op and check output LoD in the op unit test.
7 years ago
chengduoZH
b15c69f59d
remove conflict
7 years ago
caoying03
427644b2fa
fix the computation kernels.
7 years ago
chengduoZH
4c6bccbe20
fix doc and remove useless code
7 years ago
yangyaming
0521161070
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix-3923-r
7 years ago
yangyaming
9a3f50d806
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix-3923-r
7 years ago
wanghaoshuang
97f1b98759
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into seq_expand_op
7 years ago
wanghaoshuang
296167446c
Rewrite sequence expand op
7 years ago
chengduoZH
61dbf4b926
fix im2col interface
7 years ago
Luo Tao
35434c33d0
fix compiler error in im2col.h
7 years ago
chengduoZH
154dbb4697
Add unit test
7 years ago
qingqing01
23bf6b2c15
Merge pull request #4887 from chengduoZH/fix_im2col_kocf_for_sequence
...
Add up, down, left and right padding for im2col.
7 years ago
Zhuoyuan
8fdc315acc
Merge pull request #4739 from zchen0211/develop
...
deconv op implementing ...
7 years ago
Dong Zhihong
50f04dcae3
"add init allreduce test"
7 years ago
Yu Yang
94e741d6f0
Use external project for NCCL ( #5028 )
7 years ago
Dong Zhihong
f6106ffa7e
merge develop into multigpu
7 years ago
Dong Zhihong
ec47565c23
"add reduce hash function"
7 years ago
Yi Wang
cdb5f2928a
Add a C++ program that prints operator document in JSON format ( #4981 )
...
* Add print_operators_doc.cc
* Update Escape
* Correct a bug
* Remove OpInfoMap::Iterate
* Update the print_operators_doc.cc
* Escape tab
* Use auto&
* Use auto&
* Remove trailing ,
* clang-format C++
7 years ago
Yang Yang(Tony)
db157eda45
New Op Test framework. ( #4962 )
...
Pass all forward op test
7 years ago
Dong Zhihong
423d7438a1
"add register gpu macro"
7 years ago
Yu Yang
bc15117403
Correct mul_op implementation ( #4988 )
...
* Correct mul_op implementation
* Restore the origin shape after mul
* Fix mul op
* Do not touch math_function
7 years ago
Yu Yang
43c6ff212e
Feature/nccl dso ( #5001 )
...
* "add nccl enforce"
* Dev
* Update comment
* Add nccl test
* Follow comments
7 years ago
QI JUN
fcd74e06b8
add book04.word2vec train test ( #5002 )
...
* init
* ensure ids in lookup table op must be a column vector
* add book4 configuration in test_layers
* debug test_book4
* add test_word2vec
* follow comments
* follow comments
7 years ago
QI JUN
40e7caf667
ensure ids in lookup table op must be a column vector ( #4987 )
...
* ensure ids in lookup table op must be a column vector
* follow comments
7 years ago
chengduoZH
09662da0be
follow comments
7 years ago
chengduoZH
811a3174f1
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_im2col_kocf_for_sequence
7 years ago
chengduoZH
2947f5678e
follow comments
7 years ago
guosheng
4b84f07c70
Merge branch 'develop' of https://github.com/PaddlePaddle/paddle into add-ChunkEvalOp
7 years ago
guosheng
bb9d68dcb3
Add chunk_eval_op
7 years ago
chengduoZH
f2ccef26bf
Add sequence_conv_op
7 years ago
chengduoZH
0ab2c436ae
Add sequence_project_functor
7 years ago
dangqingqing
cf2608e383
update to the develop branch.
7 years ago
dangqingqing
64fe9bcc5c
Update lstm comments and fix bug.
7 years ago
Tao Luo
48173e8599
Merge pull request #5010 from tensor-tang/merge
...
fix crash on ds2
7 years ago
tensor-tang
386bb1fece
fix crash on ds2
7 years ago
wanghaoshuang
4e8fccff11
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into seq_expand_op
7 years ago
wanghaoshuang
d697b6a349
Modified code using LoDTensor
7 years ago
chengduoZH
ce960575cd
fix doc format and unit test
7 years ago
Luo Tao
70d9d953e6
rename sparse_vector to sparse_float_vector in tests
7 years ago
caoying03
c8d0d37c4d
Merge branch 'develop' into crf
7 years ago
Tao Luo
abce9eb750
Merge pull request #4953 from tensor-tang/merge_grad_gtest
...
refine the mkldnn logic
7 years ago
caoying03
80a5ee0052
fix forward and add backward.
7 years ago
Qiao Longfei
c91de280d7
CompileTime InferShape should find var recursively in stack of blocks ( #4998 )
...
* recursive find var in BlockDesc
* add HasVarRecursive and FindVarRecursive to BlockDesc
* fix FindVarRecursive
7 years ago
chengduoZH
4c19f9f429
fix backward
7 years ago
chengduoZH
6246be294f
clean gradient data
7 years ago
Qiao Longfei
54ffafa123
use context to get attribute ( #4997 )
7 years ago
Qiao Longfei
e7f627036a
fix InferShapeContext Has interface ( #4994 )
7 years ago
chengduoZH
4d112b7d04
remove conflict
7 years ago
chengduoZH
dc7d07358c
add padding up, down, left, right
7 years ago
chengduoZH
834b82f109
fix sequence_project_op forward and backward
7 years ago
Guo Sheng
d0cfbba429
Merge pull request #4496 from guoshengCS/add-NormOp
...
Add norm_op
7 years ago
Yang Yang(Tony)
fa050941b0
Merge pull request #4990 from tonyyang-svail/add_optional_in_op_proto
...
add optional in op proto
7 years ago
zchen0211
cc5e118b1e
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into develop
7 years ago
zchen0211
b3ab3ce0a1
deconv -> conv transpose
7 years ago
Yang Yang
c030589df6
add optional in op proto
7 years ago
dzhwinter
ac4f7598ed
Merge pull request #4985 from reyoung/add_nccl_to_docker_image
...
Add nccl to docker image
7 years ago
Yu Yang
784fc32bfa
Add nccl to docker image
7 years ago
zchen0211
64c5ecbedb
deconv
7 years ago
Yu Yang
e9e0d7d774
Correct the dependencies ( #4978 )
7 years ago
Yu Yang
86437a8dda
Global function, op_support_gpu ( #4980 )
7 years ago
Yu Yang
65906ef1d0
Several Enhancement
7 years ago
zchen0211
502e72591f
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into develop
7 years ago
QI JUN
d2f3c8bb7a
Merge pull request #4973 from QiJune/clang-build-error
...
fix clang build error
7 years ago
Yan Chunwei
db7b11719b
change lod tensor to absolute offsets ( #4952 )
7 years ago
qijun
7edc1d96c6
fix clang build error
7 years ago
Yan Chunwei
07ea9adec0
feature/dynamic recurrent op forward and backward ( #4799 )
7 years ago
kavyasrinet
5380a5471b
Adding Nesterov Momentum ( #4948 )
7 years ago
tensor-tang
5c892db64c
remove unused code
...
refine comments and bias
fix typo and todo
7 years ago
guosheng
b12988f008
Merge branch 'develop' of https://github.com/PaddlePaddle/paddle into add-NormOp
7 years ago
tensor-tang
5b5cb0781a
add branch tests for pool and fc
7 years ago
hedaoyuan
9a5562aade
Merge pull request #4963 from hedaoyuan/inference
...
Change the name of the export.map to paddle_capi.map which need to be…
7 years ago
wanghaoshuang
00ad7512cf
Use stream while memory::Copy in GPU mode
7 years ago
hedaoyuan
2bb2c318e9
Change the name of the export.map to paddle_capi.map which need to be released in each version.
7 years ago
Abhinav Arora
09c0c82ec9
Adding increment op ( #4940 )
...
* Adding incremnt op
* Fixing comment about step attribute
7 years ago
zchen0211
8e55736a20
deconv2d
7 years ago
Dong Zhihong
333045d7b2
"move nccl to another directory"
7 years ago
hedaoyuan
37bfd03f0c
Merge pull request #4932 from hedaoyuan/inference
...
Split paddle_capi_whole into paddle_nn_engine and paddle_layers two s…
7 years ago
hedaoyuan
5d2fe7cd91
Fix cc_library paddle_capi_whole.
7 years ago
Yu Yang
102a5f3499
Feature/remove global scope ( #4950 )
...
* Unify `set_feed_variable` to one method
* Move global scope to python, not in C++
7 years ago
QI JUN
9903e49f94
add test_fit_a_line ( #4936 )
...
* add test_fit_a_line
* Update
* fix persistable bug
* fix elementwise add bug
* set correct attr for bias op in fc layer
* set correct attr for bias op in fc layer
* Update
1. Add init_program to hold initializers
2. bug fix
* add test_fit_a_line
* fix persistable bug
* fix elementwise add bug
* fix type
* add gitignore
* Complete fit_a_line test
* revert code
* Clean up
* Revert "revert code"
This reverts commit eb1aa015cda4fc12b6dc778ada6c3507b98134f5.
* Refine
* Fix unit test
7 years ago
QI JUN
92369177ad
Merge pull request #4941 from QiJune/fix_elementwis_add_bug
...
fix elementwise add bug
7 years ago
Yu Yang
af4dac4ac3
Feature/free kid scope ( #4951 )
...
* Delete kid
* Delete local scope
7 years ago
tensor-tang
4f41eaf7f8
Merge remote-tracking branch 'upstream/develop' into merge_grad_gtest
7 years ago
tensor-tang
d75b00c221
refine the gtest log info and vlog order,
...
and change the size of test to make unit test faster
refine comment and log of mkldnnlayer
7 years ago
zchen0211
7eeaae1695
deconv
7 years ago
Yu Yang
42f2dd4041
Unify `set_feed_variable` to one method ( #4949 )
7 years ago
Yu Yang
c532b96741
Remove template parameter for Tensor methods ( #4937 )
...
* Remove template parameter for Tensor methods
* Also check the type is correct when data()
* Simplize holder_
* Fix accuracy_op
* Register Code
7 years ago
qijun
9e6404441c
fix elementwise add bug
7 years ago
zchen0211
d97a732f4f
deconv
7 years ago
zchen0211
e59ca752fc
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into develop
7 years ago
Abhinav Arora
77cac5cdb8
Removing updates of Beta1 power accumulators outside the op ( #4931 )
7 years ago
Abhinav Arora
11bebeb2dc
Removing updates of Beta1 and Beta2 power accumulators outside the op ( #4925 )
7 years ago
Yu Yang
3db5278301
Feature/py executor test ( #4922 )
...
* Implement FC layer with helper
* Update LayerHelper
* Add debug string for Python ProtoBuf
and Rename `Sync` to `Flush`
* Add check of ProtoBuf initialization
* Layer wrapper for FC
* Fix unittest
* Fix CI
* Add code generator
* AttributeChecker Better error log and speicalize bool
Since lots of types can be cast to bool
* Complete mlp, fit_a_line
* Expose get global scope
* Make global scope not thread-safe
1. It is no need to make global scope thread-safe, since it will be
invoked in Python main thread.
2. Do not free the global scope when C++ exit. Let the OS free memories,
otherwise, we need to handle the destroy dependencies.
See
https://google.github.io/styleguide/cppguide.html#Static_and_Global_Variables
* Fix
* Implementation of simple conv_2d layer
* Stash
* Remove private data members in OpRegister
* Fix bugs
* Stash
* Expose FeedFetchList as VarType
* Change ProgramDesc not a global variable
* Polish code style
* Stash
* Correct implement BlockDesc destructor
* Correct implement BlockDesc destructor
* Unify program as parameter name
* Fix bugs
* Add unittest
* Fix unit test error
* Remove unused functions
* Add clone for Python Program
* Working on executor
* Stash
* Add glog as dependencies of ops
* Use VLOG to logging some information is helpful when we debug Paddle
* Expose VarDesc::persistable to Python
* Test executor
* Complete unittest
* Polish code
* Fix merge error
* Follow comment
* Polish Python Code
7 years ago
hedaoyuan
56d5db8bea
Bug fix of libpaddle_capi_whole.a in x86.
7 years ago
hedaoyuan
747b541957
Follow comments
7 years ago
dangqingqing
694bc64aaf
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into lstm
7 years ago
dangqingqing
17e33738f2
Enhance unit testing and fix bug.
7 years ago
武毅
63ffe5250a
Cluster train doc for v2 API ( #2072 )
...
* update cluster train v2 doc
* WIP cluster train doc
* update
* cluster train doc
* add TOC for en doc
* fix sphix build issue
* fix error links
* fix link errors
* fix image link
* polish cluster train docs
* update general distributed training document
* fix sphinx compile error
* fix doc image error
7 years ago
Yibing Liu
2073fb96cb
Enable learning rate annealing of Adam Optimizer
7 years ago
hedaoyuan
378dcb1833
Split paddle_capi_whole into paddle_nn_engine and paddle_layers two static libraries.
7 years ago
dangqingqing
8bec26be03
Add missing file of math/detail/CMakeLists.txt
7 years ago
dangqingqing
ee39b37a41
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into lstm
7 years ago
wanghaoshuang
74b283c9d6
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into seq_expand_op
7 years ago
wanghaoshuang
a94b3dd9a7
Refine comments and function name
...
1. Add more comments and exmples
2. Rename repeat_lod to expand_lod
3. Remove unused head file
7 years ago
tensor-tang
94e442d4b1
add cpp file of MKLDNNLayer
7 years ago
tensor-tang
9e38dafa29
change MKLDNNMatrix create interface since MatrixPtr is not always required
7 years ago
tensor-tang
c1914543b0
refine mkldnn logic, move reset buffers into MKLDNNLayer
7 years ago
dangqingqing
a461bf139d
Add missing file.
7 years ago
whs
c8c4ba0f65
Merge pull request #4918 from wanghaoshuang/fix_lod
...
Make LoDTensor::lod_element return start and end offset of sequence
7 years ago
zchen0211
e8cd4b7d86
deconv2d impl in full
7 years ago
dangqingqing
b325213150
update to develop branch.
7 years ago
dangqingqing
d1fbf50b9e
Add unit testing for forwad implementation.
7 years ago
wanghaoshuang
edb6aba698
make lod_element return std::pair
7 years ago
zchen0211
43aad989bd
deconv
7 years ago
Yu Yang
47f773ddb2
Copy Constructor for ProgramDesc ( #4895 )
...
* Implement FC layer with helper
* Update LayerHelper
* Add debug string for Python ProtoBuf
and Rename `Sync` to `Flush`
* Add check of ProtoBuf initialization
* Layer wrapper for FC
* Fix unittest
* Fix CI
* Add code generator
* AttributeChecker Better error log and speicalize bool
Since lots of types can be cast to bool
* Complete mlp, fit_a_line
* Implementation of simple conv_2d layer
* Fix bugs
* Change ProgramDesc not a global variable
* Polish code style
* Stash
* Correct implement BlockDesc destructor
* Correct implement BlockDesc destructor
* Unify program as parameter name
* Fix bugs
* Add unittest
* Fix unit test error
* Remove unused functions
* Add clone for Python Program
* Compare OpDescBind directly
7 years ago
wanghaoshuang
4018754d66
fix LoDTensor::lod_element to get last element in level
7 years ago
Yang Yang(Tony)
831927d58c
Merge pull request #4738 from tonyyang-svail/prune_impl
...
Prune implementation
7 years ago
Yu Yang
e9249d16cb
Add glog as dependencies of ops ( #4908 )
...
* Add glog as dependencies of ops
* Use VLOG to logging some information is helpful when we debug Paddle
* Fix Unittests
7 years ago
Yu Yang
f6e1d959d2
Expose VarDesc::persistable to Python ( #4911 )
7 years ago
Dong Zhihong
fdfc8f9baa
"switch to Init op"
7 years ago
QI JUN
7ad13fbfe9
Merge pull request #4876 from QiJune/sgd_op_sparse_kernel
...
add sparse update kernel for sgd operator
7 years ago
zchen0211
5ec55e7995
deconv impl
7 years ago
Yang Yang
c5b411c515
make compatible to new programDescBind
7 years ago
Yang Yang
39aa81e74e
Merge remote-tracking branch 'upstream/develop' into prune_impl
7 years ago
zchen0211
80ebc8d5e3
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into develop
7 years ago
kavyasrinet
c10b8e808f
Adding Proximal Gradient Descent ( #4848 )
...
* Adding Proximal Gradient Descent
* Fixing review comments
7 years ago
fengjiayi
a204fefe16
Fix several bugs in compile time backward and Protobuf desc ( #4894 )
...
* Implement FC layer with helper
* Update LayerHelper
* Add debug string for Python ProtoBuf
and Rename `Sync` to `Flush`
* Add check of ProtoBuf initialization
* Layer wrapper for FC
* Fix unittest
* Fix CI
* Add code generator
* AttributeChecker Better error log and speicalize bool
Since lots of types can be cast to bool
* Complete mlp, fit_a_line
* Implementation of simple conv_2d layer
* Fix bugs
* Correct implement BlockDesc destructor
* Fix bugs
* Fix unit test error
* Follow comments
7 years ago
fengjiayi
af215a1a53
Design doc: Batch Normalization Operator ( #3748 )
...
* Add design doc of batch_norm_op
* Move batch_norm_op.png to operator/images
* Refine batch_norm_op design doc
7 years ago
Yu Yang
e747623e86
Change ProgramDesc not a global variable ( #4879 )
...
* Change ProgramDesc not a global variable
* Polish code style
* Correct implement BlockDesc destructor
* Unify program as parameter name
7 years ago
wanghaoshuang
31531ab581
Add backward kernel
7 years ago
chengduoZH
91db457fc0
follow comments
7 years ago
wanghaoshuang
8de04be786
Fix unitest
7 years ago
dangqingqing
2a8dbd130d
LSTM Operator forward implementation.
7 years ago
chengduoZH
40688d223e
refine im2col (up_pad,down_pad)
7 years ago
chengduoZH
d2c1408f91
fix im2col kocf for sequence projection
7 years ago
chengduoZH
1e60c9b2e8
Add sequence_project_op (use im2col)
7 years ago
wanghaoshuang
555ab3f5af
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into seq_expand_op
7 years ago
wanghaoshuang
23701ffaf0
Refine op
7 years ago
Yang Yang(Tony)
43c05e434d
Merge pull request #4855 from tonyyang-svail/executor_refine
...
Simplify executor
7 years ago
Yu Yang
8938a9b0ac
Correct implement BlockDesc destructor ( #4882 )
7 years ago
QI JUN
521514da36
fix conflict ( #4883 )
7 years ago
Markus Kliegl
164898277c
MatMul operator ( #4856 )
...
* initial matmul operator
Similar to np.matmul, but also has transpose_X and transpose_Y flags,
and only supports tensors from rank 1 to 3 inclusive.
For GPU, uses cublas?gemmStridedBatched. For CPU, uses
cblas_?gemm_batch if available via MKL; otherwise a simple serial
implementation that loops over the batch dimension is employed for now.
7 years ago
dzhwinter
fd96914d23
Merge pull request #4571 from sidgoyal78/momentum_op
...
Add momentum operator
7 years ago
qijun
f9681459b2
fix gpu build error
7 years ago
Yu Yang
5d67677c60
Remove private data members in OpRegister ( #4871 )
7 years ago
qijun
ab8cc401e6
add sparse sgd operator unittest
7 years ago
qijun
330c509bc3
Merge remote-tracking branch 'baidu/develop' into sgd_op_sparse_kernel
7 years ago
qijun
182ce51c6d
add sparse kernel of sgd operator
7 years ago
zchen0211
98dccc9f21
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into develop
7 years ago
Qiao Longfei
46f7854e5c
fix mac compile ( #4870 )
7 years ago
Qiao Longfei
df0946ebe2
Impl optimizer ( #4734 )
...
* init parameter base class
* optimize the Comments of optimizer
* basic implimentation of optimizer
* add test_optimizer
* add no_grad_set to interface
* update optimizer.py
* python code can run
* fix some problem
* add sync_with_cpp to Python Program and Block
* sync vars and ops in block from cpp
* optimize code and add some comment
* add more check for sync
* update optimizer with return value of Backward
* rm unused code
* infer shape when create gradient vairiable
* update test_optimizer
* update test_program.py
* update backward test
* follow comment
7 years ago
zchen0211
451863dba2
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into develop
7 years ago
Luo Tao
426f7eee8e
simplify test_pool_py, add comments for different pooling strategy
7 years ago
tensor-tang
60b84856e7
refine the conf files, combine in one file
7 years ago
Luo Tao
2c1b35caaa
Merge branch 'develop' into maxseq
7 years ago
Luo Tao
a8a63d4c50
add MAX strategy for seqpool op
7 years ago
caoying03
3123e3cf5d
Merge branch 'develop' into crf
7 years ago
tensor-tang
b487c62aac
Merge remote-tracking branch 'upstream/develop' into merge_grad_gtest
7 years ago
tensor-tang
2dc6cea65c
add unit tests for mkldnn branches, and fix typo
7 years ago
Yu Yang
064c3695ec
Make global scope not thread-safe ( #4850 )
...
* Make global scope not thread-safe
1. It is no need to make global scope thread-safe, since it will be
invoked in Python main thread.
2. Do not free the global scope when C++ exit. Let the OS free memories,
otherwise, we need to handle the destroy dependencies.
See
https://google.github.io/styleguide/cppguide.html#Static_and_Global_Variables
* Revert "FIX: Release CPU/GPU memory via deleter"
This reverts commit 8f80f5bc79
.
7 years ago
Yang Yang
bdca4b37c4
change api based on design doc
7 years ago
Yang Yang
e0cee58c84
modify protobuf
7 years ago
Yang Yang
cf743ab064
merge develop
7 years ago
Yu Yang
73a8b78a72
Correct OpWithKernel's infershape ( #4847 )
...
They are public now
7 years ago
caoying03
cc220eec36
add forward computation of crf operator.
7 years ago
Qiao Longfei
fafc54d6c0
fix var_type_inference_test ( #4854 )
7 years ago
Yang Yang
f06637002d
simplify executor; pass compile
7 years ago