qingqing01
dd92fb2328
Merge pull request #6666 from qingqing01/mobile_mem
...
Fix error in function/GemmConvOp.cpp.
7 years ago
dangqingqing
10622ba3cf
Resolve conflicts.
7 years ago
dangqingqing
9d73950ec9
Add profiling tools for fluid.
7 years ago
sweetsky0901
7456d737b2
Merge pull request #6204 from sweetsky0901/my_spp_op
...
add spp(Spatial pyramid pooling ) op
7 years ago
dzhwinter
24fda39220
Feature/global context ( #6537 )
...
* "add DeviceContextPool"
* "add devicecontextpool in pybind"
* "add comments in python side "
* "fix static link error"
* "fix CI error"
* "add executor.py"
* "fix CI error"
* "add with gpu macro"
* "remove comment out codes"
* "add TODO items"
* "update init devices"
7 years ago
QI JUN
93a2d9c59d
add more place test and rename Cudnn to CUDNN ( #6621 )
...
* add more place_test and rename Cudnn to CUDNN
* fix ci
7 years ago
fengjiayi
b3ea677a2b
update
7 years ago
sweetsky0901
5be5afb331
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into my_spp_op
7 years ago
sweetsky0901
1eac276300
add spp avg
7 years ago
chengduoZH
784740d8be
refine cos-sim-op
7 years ago
tensor-tang
9c27c13e46
follow comments using macro to separate the original implements
7 years ago
tensor-tang
84cb542c13
use intel openmp to speedup seq2batch when WITH_MKL
7 years ago
typhoonzero
17f9be55ad
update for multi trainer
7 years ago
typhoonzero
f8f80db163
update for multi trainer
7 years ago
dangqingqing
7968b66d72
Fix error in function/GemmConvOp.cpp.
7 years ago
qingqing01
480a5446d3
Merge pull request #6659 from qingqing01/mobile_mem
...
Reduce memory usage in conv layer and RoI layer for mobile inference.
7 years ago
Yibing Liu
c9076543a0
Merge pull request #6609 from kuke/fix_reshape_op
...
Enable reshape_op to support dimension inference
7 years ago
Yu Yang
1b0c7d7c7a
Simplize system_allocator and fix GPU_INFO ( #6653 )
7 years ago
dangqingqing
349609207e
Fix the error function/GemmConvOp.
7 years ago
dangqingqing
0fce0fe698
Reduce memory usage in conv layer and RoI layer for mobile inference.
7 years ago
Yibing Liu
d37ed6cb13
polish code in reshape_op
7 years ago
Yibing Liu
5ac8a0bec8
Merge branch 'develop' of upstream into fix_reshape_op
7 years ago
Yu Yang
d5cab4f07c
Fix compile on CUDA9.1 & MacOS ( #6642 )
7 years ago
Guo Sheng
d40d28d81e
Merge pull request #6515 from guoshengCS/add-multiBatch-chunkEval
...
Add ChunkEvaluator for Multi-batches
7 years ago
sweetsky0901
929be3a4a5
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into detection_output
7 years ago
sweetsky0901
ffd4e8c172
modify xx_y to xxY
7 years ago
guosheng
a7fa2051f2
Merge branch 'develop' of https://github.com/PaddlePaddle/paddle into add-multiBatch-chunkEval-dev
7 years ago
tensor-tang
b858e17b38
Merge pull request #6592 from tensor-tang/fluid
...
add MKLDNNPlace
7 years ago
tensor-tang
bf269d67b3
fix place_test on MKLDNNPlace
7 years ago
tensor-tang
a92f057ed1
fix conflict of Place
7 years ago
tensor-tang
7728c53448
Merge remote-tracking branch 'upstream/develop' into fluid
...
Conflicts:
paddle/platform/place.h
7 years ago
tensor-tang
f271210595
fix undefined issue when with_gpu
7 years ago
tensor-tang
e0c3317646
add MKLDNNPlace
7 years ago
chengduoZH
479ee271ed
remove conflict
7 years ago
typhoonzero
1b20096a52
done
7 years ago
fengjiayi
9956d5f733
Merge pull request #6623 from JiayiFeng/dev_expose_GradOpMaker
...
Expose GradOpMaker to Python
7 years ago
fengjiayi
044a13d022
expose GradOpMaker to Python
7 years ago
Leding Li
991826317a
Add extern "C" to paddle_error_string ( #6611 )
7 years ago
fengjiayi
e11a561c12
update
7 years ago
fengjiayi
ecfd2ac8a0
Merge pull request #6614 from JiayiFeng/dev_rename_sub_block
...
Unify `step_block` and `block` to `sub_block`
7 years ago
dzhwinter
0e9b393b34
"derived cudnnDevice context" ( #6585 )
...
* "derived cudnnDevice context"
* "leave remove cudnn handle from CUDADeviceContext"
* "fix math function error"
7 years ago
fengjiayi
dafd449c68
Unify `step_block` and `block` to `sub_block`
7 years ago
guosheng
181db32649
Merge branch 'develop' of https://github.com/PaddlePaddle/paddle into add-multiBatch-chunkEval
7 years ago
Guo Sheng
7ab48aec00
Merge pull request #6586 from guoshengCS/enhance-ReduceOp
...
Enhance ReduceOp to support reducing over all elements
7 years ago
Yibing Liu
0a75ed6f5b
Add unit test for dimension inference in reshape_op
7 years ago
chengduo
2a7bc64caf
Merge pull request #6455 from chengduoZH/refine/conv_zero
...
Refine conv
7 years ago
Yibing Liu
685d1e3b33
Enable reshape_op to support dimension inference
7 years ago
typhoonzero
40d0fff2e5
single pserver workable version
7 years ago
chengduoZH
97c3de0cfb
follow comments
7 years ago
Tao Luo
d65316f833
Merge pull request #6573 from luotao1/paddledev
...
update paddledev to paddlepaddle
7 years ago
gangliao
8602d0250f
Merge pull request #6557 from gangliao/gift
...
Christmas gift: fix a typo
7 years ago
Tao Luo
e72b865cb1
Merge pull request #6530 from luotao1/atlas
...
remove ATLAS library
7 years ago
guosheng
842b485f6a
Enhance ReduceOp to support reducing over all elements
7 years ago
chengduoZH
aa2f2fb47e
remove conflict
7 years ago
typhoonzero
2b47fb3d25
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into simple_dist_train_api
7 years ago
typhoonzero
9508c72685
wip: should fix variable recreate
7 years ago
Yu Yang
0a8addf802
Make cast op support bool ( #6562 )
...
Also add `elemwise_sub/mul/abs/clip` layers
7 years ago
Luo Tao
0e18bc8836
update paddledev to paddlepaddle
7 years ago
Yu Yang
1ba8f7fe71
The comments in reshape_op is wrong ( #6565 )
7 years ago
liaogang
316ab4e07d
christmas gift: fix a typo
7 years ago
sweetsky0901
ea093283e6
for code review by zhaolong
7 years ago
sweetsky0901
89de58d990
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into my_spp_op
7 years ago
Tao Luo
79f0ab55de
Merge pull request #6473 from luotao1/with_test
...
typo WITH_TEST to WITH_TESTING
7 years ago
guosheng
29cd089ab2
Merge branch 'develop' of https://github.com/PaddlePaddle/paddle into add-multiBatch-chunkEval
7 years ago
sweetsky0901
73a9c853fa
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into detection_output
7 years ago
sweetsky0901
a3addcdc59
modify for some update in trunk
7 years ago
typhoonzero
b4cd7f3d75
wip need ut
7 years ago
wanghaoshuang
2cd510641a
Fix crop op doc
7 years ago
guosheng
63ce906b08
Refine ChunkEvalutor by following comments
7 years ago
chengduoZH
50003984d4
remove conflict
7 years ago
Luo Tao
e5dcefc4d3
remove ATLAS library
7 years ago
Zhaolong Xing
a91efdde69
Merge pull request #6177 from NHZlX/add_prelu_neon
...
add prelu neon impl
7 years ago
Tao Luo
c8d4efb20e
Merge pull request #6498 from luotao1/row_conv
...
fix warning in row_conv_op.cu
7 years ago
guosheng
1eaeacb2c8
Merge branch 'develop' of https://github.com/PaddlePaddle/paddle into add-multiBatch-chunkEval
7 years ago
sweetsky0901
4d8f39b853
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into detection_output
7 years ago
sweetsky0901
b34df5f12c
add some doc
7 years ago
guosheng
9a89b041ba
Add ChunkEvaluator for multi-batches
7 years ago
Yu Yang
d918ccded3
Add fill_op ( #6477 )
...
* Add fill_op
* Fix bug
7 years ago
tensor-tang
252f1e4a34
Merge pull request #6493 from PaddlePaddle/release/0.11.0
...
merge Release/0.11.0 to develop
7 years ago
QI JUN
61ec0b9516
Refine device context ( #6433 )
...
There are mainly following fixes:
- take `DeviceContext` as the template parameter of math functors and OpKernel instead of `Place`
- remove `eigen_device` interface in base class `DeviceContext`
- remove `GetEigenDevice` interface in `ExecutionContext` and base class `DeviceContext`
- remove unused `platform::EigenDeviceConverter`
- rename `REGISTER_OP_GPU_KERNEL` to `REGISTER_OP_CUDA_KERNEL`
- rename `USE_GPU_ONLY_OP` to `USE_CUDA_ONLY_OP`
7 years ago
Luo Tao
f3acdd3af9
fix warning in row_conv_op.cu
7 years ago
tensor-tang
f4f17e539b
skip mkl setting in v1 with Mac
7 years ago
hedaoyuan
7687756b2c
Merge pull request #6445 from hedaoyuan/float16
...
Use PADDLE_WITH_NATIVE_FP16 for float16_t.
7 years ago
kavyasrinet
35420cdf63
Updating the Latex equation for Adagrad ( #6009 )
...
* Updating the Latex equation for Adagrad
* Fixing Latex euqations for adadelta, adam and adamax
7 years ago
Siddharth Goyal
4ff6bc175a
Add row conv operator ( #6013 )
...
* Fix documentation
* Address review comments
7 years ago
sweetsky0901
5fe4d7fb6b
modify a bug *input_nums
7 years ago
sweetsky0901
c65d2fc356
add inline
7 years ago
sweetsky0901
15b1e8e858
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into detection_output
7 years ago
sweetsky0901
65b641bf66
add detection_output op
7 years ago
typhoonzero
489b9695e4
wip for testing
7 years ago
武毅
f650429ba3
Merge pull request #6419 from typhoonzero/recv_op_use_ser_prog
...
recv_op use serialized program
7 years ago
tensor-tang
38de512d29
Merge pull request #6459 from tensor-tang/develop
...
unify MKL macro definition
7 years ago
tensor-tang
69b44f2f19
unify MKL macro definition
7 years ago
Luo Tao
ddf20e589f
typo WITH_TEST to WITH_TESTING
7 years ago
Yiqun Liu
9592468609
Fix gcc4.9 ( #6442 )
...
* Fix compiling error of gcc4.9.
* Refine the check of cxx compiler flags in api/CMakeLists.txt.
7 years ago
chengduoZH
a6ef875885
refine conv
7 years ago
typhoonzero
308491a94a
update for simple dist train
7 years ago
hedaoyuan
578ad6d232
Use PADDLE_WITH_NATIVE_FP16 for float16_t.
7 years ago
hedaoyuan
ef45af113a
Merge pull request #6408 from hedaoyuan/fix_bug
...
Fix compile error in android.
7 years ago
sweetsky0901
b41894d147
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into detection_output
7 years ago
sweetsky0901
fe177b6292
test detection_output cpu and gpu ok, but doc will be modify
7 years ago
qingqing01
5ba231d80b
Merge pull request #6374 from reyoung/feature/remove_device_context_finish
...
Remove DeviceContext::Finish
7 years ago
typhoonzero
986ca03ce2
update
7 years ago
typhoonzero
1c1fae6077
update recv op
7 years ago
chengduoZH
aa770198c7
add dilation in c++ code
7 years ago
typhoonzero
71655334c6
update
7 years ago
typhoonzero
4f1381eac3
recv_op use serialized program
7 years ago
Luo Tao
1d301731ac
refine the gen_docs in build.sh
7 years ago
hedaoyuan
e1247d8015
Fix compile error in android.
7 years ago
Yiqun Liu
00b64f6679
Add a c-api interface to initialize the thread environment of Paddle ( #5773 )
...
* Fix bug in MergeModel.cpp.
* Add a c-api inferface to initilize the thread environment of Paddle and add a GPU example.
* Add some note for paddle_init_thread and move the inplementation of paddle_error_string into a .cpp file.
* Add some comments.
7 years ago
Qiao Longfei
36fcc95cab
Nmt decoder train ( #6367 )
...
* init decoder_trainer
* can run
* fix lod
* add sharelod to cross_entropy_grad_op
* add avg_cost to fetch list
* modify learning rate
* can run
* optimie code
* add early exit
* fix print
* revert test_understand_sentiment_conv.py
* add act to fc
7 years ago
sweetsky0901
9e72cc5c9a
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into detection_output
7 years ago
sweetsky0901
ca535d18ab
add detection_output code only
7 years ago
Luo Tao
3b1529d2e5
add print_operators_doc in travis ci
7 years ago
Abhinav Arora
113c026d12
Swish activation operator ( #6358 )
7 years ago
QI JUN
3a0a4586a3
refine GPU memory allocation policy ( #6373 )
...
* fix gpu memory allocation policy
* refine codes
* fix code style
* follow comments
7 years ago
tensor-tang
c096130a94
Merge pull request #6291 from tensor-tang/check_env
...
add script to check the cpu env
7 years ago
tensor-tang
32cc0db151
check if cmake has been installed
7 years ago
Tao Luo
8193cc55c4
Merge pull request #6371 from Yancey1989/install_dmidecode
...
Install dmidecode in product docker image
7 years ago
tensor-tang
f62dbebe20
follow comments and add limited version of dmidecode
7 years ago
Yang Yu
6b9567e0ac
Remove DeviceContext::Finish
7 years ago
Yu Yang
f291abfc53
Add HasCUDNN to detect if CUDNN is installed or not ( #6349 )
...
* Add HasCUDNN to detect if CUDNN is installed or not
* Fix CI
7 years ago
Tao Luo
a34fc8b36b
Merge pull request #6213 from tensor-tang/mkldnn_lrn
...
add MKLDNN LRN
7 years ago
Yancey1989
8dacb4050b
install dmidecode in product docker image
7 years ago
whs
e09e21beee
Merge pull request #6188 from wanghaoshuang/conv_fix
...
Make ConvTransProjection support for dilation
7 years ago
Abhinav Arora
e557611f39
Fix equations in sequence_pool op ( #6355 )
7 years ago
Tao Luo
de80c5696a
Merge pull request #6100 from guoshengCS/enhance-include-pool
...
Enhance AvgPooling to support both include_mode and exclude_mode
7 years ago
Abhinav Arora
0d4f050955
Fix equation in logical or op ( #6315 )
7 years ago
QI JUN
7910d96ab8
build paddle_python before build paddle_docs ( #6337 )
7 years ago
gongweibao
c7e739f542
Add LRN efficient GPU implement. ( #5894 )
...
Add LRN efficient GPU implement
7 years ago
kexinzhao
1d1555e2e5
Merge pull request #5716 from kexinzhao/float16
...
Add half precision float16 data type
7 years ago
whs
c22cf594f6
Merge pull request #6333 from wanghaoshuang/fix_type
...
Fix nce op warning about comparison of integers of different signs
7 years ago
qingqing01
5a1a04f648
Merge pull request #6326 from jacquesqiao/fix-int-overflow
...
fix int overflow
7 years ago
chengduo
b30e8bc00c
Merge pull request #6324 from chengduoZH/profiling/refine_elementwise_add
...
Refine code(operator++)
7 years ago
wanghaoshuang
6173f91cce
uncomment code
7 years ago
wanghaoshuang
83537c7ada
Fix warning about comparison of integers of different signs
7 years ago
chengduoZH
8711a9a22a
refine code
7 years ago
qiaolongfei
d303f7ae4f
fix int overflow
7 years ago
Yu Yang
229c2e7883
Feature/while op sentiment analysis ( #6282 )
...
* Add DataFeeder
A v2 API like data feeder for book demos.
We can feed data directly from reader.
* Fix CI
* Add an unittest for while/rnn op forward
* Add unittest for raw while op backward
* Fix CI
* Complete Dynamic RNN
7 years ago
kexinzhao
94a36b8cc7
fix clip op doc operation ( #6314 )
7 years ago
kexinzhao
dbf205002d
fix read and write tensor array op ( #6312 )
7 years ago
kexinzhao
a5167ce0b4
fix lod_array_lengh op equation ( #6307 )
7 years ago
kexinzhao
002a7b4d01
fix scatter op equation ( #6304 )
7 years ago
kexinzhao
16822fb702
fix latex equation for clip by norm op ( #6302 )
7 years ago
kexinzhao
2c1270e40c
fix maxout op latex equation ( #6303 )
7 years ago
Abhinav Arora
1d04b19ce8
Fix the rendering of latex equation for adamax op ( #6294 )
...
* Using latex fraction syntax in sigmoid and logsigmoid op
* Fixing the rendering of the latex equations in adamax operator
7 years ago
Qiao Longfei
161128bada
add chunk eval layer ( #6296 )
...
* add crf_decoding layer
* fix some typo
* init trunk_evaluator
* add trunk_evaluator layer
* update chunk_eval_op and test, change int32 to int64
* fix a numeric problem
* change layers.trunk_evaluator to layers.trunk_eval
* fix typo
* add precision_val
7 years ago
tensor-tang
62c00e00eb
add dimms locator info
7 years ago
qingqing01
1a8f20c6d1
Merge pull request #6284 from qingqing01/cuda_sync
...
Remove the CUDA stream synchronization between each operator.
7 years ago
tensor-tang
e670453518
add script to check the cpu env
7 years ago
Qiao Longfei
45c8a88a3e
add crf_decoding layer ( #6274 )
...
* add crf_decoding layer
* fix some typo
* fix test_crf_decoding_op
7 years ago
qingqing01
e760641a3a
Merge pull request #6233 from qingqing01/momentum_op
...
Refine and speedup momentum operator.
7 years ago
dangqingqing
4e451a34db
Remove the cuda stream synchronization between each operator.
7 years ago
chengduo
36444461dc
Merge pull request #6229 from chengduoZH/profiling/updata_elementwise_op
...
[Profiling] Update elementwise op
7 years ago
chengduoZH
23e38216a7
add dilation
7 years ago
dangqingqing
62acf7990f
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into momentum_op
7 years ago
dangqingqing
d432b10d8a
Update cuda kernel and doc.
7 years ago
chengduoZH
37671ac053
follow comments
7 years ago
QI JUN
96a5f96cc1
fix bug in gpu default memory allocating policy ( #6268 )
7 years ago
chengduoZH
9e244a8cbe
follow comments
7 years ago
dangqingqing
e03b574e0e
Refine the Eigen usage for CPU implementation.
7 years ago
chengduoZH
54f09620cf
code refine
7 years ago
chengduoZH
488908e95b
refine cuda
7 years ago
dangqingqing
5bd1e73f5e
Refine and speedup momentum operator.
7 years ago
Yu Yang
d5e3279451
While op forward for sentimental analysis ( #6140 )
...
* Add DataFeeder
A v2 API like data feeder for book demos.
We can feed data directly from reader.
* Fix CI
* Add an unittest for while/rnn op forward
* Add unittest for raw while op backward
* Fix CI
7 years ago
typhoonzero
57f666fb56
update
7 years ago
tensor-tang
f13d725acf
add mkldnn_lrn python interface and add it to simple net
7 years ago
tensor-tang
343b1a962b
add mkldnn_lrn unit test
7 years ago
tensor-tang
54205c99b6
add MKLDNNLRNLayer
7 years ago
chengduoZH
fbbfe8b859
code refine
7 years ago
qingqing01
4786ad1457
Make the new framework independent the old framework. ( #6201 )
7 years ago
sweetsky0901
141a323c34
fix a bug
7 years ago
sweetsky0901
0c4446868b
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into my_spp_op
7 years ago
qingqing01
74725d0505
Merge pull request #6198 from qingqing01/lstm_style
...
Make lstm_op follow google code style.
7 years ago
sweetsky0901
8368e55be9
modify some doc
7 years ago
Tao Luo
0c15b6c24c
Merge pull request #6184 from luotao1/print_operators_doc
...
add WITH_DOC for print_operators_doc
7 years ago
Tao Luo
09fc307fd3
Merge pull request #5830 from luotao1/cmake
...
simplify the CMakeLists.txt of trainer/tests and gserver/tests
7 years ago
qingqing01
20b885f7b8
Merge pull request #6148 from qingqing01/lstm_doc
...
Fix the doc of LSTM operator.
7 years ago
sweetsky0901
531e7b6fa6
gpu test ok
7 years ago
qingqing01
e5b51c4d10
Make lstm_op follow google code style.
7 years ago
sweetsky0901
c75b4538ca
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into my_spp_op
7 years ago
sweetsky0901
5c057f9552
add spp op only can test ok
7 years ago
Siddharth Goyal
ea1a643425
Add hinge loss op ( #5837 )
...
* Add hinge loss op
* Update hinge-loss equation for proper latex
7 years ago
wanghaoshuang
b25ee3ae60
Fix ConvTransProjection bug.
...
1. Make ConvTransProjection support for dilation
2. Fix err config in Projection.conv unitest while deConv=true
7 years ago
Luo Tao
3e8c3638dc
add WITH_DOC for print_operators_doc in docker/build.sh
7 years ago
Luo Tao
45dfc34c21
Merge branch 'print_operators_doc' of https://github.com/luotao1/Paddle into luotao1-print_operators_doc
7 years ago
Luo Tao
362b7d8a5e
Rename gserver_test2 to gserver_test_with_python
7 years ago
Luo Tao
57157284b2
Merge branch 'develop' into cmake
7 years ago
Tao Luo
fb919382cc
Merge pull request #6088 from Xreki/fix_capi_build_with_python
...
Fix the compiling error when seting WITH_C_API=ON and WITH_PYTHON=ON.
7 years ago
xzl
0ff4ff346d
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_prelu_neon
7 years ago
typhoonzero
c66c65cbdf
add grpc benchmark
7 years ago
xzl
fac96456c2
add prelu neon impl
7 years ago
武毅
067381e2e2
Merge pull request #6146 from typhoonzero/add_switch_distributed
...
Add switch for distributed support
7 years ago
Yancey
1b612d3a76
Fix grpc compile warning ( #6050 )
...
* fix grpc compile warn
* update
* -Wnon-virtual-dtor -> -Wno-non-virtual-dtor
7 years ago
QI JUN
d066b07f14
change GPU memory allocating policy ( #6159 )
...
* change GPU memory allocating policy
* fix potential overflow bug
7 years ago
chengduo
e50f35706a
code refine ( #6164 )
7 years ago
typhoonzero
dda277ba6c
update build.sh
7 years ago
Yiqun Liu
57dc8de934
Fix the linking error for iOS simulator (architecture x86_64). ( #6081 )
7 years ago
Yiqun Liu
7480291c62
Add version and commit information in capi config.h and use unofficial glog for Android API < 21. ( #6113 )
...
* Automatically configure the version and commit information in capi.
* Use the unofficial glog repository for building for Android (API < 21).
7 years ago
whs
10b2534ebc
Merge pull request #5480 from wanghaoshuang/nce_op
...
Add nce op
7 years ago
Yu Yang
1fe5acb25a
Expose sigmoid_cross_entropy_with_logits ( #6147 )
...
Also, change the `labels` to `label` for api consistency
7 years ago
dangqingqing
d4fcd2a59f
Fix the doc of LSTM operator.
7 years ago
Tao Luo
44e391449c
Merge pull request #6126 from emailweixu/fix_rnn_comments
...
Fix comments in sequence_rnn_(mixed/matched)_inputs.py
7 years ago
typhoonzero
1a852861b2
add switch for distributed support
7 years ago
Yiqun Liu
42708ded54
Enable the case N != ldc in EigenBlasGemm. ( #5976 )
...
* Enable the case N != ldc in EigenBlasGemm.
* Use MemoryHandle instead of direct calling of posix_memalign to alloc temporary memory.
* Use Eigen's slice() instead of a temporary memory.
* Add if-else for different cases in EigenBlasGemm (for N ?= ldc).
7 years ago
Yu Yang
8ac02279f2
Fix the proformance problem of enforce ( #6085 )
...
* Fix Proformance problem of enforce
* Fix missing `;` in code
* Fix CI
7 years ago
xuwei06
d36db0d3ec
Fix comments in sequence_rnn_(mixed/matched)_inputs.py
7 years ago
guosheng
6b18b3cc9c
Merge branch 'develop' of https://github.com/PaddlePaddle/paddle into enhance-include-pool
7 years ago
guosheng
cb5a7a8b4c
Fix errors of GPU AvgPooling with the excludeMode argument
7 years ago
fengjiayi
4e564e4852
make WriteToArrayOp supporting empty tensor input ( #6030 )
7 years ago
fengjiayi
a38c151243
Add GetInputsElementDim ( #6091 )
7 years ago
Luo Tao
4c95301e98
add WITH_DOC for print_operators_doc
7 years ago
guosheng
192c00a7da
Merge branch 'develop' of https://github.com/PaddlePaddle/paddle into enhance-include-pool
7 years ago
guosheng
fe6af6b6ac
Enhance the AvgPooling to support optional exclude-mode
7 years ago
QI JUN
1238706d72
Refine unittest with setting gflags ( #5476 )
...
* add gflags for C++ unittest
7 years ago
Luo Tao
c864161532
Merge branch 'develop' into cmake
7 years ago
Liu Yiqun
5fc88244b5
Fix the compiling error when seting WITH_C_API=ON and WITH_PYTHON=ON.
7 years ago
Yu Yang
35453df18f
Fix ShareLoD bug ( #6084 )
...
Fix #6087
7 years ago
Yibing Liu
61d98f27ce
Merge pull request #6058 from kuke/refine_rank_loss_op
...
Revise comments in rank_loss_op
7 years ago
武毅
e1b8c27acb
Add back print_operators_doc ( #5970 )
...
* add back print_operators_doc
* fix style check
* fix style check
7 years ago
Yibing Liu
0d8483b45d
Merge pull request #6060 from kuke/refine_reshape_op
...
Amend license and comments in reshape_op
7 years ago
Yibing Liu
9907350d8a
Merge branch 'develop' of upstream into refine_rank_loss_op
7 years ago
Yibing Liu
da62d6cc24
fix the doc display problem in rank_loss_op
7 years ago
Yang yaming
990818f7c7
Merge pull request #6031 from pkuyym/fix-6016
...
Refine doc for smooth l1 loss op.
7 years ago
Yiqun Liu
00eceea06e
Fix the problem that building for Android fails with WITH_TESTING=ON. ( #6051 )
7 years ago
Kexin Zhao
36df67b17c
small fix
7 years ago
sweetsky0901
966a6ce6db
Merge pull request #5826 from sweetsky0901/my_unpool_max_2d
...
My unpool max 2d
7 years ago
Yibing Liu
4d1ee0ff12
Amend license and comments in reshape_op
7 years ago
Yibing Liu
ff8a677848
Revise comments in rank_loss_op
7 years ago
Luo Tao
8a5a8637f9
fix bug in trainer/tests/CMakeLists.txt
7 years ago
Luo Tao
da747841db
Merge branch 'develop' into cmake
7 years ago
chengduo
5a3d1362f7
Merge pull request #5951 from chengduoZH/fix_conv_doc
...
fix conv and conv_trans op doc
7 years ago
Kexin Zhao
4901184ee9
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into float16
7 years ago
Guo Sheng
0066bbbaa3
Merge pull request #6021 from guoshengCS/fix-GRUOp-codestyle
...
Fix gru_op related code style
7 years ago
yangyaming
a5236265b7
Refine doc for smooth l1 loss op.
7 years ago
sweetsky0901
4ffb73fd3b
format ..
7 years ago
Kexin Zhao
41bd1f9115
fix gpu test, clean code and add cmake
7 years ago
chengduoZH
c339e1b786
remove conflict
7 years ago
sweetsky0901
5b449b6021
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into my_unpool_max_2d
7 years ago
sweetsky0901
3206094b5e
format code
7 years ago
武毅
4ecbab42d8
Fix compile on cudnn7 ( #5982 )
...
* fix compile on cudnn7
* update
* update
* make silent
7 years ago
qingqing01
21053c168c
Merge pull request #5954 from qingqing01/nvprof
...
Add CUDA profiler tools in new framework.
7 years ago
guosheng
1ab03d49d5
Fix gru_op related code style in gpu_kernel
7 years ago
guosheng
3e552cdcac
Fix gru_op related code style
7 years ago
wanghaoshuang
29262ab24d
Fix unitest.
7 years ago
sweetsky0901
d2ee3c98df
format code
7 years ago
sweetsky0901
2d42fa74ec
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into my_unpool_max_2d
7 years ago
sweetsky0901
c52ed8de37
format code
7 years ago
kavyasrinet
dcf3ffd980
Adding log loss operator ( #5854 )
...
* Adding log loss operator
* Removing comments
7 years ago
sweetsky0901
bd561384bc
format code
7 years ago
sweetsky0901
d9673cad15
format code
7 years ago
Kexin Zhao
a5feb77159
address pr comment
7 years ago
sweetsky0901
821899ccd4
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into my_unpool_max_2d
7 years ago
sweetsky0901
6fc9a9fd69
modify for del T2 and doc update
7 years ago
guosheng
6ed135413a
Fix useGpu in HierarchicalSigmoidLayer
7 years ago
peterzhang2029
976f96a9a5
Merge pull request #5926 from peterzhang2029/hsigmoid_gpu
...
Fix hsigmoid_layer when using GPU.
7 years ago
Qiao Longfei
c975fe1bde
batch norm support matrix input ( #5980 )
...
* batch norm support matrix input
* update gpu code
* format code
7 years ago
fengjiayi
23b3fef062
Make 'scale_op' supporting int and int64 ( #5986 )
...
* Make 'scale_op' supporting int and int64
* refine .cu file
7 years ago
dangqingqing
696b0253e5
Refine paddle/v2/fluid/profiler.py.
7 years ago
peterzhang2029
b156c6a3c0
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into hsigmoid_gpu
7 years ago
wanghaoshuang
76a65a83a0
Fix comments style
7 years ago
Yu Yang
985e4ab62d
Add Python wrap of conv2d_transpose and its unittest ( #5946 )
...
* Add Python wrap of conv2d_transpose and its unittest
* Follow comments
* Fix format
7 years ago
wanghaoshuang
ab9d59c539
Fix double type error while using eigen api
7 years ago
wanghaoshuang
d0246e24e0
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into nce_op
7 years ago
wanghaoshuang
ea7359c60b
Refine code and comments
...
1. Remove checking for num_neg_samples.
2. Fix dims of Output(Cost) and Input(Bias).
3. Renamed num_sampled_classes to num_neg_samples.
4. Add TODO for add more distribution sampler.
5. Init grad_data of bias by zero.
6. Refine comments.
7. Register a kernel for type double.
7 years ago
peterzhang2029
ab1af66b12
--amend
7 years ago
peterzhang2029
bb61e90ffc
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into hsigmoid_gpu
7 years ago
peterzhang2029
3022a79082
add gpu test in test_LayerGrad
7 years ago
武毅
0a8a86e0c9
Send recv op ( #5520 )
...
* WIP send recv op
* WIP send recv
* put grpc impl in details
* put grpc impl in details
* update wip
* update proto
* update proto
* update proto
* clean cmake
* wip on op implementations
* wip on op implementations
* compile ok adding ut
* wip unitest
* add extern cares for linking
* wip add ut
* working version send recv
* revert optimizer.py
* update test cmake
* add libtool to dockerfile
* update cmake dependency
* update cmake depends
* update cmake grpc depends
* fix cmake dependency
* fix compile error
* fix compile
* follow comments
* update
* update copyfrom
7 years ago
Yu Yang
dc82a30908
Refine CheckStyle Script ( #5942 )
...
* Refine CheckStyle Script
* Disable linkchecker for build_doc.sh
7 years ago
Yancey
f96bc313e8
fix path env in build.sh ( #5948 )
7 years ago
sweetsky0901
ee0a794c27
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into my_unpool_max_2d
7 years ago
sweetsky0901
57e68e5740
modify for code review by qingqing 2nd
7 years ago
Guo Sheng
0ce9bf77be
Merge pull request #5931 from guoshengCS/fix-ROIPoolOP-warn
...
Refine roi_pool_op to avoid warning
7 years ago
chengduoZH
9abc0e04c1
fix conv and conv_trans op doc
7 years ago
dangqingqing
623f62a7dc
Add cuda profiler tools and expose it in Python.
7 years ago
Wang Meng
95cdbfec19
Merge pull request #4859 from will-am/factorization_machine_layer
...
Add Factorization Machine Layer
7 years ago
Abhinav Arora
ef3420e2b9
Fix the latex comment syntax in sgd_op.cc ( #5940 )
...
* Fix the latex comment syntax in sgd_op.cc
* Change \textunderscore to \_
7 years ago
peterzhang2029
e6b540eacf
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into hsigmoid_gpu
7 years ago
guosheng
19a37ecde3
Merge branch 'develop' of https://github.com/PaddlePaddle/paddle into fix-ROIPoolOP-warn
7 years ago
fengjiayi
90fc4a6cd5
Complete shrink_rnn_memory_op comments ( #5935 )
...
* Complete shrink_rnn_memory_op comments
* Update
7 years ago
peterzhang2029
539462839b
bug fix when using hsigmoid with gpu
7 years ago
QI JUN
b28b2f172b
refine test_recognize_digits_mlp and format codes ( #5937 )
7 years ago
dangqingqing
322d69f209
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into nvprof
7 years ago
dangqingqing
6cf2dcbc1f
Add cuda profiler tools.
7 years ago
sweetsky0901
f9c2a5c38e
modify for code review zcd
7 years ago
sweetsky0901
022b48e16f
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into my_unpool_max_2d
7 years ago
sweetsky0901
20654cf78a
modify for type check rewrite
7 years ago
guosheng
c8bb663141
Refine roi_pool_op to avoid warning
7 years ago
fengjiayi
33fa2dfbde
Compelete max_sequence_len_op ( #5913 )
7 years ago
peterzhang2029
cda3a7747a
bug fix when using hsigmoid with gpu
7 years ago
qingqing01
54b399492a
Merge pull request #5919 from qingqing01/roi_pooling
...
Fix bug in RoI pooling.
7 years ago
武毅
a06bec1287
Conv cudnn 3d ( #5783 )
...
* conv cudnn 3d
* update test case
* update
* update
* follow comments and remove groups from helper
* update
* refine
* update
* follow comments2
* update
* fix compile
7 years ago
sweetsky0901
27cf7f3376
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into my_unpool_max_2d
7 years ago
sweetsky0901
cfd7721b51
add unpool_op.h modify
7 years ago
dangqingqing
a21fe4ac0d
Fix bug in RoI pooling.
7 years ago
dangqingqing
082eb8c61c
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into roi_pooling
7 years ago
dangqingqing
cc9a761a87
Fix bug in RoI pooling.
7 years ago
Yu Yang
a619695b06
Feature/enhance evaluator ( #5824 )
...
* Stash
* Stash
* Polish Evaluator
* Merge code
* Revert
7 years ago
sweetsky0901
a38bbc8610
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into my_unpool_max_2d
7 years ago
sweetsky0901
c218961a6b
modify for code review by qingqing
7 years ago
dzhwinter
513b1e010f
"add floor, ceil, round op" ( #5898 )
...
* "add floor, ceil, round op"
* "reuse zero gradient"
* "fix divide zero"
* "fix numpy floor error"
7 years ago
dzhwinter
45062fe5d7
Feature/copytensor ( #5455 )
...
* "make global tensor function independently"
* "replace functor"
* "fix inline template error"
* "fix tensor array with CopyFrom"
* "fix other case use CopyFrom"
* "move the op interface hardly"
* "fix operators"
* "fix typo"
* "delete dynamic recurrent rnn and fix gru_unit in debugmode"
* "fix unique_ptr copy"
* "fix cuda copy"
* "fix namespace error"
* "removed nccl python test"
* "fix include error"
* "fix typo"
* fix copy util test
7 years ago
wanghaox
0690cca758
Merge pull request #5831 from wanghaox/roi_pool
...
Roi pool operator
7 years ago
wanghaox
cf5b598642
fix some issues
7 years ago
wanghaox
ef905598a2
fix some code issues
7 years ago
Qiao Longfei
65c859db7a
beam_search_decode support multi data type ( #5847 )
...
* beam_search_decode support multi data type
* add VisitDataType for beam search decode
* use Specialization to handle bool
* move Specialization of BeamSearchDecodeFunctor out of class
7 years ago
QI JUN
3a76062c84
support testing when training and handle dropout and batch_norm operator in testing mode ( #5734 )
...
* is_training to is_test in dropout op
* handle dropout and batch_norm operator when prune pdesc in testing mode
* handle dropout and batch_norm operator when prune pdesc in testing mode
* add get_inference_program method
* fix dropout op
* fix ci
* test data after each batch training
* refine code
* refine test_book3
* fix ci
* follow comments
7 years ago
Qiao Longfei
c9172c1cb3
Make enforce target ( #5889 )
...
* make enforce a target and dependent on nccl when gpu is enabled
* add some more dependency
7 years ago
fengjiayi
50d670ee06
Unify dtype and datatype ( #5869 )
...
* Change all `data_type` in Python to `dtype`
* Change `date_type` in C++ to `dtype`
* Refine
7 years ago
peterzhang2029
1ab1b092cd
Merge pull request #5875 from peterzhang2029/refine_op_doc
...
Fix latex syntax in bilinear tensor product operator comment.
7 years ago
peterzhang2029
4bdd97625b
refine the latex mark
7 years ago
Cao Ying
2c471db2c1
Merge pull request #5884 from lcy-seso/fix_latex
...
fix LaTeX syntax in three operators' comments.
7 years ago
qingqing01
52007ea662
Merge pull request #5872 from qingqing01/op_debug
...
Fix lstm_op and gru_op in debug mode.
7 years ago
caoying03
8ba62a5f94
fix LaTeX syntax in liear_chain_crf op.
7 years ago
Yang yaming
98700ceef5
Merge pull request #5867 from pkuyym/fix-5865
...
Fix LaTeX equation for huber_loss_op.cc.
7 years ago
Yu Yang
c077a6d57c
Feature/support int64 for sum ( #5832 )
...
* Support int64 for sum op
* Refine code
7 years ago
peterzhang2029
aacd94127b
refine bilinear tensor product doc
7 years ago
chengduo
e800c0d3f1
Merge pull request #5791 from chengduoZH/fix_conv_op
...
remove vector::erase
7 years ago
dangqingqing
7fb1f7a25f
Fix lstm_op and gru_op in debug mode.
7 years ago
Luo Tao
223323a474
Merge branch 'develop' into cmake
7 years ago
wangmeng28
89e63b138f
Merge remote-tracking branch 'upstream/develop' into factorization_machine_layer
7 years ago
yangyaming
3305c8766e
Add more comment.
7 years ago
sweetsky0901
ee4a5d2117
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into my_unpool_max_2d
7 years ago
yangyaming
fb56a18109
Fix LaTeX equation for huber_loss_op.cc.
7 years ago
sweetsky0901
66b84366f1
modify for code review by wangyi
7 years ago
kavyasrinet
d883547bf0
Adding the FTRL optimizer. ( #5785 )
...
* Adding the FTRL optimizer
* Fixed the python test case
7 years ago
Cao Ying
657776012b
Merge pull request #5692 from peterzhang2029/add_bn_eq
...
Make epsilon in BatchNormLayer a configurable variable.
7 years ago
Luo Tao
b3e7c4bcf9
simplify the CMakeLists.txt of gserver/tests
7 years ago
wanghaox
36dd770a08
add roi operator unittest
7 years ago
peterzhang2029
6ab78aee4d
Merge pull request #5817 from peterzhang2029/capi_dense_fix
...
Bug fix in dense example of CAPI.
7 years ago
wanghaox
7960928883
add roi pool operator
7 years ago
peterzhang2029
82aaceba08
transform to c style
7 years ago
sweetsky0901
0112c5d640
format code
7 years ago
sweetsky0901
47bd0bb678
del printf
7 years ago
sweetsky0901
8ba8237a87
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into my_unpool_max_2d
7 years ago
sweetsky0901
e2a5905eae
gpu test ok unpool2dmax
7 years ago
Luo Tao
e7cbde80c3
simplify the CMakeLists.txt of trainer/tests
7 years ago
Qiao Longfei
53bd51e3f4
07/Label semantic roles ( #5798 )
...
* init label_semantic_roles.py
* add linear_chain_crf and test
* complete test_linear_chain_crf
* correct last layer of db_lstm
* update optimizer and initializer
* update param_initializer of embedding_layer
* support load pre trained embedding
* rm unused parameter
* optimize code
* clean code
* fix test
* add todo
7 years ago
Guo Sheng
778b981e24
Merge pull request #5804 from guoshengCS/fix-GRUUnitOp-dev
...
Fix calculations in gru_unit_op to consistent with gru_op
7 years ago
sweetsky0901
abb3357d1c
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into my_unpool_max_2d
7 years ago
sweetsky0901
90f664d0b0
test unpool ok cpu
7 years ago
peterzhang2029
d6bd5b1954
bug fix in dense
7 years ago
peterzhang2029
5d3e816717
bug fix in dense
7 years ago
tensor-tang
63ee7290f2
remove the tmp buffer
7 years ago
chengduoZH
a93227a148
refine code
7 years ago
peterzhang2029
90e05a4b8c
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_bn_eq
7 years ago
peterzhang2029
5502abb95b
refine docstrings
7 years ago
guosheng
b6b7ab63c6
Fix calculations in gru_unit_op to be consistent with gru_op
7 years ago
tensor-tang
55ecd6d29e
Merge pull request #5797 from luotao1/fix_warning
...
fix some warning with MKLDNN related codes and etc
7 years ago