dzhwinter
0e9b393b34
"derived cudnnDevice context" ( #6585 )
...
* "derived cudnnDevice context"
* "leave remove cudnn handle from CUDADeviceContext"
* "fix math function error"
7 years ago
Guo Sheng
7ab48aec00
Merge pull request #6586 from guoshengCS/enhance-ReduceOp
...
Enhance ReduceOp to support reducing over all elements
7 years ago
chengduo
2a7bc64caf
Merge pull request #6455 from chengduoZH/refine/conv_zero
...
Refine conv
7 years ago
chengduoZH
97c3de0cfb
follow comments
7 years ago
Tao Luo
d65316f833
Merge pull request #6573 from luotao1/paddledev
...
update paddledev to paddlepaddle
7 years ago
gangliao
8602d0250f
Merge pull request #6557 from gangliao/gift
...
Christmas gift: fix a typo
7 years ago
Tao Luo
e72b865cb1
Merge pull request #6530 from luotao1/atlas
...
remove ATLAS library
7 years ago
guosheng
842b485f6a
Enhance ReduceOp to support reducing over all elements
7 years ago
chengduoZH
aa2f2fb47e
remove conflict
7 years ago
Yu Yang
0a8addf802
Make cast op support bool ( #6562 )
...
Also add `elemwise_sub/mul/abs/clip` layers
7 years ago
Luo Tao
0e18bc8836
update paddledev to paddlepaddle
7 years ago
Yu Yang
1ba8f7fe71
The comments in reshape_op is wrong ( #6565 )
7 years ago
liaogang
316ab4e07d
christmas gift: fix a typo
7 years ago
Tao Luo
79f0ab55de
Merge pull request #6473 from luotao1/with_test
...
typo WITH_TEST to WITH_TESTING
7 years ago
Luo Tao
e5dcefc4d3
remove ATLAS library
7 years ago
Zhaolong Xing
a91efdde69
Merge pull request #6177 from NHZlX/add_prelu_neon
...
add prelu neon impl
7 years ago
Tao Luo
c8d4efb20e
Merge pull request #6498 from luotao1/row_conv
...
fix warning in row_conv_op.cu
7 years ago
Yu Yang
d918ccded3
Add fill_op ( #6477 )
...
* Add fill_op
* Fix bug
7 years ago
tensor-tang
252f1e4a34
Merge pull request #6493 from PaddlePaddle/release/0.11.0
...
merge Release/0.11.0 to develop
7 years ago
QI JUN
61ec0b9516
Refine device context ( #6433 )
...
There are mainly following fixes:
- take `DeviceContext` as the template parameter of math functors and OpKernel instead of `Place`
- remove `eigen_device` interface in base class `DeviceContext`
- remove `GetEigenDevice` interface in `ExecutionContext` and base class `DeviceContext`
- remove unused `platform::EigenDeviceConverter`
- rename `REGISTER_OP_GPU_KERNEL` to `REGISTER_OP_CUDA_KERNEL`
- rename `USE_GPU_ONLY_OP` to `USE_CUDA_ONLY_OP`
7 years ago
Luo Tao
f3acdd3af9
fix warning in row_conv_op.cu
7 years ago
tensor-tang
f4f17e539b
skip mkl setting in v1 with Mac
7 years ago
hedaoyuan
7687756b2c
Merge pull request #6445 from hedaoyuan/float16
...
Use PADDLE_WITH_NATIVE_FP16 for float16_t.
7 years ago
kavyasrinet
35420cdf63
Updating the Latex equation for Adagrad ( #6009 )
...
* Updating the Latex equation for Adagrad
* Fixing Latex euqations for adadelta, adam and adamax
7 years ago
Siddharth Goyal
4ff6bc175a
Add row conv operator ( #6013 )
...
* Fix documentation
* Address review comments
7 years ago
武毅
f650429ba3
Merge pull request #6419 from typhoonzero/recv_op_use_ser_prog
...
recv_op use serialized program
7 years ago
tensor-tang
38de512d29
Merge pull request #6459 from tensor-tang/develop
...
unify MKL macro definition
7 years ago
tensor-tang
69b44f2f19
unify MKL macro definition
7 years ago
Luo Tao
ddf20e589f
typo WITH_TEST to WITH_TESTING
7 years ago
Yiqun Liu
9592468609
Fix gcc4.9 ( #6442 )
...
* Fix compiling error of gcc4.9.
* Refine the check of cxx compiler flags in api/CMakeLists.txt.
7 years ago
chengduoZH
a6ef875885
refine conv
7 years ago
hedaoyuan
578ad6d232
Use PADDLE_WITH_NATIVE_FP16 for float16_t.
7 years ago
hedaoyuan
ef45af113a
Merge pull request #6408 from hedaoyuan/fix_bug
...
Fix compile error in android.
7 years ago
qingqing01
5ba231d80b
Merge pull request #6374 from reyoung/feature/remove_device_context_finish
...
Remove DeviceContext::Finish
7 years ago
typhoonzero
986ca03ce2
update
7 years ago
typhoonzero
4f1381eac3
recv_op use serialized program
7 years ago
Luo Tao
1d301731ac
refine the gen_docs in build.sh
7 years ago
hedaoyuan
e1247d8015
Fix compile error in android.
7 years ago
Yiqun Liu
00b64f6679
Add a c-api interface to initialize the thread environment of Paddle ( #5773 )
...
* Fix bug in MergeModel.cpp.
* Add a c-api inferface to initilize the thread environment of Paddle and add a GPU example.
* Add some note for paddle_init_thread and move the inplementation of paddle_error_string into a .cpp file.
* Add some comments.
7 years ago
Qiao Longfei
36fcc95cab
Nmt decoder train ( #6367 )
...
* init decoder_trainer
* can run
* fix lod
* add sharelod to cross_entropy_grad_op
* add avg_cost to fetch list
* modify learning rate
* can run
* optimie code
* add early exit
* fix print
* revert test_understand_sentiment_conv.py
* add act to fc
7 years ago
Luo Tao
3b1529d2e5
add print_operators_doc in travis ci
7 years ago
Abhinav Arora
113c026d12
Swish activation operator ( #6358 )
7 years ago
QI JUN
3a0a4586a3
refine GPU memory allocation policy ( #6373 )
...
* fix gpu memory allocation policy
* refine codes
* fix code style
* follow comments
7 years ago
tensor-tang
c096130a94
Merge pull request #6291 from tensor-tang/check_env
...
add script to check the cpu env
7 years ago
tensor-tang
32cc0db151
check if cmake has been installed
7 years ago
Tao Luo
8193cc55c4
Merge pull request #6371 from Yancey1989/install_dmidecode
...
Install dmidecode in product docker image
7 years ago
tensor-tang
f62dbebe20
follow comments and add limited version of dmidecode
7 years ago
Yang Yu
6b9567e0ac
Remove DeviceContext::Finish
7 years ago
Yu Yang
f291abfc53
Add HasCUDNN to detect if CUDNN is installed or not ( #6349 )
...
* Add HasCUDNN to detect if CUDNN is installed or not
* Fix CI
7 years ago
Tao Luo
a34fc8b36b
Merge pull request #6213 from tensor-tang/mkldnn_lrn
...
add MKLDNN LRN
7 years ago