chengduo
62fed4cbb3
fix __shfl_down ( #10362 )
7 years ago
qingqing01
24509f4af9
Fix the grammar in copyright. ( #8403 )
7 years ago
hedaoyuan
784e59406c
Bug fix of Im2ColMobileFunctor.
7 years ago
hedaoyuan
ed0a564c90
Optimize GemmConvMobileFunction.
7 years ago
hedaoyuan
b7c4b58d3d
Follow comments.
7 years ago
hedaoyuan
f453b7137f
Refine code.
7 years ago
hedaoyuan
a850dec991
Add dilation.
7 years ago
hedaoyuan
19547943ba
Add test for Im2ColMobileFunctor.
7 years ago
hedaoyuan
d775895e93
Add Im2ColMobileFunctor.
7 years ago
hedaoyuan
dbf1d75f57
Add a GemmConvMobileFunction.
7 years ago
dangqingqing
7968b66d72
Fix error in function/GemmConvOp.cpp.
7 years ago
dangqingqing
349609207e
Fix the error function/GemmConvOp.
7 years ago
dangqingqing
0fce0fe698
Reduce memory usage in conv layer and RoI layer for mobile inference.
7 years ago
Yiqun Liu
42708ded54
Enable the case N != ldc in EigenBlasGemm. ( #5976 )
...
* Enable the case N != ldc in EigenBlasGemm.
* Use MemoryHandle instead of direct calling of posix_memalign to alloc temporary memory.
* Use Eigen's slice() instead of a temporary memory.
* Add if-else for different cases in EigenBlasGemm (for N ?= ldc).
7 years ago
xzl
fbd8a3307c
regenerate the proto for support the dilation
7 years ago
xzl
469b3ad589
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_dilation
7 years ago
yangyaming
04a351500f
Remove MulValu* and reduce time cost for unit test.
7 years ago
yangyaming
07f3f07ff3
MulValue --> ScaleSubRegion
7 years ago
yangyaming
cfde85bc52
CallBack --> Callback
7 years ago
yangyaming
cfad83ce89
Add MulValueLayer.
7 years ago
xzl
ee4140103d
add more dialtion test
8 years ago
xzl
d746e49f2e
add dilation to conv test
8 years ago
xzl
f0c3c49808
test exconv layerGrad and im2col
8 years ago
xzl
328169a955
im2col cpu gpu dilation support
8 years ago
xzl
7a5b38466a
support exconv dilation
8 years ago
Yi Wang
4558807c48
Use PADDLE_WITH_CUDA instead of PADDLE_WITH_GPU
8 years ago
Yu Yang
84500f9487
Change `PADDLE_ONLY_CPU` to `PADDLE_WITH_GPU`
...
By shell command
```bash
sed -i 's#ifdef PADDLE_ONLY_CPU#ifndef PADDLE_WITH_GPU#g' `find ./paddle/ -name '*.h' -o -name '*.cc' -o -name '*.cpp' -o -name '*.c' -o -name '*.cu'`
sed -i 's#ifndef PADDLE_ONLY_CPU#ifdef PADDLE_WITH_GPU#g' `find ./paddle/ -name '*.h' -o -name '*.cc' -o -name '*.cpp' -o -name '*.c' -o -name '*.cu'`
```
8 years ago
Liu Yiqun
3fefee8a06
Use scalar implementation instead of neon implementation to avoid out of range memory access in the tail conv3x3.
8 years ago
Liu Yiqun
9e524fce1b
Remove the generation of executable, such as paddle_trainer, when setting WITH_C_API.
8 years ago
hedaoyuan
0146e8bfc5
Merge pull request #3843 from Xreki/build_android_clang
...
Enable the use of clang to cross-compile for Android
8 years ago
hedaoyuan
a8efed0910
Merge pull request #3792 from hedaoyuan/convolution
...
Neon Depthwise Convolution Transpose Function
8 years ago
Liu Yiqun
3360e9cdb8
Change the definition of vmlaq_laneq_f32 from template function to macro.
8 years ago
wanghaoshuang
e033569dd6
Fix format
8 years ago
wanghaoshuang
eb3c774b83
Fix format error
8 years ago
whs
a162ebce8b
Merge branch 'develop' into pixel_softmax_layer
8 years ago
Liu Yiqun
387b4e4eac
Change the declaration of EigenGemm from `class` to `struct`.
8 years ago
Liu Yiqun
8b15ac82fa
Move the definition of hl_cpu_gru_forward and hl_cpu_gru_backward to function/GruFunctor.h.
8 years ago
Liu Yiqun
c54c7d91a0
Use template to deliver const argument instead, to remove the compiling error "argument to __builtin_neon_vgetq_lane_f32 must be a constant integer".
8 years ago
Liu Yiqun
50c63dc9e7
Merge branch 'develop' into build_android_clang
8 years ago
hedaoyuan
90bf4f60ae
Add stride support 2 for NeonDepthwiseConvTranspose.
8 years ago
hedaoyuan
840104c99a
Add NeonDepthwiseConvTransposeFunction.
8 years ago
hedaoyuan
40d47fae95
[Refine code]Move class Padding into the NeonDepthwiseConv.h.
8 years ago
hedaoyuan
4b6b7251c1
Refine NeonDepthwiseConv.
8 years ago
hedaoyuan
f7e75a03cf
Refine the neon depthwise convolution code(separate the Function and kernel).
8 years ago
hedaoyuan
5df384d67f
Remove NeonDepthwiseConv.h
8 years ago
hedaoyuan
3a75b4b70c
Fix CMakeLists.text
8 years ago
hedaoyuan
227fdfb65d
Refine NeonDepthwiseConvFunction.
8 years ago
Liu Yiqun
f241773c4f
Support to use clang for Android cross-compiling.
8 years ago
hedaoyuan
f00c4112d2
Neon depthwise conv with filterSize = 4 and stride = 2.
8 years ago
hedaoyuan
6dcff9a444
Neon depthwise conv with filterSize = 3 and stride = 2.
8 years ago