Commit Graph

276 Commits (8f9480cc2cbc02bd96d237e8b52706e53d86e217)

Author SHA1 Message Date
hedaoyuan b7c4b58d3d Follow comments.
7 years ago
hedaoyuan f453b7137f Refine code.
7 years ago
hedaoyuan a850dec991 Add dilation.
7 years ago
hedaoyuan 19547943ba Add test for Im2ColMobileFunctor.
7 years ago
hedaoyuan d775895e93 Add Im2ColMobileFunctor.
7 years ago
hedaoyuan dbf1d75f57 Add a GemmConvMobileFunction.
7 years ago
dangqingqing 7968b66d72 Fix error in function/GemmConvOp.cpp.
7 years ago
dangqingqing 349609207e Fix the error function/GemmConvOp.
7 years ago
dangqingqing 0fce0fe698 Reduce memory usage in conv layer and RoI layer for mobile inference.
7 years ago
Yiqun Liu 42708ded54
Enable the case N != ldc in EigenBlasGemm. (#5976)
7 years ago
xzl fbd8a3307c regenerate the proto for support the dilation
7 years ago
xzl 469b3ad589 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_dilation
7 years ago
yangyaming 04a351500f Remove MulValu* and reduce time cost for unit test.
7 years ago
yangyaming 07f3f07ff3 MulValue --> ScaleSubRegion
7 years ago
yangyaming cfde85bc52 CallBack --> Callback
7 years ago
yangyaming cfad83ce89 Add MulValueLayer.
7 years ago
xzl ee4140103d add more dialtion test
7 years ago
xzl d746e49f2e add dilation to conv test
7 years ago
xzl f0c3c49808 test exconv layerGrad and im2col
7 years ago
xzl 328169a955 im2col cpu gpu dilation support
7 years ago
xzl 7a5b38466a support exconv dilation
7 years ago
Yi Wang 4558807c48 Use PADDLE_WITH_CUDA instead of PADDLE_WITH_GPU
7 years ago
Yu Yang 84500f9487 Change `PADDLE_ONLY_CPU` to `PADDLE_WITH_GPU`
7 years ago
Liu Yiqun 3fefee8a06 Use scalar implementation instead of neon implementation to avoid out of range memory access in the tail conv3x3.
7 years ago
Liu Yiqun 9e524fce1b Remove the generation of executable, such as paddle_trainer, when setting WITH_C_API.
8 years ago
hedaoyuan 0146e8bfc5 Merge pull request #3843 from Xreki/build_android_clang
8 years ago
hedaoyuan a8efed0910 Merge pull request #3792 from hedaoyuan/convolution
8 years ago
Liu Yiqun 3360e9cdb8 Change the definition of vmlaq_laneq_f32 from template function to macro.
8 years ago
wanghaoshuang e033569dd6 Fix format
8 years ago
wanghaoshuang eb3c774b83 Fix format error
8 years ago
whs a162ebce8b Merge branch 'develop' into pixel_softmax_layer
8 years ago
Liu Yiqun 387b4e4eac Change the declaration of EigenGemm from `class` to `struct`.
8 years ago
Liu Yiqun 8b15ac82fa Move the definition of hl_cpu_gru_forward and hl_cpu_gru_backward to function/GruFunctor.h.
8 years ago
Liu Yiqun c54c7d91a0 Use template to deliver const argument instead, to remove the compiling error "argument to __builtin_neon_vgetq_lane_f32 must be a constant integer".
8 years ago
Liu Yiqun 50c63dc9e7 Merge branch 'develop' into build_android_clang
8 years ago
hedaoyuan 90bf4f60ae Add stride support 2 for NeonDepthwiseConvTranspose.
8 years ago
hedaoyuan 840104c99a Add NeonDepthwiseConvTransposeFunction.
8 years ago
hedaoyuan 40d47fae95 [Refine code]Move class Padding into the NeonDepthwiseConv.h.
8 years ago
hedaoyuan 4b6b7251c1 Refine NeonDepthwiseConv.
8 years ago
hedaoyuan f7e75a03cf Refine the neon depthwise convolution code(separate the Function and kernel).
8 years ago
hedaoyuan 5df384d67f Remove NeonDepthwiseConv.h
8 years ago
hedaoyuan 3a75b4b70c Fix CMakeLists.text
8 years ago
hedaoyuan 227fdfb65d Refine NeonDepthwiseConvFunction.
8 years ago
Liu Yiqun f241773c4f Support to use clang for Android cross-compiling.
8 years ago
hedaoyuan f00c4112d2 Neon depthwise conv with filterSize = 4 and stride = 2.
8 years ago
hedaoyuan 6dcff9a444 Neon depthwise conv with filterSize = 3 and stride = 2.
8 years ago
hedaoyuan b7885b087b Add DepthwiseConvKernel for filter size is 4.
8 years ago
hedaoyuan 0dffe68ca9 Add NeonDepthwiseConvFunction.
8 years ago
guosheng 117ce4cbc1 Change class to struct in GemmFunctor to avoid errors on special compilers
8 years ago
hedaoyuan 430e0e418b Follow comments.
8 years ago