Commit Graph

167 Commits (144854d2e86efce8858b34518c1ee5f4b48d9407)

Author SHA1 Message Date
Liu Yiqun 430adf43d1 Move the definition of hl_vec_add/sub/mul/div/max/min to hl_tensor_ops.h
8 years ago
Liu Yiqun 8f5d22b045 Add annotations.
8 years ago
Dong Li 4ed5f7dbb9 Disable neon when enable gpu.
8 years ago
Liu Yiqun 688305f82d Centralize the use of sse and neon instrinsic.
8 years ago
Jiangtao Hu 32a8508138 Support scalar computing.
8 years ago
liaogang 6237f6f57a revert clang-format
8 years ago
liaogang f27fd9dc28 follow comments
8 years ago
liaogang 665cc0e7b2 remove redundant mutex
8 years ago
liaogang 5b8fe87faf dlopen lapacke api and remove gfotran
8 years ago
Liu Yiqun 2ae3dd08f9 Merge branch 'develop' into build_arm
8 years ago
Luo Tao 53da530d90 package avg_gpu_backward
8 years ago
Liu Yiqun 2a601e025e Set the simd-related kernels used under arm toolchains.
8 years ago
Liu Yiqun 717f755cee Include arm_neon.h on arm platform.
8 years ago
Peng Li 36dbfe86a7 Change ctcStatus_t to hl_warpctc_status_t to keep consistency
8 years ago
Peng Li 448e60b5fe Fix macro bug in hl_warpctc_warp.cc to support double precision
8 years ago
Liang Zhao 046349dd40 Fix definition of hl_matrix_classification_error in hl_matrix_stub.h
8 years ago
Liang Zhao 8fded24c75 implement top k classification error in class matrix
8 years ago
xutianbing a948eea3ed clean unused code.
8 years ago
Haonan 7f4042ec86 fix stub error
8 years ago
Haonan 2558c3f15a revisions according to reviews
8 years ago
liaogang eda4254af0 Remove hl_cudart_wrap.cc in CMake
8 years ago
liaogang 80c1679284 Fix cudart bugs before initMain
8 years ago
liaogang 4d6aca4b33 Warpctc only support dynamic load
8 years ago
liaogang b090ce329a Fix conflict with develop
8 years ago
liaogang 572d8254ea Clean cmake
8 years ago
xutianbing ec6b13dbfc clean up unused code.
8 years ago
liaogang aee0857838 Clean Travis ci and fix bug
8 years ago
liaogang 0b956711d9 Add external_project_dependencies for targets
8 years ago
liaogang c8d0791acc Add common.h and remove DisableCopy and Typedefs
8 years ago
liaogang f09989a1b2 Remove utils/CommandLineParser.h
8 years ago
hedaoyuan bf32411191 Merge branch 'develop' of https://github.com/baidu/Paddle into cmrnorm
8 years ago
chenchaoxiu 18ebeec2ac Added support for cudnn v6 and cuda 8.0
8 years ago
hedaoyuan d11e2b4013 Remove some useless code
8 years ago
hedaoyuan 9171ab0ac1 Merge branch 'develop' of https://github.com/baidu/Paddle into cmrnorm
8 years ago
gangliao 6aed264f0a Merge pull request #896 from gangliao/glog
8 years ago
liaogang 3d0e73bd32 Remove custom glog-like and gflags-like macros
8 years ago
Yi Wang 35ccf9c21d Disable clang-format check on a trick part of our source code
8 years ago
Yi Wang 8777ff3fa6 Use yapf to auto format all BUILD and WORKSPACE files
8 years ago
Yu Yang f821b6b750 Fit pre-commit for clang-format 4.x
8 years ago
hedaoyuan 529f24c262 cpu cmrnorm
8 years ago
Yu Yang 579e591207 Try to fix unittest error
8 years ago
Yu Yang be1b70e64e Tuning travis
8 years ago
Yu Yang 068bfbb817 All file pass pre-commit hook
8 years ago
Yi Wang e9549cbb78 Change "Baidu, Inc" into "PaddlePaddle Authors"
8 years ago
Yu Yang 2368ca8f7b Merge pull request #663 from gangliao/docker
8 years ago
liaogang 613d7c812b Fix conflicts with develop branch
8 years ago
Yiqun Liu 4823075f95 Merge pull request #651 from Xreki/warpctc
8 years ago
hedaoyuan abdcb8e128 format some files
8 years ago
hedaoyuan 671db8deaa Merge branch 'develop' of https://github.com/baidu/Paddle into tensor_merge
8 years ago
hedaoyuan 7e0b51f28f some bugs fix
8 years ago
hedaoyuan e63f1e6952 merge from cooder
8 years ago
liaogang 26b2996b0a Upgrade compiler‘s minimum version
8 years ago
Liu Yiqun 18b85e558a Add a script to auto compile the warp-ctc submodule.
8 years ago
liaogang f340f37f02 Change atomicAdd to paddleAtomicAdd
8 years ago
Liu Yiqun a816443e11 Add submodule warp-ctc.
8 years ago
Liu Yiqun 4d487c6f35 Integrate warp-ctc as WarpCTCLayer, including unitest and layer interface.
8 years ago
liaogang e488001675 Merge conflict with hl_cuda_device.cc
8 years ago
Luo Tao 9ea0661a82 clang format off on some cuda .cc file
8 years ago
Luo Tao 80c68d38ff clang format .cc .h .cpp .c and .hpp file
8 years ago
Yu Yang e9f50bd50b Merge branch 'develop' into feature/add_clang_format_plugin
8 years ago
liaogang ccea3b026e Add style check for *.cc files in cuda directory
8 years ago
gangliao 049f9d3a1c Fix a pointer comparison bug in hl_dso_loader.cc
8 years ago
liaogang 20aac5bba1 Add style check for *.cc files in cuda directory
8 years ago
liaogang 2c84c1ecfb Add profiler object and update docs
8 years ago
liaogang 84cab2c763 Merge conflict with develop branch
8 years ago
liaogang 2e9ea1cece Add Gpu profiler interface
8 years ago
Yu Yang 836d61382f Update pre-commit-config
8 years ago
Haonan 5591292b7a modifications according to comments
8 years ago
Haonan 069d0004dc multi_binary_cross_entropy when ids vector is provided
8 years ago
qijun 9dd588b414 fix merge conflicts
8 years ago
qijun f173341fb2 Merge remote-tracking branch 'baidu/develop' into feature/sppnet
8 years ago
liaogang 0519cc6423 Merge branch 'develop' of https://github.com/baidu/Paddle into bilinear
8 years ago
luotao1 e6c83f4ec0 some tiny fixs (#406)
8 years ago
qijun 3553576e6e Merge remote-tracking branch 'baidu/develop' into feature/sppnet
8 years ago
qijun e2c0713589 follow comments
8 years ago
liaogang cc04a7d7ab Merge branch 'develop' of https://github.com/baidu/Paddle into bilinear
8 years ago
liaogang db1757556e Follow comments
8 years ago
qijun db569f293e fix merge conflict
8 years ago
hedaoyuan a07da94939 fix floating-point overflow problem of tanh (#355)
8 years ago
lzhao4ever 4905751a22 Add define for double getrf, getri (#381)
8 years ago
liaogang bd38facada Fix conflict
8 years ago
liaogang 57348806b5 Follow comments
8 years ago
gangliao 3424a4c0d8 Fix bug and redundant code in hl_dso_loader.cc (#306)
8 years ago
lzhao4ever 5f2059db05 Add matrix inverse (#240)
8 years ago
qijun 766a61c374 fix conflict with baidu/develop
8 years ago
qingqing01 45c81a414f Add job=time in trainer, refine cudnn_conv to reduce gpu memory and speed up training. (#218)
8 years ago
qijun cdac60f616 add SpatialPyramidPoolLayer c++ support
8 years ago
gangliao 6467c38202 Add default cuda system path (#192)
8 years ago
liaogang fd4eeaf59c Merge conflict with maxout layer
8 years ago
liaogang ddfff3a7fd Add bilinear interpolation layer
8 years ago
luotao1 3dd8c9bea4 add maxout layer, including interface and unittest (#229)
8 years ago
gangliao c13bdb15cd remove redundant HPPL_TYPE_DOUBLE (#200)
8 years ago
luotao1 91df606280 remove some copyfrom in AgentLayer and ExpandLayer, fix warning in seq2seq config (#183)
8 years ago
Mark 9f244e4a39 Should not compile the two files if -DWITH_AVX=OFF. (#163)
8 years ago
qingqing01 191fafe355 support rectangle padding, stride, window and input for PoolProjection (#115)
8 years ago
gangliao 0ab332242f Support MAC OS Sierra (#169)
8 years ago
hedaoyuan b52039bd11 some bug fix for sparse matrix (#133)
8 years ago
liaogang 23e47bb600 Merge remote-tracking branch 'upstream/master'
8 years ago
emailweixu b15a4783cb Correctly handling multiple inputs and integer inputs for recurrent_g… (#114)
8 years ago
liaogang 1d4bc47805 support gettid() on MAC OS X
9 years ago