武毅
0a8a86e0c9
Send recv op ( #5520 )
...
* WIP send recv op
* WIP send recv
* put grpc impl in details
* put grpc impl in details
* update wip
* update proto
* update proto
* update proto
* clean cmake
* wip on op implementations
* wip on op implementations
* compile ok adding ut
* wip unitest
* add extern cares for linking
* wip add ut
* working version send recv
* revert optimizer.py
* update test cmake
* add libtool to dockerfile
* update cmake dependency
* update cmake depends
* update cmake grpc depends
* fix cmake dependency
* fix compile error
* fix compile
* follow comments
* update
* update copyfrom
7 years ago
Qiao Longfei
c9a96575d5
py_test and test_image_classification_train support argument ( #5934 )
...
* py_test support argument, test_image_classification_train support argument
* use REMOVE_ITEM to rm item from list in cmake
7 years ago
Qiao Longfei
1f6002edc0
update gflags ( #5904 )
7 years ago
Yancey1989
cd29714af0
fix py unit test executable
7 years ago
dangqingqing
94e86897fa
update code and fix conflicts.
7 years ago
dangqingqing
082bc7af56
Use CUDA_ARCH_NAME=All in the paddle/scripts/docker/build.sh and remove 20 21(20) in cmake/cuda.cmake.
7 years ago
tensor-tang
1578c20aaf
add the missing macro PADDLE_USE_MKLDNN
7 years ago
tensor-tang
cece368d6f
Merge remote-tracking branch 'upstream/develop' into refine
7 years ago
Yiqun Liu
c808fbbfcb
Support the build for multiple architectures at one cmake command (iOS). ( #5677 )
...
* Support the build for multiple architectures at one cmake command (iOS).
* Update the documentations.
7 years ago
tensor-tang
8496eab45a
make mklml necessary when with_mkldnn
7 years ago
dangqingqing
3d080f3ad5
Refine cmake about CUDA to automatically detect GPU arch by default.
...
1. Automatically detect GPU arch by default.
2. Specify -DCUDA_ARCH_NAME=All when releasing PaddlePaddle new version
7 years ago
Luo Tao
49ac0480e6
fix a lot of warnings -Wunused-but-set-variable and Wunused-variable when compile openBlas
7 years ago
Yiqun Liu
d3ef2543f9
Fix compiling error for Android, and installing error for cmake of low version. ( #5660 )
7 years ago
Luo Tao
52272bc076
fix openblas bug: undefined reference to pthread_atfork
7 years ago
Tao Luo
97c1bb1694
Merge pull request #5521 from tensor-tang/develop
...
fix EQUAL unknown
7 years ago
Yu Yang
5e13e706f9
Fix CI compile ( #5526 )
7 years ago
tensor-tang
df105ac940
fix EQUAL unknown
7 years ago
Luo Tao
34d02f94b5
RollBACK the openblas.cmake
7 years ago
Luo Tao
7835d49384
remove PADDLE_USE_MKL
7 years ago
Luo Tao
5a5b729747
remove unused INTEL_MKL_ROOT etc.
7 years ago
tensor-tang
4fd432fdac
update mkldnn tag
...
and abandoned deprecated sum API interface
8 years ago
tensor-tang
a1856be5eb
update mklml tag
8 years ago
Yiqun Liu
bba6223598
Enable the build for iOS simulator. ( #5211 )
8 years ago
Yiqun Liu
288ffdd715
Correct the install command, static library name and typo in nccl.cmake. ( #5048 )
8 years ago
Qiao Longfei
ee998a9c44
CPU Batch Norm Op ( #4964 )
...
* init batch norm op
* prepare input output
* compute mean_out var_out save_mean save_var on CPU
* active is test
* use eigen to do computation
* complete batch norm forward
* set default momentum to 0.9
* add batch norm grad op in CPU
* add tensor_format and NHWC support, add python test
* add test training
* add batch norm gradient test
* improve comment, fix foward Python UnitTest
* add gradient test
* fix eigen warning
* follow name style
* fix a bug
* change float to T
* add simple forward test
* test with different place
* add backward test
* refine python test
* remove old python test code
* code clean
* follow code style
* update comment
8 years ago
Luo Tao
fb4744fd6a
fix nccl compiler error
8 years ago
Yu Yang
94e741d6f0
Use external project for NCCL ( #5028 )
8 years ago
Yu Yang
43c6ff212e
Feature/nccl dso ( #5001 )
...
* "add nccl enforce"
* Dev
* Update comment
* Add nccl test
* Follow comments
8 years ago
hedaoyuan
6316b40a2c
Merge pull request #4753 from hedaoyuan/inference
...
Use MinSizeRel compile third_party library when build for mobile infe…
8 years ago
helinwang
ce91f85ec5
Add GIT tag for all cmake dependencies. ( #4776 )
8 years ago
hedaoyuan
773d064ae0
Use MinSizeRel compile third_party library when build for mobile inference.
8 years ago
xzl
07e5d86787
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into protobuf_lite
8 years ago
xzl
54a03ab31d
add paddle_protobuf_generate_cpp() Func which could dynamic set RUNTIME_LITE mode for paddle
8 years ago
Yan Chunwei
843ed8e320
dynamic recurrent op forward c++ implentation ( #4597 )
8 years ago
hedaoyuan
aa1e92ec9a
Merge branch 'develop' of https://github.com/baidu/Paddle into inference
8 years ago
Yi Wang
f985700abf
Resolve conflict
8 years ago
Yu Yang
84500f9487
Change `PADDLE_ONLY_CPU` to `PADDLE_WITH_GPU`
...
By shell command
```bash
sed -i 's#ifdef PADDLE_ONLY_CPU#ifndef PADDLE_WITH_GPU#g' `find ./paddle/ -name '*.h' -o -name '*.cc' -o -name '*.cpp' -o -name '*.c' -o -name '*.cu'`
sed -i 's#ifndef PADDLE_ONLY_CPU#ifdef PADDLE_WITH_GPU#g' `find ./paddle/ -name '*.h' -o -name '*.cc' -o -name '*.cpp' -o -name '*.c' -o -name '*.cu'`
```
8 years ago
Yi Wang
5f51d0afc4
Add -D PADDLE_WITH_CUDA in cmake/configure.cmake
8 years ago
hedaoyuan
d94dd75e3b
Remove string, scripts and retain cuda.
8 years ago
hedaoyuan
ea4672bea0
Remove optimizer.
8 years ago
hedaoyuan
33299ef972
Remove cuda.
8 years ago
hedaoyuan
3c66b307f7
Remove the pserver, trainer, evaluators and some useless gradientmachines when compile mobile inference library.
8 years ago
Qiao Longfei
d0ad82cff1
fix nv_library ( #4370 )
...
* fix nv_library
* fix symbol in gpu_info.h
8 years ago
Tao Luo
330e9929ec
Merge pull request #4310 from tensor-tang/vgg
...
enable VGG with MKLDNN layers
8 years ago
Yiqun Liu
2d8467ee9d
Merge pull request #4320 from Xreki/fix_android_linking_error
...
Fix bug in cc_library, when merging several libraries into one on Linux.
8 years ago
Liu Yiqun
dd2f477a50
Fix bug in cc_library, when merging several libraries into one on Linux.
8 years ago
tensor-tang
f96d31d552
only link iomp when with MKLDNN and MKLML
8 years ago
Liu Yiqun
35dd981db0
Fix linking error on mac.
8 years ago
Yiqun Liu
a9202e89cf
Merge pull request #3764 from Xreki/build_ios
...
Add the support of the cross-compiling for iOS devices
8 years ago
Liu Yiqun
46ee8ceecb
Move the sreach and link of veclib to iOS's toolchain file .
8 years ago