dzhwinter
85f8dd1c77
debug version
7 years ago
dzhwinter
bf1e76e401
"fix dso" ( #13407 )
7 years ago
dzhwinter
e1999538eb
debug the device context
7 years ago
dzhwinter
372caf4000
windows staff
7 years ago
dzhwinter
c3e1fb5a3e
add demo
7 years ago
luotao1
d230379b54
move anakin release to third_party/install/anakin
7 years ago
luotao1
6fae46a128
refine codes
7 years ago
luotao1
61cae53e79
support anakin for only-cpu environment
7 years ago
luotao1
9790acbde6
Merge branch 'library' of https://github.com/luotao1/Paddle into luotao1-library
7 years ago
luotao1
d7b4965785
auto generate paddle_inference_pass.h
7 years ago
Xin Pan
4529f70799
Merge pull request #13165 from gongweibao/fixgrpcurl
...
Fix grpc download url bug.
7 years ago
dzhwinter
379b471ee2
squash commit
7 years ago
luotao1
0639a32477
Merge branch 'develop' into library
7 years ago
gongweibao
f0e89b81d7
fix grpc cdn
7 years ago
dzhwinter
a0aa2ec8b5
build compile
7 years ago
dzhwinter
75681c0a79
switch to 9.2
7 years ago
dzhwinter
dbe90cc0f6
merge develop branch
7 years ago
luotao1
f3b7e18be9
add static library for analysis
7 years ago
dzhwinter
e09bb3171f
cuda windows
7 years ago
dzhwinter
f5329d6539
add some synatx
7 years ago
dzhwinter
b78394ea57
done
7 years ago
dzhwinter
b74af56bbc
cpu compile is done
7 years ago
luotao1
79d4a99d47
Merge branch 'develop' into anakin_v4
7 years ago
luotao1
e89936fc01
update anakin commit id due to fix 5117 compile bug
7 years ago
dzhwinter
26dbe35c54
add msvc flags and copy lib done
7 years ago
dzhwinter
dbd7896678
cmakelist windows ( #12927 )
...
* picked pr
* "fix ci"
7 years ago
dzhwinter
77c0aeb91e
boost ( #12929 )
...
* "fix ci"
* "windows tab"
* "fix ci"
7 years ago
dzhwinter
a4ffdf3088
gflags ( #12928 )
7 years ago
dzhwinter
669304f4e5
protobuf ( #12935 )
7 years ago
dzhwinter
5df6581101
merge_static_libs ( #12936 )
7 years ago
dzhwinter
593ac0f23e
openblas ( #12937 )
7 years ago
dzhwinter
c7e0ed8318
inference lib
7 years ago
dzhwinter
65f144aacc
fix commit
7 years ago
dzhwinter
89f95ea25e
merge develop branch
7 years ago
Tao Luo
3e3b5f4fda
Merge pull request #12675 from Sand3r-/fix-conv-mkldnn-0.15
...
Update MKLDNN to 0.15, fix convolution integration
7 years ago
dzhwinter
34f8c9b6f5
windows port
7 years ago
Tao Luo
decda738b0
fea/anakin compile with demo ( #12772 )
...
* anakin support x86
* fix code style
* add anakin ditu cnn demo
* add timer
* add rnn
* fix inference_anakin_cnn/rnn_test compile error
* make anakin_rnn_tester run
* add anakin_enable_op_time option
* update api/CMakeLists.txt
* enlarge the max_batch_size in anakin.config
* update with comments
7 years ago
dzhwinter
e23ddf6ae4
status ( #12764 )
7 years ago
minqiyang
d214dff13c
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_anakin_in_manylinux1
7 years ago
minqiyang
8f8560744a
Reorder the cmake import and add CUDNN_INCLUDE_DIR into anakin cmake module
7 years ago
Yihua Xu
084d4a9e9e
Optimize CRF Decoding with AVX/AVX2/AVX512F instruction ( #12767 )
...
* Optimize CRF decoding with AVX/AVX2 instruction
* Enable the AVX2 flags for compiling
* Clean the code and decrease the count of multiply calculation
* Add the support of AVX512 instruction to optimize CRF Decoding
* Clean the code
* Enable the AVX512f flags for compiling
* Clean the code for the invaluable switch
* Fixed the issue to check AVX512F status
* Clean the code
* Add some explanation of the key points
7 years ago
dzhwinter
00463fdfe3
cudnn windows support ( #12757 )
...
* cudnn widndows
* "add comment"
* "windows support"
* "fix cmake error"
7 years ago
minqiyang
bfe8d6fa66
Polish code
7 years ago
minqiyang
e10e0d4a5b
Fix anakin build problem in manylinux1 docker image
7 years ago
dzhwinter
c5722eb03a
merge windows/cudnn
7 years ago
dzhwinter
17602eab94
windows port of malloc
7 years ago
dzhwinter
f1a7ae3d12
"fix cmake error"
7 years ago
dzhwinter
5c88cd2af5
remove werror in windows
7 years ago
luotao1
413bf9d494
disable anakin when cuda < 8.0 or cudnn < 7.0
7 years ago
Michal Gallus
6588d0e039
Update MKLDNN to 0.15, fix conv integration
7 years ago
dzhwinter
963a7457f5
"add comment"
7 years ago
dzhwinter
adec702af2
cudnn widndows
7 years ago
luotao1
447936551e
quick fix anakin on 5117 cpu
7 years ago
luotao1
546a26f081
add mklml depends for anakin
7 years ago
dzhwinter
d7873e1412
remove patchelf in windows ( #12710 )
...
* remove patchelf in windowls
* "follow comment"
7 years ago
luotao1
9f3789944c
use latest anakin commit
7 years ago
luotao1
3373535b21
fix specific cudnn include and library path
7 years ago
Tao Luo
1e1974c998
Merge pull request #12563 from luotao1/anakin_test
...
* make inference_anakin_test SERIAL
* add anakin compiler from github source code
* fix inference_lib_dist error
* add comment
* update anakin.cmake
* fix anakin-NOTFOUND compiler error
* modify the anakin_model download dir
7 years ago
Qiyang Min
4c3b165346
Merge pull request #12638 from velconia/add_py3_env_for_manylinux
...
Add py3 env for manylinux
7 years ago
tensor-tang
a50889f523
introduce xbyak
7 years ago
luotao1
a222d336ca
modify the anakin_model download dir
7 years ago
luotao1
22bc328951
fix anakin-NOTFOUND compiler error
7 years ago
luotao1
b2367f3661
update anakin.cmake
7 years ago
minqiyang
ed1f7df1c7
Fix mkldnn build bug in Ubuntu
7 years ago
luotao1
f4bcee1d6f
Merge branch 'develop' into anakin_test
7 years ago
luotao1
94042ccd2d
add comment
7 years ago
Luo Tao
64c0ba288a
fix inference_lib_dist error
7 years ago
sneaxiy
53052828e3
Merge pull request #12552 from sneaxiy/add_cub_support
...
Add CUB Library support in Paddle
7 years ago
sneaxiy
cef3585535
add cub support
7 years ago
Luo Tao
e8aa6d1283
add anakin compiler from github source code
7 years ago
chengduo
97a77512b4
Fix the order of sum ( #12562 )
...
* fix the order of sum
* add doc
* check whether need to copy
* follow comments
7 years ago
chengduo
93152b0bac
Fix the result of unit test ( #12520 )
...
* fix the result of unit test
* fix resnext
* compare the result of PE and Exe
* compare the result of reduce and allreduce
7 years ago
minqiyang
2e6bce597d
Add CUDA_VERSION to cudnn location detection
7 years ago
minqiyang
9812bb8b48
Fix pserver can NOT start with DebugString problem
7 years ago
cuichaowen
046de2acdb
Improve anakin feature ( #11961 )
7 years ago
Xin Pan
7a192d9e70
fix SERIAL tests
7 years ago
Qiyang Min
7da453630e
Merge pull request #12403 from velconia/fix_hang_up
...
Fix grpc destroy bug
7 years ago
minqiyang
b78ffde6d5
Add stopped sign for grpc client
7 years ago
minqiyang
bc5555b85d
Change git apply patches to cp because third_party is not managed in Paddle repo
7 years ago
Luo Tao
5ba4337698
unify libpaddle_inference_api into libpaddle_fluid
7 years ago
minqiyang
c0a551cb08
Change latest to 0.0.0
7 years ago
Qiyang Min
982d4230b9
Merge pull request #12145 from velconia/update_version_via_branch_name
...
Update version via branch name
7 years ago
Luo Tao
2e68abf47c
rename api.h to paddle_inference_api.h, put demo_ci in fluid_install_dir
7 years ago
Luo Tao
44b6a5f308
fix inference_lib.cmake and make demo_ci pass
7 years ago
minqiyang
da433f7927
Only get tag version when branch name matches release branch
7 years ago
Qiyang Min
cebf7c602d
Merge pull request #12095 from velconia/port_py3
...
Port py3
7 years ago
tensor-tang
908b53499c
disable libxsmm by default
7 years ago
minqiyang
bbed762b87
Port current code to python3
7 years ago
tensor-tang
32822b2a59
Merge remote-tracking branch 'ups/develop' into feature/libxsmm
7 years ago
minqiyang
fdc06ab02e
Change python.cmake's version to 3.5
7 years ago
tensor-tang
7bb67b6788
Merge remote-tracking branch 'ups/develop' into feature/libxsmm
7 years ago
minqiyang
fceaabdd81
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_grpc_destroy_bug
7 years ago
Tao Luo
27aef26f57
Merge pull request #12009 from luotao1/warning
...
fix compile warning in inference related codes
7 years ago
tensor-tang
e8ae020a9d
explict cxx
7 years ago
tensor-tang
00ee6c3c17
Merge remote-tracking branch 'ups/develop' into feature/libxsmm
7 years ago
minqiyang
b8ff38ae7a
Remove the new git branching model from this pr
7 years ago
minqiyang
1d8cbc1738
Change develop to latest and other branch will get the tag to decide which
...
version it is staying on
7 years ago
minqiyang
4365b24e2f
Add patches to grpc
7 years ago
minqiyang
d0deb472b8
Move grpc changes to patches
7 years ago
Luo Tao
fc3e7341fc
fix compile warning in inference related codes
7 years ago
minqiyang
fe49e46904
Fix the problem that CMake do not support {} regex
...
Change patch version to str
7 years ago
minqiyang
a947445a21
Replace new grpc package
7 years ago
minqiyang
1af7d5a2e8
Change the incorrect version result from UNKNOWN to 0
...
Replace {} to + in cmake regex match
7 years ago
minqiyang
37a4322112
Polish the code in setup.py.in
...
Change the PADDLE_VERSION in develop branch to latest
7 years ago
minqiyang
98f758d3d6
Change the grpc package version to the v1.10.x-paddle patch
7 years ago
tensor-tang
c3941745b3
add libxsmm_gemm
7 years ago
minqiyang
2c4fb585db
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_grpc_destroy_bug
7 years ago
minqiyang
0d04545e9c
Remove debug info
7 years ago
tensor-tang
17987eb3fc
link libxsmm
7 years ago
tensor-tang
763718c7a1
Introduce libxsmm
7 years ago
minqiyang
207d1b81fe
Add fixed grpc
7 years ago
dzhwinter
4ed0b62476
Move fluid::framework::InitDevices into fluid::platform ( #11757 )
...
* move to platform
* "move init from framework to platform"
* "remove used init"
* "fix ci"
* "fix ci"
* "fix generic"
* "fix ci"
* "fix ci"
* "fix ci"
* "disable fragile test"
7 years ago
Wu Yi
27d6962536
fix mac build ( #11873 )
...
* fix mac build
* add notes
* fix_mac_build
* update
7 years ago
minqiyang
1d6ecd3c4e
Change grpc version to 1.13.x
7 years ago
gongweibao
66c91911cf
Improve brpccmake ( #11842 )
7 years ago
Yan Chunwei
74dce39072
fix anakin compile manylinux ( #11861 )
7 years ago
chengduo
d2ad4a5c41
Init allocated memory for unit test ( #11657 )
...
* memory init
* add env
* refine anounce
* Add check for Nan
* Debug
* Add env for cc_test
* Add env for py_test and nv_test
* Remove py_test env
* Add env for py_test
* serial test_recognize_digits
* Test FLAGS_init_allocated_mem function for unit test
* Init allocated mem for op unit test
* Add env for all unit test
7 years ago
Tao Luo
fff6fa0f51
Merge pull request #11807 from typhoonzero/reduce_generic_cmake_dep
...
move find_fluid_modules
7 years ago
Yan Chunwei
cde5693bdd
fea/expose infrerence api so ( #11793 )
7 years ago
typhoonzero
53f217115d
move find_fluid_modules
7 years ago
Yan Chunwei
c228977727
add anakin release ( #11747 )
7 years ago
Tao Luo
2dae8a4631
Merge pull request #11596 from tensor-tang/refine/mklml/dyload
...
enable dynamic load mklml lib on fluid
7 years ago
Wu Yi
acfd177d0c
Retry rpc calls ( #11651 )
...
* make deadline configurable
* configurable deadline
* update
* fix grpc deadline exceeded
7 years ago
Jacek Czaja
98f3ad3ba1
- MKLDNN Softmax Grad Op
...
- Added hash function inside of MKLDNN softmax op to be used as handle for primitives stroing in a
context
- Style fixes to softmax mkldnn op
- Fixes after review
- Coding style
- Fix to style
- style fixes
- style fix
- style fixes
- Fix to cody style check
- Rephrasing a comment
fix t obroken merge
Fixes to rebase
Conflicts:
benchmark/fluid/models/machine_translation.py
cmake/external/mkldnn.cmake
paddle/fluid/operators/softmax_mkldnn_op.cc
- Bumped revision of MKL-DNN up to have softmax backward primitive
- Added choosing MKLDNN softmax grad operator
- First reuse of softmax backward
- Reinvented reusing for softmax
- Fix to crash in reinvented reuse
- Clang format fixes
- Clang format fixes
- Improved softmax mkldnn reuse mechanism
- clang format fixes
- Fix to broken merge
- Fix
7 years ago
tensor-tang
c475041405
link iomp as needed
7 years ago
tensor-tang
17c9e3d223
only deps mklml so, do not need link it
7 years ago
tensor-tang
f503f12925
enable dynamic load mklml lib on fluid
7 years ago
tensor-tang
9ff77a76de
fix mkldnn compile issue
7 years ago
Yan Chunwei
e42e6ea142
add inference lib to release ( #11482 )
7 years ago
tensor-tang
9169b3b802
Merge pull request #10789 from Xreki/core_fix_openblas_threads
...
Add an interface to set the number of threads for math function, and set the default value to 1 for inference.
7 years ago
Yan Chunwei
bd2a537b05
feature/anakin ci ( #11330 )
7 years ago
Wu Yi
34865f2de3
Trainer send term signal ( #11220 )
...
* wip
* use executor.complete to end trainer
* fix build
* fix build with distribute off
* fix typo
* fix cmake typo
* fix build
7 years ago
gongweibao
d9de6b8621
Add brpc surpport. ( #11263 )
7 years ago
Luo Tao
e116129f03
rewrite unittest of trt_activation_op
7 years ago
gongweibao
7f6503cb7c
Add comment of grpc.tar.xz ( #11153 )
7 years ago
yi.wu
d58955997e
fix term core only
7 years ago
tensor-tang
0e1c1276cf
fix inference_lib_dist deps
7 years ago
Qiao Longfei
391c274966
disable remove rpath from third party protoc ( #10939 )
7 years ago
Houjiang Chen
83f4e9e9a6
enable eigen multi-threads on mobile device ( #10938 )
7 years ago
Tao Luo
ec7f1e99ab
Merge pull request #10907 from reyoung/feature/remove_cpplint_while_compile
...
Remove cpplint in cmake
7 years ago
Tao Luo
174d884d2c
Merge pull request #10904 from luotao1/ci
...
refine fluid deploy on teamcity
7 years ago
yuyang18
a229734cbd
Remove cpplint in cmake
7 years ago
Luo Tao
654229b232
refine fluid deploy on teamcity
7 years ago
yuyang18
fcbf19bf93
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/refine_parallel_executor
7 years ago
Yu Yang
051a4b3916
Merge pull request #10840 from PaddlePaddle/revert-10836-update-becbos-url
...
Revert "update bcebos url so that it can also be downloaded in inner network"
7 years ago
yuyang18
b857ff1b45
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/refine_parallel_executor
7 years ago
Yu Yang
62559ace44
Merge pull request #10834 from reyoung/feature/sequnce_run_tests
...
Force some unittests serial
7 years ago
Yu Yang
d8a53ebc61
Revert "update bcebos url so that it can also be downloaded in inner network"
7 years ago
Qiao Longfei
942e2194c1
Merge pull request #10836 from jacquesqiao/update-becbos-url
...
update bcebos url so that it can also be downloaded in inner network
7 years ago
qiaolongfei
610ec4146a
update bcebos url so that it can also be downloaded in inner network
7 years ago
Tao Luo
565606a42d
Merge pull request #10827 from luotao1/paddle_script
...
change CMAKE_INSTALL_PREFIX in inference_lib_dist to FLUID_INSTALL_DIR
7 years ago
yuyang18
1b69c25c92
Merge branch 'feature/sequnce_run_tests' into feature/refine_parallel_executor
7 years ago
yuyang18
91007fe974
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/refine_parallel_executor
7 years ago
yuyang18
1426d794ff
Force some unittests serial
7 years ago
Luo Tao
91d8373b18
change CMAKE_INSTALL_PREFIX to FLUID_INSTALL_DIR
7 years ago
Qiao Longfei
d54ad9f1a4
support user defined download url for mklml and boost ( #10819 )
7 years ago
Luo Tao
50e60ef321
add version and cmakecache in inference_lib
7 years ago
Liu Yiqun
50ba205d79
Merge branch 'develop' into core_fix_openblas_threads
7 years ago
Liu Yiqun
39eb871ddf
Add an interface to set the number of threads for math function, and set the default value to 1 for inference.
7 years ago
yuyang18
6db9c3c7d6
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/refine_parallel_executor
7 years ago
yuyang18
7a9eb21561
Make -j better
7 years ago
yuyang18
65c0535b5c
Use cached grpc
7 years ago
Tao Luo
651c934b2c
Merge pull request #10736 from luotao1/mkldnn_cmake
...
refine mkldnn cmake with official commit id
7 years ago
Yancey
062c811b6f
Merge pull request #10745 from Yancey1989/fix_bce_cdn_link
...
use cdn to speed up thirdparty package download
7 years ago
Yancey1989
57734901d9
use cdn to speed up thirdparty package download
7 years ago
Qiao Longfei
ca0dce7093
Merge pull request #10705 from jacquesqiao/add-mkldnn-to-paddle-lib
...
add mkldnn to paddle lib
7 years ago
Luo Tao
8fd5001223
refine mkldnn cmake with official commit id
7 years ago
Yu Yang
a06efd293e
Merge pull request #10731 from dzhwinter/fix/eigen_cmake
...
"change eigen mirror"
7 years ago
qiaolongfei
0764211960
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-mkldnn-to-paddle-lib
7 years ago
qiaolongfei
de3c5175c5
fix typo
7 years ago
dzhwinter
5129bddc2c
"change eigen mirror"
7 years ago
qiaolongfei
fdf99bbe6f
follow comment
7 years ago
Qiao Longfei
5c58eee552
Merge pull request #10690 from jacquesqiao/add-use-op-to-paddle-lib
...
add pybind.h to inference lib for static link
7 years ago
qiaolongfei
1aeb6c5c6d
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-mkldnn-to-paddle-lib
7 years ago
qiaolongfei
269a62a2b5
add mkldnn to paddle lib
7 years ago
qiaolongfei
ff24f789cb
update pybind.h source dir
7 years ago
qiaolongfei
11a8814758
add pybind.h to inference lib for static link
7 years ago
yuyang18
dfbe06ccab
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/fix_ninja_build
7 years ago
yuyang18
dc6ce071d4
Polish cmake
7 years ago
qiaolongfei
2ddca7196d
update boost dst dir
7 years ago
qiaolongfei
efd425cb5e
only copy header file of boost
7 years ago
qiaolongfei
af2cd942bb
copy boost
7 years ago
Wu Yi
b3e5a88450
Patch mkldnn for build on gcc 4.8.2 ( #10616 )
...
* test_patch_mkldnn
* use patch file
* follow comments
* fix patch
7 years ago
Luo Tao
329655dd7b
update mklml version
...
disable building tests and examples when install mkldnn
skip -Werror=unused-result
7 years ago
Wojciech Uss
d0e8bd2905
bump mkldnn version
7 years ago
dzhwinter
eb6f9dd5de
Feature/cuda9 cudnn7 ( #10140 )
...
* "re-commit "
* "picked up"
* "fix ci"
* "fix pdb hang up issue in cuda 9"
7 years ago
dzhwinter
c26204027f
"fix cuda9 error" ( #10271 )
...
* "fix cuda9 error"
* "change commit id"
* "remote git tag"
7 years ago
Luo Tao
71f51ff64a
refine tensorrt cmake and dockerfile
7 years ago
Luo Tao
d4682247e1
auto find tensorrt library
7 years ago
typhoonzero
d9e2ff333e
grpc version to 1.10.x
7 years ago
Liu Yiqun
449bdde58a
Correct some typos.
7 years ago
Wu Yi
8d4d6eae5c
Merge pull request #9863 from typhoonzero/grpc_version_update
...
Update grpc version
7 years ago
Yiqun Liu
e90e7ab237
Remove the use of ARCHIVE_START/END ( #9844 )
...
* Add USE_OP of all operators and kernels and remove ARCHIVE_START/END in CMakeLists.txt of inference unittests.
* Remove ARCHIVE_START/END when linking inference shared library.
* Disable some fluid related cmake operations for cross-compiling.
7 years ago
typhoonzero
d798e3258d
update grpc version
7 years ago
Luo Tao
38f8676989
remove unused nccl.cmake
7 years ago
Yi Wang
8dbd9c394e
Fix part of the cpplint errors in fluid/platform ( #9802 )
7 years ago
Yi Wang
25ad6884bb
Merge branch 'develop' of http://github.com/paddlepaddle/paddle into cpplint-memory-detail
7 years ago
Yi Wang
45bc4538c4
Update paddle_memory in CMakeLists.txt files
7 years ago
Yi Wang
e185502ebe
Fix cpplint errors with paddle/fluid/platform/dynload ( #9715 )
...
* Update source files.
* Update headers
* Update
* Update
* Update
* Update
* Fix a CMake dependency
7 years ago
Yi Wang
c839ec6c47
Update rcordio
7 years ago
Lei Wang
09b4a1a361
Build: generate all the build related files into one directory. ( #9512 )
7 years ago
qiaolongfei
72913dc2a6
change mklml download url to bce
7 years ago
Yang Yu
af230d9bef
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into cpp_parallel_executor
7 years ago
Yang Yu
b0775588c0
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into cpp_parallel_executor
7 years ago
Xi Chen
1daa96579c
adding comments for this fix
7 years ago
Xi Chen
06aaea8a64
Revert "test removal of redundant line"
...
This reverts commit d4f4935530
.
7 years ago
Xi Chen
d4f4935530
test removal of redundant line
7 years ago
Xi Chen
0e7413938a
added missing *.pb.h *.pb.cc generation to fix distribute build issue
7 years ago
Luo Tao
d573195dde
rm libmklml_gnu.so
7 years ago
Yu Yang
50e7e25db3
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into cpp_parallel_executor
7 years ago
Luo Tao
ae820a34bc
Merge branch 'develop' into infer_mkl
7 years ago
qiaolongfei
ac94242ea9
change boost download url to speed up download
7 years ago
sabreshao
e50205e744
CMake refine for HIP support.
...
1. Add option WITH_AMD_GPU.
2. Add cmake/hip.cmake for HIP toolchain.
3. Some external module such as eigen may need HIP port.
4. Add macro hip_library/hip_binary/hip_test to cmake/generic.cmake.
5. Add one HIP source concat.hip.cu as an example. Each .cu may have its corresponding .hip.cu.
7 years ago
sabreshao
45c988d86a
Demostration of cmake refine for HIP support.
...
1. Add option WITH_AMD_GPU.
2. Add cmake/hip.cmake for HIP toolchain.
3. Some external module such as eigen may need HIP port.
4. Add macro hip_library/hip_binary/hip_test to cmake/generic.cmake.
5. Add one HIP source concat.hip.cu as an example. Each .cu may have its corresponding .hip.cu.
7 years ago
Yu Yang
6f0dfd89a4
Single GPU ParallelExecutor complete
7 years ago
Luo Tao
e42b8f8a11
fix mklml install path
7 years ago
Luo Tao
de13f0eb4e
Merge branch 'develop' into infer_mkl
7 years ago
QI JUN
7287630e83
Repair nccl op test ( #8575 )
...
* fix nccl op unit test
* fix build error
* format code
* refine nccl related unit test
* fix build error
* add setGPUData
* clean up
* follow comments
* rm test_nccl.cu
* follow comment
* rm wait
7 years ago
Luo Tao
5030681c36
add MKL for fluid static and shared library
7 years ago
Tao Luo
6f50dee4d5
compile and install the static library of fluid inference ( #7827 )
...
* compile and install the static library of fluid inference
* fix dynload_cuda not in CPU mode
* update shared library and adjust the deploy of openblas
* adjust the deploy of openblas
* * auto add all fluid modules for static library
* use libprotobuf.a instead of libprotobuf-lite.a for profiler
* use set_property to set the global varible instead of ENV
* add gpu depends of fluid modules, auto add inference_lib_dist depends
* change the condition of openblas_lib, and fix a typo
7 years ago
qiaolongfei
8ccb091fb4
fix snappy build on macos
7 years ago
Yu Yang
9dc69582de
Make recordio simple
7 years ago
dongzhihong
69c7991108
"add snappy library"
7 years ago
Yiqun Liu
ae2026e134
Fix building error of missing end-group for Android. ( #8680 )
7 years ago
Xin Pan
6720198731
Merge pull request #8663 from panyx0718/test2
...
First timeline version
7 years ago
Xin Pan
12843a3a53
Firt timeline version
7 years ago
Yu Yang
22b5c07a7d
Fix the compilation on CUDA 9.1/GCC 5.3
...
* Make CUPTI_LIB_PATH not passing by macro.
* Add missing header
7 years ago
Xin Pan
b9ec24c6e9
Extend current profiler for timeline and more features.
7 years ago
Yi Wang
9890bb5987
Explain why we choose an old version of boost ( #7967 )
7 years ago
Yang Yang
0717ff8b90
make boost library hosted on our server
7 years ago
Tao Luo
c77715e3af
Merge pull request #8410 from jczaja/prv-match-mkldnn-build-type-with-paddle-type
...
Fix MKLDNN build type to match Paddle build type
7 years ago
Jacek Czaja
5e1640f7e4
- Fix MKLDNN build type to match Paddle build type
7 years ago
qingqing01
24509f4af9
Fix the grammar in copyright. ( #8403 )
7 years ago
Tao Luo
da02a5812c
refine inference_lib_dist after code move, and add it to docker/build.sh ( #8379 )
...
* refine inference_lib_dist after code move, and add it to docker/build.sh
* remove is_directory in inference_lib.cmake
7 years ago
Yu Yang
06f10942a8
Merge pull request #8275 from reyoung/feature/rewrite_vector
...
Rewrite mixed_vector.h
7 years ago
Yu Yang
ef1aba39a6
Rewrite mixed_vector.h
7 years ago
Liu Yiqun
e4d9989af0
Simplify the cmake of inference.
7 years ago
Tao Luo
6c3b78b7cf
Merge pull request #7977 from luotao1/inference_lib_dist
...
make inference_lib_dist for fluid inference shared library
7 years ago
Yu Yang
b41205d9a6
Disable BUILD_TESTS for warpctc ( #8210 )
...
* It will sightly faster compile and make warpctc compile
well on CUDA 9 and GCC 5.5
7 years ago
Luo Tao
f28dc9a68d
refine inference_lib.cmake
7 years ago
Luo Tao
59e4dd5797
add independent inference_lib.cmake
7 years ago
Luo Tao
446198dab6
Merge branch 'develop' into inference_lib_dist
7 years ago
Luo Tao
96d4bf5337
prevent make clean from cleaning ExternalProject boost
7 years ago
Luo Tao
d2caf777ae
set FLAGS_warpctc_dir to pass the test_warpctc_op unit test
7 years ago
Luo Tao
420f9c3873
Merge branch 'develop' into warpctc_deps
7 years ago
Liu Yiqun
119da44954
Merge branch 'develop' into core_add_inference_unittest
7 years ago
Luo Tao
9b5d41b636
make inference_lib_dist
7 years ago
Liu Yiqun
6ac2e079b3
Enable whole-archive flag in cc_test and use cc_test to rewrite the CMakeLists.txt of inference unittest.
7 years ago
yi.wu
90a5fd26a9
fix boost down link
7 years ago
yi.wu
a8c46f33d8
downgrade boost to fit manylinux
7 years ago
dangqingqing
e5fc79d88a
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into boost_cmake
7 years ago
Luo Tao
ca636eedb9
remove libwarpctc.so in core.so and libpaddle_fluid.so
7 years ago
dangqingqing
a89e1b85fe
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into boost_cmake
7 years ago
dangqingqing
da018e31bd
Remove boost installing step and remove boost installation in Dockerfile scripts.
7 years ago
Luo Tao
5c056535ed
Merge branch 'develop' into fluid_infer
7 years ago
Tao Luo
eaa8d68006
Merge pull request #7693 from luotao1/openblas_install
...
refine openblas install lib when CAPI
7 years ago
dzhwinter
e983cc90fc
"fix decode bug" ( #7711 )
...
* "fix decode bug"
* "follow commnet"
* "fix error"
* "fix hook bug"
* fix based comment
* fix copyright
* fix based on comment
7 years ago
dangqingqing
564c6abda3
Add cmake for extern project of boost.
7 years ago
Luo Tao
b4685675df
refine openblas install lib when CAPI
7 years ago
Luo Tao
c0f0f2337e
add WITH_FLUID option and third party INSTALL for fluid api
7 years ago
gongweibao
535fefb7e8
Fix grpc bugs ( #7435 )
...
Fix grpc bugs
7 years ago
dzhwinter
b9b75377a2
Feature/hooks ( #7513 )
...
* add copyright hook
* add copyright hook
* refine copyright hook
* "test copyright hook"
* fix check style
* fix ci
7 years ago
Yiqun Liu
b5fda2723f
Port WarpCTC Operator ( #5107 )
...
* Add Seq2BatchFunctor, which will be used in WarpCTCOp.
* Implement WrapCTCFunctor and WrapCTCKernel.
* Add unittest of warpctc_op.
* Modify the check_output inferface in python unittest framework to allow check a subset of outputs.
* Use absolute offset lod in warpctc_op and related functors.
* Refine the comments of warpctc_op.
* The new python unittest supports checking a subset of the outputs, so revoke the previous change.
* Rename the transform from LoDTensor to Tensor with shape [max_sequence_length, num_sequences, sequence_width] to PaddingSequenceFunctor.
* Update to the newest codes.
* Rename the PaddingSequenceFunctor to PaddingLoDTensorFunctor and remove the computation of dimensions out of the functos.
7 years ago
tensor-tang
6de2049651
use the mkldnn shared lib so.0
7 years ago
tensor-tang
11ed2f2f93
package right mkldnn and mklml libs if enabled in capi
7 years ago
tensor-tang
6177cb5162
Merge remote-tracking branch 'upstream/develop' into context
7 years ago
Tao Luo
809122cb5a
Merge pull request #7097 from Xreki/core_inference_example
...
Add a simple C++ inference example for fluid
7 years ago
Tao Luo
6f347faafe
Merge pull request #6401 from Xreki/fix_build_android_openblas
...
Update the version of openblas for Android and iOS
7 years ago
tensor-tang
5bf5650dcf
generate mkldnn dummy target for static deps
7 years ago
Liu Yiqun
9b3f2c39f2
Add a simple example for fluid to do inference in C++ code.
7 years ago
hedaoyuan
0295b00066
Add libprotobuf-lite.a when install. ( #6340 )
...
* Add libprotobuf-lite.a when install.
* Fix protobuf.cmake
* Bug fix
7 years ago
Liu Yiqun
6dc0e663f4
Merge branch 'develop' into fix_build_android_openblas
7 years ago
Luo Tao
e5dcefc4d3
remove ATLAS library
7 years ago
tensor-tang
69b44f2f19
unify MKL macro definition
7 years ago
Liu Yiqun
2bdd3e43fa
Update the version of openblas.
7 years ago
Yancey1989
c53a9c8d44
compile cblas library as static
7 years ago
kexinzhao
1d1555e2e5
Merge pull request #5716 from kexinzhao/float16
...
Add half precision float16 data type
7 years ago
dzhwinter
e0ac34a620
"fix build cares" ( #6097 )
7 years ago
武毅
067381e2e2
Merge pull request #6146 from typhoonzero/add_switch_distributed
...
Add switch for distributed support
7 years ago
Yancey
1b612d3a76
Fix grpc compile warning ( #6050 )
...
* fix grpc compile warn
* update
* -Wnon-virtual-dtor -> -Wno-non-virtual-dtor
7 years ago
Yiqun Liu
7480291c62
Add version and commit information in capi config.h and use unofficial glog for Android API < 21. ( #6113 )
...
* Automatically configure the version and commit information in capi.
* Use the unofficial glog repository for building for Android (API < 21).
7 years ago
typhoonzero
1a852861b2
add switch for distributed support
7 years ago
qingqing01
5f0d081886
Merge pull request #6115 from qingqing01/protobuf-v3.2
...
Use protobuf v3.2.0 for MOBILE_INFERENCE compiling.
7 years ago
Yu Yang
3a8311f819
Fix compile error for gcc 6.3 ( #6112 )
7 years ago
dangqingqing
a0648ee449
Add comments.
7 years ago
dangqingqing
4d47683b1c
Use protobuf v3.2.0 for MOBILE_INFERENCE compiling.
7 years ago
QI JUN
1238706d72
Refine unittest with setting gflags ( #5476 )
...
* add gflags for C++ unittest
7 years ago
Yu Yang
dc91c4e3a4
Fix MacOS compile ( #6062 )
7 years ago
Kexin Zhao
4901184ee9
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into float16
7 years ago
Kexin Zhao
41bd1f9115
fix gpu test, clean code and add cmake
7 years ago
武毅
4ecbab42d8
Fix compile on cudnn7 ( #5982 )
...
* fix compile on cudnn7
* update
* update
* make silent
7 years ago
Yu Yang
6375c8cacb
Fix MacOS compile ( #5978 )
...
* Fix MacOS compile
* Update GRPC
* Unset PROTOBUF_EXEC
7 years ago
武毅
0a8a86e0c9
Send recv op ( #5520 )
...
* WIP send recv op
* WIP send recv
* put grpc impl in details
* put grpc impl in details
* update wip
* update proto
* update proto
* update proto
* clean cmake
* wip on op implementations
* wip on op implementations
* compile ok adding ut
* wip unitest
* add extern cares for linking
* wip add ut
* working version send recv
* revert optimizer.py
* update test cmake
* add libtool to dockerfile
* update cmake dependency
* update cmake depends
* update cmake grpc depends
* fix cmake dependency
* fix compile error
* fix compile
* follow comments
* update
* update copyfrom
7 years ago
Qiao Longfei
c9a96575d5
py_test and test_image_classification_train support argument ( #5934 )
...
* py_test support argument, test_image_classification_train support argument
* use REMOVE_ITEM to rm item from list in cmake
7 years ago
Qiao Longfei
1f6002edc0
update gflags ( #5904 )
7 years ago
Yancey1989
cd29714af0
fix py unit test executable
7 years ago
dangqingqing
94e86897fa
update code and fix conflicts.
7 years ago
dangqingqing
082bc7af56
Use CUDA_ARCH_NAME=All in the paddle/scripts/docker/build.sh and remove 20 21(20) in cmake/cuda.cmake.
7 years ago
tensor-tang
1578c20aaf
add the missing macro PADDLE_USE_MKLDNN
7 years ago
tensor-tang
cece368d6f
Merge remote-tracking branch 'upstream/develop' into refine
7 years ago
Yiqun Liu
c808fbbfcb
Support the build for multiple architectures at one cmake command (iOS). ( #5677 )
...
* Support the build for multiple architectures at one cmake command (iOS).
* Update the documentations.
7 years ago
tensor-tang
8496eab45a
make mklml necessary when with_mkldnn
7 years ago
dangqingqing
3d080f3ad5
Refine cmake about CUDA to automatically detect GPU arch by default.
...
1. Automatically detect GPU arch by default.
2. Specify -DCUDA_ARCH_NAME=All when releasing PaddlePaddle new version
7 years ago
Luo Tao
49ac0480e6
fix a lot of warnings -Wunused-but-set-variable and Wunused-variable when compile openBlas
7 years ago
Yiqun Liu
d3ef2543f9
Fix compiling error for Android, and installing error for cmake of low version. ( #5660 )
7 years ago
Luo Tao
52272bc076
fix openblas bug: undefined reference to pthread_atfork
7 years ago
Tao Luo
97c1bb1694
Merge pull request #5521 from tensor-tang/develop
...
fix EQUAL unknown
7 years ago
Yu Yang
5e13e706f9
Fix CI compile ( #5526 )
7 years ago
tensor-tang
df105ac940
fix EQUAL unknown
7 years ago
Luo Tao
34d02f94b5
RollBACK the openblas.cmake
7 years ago
Luo Tao
7835d49384
remove PADDLE_USE_MKL
7 years ago
Luo Tao
5a5b729747
remove unused INTEL_MKL_ROOT etc.
7 years ago
tensor-tang
4fd432fdac
update mkldnn tag
...
and abandoned deprecated sum API interface
7 years ago
tensor-tang
a1856be5eb
update mklml tag
7 years ago
Yiqun Liu
bba6223598
Enable the build for iOS simulator. ( #5211 )
7 years ago
Yiqun Liu
288ffdd715
Correct the install command, static library name and typo in nccl.cmake. ( #5048 )
7 years ago
Qiao Longfei
ee998a9c44
CPU Batch Norm Op ( #4964 )
...
* init batch norm op
* prepare input output
* compute mean_out var_out save_mean save_var on CPU
* active is test
* use eigen to do computation
* complete batch norm forward
* set default momentum to 0.9
* add batch norm grad op in CPU
* add tensor_format and NHWC support, add python test
* add test training
* add batch norm gradient test
* improve comment, fix foward Python UnitTest
* add gradient test
* fix eigen warning
* follow name style
* fix a bug
* change float to T
* add simple forward test
* test with different place
* add backward test
* refine python test
* remove old python test code
* code clean
* follow code style
* update comment
7 years ago
Luo Tao
fb4744fd6a
fix nccl compiler error
7 years ago
Yu Yang
94e741d6f0
Use external project for NCCL ( #5028 )
7 years ago
Yu Yang
43c6ff212e
Feature/nccl dso ( #5001 )
...
* "add nccl enforce"
* Dev
* Update comment
* Add nccl test
* Follow comments
7 years ago
hedaoyuan
6316b40a2c
Merge pull request #4753 from hedaoyuan/inference
...
Use MinSizeRel compile third_party library when build for mobile infe…
7 years ago
helinwang
ce91f85ec5
Add GIT tag for all cmake dependencies. ( #4776 )
7 years ago
hedaoyuan
773d064ae0
Use MinSizeRel compile third_party library when build for mobile inference.
7 years ago
xzl
07e5d86787
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into protobuf_lite
7 years ago
xzl
54a03ab31d
add paddle_protobuf_generate_cpp() Func which could dynamic set RUNTIME_LITE mode for paddle
7 years ago
Yan Chunwei
843ed8e320
dynamic recurrent op forward c++ implentation ( #4597 )
7 years ago
hedaoyuan
aa1e92ec9a
Merge branch 'develop' of https://github.com/baidu/Paddle into inference
7 years ago
Yi Wang
f985700abf
Resolve conflict
7 years ago
Yu Yang
84500f9487
Change `PADDLE_ONLY_CPU` to `PADDLE_WITH_GPU`
...
By shell command
```bash
sed -i 's#ifdef PADDLE_ONLY_CPU#ifndef PADDLE_WITH_GPU#g' `find ./paddle/ -name '*.h' -o -name '*.cc' -o -name '*.cpp' -o -name '*.c' -o -name '*.cu'`
sed -i 's#ifndef PADDLE_ONLY_CPU#ifdef PADDLE_WITH_GPU#g' `find ./paddle/ -name '*.h' -o -name '*.cc' -o -name '*.cpp' -o -name '*.c' -o -name '*.cu'`
```
7 years ago
Yi Wang
5f51d0afc4
Add -D PADDLE_WITH_CUDA in cmake/configure.cmake
7 years ago
hedaoyuan
d94dd75e3b
Remove string, scripts and retain cuda.
7 years ago
hedaoyuan
ea4672bea0
Remove optimizer.
7 years ago
hedaoyuan
33299ef972
Remove cuda.
7 years ago
hedaoyuan
3c66b307f7
Remove the pserver, trainer, evaluators and some useless gradientmachines when compile mobile inference library.
7 years ago
Qiao Longfei
d0ad82cff1
fix nv_library ( #4370 )
...
* fix nv_library
* fix symbol in gpu_info.h
7 years ago
Tao Luo
330e9929ec
Merge pull request #4310 from tensor-tang/vgg
...
enable VGG with MKLDNN layers
7 years ago
Yiqun Liu
2d8467ee9d
Merge pull request #4320 from Xreki/fix_android_linking_error
...
Fix bug in cc_library, when merging several libraries into one on Linux.
8 years ago
Liu Yiqun
dd2f477a50
Fix bug in cc_library, when merging several libraries into one on Linux.
8 years ago
tensor-tang
f96d31d552
only link iomp when with MKLDNN and MKLML
8 years ago
Liu Yiqun
35dd981db0
Fix linking error on mac.
8 years ago
Yiqun Liu
a9202e89cf
Merge pull request #3764 from Xreki/build_ios
...
Add the support of the cross-compiling for iOS devices
8 years ago
Liu Yiqun
46ee8ceecb
Move the sreach and link of veclib to iOS's toolchain file .
8 years ago
Luo Tao
70cadf9350
Merge branch 'develop' into use_op
8 years ago
Luo Tao
0597f4de99
auto generate USE_OP() in pybind.cc
8 years ago
Liu Yiqun
76338fb6a7
Merge branch 'develop' into build_ios
8 years ago
Liu Yiqun
9e524fce1b
Remove the generation of executable, such as paddle_trainer, when setting WITH_C_API.
8 years ago
Liu Yiqun
b4d710ce12
Merge branch 'develop' into build_ios
8 years ago
Liu Yiqun
5991a35e58
Install the automatically built, compiled libraries under third_party when
...
executing `make install` and WITH_C_API is set.
8 years ago
Liu Yiqun
7976876993
Add the finding of iOS's native vecLib.framework.
8 years ago
Liu Yiqun
d1f5f49826
Merge branch 'develop' into build_ios
8 years ago