heqiaozhi
419506f510
refine for compile pslib.so
6 years ago
heqiaozhi
a77fa67bbd
async_thread_trainer & libmct & pslib.cmake
6 years ago
Xin Pan
7e0801d4ed
Merge pull request #14441 from baojun-nervana/intel/ngraph_op
...
Implementing ngraph engine
6 years ago
heqiaozhi
4798a8c7b8
pslib_brpc
6 years ago
heqiaozhi
038346c0c2
libmct
6 years ago
heqiaozhi
3c239cd640
pslib
6 years ago
Sang Ik Lee
24e70920db
Refactor some build settings.
...
test=develop
6 years ago
Sang Ik Lee
d6125a5eec
Include ngraph in inference demo build.
...
test=develop
6 years ago
Qiao Longfei
bcad29c680
gzstream depend on the zlib in thirdparty
...
test=develop
6 years ago
Qiao Longfei
35b79ab865
Merge pull request #13983 from jacquesqiao/add-ctr-reader
...
Add ctr reader
6 years ago
Qiao Longfei
1edd435da6
fix ci problem test=develop
6 years ago
Tao Luo
1538059ba3
Merge pull request #14595 from luotao1/clean_infer_library
...
clean inference include files
6 years ago
Qiao Longfei
668ae9083e
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-ctr-reader
6 years ago
wopeizl
05b7ee7eeb
Merge pull request #14545 from wopeizl/windows/online
...
Windows/online
6 years ago
Tao Luo
c0b3f93bff
clean inference include files
...
test=develop
6 years ago
minqiyang
e43f5bc77c
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_dist_resnet_ut_in_py36
...
test=develop
6 years ago
peizhilin
6250be4b5c
Merge branch 'windows/build' into windows/online
...
test=develop
6 years ago
peizhilin
30849d1f20
Merge remote-tracking branch 'upstream/develop' into windows/build
6 years ago
qingqing01
6224e61fd9
Transpose-Flatten-Concat fusion operator. ( #14568 )
...
* Transpose-Flatten-Concat fusion operator.
* Add unit testing and fix bug.
6 years ago
peizhilin
e0d47cc941
test=develop
6 years ago
peizhilin
fe1788caab
Merge branch 'windows/build' into windows/online
6 years ago
peizhilin
36cd18b549
Merge remote-tracking branch 'upstream/develop' into windows/build
6 years ago
peizhilin
a89ed6998e
Merge branch 'windows/build' into windows/online
6 years ago
peizhilin
1afa9492af
Recover the profiler
6 years ago
minqiyang
8038cd10a9
Upgrade pybind11 to v2.2.4 to support Python3.7
...
test=develop
6 years ago
peizhilin
eb57780557
Merge branch 'windows/build' into windows/online
...
test=develop
6 years ago
peizhilin
445fff24dc
add the bigobj option to NVCC compile
...
fix code style
6 years ago
sabreshao
61c5f13fcf
Fix cmake for AMDGPU platform ( #13801 )
...
* HIP cmake.
Enable whole archieve build for pybind library.
Disable two warning.
Rollback to C++11.
Link RCCL to WA gpu kernel loading issue.
Update eigen to fix build failure.
Add more include directories.
Fix O3 build failure.
Update eigen.
fix tensor_util_test segment fault issue
add more macro check in hip.cmake.
we may consider refine hip.cmake to inherit all add_definitions() in parrent scope, in the future.
Fix rocRAND load.
Update eigen to fix gru_unit_op and reduce_op.
Add HIP support to testing.
Update eigen to support int16 and int8 in arg min and arg max.
* add rocprim as cub library used by nv implementation
* Reduce build time in rocprim.
* Add rocprim introduction, remove useless cmake code.
* Remove useless flags and format cmake file.
6 years ago
Tao Luo
dfbdece55c
Merge pull request #14550 from kbinias/bumped-mkldnn-version-to-rls-v0.17
...
Bumped MKL-DNN version to 0.17
6 years ago
Krzysztof Binias
6cc6bf4074
Bumped MKL-DNN version to 0.17
...
test=develop
6 years ago
peizhilin
31de60959a
Merge branch 'windows/build' into windows/online
...
test=develop
6 years ago
peizhilin
7c8c9dc9bf
fix unit test cases
6 years ago
sneaxiy
488610a65a
merge develop
...
test=develop
6 years ago
wopeizl
d9a1f3e58e
Windows/online ( #14474 )
...
* add recordio support
* disable the openblas multi-thread on windows since no support
adjust the python script
* code style
* code style
test=develop
* add create_recordio_file_reader back
* fix code style
test=develop
* fix the gtest.cmake on windows
* fix cc_test on windows
* fix the win build
test=develop
* remove fused compile support on windows
test=develop
* add the jit support
test=develop
* add the jit support, test=develop
* add the jit support, test=develop
* add the jit back
fix compile error on windows
* rollback test=develop
* test case fix
* disable DSO by default on windows
* exclude warpctc_op on windows
* exclude the dynload_warpctc out on windows
test=develop
* fix the scripts error
test=develop
* disable avx on windows by default
test=develop
* re-organize the cmake file
* disable mkl on windows by default
* add warp_ctc back
* fix the dependency
* fix the dependency
* fix the build issue on windows
* remove unsupported flag on windows
* code style
* code style
test=develop
* fix issue
* add profiler, parallel_executor back
* clean up the pre-definitions on windows
* fix build issue
* test=develop
6 years ago
sneaxiy
3912545ffe
add dlpack support
...
test=develop
6 years ago
peizhilin
133d11d3bb
Merge branch 'windows/build' into windows/online
...
test=develop
6 years ago
peizhilin
6e66fadb95
clean up the pre-definitions on windows
6 years ago
peizhilin
a7c8e42d6b
Merge branch 'windows/build' into windows/online
...
test=develop
6 years ago
peizhilin
67562a6fcd
Merge remote-tracking branch 'upstream/develop' into windows/build
6 years ago
peizhilin
3630386a89
Merge branch 'windows/build' into windows/online
...
test=develop
6 years ago
peizhilin
ee0fd78c81
Merge remote-tracking branch 'upstream/develop' into windows/build
6 years ago
qingqing01
fd7e643153
Convolution fusion operator. ( #14449 )
...
* Convolution fusion operator.
* Clean code
test=develop
6 years ago
Wojciech Uss
d36491c28a
add allocator.h copy
...
The allocator.h header file is required for C-API inference applications
test=develop
6 years ago
peizhilin
4a6769da84
re-organize the cmake file
6 years ago
peizhilin
cc319f64cb
disable avx on windows by default
...
test=develop
6 years ago
peizhilin
b1cd868086
Merge branch 'windows/build' into windows/online
...
test=develop
6 years ago
peizhilin
b967e01cbe
Merge remote-tracking branch 'upstream/develop' into windows/build
6 years ago
Wu Yi
d7bd0361cb
fix dist deps ( #14471 )
...
* fix dist deps test=develop
* update test=develop
* update test=develop
* update test=develop
* update test=develop
6 years ago
peizhilin
b6d8920424
Merge branch 'windows/build' into windows/online
6 years ago
peizhilin
a3e952f41d
add the jit back
...
fix compile error on windows
6 years ago
peizhilin
928efeed46
add the jit support, test=develop
6 years ago
peizhilin
1f0449aded
Merge branch 'windows/build' into windows/online
...
test=develop
7 years ago
peizhilin
1cc23ef67d
merge from paddle:develop
7 years ago
peizhilin
1047b0b469
Merge branch 'windows/build' into windows/online
...
test=develop
7 years ago
peizhilin
b942f4760a
fix cc_test on windows
7 years ago
Wu Yi
a2d9b34417
Refine operator cmake ( #14413 )
...
* wip simplify operator framework
* wip
* wip
* done test=develop
* clean test=develop
* fix test=develop
* fix deps test=develop
* fix cpu build test=develop
* fix tensorrt build test=develop
* fix tests test=develop
* fix test=develop
* fix cpu build test=develop
7 years ago
peizhilin
695e2aba5e
fix the gtest.cmake on windows
7 years ago
peizhilin
dc80be275d
code style
...
test=develop
7 years ago
peizhilin
6d0d5a76eb
Merge remote-tracking branch 'upstream/develop' into windows/build
7 years ago
Yan Chunwei
7796f65f89
fix inference on gpu out of mem ( #14414 )
...
* fix inference on gpu out of mem
the transfer logic in operator.cc will keep creating new scopes.
7 years ago
peizhilin
d1429ac4a5
add recordio support
7 years ago
peizhilin
be332a13bc
Merge remote-tracking branch 'upstream/develop' into windows/build
7 years ago
Yu Yang
83ddafb515
Splict cicheks jobs and expose anakin options ( #14327 )
...
* Split cichecks
test=develop
* feat(Anakin): expose anakin options to paddle cmake option
Expose ANAKIN_BUILD_FAT_BIN, ANAKIN_BUILD_CROSS_PLANTFORM to Paddle cmake option
test=develop
7 years ago
peizhilin
447bf7c80b
test=develop
7 years ago
peizhilin
203ec852cf
Merge branch 'windows/build' into windows/online
...
test=develop
7 years ago
peizhilin
30ddc07a7e
Merge remote-tracking branch 'upstream/develop' into windows/build
7 years ago
Yan Chunwei
9f252e0032
Combine Inference Analysis with IR ( #13914 )
7 years ago
peizhilin
bb3f6bd31c
Merge branch 'windows/build' into windows/online
...
test=develop
7 years ago
peizhilin
61fa5218b9
Merge remote-tracking branch 'upstream/develop' into windows/build
7 years ago
Tao Luo
9d4425dd1c
Merge pull request #14227 from baojun-nervana/intel/ngraph_cmake
...
Adding Intel nGraph build
7 years ago
Yu Yang
8f9bfad246
perf(compile): speed up reduce_op compile by splitting files ( #14294 )
...
test=develop
7 years ago
peizhilin
13bfee1f85
Merge branch 'windows/build' into windows/online
...
test=develop
7 years ago
peizhilin
dc339b78d7
fix code style
7 years ago
Yu Yang
792bf0b77f
Fix for cmake 1.11 ( #14350 )
...
test=develop
7 years ago
Sang Ik Lee
236e4d2635
Fix build issues on CentOS.
...
test=develop
7 years ago
peizhilin
9b558a8035
Merge branch 'windows/build' into windows/online
...
test=develop
7 years ago
peizhilin
7638f0afb3
simplify the logic
7 years ago
peizhilin
efe9d44ae6
Merge branch 'windows/build' into windows/online
...
test=develop
7 years ago
peizhilin
52f7644f53
Merge remote-tracking branch 'upstream/develop' into windows/build
7 years ago
Tao Luo
34e9e59f4a
Merge pull request #14333 from kbinias/change-hardcoded-format-and-bump-mkldnn-version
...
Changed hardcoded format to any in convolution and bumped MKL-DNN version to 0.17rc
7 years ago
peizhilin
ded93a354a
fix the typo
7 years ago
peizhilin
9fa96147c2
fix the typo
7 years ago
Krzysztof Binias
080112276a
Fixed problem with array subscript is above array bounds in MKL-DNN jit_uni_reorder_utils.cpp:prb_simplify function
...
test=develop
7 years ago
peizhilin
c290c7a076
Merge branch 'windows/build' into windows/online
7 years ago
peizhilin
4ffa92d4f0
Merge branch 'develop' into windows/build
7 years ago
Krzysztof Binias
f1c1acf1ac
Changed hardcoded format to any in convolution and bumped MKL-DNN version to 0.17-rc
...
test=develop
7 years ago
peizhilin
45125ba538
fix share library issue
7 years ago
Zhaolong Xing
ba8b5619a3
Revert "cherry picked windows patches."
7 years ago
peizhilin
52d3cd964e
fix
7 years ago
peizhilin
67eb357f87
Merge branch 'windows/build' into windows/online
7 years ago
peizhilin
869487a2b7
Merge remote-tracking branch 'origin/develop' into windows/build
7 years ago
peizhilin
3c439feadc
remove the duplicate flag
7 years ago
peizhilin
e3f7be959d
fix the debug flag for nvcc
7 years ago
peizhilin
130cdda65b
add gpu debug mode
7 years ago
peizhilin
77892124fb
online configuration
7 years ago
dzhwinter
2835e04409
merge develop branch. test=develop
7 years ago
Sang Ik Lee
f30c1ddb45
Include nGraph build.
...
test=develop
7 years ago
peizhilin
1f12ba6192
gpu support, fix build issue:
...
1. Non utf-8 characters within comments of OPs may lead to protobuf fail to parse_from_string
2. comment out some ops which not supported on windows
3. cuda libs may not be correctly linked to target on windows
7 years ago
tensor-tang
d277a2e6ef
fix avx512f flag ( #14041 )
7 years ago
tensor-tang
94ab65d591
disable avx2 and avx512 flag
...
test=develop
7 years ago
peizhilin
9d67c1fb69
cpu build support
7 years ago
dzhwinter
eb2f7ed21b
refine tests. test=develop
7 years ago
dzhwinter
0a180584e6
clean cmake. test=develop
7 years ago
dzhwinter
316765839d
add back jit simd instructions. stage.
7 years ago
dzhwinter
bf2e4cb188
cleard. staged
7 years ago
dzhwinter
ebfe5a02b3
merge develop branch
7 years ago
Xin Pan
9ef19d4919
Merge pull request #14106 from luotao1/fix_cmake_warning
...
[1.1] fix cmake warning when ON_INFER=false
7 years ago
Qiao Longfei
74f77accfc
fix xxhash compile on macos
...
test=develop
7 years ago
Tao Luo
4928ff32a9
fix cmake warning when ON_INFER=false
...
test=develop
7 years ago
Yan Chunwei
ee74be3a49
[1.1] Bugfix/tensorarray ( #14044 )
7 years ago
minqiyang
7f7af5d412
Add xxhash deps to inference demo and trainer demo
...
test=develop
7 years ago
minqiyang
fe18adfbaa
Add fluid inference support
...
test=develop
7 years ago
minqiyang
40141f749b
Implement the unittest for hash op
...
test=develop
7 years ago
minqiyang
accb7b5d95
Polish code
7 years ago
minqiyang
8a0f26f45f
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into continue_hash_op
7 years ago
minqiyang
d4f9aa0852
Add hash op implementation
7 years ago
wanghaoshuang
9e522a4495
update cmake
7 years ago
wanghaoshuang
f9e7cfb03c
save binary file
7 years ago
Jiabin Yang
5d6783f8ca
Merge pull request #13954 from typhoonzero/core_so_donot_depend_libpython
...
core.so do not link libpython test=develop
7 years ago
Qiao Longfei
92cbaa41eb
add GetTimeInSec
7 years ago
typhoonzero
726b91e471
update
7 years ago
Xin Pan
909e1341bd
Merge pull request #13966 from panyx0718/fix4
...
set default timeout to avoiding blocking CI
7 years ago
Qiao Longfei
a1e0f5abb7
add gzstream.cmake
7 years ago
Xin Pan
6de08b5eef
set default timeout to avoiding blocking CI
...
test=develop
7 years ago
Tao Luo
fc63aa72cc
add inference-only fluid library
7 years ago
dzhwinter
804dd7da04
merge conflict. both linux and windows pass.
7 years ago
dzhwinter
962061f0a3
windows fix
7 years ago
wanghaoshuang
b12f7c2399
compile in linux.
7 years ago
Tao Luo
3d976f3f18
rename inference_lib_dist to fluid_lib_dist
...
test=develop
7 years ago
wanghaoshuang
3ae9645084
compile in linux
7 years ago
Tao Luo
75bd0f188b
Merge pull request #13754 from luotao1/fast_math
...
disable EIGEN_FAST_MATH and use_fast_math
7 years ago
Jiabin Yang
34b81f606f
Merge pull request #13749 from JiabinYang/fix/mac_10.14
...
fix compile error on latest Mac OS 10.14
7 years ago
Tao Luo
28889caea5
disable EIGEN_FAST_MATH and use_fast_math
...
test=develop
7 years ago
JiabinYang
cea4952aea
test=develop
7 years ago
JiabinYang
9bd9535d0a
test=develop
7 years ago
JiabinYang
dcd6d9a1b7
test=develop
7 years ago
JiabinYang
0a2a2124c6
test=develop
7 years ago
JiabinYang
8e35b21bbb
test=develop
7 years ago
JiabinYang
dca9c7bb6d
test=develop
7 years ago
JiabinYang
887ebd8b6b
test=develop
7 years ago
Yan Chunwei
c8744d118d
fea/infer executor and concurrency performance issue bug fix ( #13451 )
...
- add naive executor
- fix concurrency performance issue
7 years ago
dzhwinter
3e71ad0cda
flags ( #13540 )
...
* flags
* "fix ci"
7 years ago
dzhwinter
b9fcf8e677
"configure" ( #13539 )
7 years ago
dzhwinter
79463ae784
generic cmake ( #13145 )
...
* generic cmake
* Update generic.cmake
7 years ago
tensor-tang
7f1aa1428a
update mklml version
7 years ago
chengduo
289acfa207
refien generic_cmake_ ( #13457 )
7 years ago
xzl
68e2125e59
fix dso cmake typo
7 years ago
dzhwinter
85f8dd1c77
debug version
7 years ago
dzhwinter
bf1e76e401
"fix dso" ( #13407 )
7 years ago
dzhwinter
e1999538eb
debug the device context
7 years ago