Zhou Wei
3e04ed2227
fix bug in CUDA_NVCC_FALS and CMAKE_CUDA_FLAGS, and eliminate some warning,test=develop ( #24982 )
...
fix bug in CUDA_NVCC_FALS and CMAKE_CUDA_FLAGS
5 years ago
Zhang Ting
0cb0318253
update cub to 1.9.8, test=develop ( #24895 )
5 years ago
Yanghello
2ca2b90d62
fix cryptopp lib building bug in gcc8 ( #24945 )
5 years ago
T8T9
90d420b13c
add -DPADDLE_CUDA_BINVER ( #24928 )
...
* add -DPADDLE_CUDA_BINVER. test=develop, test=win_gpu
* nvcc will use add_compile_options, avoid using it if you don't want to pass arguments to nvcc. test=develop
* test=develop, test=win_gpu
5 years ago
Chen Weihang
4a702ef361
Support SelelctedRows allreduce in multi-cards imperative mode ( #24690 )
...
* support selectedrows allreduce in multi-cards dygraph, test=develop
* remove useless import modules in unittests, test=develop
* add nccl cmake to get nccl version, test=develop
* add if-condition to compiled correctly, test=develop
* add detail version parseing for old nccl, test=develop
* polish camke details, test=develop
* fix remove test cmake error, test=develop
* fix cmake condition, test=develop
* change unittest camke list, test=develop
* fix unittest cmake rule, test=develop, test=framep0
5 years ago
T8T9
211ef78c1e
Builtin cuda ( #24904 )
...
* support CUDA using cmake built-in way (#24395 )
* support CUDA using cmake built-in way. test=develop
* test=develop
* cmake_minimum_required 3.10
* test=develop
5 years ago
silingtong123
fc4435174b
test=develop, fix the bug of tensorrt package can't compile on windows ( #24860 )
...
* test=develop, fix a bug
* test=develop, remove the macro of PADDLE_DLL_INFERENCE
5 years ago
Yanghello
aa47356b74
Add crypto python ( #24836 )
...
* add crypto helper for paddle, test=develop
* cryptopp.cmake bug fixed, test=develop
* remove debug build type, test=develop
* fixed CMakeLists for new target, test=develop
* fix CI bug, test=develop
* add cmake option flag DWITH_CRYPTO, test=develop
* add crypto api for python, test=develop
* Revert "add crypto api for python, test=develop"
This reverts commit 3a1cfa9d055fab357f46e653a8786f96336f6b47.
* Revert "Add crypto api (#24694 )"
This reverts commit 5a7a517cde
.
* Revert "Revert "Add crypto api (#24694 )""
This reverts commit f952b19fa7e8b7f9c57d31d78b9ffee1041c43ed.
* fixed cryptopp cmake building error, test=develop
* change WITH_CRYPTO building option to OFF, test=develop
* âfixed cipher test failed, test=develop
* "add crypto api for python, test=develop"
This reverts commit 83fb55c0668d59afad2ad1e7e04d425c7c7dd189.
* travis CI bug fixed, test=develop
* fixed test in python3
* test=develop
* fixed unittest, test=develop
5 years ago
Yanghello
62b4ff7dd2
Aes_cipher_test and cipher_utils_test failed fixed ( #24816 )
5 years ago
Wilber
f8e370ac7f
[Inference] [unittest] Inference unit tests rely on dynamic libraries ( #24743 )
5 years ago
Zhou Wei
8a9f06e62d
fix bug when compile CPU inference library ( #24800 )
5 years ago
silingtong123
126d3d693b
support C++ inference shared library on windows ( #24672 )
...
* add SetCommandLineOption
* add the print_FLAGS function
* remove the test demo
* modify the location of macro
* add the 'WITH_STATIC_LIB' option on windows
* modify the macro of PD_INFER_DECL
* modify the the fuction name
* modify the unittest
* modify the code style
5 years ago
Zhou Wei
d1047d0a69
add WITH_GPU for cudaerror download ( #24056 )
5 years ago
Zhou Wei
80ec2fe71c
fix windows bug that compile .cu files use MSVC dynamic C runtime ( #24729 )
5 years ago
Yanghello
5a7a517cde
Add crypto api ( #24694 )
5 years ago
Pei Yang
21ad122a4a
add more info to version.txt, test=develop ( #24551 )
5 years ago
Jinhua Liang
1ad6317bc4
fix compile error about cub ( #24648 )
5 years ago
Jacek Czaja
3292f0ef58
[onednn] elementwise add broadcasting support ( #24594 )
5 years ago
Wilber
4ec7287602
fix compile when with_nccl=off. test=develop ( #24444 )
5 years ago
Shibo Tao
30efee339a
Revert "support CUDA using cmake built-in way ( #24395 ). test=develop" ( #24468 )
...
This reverts commit 068d3690c6
.
5 years ago
Shibo Tao
068d3690c6
support CUDA using cmake built-in way ( #24395 )
...
* support CUDA using cmake built-in way. test=develop
* test=develop
5 years ago
Pei Yang
8c296dea75
fix compile error(cpuid.h not found) on nvidia jetson platforms. test=develop ( #24329 )
5 years ago
Tao Luo
9eedf05d2f
solve mklml memory leak on windows ( #24015 )
...
* solve mklml memory leak on windows
test=develop
* remove unused msvcr120.dll
test=develop
5 years ago
Guo Sheng
1fc6cc502a
Fix cusolver loader for Windows ( #24157 )
...
* Fix cusolver loader for Windows in dynamic_loader.cc. test=develop
* Fix missing CUSOLVER_ROUTINE_EACH_R1.
test=gpu
test=develop
* Add unsupprot for cusolver on Windows temporarily. test=develop
* Fix GetCusolverDsoHandle error message. test=develop
5 years ago
Tao Luo
34122e665e
update mklml.cmake to 2019.0.5 ( #24179 )
...
test=develop
5 years ago
Tao Luo
e3179ea2f5
refine ccache statistics show ( #24167 )
...
test=develop
5 years ago
Tao Luo
29e1968d63
Revert "update mklml.cmake to 2019.0.5 ( #24022 )" ( #24147 )
...
This reverts commit 652e804b41
.
test=develop
5 years ago
Tao Luo
652e804b41
update mklml.cmake to 2019.0.5 ( #24022 )
...
* update mklml.cmake to 2019.0.5
test=develop
* update mklml.cmake with new version
test=develop
5 years ago
Zhou Wei
6f5669f9bf
Add note about the time cost and change HTTPS to HTTP to avoid unable to download( #24043 )
5 years ago
Zeng Jinle
d053dfd5fc
fix cuda arch detection ( #24036 )
5 years ago
Zhou Wei
7817003795
Optimize the error messages of paddle CUDA API ( #23816 )
...
* Optimize the error messages of paddle CUDA API, test=develop
* fix the error messages of paddle CUDA API, test=develop
* Refactoring PADDLE_ENFORCE_CUDA_SUCCESS, and apply to curand/cudnn/cublas/NCCL,test=develop
* remove build_ex_string,test=develop
* merge conflict,test=develop
5 years ago
Zhang Ting
b89dd86fb6
Update eigen ( #23203 )
...
* update eigen, test=develop
* remove patches, test=develop
* add definition of -fabi-version, test=develop
* add patch for TensorBlock.h, test=develop
* test windows, test=develop
* only update eigen for Linux, test=develop
* add code comments, test=develop
5 years ago
WangXi
752636f94f
cache dgc package ( #23941 )
5 years ago
Zhaolong Xing
c113302826
fix cuda9, volta, turing compile error ( #23730 )
5 years ago
zhangchunle
faf284a9b3
modify cmake/external/*.cmake ( #23710 )
5 years ago
mozga-intel
3baaee9aab
Remove: NGraph engine from PDPD repository ( #23545 )
...
* Remove the NGraph engine from PDPD repository
1. Each operator was removed from the operator's directory
2. Each test was removed from the unittest directory
3. The parallel executor support was removed from the PDPD
4. The CMake file was removed from the PDPD
5. The NG flags were removed from the repository
test=develop
* Remove ngraph from:
1. Cmake file
2. Python file
test=develop
5 years ago
石晓伟
9b82e4c183
change the cmake and apis of lite engine, test=develop ( #22934 )
...
* change the cmake and apis of lite engine, test=develop
* change the cmake of lite engine, test=develop
5 years ago
channings
a2e10930cf
update linspace, equal operators to API 2.0 ( #23274 )
...
* update linspace, equal operators to API 2.0, test=develop
* equal support higher performance CUDA kernel, test=develop
* update comment of equal&linspace operator, test=develop
* update comment of equal&linspace operator, test=develop
5 years ago
Adam
487f43bbcb
Update DNNL version to 1.3 ( #23204 )
5 years ago
Zhaolong Xing
430b0099c9
[Paddle-TRT]: Ernie Dynamic shape support. ( #23138 )
...
* add dynamic plugin support.
test=develop
* change emb eltwise layernorm to math function
test=develop
* add emb eltwise layernorm
test=develop
* can run dynamic shape ernie
test=develop
* fix ci
test=develop
* add ut for trt ernie dynamic
test=develop
* refine dynamic shape c++ interface.
test=develop
* fix comments
test=develop
* fix comments
test=develop
5 years ago
xujiaqi01
d0413e58d3
support get pslib version ( #22835 )
...
* get pslib version
* test=develop
5 years ago
Zhaolong Xing
8d6dc102fe
[Ernie GPU Optimize]: Embedding_eltwise_layernorm Fuse ( #22494 )
...
* 1. add embedding eltwise layernorm fuse
2. add embedding eltwise layernorm op
3. refine inplace_add_relu
4. refine fc_eltwise_layernorm
test=develop
* 1. refine fc
test=develop
* fix comments
test=develop
* fix comments
test=develop
5 years ago
石晓伟
ddb9b46fec
change the function in op_teller, test=develop ( #22794 )
...
* change the function in op_teller, test=develop
* correct the commit-id, test=develop
5 years ago
zhou wei
0fb5ea7814
fix bug that sourcecode of third_party can't be cached correctly,and add cache for xbyak and openblas ( #22772 )
5 years ago
tianshuo78520a
433cef03e5
fix typo word ( #22784 )
5 years ago
hutuxian
175954d894
PaddleBox Framework Part2 ( #22466 )
...
* Add two types of Metric Calculator: MultiTaskCalculator & CmatchRankCalculator.
* Add a config for DynamicAdjustChannelNum function to denote whether we will discard the remaining instances when they are not be distributed evenly.
* Remove CPU code in Pull/PushSparse and we will add it back when testing it fully.
* Fix some known issues: such as copying persistable vars after one epoch running.
5 years ago
zhouwei25
7cf648b315
fix bug of the cmake variable protobuf_MSVC_STATIC_CRT ( #22598 )
5 years ago
Adam
608447bfd5
Update MKLDNN to v1.2 ( #22521 )
5 years ago
flame
1d503e6a9e
Golang inference API ( #22503 )
...
* support golang inference
5 years ago
石晓伟
53be3f07e9
update internal header files, test=develop ( #22379 )
5 years ago