Commit Graph

1341 Commits (fef3654b4e76f5e2cc9a5f71c1c047cef82192e5)

Author SHA1 Message Date
Shang Zhizhou 8699f38d08
裁剪transformer模型trt支持;修复tensorRT不支持DeletePass的bug (#28517)
4 years ago
YUNSHEN XIE d3b2d07d6e
modified timeout value on windows (#28499)
4 years ago
YUNSHEN XIE 72c78e4dbb
exec ut no more than 15s 2 (#28441)
4 years ago
QingshuChen 6bba8e57b1
fix batch_norm_xpu bug & remove xpusimulator dependence (#28430)
4 years ago
Wilber 648b92c093
[sw] Update compile error for sw (#28419)
4 years ago
石晓伟 0d25d55a86
update the cmake cmd, test=develop (#28344)
4 years ago
wangchaochaohu 337d3832f3
refine (#28366)
4 years ago
Wilber 09fd2b2aab
Paddle support compile on sw (#27858)
4 years ago
Zhou Wei f41104efa3
fix compile out of memory temporary (#28346)
4 years ago
石晓伟 d9b5f1261c
update the version of pybind, test=develop (#28284)
4 years ago
XiaoguangHu d252219779
add git mirror url to speed up clone (#28241)
4 years ago
Zhou Wei 4877bd5944
fix CUDA9 error due to BuildCustomizations (#28222)
4 years ago
Wilber f935ca8a50
[lite-xpu-subgraph] Fix xpu compile and test xpu ci. (#27932)
4 years ago
Zhou Wei 68c473e3e0
fix Automatic GPU detection failed on windows (#28148)
4 years ago
Zhou Wei 5d7000215a
fix dynamic_loader more safe and error message on windows (#28117)
4 years ago
lilong12 ff02173d01
add a comment, test=document_fix (#28008)
4 years ago
lilong12 afce32f32c
build gloo from source code instead of using the pre-compiled library (#27930)
4 years ago
Wilber 9005c5a260
Lite subgraph support arm cpu. (#27827)
4 years ago
chen zhiyu 6335e6a0a6
add musl option (#27798)
4 years ago
Wilber a2d08aa916
update for windows compile. (#27813)
4 years ago
Wojciech Uss 966447e338
Added support for quantization of fusion_gru (#27518)
4 years ago
Li Fuchen 1501a80f74
add support to float64 input of warpctc op. (#27399)
4 years ago
QingshuChen 6b727e08b1
support elementwise add, activation, matmul on Baidu Kunlun (#27143)
4 years ago
Thunderbrook 6f69a4cb05
add xpu in heter mode (#27000)
4 years ago
tianshuo78520a a2e0b7cb4a
update gcc8 in python3 ci docker (#26979)
4 years ago
Zhou Wei d20349b548
add unittest count ,install check on windows (#27492)
4 years ago
Wilber ec4155d7d0
windows lib size crop from 5.4G to 3.9G (#27477)
4 years ago
Zhou Wei 292b24aa6d
fix bug MD of compile, And add MD/STATIC/OPENBLAS inference lib check on windows (#27051)
4 years ago
Zhang Ting 906e7f921e
add fuse_bn_act op (#27230)
4 years ago
Wilber afe94903c3
Rename fluid_inference to paddle_inference. (#27422)
4 years ago
Pei Yang a5ef246cac
Optimize emb_eltwise_layernorm_plugin and support fp16 (#27128)
4 years ago
Leo Chen c89f269c4d
Fix bug of handling blank characters in operators.cmake (#27310)
4 years ago
chalsliu cb34cf18c0
Set timeout value on windows and mac (#27197)
4 years ago
Adam cc3f4b813a
Add int8 GRU kernel (#27220)
4 years ago
Wilber 1b84c0bf43
Lite subgraph refine predictor (#27167)
4 years ago
Jacek Czaja e005861598
[oneDNN]Introducing oneDNN 1.6 (#27137)
4 years ago
wangchaochaohu 3b8f5200a5
add dgc cuda11 support for Paddle (#27204)
4 years ago
wangchaochaohu c71d79b1d2
[cuda11 support] change the CMakeLists to support the cuda11 (#27124)
4 years ago
wangchaochaohu 9b7692b144
update gpu compute archs for cuda11 (#27039)
4 years ago
Adam Osewski c2c689582e
Update Paddle-Lite commit hash. (#26413)
5 years ago
YUNSHEN XIE ada1e12982
modified timeout value on windows and mac (#26690)
5 years ago
MRXLT 4ba6483665
fix infer compile bug (#26641)
5 years ago
Wilber 32ba8602c6
Enhance py_func error info message. (#26557)
5 years ago
YUNSHEN XIE 39fe0d35aa
find timeout unittests (#26371)
5 years ago
QingshuChen 138ecf24aa
support Baidu Kunlun AI Accelerator (#25959)
5 years ago
Chen Weihang 17bcaef411
Revert "update cub to 1.9.8, test=develop (#24895)" (#26504)
5 years ago
Zhou Wei 62bd7ba13c
specify cuda arch when dectected fail (#26420)
5 years ago
liuyuhui 935da32d25
【paddle.fleet】upgrade fleet: modify role_maker (#26038)
5 years ago
Leo Chen ffe52b4452
[OpDevOptimize] Add common infershape functions (#26096)
5 years ago
Jacek Czaja 615e8a20f6
- oneDNN update 1.3 -> 1.5 (#25202)
5 years ago
Zhou Wei 1f74b94d3f
fix compile warning on windows MSVC, fix paddle_build.bat more safe (#25933)
5 years ago
Adam 68c6160e63
Add oneDNN fusion_gru kernel (#25594)
5 years ago
Zhaolong Xing 358bc06c72
[CUDNN8 support] : support CUDNN8 (#25664)
5 years ago
wawltor 595a719795
Update the api for the compare_ops
5 years ago
石晓伟 521e70152f
fix cmake of lite, test=develop (#25680)
5 years ago
Zhou Wei e0a9115e28
fix random compile failure due to missing file (#25661)
5 years ago
Wilber 0e23dc3a2d
[Compile] Update compile for arm FT2000 (#25651)
5 years ago
Aurelius84 6452fd425e
Add warning if gcc version > 8.2(#25635)
5 years ago
石晓伟 7206417259
supports xpu runtime, test=develop (#25554)
5 years ago
Chen Weihang 404566e459
move nccl.h find code to avoid cmake error (#25589)
5 years ago
Zhaolong Xing 9df18b08f3
Disable windows static library generation (#25593)
5 years ago
Zhou Wei 1ab60544f2
windows CI scripts for xly,test=develop,test=win (#25533)
5 years ago
Wilber 848aca7ae8
[CI] [Lite-Subgraph] CI add lite subgraph check. (#25346)
5 years ago
Leo Chen c27413e9cb
Attempt to resolve the TLS problem (#25390)
5 years ago
Chen Weihang 172d4ecb6c
remove WITH_DSO compile option (#25444)
5 years ago
Zhang Ting b9aeb68197
remove EIGEN_MAX_CPP_VER=11, test=develop (#25309)
5 years ago
MRXLT 3b8f0a64c2
Encryption infer (#25119)
5 years ago
Wilber 0371cf6f94
fix compile for lite subgraph. test=develop (#25285)
5 years ago
Wilber 4c964abdf7
support build on arm. test=develop (#25212)
5 years ago
Shibo Tao f8d5fd6f9b
generate dummy file using cmake configure_file function to avoid re-generating it. (#25161)
5 years ago
Shibo Tao 19c4db1b56
don't re-generate header file if content doesn't change (#25130)
5 years ago
T8T9 7046165670
remove ${CMAKE_VERSION} VERSION_LESS "3.3.0". (#25128)
5 years ago
石晓伟 6783441e70
fix repeat definitions in liengine.cc, test=develop (#25020)
5 years ago
T8T9 a73a4a8fe7
don't support cmake 3.12, 3.13, 3.14 (#25021)
5 years ago
Zhou Wei 3e04ed2227
fix bug in CUDA_NVCC_FALS and CMAKE_CUDA_FLAGS, and eliminate some warning,test=develop (#24982)
5 years ago
Zhang Ting 0cb0318253
update cub to 1.9.8, test=develop (#24895)
5 years ago
Yanghello 2ca2b90d62
fix cryptopp lib building bug in gcc8 (#24945)
5 years ago
T8T9 90d420b13c
add -DPADDLE_CUDA_BINVER (#24928)
5 years ago
Chen Weihang 4a702ef361
Support SelelctedRows allreduce in multi-cards imperative mode (#24690)
5 years ago
T8T9 211ef78c1e
Builtin cuda (#24904)
5 years ago
silingtong123 fc4435174b
test=develop, fix the bug of tensorrt package can't compile on windows (#24860)
5 years ago
Yanghello aa47356b74
Add crypto python (#24836)
5 years ago
Yanghello 62b4ff7dd2
Aes_cipher_test and cipher_utils_test failed fixed (#24816)
5 years ago
Wilber f8e370ac7f
[Inference] [unittest] Inference unit tests rely on dynamic libraries (#24743)
5 years ago
Zhou Wei 8a9f06e62d
fix bug when compile CPU inference library (#24800)
5 years ago
silingtong123 126d3d693b
support C++ inference shared library on windows (#24672)
5 years ago
Zhou Wei d1047d0a69
add WITH_GPU for cudaerror download (#24056)
5 years ago
Zhou Wei 80ec2fe71c
fix windows bug that compile .cu files use MSVC dynamic C runtime (#24729)
5 years ago
Yanghello 5a7a517cde
Add crypto api (#24694)
5 years ago
Pei Yang 21ad122a4a
add more info to version.txt, test=develop (#24551)
5 years ago
Jinhua Liang 1ad6317bc4
fix compile error about cub (#24648)
5 years ago
Jacek Czaja 3292f0ef58
[onednn] elementwise add broadcasting support (#24594)
5 years ago
Wilber 4ec7287602
fix compile when with_nccl=off. test=develop (#24444)
5 years ago
Shibo Tao 30efee339a
Revert "support CUDA using cmake built-in way (#24395). test=develop" (#24468)
5 years ago
Shibo Tao 068d3690c6
support CUDA using cmake built-in way (#24395)
5 years ago
Pei Yang 8c296dea75
fix compile error(cpuid.h not found) on nvidia jetson platforms. test=develop (#24329)
5 years ago
Tao Luo 9eedf05d2f
solve mklml memory leak on windows (#24015)
5 years ago
Guo Sheng 1fc6cc502a
Fix cusolver loader for Windows (#24157)
5 years ago
Tao Luo 34122e665e
update mklml.cmake to 2019.0.5 (#24179)
5 years ago
Tao Luo e3179ea2f5
refine ccache statistics show (#24167)
5 years ago