Commit Graph

1245 Commits (d12ac984bf24c5f625d418ea1d238f56b61e7551)

Author SHA1 Message Date
Tao Luo 9eedf05d2f
solve mklml memory leak on windows (#24015)
5 years ago
Guo Sheng 1fc6cc502a
Fix cusolver loader for Windows (#24157)
5 years ago
Tao Luo 34122e665e
update mklml.cmake to 2019.0.5 (#24179)
5 years ago
Tao Luo e3179ea2f5
refine ccache statistics show (#24167)
5 years ago
Tao Luo 29e1968d63
Revert "update mklml.cmake to 2019.0.5 (#24022)" (#24147)
5 years ago
Tao Luo 652e804b41
update mklml.cmake to 2019.0.5 (#24022)
5 years ago
Zhou Wei 6f5669f9bf
Add note about the time cost and change HTTPS to HTTP to avoid unable to download(#24043)
5 years ago
Zeng Jinle d053dfd5fc
fix cuda arch detection (#24036)
5 years ago
Zhou Wei 7817003795
Optimize the error messages of paddle CUDA API (#23816)
5 years ago
Zhang Ting b89dd86fb6
Update eigen (#23203)
5 years ago
WangXi 752636f94f
cache dgc package (#23941)
5 years ago
Zhaolong Xing c113302826
fix cuda9, volta, turing compile error (#23730)
5 years ago
zhangchunle faf284a9b3
modify cmake/external/*.cmake (#23710)
5 years ago
mozga-intel 3baaee9aab
Remove: NGraph engine from PDPD repository (#23545)
5 years ago
石晓伟 9b82e4c183
change the cmake and apis of lite engine, test=develop (#22934)
5 years ago
channings a2e10930cf
update linspace, equal operators to API 2.0 (#23274)
5 years ago
Adam 487f43bbcb
Update DNNL version to 1.3 (#23204)
5 years ago
Zhaolong Xing 430b0099c9
[Paddle-TRT]: Ernie Dynamic shape support. (#23138)
5 years ago
xujiaqi01 d0413e58d3
support get pslib version (#22835)
5 years ago
Zhaolong Xing 8d6dc102fe
[Ernie GPU Optimize]: Embedding_eltwise_layernorm Fuse (#22494)
5 years ago
石晓伟 ddb9b46fec
change the function in op_teller, test=develop (#22794)
5 years ago
zhou wei 0fb5ea7814
fix bug that sourcecode of third_party can't be cached correctly,and add cache for xbyak and openblas (#22772)
5 years ago
tianshuo78520a 433cef03e5
fix typo word (#22784)
5 years ago
hutuxian 175954d894
PaddleBox Framework Part2 (#22466)
5 years ago
zhouwei25 7cf648b315
fix bug of the cmake variable protobuf_MSVC_STATIC_CRT (#22598)
5 years ago
Adam 608447bfd5
Update MKLDNN to v1.2 (#22521)
5 years ago
flame 1d503e6a9e
Golang inference API (#22503)
5 years ago
石晓伟 53be3f07e9
update internal header files, test=develop (#22379)
5 years ago
Pei Yang 5a1a9a1e59
remove copying trt to inference lib, test=develop (#22470)
5 years ago
yaoxuefeng 2235ee1a5e
multi-loss optimization by adding a DownpourOpt worker (#22025)
5 years ago
石晓伟 e1b0d7cbb1
remove anakin from code, test=develop (#22420)
5 years ago
Wilber 55b403e8a8 Modify lite commit id. (#22371)
5 years ago
石晓伟 24f9037e62
update external lite, test=develop (#22347)
5 years ago
Wilber 36afdbd3e1
modify lite commit id to support var_conv_2d cascade. test=develop (#22299)
6 years ago
Leo Chen 032e49c494
fix compile issue, test=develop (#22001)
6 years ago
silingtong123 4f1da4adcb remove the useless third_party library from C++ inference library (#22021)
6 years ago
zhouwei25 549e6de7ac faster build by reduce by-product, reduce linking library and fix compile warning of std=c++11 (#22164)
6 years ago
xujiaqi01 e3a457d34b
add collective communication library in fleet (#22211)
6 years ago
Wilber 5750152e80
support fluid-lite subgraph run resnet test=develop (#22191)
6 years ago
Zhen Wang 46189b166d Add bn and relu fuse pass (#22048)
6 years ago
baojun f8516ccb53 Upgrade nGraph to use mkldnn v1.1 (#22154)
6 years ago
石晓伟 ad0dfb17c1
[Feature] Lite subgraph (#22114)
6 years ago
zhouwei25 4f7a2bd0d1 tweak the interface of cache_third_party function - expose the SOURCE_DIR for each external library (#21899)
6 years ago
Adam 700fdb1819 MKL-DNN 1.1 for Windows (#22089)
6 years ago
Adam c112b645c4 Update MKL-DNN to 1.1 (#21754)
6 years ago
Yiqun Liu d48320777e
Add the first implememtation of fusion_group op (#19621)
6 years ago
zhouwei25 8b15acd71d remove patch command and file of warpctc to Improved quality of Paddle Repo (#21929)
6 years ago
zhouwei25 2df4be5d35 Fix openblas bug to support compile on windows when WITH_MKL=OFF (#21902)
6 years ago
zhouwei25 cad058ce19 remove patch command and file of grpc to Improved quality of Paddle Repo (#21778)
6 years ago
zhouwei25 a01663ca1f remove patch command and file of cares to Improved quality of Paddle Repo (#21776)
6 years ago
zhouwei25 3e1404d208 fix cp bug of warpctc repository,test=develop (#21901)
6 years ago
xujiaqi01 37896e9050
fix compile error when WITH_PSLIB=ON (#21702)
6 years ago
zhouwei25 34dc710641 fix wrong commitID with patch file of warpctc (#21755)
6 years ago
zhouwei25 03133c2c58 fix the bug that cannot pathch command for the second time (#21596)
6 years ago
baojun 45d2fa4e26 update ngraph to v0.27 test=develop (#21677)
6 years ago
Adam e81f0228df MKL-DNN 1.0 Update (#20162)
6 years ago
Leo Chen 84b7267100
dygraph_grad_maker supports varbase without grad_var (#21524)
6 years ago
Leo Chen cdd46d7e02
Split VarBase from Python Variable for Dygraph (#21359)
6 years ago
silingtong123 4640178629 modify the personal repo address of eigen and warpctc (#21445)
6 years ago
Zhaolong Xing c5f0293cf3
NV jetson(nano, tx2, xavier) inference compile support (#21393)
6 years ago
Tao Luo 060bf8d0d5
Revert "revert flags.cmake (#21437)" (#21485)
6 years ago
gongweibao c93c9e5bfe revert flags.cmake test=develop (#21437)
6 years ago
Zhaolong Xing 6aa13f46cb
update openblas version (#21450)
6 years ago
zhouwei25 fce24315fb fix cub/threadpool include_dir to match setup.py.in,test=develop (#21436)
6 years ago
Tao Luo c0656dcb1a
remove -Wno-error=sign-compare, make warning as error (#21358)
6 years ago
zhouwei25 b39f947698 Eliminate the impact on incremental compilation (#21410)
6 years ago
Michał Gallus 5d7d548275 INT8 Fully-connected (#17641)
6 years ago
Tao Luo d8e7d25274
make CUDA_ARCH_NAME default Auto (#21352)
6 years ago
silingtong123 4b429c190d package the CAPI inference library and third_party (#21299)
6 years ago
zhouwei25 345b67b5e2 remove warning LNK4006 and warning LNK4221 (#21226)
6 years ago
zhouwei25 341dee0657 Cache 3rd source code, improve stability, reduce the compilation time (#21190)
6 years ago
Zeng Jinle 925280b96c
Change GCC version to be 8.2 in Dockerfile.GCC8 (#21222)
6 years ago
zhouwei25 c0dcb090a3 Determine whether to copy and link inference lib by ON_INFER (#20931)
6 years ago
Zeng Jinle cdb3d27985
Fix warn of gcc8 (#21205)
6 years ago
zhouwei25 5d821578d9 fix bug when build openblas with a computer that has installed openblas before,test=develop (#21160)
6 years ago
Jeng Bai-Cheng 330b173c38 Better TensorRT support (#20858)
6 years ago
zhouwei25 d257355089 Remove useless code of openblas and fix the previous incorrect message (#21092)
6 years ago
Michał Gallus 6cc544aa28 Add Shallow clone to ExternalProjects (#21060)
6 years ago
joanna.wozna.intel 77c2083586 Add transpose2 INT8 for mkl-dnn (#19424)
6 years ago
zhouwei25 89bc18eec0 move more third party library related logic to third_party.cmake (#20927)
6 years ago
Chen Weihang 7ee25189c3
Enrich the type of error and declare the error type interfaces (#21024)
6 years ago
Zeng Jinle 878a40f57d
Support NoNeedBufferVarsInference in dygraph backward (#20868)
6 years ago
zhouwei25 394edd8647 fix mklml and cblas bug,test=develop (#20970)
6 years ago
hong 8c4573a3cb
GradMaker for dygraph (#19706)
6 years ago
zhouwei25 b741761098 Integration of third_party compilation structure (#20887)
6 years ago
wopeizl 3b31b74e20
remove the warning issue test=develop (#20718)
6 years ago
zhouwei25 bcd77e147c Cmake_generotor support has been added to enable multi-version VS support (#20755)
6 years ago
wopeizl 9e5948230e
add support to gcc8, add docker env test=develop (#19807)
6 years ago
WangXi 507afa8a8a Fix dgc nan by stripping nccl from sparseReduce. (#20630)
6 years ago
石晓伟 48b27229a8
fix version.cmake, test=develop (#20606)
6 years ago
633WHU 12e4be0382 Dlpack support (#20039)
6 years ago
tangwei12 c9139c3db3
trainer from dataset fetch targets (#19760)
6 years ago
zhaoyuchen2018 e867366805
Add multihead op for ernie opt (#19933)
6 years ago
liym27 3aa331d97e fix conv2d and conv3d: (#20042)
6 years ago
石晓伟 01b9d07963
update operator compatible info, test=develop (#19978)
6 years ago
gongweibao ae593e57fa
Add dgc source code to bos platform. (#19892)
6 years ago
Yiqun Liu 3cd985a669
Add a pass to fuse fc+elementwise_add+layernorm (#19776)
6 years ago
chengjuntao 00efd1d8a9
add deformable conv v1 op and cpu version of deformable conv v2 (#18500)
6 years ago
zhouwei25 b5a5d93bbe fix the dependencies of third party and inference lib (#19684)
6 years ago
Huihuang Zheng 12542320c5
Replace TemporaryAllocator by CUDADeviceContextAllocator (#18989)
6 years ago