Commit Graph

1201 Commits (0c85fd0a1b75c52a495468248c22c08f1b414897)

Author SHA1 Message Date
Jinhua Liang 1ad6317bc4
fix compile error about cub (#24648)
5 years ago
Jacek Czaja 3292f0ef58
[onednn] elementwise add broadcasting support (#24594)
5 years ago
Wilber 4ec7287602
fix compile when with_nccl=off. test=develop (#24444)
5 years ago
Shibo Tao 30efee339a
Revert "support CUDA using cmake built-in way (#24395). test=develop" (#24468)
5 years ago
Shibo Tao 068d3690c6
support CUDA using cmake built-in way (#24395)
5 years ago
Pei Yang 8c296dea75
fix compile error(cpuid.h not found) on nvidia jetson platforms. test=develop (#24329)
5 years ago
Tao Luo 9eedf05d2f
solve mklml memory leak on windows (#24015)
5 years ago
Guo Sheng 1fc6cc502a
Fix cusolver loader for Windows (#24157)
5 years ago
Tao Luo 34122e665e
update mklml.cmake to 2019.0.5 (#24179)
5 years ago
Tao Luo e3179ea2f5
refine ccache statistics show (#24167)
5 years ago
Tao Luo 29e1968d63
Revert "update mklml.cmake to 2019.0.5 (#24022)" (#24147)
5 years ago
Tao Luo 652e804b41
update mklml.cmake to 2019.0.5 (#24022)
5 years ago
Zhou Wei 6f5669f9bf
Add note about the time cost and change HTTPS to HTTP to avoid unable to download(#24043)
5 years ago
Zeng Jinle d053dfd5fc
fix cuda arch detection (#24036)
5 years ago
Zhou Wei 7817003795
Optimize the error messages of paddle CUDA API (#23816)
5 years ago
Zhang Ting b89dd86fb6
Update eigen (#23203)
5 years ago
WangXi 752636f94f
cache dgc package (#23941)
5 years ago
Zhaolong Xing c113302826
fix cuda9, volta, turing compile error (#23730)
5 years ago
zhangchunle faf284a9b3
modify cmake/external/*.cmake (#23710)
5 years ago
mozga-intel 3baaee9aab
Remove: NGraph engine from PDPD repository (#23545)
5 years ago
石晓伟 9b82e4c183
change the cmake and apis of lite engine, test=develop (#22934)
5 years ago
channings a2e10930cf
update linspace, equal operators to API 2.0 (#23274)
5 years ago
Adam 487f43bbcb
Update DNNL version to 1.3 (#23204)
5 years ago
Zhaolong Xing 430b0099c9
[Paddle-TRT]: Ernie Dynamic shape support. (#23138)
5 years ago
xujiaqi01 d0413e58d3
support get pslib version (#22835)
5 years ago
Zhaolong Xing 8d6dc102fe
[Ernie GPU Optimize]: Embedding_eltwise_layernorm Fuse (#22494)
5 years ago
石晓伟 ddb9b46fec
change the function in op_teller, test=develop (#22794)
5 years ago
zhou wei 0fb5ea7814
fix bug that sourcecode of third_party can't be cached correctly,and add cache for xbyak and openblas (#22772)
5 years ago
tianshuo78520a 433cef03e5
fix typo word (#22784)
5 years ago
hutuxian 175954d894
PaddleBox Framework Part2 (#22466)
5 years ago
zhouwei25 7cf648b315
fix bug of the cmake variable protobuf_MSVC_STATIC_CRT (#22598)
5 years ago
Adam 608447bfd5
Update MKLDNN to v1.2 (#22521)
5 years ago
flame 1d503e6a9e
Golang inference API (#22503)
5 years ago
石晓伟 53be3f07e9
update internal header files, test=develop (#22379)
5 years ago
Pei Yang 5a1a9a1e59
remove copying trt to inference lib, test=develop (#22470)
5 years ago
yaoxuefeng 2235ee1a5e
multi-loss optimization by adding a DownpourOpt worker (#22025)
5 years ago
石晓伟 e1b0d7cbb1
remove anakin from code, test=develop (#22420)
5 years ago
Wilber 55b403e8a8 Modify lite commit id. (#22371)
5 years ago
石晓伟 24f9037e62
update external lite, test=develop (#22347)
5 years ago
Wilber 36afdbd3e1
modify lite commit id to support var_conv_2d cascade. test=develop (#22299)
5 years ago
Leo Chen 032e49c494
fix compile issue, test=develop (#22001)
5 years ago
silingtong123 4f1da4adcb remove the useless third_party library from C++ inference library (#22021)
5 years ago
zhouwei25 549e6de7ac faster build by reduce by-product, reduce linking library and fix compile warning of std=c++11 (#22164)
5 years ago
xujiaqi01 e3a457d34b
add collective communication library in fleet (#22211)
5 years ago
Wilber 5750152e80
support fluid-lite subgraph run resnet test=develop (#22191)
5 years ago
Zhen Wang 46189b166d Add bn and relu fuse pass (#22048)
5 years ago
baojun f8516ccb53 Upgrade nGraph to use mkldnn v1.1 (#22154)
5 years ago
石晓伟 ad0dfb17c1
[Feature] Lite subgraph (#22114)
5 years ago
zhouwei25 4f7a2bd0d1 tweak the interface of cache_third_party function - expose the SOURCE_DIR for each external library (#21899)
5 years ago
Adam 700fdb1819 MKL-DNN 1.1 for Windows (#22089)
5 years ago