Commit Graph

1012 Commits (28164b266f4639c48fad7923caebbc8fb4921b45)

Author SHA1 Message Date
Chen Weihang 4a702ef361
Support SelelctedRows allreduce in multi-cards imperative mode (#24690)
5 years ago
Chen Weihang d1062d5278
Replace all errors thrown by LOG(FATAL) with PADDLE_THROW (#24759)
5 years ago
Zhang Ting 7d0cbfd045
fix negative framework overhead in Profiling Report (#24850)
5 years ago
Chen Weihang 0aed095188
The third time to simplify the C ++ error stack (#24831)
5 years ago
Adam b490e41c1d
Add isCached() mechanism for BatchNorm and LRN oneDNN operators (#24798)
5 years ago
Wilber f8e370ac7f
[Inference] [unittest] Inference unit tests rely on dynamic libraries (#24743)
5 years ago
Zhou Wei d1047d0a69
add WITH_GPU for cudaerror download (#24056)
5 years ago
wangchaochaohu 79caed6667
fix the print error of PE record_event and framework overhead in profiler test=develop (#24744)
5 years ago
Adam 56a714a19b
Add isCached() machinism to oneDNN pooling primitive (#24724)
5 years ago
lidanqing c3c61d34c1
Update PADDLE_ENFORCE in DNNL related ops (#24333)
5 years ago
wangchaochaohu dbfe5333c5
Add pe profiler Event (#24611)
5 years ago
Adam 586b587519
Add isCached() check in Softmax handler (#24637)
5 years ago
Leo Chen 1d03469685
use vector instead of pointer, test=develop (#24620)
5 years ago
Jacek Czaja 3292f0ef58
[onednn] elementwise add broadcasting support (#24594)
5 years ago
Yiqun Liu 560c815390
Add some check for CUDA Driver API and NVRTC (#22719)
5 years ago
Adam dcf17f4813
Add isCached() mechanism to elementwise_add DNNL (#24563)
5 years ago
pawelpiotrowicz db2b6b6568
Hide globals & redesign restore PR (#24279)
5 years ago
Chen Weihang aa0f254fbe
Add macro BOOST_GET to enrich the error information of boost :: get (#24175)
5 years ago
Pei Yang 8c296dea75
fix compile error(cpuid.h not found) on nvidia jetson platforms. test=develop (#24329)
5 years ago
Guo Sheng 4a5de14426
Remove cusolver potrfBatched support on Windows. (#24338)
5 years ago
Guo Sheng 1fc6cc502a
Fix cusolver loader for Windows (#24157)
5 years ago
石晓伟 17ac6e2580
update the analysis predictor for multi-stream support, test=develop (#24046)
5 years ago
Sylwester Fraczek e1a7a88057
added reshape transpose matmul fuse pass (#23754)
5 years ago
Yiqun Liu ecfddebbef
Add the implementation of inverse (#23310)
5 years ago
wangchaochaohu 6bf26ef156
fix warning mac compiler (#24138)
5 years ago
Guo Sheng a8c0fb4e86
Add cholesky_op (#23543)
5 years ago
wangchaochaohu 6ba7c3ac92
Reduce the construction time of fuction about profiler (#24117)
5 years ago
石晓伟 34d7d6aef0
declare the stream::Priority as enum class, test=develop (#24013)
5 years ago
Jacek Czaja c6c65c65c7
[DNNL] Added elementwise_add mkl-dnn inplace (#23477)
5 years ago
石晓伟 db6d867383
add boost dependency to cuda_stream (#24032)
5 years ago
石晓伟 d2584a7082
New feature: thread local allocator, test=develop (#23989)
5 years ago
Zhou Wei 7817003795
Optimize the error messages of paddle CUDA API (#23816)
5 years ago
Zhang Ting b89dd86fb6
Update eigen (#23203)
5 years ago
石晓伟 2d01cc85c4
DeviceContext Split, test=develop (#23737)
5 years ago
guofei c2a60bb1fa
Correct the wrong name in the flag comment (#22977)
5 years ago
Yi Liu 14e7041c6d
Fix CUDAHandleHolder destruction problem. (#23772)
5 years ago
Michał Gallus a63bcf9ae7
[DNNL][INT8][FP32] MatMul (#23395)
5 years ago
littletomatodonkey 1c08a2136e
test=develop, add addmm op (#23384)
5 years ago
Zeng Jinle 674355a097
fix GET_DATA_SAFELY ptr, test=develop (#23679)
5 years ago
silingtong123 c6d14bc839
show the exception messages of cpp inference library in msvc (#23702)
5 years ago
Tao Luo e4f1b1c5e1
solve mklml memory leak (#23557)
5 years ago
mozga-intel 3baaee9aab
Remove: NGraph engine from PDPD repository (#23545)
5 years ago
Zhang Ting 480530c4e3
API(place-related) error message enhancement (#23515)
5 years ago
Chen Weihang 16315d3d9e
Delete Ref & VectorRef and add GetDataSafely (#22997)
5 years ago
Leo Chen f297a33285
Dev/fix init flags (#23465)
5 years ago
Chen Weihang 7f1ad510bd
Add op inout check macro to simplify error message writing (#23430)
5 years ago
Adam da7c73f847
Delete is_test attribute from activation operators (#23318)
5 years ago
石晓伟 5c59d2139e
reverts the commit 23177, test=develop (#23363)
5 years ago
Yi Liu 0471476a18
fix nccl comm double free bug (#23344)
5 years ago
wangchaochaohu 1ee2a9a424
Profiler refine (#23294)
5 years ago
Yi Liu 2169e6fb58
Initialize global nccl_comm in PE (#23275)
5 years ago
石晓伟 75ebb48a91
supports thread-binding stream, test=develop (#23177)
5 years ago
Zeng Jinle 77b4dc80c9
code polish for adding const qualifier, test=develop, test=document_fix (#23248)
5 years ago
Zeng Jinle bba740710d
add cuda resource pool for BufferedReader, test=develop (#23152)
5 years ago
Sylwester Fraczek abee05a8c8
added mkldnn swish activation (#23041)
5 years ago
Yi Liu 121b2aed4d
initialize global nccl context in dygraph (#23037)
5 years ago
wangchaochaohu 99db0cf762
remove debug log test=develop (#22994)
5 years ago
wangchaochaohu c979c9f2b0
refine the profiler print test=develop (#22968)
5 years ago
Zhang Ting ca9c8b417d
fix compute ratio of profile, test=develop (#22872)
5 years ago
wangchaochaohu dbb0b9b3b6
refine the profiler print (#22823)
5 years ago
Zeng Jinle d41d802ba3
Add flags to limit gpu memory (#22793)
5 years ago
Zhang Ting 72ff5a09c3
fix print bug of profile, test=develop (#22804)
5 years ago
wangchaochaohu 8456c3f4dd
polish the profiler_help code (#22811)
5 years ago
wangchaochaohu 7578fcbac4
Profile code refine (#22800)
5 years ago
Adam 2b80e9a719
Add cpu_info without XBYAK (#22716)
5 years ago
Zhang Ting f97f3f9301
add framework overhead ratio in profile report (#22590)
5 years ago
wangchaochaohu 611411b90e
Fusion group profile support (#22718)
5 years ago
tianshuo78520a d2ba91aad1
fix typo words (#22653)
5 years ago
Yiqun Liu 22bbd54719
Add the support of fp16 in fusion_group (#22239)
5 years ago
wangchaochaohu a089072c8b
fix the profile print error (#22665)
5 years ago
wangchaochaohu c65c6ae534
add flag to control profile level in python API (#22319)
5 years ago
Chen Weihang fe685cc185
fix enforce test error, test=develop (#22610)
5 years ago
Chen Weihang 266106da75
Fix mismatch with plus sign in the line (#22588)
5 years ago
Wilber de009152a7 Compile without nccl deps. [2/2] (#22484)
5 years ago
LielinJiang 2b1386b2b2
optimize performance of interpolate op (#22436)
5 years ago
wangchaochaohu 77dd0d97bb
use enum class to replace the usage of enum in some condition test=develop (#22464)
5 years ago
Wilber 7bc4b09500
add WITH_NCCL option for cmake. (#22384)
5 years ago
Michał Gallus 269db0d1d1
[DNNL] Fix accuracy in INT8 FC (#22404)
5 years ago
wangchaochaohu 621d3e0b66
fix the bug of profile update (#22207)
5 years ago
石晓伟 ad0dfb17c1
[Feature] Lite subgraph (#22114)
5 years ago
Yiqun Liu 96980c2244
Polish the PADDLE_ENFORCE in fusion_group pass related codes. (#22144)
5 years ago
wangchaochaohu c3876cf82d
add support for nested profiling event and printing in different level (#22061)
5 years ago
zhaoyuchen2018 3d4f2aa689
Refine stack op to improve xlnet performance, test=develop (#22142)
5 years ago
Zeng Jinle 4c2df8e4d4
fix allocator strategy comment, test=develop, test=document_fix (#22121)
5 years ago
bingyanghuang 7872d06ff4 Add explanation on conv grad for dims<3 (#22125)
5 years ago
Chen Weihang ba8414d3a5
replace CUDNN_ENFORCE with PADDLE_ENFORCE_CUDA_SUCCESS, test=develop (#22109)
5 years ago
Jacek Czaja b0b27ff699 [MKL-DNN] Conv grad and Batch Norm grad NHWC support (#22088)
5 years ago
Zeng Jinle 9587249442
polish allocator strategy doc, test=develop, test=document_fix (#22095)
5 years ago
Zeng Jinle d9f5d1eb29
ag allocator by default, test=develop (#21837)
5 years ago
Jacek Czaja ad8a9cb82c [MKL-DNN] Pool & LRN Grad Ops NHWC support (#21747)
5 years ago
Yiqun Liu d48320777e
Add the first implememtation of fusion_group op (#19621)
5 years ago
Chen Weihang 2e9082250d
polish default error msg & cublas error hint, test=develop (#22032)
5 years ago
Chen Weihang 35ff1568e9 Add error message for cublas inItizalize failed (#21995)
5 years ago
Chen Weihang fbb42173a9
fix no hint problem when use ENFORCE for cuda, test=develop (#21994)
5 years ago
Chen Weihang 1fd1f06f11 Rename paddle throw error macro (#21657)
5 years ago
Adam e81f0228df MKL-DNN 1.0 Update (#20162)
5 years ago
Zeng Jinle 97e76cb96d
refine dev_ctx.Wait() exception throw, test=develop (#21600)
5 years ago
Huihuang Zheng b241c7329c
Refine a Warning Which Can Occur Not Only During Init (#21546)
5 years ago
wangchaochaohu 932aca162d
Add Branch to avoid CPU profiler warning print (#21556)
5 years ago
Pei Yang 122b37ce62
make config option DisableGlogInfo() able to mute all inference logs (#21318)
5 years ago
Zhaolong Xing c5f0293cf3
NV jetson(nano, tx2, xavier) inference compile support (#21393)
5 years ago
Huihuang Zheng a71f53d7ac
Add warning message when initialize GLOG failed. (#21487)
5 years ago
Tao Luo 01fa4ead61
fix -Wno-error=sign-compare warning in gcc8 (#21434)
5 years ago
Jie Fang 5e813b53c5 nhwc optimization for batchnorm (#21090)
5 years ago
Jacek Czaja cd43c4440e [MKL-DNN] LRN and Pool2d (FWD) NHWC support (#21375)
5 years ago
wangchaochaohu 8293f21a52
Profile refine (#21258)
5 years ago
wangchaochaohu e0e205ea2d
fix the profiling bug test=develop (#21396)
5 years ago
zhouwei25 345b67b5e2 remove warning LNK4006 and warning LNK4221 (#21226)
5 years ago
gongweibao ed2a185248
optimize nhwc for tensor core in ConvOp and ConvGradOp (#20597)
5 years ago
Zeng Jinle cdb3d27985
Fix warn of gcc8 (#21205)
5 years ago
liuwei1031 d8b6cf2bcd
fix sporadically hang issue on windows(#21201)
5 years ago
zhaoyuchen2018 b93870e696
Improve topk performance. (#21087)
5 years ago
Chen Weihang b3a3e6f60c change cuda enforce & add example (#21142)
5 years ago
Chen Weihang 27fa9c100b
add examples for resource exhausted error, test=develop (#21140)
5 years ago
Chen Weihang edd6680a71
Further simplify the C++ error info stack (#21093)
5 years ago
joanna.wozna.intel 77c2083586 Add transpose2 INT8 for mkl-dnn (#19424)
5 years ago
Chen Weihang 7ee25189c3
Enrich the type of error and declare the error type interfaces (#21024)
5 years ago
Adam 3fda695bb0 Add support for asymetric padding in MKLDNN pool, conv and conv_transpose (#21062)
5 years ago
Zeng Jinle a710ccc0cb
refine error message of allocator again, test=develop (#21023)
5 years ago
wangchaochaohu 7695b713e1
gpu info query refine test=develop (#20904)
5 years ago
Chen Weihang 3358455c86
Polish and arrange code in enforce.h (#20901)
5 years ago
Chen Weihang 8b59ac3ad0 delete paddle infershape enforce marco (#20832)
5 years ago
Chen Weihang 1d1552d106
Make formatted ENFORCE stack adapt to more situations (#20826)
5 years ago
Adam 67b59ddb38 Minor MKL-DNN conv int8 performance fixes (#20753)
5 years ago
123malin 95e90aa102
test=develop, add communicator_is_sgd_optimizer flag (#20677)
5 years ago
wopeizl 9e5948230e
add support to gcc8, add docker env test=develop (#19807)
5 years ago
WangXi 507afa8a8a Fix dgc nan by stripping nccl from sparseReduce. (#20630)
5 years ago
lidanqing 46e93f7c86 Revert "Refactor conv computeINT8" (#20640)
5 years ago
Jacek Czaja a1cd27f13f [MKL-DNN] Added mkl-dnn cache clearing when creating Executor instance (#20241)
5 years ago
Zeng Jinle 4922eb6da5
make_conv_workspace_size_configurable, test=develop (#20662)
5 years ago
633WHU 12e4be0382 Dlpack support (#20039)
5 years ago
Wilber 751812a674
enable cpu machine to run paddle in gpu lib
5 years ago
Zeng Jinle 1d1d221f26
refine allocator_flag, test=develop, test=document_fix (#20400)
5 years ago
danleifeng 425279a57b Improve elementwise operators performance in same dimensions. (#19763)
5 years ago
qingqing01 1a3eef026c
Enable users to create custom cpp op outside framework. (#19256)
5 years ago
liym27 24010472d4 fix pool2d pool3d,support asymmetric padding and channel_last (#19739)
5 years ago
Chen Weihang b916335025 Paddle error message stack shaping and optimization (#19895)
5 years ago
joanna.wozna.intel 1d32897c5c Fix test pool2d int8 mkldnn (#19976)
5 years ago
Zeng Jinle 37f76407b0
fix cuda dev_ctx allocator cmake deps, test=develop (#19953)
5 years ago
Jacek Czaja 5b07ca9cdd - ReImplemented pooling fwd mkldnn (#19911)
5 years ago
chengduo d7251a8e1e
Delete local execution scopes (#19749)
5 years ago
Zeng Jinle c7f36e7c00
Add lock to cudnn handle calls (#19845)
5 years ago
Zeng Jinle b25d1e758d
remove enforce.h file written, test=develop (#19897)
5 years ago
Jacek Czaja 619c797a7f [MKL-DNN] LRN refactoring (#19798)
5 years ago
lidanqing 2c32c2d649 Refactor conv computeINT8 (#19574)
5 years ago
Adam c7e688921b Add template functions for Acquire primitive/primitive_desc (#19867)
5 years ago
Zeng Jinle 13ca364ceb
remove some flags and add comments to some flags, test=develop (#19813)
5 years ago
Zeng Jinle 5eb381a3e2
refine reallocate of workspace size, test=develop (#19843)
5 years ago
Adam dfdd73cbc0 Add MKLDNNhandlerT templatized class (#19801)
5 years ago
Zeng Jinle 32b1151f5e
reduce default value of cudnn workspace size, test=develop (#19780)
5 years ago