Commit Graph

995 Commits (5305b2749a4c5f913b0fa8b5ffe6ba616b621bab)

Author SHA1 Message Date
wangchaochaohu 0b81d76310
[API2.0] add op for cudnn version query test=develop (#26180)
5 years ago
joanna.wozna.intel 734cf1c3e9
Change use_quantizer attribute name and data type (#25838)
5 years ago
Leo Chen 751305ecf0
Add flags to control call stack of error message (#25997)
5 years ago
Pei Yang beb0ca5fab
Fix TRT plugin registry without TRT lib (#25982)
5 years ago
Adam 68c6160e63
Add oneDNN fusion_gru kernel (#25594)
5 years ago
Zhaolong Xing 358bc06c72
[CUDNN8 support] : support CUDNN8 (#25664)
5 years ago
Pei Yang b717895f64
Fix registering trt plugin (#25744)
5 years ago
Chen Weihang 9b5a65b819
refine init signal handler meg dumper (#25911)
5 years ago
Chen Weihang d47304e6d9
Refine paddle error stack format (#25790)
5 years ago
Chen Weihang 2469b578f5
Unified paddle error format when catch system signal (#25765)
5 years ago
Chen Weihang 1b3081b1b4
Simplify BufferedReader to improve DataLoader performance (#25648)
5 years ago
arlesniak e52df3b125
Added DNNL cache management for DyGraph (#25624)
5 years ago
joanna.wozna.intel e5bbffa84c
Add NOMINMAX define due to windows.h max/min macro conflict (#25637)
5 years ago
Chen Weihang a6abd92dfd
Polish install error hint message (#25531)
5 years ago
Jacek Czaja 7dbc441eab
[oneDNN] cache cosmetics improvement (#25576)
5 years ago
LielinJiang 7129f544f0
Add bilateral_slice op (#25401)
5 years ago
GaoWei8 c10dcff12d
refine PADDLE_ENFORCE (#25456)
5 years ago
Chen Weihang 0b54d54fd8
Fix index overflow bug of the CUDA kernel loop increment (#25435)
5 years ago
Chen Weihang 7be285a66f
remove useless property, test=develop (#25461)
5 years ago
Jacek Czaja a5d1592f6c
Added missing oneDNN format (#25450)
5 years ago
Chen Weihang 172d4ecb6c
remove WITH_DSO compile option (#25444)
5 years ago
Zhen Wang bb45af02ac
add the c++ part of Imperative QAT. test=develop (#25446)
5 years ago
GaoWei8 ea7e532598
Refine PADDLE_ENFORCE (#25369)
5 years ago
GaoWei8 fb70682f00
fix PADDLE_ENFORCE (#25297)
5 years ago
Chen Weihang 5a959f6e6e
Refactor dynamic dso search functions (#25214)
5 years ago
Wilber 4c964abdf7
support build on arm. test=develop (#25212)
5 years ago
Chen Weihang 353ea9e8ad
Add default cudnn lib path (#25175)
5 years ago
Adam bd0b38e671
Refactor of conv fp32 oneDNN operator (#25137)
5 years ago
Tao Luo 2996315fc9
fix profiler_test on win32 (#25073)
5 years ago
Jacek Czaja a7944904d3
[oneDNN]elementwise_add and elementwise_mul int8 support (#24984)
5 years ago
hutuxian 5822862d8a
Monitor Framework (#24079)
5 years ago
wangchaochaohu feba131893
fix the sgement fault error of profiler in seqseq model test=develop (#24952)
5 years ago
Zhou Wei 4058e736ff
temporarily disable these unittests failed on windows (#24942)
5 years ago
Chen Weihang 4a702ef361
Support SelelctedRows allreduce in multi-cards imperative mode (#24690)
5 years ago
Chen Weihang d1062d5278
Replace all errors thrown by LOG(FATAL) with PADDLE_THROW (#24759)
5 years ago
Zhang Ting 7d0cbfd045
fix negative framework overhead in Profiling Report (#24850)
5 years ago
Chen Weihang 0aed095188
The third time to simplify the C ++ error stack (#24831)
5 years ago
Adam b490e41c1d
Add isCached() mechanism for BatchNorm and LRN oneDNN operators (#24798)
5 years ago
Wilber f8e370ac7f
[Inference] [unittest] Inference unit tests rely on dynamic libraries (#24743)
5 years ago
Zhou Wei d1047d0a69
add WITH_GPU for cudaerror download (#24056)
5 years ago
wangchaochaohu 79caed6667
fix the print error of PE record_event and framework overhead in profiler test=develop (#24744)
5 years ago
Adam 56a714a19b
Add isCached() machinism to oneDNN pooling primitive (#24724)
5 years ago
lidanqing c3c61d34c1
Update PADDLE_ENFORCE in DNNL related ops (#24333)
5 years ago
wangchaochaohu dbfe5333c5
Add pe profiler Event (#24611)
5 years ago
Adam 586b587519
Add isCached() check in Softmax handler (#24637)
5 years ago
Leo Chen 1d03469685
use vector instead of pointer, test=develop (#24620)
5 years ago
Jacek Czaja 3292f0ef58
[onednn] elementwise add broadcasting support (#24594)
5 years ago
Yiqun Liu 560c815390
Add some check for CUDA Driver API and NVRTC (#22719)
5 years ago
Adam dcf17f4813
Add isCached() mechanism to elementwise_add DNNL (#24563)
5 years ago
pawelpiotrowicz db2b6b6568
Hide globals & redesign restore PR (#24279)
5 years ago
Chen Weihang aa0f254fbe
Add macro BOOST_GET to enrich the error information of boost :: get (#24175)
5 years ago
Pei Yang 8c296dea75
fix compile error(cpuid.h not found) on nvidia jetson platforms. test=develop (#24329)
5 years ago
Guo Sheng 4a5de14426
Remove cusolver potrfBatched support on Windows. (#24338)
5 years ago
Guo Sheng 1fc6cc502a
Fix cusolver loader for Windows (#24157)
5 years ago
石晓伟 17ac6e2580
update the analysis predictor for multi-stream support, test=develop (#24046)
5 years ago
Sylwester Fraczek e1a7a88057
added reshape transpose matmul fuse pass (#23754)
5 years ago
Yiqun Liu ecfddebbef
Add the implementation of inverse (#23310)
5 years ago
wangchaochaohu 6bf26ef156
fix warning mac compiler (#24138)
5 years ago
Guo Sheng a8c0fb4e86
Add cholesky_op (#23543)
5 years ago
wangchaochaohu 6ba7c3ac92
Reduce the construction time of fuction about profiler (#24117)
5 years ago
石晓伟 34d7d6aef0
declare the stream::Priority as enum class, test=develop (#24013)
5 years ago
Jacek Czaja c6c65c65c7
[DNNL] Added elementwise_add mkl-dnn inplace (#23477)
5 years ago
石晓伟 db6d867383
add boost dependency to cuda_stream (#24032)
5 years ago
石晓伟 d2584a7082
New feature: thread local allocator, test=develop (#23989)
5 years ago
Zhou Wei 7817003795
Optimize the error messages of paddle CUDA API (#23816)
5 years ago
Zhang Ting b89dd86fb6
Update eigen (#23203)
5 years ago
石晓伟 2d01cc85c4
DeviceContext Split, test=develop (#23737)
5 years ago
guofei c2a60bb1fa
Correct the wrong name in the flag comment (#22977)
5 years ago
Yi Liu 14e7041c6d
Fix CUDAHandleHolder destruction problem. (#23772)
5 years ago
Michał Gallus a63bcf9ae7
[DNNL][INT8][FP32] MatMul (#23395)
5 years ago
littletomatodonkey 1c08a2136e
test=develop, add addmm op (#23384)
5 years ago
Zeng Jinle 674355a097
fix GET_DATA_SAFELY ptr, test=develop (#23679)
5 years ago
silingtong123 c6d14bc839
show the exception messages of cpp inference library in msvc (#23702)
5 years ago
Tao Luo e4f1b1c5e1
solve mklml memory leak (#23557)
5 years ago
mozga-intel 3baaee9aab
Remove: NGraph engine from PDPD repository (#23545)
5 years ago
Zhang Ting 480530c4e3
API(place-related) error message enhancement (#23515)
5 years ago
Chen Weihang 16315d3d9e
Delete Ref & VectorRef and add GetDataSafely (#22997)
5 years ago
Leo Chen f297a33285
Dev/fix init flags (#23465)
5 years ago
Chen Weihang 7f1ad510bd
Add op inout check macro to simplify error message writing (#23430)
5 years ago
Adam da7c73f847
Delete is_test attribute from activation operators (#23318)
5 years ago
石晓伟 5c59d2139e
reverts the commit 23177, test=develop (#23363)
5 years ago
Yi Liu 0471476a18
fix nccl comm double free bug (#23344)
5 years ago
wangchaochaohu 1ee2a9a424
Profiler refine (#23294)
5 years ago
Yi Liu 2169e6fb58
Initialize global nccl_comm in PE (#23275)
5 years ago
石晓伟 75ebb48a91
supports thread-binding stream, test=develop (#23177)
5 years ago
Zeng Jinle 77b4dc80c9
code polish for adding const qualifier, test=develop, test=document_fix (#23248)
5 years ago
Zeng Jinle bba740710d
add cuda resource pool for BufferedReader, test=develop (#23152)
6 years ago
Sylwester Fraczek abee05a8c8
added mkldnn swish activation (#23041)
6 years ago
Yi Liu 121b2aed4d
initialize global nccl context in dygraph (#23037)
6 years ago
wangchaochaohu 99db0cf762
remove debug log test=develop (#22994)
6 years ago
wangchaochaohu c979c9f2b0
refine the profiler print test=develop (#22968)
6 years ago
Zhang Ting ca9c8b417d
fix compute ratio of profile, test=develop (#22872)
6 years ago
wangchaochaohu dbb0b9b3b6
refine the profiler print (#22823)
6 years ago
Zeng Jinle d41d802ba3
Add flags to limit gpu memory (#22793)
6 years ago
Zhang Ting 72ff5a09c3
fix print bug of profile, test=develop (#22804)
6 years ago
wangchaochaohu 8456c3f4dd
polish the profiler_help code (#22811)
6 years ago
wangchaochaohu 7578fcbac4
Profile code refine (#22800)
6 years ago
Adam 2b80e9a719
Add cpu_info without XBYAK (#22716)
6 years ago
Zhang Ting f97f3f9301
add framework overhead ratio in profile report (#22590)
6 years ago
wangchaochaohu 611411b90e
Fusion group profile support (#22718)
6 years ago