Commit Graph

1388 Commits (c41fd033e5b4efd93a1ff738f7ee029a65075b50)

Author SHA1 Message Date
joanna.wozna.intel 95e1434bb2
Add bfloat16 data type (#25402)
5 years ago
YUNSHEN XIE d8984a6b90
limit timeout value setting on linux (#26923)
5 years ago
Zhou Wei 1771d9f880
fix cache judge more safe (#26910)
5 years ago
joanna.wozna.intel 0627a319b0
Restore "Add mkldnn bfloat16 option to C-API " (#26882)
5 years ago
Zhaolong Xing 932bbe955b
fix pool trt plugin bug (#26463)
5 years ago
zlsh80826 ad6e3dd69c
[Paddle-TRT] Stack op plugin (#25605)
5 years ago
石晓伟 ced6e87eee
Revert "Add mkldnn bfloat16 option to C-API (#26676)" (#26854)
5 years ago
Pei Yang 78a530c219
[Paddle-TRT] TRT dynamic shape support PaddleSlim quant models (#26536)
5 years ago
zlsh80826 ac63c7cdef
fix a skip_layernorm bug, test=develop (#26800)
5 years ago
Adam 8bcb1f29d9
Add conv+affine_channel fuse pass to MKLDNN pass strategy and fix it (#26779)
5 years ago
Wilber 68e0560c2f
refine paddle inference api (#26774)
5 years ago
Pei Yang e3f8e5cf5c
trt int8 support conv2d_transpose (#26636)
5 years ago
joanna.wozna.intel 02083bda40
Add mkldnn bfloat16 option to C-API (#26676)
5 years ago
chalsliu dc56c89822
Add the option to execute unit tests only at night (#26669)
5 years ago
LoveAn 02fc1fef8b
Fix the cmake-function named inference_download_and_uncompress on Windows (#26512)
5 years ago
YUNSHEN XIE a8b5741fb4
add a few unittests for setting timeout properity (#26630)
5 years ago
Pei Yang 379222c3f1
add output scale and trt op teller support for hard_swish and hard_sigmoid (#26499)
5 years ago
Zhaolong Xing b7a86e92a8
fix dy shape bug in trt7.1 (#26273)
5 years ago
Pei Yang b757466b0d
fix trt dynamic ernie serialization unit test (#26228)
5 years ago
Wilber c84aa9c61f
update diff val. (#26242)
5 years ago
Wilber fb72b192e7
[DOC] Fix dead link (#26154)
5 years ago
lidanqing 65b97d6215
GRU model xnli dataset C++ tester (#25534)
5 years ago
joanna.wozna.intel 734cf1c3e9
Change use_quantizer attribute name and data type (#25838)
5 years ago
Pei Yang beb0ca5fab
Fix TRT plugin registry without TRT lib (#25982)
5 years ago
Pei Yang b717895f64
Fix registering trt plugin (#25744)
5 years ago
Pei Yang 9e9a569dae
add trt int8 support for elementwise_mul and scale (#25676)
5 years ago
Zhaolong Xing bcddefef39
[Fix Ut]: fix inference ut which exist bug on windows. (#25814)
5 years ago
Pei Yang eef98b7f86
add macro check for using TRT api dynamicRangeIsSet() (#25694)
5 years ago
Pei Yang f82baed866
fix trt instance norm plugin on gcc8. test=develop (#25730)
5 years ago
Adam 98899b73d2
Fix FC + GRU fuse pass (#25687)
5 years ago
石晓伟 7206417259
supports xpu runtime, test=develop (#25554)
5 years ago
Zhaolong Xing 9df18b08f3
Disable windows static library generation (#25593)
5 years ago
Wilber 848aca7ae8
[CI] [Lite-Subgraph] CI add lite subgraph check. (#25346)
5 years ago
Pei Yang 43f9f180e5
Add api to clear intermediate tensors in AnalysisPredictor (#25069)
5 years ago
Jeng Bai-Cheng fc93266b0a
Improve qkv transpose performance (#23919)
5 years ago
Chen Weihang 7be285a66f
remove useless property, test=develop (#25461)
5 years ago
Zhaolong Xing 7b7e605189
[Fix BUGs]: fix multhead matmul pass's instable bug (#25123)
5 years ago
MRXLT 3b8f0a64c2
Encryption infer (#25119)
5 years ago
Wilber 4474fc1033
fix compile on windows. test=develop (#25310)
5 years ago
Wilber 0371cf6f94
fix compile for lite subgraph. test=develop (#25285)
5 years ago
Wojciech Uss 23a4f54b73
rename qat into quant (#24948)
5 years ago
Sylwester Fraczek 36abeff44f
adding elementwiseadd quantization (#25178)
5 years ago
Wojciech Uss 56fa3880e3
rename qat into quant in filenames only (#25194)
5 years ago
Wilber f78e161ea3
remove paddle_use_kernel and paddle_use_op. test=develop (#25189)
5 years ago
Pei Yang b2f5a149e7
[Paddle-TRT] Better Paddle-TensorRT support for PaddleSlim quant models (#25097)
5 years ago
Shibo Tao 19c4db1b56
don't re-generate header file if content doesn't change (#25130)
5 years ago
iducn f282599229
disable unitest for gcc8(#25134)
5 years ago
Zhaolong Xing 843581154f
fix emb eltwise layernorm (#24873)
5 years ago
石晓伟 9ab3cf039c
remove useless test_dot, test=develop (#24957)
5 years ago
石晓伟 6783441e70
fix repeat definitions in liengine.cc, test=develop (#25020)
5 years ago
Leo Chen fa657b3dbb
fix bug of prelu when rank not equal 4, test=develop (#25067)
5 years ago
Jeng Bai-Cheng bef4afa6de
bugfix for unique_ptr of IOptimizationProfile (#23917)
5 years ago
zlsh80826 49e4ee27e1
[Paddle-TRT] slice kernel optimization (#24783)
5 years ago
silingtong123 37bdb5269f
test=develop, add log message in the function UpdateDllFlag (#24937)
5 years ago
silingtong123 640196c446
test=develop, remove the tensorrt dll file from windows package (#24922)
5 years ago
Zhou Wei 4058e736ff
temporarily disable these unittests failed on windows (#24942)
5 years ago
silingtong123 ef9b36873d
test=develop, remove the gflags/gflags.h form paddle_api.h (#24921)
5 years ago
silingtong123 fc4435174b
test=develop, fix the bug of tensorrt package can't compile on windows (#24860)
5 years ago
石晓伟 76cdbb84b0
ignore warnings of external libraries, test=develop (#24193)
5 years ago
Chen Weihang d1062d5278
Replace all errors thrown by LOG(FATAL) with PADDLE_THROW (#24759)
5 years ago
Wilber 1e190a9e02
[Inference] [unittest] Inference unit tests rely on dynamic libraries (2) (#24859)
5 years ago
Wilber f8e370ac7f
[Inference] [unittest] Inference unit tests rely on dynamic libraries (#24743)
5 years ago
silingtong123 126d3d693b
support C++ inference shared library on windows (#24672)
5 years ago
Pei Yang 181b1f5a30
adjust minimum trt version for hard_sigmoid converter to 5130. test=develop (#24746)
5 years ago
zlsh80826 fdbe114b12
[Paddle-TRT] use float constant instead of double test=develop (#24544)
5 years ago
Zhaolong Xing f68d4fb3f1
fix bert bug using trt6 when compile with CUDA_ARCH_NAME=All (#24517)
5 years ago
pawelpiotrowicz db2b6b6568
Hide globals & redesign restore PR (#24279)
5 years ago
Jacek Czaja 8b88cd5167
[oneDNN] Fix to inplace pass (#24442)
5 years ago
Jacek Czaja d0307145a3
[oneDNN] disabling oneDNN inplace pass (#24406)
5 years ago
Tao Luo 72c370c8d2
remove unused test_multi_thread_helper.h (#24399)
5 years ago
Tao Luo 63da846de0
remove old inference C++ tests (#24368)
5 years ago
Chen Weihang aa0f254fbe
Add macro BOOST_GET to enrich the error information of boost :: get (#24175)
5 years ago
Tao Luo c1df7048c7
add UT for mkldnn_cache_capacity (#24336)
5 years ago
Tao Luo 9eedf05d2f
solve mklml memory leak on windows (#24015)
5 years ago
石晓伟 17ac6e2580
update the analysis predictor for multi-stream support, test=develop (#24046)
5 years ago
lidanqing 61ec30f030
Update QAT INT8 2.0 doc (#24127)
5 years ago
Sylwester Fraczek e1a7a88057
added reshape transpose matmul fuse pass (#23754)
5 years ago
arlesniak d31a174f51
added fusing matmul-transpose-reshape pass (#23866)
5 years ago
Pei Yang 695a53c874
remove conv_bn_fuse_pass and fc_fuse_pass in trt int8 calibration. test=develop (#23805)
5 years ago
Zhaolong Xing 35148d17f7
[BUG]: Head number can only be > 1 on multihead op (#23974)
5 years ago
Zhou Wei 7817003795
Optimize the error messages of paddle CUDA API (#23816)
5 years ago
Zhaolong Xing 133f1fc123
[Eernie TRT]: add slice op and add emb eltwise layernorm fp16 support (#23723)
5 years ago
guofei 2b896c1f6b
Support LoDTensorArray in fetch (#23645)
5 years ago
lidanqing 2291634c5c
Add user local data preprocess support (#23692)
5 years ago
chenhaoze b7d185d6ca
OP clip, merge_lod_tensor, convert/elementwise error message enhancement (#23742)
5 years ago
Pei Yang c528f1d4f3
[Paddle-TRT] Add hard_sigmoid and hard_swish support(support MobilenetV3) (#23672)
5 years ago
Pei Yang 015acdbfb1
Refine error message of leaky_relu, tensorrt_engine, split, prelu op converter (#23661)
5 years ago
joanna.wozna.intel 12ba05ce0c
Add scale-matmul fuse pass (#23734)
5 years ago
Zhaolong Xing 3acb047a20
[Paddle-TRT]: add eltwise,pool2d, prelu, scale, concat, gelu dynamic shape support (#23396)
5 years ago
chenhaoze 9b06dd8628
Add three passes and api reference of paddle_pass_builder. test=develop (#23741)
5 years ago
Zhaolong Xing ed5766ffbc
refine act conv2d pool2d trt converter log (#23605)
5 years ago
Pei Yang 28f04c6a5e
refine shuffle channel errmsg, test=develop (#23520)
5 years ago
Tao Luo e4f1b1c5e1
solve mklml memory leak (#23557)
5 years ago
mozga-intel 3baaee9aab
Remove: NGraph engine from PDPD repository (#23545)
5 years ago
Pei Yang 3d5d217030
Revert "[Paddle-TRT] Add hard_sigmoid and hard_swish support(support MobilenetV3) (#23536)", test=develop (#23642)
5 years ago
Pei Yang eb11633611
batch_norm trt converter error message, test=develop (#23620)
5 years ago
joanna.wozna.intel ce08fdcf2b
Add support for INT8 matmul in C-API quantization (#23463)
5 years ago
Pei Yang cdc6d4e292
[Paddle-TRT] Add hard_sigmoid and hard_swish support(support MobilenetV3) (#23536)
5 years ago
Pei Yang 42655ef721
Add full_like op. (#23364)
5 years ago
石晓伟 36b82eae0e
refine the doc of paddle_api.h, test=develop (#23402)
5 years ago