Commit Graph

1140 Commits (f04f2b232a22c9aba3ee4538ab708acf9f77c813)

Author SHA1 Message Date
Zhaolong Xing e89b12884a
FIx C++ inference BUG: When open memory optim and enable trt subgraph at the same time, there is a bug (#19969)
5 years ago
Aurelius84 99a9615a4b
Removing length dims constraints of seq_pad and seq_unpad (#19497)
5 years ago
pawelpiotrowicz 2c5c636514 Add two extra flags for test_analyzer_int8_image_classification to disable fp32/int8 (#19840)
5 years ago
Pei Yang baccd7e2ca
Add TRT input shape check between model and runtime (#19864)
5 years ago
Pei Yang 74812d1c90
Fix BUGS: paddle-TRT repeatedly sets weight_map and overdeletes repetitive_params (#19825)
5 years ago
石晓伟 d004a0f50e
fix multi-thread exec of trt, test=develop (#19338)
5 years ago
Yiqun Liu 3cd985a669
Add a pass to fuse fc+elementwise_add+layernorm (#19776)
5 years ago
石晓伟 71b2ed61bc
support MLU nums, test=develop (#19372)
5 years ago
Pei Yang 9cbc1eff2d
zerocopytensor support uint8, analysis config support profile, analysis predictor support GetInputTensorShape, test=develop (#19822)
5 years ago
Zhaolong Xing 110be57c1b
fix memory optimization type (#19781)
5 years ago
Yiqun Liu c67c8758cb
Enhance fc_fuse_pass to enable fusing relu to fc_op (#19733)
5 years ago
Yiqun Liu a65c728e5d
Implement the GPU kernel of fc operator (#19687)
6 years ago
Tao Luo f05d2c519d paddle::framework::vectorize() templatization [PART3] (#19643)
6 years ago
Tao Luo 3ae939e48a
unify PADDLE_ASSERT_MSG into PADDLE_ENFORCE(error_message) (#19631)
6 years ago
baojun a3a4b6e570 Enable ngraph through build_strategy (#19266)
6 years ago
Yiqun Liu c5548178b0
A a pass to enable the use of cudnn (#19346)
6 years ago
liuwei1031 d6cb1a4122
add dynamic C runtime support on windows, test=develop (#19502)
6 years ago
Yiqun Liu fcec365d29
Add a pass to replace dropout_op with scale_op when is_test is true (#19297)
6 years ago
lidanqing 9240e5325c add local user data conversion into full_pascalvoc_test_preprocess.py (#19283)
6 years ago
Adam 97d1db1874 Add generalized Conv+Activation MKLDNN fuse pass creation Part2 (#19237)
6 years ago
Zhaolong Xing 76c95af000
Fix BUG: Mask RCNN inference diff When using AnalysisPredictor. (#19213)
6 years ago
Zeng Jinle 5b6673c44d
merge develop to solve conflict, also fix API doc, test=develop (#18823)
6 years ago
lidanqing 07a4d8f8d6 Fix mAP problem in unit test of int8 object detection test (#18946)
6 years ago
Adam b837689e97 Add generalized Conv+Activation MKLDNN fuse pass creation (#19072)
6 years ago
wopeizl 80b7ef6fc8
add tensorrt support for windows (#19084)
6 years ago
Tao Luo 741ce8bb1a
inference_shared_library support profile (#16275)
6 years ago
mapingshuo 4ad7c9d5a7 [WIP] Add Imdb train demo (#18895)
6 years ago
silingtong123 fd3b666d8c test=develop,Synchronize the contents of develop with release1.5 (#18937)
6 years ago
Zhaolong Xing 3816d221ff
Fix the CE error which caused by paddle-trt version (#18941)
6 years ago
石晓伟 ee2f296ef8
Fusion: seqpool_cvm_concat (#18471)
6 years ago
liuwei1031 0d99690809
fix several security bugs reported by security team (#18831)
6 years ago
Zhaolong Xing 61238d31f7
Trt fp16 support (#18860)
6 years ago
Leo Zhao 10eeed93d1 Revert "use static variable to do cache instead of thread local in thread frequent switching case (#18428)" (#18879)
6 years ago
Huihuang Zheng cfce4994cf
Merge cuda 9/10 dockerfile with root dockerfile (#18693)
6 years ago
Zhaolong Xing 26ae6d49e4
Update trt5 for paddle-trt (#18645)
6 years ago
guru4elephant d714bf037c
remove async executor and add data_feed.proto to the deps of train demo (#18659)
6 years ago
石晓伟 25d8079140
Fix Bitmain Predictor::Clone() (#18599)
6 years ago
Tao Luo 076f833110
add config.SetMkldnnCacheCapacity api for mkldnn cache clear strategy (#18580)
6 years ago
Jiabin Yang 667f88f9a6
Fix/gcc 4.8 ubt link error (#18558)
6 years ago
Zhaolong Xing 88b52a27fe
Inference: fix mask rcnn model diff, optim memory usage, memory leak. (#18532)
6 years ago
石晓伟 1529154821
Support Bitmain Anakin (#18542)
6 years ago
Leo Zhao ce38bb5341 use static variable to do cache instead of thread local in thread frequent switching case (#18428)
6 years ago
Tao Luo fe32879d2a
add mkldnn shapeblob cache clear strategy (#18513)
6 years ago
bingyanghuang 3fe6bf5ee6 fix command line bug in int8v2 readme (#18507)
6 years ago
石晓伟 047bba855b
Remove the obsolete cmake options (#18481)
6 years ago
Tao Luo d234aa02cd
add transfer_scope_cache unit-test (#18467)
6 years ago
Tao Luo 3123d18787
remove unused AnalysisPredictor::SetMkldnnThreadID() (#18444)
6 years ago
Michał Gallus 7023a86c3a Fix Pooling output scale (#18186)
6 years ago
Michał Gallus 8409693272 Reset DeviceContext after quantization warmup (#18182)
6 years ago
Sylwester Fraczek 9252e8fa08 add int8 mkldnn prior_box (#17242)
6 years ago