Paddle

Commit Graph

Author	SHA1	Message	Date
mozga-intel	3baaee9aab	Remove: NGraph engine from PDPD repository (#23545 ) * Remove the NGraph engine from PDPD repository 1. Each operator was removed from the operator's directory 2. Each test was removed from the unittest directory 3. The parallel executor support was removed from the PDPD 4. The CMake file was removed from the PDPD 5. The NG flags were removed from the repository test=develop * Remove ngraph from: 1. Cmake file 2. Python file test=develop	5 years ago
Zhaolong Xing	6a23850a3f	add init value to varis in analysis config. (#23442 )	5 years ago
Pei Yang	7e439780d9	add full paddle_analysis_config.h APIs. (#23215 )	5 years ago
Zhaolong Xing	430b0099c9	[Paddle-TRT]: Ernie Dynamic shape support. (#23138 ) * add dynamic plugin support. test=develop * change emb eltwise layernorm to math function test=develop * add emb eltwise layernorm test=develop * can run dynamic shape ernie test=develop * fix ci test=develop * add ut for trt ernie dynamic test=develop * refine dynamic shape c++ interface. test=develop * fix comments test=develop * fix comments test=develop	5 years ago
Pei Yang	46b8d282dc	Add some inference API comments for AnalysisConfig (#23117 ) * add some API comments in paddle_analysis_config.h, test=develop * add some API comments in paddle_analysis_config.h, test=develop	5 years ago
Zhaolong Xing	dd67d44a50	[Paddle-TRT] : (Part1) Dynamic shape support (#22868 ) * change the ci trt from version 5. to 6.0 * paddle-trt dynamic shape support init * conv+bias or conv+bn dynamic shape support test=develop * modity trt engine opconvert test=develop * fix ci error test=develop	5 years ago
GaoWei8	cdf5f6fb8c	Add an inference interface to disable FC padding (#22097 ) * Add an interface of disabling FC padding * fix bert regression * polish fc padding interface * recover pass function * fix argument error * fix mkldnn error	5 years ago
石晓伟	e1b0d7cbb1	remove anakin from code, test=develop (#22420 )	5 years ago
石晓伟	ad0dfb17c1	[Feature] Lite subgraph (#22114 )	5 years ago
Pei Yang	443f604c3b	add DisableGlogInfo() to AnalysisConfig, test=develop (#20581 )	5 years ago
Zhaolong Xing	e89b12884a	FIx C++ inference BUG: When open memory optim and enable trt subgraph at the same time, there is a bug (#19969 ) * fix memory optimization type test=develop * 1. fix BUG: open trt and memory optim will trigger bug. 2. Clean memory optim bug. test=develop	5 years ago
Pei Yang	9cbc1eff2d	zerocopytensor support uint8, analysis config support profile, analysis predictor support GetInputTensorShape, test=develop (#19822 )	5 years ago
Yiqun Liu	c5548178b0	A a pass to enable the use of cudnn (#19346 ) * Add a interface to enable cudnn for inference. * Add cudnn_placement_pass. test=develop * Set the default value of cudnn_enabled_op_types to null. test=develop * Write the common basic class, placement_pass_base, to refine the codes. test=develop * Call EnableCUDNN in unittest. test=develop * Refine cudnn_placement_pass tester. * Enable the testing of cudnn_placement_pass in inference's unittest. test=develop * Add the check of op kernels. test=develop	6 years ago
Zhaolong Xing	61238d31f7	Trt fp16 support (#18860 ) * Fix Mask rcnn predictor 1. refine memory optim algorithm to support the model with the block op. 2. output diff : modify the affine channel fuse 3. add condition_block_infer op add interface for setting trt calib table dir test=develop * add the missing files. test=develop * 1 add trt fp16 support test=develop	6 years ago
Tao Luo	076f833110	add config.SetMkldnnCacheCapacity api for mkldnn cache clear strategy (#18580 ) * add config.SetMkldnnCacheCapacity api for mkldnn cache clear strategy test=develop * enhance MkldnnPostReset test=develop * add comments for mkldnn_cache_capacity field test=develop	6 years ago
Zhaolong Xing	88b52a27fe	Inference: fix mask rcnn model diff, optim memory usage, memory leak. (#18532 ) * Fix Mask rcnn predictor 1. refine memory optim algorithm to support the model with the block op. 2. output diff : modify the affine channel fuse 3. add condition_block_infer op add interface for setting trt calib table dir test=develop * add the missing files. test=develop	6 years ago
Zhaolong Xing	ae576f3c68	fix: when use the load model from memory mode, the RAM occupy is high (#17788 ) test=develop	6 years ago
翟飞跃	993c703bcc	INT8 MKL-DNN v2 integrate to slim (#17634 ) * refactor PR 16865 * delete mergetool files * test=develop * test=develop * test=develop * test=develop * create dir for int8 model before call SaveOptimModel * test=develop * mkldnn int8 only support linux; test=develop * refine code; test=develop * remove comment; test=develop * refine code; test=develop * fix bug; test=develop * add exception for mkldnn_post_training_strategy * reuse int8v2 CAPI dataset; test=develop * fix accuracy check bug; test=develop * remove tab * convert files to unix format * test=develop * reduce CI time;test=develop * reduce CI time and refine code;test=develop * refine comment; test=develop * add cmake FLAGS;test=develop * remove predict_num;test=develop	6 years ago
mozga-intel	5eb81fe595	Capi for a ngraph engine (#17037 )	6 years ago
Zhaolong Xing	61221ebc28	TRT: Support set dynamic range in int8 mode. (#17524 ) * fluid int8 train and trt int8 predict align. trt int8 predict init op converter * 2. align fluid int8 train and trt int8 inference. enhance quant dequant fuse pass enhance op converter, trt engine, trt engine op, trt subgraph pass. * 3. add delete_quant_dequant_pass for trt test=develop * 4. add the missing file test=develop * 5. i modify the c++ interface, but forget to modify the pybind code fix the IS_TRT_VERSION_GE bug, and fix elementwise op converter test=develop	6 years ago
石晓伟	a72dbe9abf	Cherry-pick benchmark related changes from release/1.4 (#17156 ) * cherry-pick commit from `8877054` * cherry-pick commit from `3f0b97d` * cherry-pick from 16691:Anakin subgraph support yolo_v3 and faster-rcnn (cherry picked from commit `8643dbc233`) * Cherry-Pick from 16662 : Anakin subgraph cpu support (cherry picked from commit `7ad182e16c`) * Cherry-pick from 1662, 16797.. : add anakin int8 support (cherry picked from commit `e14ab180fe`) * Cherry-pick from 16813 : change singleton to graph RegistBlock test=release/1.4 (cherry picked from commit `4b9fa42307`) * Cherry Pick : 16837 Support ShuffleNet and MobileNet-v2 Support ShuffleNet and MobileNet-v2, test=release/1.4 (cherry picked from commit `a6fb066f90`) * Cherry-pick : anakin subgraph add opt config layout argument #16846 test=release/1.4 (cherry picked from commit `8121b3eccb`) * 1. add shuffle_channel_detect (cherry picked from commit `6efdea8997`) * update shuffle_channel op convert, test=release/1.4 (cherry picked from commit `e4726a066f`) * Modify symbol export rules test=develop	6 years ago
石晓伟	5dea0bdd1b	Merge pull request #16498 from Shixiaowei02/feature/anakin-engine merge feature/anakin-engine to develop	6 years ago
Shixiaowei02	bddb2cd315	resolve conflicts with the develop branch test=develop	6 years ago
Wojciech Uss	09dfc7a2aa	C-API quantization core 2 (#16396 ) * C-API quantization core test=develop Co-authored-by: Sylwester Fraczek <sylwester.fraczek@intel.com> * Decouple Quantizer from AnalysisPredictor test=develop * fixes after review test=develop * renamed mkldnn quantize stuff test=develop * remove ifdef from header file test=develop	6 years ago
nhzlx	07dcf2856c	git cherry-pick from feature/anakin-engine: update anakin subgraph #16278	6 years ago
nhzlx	a25331bc26	cherry-pick from feature/anakin-engine: deal the changing shape when using anakin #16189	6 years ago
nhzlx	c79f06d3d8	cherry-pick from feature/anakin-engine: add batch interface for pd-anakin #16178	6 years ago
nhzlx	69d37f81d7	cherry-pick from feature/anakin-engine: refine anakin subgraph. #16157 support change input size	6 years ago
Tao Luo	7d2740db83	Revert "cache runtime_context"	6 years ago
luotao1	5ecdc49c6b	set enable_runtime_context_cache_ default false test=develop	6 years ago
luotao1	d94fd97230	add runtime_context_cache_pass test=develop	6 years ago
nhzlx	a9ed427749	cant not pass ci add if use static engine for trt test=develop	6 years ago
Sylwester Fraczek	543e53db05	fix typo releated->related	6 years ago
Yan Chunwei	e887d71958	fix ir debug config (#15571 )	6 years ago
Yan Chunwei	655179089f	AnalysisConfig remove contrib namespace (#15540 )	6 years ago
Zhaolong Xing	97b76c94c4	Merge pull request #15242 from NHZlX/trt_int8_ultimate_version add trt int8 support	6 years ago
Yan Chunwei	e2818c8608	add dynamic memory optim (#15457 )	6 years ago
nhzlx	027d24c831	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into trt_int8_ultimate_version	6 years ago
Yan Chunwei	885c4e57ab	fea/infer memory optim2 (#14953 )	6 years ago
nhzlx	312fe0ece1	add trt int8 calibration support fix comments test=develop	6 years ago
nhzlx	4e3522e5b4	add trt int8 support test=develop	6 years ago
Yan Chunwei	d09d6eadc0	make inference api work with Doxygen (#15195 )	6 years ago
Yan Chunwei	875a07c32d	refactor inference analysis api (#14634 )	6 years ago
nhzlx	71636e677d	add min_subgraph_size attr to tensorrt config test=develop	6 years ago
bingyanghuang	943ad4781f	One possible solution to add flexibility for mkldnn placement pass (#14768 ) * Choose to turn on use_mkldnn attribute v1 * Fix mkldnn_op empty bug * format change test=develop * fix ci test=develop * fix ci test and add test in dam test=develop * add example to dam compare test test=develop * review changes test=develop	6 years ago
Tao Luo	743cb840f1	update with comments test=develop	6 years ago
Tao Luo	405b2486db	support loading from memory test=develop	6 years ago
luotao1	a5c4b463c9	add SetMKLDNNThreadId api	6 years ago
Yiqun Liu	9e6b1c5f97	Refine tester of TensorRT engine (#14390 ) * Refine the tester for MixedRTPredictor. test=develop * Enable the profiler in TensorRT engine. * Support the use of combined inference model in TensorRT unittest, and print the shape of feed targets.	6 years ago
Yan Chunwei	9f252e0032	Combine Inference Analysis with IR (#13914 )	6 years ago

50 Commits (13b03e7ad6e9d555b972cca8e5e84ee0d4b61c99)