Paddle

Commit Graph

Author	SHA1	Message	Date
Pei Yang	181b1f5a30	adjust minimum trt version for hard_sigmoid converter to 5130. test=develop (#24746 )	5 years ago
zlsh80826	fdbe114b12	[Paddle-TRT] use float constant instead of double test=develop (#24544 )	5 years ago
Zhaolong Xing	f68d4fb3f1	fix bert bug using trt6 when compile with CUDA_ARCH_NAME=All (#24517 ) test=develop	5 years ago
pawelpiotrowicz	db2b6b6568	Hide globals & redesign restore PR (#24279 ) test=develop	5 years ago
Jacek Czaja	8b88cd5167	[oneDNN] Fix to inplace pass (#24442 ) * - Disabling inplace pass test=develop - Disable cycles test=develop - fix test=develop - Enhancement to in-place - Lint fixes test=develop * - Lint fixes test=develop	5 years ago
Jacek Czaja	d0307145a3	[oneDNN] disabling oneDNN inplace pass (#24406 )	5 years ago
Tao Luo	72c370c8d2	remove unused test_multi_thread_helper.h (#24399 ) test=develop	5 years ago
Tao Luo	63da846de0	remove old inference C++ tests (#24368 )	5 years ago
Chen Weihang	aa0f254fbe	Add macro BOOST_GET to enrich the error information of boost :: get (#24175 ) * add new macro BOOST_GET_SAFELY & unittests, test=develop * add different macro type, test=develop * fix get macro type in executor, test=develop * four macro part change backup * using one macro for all case, test=develop * revert attribute change, test=develop * change to three func to solve gcc4.8 bug, test=develop * polish some details, test=develop	5 years ago
Tao Luo	c1df7048c7	add UT for mkldnn_cache_capacity (#24336 ) * add UT for mkldnn_cache_capacity test=develop * fix comparison of integer expressions of different signedness test=develop	5 years ago
Tao Luo	9eedf05d2f	solve mklml memory leak on windows (#24015 ) * solve mklml memory leak on windows test=develop * remove unused msvcr120.dll test=develop	5 years ago
石晓伟	17ac6e2580	update the analysis predictor for multi-stream support, test=develop (#24046 ) * update the analysis predictor, test=develop * update the unit test, test=develop * no priority set before the inferface determined, test=develop * interface name generalization, test=develop	5 years ago
lidanqing	61ec30f030	Update QAT INT8 2.0 doc (#24127 ) * update local data preprocess doc * update for 2.0 QAT test=develop test=document_fix * update benchmark data test=develop test=document_fix Co-authored-by: Wojciech Uss <wojciech.uss@intel.com>	5 years ago
Sylwester Fraczek	e1a7a88057	added reshape transpose matmul fuse pass (#23754 )	5 years ago
arlesniak	d31a174f51	added fusing matmul-transpose-reshape pass (#23866 )	5 years ago
Pei Yang	695a53c874	remove conv_bn_fuse_pass and fc_fuse_pass in trt int8 calibration. test=develop (#23805 )	5 years ago
Zhaolong Xing	35148d17f7	[BUG]: Head number can only be > 1 on multihead op (#23974 ) * support the head number == 1 test=develop * fix slice op error. test=develop	5 years ago
Zhou Wei	7817003795	Optimize the error messages of paddle CUDA API (#23816 ) * Optimize the error messages of paddle CUDA API, test=develop * fix the error messages of paddle CUDA API, test=develop * Refactoring PADDLE_ENFORCE_CUDA_SUCCESS, and apply to curand/cudnn/cublas/NCCL,test=develop * remove build_ex_string,test=develop * merge conflict,test=develop	5 years ago
Zhaolong Xing	133f1fc123	[Eernie TRT]: add slice op and add emb eltwise layernorm fp16 support (#23723 ) * refine ernie trt dynamic shape support 1. add slice op converter 2. add emb eltwise layernorm fp16 support test=develop * fix dynamic shape test ut test=develop * fix comments. test=develop * fix comments test=develop	5 years ago
guofei	2b896c1f6b	Support LoDTensorArray in fetch (#23645 ) * Support LoDTEnsorArray in fetch op test=develop * Support LoDTensorArray in fetch test=develop * Support LoDTensorArray in fetch test=develop * Support LoDTensorArray in fetch test=develop * Support LoDTensorArray in fetch test=develop * Support LoDTensorArray in fetch test=develop * Support LoDTensorArray in fetch test=develop * Support LoDTensorArray in fetch test=develop * Support LoDTensorArray in fetch test=develop * Support LoDTensorArray in fetch test=develop	5 years ago
lidanqing	2291634c5c	Add user local data preprocess support (#23692 ) * add local data preprocess support for imagenet test=develop * add local data2bin tests test=develop * locally two tests passed test=develop * change according to reviews test=develop	5 years ago
chenhaoze	b7d185d6ca	OP clip, merge_lod_tensor, convert/elementwise error message enhancement (#23742 ) * OP clip, merge_lod_tensor, convert/elementwise error message enhancement. test=develop	5 years ago
Pei Yang	c528f1d4f3	[Paddle-TRT] Add hard_sigmoid and hard_swish support(support MobilenetV3) (#23672 ) * add hard_sigmoid trt op converter * add hard_swish op converter and plugin. test=develop * add macro to adapt lower trt version. test=develop	5 years ago
Pei Yang	015acdbfb1	Refine error message of leaky_relu, tensorrt_engine, split, prelu op converter (#23661 )	5 years ago
joanna.wozna.intel	12ba05ce0c	Add scale-matmul fuse pass (#23734 )	5 years ago
Zhaolong Xing	3acb047a20	[Paddle-TRT]: add eltwise,pool2d, prelu, scale, concat, gelu dynamic shape support (#23396 ) * add elementwise pool2d, prelu, shuffle channel test=develop * add scale and refine concat eltwise conveter test=develop * refine elementwise converter test=develop * refine ut test and enforce error. test=develop * modify const cast test=develop	5 years ago
chenhaoze	9b06dd8628	Add three passes and api reference of paddle_pass_builder. test=develop (#23741 ) * Add three passes and api reference of paddle_pass_builder.h	5 years ago
Zhaolong Xing	ed5766ffbc	refine act conv2d pool2d trt converter log (#23605 ) * refine act conv2d pool2d fc, trt converter log test=develop * fix comments test=develop	5 years ago
Pei Yang	28f04c6a5e	refine shuffle channel errmsg, test=develop (#23520 )	5 years ago
Tao Luo	e4f1b1c5e1	solve mklml memory leak (#23557 )	5 years ago
mozga-intel	3baaee9aab	Remove: NGraph engine from PDPD repository (#23545 ) * Remove the NGraph engine from PDPD repository 1. Each operator was removed from the operator's directory 2. Each test was removed from the unittest directory 3. The parallel executor support was removed from the PDPD 4. The CMake file was removed from the PDPD 5. The NG flags were removed from the repository test=develop * Remove ngraph from: 1. Cmake file 2. Python file test=develop	5 years ago
Pei Yang	3d5d217030	Revert "[Paddle-TRT] Add hard_sigmoid and hard_swish support(support MobilenetV3) (#23536 )", test=develop (#23642 ) This reverts commit `cdc6d4e292`.	5 years ago
Pei Yang	eb11633611	batch_norm trt converter error message, test=develop (#23620 )	5 years ago
joanna.wozna.intel	ce08fdcf2b	Add support for INT8 matmul in C-API quantization (#23463 ) * Integrate matmul with cpu_quantize_pass test=develop * Add matmul checking scales test=develop * Change condition of matmul quantization test=develop * Remove redundant var test=develop	5 years ago
Pei Yang	cdc6d4e292	[Paddle-TRT] Add hard_sigmoid and hard_swish support(support MobilenetV3) (#23536 ) * add hard_sigmoid trt op converter * add hard_swish op converter and plugin. test=develop	5 years ago
Pei Yang	42655ef721	Add full_like op. (#23364 ) * add full_like op. test=develop * add dygraph support. test=develop * increase coverage. test=develop	5 years ago
石晓伟	36b82eae0e	refine the doc of paddle_api.h, test=develop (#23402 ) * refine the doc of paddle_api.h, test=develop * fix documents, test=develop	5 years ago
Zhaolong Xing	6a23850a3f	add init value to varis in analysis config. (#23442 )	5 years ago
Zhaolong Xing	70782e6379	[Inference doc]: refine paddle_api.h doc (#23354 ) * refine paddle api doc test=develop * fix comments test=develop	5 years ago
Pei Yang	7e439780d9	add full paddle_analysis_config.h APIs. (#23215 )	5 years ago
Zhaolong Xing	1a6ce8b910	add swish split gelu plugin dynamic support (#23305 ) test=develop	5 years ago
Jacek Czaja	2bb1b0e89e	[DNNL] Added MKL-DNN inplace pass for C-API inference (#23315 )	5 years ago
石晓伟	708ded584e	pause the io_utils_test of int64 and resume after repair, test=develop (#23234 )	5 years ago
Wilber	0129f4b568	Add some inference API comments for AnalysisPredictor (#23242 ) * add inference api doc. test=develop	5 years ago
Zhaolong Xing	430b0099c9	[Paddle-TRT]: Ernie Dynamic shape support. (#23138 ) * add dynamic plugin support. test=develop * change emb eltwise layernorm to math function test=develop * add emb eltwise layernorm test=develop * can run dynamic shape ernie test=develop * fix ci test=develop * add ut for trt ernie dynamic test=develop * refine dynamic shape c++ interface. test=develop * fix comments test=develop * fix comments test=develop	5 years ago
Pei Yang	46b8d282dc	Add some inference API comments for AnalysisConfig (#23117 ) * add some API comments in paddle_analysis_config.h, test=develop * add some API comments in paddle_analysis_config.h, test=develop	5 years ago
Sylwester Fraczek	abee05a8c8	added mkldnn swish activation (#23041 )	5 years ago
Pei Yang	24db750386	fix trt int8 calib precision bug. test=develop (#23036 )	5 years ago
Wilber	db40ee86db	fix unittets. test=develop (#23018 )	5 years ago
Zhang Ting	137d6563fc	add check for assigned data, test=develop (#22960 )	5 years ago
Zhaolong Xing	8d6dc102fe	[Ernie GPU Optimize]: Embedding_eltwise_layernorm Fuse (#22494 ) * 1. add embedding eltwise layernorm fuse 2. add embedding eltwise layernorm op 3. refine inplace_add_relu 4. refine fc_eltwise_layernorm test=develop * 1. refine fc test=develop * fix comments test=develop * fix comments test=develop	5 years ago
Zhaolong Xing	dd67d44a50	[Paddle-TRT] : (Part1) Dynamic shape support (#22868 ) * change the ci trt from version 5. to 6.0 * paddle-trt dynamic shape support init * conv+bias or conv+bn dynamic shape support test=develop * modity trt engine opconvert test=develop * fix ci error test=develop	5 years ago
Michał Gallus	0038bfbd1d	Prevent loading of warmup data in analyzer_int8 if enable_int8 is set to false (#22857 )	5 years ago
石晓伟	1861ca88f1	serialize the PaddleTensor, test=develop (#22810 ) * encapsulate the PaddleTensorToLoDTensor, test=develop * serialize the pd_tensor, test=develop * serialize tensors to file, test=develop	5 years ago
石晓伟	ddb9b46fec	change the function in op_teller, test=develop (#22794 ) * change the function in op_teller, test=develop * correct the commit-id, test=develop	5 years ago
liu zhengxi	324f2b3922	Fix inference c api PD_GetZeroCopyOutput lod (#22768 ) * fix inference c api lod, test=develop * fix capi lod problem and enrich tests, test=develop * delete useless header files and alter const_cast, test=develop	5 years ago
tianshuo78520a	433cef03e5	fix typo word (#22784 )	5 years ago
liu zhengxi	71ab0458e1	Fix pointer and c-api encapsulation (#22663 ) * refine pointer and c-api prototype, test=develop * fix new c api profile bug, test=develop * add unit tests, test=develop	5 years ago
GaoWei8	cdf5f6fb8c	Add an inference interface to disable FC padding (#22097 ) * Add an interface of disabling FC padding * fix bert regression * polish fc padding interface * recover pass function * fix argument error * fix mkldnn error	5 years ago
tianshuo78520a	d2ba91aad1	fix typo words (#22653 )	5 years ago
flame	d97475d53b	fix CPU C inference API compile bug (#22702 )	5 years ago
flame	74eb82de19	fix go api bug (#22669 )	5 years ago
flame	f7eafca828	remove python inference warning (#22602 )	5 years ago
flame	1d503e6a9e	Golang inference API (#22503 ) * support golang inference	5 years ago
Zhaolong Xing	8acd745c25	[Ernie GPU Optim]: Fuse three fc to multihtead matmul (#22486 ) * 1. optim multihead matmul: fuse three fc to multihtead matmul test=develop * fix conflict test=develop * fix comments test=develop	5 years ago
Wojciech Uss	4cddb43c5c	Add support for Ernie NLP model to the Slim QAT (#22506 ) * a test for Ernie QAT INT8 accuracy check test=develop * Remove NLP comparison test to split PRs test=develop * Fix typo and tabs, delete commented lines test=develop * re-combine the 2 PRs, test=develop Co-authored-by: Michał Gallus <sand3r@interia.eu> Co-authored-by: bingyanghuang <33643817+bingyanghuang@users.noreply.github.com>	5 years ago
Zhaolong Xing	54a325a52f	[Refine Paddle-TRT INT8]: Support PaddleSlim's Resnet50, Mobilenetv1, Yolov3 models for Inference. (#22483 ) * add int8 op teller for trt. * refine trt int8 * add int8 op teller for trt. test=develop	5 years ago
Zhaolong Xing	ceda0b9b1a	[Fix BUG]: Core when multi thread + clone + paddle-trt (#22442 ) * add mutex for trt engine test=develop * add the test for copy_to_cpu test=develop	5 years ago
石晓伟	e1b0d7cbb1	remove anakin from code, test=develop (#22420 )	5 years ago
joanna.wozna.intel	3099d9d47c	Restore requantize squash (#22399 )	5 years ago
石晓伟	8cb04664b9	revert paddle_fluid.map, test=develop (#22236 )	5 years ago
liu zhengxi	07afc29e90	Make api.cc malloc consistent with paddle_api.h for PaddleBuf (#22255 )	5 years ago
silingtong123	4f1da4adcb	remove the useless third_party library from C++ inference library (#22021 ) * remove the useless third_party library from C++ inference library * revert removing the install directory	5 years ago
zhouwei25	549e6de7ac	faster build by reduce by-product, reduce linking library and fix compile warning of std=c++11 (#22164 )	5 years ago
Wilber	1230c110cb	[fluid-lite] adjust to relative error (#22232 ) - fluid和lite精度比较替换为相对误差	5 years ago
Wojciech Uss	2e90c4eb0a	improve mkldnn_quantizer_config test code coverage (#22216 )	5 years ago
Wilber	5750152e80	support fluid-lite subgraph run resnet test=develop (#22191 ) - 添加了fluid-lite子图方式运行resnet的单测 - 修改了依赖Lite的git commit id	5 years ago
石晓伟	ad0dfb17c1	[Feature] Lite subgraph (#22114 )	5 years ago
Pei Yang	d8a9b134e3	fix trt instance_norm serialize bug. test=develop (#22152 )	5 years ago
Yiqun Liu	b1401fb74d	Remove subgraph_detector from inference/analysis to the common framework/ir directory. (#22094 ) test=develop	5 years ago
Pei Yang	50bee83f71	add TRT support for instance_norm op (#21928 ) * add TRT support for instance_norm op	5 years ago
Pei Yang	0a51098a71	Add TRT support for BERT (#21135 ) * add gelu plugin * align trt bert with gpu * add support for fused fc with relu, * add unittest for bert trt	5 years ago
Michał Gallus	6192108408	[DNNL] 3D Fully-Connected (#21746 )	5 years ago
zhouwei25	e66f92d1ae	Modify demo_ci to support Windows, prepare for PR_Windows_Inference (#21873 )	5 years ago
石晓伟	03479469a7	fix multi-thread error of fc_gru_fuse_pass.cc, test=develop (#21841 ) * fix multi-thread error of fc_gru_fuse_pass.cc, test=develop * export FLAGS and GLOG symbols, test=develop	5 years ago
lidanqing	9dff56e8e2	change qat_performance with mobilenet, change batch_size of qat2_resnet50 (#21895 ) test=develop	5 years ago
Michał Gallus	253e664275	Disable memory opt pass when DNNL is on (#21826 ) * Disable memory opt pass when DNNL is on * Refine comment above mem optimization pass enablement test=develop	5 years ago
Michał Gallus	a5159d8480	Re-anble vgg and resnet101 models download (#21713 ) test=develop	5 years ago
石晓伟	2bb135825e	fix analysis_predictor when func is called multiple times, test=release/1.6 (#21665 )	5 years ago
Chen Weihang	1fd1f06f11	Rename paddle throw error macro (#21657 ) * rename paddle throw error macro, test=develop * fix new error use case, test=develop	5 years ago
joanna.wozna.intel	d419b859c0	Add reshape int8 mkldnn op (#21428 ) * Add reshape int8 op test=develop * Change test to CPUPlace test=develop * Correct tests test=develop	5 years ago
Zhaolong Xing	fbbd94a6ce	there is bug for inference using auto grwoth allocator (#21621 ) test=develop	5 years ago
Adam	e81f0228df	MKL-DNN 1.0 Update (#20162 ) * MKLDNN v1.0 rebase to Paddle 1.6 test=develop * Add hacky paddle::string::to_string() implementation * vectorize<int64-t>() -> vectorize() cleanup test=develop * PADDLE_ENFORCE and void_cast fixes test=develop * Rebase changes test=develop * Cosmetics test=develop * Delete MKL from mkldnn.cmake test=develop * CMake debug commands test=develop * Delete MKLDNN_VERBOSE and rebase fixes test=develop * Rebase fixes test=develop * Temporarily disable int8 resnet101 vgg16 and vgg19 tests test=develop * Add libmkldnn.so.1 to python setup test=develop * Add libmkldnn.so.1 to inference_lib cmake after rebase test=develop * Post rebase fixes + FC int8 changes test=develop * Fix LRN NHWC test=develop * Fix NHWC conv3d test=develop * Windows build fix + next conv3d fix test=develop * Fix conv2d on AVX2 machines test=develop	5 years ago
rensilin	7f5d532a9c	fix: fail to call ZeroCopyTensor::mutable_data() when device_id is no… (#21461 ) * ZeroCopyTensor::mutable_data in the right device, test=develop * add unittest for zerocopy, test=develop	5 years ago
lidanqing	fbf9eca0d3	QAT Int8 document (#21360 ) * update benchmark for int8v2, QAT1, QAT2 accuracy and performance test=document_fix * change according to reviews test=develop test=document_fix * improve some descriptions and some models test=develop test=document_fix * update models benchmark data test=develop test=document_fix * update int8v2 and qat2 performance test=develop test=document_fix	5 years ago
Pei Yang	20d61414b4	fix glog warning, test=develop (#21573 )	5 years ago
Pei Yang	122b37ce62	make config option DisableGlogInfo() able to mute all inference logs (#21318 ) * make DisableGlogInfo able to mute all logs in inference.	5 years ago
Zhaolong Xing	da7748c53d	add conv, depthwise_conv, pooling (#20966 ) test=develop	5 years ago
GaoWei8	250a192181	Add ernie large c++ inference test (#21365 ) * add ernie-large test test=develop * add ernie large c++ inference test test=develop	5 years ago
Zhaolong Xing	b39c011637	specify the auto growth allocator for inference. (#21448 ) test=develop	5 years ago

1 2 3 4 5 ...

1325 Commits (1c898b66d6c668048ab77ee33b2457687b8b36be)