Commit Graph

496 Commits (6df1ca54c869efe34723be8bee2ce15db7320a2a)

Author SHA1 Message Date
Wojciech Uss c26130f3a9 reuse C-API INT8 unit test application (#18077)
6 years ago
lidanqing 466254151a add Mobilienet ssd int8 analyzer tester (#18075)
6 years ago
石晓伟 42f12a4aca
fix ci test cmake test=develop (#18060)
6 years ago
Michał Gallus 8462e2b805 Disable MKLDNN FC in Resnet50 test (#18030)
6 years ago
石晓伟 bce259e5bf
Update the Anakin interfaces for content-dnn and MLU (#17890)
6 years ago
mozga-intel c1379bf238 [NGraph] Bert model for a capi, ngraph's support test=develop (#17844)
6 years ago
Zhaolong Xing ae576f3c68
fix: when use the load model from memory mode, the RAM occupy is high (#17788)
6 years ago
Tao Luo b4b169467b
add fc_mkldnn_pass in compare_mkldnn (#17712)
6 years ago
Zhaolong Xing 4337009b92 fix trt ci timeout error (#17701)
6 years ago
mozga-intel 5eb81fe595 Capi for a ngraph engine (#17037)
6 years ago
lidanqing 04b6c29ee0 Improve mobilenetv2 INT8 performance by using INT8 relu as post-op (#17570)
6 years ago
Michał Gallus 0c39b97b4e [MKL-DNN] Add Fully Connected Op for inference only(#15226)
6 years ago
Sylwester Fraczek bccb0ba49a fix quantize_squash_pass segfault when no tensor linked to Bias (#17292)
6 years ago
lijianshe02 daf88968e2
fix bug that saved optimal model path in test_analyzer_save_model con… (#17555)
6 years ago
Tao Luo 3d19f44a89
remove unused SERIAL compiler option (#17500)
6 years ago
lidanqing 36757ed203 Enabling resnet101, vgg16, vgg19 INT8v2 model tests (#17468)
6 years ago
flame e48dd92fc8
bug fix (#17392)
6 years ago
Wojciech Uss 984aa90583 improved unit test output (#17266)
6 years ago
Leo Zhao 54636a1982 call SetNumThreads everytime to avoid missing omp thread setting (#17224)
6 years ago
wopeizl 83c4f7721f
use two GPUs to run the exclusive test test=develop (#17187)
6 years ago
tensor-tang 79ed1c76cd
fix bn fuse vardesc and add model saver (#17143)
6 years ago
luotao1 490e746269 fix runtime_context_cache bug when gpu model has an op runs only on cpu
6 years ago
wopeizl d9991dccdd
add parallel build script to ci … (#16901)
6 years ago
Tao Luo aa7b975bf6 disable runtime_context_cache pass by default
6 years ago
Tao Luo bc037c13c7 use multi-thread to speedup CI tests
6 years ago
Tao Luo 5b1565a7be
Merge pull request #16875 from lidanqing-intel/lidanqing/improve_preprocess_script
6 years ago
root 1965a22488 minus trt ci times.
6 years ago
lijianshe02 de26df440b add SaveOptimModel interface in analysis_predictor.h and test it in a… (#16441)
6 years ago
lidanqing de02d40e98 improve preprocess script and read from tar
6 years ago
Yihua Xu 93cedfdb9c Fix the order while sorting the operators (#16756)
6 years ago
liuwei1031 85363848a1
Security issue (#16774)
6 years ago
tensor-tang d6c1b5a73b disable seqpool concat pass by default saving CI time
6 years ago
Tao Luo d5c8d4acfe reduce all analyzer_test ci elasped time
6 years ago
bingyanghuang 88ceda5134 MKLDNN INT8 v2 readme.md (#16515)
6 years ago
luotao1 bd636a9ea6 test_analyzer_int8 tests use default pass order
6 years ago
Wojciech Uss 9b6a029666 fix dataset reading and add support for full dataset (#16559)
6 years ago
lidanqing 2ca0de3cd4 fix preprocess script with processbar, integrity check and logs (#16608)
6 years ago
Tao Luo ce18710421 enhance analyzer_tests download
6 years ago
Wojciech Uss 2498395132 remove profiling from int8 test
6 years ago
Tao Luo 8f7b5883b8
Merge pull request #16529 from lidanqing-intel/lidanqing/preprocess-data
6 years ago
lidanqing 0d656996bf fix some bugs of unzip and reading val list
6 years ago
lidanqing b46e467abc add wget and unzip part and change data_dir
6 years ago
lidanqing 894aa9b235 change script file name and data_dir location
6 years ago
lidanqing 57f51e5b08 preprocess with PIL the full val dataset and save binary
6 years ago
Sylwester Fraczek 8ece7a9708 fixed url to dataset
6 years ago
Sylwester Fraczek fe21578a44 create test for quantized resnet50
6 years ago
Yihua Xu 57dc3c1943 Disable compare for Issue#16316 (#16466)
6 years ago
liuwei1031 de3b70a101
fix cdn issue, test=develop (#16423)
6 years ago
Tao Luo 7d2740db83
Revert "cache runtime_context"
6 years ago
Jacek Czaja 13816dd4ac [MKL-DNN] Fix to crash of Transformer when mkldnn is to be used (#16233)
6 years ago
luotao1 5ecdc49c6b set enable_runtime_context_cache_ default false
6 years ago
luotao1 d94fd97230 add runtime_context_cache_pass
6 years ago
Tao Luo c49b7855fa
Merge pull request #16120 from Xreki/fix_cmake_compress
6 years ago
Liu Yiqun 4e052e0ac9 Disable inference download for WIN32 temporary.
6 years ago
luotao1 1283833395 zero_copy tensor support INT32
6 years ago
luotao1 31c4e1d9fc Merge branch 'develop' into zero_copy
6 years ago
luotao1 9e2c7e69fb simplify the zero_copy tests
6 years ago
luotao1 aeee4cbe71 add compare between zerocopy and analysis
6 years ago
Liu Yiqun 6bb84b74b2 Change the download and compress command of cmake.
6 years ago
Tao Luo 25ca2ca001 change init_idx to INT32 in transformer_test
6 years ago
Tao Luo e5e7e9b865 Merge branch 'develop' into transformer_ut
6 years ago
Tao Luo 6f2581e4c5
Merge pull request #16090 from lidanqing-intel/paddle-int32
6 years ago
Zhaolong Xing 3d63aa0a11
Merge pull request #15729 from NHZlX/add_static_model_load_for_trt
6 years ago
nhzlx a9ed427749 cant not pass ci
6 years ago
luotao1 fad06cb928 unify ZeroCopy in analysis_test
6 years ago
lidanqing 4aeb261da9 Add INT32 support. INT32 in last switch case
6 years ago
luotao1 06aab1b493 refine SetCpuMathLibraryNumThreads
6 years ago
Tao Luo e3dd6970fc disable dam temporarily (#15860)
6 years ago
Dun a83e470405
Profiler refine and add CUDA runtime api tracer (#15301)
6 years ago
Yiqun Liu e38dd91f04
Refine cmake's download function. (#15512)
6 years ago
tensor-tang e1c707fe9c
fix warnings (#15790)
6 years ago
Yan Chunwei e887d71958
fix ir debug config (#15571)
6 years ago
Tao Luo 3d0ecab41b add analyzer_transformer_test
6 years ago
Tao Luo 1a252f4be6
Merge pull request #15587 from luotao1/bert
6 years ago
Zhaolong Xing 90ffe74954
Merge pull request #15546 from NHZlX/fix_trt_utest_random_failed
6 years ago
luotao1 8f0c2b07f2 use embedding=128 bert model for test
6 years ago
luotao1 5504425eb3 fix compiler error, use len20 dataset for bert
6 years ago
Yan Chunwei 655179089f
AnalysisConfig remove contrib namespace (#15540)
6 years ago
luotao1 e31aef9f6e Merge branch 'develop' into fc500110-bert_test
6 years ago
nhzlx 95b98f27ae fix trt models utest failed.
6 years ago
Tao Luo b919190232
Merge pull request #15531 from jczaja/prv-googlenet-fix
6 years ago
Jacek Czaja 4aa7ef3c13 - Compensation fix to LRN MKL-DNN op
6 years ago
Yan Chunwei e2818c8608
add dynamic memory optim (#15457)
6 years ago
luotao1 353b5f06a7 refine analyzer_bert_test to pass the ci
6 years ago
nhzlx e6218c1d7b change the input to a smaller value
6 years ago
fuchang01 4a33a44f45 analyzer bert tester
6 years ago
nhzlx 5b92ddabe2 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_trt_stream_bug
6 years ago
nhzlx 2f4aee361a fix comments
6 years ago
Yan Chunwei 885c4e57ab
fea/infer memory optim2 (#14953)
6 years ago
Tao Luo 8f522c15ed
Merge pull request #15408 from luotao1/mm_dnn
6 years ago
Tao Luo 001827c270 test_analyzer_mm_dnn runs in serial
6 years ago
Tao Luo 668563088e add pyramid_dnn c++ inference test
6 years ago
Xin Pan 3ecf6bb338
Merge pull request #15028 from yihuaxu/develop_641313ea7_elementwise_mul_mkldnn_bug_fix
6 years ago
Yiqun Liu 568cc2ffa8
Optimize while_op for test (#14764)
6 years ago
tensor-tang a7fc3d42a0
Merge pull request #15304 from tensor-tang/fuse/second_order_mul_sub
6 years ago
bingyanghuang a152a5c731 Disable conv3d mkldnn in dam (#15335)
6 years ago
tensor-tang 1a95cd227d disable seqpool test on mac or without mkl
6 years ago
Tao Luo 2411ed4286 fix multi-threads in ZeroCopyProfile
6 years ago
tensor-tang 84b0ecdcce Merge remote-tracking branch 'ups/develop' into fuse/second_order_mul_sub
6 years ago
tensor-tang 7035f051a8 adjust acc on mac
6 years ago
Tao Luo e33427da0d
Merge pull request #15280 from luotao1/random_test
6 years ago
tensor-tang d618e48309 fix fuse square mat order and refine test
6 years ago
tensor-tang 84e023eae5 adjust the acc since the refer result is too large
6 years ago
tensor-tang 4461a458a5 adjust diff since abs is too large
6 years ago
tensor-tang ca6fdc6e33 refine and fix test
6 years ago
tensor-tang 781cd0cf51 add multi threads test of seqpool test (#15293)
6 years ago
Tao Luo cbd1c7c01f fix CompareDeterministic error when test_all_data
6 years ago
tensor-tang 96786d3716 add compare_determine of seqpool1 test
6 years ago
tensor-tang ce909664d8 Merge remote-tracking branch 'ups/develop' into refine/seqpool/feed
6 years ago
Tao Luo 7d13d20769
Merge pull request #15245 from luotao1/rnn1_multi_thread
6 years ago
tensor-tang 54afcb7ec6 add compare zerocopy test with native result
6 years ago
tensor-tang 137060135e fix zerocopy size
6 years ago
tensor-tang 7461356723 add zerocopy for seqpool test
6 years ago
tensor-tang 48410b9bfe
Merge pull request #15237 from tensor-tang/fuse/seqpool_concat_2
6 years ago
Tao Luo d43983b61d reduce threads number to avoid hang in CI
6 years ago
Tao Luo 197d0f2431 fix trt_model_tester to pass the ci
6 years ago
Tao Luo 71d9097a89 fix analyzer_test runs error in native_config
6 years ago
tensor-tang 72d2a1801e add seqpool concat fuse pass
6 years ago
Tao Luo 6ca9a4810b
Merge pull request #15196 from luotao1/serial
6 years ago
Tao Luo 7dc0181c46 run analyzer_tester serial in multi-thread
6 years ago
Yan Chunwei 875a07c32d
refactor inference analysis api (#14634)
6 years ago
tensor-tang b9c645639b workaround with third party cache
6 years ago
tensor-tang 484085693e update url and num_ops
6 years ago
tensor-tang cd94df8679 fix load and refine
6 years ago
tensor-tang 8e271896ae add test data for seqpool1
6 years ago
Tao Luo 85471533e0
Merge pull request #15079 from luotao1/analysis_test
6 years ago
wopeizl 719ebe3786
Merge pull request #15070 from wopeizl/windows/testcasefix
6 years ago
Tao Luo ecae157edf simplify some data record in analyzer_tester
6 years ago
Tao Luo 05f1b65da3 simplify prepere_input in analyzer_test
6 years ago
peizhilin 01c00b07dd fix test issues on windows
6 years ago
Yihua Xu 0b0acfaa88 Add mkldnn item for porfile and compare usage.
6 years ago
tensor-tang d46a140dd9 add seq pool inference test
6 years ago
tensor-tang d4931a2abc support more input fake data
6 years ago
Tao Luo 91408e3122 fix analyzer_mm_dnn_tester fails when bs > 1
6 years ago
Tao Luo 22c71398e3 add MM_DNN inference test
6 years ago
luotao1 13367866cd add deterministic result unit-test
6 years ago
Tao Luo 2f55a04ec6 add refer result comparasion
6 years ago
Tao Luo 1a6d2cfe39 add test_analyzer_mobilenet
6 years ago
Yu Yang bacf1d2399 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/tensor_type
6 years ago
Yan Chunwei a985949be9
Fea/fuse conv elementwise add fuse (#14669)
6 years ago
Yu Yang aa38fc4ce5 Fix compile
6 years ago
nhzlx 644c13a387 fix compile error
6 years ago
nhzlx afc51e6f82 add benchmark for trt
6 years ago
bingyanghuang 943ad4781f One possible solution to add flexibility for mkldnn placement pass (#14768)
6 years ago
Yihua Xu 3821fc3950 Merge branch 'develop' into develop_4f71a6ee2_conv3d_bias_fusion_mkldnn_impl
6 years ago
Tao Luo 743cb840f1 update with comments
6 years ago
Tao Luo 405b2486db support loading from memory
6 years ago
Yihua Xu 82eefceabe Add the profile_mkldnn flag for profile function(test=develop)
6 years ago
Yihua Xu 64e261c6cd Implement the fusion of convolution and bias for mkldnn
6 years ago
ZhenWang 6e48e47406 test=develop
6 years ago
ZhenWang e1da6cd754 add the normal dam and the small dam
6 years ago
ZhenWang d5947b0ed7 test=develop
6 years ago
ZhenWang 33b4963505 unify the normal and small dam model.
6 years ago
ZhenWang 8f2e556e65 support the small dam model. test=develop
6 years ago
minqiyang be04d99fe4 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into revert_vlog
6 years ago
minqiyang 53433d7f2e Revert the changes of VLOG
6 years ago
peizhilin 36cd18b549 Merge remote-tracking branch 'upstream/develop' into windows/build
6 years ago
peizhilin 1afa9492af Recover the profiler
6 years ago
Yiqun Liu bf222f197d
Use sub scope in tensor_array_to_tensor op. (#14524)
6 years ago
luotao1 116979a40a refine api name
6 years ago
luotao1 e66b4c6bff adjust tester_helper to make multi-instance multi-thread work
6 years ago
luotao1 a5c4b463c9 add SetMKLDNNThreadId api
6 years ago
luotao1 e21edb26f6 add Set/GetCPUNumThreads api
6 years ago
peizhilin 7c8c9dc9bf fix unit test cases
6 years ago
Tao Luo 1d9b2a453c
Merge pull request #14508 from luotao1/warm_up_multi_thread
6 years ago
nhzlx a4dc1d4292 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refine_trt
6 years ago
nhzlx faeb9b8aa9 fix compile rely problem
6 years ago
Tao Luo eb9b9becdc add warm up in TestMultiThreadPrediction
6 years ago
nhzlx 2a84054372 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refine_trt
6 years ago
nhzlx b742d46520 fix demo ci bug on trt
6 years ago
Tao Luo 09ee266f8e disable two openblas test temporary
6 years ago
Yiqun Liu 8bc1c5d2ab
Implement the Tensorrt plugin for elementwise op (#14487)
6 years ago
Tao Luo 9d29ebc010
Merge pull request #14306 from sfraczek/sfraczek/test-analyzer-mobilenet
6 years ago
Sylwester Fraczek d318583eb5 rename mobilenet dir to mobilenet_depthwise_conv
6 years ago
Tao Luo 1d867805b0 rollback analyzer_seq_conv1_tester
6 years ago
Tao Luo 5ef123c778 Merge branch 'develop' into dam_fc
6 years ago
Yiqun Liu 9e6b1c5f97
Refine tester of TensorRT engine (#14390)
6 years ago
Sylwester Fraczek 2412c27c2b
Merge branch 'develop' into sfraczek/test-analyzer-mobilenet
6 years ago
Tao Luo e0d4e04bdd fix some compiler warning
6 years ago
Tao Luo 8ea13e336a add in_num_col_dims for fc
6 years ago
Yan Chunwei 9f252e0032
Combine Inference Analysis with IR (#13914)
6 years ago
nhzlx d6ff006903 add serial to trt test and do not print log for unused trt logs
6 years ago
Sylwester Fraczek b5f617fa9b make mobilenet test reuse resnet50 test
6 years ago
minqiyang 3da43dcae2 Because anakin do NOT use glog, so we revert anakin related change
6 years ago
minqiyang 0c3227a523 Change the origin VLOG level to 10 times
6 years ago
Sylwester Fraczek f395075efc rebased and stuff broke
6 years ago
Sylwester Fraczek a60957f386 addd test_analyzer_mobilenet
6 years ago
Xin Pan 80132933b7
Merge pull request #14281 from luotao1/face
6 years ago
Tao Luo eea36739cc refine test_helper.h
6 years ago
Tao Luo 2b791f1f63 unify analyzer_face_tester to analyzer_resnet50_tester
6 years ago
Tao Luo 1ead9318d5 remove unused code in test_helper.h to pass ci
6 years ago
Tao Luo 7a2887d212 add analyzer_face_tester
6 years ago
Tao Luo 2ec65ae0db download face_model in CMakeLists.txt
6 years ago
Tao Luo 2f9a5a2e0a add analyzer_face_tester
6 years ago
nhzlx 5700fafd0f Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_googlenet_bug_with_rule
6 years ago
nhzlx 86b99ac953 fix comments and fix bug
6 years ago
Zhen Wang 4dbc01841d Nlp dam (#14248)
6 years ago
Tao Luo fe8f178582 fix word2vec related inference unit-tests (#14203)
6 years ago
Tao Luo 79da263b11
Merge pull request #14032 from sfraczek/sfraczek/fix-test-multithreading-mkldnn
6 years ago
Yan Chunwei ee74be3a49
[1.1] Bugfix/tensorarray (#14044)
6 years ago