Commit Graph

485 Commits (d0a5620575a3ce94e0a7a5a20192e9307b0b9c93)

Author SHA1 Message Date
Michał Gallus 0c39b97b4e [MKL-DNN] Add Fully Connected Op for inference only(#15226)
6 years ago
Sylwester Fraczek bccb0ba49a fix quantize_squash_pass segfault when no tensor linked to Bias (#17292)
6 years ago
lijianshe02 daf88968e2
fix bug that saved optimal model path in test_analyzer_save_model con… (#17555)
6 years ago
Tao Luo 3d19f44a89
remove unused SERIAL compiler option (#17500)
6 years ago
lidanqing 36757ed203 Enabling resnet101, vgg16, vgg19 INT8v2 model tests (#17468)
6 years ago
flame e48dd92fc8
bug fix (#17392)
6 years ago
Wojciech Uss 984aa90583 improved unit test output (#17266)
6 years ago
Leo Zhao 54636a1982 call SetNumThreads everytime to avoid missing omp thread setting (#17224)
6 years ago
wopeizl 83c4f7721f
use two GPUs to run the exclusive test test=develop (#17187)
6 years ago
tensor-tang 79ed1c76cd
fix bn fuse vardesc and add model saver (#17143)
6 years ago
luotao1 490e746269 fix runtime_context_cache bug when gpu model has an op runs only on cpu
6 years ago
wopeizl d9991dccdd
add parallel build script to ci … (#16901)
6 years ago
Tao Luo aa7b975bf6 disable runtime_context_cache pass by default
6 years ago
Tao Luo bc037c13c7 use multi-thread to speedup CI tests
6 years ago
Tao Luo 5b1565a7be
Merge pull request #16875 from lidanqing-intel/lidanqing/improve_preprocess_script
6 years ago
root 1965a22488 minus trt ci times.
6 years ago
lijianshe02 de26df440b add SaveOptimModel interface in analysis_predictor.h and test it in a… (#16441)
6 years ago
lidanqing de02d40e98 improve preprocess script and read from tar
6 years ago
Yihua Xu 93cedfdb9c Fix the order while sorting the operators (#16756)
6 years ago
liuwei1031 85363848a1
Security issue (#16774)
6 years ago
tensor-tang d6c1b5a73b disable seqpool concat pass by default saving CI time
6 years ago
Tao Luo d5c8d4acfe reduce all analyzer_test ci elasped time
6 years ago
bingyanghuang 88ceda5134 MKLDNN INT8 v2 readme.md (#16515)
6 years ago
luotao1 bd636a9ea6 test_analyzer_int8 tests use default pass order
6 years ago
Wojciech Uss 9b6a029666 fix dataset reading and add support for full dataset (#16559)
6 years ago
lidanqing 2ca0de3cd4 fix preprocess script with processbar, integrity check and logs (#16608)
6 years ago
Tao Luo ce18710421 enhance analyzer_tests download
6 years ago
Wojciech Uss 2498395132 remove profiling from int8 test
6 years ago
Tao Luo 8f7b5883b8
Merge pull request #16529 from lidanqing-intel/lidanqing/preprocess-data
6 years ago
lidanqing 0d656996bf fix some bugs of unzip and reading val list
6 years ago
lidanqing b46e467abc add wget and unzip part and change data_dir
6 years ago
lidanqing 894aa9b235 change script file name and data_dir location
6 years ago
lidanqing 57f51e5b08 preprocess with PIL the full val dataset and save binary
6 years ago
Sylwester Fraczek 8ece7a9708 fixed url to dataset
6 years ago
Sylwester Fraczek fe21578a44 create test for quantized resnet50
6 years ago
Yihua Xu 57dc3c1943 Disable compare for Issue#16316 (#16466)
6 years ago
liuwei1031 de3b70a101
fix cdn issue, test=develop (#16423)
6 years ago
Tao Luo 7d2740db83
Revert "cache runtime_context"
6 years ago
Jacek Czaja 13816dd4ac [MKL-DNN] Fix to crash of Transformer when mkldnn is to be used (#16233)
6 years ago
luotao1 5ecdc49c6b set enable_runtime_context_cache_ default false
6 years ago
luotao1 d94fd97230 add runtime_context_cache_pass
6 years ago
Tao Luo c49b7855fa
Merge pull request #16120 from Xreki/fix_cmake_compress
6 years ago
Liu Yiqun 4e052e0ac9 Disable inference download for WIN32 temporary.
6 years ago
luotao1 1283833395 zero_copy tensor support INT32
6 years ago
luotao1 31c4e1d9fc Merge branch 'develop' into zero_copy
6 years ago
luotao1 9e2c7e69fb simplify the zero_copy tests
6 years ago
luotao1 aeee4cbe71 add compare between zerocopy and analysis
6 years ago
Liu Yiqun 6bb84b74b2 Change the download and compress command of cmake.
6 years ago
Tao Luo 25ca2ca001 change init_idx to INT32 in transformer_test
6 years ago
Tao Luo e5e7e9b865 Merge branch 'develop' into transformer_ut
6 years ago
Tao Luo 6f2581e4c5
Merge pull request #16090 from lidanqing-intel/paddle-int32
6 years ago
Zhaolong Xing 3d63aa0a11
Merge pull request #15729 from NHZlX/add_static_model_load_for_trt
6 years ago
nhzlx a9ed427749 cant not pass ci
6 years ago
luotao1 fad06cb928 unify ZeroCopy in analysis_test
6 years ago
lidanqing 4aeb261da9 Add INT32 support. INT32 in last switch case
6 years ago
luotao1 06aab1b493 refine SetCpuMathLibraryNumThreads
6 years ago
Tao Luo e3dd6970fc disable dam temporarily (#15860)
6 years ago
Dun a83e470405
Profiler refine and add CUDA runtime api tracer (#15301)
6 years ago
Yiqun Liu e38dd91f04
Refine cmake's download function. (#15512)
6 years ago
tensor-tang e1c707fe9c
fix warnings (#15790)
6 years ago
Yan Chunwei e887d71958
fix ir debug config (#15571)
6 years ago
Tao Luo 3d0ecab41b add analyzer_transformer_test
6 years ago
Tao Luo 1a252f4be6
Merge pull request #15587 from luotao1/bert
6 years ago
Zhaolong Xing 90ffe74954
Merge pull request #15546 from NHZlX/fix_trt_utest_random_failed
6 years ago
luotao1 8f0c2b07f2 use embedding=128 bert model for test
6 years ago
luotao1 5504425eb3 fix compiler error, use len20 dataset for bert
6 years ago
Yan Chunwei 655179089f
AnalysisConfig remove contrib namespace (#15540)
6 years ago
luotao1 e31aef9f6e Merge branch 'develop' into fc500110-bert_test
6 years ago
nhzlx 95b98f27ae fix trt models utest failed.
6 years ago
Tao Luo b919190232
Merge pull request #15531 from jczaja/prv-googlenet-fix
6 years ago
Jacek Czaja 4aa7ef3c13 - Compensation fix to LRN MKL-DNN op
6 years ago
Yan Chunwei e2818c8608
add dynamic memory optim (#15457)
6 years ago
luotao1 353b5f06a7 refine analyzer_bert_test to pass the ci
6 years ago
nhzlx e6218c1d7b change the input to a smaller value
6 years ago
fuchang01 4a33a44f45 analyzer bert tester
6 years ago
nhzlx 5b92ddabe2 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_trt_stream_bug
6 years ago
nhzlx 2f4aee361a fix comments
6 years ago
Yan Chunwei 885c4e57ab
fea/infer memory optim2 (#14953)
6 years ago
Tao Luo 8f522c15ed
Merge pull request #15408 from luotao1/mm_dnn
7 years ago
Tao Luo 001827c270 test_analyzer_mm_dnn runs in serial
7 years ago
Tao Luo 668563088e add pyramid_dnn c++ inference test
7 years ago
Xin Pan 3ecf6bb338
Merge pull request #15028 from yihuaxu/develop_641313ea7_elementwise_mul_mkldnn_bug_fix
7 years ago
Yiqun Liu 568cc2ffa8
Optimize while_op for test (#14764)
7 years ago
tensor-tang a7fc3d42a0
Merge pull request #15304 from tensor-tang/fuse/second_order_mul_sub
7 years ago
bingyanghuang a152a5c731 Disable conv3d mkldnn in dam (#15335)
7 years ago
tensor-tang 1a95cd227d disable seqpool test on mac or without mkl
7 years ago
Tao Luo 2411ed4286 fix multi-threads in ZeroCopyProfile
7 years ago
tensor-tang 84b0ecdcce Merge remote-tracking branch 'ups/develop' into fuse/second_order_mul_sub
7 years ago
tensor-tang 7035f051a8 adjust acc on mac
7 years ago
Tao Luo e33427da0d
Merge pull request #15280 from luotao1/random_test
7 years ago
tensor-tang d618e48309 fix fuse square mat order and refine test
7 years ago
tensor-tang 84e023eae5 adjust the acc since the refer result is too large
7 years ago
tensor-tang 4461a458a5 adjust diff since abs is too large
7 years ago
tensor-tang ca6fdc6e33 refine and fix test
7 years ago
tensor-tang 781cd0cf51 add multi threads test of seqpool test (#15293)
7 years ago
Tao Luo cbd1c7c01f fix CompareDeterministic error when test_all_data
7 years ago
tensor-tang 96786d3716 add compare_determine of seqpool1 test
7 years ago
tensor-tang ce909664d8 Merge remote-tracking branch 'ups/develop' into refine/seqpool/feed
7 years ago
Tao Luo 7d13d20769
Merge pull request #15245 from luotao1/rnn1_multi_thread
7 years ago
tensor-tang 54afcb7ec6 add compare zerocopy test with native result
7 years ago
tensor-tang 137060135e fix zerocopy size
7 years ago
tensor-tang 7461356723 add zerocopy for seqpool test
7 years ago
tensor-tang 48410b9bfe
Merge pull request #15237 from tensor-tang/fuse/seqpool_concat_2
7 years ago
Tao Luo d43983b61d reduce threads number to avoid hang in CI
7 years ago
Tao Luo 197d0f2431 fix trt_model_tester to pass the ci
7 years ago
Tao Luo 71d9097a89 fix analyzer_test runs error in native_config
7 years ago
tensor-tang 72d2a1801e add seqpool concat fuse pass
7 years ago
Tao Luo 6ca9a4810b
Merge pull request #15196 from luotao1/serial
7 years ago
Tao Luo 7dc0181c46 run analyzer_tester serial in multi-thread
7 years ago
Yan Chunwei 875a07c32d
refactor inference analysis api (#14634)
7 years ago
tensor-tang b9c645639b workaround with third party cache
7 years ago
tensor-tang 484085693e update url and num_ops
7 years ago
tensor-tang cd94df8679 fix load and refine
7 years ago
tensor-tang 8e271896ae add test data for seqpool1
7 years ago
Tao Luo 85471533e0
Merge pull request #15079 from luotao1/analysis_test
7 years ago
wopeizl 719ebe3786
Merge pull request #15070 from wopeizl/windows/testcasefix
7 years ago
Tao Luo ecae157edf simplify some data record in analyzer_tester
7 years ago
Tao Luo 05f1b65da3 simplify prepere_input in analyzer_test
7 years ago
peizhilin 01c00b07dd fix test issues on windows
7 years ago
Yihua Xu 0b0acfaa88 Add mkldnn item for porfile and compare usage.
7 years ago
tensor-tang d46a140dd9 add seq pool inference test
7 years ago
tensor-tang d4931a2abc support more input fake data
7 years ago
Tao Luo 91408e3122 fix analyzer_mm_dnn_tester fails when bs > 1
7 years ago
Tao Luo 22c71398e3 add MM_DNN inference test
7 years ago
luotao1 13367866cd add deterministic result unit-test
7 years ago
Tao Luo 2f55a04ec6 add refer result comparasion
7 years ago
Tao Luo 1a6d2cfe39 add test_analyzer_mobilenet
7 years ago
Yu Yang bacf1d2399 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/tensor_type
7 years ago
Yan Chunwei a985949be9
Fea/fuse conv elementwise add fuse (#14669)
7 years ago
Yu Yang aa38fc4ce5 Fix compile
7 years ago
nhzlx 644c13a387 fix compile error
7 years ago
nhzlx afc51e6f82 add benchmark for trt
7 years ago
bingyanghuang 943ad4781f One possible solution to add flexibility for mkldnn placement pass (#14768)
7 years ago
Yihua Xu 3821fc3950 Merge branch 'develop' into develop_4f71a6ee2_conv3d_bias_fusion_mkldnn_impl
7 years ago
Tao Luo 743cb840f1 update with comments
7 years ago
Tao Luo 405b2486db support loading from memory
7 years ago
Yihua Xu 82eefceabe Add the profile_mkldnn flag for profile function(test=develop)
7 years ago
Yihua Xu 64e261c6cd Implement the fusion of convolution and bias for mkldnn
7 years ago
ZhenWang 6e48e47406 test=develop
7 years ago
ZhenWang e1da6cd754 add the normal dam and the small dam
7 years ago
ZhenWang d5947b0ed7 test=develop
7 years ago
ZhenWang 33b4963505 unify the normal and small dam model.
7 years ago
ZhenWang 8f2e556e65 support the small dam model. test=develop
7 years ago
minqiyang be04d99fe4 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into revert_vlog
7 years ago
minqiyang 53433d7f2e Revert the changes of VLOG
7 years ago
peizhilin 36cd18b549 Merge remote-tracking branch 'upstream/develop' into windows/build
7 years ago
peizhilin 1afa9492af Recover the profiler
7 years ago
Yiqun Liu bf222f197d
Use sub scope in tensor_array_to_tensor op. (#14524)
7 years ago
luotao1 116979a40a refine api name
7 years ago
luotao1 e66b4c6bff adjust tester_helper to make multi-instance multi-thread work
7 years ago
luotao1 a5c4b463c9 add SetMKLDNNThreadId api
7 years ago
luotao1 e21edb26f6 add Set/GetCPUNumThreads api
7 years ago
peizhilin 7c8c9dc9bf fix unit test cases
7 years ago
Tao Luo 1d9b2a453c
Merge pull request #14508 from luotao1/warm_up_multi_thread
7 years ago
nhzlx a4dc1d4292 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refine_trt
7 years ago
nhzlx faeb9b8aa9 fix compile rely problem
7 years ago
Tao Luo eb9b9becdc add warm up in TestMultiThreadPrediction
7 years ago
nhzlx 2a84054372 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refine_trt
7 years ago
nhzlx b742d46520 fix demo ci bug on trt
7 years ago
Tao Luo 09ee266f8e disable two openblas test temporary
7 years ago
Yiqun Liu 8bc1c5d2ab
Implement the Tensorrt plugin for elementwise op (#14487)
7 years ago
Tao Luo 9d29ebc010
Merge pull request #14306 from sfraczek/sfraczek/test-analyzer-mobilenet
7 years ago
Sylwester Fraczek d318583eb5 rename mobilenet dir to mobilenet_depthwise_conv
7 years ago
Tao Luo 1d867805b0 rollback analyzer_seq_conv1_tester
7 years ago
Tao Luo 5ef123c778 Merge branch 'develop' into dam_fc
7 years ago
Yiqun Liu 9e6b1c5f97
Refine tester of TensorRT engine (#14390)
7 years ago
Sylwester Fraczek 2412c27c2b
Merge branch 'develop' into sfraczek/test-analyzer-mobilenet
7 years ago
Tao Luo e0d4e04bdd fix some compiler warning
7 years ago
Tao Luo 8ea13e336a add in_num_col_dims for fc
7 years ago
Yan Chunwei 9f252e0032
Combine Inference Analysis with IR (#13914)
7 years ago
nhzlx d6ff006903 add serial to trt test and do not print log for unused trt logs
7 years ago
Sylwester Fraczek b5f617fa9b make mobilenet test reuse resnet50 test
7 years ago
minqiyang 3da43dcae2 Because anakin do NOT use glog, so we revert anakin related change
7 years ago
minqiyang 0c3227a523 Change the origin VLOG level to 10 times
7 years ago
Sylwester Fraczek f395075efc rebased and stuff broke
7 years ago
Sylwester Fraczek a60957f386 addd test_analyzer_mobilenet
7 years ago
Xin Pan 80132933b7
Merge pull request #14281 from luotao1/face
7 years ago
Tao Luo eea36739cc refine test_helper.h
7 years ago
Tao Luo 2b791f1f63 unify analyzer_face_tester to analyzer_resnet50_tester
7 years ago
Tao Luo 1ead9318d5 remove unused code in test_helper.h to pass ci
7 years ago
Tao Luo 7a2887d212 add analyzer_face_tester
7 years ago
Tao Luo 2ec65ae0db download face_model in CMakeLists.txt
7 years ago
Tao Luo 2f9a5a2e0a add analyzer_face_tester
7 years ago
nhzlx 5700fafd0f Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_googlenet_bug_with_rule
7 years ago
nhzlx 86b99ac953 fix comments and fix bug
7 years ago
Zhen Wang 4dbc01841d Nlp dam (#14248)
7 years ago
Tao Luo fe8f178582 fix word2vec related inference unit-tests (#14203)
7 years ago
Tao Luo 79da263b11
Merge pull request #14032 from sfraczek/sfraczek/fix-test-multithreading-mkldnn
7 years ago
Yan Chunwei ee74be3a49
[1.1] Bugfix/tensorarray (#14044)
7 years ago
Sylwester Fraczek 2098b42584 review fixes (Teamcity fails)
7 years ago
Sylwester Fraczek 741cb33bd9 test multithreading
7 years ago
Tao Luo 316bc9bfc9 fix typo and warning in analyzer_resnet50_test
7 years ago
tensor-tang 40f8456a4f refine fuse pattern and attr
7 years ago
tensor-tang 0a9f5f1790
Merge pull request #13968 from tensor-tang/fix/jit/exp
7 years ago
Tao Luo ef09862450 fix analyzer_rnn2_test
7 years ago
Tao Luo e5b4643ad8 add profile_mkldnn test
7 years ago
Tao Luo 7d680be5a3 Merge branch 'develop' into mkldnn_test
7 years ago
Tao Luo 6a4e9230ed Merge branch 'develop' into mkldnn_test
7 years ago
Tao Luo b819684370 add compare_mkldnn test
7 years ago
Tao Luo 6ea9d1b595 add analysis_predictor in vis_demo
7 years ago