Commit Graph

1130 Commits (b34933d9ee3b61dbbd642fd02f244c36d0d14550)

Author SHA1 Message Date
石晓伟 04ea7cb069
modify the access level of anakin engine (#18015)
6 years ago
石晓伟 bce259e5bf
Update the Anakin interfaces for content-dnn and MLU (#17890)
6 years ago
Zhaolong Xing 4e8d5a034f
Light mem reuse strategy for inference. (#17925)
6 years ago
mozga-intel c1379bf238 [NGraph] Bert model for a capi, ngraph's support test=develop (#17844)
6 years ago
石晓伟 d008260fa8
update the initialization of anakin subgraph (#17880)
6 years ago
Zhaolong Xing ae576f3c68
fix: when use the load model from memory mode, the RAM occupy is high (#17788)
6 years ago
翟飞跃 993c703bcc INT8 MKL-DNN v2 integrate to slim (#17634)
6 years ago
Tao Luo e089e454a1
make omp thread num default 1 after inference run (#17801)
6 years ago
Tao Luo b4b169467b
add fc_mkldnn_pass in compare_mkldnn (#17712)
6 years ago
Zhaolong Xing 4337009b92 fix trt ci timeout error (#17701)
6 years ago
mozga-intel 5eb81fe595 Capi for a ngraph engine (#17037)
6 years ago
lidanqing 04b6c29ee0 Improve mobilenetv2 INT8 performance by using INT8 relu as post-op (#17570)
6 years ago
Jacek Czaja 6d8075ecef [MKL-DNN] conv_transpose mkldnn bias pass (#17644)
6 years ago
Sylwester Fraczek 96845d2168 add Concat quantization (#17448)
6 years ago
Zhen Wang 8bd651b7ed
Fix the bug in the AnalysisPredictor and add more directions about io APIs. (#17639)
6 years ago
Zeng Jinle 4aa931dd85
Code clean of Allocator (#17602)
6 years ago
Zhaolong Xing 61221ebc28
TRT: Support set dynamic range in int8 mode. (#17524)
6 years ago
Michał Gallus 0c39b97b4e [MKL-DNN] Add Fully Connected Op for inference only(#15226)
6 years ago
Sylwester Fraczek 5b2a3c4b12 Conv concat relu quantization (#17466)
6 years ago
Sylwester Fraczek bccb0ba49a fix quantize_squash_pass segfault when no tensor linked to Bias (#17292)
6 years ago
Zhaolong Xing 38da103034 fix trt ci bug temporary. (#17565)
6 years ago
lijianshe02 daf88968e2
fix bug that saved optimal model path in test_analyzer_save_model con… (#17555)
6 years ago
guomingz 2281ebf0f3 Enable the convolution/relu6(bounded_relu) fusion for FP32 on Intel platform. (#17130)
6 years ago
Tao Luo 3d19f44a89
remove unused SERIAL compiler option (#17500)
6 years ago
lidanqing 36757ed203 Enabling resnet101, vgg16, vgg19 INT8v2 model tests (#17468)
6 years ago
liuwei1031 ba70cc499e
fix security bugs : (#17464)
6 years ago
Tao Luo 32da5e9c3d
remove unused expected_kernel_cache_pass (#17486)
6 years ago
wopeizl ca3ba378c7
fix the random compilation failure on windows test=develop (#17475)
6 years ago
Zhen Wang 4a1b7fec96
Add setting Scope function for the graph class (#17417)
6 years ago
flame e48dd92fc8
bug fix (#17392)
6 years ago
Zhaolong Xing 7a3bb061d8
fix: (#17279)
6 years ago
Wojciech Uss 984aa90583 improved unit test output (#17266)
6 years ago
石晓伟 a72dbe9abf
Cherry-pick benchmark related changes from release/1.4 (#17156)
6 years ago
Leo Zhao 54636a1982 call SetNumThreads everytime to avoid missing omp thread setting (#17224)
6 years ago
wopeizl 83c4f7721f
use two GPUs to run the exclusive test test=develop (#17187)
6 years ago
tensor-tang 79ed1c76cd
fix bn fuse vardesc and add model saver (#17143)
6 years ago
Tao Luo d9cd989825
Merge pull request #17048 from luotao1/fix_runtime_cache_bug
6 years ago
tangwei12 13295d90d9
load persistables with selected rows, test=develop (#17047)
6 years ago
luotao1 490e746269 fix runtime_context_cache bug when gpu model has an op runs only on cpu
6 years ago
wopeizl d9991dccdd
add parallel build script to ci … (#16901)
6 years ago
Tao Luo aa7b975bf6 disable runtime_context_cache pass by default
6 years ago
nhzlx bc6b0ca1f4 fix trt anakin subgraph compile rely
6 years ago
gongweibao cbdb8a17b1
Polish DGC code (#16818)
6 years ago
Tao Luo bc037c13c7 use multi-thread to speedup CI tests
6 years ago
Tao Luo 5b1565a7be
Merge pull request #16875 from lidanqing-intel/lidanqing/improve_preprocess_script
6 years ago
root 1965a22488 minus trt ci times.
6 years ago
Tao Luo ca8b8fa0bd
Merge pull request #16830 from Superjomn/fix/tmp-memory-optim
6 years ago
lijianshe02 de26df440b add SaveOptimModel interface in analysis_predictor.h and test it in a… (#16441)
6 years ago
lidanqing de02d40e98 improve preprocess script and read from tar
6 years ago
superjomn f58c3ec189 fix memory optim temporarily
6 years ago
Yihua Xu 93cedfdb9c Fix the order while sorting the operators (#16756)
6 years ago
liuwei1031 85363848a1
Security issue (#16774)
6 years ago
tensor-tang d6c1b5a73b disable seqpool concat pass by default saving CI time
6 years ago
Tao Luo ad4a1bd13c
Merge pull request #16339 from luotao1/core_opt_choose_kernel
6 years ago
Tao Luo d5c8d4acfe reduce all analyzer_test ci elasped time
6 years ago
luotao1 226596a296 Merge branch 'develop' into core_opt_choose_kernel
6 years ago
bingyanghuang 88ceda5134 MKLDNN INT8 v2 readme.md (#16515)
6 years ago
luotao1 bd636a9ea6 test_analyzer_int8 tests use default pass order
6 years ago
Yan Chunwei 044ae2497d
fix identity temporarily (#15942)
6 years ago
Wojciech Uss ec2750b3c2 fix repeating passes (#16606)
6 years ago
Wojciech Uss 9b6a029666 fix dataset reading and add support for full dataset (#16559)
6 years ago
lidanqing 2ca0de3cd4 fix preprocess script with processbar, integrity check and logs (#16608)
6 years ago
Tao Luo ce18710421 enhance analyzer_tests download
6 years ago
石晓伟 5dea0bdd1b
Merge pull request #16498 from Shixiaowei02/feature/anakin-engine
6 years ago
Shixiaowei02 7b9fc71076 update tensorrt subgraph_util test=develop
6 years ago
Wojciech Uss 2498395132 remove profiling from int8 test
6 years ago
Zhaolong Xing 3e6aa498d6
Merge pull request #16526 from NHZlX/refine_trt_anakin
6 years ago
Tao Luo 8f7b5883b8
Merge pull request #16529 from lidanqing-intel/lidanqing/preprocess-data
6 years ago
Tao Luo 5b24002389
Merge pull request #16399 from sfraczek/sfraczek/analyzer_int8_resnet50_test
6 years ago
Shixiaowei02 bddb2cd315 resolve conflicts with the develop branch test=develop
6 years ago
lidanqing 0d656996bf fix some bugs of unzip and reading val list
6 years ago
nhzlx d065b5bf2b Anakin ssd support
6 years ago
lidanqing b46e467abc add wget and unzip part and change data_dir
6 years ago
lidanqing 894aa9b235 change script file name and data_dir location
6 years ago
lidanqing 57f51e5b08 preprocess with PIL the full val dataset and save binary
6 years ago
chengduo ed61d67c73
Fix the interface of Pass::Apply (#16484)
6 years ago
Sylwester Fraczek 8ece7a9708 fixed url to dataset
6 years ago
gongweibao eb83abeac3
Add DGC(Deep Gradient Compression) interface. (#15841)
6 years ago
Sylwester Fraczek fe21578a44 create test for quantized resnet50
6 years ago
Michał Gallus 2d8b7b3a76 Refine default MKL-DNN Pass order (#16490)
6 years ago
Wojciech Uss 09dfc7a2aa
C-API quantization core 2 (#16396)
6 years ago
Yihua Xu 57dc3c1943 Disable compare for Issue#16316 (#16466)
6 years ago
nhzlx 953bdde058 Merge branch 'develop' of https://github.com/paddlepaddle/paddle into HEAD
6 years ago
nhzlx 45b3766fdf fix comments
6 years ago
Wojciech Uss 46677fb080 Move cpu_quantize_* passes into mkldnn subfolder
6 years ago
liuwei1031 de3b70a101
fix cdn issue, test=develop (#16423)
6 years ago
nhzlx 3df7b98a0f Merge branch 'develop' of https://github.com/paddlepaddle/paddle into HEAD
6 years ago
nhzlx f3a2e4b3d8 1. Add ANAKIN_ROOT compile option
6 years ago
Tao Luo 294cdf6f48
Merge pull request #16177 from fc500110/remove_visualizer
6 years ago
luotao1 056599a738 add expected_kernel_cache_pass
6 years ago
Wojciech Uss cbe2dbf0db Add enabling quantization (#16326)
6 years ago
nhzlx 4f4daa4b66 cherry-pick from feature/anakin-engine: add data type for zero copy #16313
6 years ago
nhzlx 07dcf2856c git cherry-pick from feature/anakin-engine: update anakin subgraph #16278
6 years ago
nhzlx c407dfa3cb cherry-pick from feature/anakin-engine: refine paddle-anakin to new interface. #16276
6 years ago
nhzlx a25331bc26 cherry-pick from feature/anakin-engine: deal the changing shape when using anakin #16189
6 years ago
nhzlx c79f06d3d8 cherry-pick from feature/anakin-engine: add batch interface for pd-anakin #16178
6 years ago
nhzlx 69d37f81d7 cherry-pick from feature/anakin-engine: refine anakin subgraph. #16157
6 years ago
nhzlx a1d200a5de cherry-pick from feature/anakin-engine: Anakin support facebox #16111
6 years ago
flame a32d420043 cherry-pick from feature/anakin-engine: batch norm (#16110)
6 years ago
flame 0945b97f07 cherry-pick feature/anakin-engine: add anakin softmax/transpose/batch_norm/flatten/reshape op (#16020)
6 years ago