Commit Graph

539 Commits (ad6e3dd69cd915dd61287e96de7ec4ae132d24a5)

Author SHA1 Message Date
Leo Zhao 54636a1982 call SetNumThreads everytime to avoid missing omp thread setting (#17224)
6 years ago
wopeizl 83c4f7721f
use two GPUs to run the exclusive test test=develop (#17187)
6 years ago
luotao1 490e746269 fix runtime_context_cache bug when gpu model has an op runs only on cpu
6 years ago
wopeizl d9991dccdd
add parallel build script to ci … (#16901)
6 years ago
Tao Luo aa7b975bf6 disable runtime_context_cache pass by default
6 years ago
Tao Luo ca8b8fa0bd
Merge pull request #16830 from Superjomn/fix/tmp-memory-optim
6 years ago
lijianshe02 de26df440b add SaveOptimModel interface in analysis_predictor.h and test it in a… (#16441)
6 years ago
superjomn f58c3ec189 fix memory optim temporarily
6 years ago
liuwei1031 85363848a1
Security issue (#16774)
6 years ago
tensor-tang d6c1b5a73b disable seqpool concat pass by default saving CI time
6 years ago
luotao1 226596a296 Merge branch 'develop' into core_opt_choose_kernel
6 years ago
luotao1 bd636a9ea6 test_analyzer_int8 tests use default pass order
6 years ago
Yan Chunwei 044ae2497d
fix identity temporarily (#15942)
6 years ago
Wojciech Uss ec2750b3c2 fix repeating passes (#16606)
6 years ago
Wojciech Uss 9b6a029666 fix dataset reading and add support for full dataset (#16559)
6 years ago
石晓伟 5dea0bdd1b
Merge pull request #16498 from Shixiaowei02/feature/anakin-engine
6 years ago
Shixiaowei02 bddb2cd315 resolve conflicts with the develop branch test=develop
6 years ago
nhzlx d065b5bf2b Anakin ssd support
6 years ago
Michał Gallus 2d8b7b3a76 Refine default MKL-DNN Pass order (#16490)
6 years ago
Wojciech Uss 09dfc7a2aa
C-API quantization core 2 (#16396)
6 years ago
nhzlx 953bdde058 Merge branch 'develop' of https://github.com/paddlepaddle/paddle into HEAD
6 years ago
nhzlx 45b3766fdf fix comments
6 years ago
liuwei1031 de3b70a101
fix cdn issue, test=develop (#16423)
6 years ago
nhzlx 3df7b98a0f Merge branch 'develop' of https://github.com/paddlepaddle/paddle into HEAD
6 years ago
nhzlx f3a2e4b3d8 1. Add ANAKIN_ROOT compile option
6 years ago
luotao1 056599a738 add expected_kernel_cache_pass
6 years ago
Wojciech Uss cbe2dbf0db Add enabling quantization (#16326)
6 years ago
nhzlx 4f4daa4b66 cherry-pick from feature/anakin-engine: add data type for zero copy #16313
6 years ago
nhzlx 07dcf2856c git cherry-pick from feature/anakin-engine: update anakin subgraph #16278
6 years ago
nhzlx c407dfa3cb cherry-pick from feature/anakin-engine: refine paddle-anakin to new interface. #16276
6 years ago
nhzlx a25331bc26 cherry-pick from feature/anakin-engine: deal the changing shape when using anakin #16189
6 years ago
nhzlx c79f06d3d8 cherry-pick from feature/anakin-engine: add batch interface for pd-anakin #16178
6 years ago
nhzlx 69d37f81d7 cherry-pick from feature/anakin-engine: refine anakin subgraph. #16157
6 years ago
nhzlx a1d200a5de cherry-pick from feature/anakin-engine: Anakin support facebox #16111
6 years ago
nhzlx b21770a2aa cherry-pick from feature/anakin-engine: Add subgraph fuse support and anakin engine #16018
6 years ago
luotao1 82af8031d9 add runtime_context_cache_pass
6 years ago
Tao Luo 7d2740db83
Revert "cache runtime_context"
6 years ago
luotao1 a275fd6e0c Merge branch 'develop' into runtime_context
6 years ago
Wojciech Uss 2579ade45f Add cpu_quantize_pass for C-API quantization (#16127)
6 years ago
luotao1 5ecdc49c6b set enable_runtime_context_cache_ default false
6 years ago
luotao1 1510b866b6 turn off runtime_context_cache for tensorrt
6 years ago
luotao1 d94fd97230 add runtime_context_cache_pass
6 years ago
luotao1 1283833395 zero_copy tensor support INT32
6 years ago
luotao1 31c4e1d9fc Merge branch 'develop' into zero_copy
6 years ago
Tao Luo e5e7e9b865 Merge branch 'develop' into transformer_ut
6 years ago
Tao Luo 6f2581e4c5
Merge pull request #16090 from lidanqing-intel/paddle-int32
6 years ago
Zhaolong Xing 3d63aa0a11
Merge pull request #15729 from NHZlX/add_static_model_load_for_trt
6 years ago
nhzlx a9ed427749 cant not pass ci
6 years ago
luotao1 fad06cb928 unify ZeroCopy in analysis_test
6 years ago
lidanqing 4aeb261da9 Add INT32 support. INT32 in last switch case
6 years ago
luotao1 06aab1b493 refine SetCpuMathLibraryNumThreads
6 years ago
nhzlx 3c40cb767b 7 refine zero copy
6 years ago
Yiqun Liu 1616c32acf
Add the include of cudnn.h to enable the use of CUDNN_VERSION. (#15961)
6 years ago
nhzlx 2eff3e26b6 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_static_model_load_for_trt
6 years ago
nhzlx 06a088a199 fix comments and fix cpplint
6 years ago
nhzlx 0ed63b2108 6. delete useless predictor id
6 years ago
Sylwester Fraczek 1943119fc5 fix typo memeroy->memory
6 years ago
Sylwester Fraczek 8bc604571f fix typo seriazlized->serialized
6 years ago
Sylwester Fraczek 543e53db05 fix typo releated->related
6 years ago
tensor-tang e1c707fe9c
fix warnings (#15790)
6 years ago
nhzlx 2070fb246d 4. do the trt_engine optim during init.
6 years ago
Yan Chunwei 3a5d6e5e64
move passes to src to avoid different behavior in deployment (#15705)
6 years ago
Yan Chunwei c00ed19df2
add more comment (#15603)
6 years ago
Gabor Buella da9c94da33 Clang build fixes (#15628)
6 years ago
Chunwei d85c2e4e5c fix anakin compile dependency
6 years ago
qingqing01 943d972878
Fix analysis predictor when loading the persistable RAW type variable. (#15613)
6 years ago
Yan Chunwei e887d71958
fix ir debug config (#15571)
6 years ago
Yan Chunwei 897789b16e
fix save_inferece_model bug (#15365)
6 years ago
Tao Luo 3d0ecab41b add analyzer_transformer_test
6 years ago
Yan Chunwei 655179089f
AnalysisConfig remove contrib namespace (#15540)
6 years ago
qingqing01 a6910f900e
Always create variables in analysis_predictor before OptimizeInferenceProgram. (#15533)
6 years ago
Yan Chunwei b62b756b28
add version support (#15469)
6 years ago
Yan Chunwei 526790e652
infer get program (#15511)
6 years ago
Zhaolong Xing 97b76c94c4
Merge pull request #15242 from NHZlX/trt_int8_ultimate_version
6 years ago
Yan Chunwei e2818c8608
add dynamic memory optim (#15457)
6 years ago
nhzlx 92cf4a4c6b fix comments
6 years ago
nhzlx 027d24c831 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into trt_int8_ultimate_version
6 years ago
nhzlx 9641324995 fix comments
6 years ago
nhzlx 484b3bc801 When cudnn version < 7100, there is problem with conv_fusion.
6 years ago
flame d60751fb71
add python inference api (#15248)
6 years ago
Yan Chunwei 885c4e57ab
fea/infer memory optim2 (#14953)
6 years ago
Yan Chunwei c9e5aa19c1
get tensor API add more comments (#15345)
7 years ago
Yan Chunwei e84234b551
make clone thread safe (#15363)
7 years ago
Zhaolong Xing 236201c222
Merge pull request #15350 from NHZlX/fix_bug_for_precditor
7 years ago
Yan Chunwei e07900d317
cache tensor ptr in ZeroCopyTensor (#15352)
7 years ago
Yan Chunwei b7916440ff
hot fix the Native clone (#15344)
7 years ago
nhzlx b95f2ff8fe fix win build bug
7 years ago
nhzlx b938324381 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into trt_int8_ultimate_version
7 years ago
nhzlx 312fe0ece1 add trt int8 calibration support
7 years ago
Yiqun Liu 568cc2ffa8
Optimize while_op for test (#14764)
7 years ago
nhzlx b2ba3471fd fix analysis config bug.
7 years ago
tensor-tang a5d2a6d1ad add fuse pass of sequared mat sub fusion
7 years ago
tensor-tang a89296ac1f add repeated fc relu pass
7 years ago
Zhaolong Xing 98e85f3735 add_transpose_flatten_concat_fuse (#15121)
7 years ago
wopeizl 5d9edb4124
Merge pull request #15156 from wopeizl/windows/fixgpuissue
7 years ago
tensor-tang 146e942c65
Merge pull request #15250 from tensor-tang/refine/seqpool/feed
7 years ago
peizhilin 439691f5bd adjust the shlwapi on windows
7 years ago
tensor-tang ce909664d8 Merge remote-tracking branch 'ups/develop' into refine/seqpool/feed
7 years ago
peizhilin e239558e56 remove the dismatch enclosure to avoid warning message test=develop
7 years ago
Tao Luo 2b11c710b3
Merge pull request #15249 from NHZlX/fix_trt_demo_ci
7 years ago
tensor-tang 137060135e fix zerocopy size
7 years ago
nhzlx e7d83389e6 fix demo ci bug
7 years ago
nhzlx 4e3522e5b4 add trt int8 support
7 years ago
tensor-tang 72d2a1801e add seqpool concat fuse pass
7 years ago
Yan Chunwei d09d6eadc0
make inference api work with Doxygen (#15195)
7 years ago
Yan Chunwei 875a07c32d
refactor inference analysis api (#14634)
7 years ago
tensor-tang 516fe301ee add comment in case of empty name
7 years ago
tensor-tang dca68cdf97 throw error when name not find
7 years ago
tensor-tang cd94df8679 fix load and refine
7 years ago
Zhaolong Xing 4048cfa9da
Merge pull request #15048 from NHZlX/add_affine_channel_fuse
7 years ago
Zeng Jinle c0bcff00dc
Merge pull request #14962 from sneaxiy/rewrite_variable_type
7 years ago
Tao Luo 05f1b65da3 simplify prepere_input in analyzer_test
7 years ago
nhzlx 02e17396c2 fix comments
7 years ago
nhzlx 71636e677d add min_subgraph_size attr to tensorrt config
7 years ago
sneaxiy dde3afe7b7 Merge develop
7 years ago
nhzlx 73b47df1f4 Merge branch 'develop' of https://github.com/paddlepaddle/paddle into add_affine_channel_fuse
7 years ago
nhzlx ce3782c193 add affine_channel fuse.
7 years ago
qingqing01 51a9fca323
Async memory copy (#15013)
7 years ago
sneaxiy ae6f46a1a9 rewrite variable type
7 years ago
peizhilin 07c7eaabb4 Merge remote-tracking branch 'upstream/develop' into windows/mkl
7 years ago
Zhaolong Xing a9fb34fad8
Merge pull request #14903 from NHZlX/add_conv_elementwise_pass
7 years ago
peizhilin 5a6d7fe2ff add mkl,ctc support for windows
7 years ago
wopeizl 0f085f0a5a
Merge pull request #14892 from wopeizl/windows/port3
7 years ago
nhzlx fcc93d96d5 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_conv_elementwise_pass
7 years ago
Yu Yang bacf1d2399 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/tensor_type
7 years ago
nhzlx 4e4a777243 add conv+elementwiseadd pass
7 years ago
Yan Chunwei a985949be9
Fea/fuse conv elementwise add fuse (#14669)
7 years ago
Yu Yang 04a570b463 Fix ut
7 years ago
peizhilin 23dec78772 fix script issue
7 years ago
Yu Yang 9bd70a1e04 Change tensor uses proto::VarType::type
7 years ago
bingyanghuang 943ad4781f One possible solution to add flexibility for mkldnn placement pass (#14768)
7 years ago
Yihua Xu 3821fc3950 Merge branch 'develop' into develop_4f71a6ee2_conv3d_bias_fusion_mkldnn_impl
7 years ago
Tao Luo 743cb840f1 update with comments
7 years ago
Tao Luo 42359e88a4 clean code
7 years ago
Tao Luo 405b2486db support loading from memory
7 years ago
Xin Pan 7e0801d4ed
Merge pull request #14441 from baojun-nervana/intel/ngraph_op
7 years ago
Yihua Xu 64e261c6cd Implement the fusion of convolution and bias for mkldnn
7 years ago
Yan Chunwei 4b7617740e
fix container not cleared (#14231)
7 years ago
nhzlx 49c28b8c52 Merge branch 'develop' of https://github.com/paddlepaddle/paddle into add_params_sync_pass
7 years ago
Sang Ik Lee 24e70920db Refactor some build settings.
7 years ago
Sang Ik Lee d6125a5eec Include ngraph in inference demo build.
7 years ago
Tao Luo b4de023ee1
Merge pull request #14636 from Superjomn/fix/word2vec
7 years ago
nhzlx d3e140a572 Merge branch 'develop' of https://github.com/paddlepaddle/paddle into add_params_sync_pass
7 years ago
nhzlx 900fbb83f9 add params sync pass
7 years ago
superjomn 9c665c81ae update
7 years ago
minqiyang a02ce58f2c Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into revert_vlog
7 years ago
Yiqun Liu 726f2cefe3
Fix bug of referencing a temporary variable. (#14614)
7 years ago
minqiyang be04d99fe4 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into revert_vlog
7 years ago
minqiyang 53433d7f2e Revert the changes of VLOG
7 years ago
peizhilin 36cd18b549 Merge remote-tracking branch 'upstream/develop' into windows/build
7 years ago
qingqing01 39ec80def4
Remove the memory copy of feeding data in C++ inference API (#14577)
7 years ago
peizhilin 1afa9492af Recover the profiler
7 years ago
Yiqun Liu bf222f197d
Use sub scope in tensor_array_to_tensor op. (#14524)
7 years ago
dzhwinter 840c1b29ad
test=develop (#14562)
7 years ago
luotao1 116979a40a refine api name
7 years ago
luotao1 a5c4b463c9 add SetMKLDNNThreadId api
7 years ago
luotao1 e21edb26f6 add Set/GetCPUNumThreads api
7 years ago
peizhilin 7c8c9dc9bf fix unit test cases
7 years ago
wopeizl d9a1f3e58e Windows/online (#14474)
7 years ago
peizhilin 6e66fadb95 clean up the pre-definitions on windows
7 years ago
nhzlx a4dc1d4292 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refine_trt
7 years ago
nhzlx faeb9b8aa9 fix compile rely problem
7 years ago
nhzlx b742d46520 fix demo ci bug on trt
7 years ago
hjchen2 a8c077df7c Implement leaky relu tensorRT converter
7 years ago
Superjomn e878a8e885 update
7 years ago
superjomn 4bf6817cbc fix gpu load model
7 years ago
Zhaolong Xing 2f27c048cc
Merge pull request #14440 from hjchen2/develop
7 years ago
hjchen2 21f33b4274 Complete PRelu plugin and Conv2d transpose op converter
7 years ago
Sylwester Fraczek 8a1eeec579 add mkldnn prop_kind phase for inference-only case to pooling and activations (#14278)
7 years ago
dzhwinter d3aed98d86
Merge pull request #14320 from wopeizl/windows/online
7 years ago
Yiqun Liu 9e6b1c5f97
Refine tester of TensorRT engine (#14390)
7 years ago
peizhilin 1a9008c420 code style fix
7 years ago
nhzlx ddb120357c Merge branch 'develop' of https://github.com/paddlepaddle/paddle into add_trt_plugin
7 years ago
peizhilin 30ddc07a7e Merge remote-tracking branch 'upstream/develop' into windows/build
7 years ago
Yan Chunwei 9f252e0032
Combine Inference Analysis with IR (#13914)
7 years ago
nhzlx d38fd6a0fc add plugin support and offer an simple split sample
7 years ago
peizhilin ca60e1d34d Merge remote-tracking branch 'upstream/develop' into windows/build
7 years ago
peizhilin 52f7644f53 Merge remote-tracking branch 'upstream/develop' into windows/build
7 years ago
Qiyang Min 698698f2fa
Merge branch 'develop' into fix_vlog
7 years ago
qingqing01 abe209234f
Exhaustive search for cuDNN conv. (#14286)
7 years ago
minqiyang 87450b9ad4 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_vlog
7 years ago
peizhilin 4ffa92d4f0 Merge branch 'develop' into windows/build
7 years ago
Tao Luo 813e54efbd
Merge pull request #14328 from PaddlePaddle/revert-14046-windows/debug
7 years ago
minqiyang 3db9fad764 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_vlog
7 years ago
minqiyang 3da43dcae2 Because anakin do NOT use glog, so we revert anakin related change
7 years ago
Tao Luo 387610aae1
Merge pull request #14325 from luotao1/fix_test_analysis_predictor
7 years ago
peizhilin 45125ba538 fix share library issue
7 years ago
Zhaolong Xing ba8b5619a3
Revert "cherry picked windows patches."
7 years ago
minqiyang fcc0452c8b Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_vlog
7 years ago
Tao Luo 381bea0a16 fix test_analysis_predictor
7 years ago
minqiyang 0c3227a523 Change the origin VLOG level to 10 times
7 years ago
peizhilin 869487a2b7 Merge remote-tracking branch 'origin/develop' into windows/build
7 years ago
dzhwinter 2835e04409 merge develop branch. test=develop
7 years ago
qingqing01 db8c52da5e Revert " Exhaustive search for cuDNN conv. (#14043)"
7 years ago
qingqing01 ce7d9b0799
Exhaustive search for cuDNN conv. (#14043)
7 years ago
peizhilin 9d67c1fb69 cpu build support
7 years ago
dzhwinter 60f70b174d test=develop
7 years ago
dzhwinter cc02353d10 test=develop
7 years ago
dzhwinter eb2f7ed21b refine tests. test=develop
7 years ago
Tao Luo fe8f178582 fix word2vec related inference unit-tests (#14203)
7 years ago