Commit Graph

488 Commits (5f2e837847db9fff219333e03f4867abaa75768c)

Author SHA1 Message Date
luotao1 413bf9d494 disable anakin when cuda < 8.0 or cudnn < 7.0
7 years ago
dzhwinter f36818d532
"windows testing easier" (#12739)
7 years ago
nhzlx 2bdd20be22 add support for global pooling for trt
7 years ago
nhzlx f55e8901c8 merge develop
7 years ago
nhzlx 1600ba86f6 1. change tensorrt op from cpu to gpu
7 years ago
luotao1 9f3789944c use latest anakin commit
7 years ago
Yan Chunwei e765dead86
add profiler to fluid inference (#12707)
7 years ago
Zhaolong Xing 83c85f34e8
Merge pull request #12598 from NHZlX/add_tensorrt_softmax
7 years ago
Tao Luo 1e1974c998
Merge pull request #12563 from luotao1/anakin_test
7 years ago
Wu Yi 8b77448d5f
hide misc APIs (#12540)
7 years ago
luotao1 a222d336ca modify the anakin_model download dir
7 years ago
luotao1 22bc328951 fix anakin-NOTFOUND compiler error
7 years ago
luotao1 b2367f3661 update anakin.cmake
7 years ago
xzl 29ad9794bb Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_tensorrt_softmax
7 years ago
luotao1 f4bcee1d6f Merge branch 'develop' into anakin_test
7 years ago
luotao1 94042ccd2d add comment
7 years ago
Yan Chunwei 7555cfe33a
fix inference double free bug (#12613)
7 years ago
Luo Tao 64c0ba288a fix inference_lib_dist error
7 years ago
nhzlx 641f32da8c add softmax op converter
7 years ago
nhzlx 943950c190 refine graph draw
7 years ago
nhzlx 7a019cd608 merge develop
7 years ago
nhzlx e823ce68bb filter redundant output
7 years ago
nhzlx c69ae865db fix comments
7 years ago
Luo Tao e8aa6d1283 add anakin compiler from github source code
7 years ago
nhzlx e8954a36f5 merge develop
7 years ago
nhzlx 32a9e050bc mapping the variable name inside the subgraph
7 years ago
Luo Tao cf74473244 make inference_anakin_test SERIAL
7 years ago
superjomn ebe1920626 add comment
7 years ago
superjomn 3c5e15de03 disable anakin test
7 years ago
Zhaolong Xing d7dd0868db
Merge pull request #12449 from NHZlX/add_tensorrt_elementwise_add
7 years ago
nhzlx d50f776b27 merge develop
7 years ago
nhzlx 64a08f840f increase the test batch
7 years ago
nhzlx c7e6a11bc1 merge develop
7 years ago
nhzlx 0015df1b12 modify op converter for conv2d
7 years ago
gongweibao 819ac3df0a
Modify style (#12465)
7 years ago
cuichaowen 046de2acdb Improve anakin feature (#11961)
7 years ago
nhzlx c13efe02d9 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_tensorrt_elementwise_add
7 years ago
nhzlx a5c96af33c Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_tensorrt_conv2d_converter
7 years ago
Yan Chunwei dcfbc6a661
inference analyzer as bin (#12450)
7 years ago
Yan Chunwei 31a2c87688
fea/lightly support lod (#12451)
7 years ago
nhzlx 5fcdd81da7 tiny modify
7 years ago
nhzlx f05c7fb8ae Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_tensorrt_conv2d_converter
7 years ago
nhzlx 6f6d552790 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_tensorrt_conv2d_converter
7 years ago
Superjomn 4d2405d851 inference analysis support ssa
7 years ago
minqiyang e96fef2cf7 Fix inference api impl deps
7 years ago
Luo Tao 062556f938 Merge branch 'develop' into unify
7 years ago
nhzlx 98948b975e wrong added file
7 years ago
nhzlx 830aa12c1a add elementwise init code
7 years ago
Zhaolong Xing 85c4912755
Merge pull request #12355 from NHZlX/add_tensorrt_pooling_converter
7 years ago
tensor-tang 9788e5ab87 add flags to control num_threads
7 years ago
nhzlx 4f71a3b12b fix a bug
7 years ago
Luo Tao 83e59257d0 fix manylinux1 Failed to publish artifacts
7 years ago
nhzlx c8adfb3451 add paddle_enforce
7 years ago
nhzlx 5533400720 fix comments
7 years ago
Luo Tao 5ba4337698 unify libpaddle_inference_api into libpaddle_fluid
7 years ago
nhzlx 01566fb61b 1. support mutil batch utest 2. support pool op
7 years ago
nhzlx 990741aa85 add weight's dim assert
7 years ago
nhzlx 21890ca0cf Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_tensorrt_pooling_converter
7 years ago
tensor-tang 7b63b85086 fix mismatch of infer api (#12342)
7 years ago
nhzlx fc41eb40b1 add conv2d trt converter
7 years ago
nhzlx 4d49e61ab8 fix comments
7 years ago
nhzlx bcd67bdd71 add assert for GetOutput
7 years ago
nhzlx 7382f98600 1. set ut batch > 1 2. readd the mul op(utest will be added later)
7 years ago
nhzlx bd64979fe9 the argument should not be a const one
7 years ago
nhzlx f42ea48996 deal with conflict
7 years ago
nhzlx 82527696e7 1. we delelte mul op, 2.modify fc and action op 3. modify the test inferface
7 years ago
nhzlx 2372daff1d there is no batchsize concept in tensorrt's tensor
7 years ago
Yan Chunwei 9e0a94f069
inference-api code clean (#12274)
7 years ago
Yan Chunwei b42ced8eda
bugfix/tensorrt analysis fix subgraph trigger (#12266)
7 years ago
qiaolongfei 0e30c9d6fb fix mac build
7 years ago
Tao Luo 3694fd5c4a
Merge pull request #12109 from emailweixu/cpu_only
7 years ago
nhzlx d384d39a68 add Temporarily add code with bug
7 years ago
Tao Luo a8f0931428
Merge pull request #12229 from luotao1/api_doc
7 years ago
Luo Tao 43c1481f88 fix dead link in high_level_api.md
7 years ago
tensor-tang d4691cedec fix mac build
7 years ago
Luo Tao 2e68abf47c rename api.h to paddle_inference_api.h, put demo_ci in fluid_install_dir
7 years ago
Luo Tao 44b6a5f308 fix inference_lib.cmake and make demo_ci pass
7 years ago
Luo Tao af1e54acd8 fix compiler error after move
7 years ago
Luo Tao 369dfb3d0f move contrib/inference to paddle/fluid/inference/api
7 years ago
Wei Xu 264e8305b0 Fixed unittests for WITH_GPU=OFF and WITH_DISTRIBUTE=OFF build
7 years ago
Luo Tao b1a1124d36 fix compiler and run error in static library
7 years ago
Luo Tao 24ced1d0b9 add independent demo for test static fluid library
7 years ago
Yan Chunwei 0cefb9461f
add topological sortting (#12059)
7 years ago
tensor-tang f92024470b
Merge pull request #12052 from tensor-tang/refine/infer/api/static
7 years ago
tensor-tang 2238ea56de paddle fluid static lib symbol hidden
7 years ago
Luo Tao fc3e7341fc fix compile warning in inference related codes
7 years ago
tensor-tang 3df99e72ab Merge remote-tracking branch 'ups/develop' into refine/set_num_threads
7 years ago
dzhwinter 4ed0b62476
Move fluid::framework::InitDevices into fluid::platform (#11757)
7 years ago
Yan Chunwei 4f555909ce
analysis/code clean (#11964)
7 years ago
sneaxiy 3f9292c6e6 fix merge conflict
7 years ago
sneaxiy dd70fb4393 fix type comparation bugs
7 years ago
Xin Pan a9086bf320 also move a few other dir to legacy/
7 years ago
Yan Chunwei 5e2656449c
add inference-analysis doc (#11813)
7 years ago
gongweibao c2165ffa7b Fix codesytle (#11836)
7 years ago
fengjiayi aab47cc08d fix Mac compile errors (#11829)
7 years ago
superjomn ba99bc2384 update
7 years ago
superjomn f1224945ba fix analysis compile bug
7 years ago
Yan Chunwei 5082642bdb
feature/analysis to support sub-graph for TRT engine (#11538)
7 years ago
tensor-tang e3a96300bb move SetNumThreads to platform
7 years ago
tensor-tang 1f09ddf806 Merge remote-tracking branch 'ups/develop' into refine/mklml/dyload
7 years ago
gongweibao 19958eeb71
fix (#11590)
7 years ago
tensor-tang f503f12925 enable dynamic load mklml lib on fluid
7 years ago
gongweibao 4dda54aa5a
Fix unlikely (#11537)
7 years ago
Yan Chunwei d734595978
Feature/pass manager (#11440)
7 years ago
tensor-tang 609dccfb55
Merge pull request #11395 from tensor-tang/fix
7 years ago
tensor-tang 0ddc5d8631
Merge pull request #11258 from tensor-tang/refine
7 years ago
tensor-tang 6c1cf60950 Merge remote-tracking branch 'ups/develop' into fix
7 years ago
Yan Chunwei 5fd142c3fd
bugfix/trt engine op (#11487)
7 years ago
tensor-tang c453573286 Merge remote-tracking branch 'ups/develop' into fix
7 years ago
tensor-tang 9169b3b802
Merge pull request #10789 from Xreki/core_fix_openblas_threads
7 years ago
tensor-tang 6a32f19865 fix unknown use_mkldnn
7 years ago
gongweibao d9de6b8621
Add brpc surpport. (#11263)
7 years ago
Luo Tao 79d555b9f2 Merge branch 'develop' into mkldnn
7 years ago
Luo Tao c6d230e03e add FLAGS_use_mkldnn to global control use_mkldnn
7 years ago
Yan Chunwei 145aaa4b49
loose threshold of TRT for CI in different model (#11305)
7 years ago
tensor-tang bfd42683ca Merge remote-tracking branch 'ups/develop' into refine
7 years ago
Luo Tao f6fb51a164 add test_mode in trt/activation_op
7 years ago
Luo Tao c73977af03 Merge branch 'develop' into trt
7 years ago
tensor-tang 9cf1f351d2 refine nlp test
7 years ago
tensor-tang 3a294042c8
Merge pull request #11233 from tensor-tang/multithreads
7 years ago
Yan Chunwei 4f95bc9463
feature/trt engine op test (#11182)
7 years ago
tensor-tang 944bdee738 Merge remote-tracking branch 'ups/develop' into multithreads
7 years ago
tensor-tang 6840953305 refine nlp multi-threads
7 years ago
Luo Tao e116129f03 rewrite unittest of trt_activation_op
7 years ago
Yan Chunwei df87e63baa
add dfg graphviz pass (#11211)
7 years ago
tensor-tang 6ae7cbe252 follow comments
7 years ago
tensor-tang 99d00cce93 follow comment: refine where time started
7 years ago
tensor-tang 38f8182df6 work around with dummy test
7 years ago
tensor-tang eaeb76c419 add some comments
7 years ago
tensor-tang 9c687a9789 Merge remote-tracking branch 'ups/develop' into nlp
7 years ago
tensor-tang 7e9f0790e0 fix scope in thread
7 years ago
Yan Chunwei 9503dbb173
fix compile error (#11119)
7 years ago
tensor-tang 3206bcd929 refine log and add QPS
7 years ago
tensor-tang 06adccf6eb Merge remote-tracking branch 'ups/develop' into nlp
7 years ago
tensor-tang 4a24c238c1 refine code
7 years ago
Yan Chunwei 0c0c5df4cb
feature/add TRT fc converter (#11043)
7 years ago
tensor-tang a4822ed897 add thread setting
7 years ago
fengjiayi d6997e5bc8
Merge pull request #11083 from JiayiFeng/dev_refine_programdesc_copy
7 years ago
tensor-tang 5387562576 add multi-thread test
7 years ago
fengjiayi 31f0533c5d fix compile errors
7 years ago
fengjiayi 04ccbed5b8 fix a compile error
7 years ago
gongweibao 4fb7cc7f5e
Move sync_mode device ctx from grpc server (#10881)
7 years ago
tensor-tang 733718c3e7 remove the ugly test
7 years ago
Yan Chunwei 97b7502772
inference API little fix (#11069)
7 years ago
tensor-tang 708bec2e56 add test
7 years ago
tensor-tang d13dd3b6a7 revert profiling
7 years ago
tensor-tang 4d11c8e9c6 retest single thread
7 years ago
Yan Chunwei 211e131525
feature/tensorrt engine op (#11001)
7 years ago
tensor-tang 77599415ba enable read dataset
7 years ago
tensor-tang c00843f4e8 enable multi-threads
7 years ago
Yancey d92a75bee4
Merge pull request #10550 from Yancey1989/overlap_send_op
7 years ago
Yan Chunwei f5fc9c3bc1
feature/mul converter (#10841)
7 years ago
Yancey1989 60d827a8b9 Merge branch 'develop' of github.com:PaddlePaddle/Paddle into overlap_send_op
7 years ago
Yancey1989 20c24c05aa singleton rpc_client
7 years ago
Yancey1989 ad6c0142c4 clean up codes
7 years ago
Xin Pan 2f0df56422 add inference interface impl
7 years ago
tensor-tang 400f5e7c3c add threads test
7 years ago
tensor-tang ce20dfa236 enable more choices
7 years ago
tensor-tang 602e28bf1c use the actual data
7 years ago
Yancey1989 0aa6f9e934 Merge branch 'develop' of github.com:PaddlePaddle/Paddle into overlap_send_op
7 years ago
Yu Yang d736fb8047 Disable unstable test (#10920)
7 years ago
tensor-tang 1b8b253ec1 Merge remote-tracking branch 'ups/develop' into nlp
7 years ago
tensor-tang 98fb8e58fd test infer nlp
7 years ago
Yan Chunwei b1d446856c
fix inference api (#10867)
7 years ago
yuyang18 fcbf19bf93 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/refine_parallel_executor
7 years ago
Yan Chunwei 1153144fbb
Inference analysis/init data flow graph analysis (#10776)
7 years ago
yuyang18 1b69c25c92 Merge branch 'feature/sequnce_run_tests' into feature/refine_parallel_executor
7 years ago
yuyang18 91007fe974 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/refine_parallel_executor
7 years ago
yuyang18 1426d794ff Force some unittests serial
7 years ago
tensor-tang 406c1dd143
Merge pull request #10701 from tensor-tang/usemkldnn
7 years ago
Yancey1989 952fa04009 Merge branch 'develop' of github.com:PaddlePaddle/Paddle into overlap_send_op
7 years ago
Liu Yiqun 50ba205d79 Merge branch 'develop' into core_fix_openblas_threads
7 years ago
Liu Yiqun 39eb871ddf Add an interface to set the number of threads for math function, and set the default value to 1 for inference.
7 years ago
yuyang18 6db9c3c7d6 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/refine_parallel_executor
7 years ago
daminglu ae1990731d
Test word2vec (#10779)
7 years ago
Xin Pan 8e3e65ff93
Merge pull request #10526 from panyx0718/infer_profile2
7 years ago
Wu Yi ebc7303990
listen_and_serv use local scope (#10663)
7 years ago
yuyang18 ceb150e9fa Merge remote-tracking branch 'yx/fix_bce_cdn_link' into feature/refine_parallel_executor
7 years ago
Yancey1989 274df85ca6 Merge branch 'develop' of github.com:PaddlePaddle/Paddle into overlap_send_op
7 years ago
yuyang18 8a42c4749e Disable tests
7 years ago
Kexin Zhao eec1ac8638 fix warning
7 years ago
tensor-tang 661826a70a enable MKLDNN inference test
7 years ago
Yancey1989 00efc4ccfa Merge branch 'develop' of github.com:PaddlePaddle/Paddle into overlap_send_op
7 years ago
yuyang18 7c777dd549 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/exec_strategy
7 years ago
Luo Tao 1992f70920 Merge branch 'develop' into refine_relu_test
7 years ago
Luo Tao be41c2ffa6 Merge branch 'develop' into refine_relu_test
7 years ago
Yiqun Liu b7026f79a9
Fix a bug related to dispensable inputs and refine the inference unittest (#10527)
7 years ago
Yan Chunwei 674bd839cd
OpConverter change BlockDesc to proto::BlockDesc (#10623)
7 years ago
Luo Tao 4f5f0be769 use the latest buffer to update the convert
7 years ago
Yan Chunwei de81ccb5cb
feature/analysis node representation (#10522)
7 years ago
Yancey1989 b35ea1a4d6 Merge branch 'develop' of github.com:PaddlePaddle/Paddle into overlap_send_op
7 years ago
Luo Tao a3ba264c47 Merge branch 'develop' into refine_relu_test
7 years ago
Lei Wang f3ffec23cf
CI: exit when fail any step. (#10579)
7 years ago
Tao Luo 28de0ea404
Merge pull request #10545 from luotao1/fix_tensorrt_engine
7 years ago
Luo Tao 4a5ebb6806 fix tensorrt_engine compiler error
7 years ago
Xin Pan dcb77813aa Revert "CI: rerun failed tests. (#10536)"
7 years ago
Lei Wang 0446220e01
CI: rerun failed tests. (#10536)
7 years ago
Tao Luo 303277f002
Merge pull request #10437 from panyx0718/infer2
7 years ago
Xin Pan f093a7b332 allow inference test to generate timeline
7 years ago
Luo Tao 40b8b634f9 Merge branch 'develop' into refine_relu_test
7 years ago
Yan Chunwei 819038113e Feature/engine refactor (#10497)
7 years ago
Yan Chunwei 6eeb819538
feature/inference analysis dot (#10494)
7 years ago
Xin Pan 6728d96d89 follow comments
7 years ago
Xin Pan 3de43a87ef Add a multi-dim add layer test.
7 years ago
Luo Tao 0ae97e8a5b Merge branch 'develop' into refine_relu_test
7 years ago
chengduoZH e00c1ee10f fix split var test
7 years ago
Luo Tao 89dcb0bd15 refine EngineIOConverter, and use io_convert in test_trt_activation_op
7 years ago
Tao Luo 3356fb3c6e
Merge pull request #10461 from luotao1/refine_convert
7 years ago
Luo Tao 53b401d589 refine io_convert and op_convert
7 years ago
Xin Pan 0c518888fa
Merge pull request #10430 from panyx0718/infer
7 years ago
Yan Chunwei 2a2c83b9e6 feature/convert tensorrt io (#10440)
7 years ago
Xin Pan 9fccf46270 reword comments
7 years ago
Xin Pan cdd52f3a30 Add comment to explain how to run inference test
7 years ago
Tao Luo 4646c0f35d
Merge pull request #10144 from luotao1/tr_convert_init
7 years ago
Kexin Zhao 7a86069422 Add float16 demo code and put float16 work in contrib/float16 folder (#10331)
7 years ago
Luo Tao beb1245560 add relu converter and unit-test
7 years ago
Abhinav Arora 55f0d84029
Fix Cpplint Issues in fluid/inference/tensorrt/ (#10318)
7 years ago
Luo Tao 9945265f09 Merge branch 'develop' into tr_convert_init
7 years ago
whs 2f9fa9b721
Merge pull request #10167 from wanghaoshuang/fluid_init
7 years ago
Luo Tao 6f6f330423 update the register method
7 years ago
Kexin Zhao 0ecc6fa8f3 Add float16 transpiler and image classification example (#10109)
7 years ago
Abhinav Arora 83b1a8f6bf
Pending more CPPLint errors in fluid/operators/math (#10243)
7 years ago
Abhinav Arora f457d5da06
Fix more CPPLint errors (#10218)
7 years ago
wanghaoshuang 848fb00215 Fix comments.
7 years ago
Luo Tao 326221acec Merge branch 'develop' into tr_convert_init
7 years ago
Abhinav Arora 4c8ff72615
Fix CPPLint errors with rxecutor (#10212)
7 years ago
Luo Tao c4e3010b14 use template to do registry
7 years ago
Yan Chunwei 2d57158e2b
fea/init tensorrt engine (#10003)
7 years ago
Luo Tao d599de5c41 auto registray op converters
7 years ago
Luo Tao 48473dddf4 Merge branch 'develop' into tr_convert_init
7 years ago
wanghaoshuang a4b452a2d6 Remove initP2P(bool) and init function in framework.
7 years ago
wanghaoshuang e4708565f4 Fix cpplint format.
7 years ago
wanghaoshuang 48b7b54321 Refine code.
7 years ago
wanghaoshuang 1bdea0a8d2 Add init interface for customize devices.
7 years ago
Luo Tao 42febfa928 tensorrt convert init
7 years ago
Luo Tao 71f51ff64a refine tensorrt cmake and dockerfile
7 years ago
Abhinav Arora 744ebcfa18
Fix CPPlint issues in fluid/inference (#10075)
7 years ago
Luo Tao d4682247e1 auto find tensorrt library
7 years ago
Yan Chunwei 186659798f
add tensorrt build support(#9891)
7 years ago
Liu Yiqun 449bdde58a Correct some typos.
7 years ago
Liu Yiqun 2762959f79 Merge branch 'develop' into core_inference_prepare
7 years ago
Liu Yiqun 339be6254e Refine the order of arguments.
7 years ago
Yiqun Liu e90e7ab237
Remove the use of ARCHIVE_START/END (#9844)
7 years ago
Liu Yiqun bf485999f4 Merge branch 'develop' into core_inference_prepare
7 years ago
Yu Yang 5ceea265bb Disable unstable unittest
7 years ago
Liu Yiqun 720f6196ea Change the seed and make it not fixed for multi-threads cases.
7 years ago
Liu Yiqun e24172eb54 Simplify the inference unittest of fit a line and add some comment.
7 years ago
Liu Yiqun bdb21f6bc3 Merge branch 'develop' into core_inference_multi_thread
7 years ago
Liu Yiqun 93e9905482 Add unittest for calling CreateVariables manually.
7 years ago
Liu Yiqun a9855e4afd Merge branch 'develop' into core_inference_fix_run
7 years ago