Commit Graph

489 Commits (dc5a7b906d18e1b0d26fe65e69d92e21966463fa)

Author SHA1 Message Date
dzhwinter bfa9b268de fix elementwise
7 years ago
Jiabin Yang 6ba2b22279
Merge pull request #13096 from JiabinYang/fix_mac
7 years ago
luotao1 fb077c17e6 add shared library for analysis
7 years ago
tensor-tang 9f02497b23 follow comment
7 years ago
tensor-tang 713e86486d bugfix ditu test
7 years ago
tensor-tang 63b38ca40b add lac test
7 years ago
tensor-tang 663a11ac7c bugfix and follow comment
7 years ago
nhzlx 5ec2fb0c93 add flexibledfs for find path between two nodes
7 years ago
luotao1 f3b7e18be9 add static library for analysis
7 years ago
luotao1 0fbe0a7a28 add multi-thread ut for ditu-rnn
7 years ago
luotao1 b3cd2ae88b Merge branch 'develop' into ner_ut2
7 years ago
Yan Chunwei af15f6f038
fea/refine fuse (#13076)
7 years ago
luotao1 07cb64adc0 add unit-test for chinese_ner
7 years ago
Xin Pan 823c4f87be
Merge pull request #13058 from panyx0718/infer
7 years ago
Jiabin Yang cceffca6bf
Update api_impl.cc
7 years ago
Jiabin Yang 5d5b70ad79
Update CMakeLists.txt
7 years ago
JiabinYang 7c7d3d6172 Fix mac
7 years ago
Yan Chunwei cfa6bbb755
move nodeid from graph to node (#13065)
7 years ago
Xin Pan 5adf118ab5 polish
7 years ago
Xin Pan c558f059ad fix
7 years ago
Xin Pan 4794d9cf70 use fast RunPrepareContext for inference
7 years ago
Yan Chunwei 902f19b46a
fea/fuse attention lstm simplify.with fusion lstm.with sequnce expand (#13006)
7 years ago
Xin Pan 2bb15f437c
Merge pull request #12791 from panyx0718/ir3
7 years ago
Xin Pan 880cb8c4c3 clean
7 years ago
Xin Pan 1a67061fee graph to program pass
7 years ago
nhzlx 478eeabdd4 refine uttest of api_tensorrt_subgraph_engine
7 years ago
nhzlx 791aa7f49d merge develop
7 years ago
dzhwinter 4fcc293617
memory module (#12931)
7 years ago
nhzlx 3de4556659 concat op && map cnn model support
7 years ago
luotao1 9c7fde45a7 enhance test_analyzer to profile ditu inference demo
7 years ago
Tao Luo decda738b0
fea/anakin compile with demo (#12772)
7 years ago
Yan Chunwei 9ee698e605
enhance/ditu rnn with fc fuse (#12831)
7 years ago
nhzlx c999895e93 merge develop
7 years ago
nhzlx 276950291a 1. fix ssa bug with batchnorm, 2. refine the trt
7 years ago
Yan Chunwei 896a37b6e3
fea/link ir to inference analysis and fc fuse support (#12789)
7 years ago
nhzlx ff052c0e6f merge develop
7 years ago
nhzlx c6a5c4b0c0 add comments for execute in ut_helper
7 years ago
tangwei12 99f74be561
Merge pull request #12802 from seiriosPlus/inference_teeny_mistakes
7 years ago
luotao1 808e5b1748 fix tensorrt compiler bug
7 years ago
nhzlx 1bf9d9e90c fix comments
7 years ago
tangwei12 cfb12f09bf fix some teeny mistakes
7 years ago
Tao Luo 7decbaaa13
Merge pull request #12762 from luotao1/anakin_cuda_env
7 years ago
nhzlx 144b20c160 add batch norm op converter
7 years ago
nhzlx 14311bb094 merge develop
7 years ago
Zhaolong Xing e5674f6dde
Merge pull request #12753 from NHZlX/add_benchmark
7 years ago
Zhaolong Xing 310708726b
Merge pull request #12761 from NHZlX/global_pooling_trt
7 years ago
nhzlx 1e92baf746 fix comments
7 years ago
nhzlx ce7f361a80 fix comments
7 years ago
nhzlx df9cbabcee add pool2d test for global_pooling true
7 years ago
Yan Chunwei 6fe5547db7
switch NodeAttr to boost::varient (#12539)
7 years ago
nhzlx 133ec69625 add batch norm trt converter
7 years ago
luotao1 413bf9d494 disable anakin when cuda < 8.0 or cudnn < 7.0
7 years ago
dzhwinter f36818d532
"windows testing easier" (#12739)
7 years ago
nhzlx 2bdd20be22 add support for global pooling for trt
7 years ago
nhzlx f55e8901c8 merge develop
7 years ago
nhzlx 1600ba86f6 1. change tensorrt op from cpu to gpu
7 years ago
luotao1 9f3789944c use latest anakin commit
7 years ago
Yan Chunwei e765dead86
add profiler to fluid inference (#12707)
7 years ago
Zhaolong Xing 83c85f34e8
Merge pull request #12598 from NHZlX/add_tensorrt_softmax
7 years ago
Tao Luo 1e1974c998
Merge pull request #12563 from luotao1/anakin_test
7 years ago
Wu Yi 8b77448d5f
hide misc APIs (#12540)
7 years ago
luotao1 a222d336ca modify the anakin_model download dir
7 years ago
luotao1 22bc328951 fix anakin-NOTFOUND compiler error
7 years ago
luotao1 b2367f3661 update anakin.cmake
7 years ago
xzl 29ad9794bb Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_tensorrt_softmax
7 years ago
luotao1 f4bcee1d6f Merge branch 'develop' into anakin_test
7 years ago
luotao1 94042ccd2d add comment
7 years ago
Yan Chunwei 7555cfe33a
fix inference double free bug (#12613)
7 years ago
Luo Tao 64c0ba288a fix inference_lib_dist error
7 years ago
nhzlx 641f32da8c add softmax op converter
7 years ago
nhzlx 943950c190 refine graph draw
7 years ago
nhzlx 7a019cd608 merge develop
7 years ago
nhzlx e823ce68bb filter redundant output
7 years ago
nhzlx c69ae865db fix comments
7 years ago
Luo Tao e8aa6d1283 add anakin compiler from github source code
7 years ago
nhzlx e8954a36f5 merge develop
7 years ago
nhzlx 32a9e050bc mapping the variable name inside the subgraph
7 years ago
Luo Tao cf74473244 make inference_anakin_test SERIAL
7 years ago
superjomn ebe1920626 add comment
7 years ago
superjomn 3c5e15de03 disable anakin test
7 years ago
Zhaolong Xing d7dd0868db
Merge pull request #12449 from NHZlX/add_tensorrt_elementwise_add
7 years ago
nhzlx d50f776b27 merge develop
7 years ago
nhzlx 64a08f840f increase the test batch
7 years ago
nhzlx c7e6a11bc1 merge develop
7 years ago
nhzlx 0015df1b12 modify op converter for conv2d
7 years ago
gongweibao 819ac3df0a
Modify style (#12465)
7 years ago
cuichaowen 046de2acdb Improve anakin feature (#11961)
7 years ago
nhzlx c13efe02d9 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_tensorrt_elementwise_add
7 years ago
nhzlx a5c96af33c Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_tensorrt_conv2d_converter
7 years ago
Yan Chunwei dcfbc6a661
inference analyzer as bin (#12450)
7 years ago
Yan Chunwei 31a2c87688
fea/lightly support lod (#12451)
7 years ago
nhzlx 5fcdd81da7 tiny modify
7 years ago
nhzlx f05c7fb8ae Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_tensorrt_conv2d_converter
7 years ago
nhzlx 6f6d552790 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_tensorrt_conv2d_converter
7 years ago
Superjomn 4d2405d851 inference analysis support ssa
7 years ago
minqiyang e96fef2cf7 Fix inference api impl deps
7 years ago
Luo Tao 062556f938 Merge branch 'develop' into unify
7 years ago
nhzlx 98948b975e wrong added file
7 years ago
nhzlx 830aa12c1a add elementwise init code
7 years ago
Zhaolong Xing 85c4912755
Merge pull request #12355 from NHZlX/add_tensorrt_pooling_converter
7 years ago
tensor-tang 9788e5ab87 add flags to control num_threads
7 years ago
nhzlx 4f71a3b12b fix a bug
7 years ago
Luo Tao 83e59257d0 fix manylinux1 Failed to publish artifacts
7 years ago
nhzlx c8adfb3451 add paddle_enforce
7 years ago
nhzlx 5533400720 fix comments
7 years ago
Luo Tao 5ba4337698 unify libpaddle_inference_api into libpaddle_fluid
7 years ago
nhzlx 01566fb61b 1. support mutil batch utest 2. support pool op
7 years ago
nhzlx 990741aa85 add weight's dim assert
7 years ago
nhzlx 21890ca0cf Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_tensorrt_pooling_converter
7 years ago
tensor-tang 7b63b85086 fix mismatch of infer api (#12342)
7 years ago
nhzlx fc41eb40b1 add conv2d trt converter
7 years ago
nhzlx 4d49e61ab8 fix comments
7 years ago
nhzlx bcd67bdd71 add assert for GetOutput
7 years ago
nhzlx 7382f98600 1. set ut batch > 1 2. readd the mul op(utest will be added later)
7 years ago
nhzlx bd64979fe9 the argument should not be a const one
7 years ago
nhzlx f42ea48996 deal with conflict
7 years ago
nhzlx 82527696e7 1. we delelte mul op, 2.modify fc and action op 3. modify the test inferface
7 years ago
nhzlx 2372daff1d there is no batchsize concept in tensorrt's tensor
7 years ago
Yan Chunwei 9e0a94f069
inference-api code clean (#12274)
7 years ago
Yan Chunwei b42ced8eda
bugfix/tensorrt analysis fix subgraph trigger (#12266)
7 years ago
qiaolongfei 0e30c9d6fb fix mac build
7 years ago
Tao Luo 3694fd5c4a
Merge pull request #12109 from emailweixu/cpu_only
7 years ago
nhzlx d384d39a68 add Temporarily add code with bug
7 years ago
Tao Luo a8f0931428
Merge pull request #12229 from luotao1/api_doc
7 years ago
Luo Tao 43c1481f88 fix dead link in high_level_api.md
7 years ago
tensor-tang d4691cedec fix mac build
7 years ago
Luo Tao 2e68abf47c rename api.h to paddle_inference_api.h, put demo_ci in fluid_install_dir
7 years ago
Luo Tao 44b6a5f308 fix inference_lib.cmake and make demo_ci pass
7 years ago
Luo Tao af1e54acd8 fix compiler error after move
7 years ago
Luo Tao 369dfb3d0f move contrib/inference to paddle/fluid/inference/api
7 years ago
Wei Xu 264e8305b0 Fixed unittests for WITH_GPU=OFF and WITH_DISTRIBUTE=OFF build
7 years ago
Luo Tao b1a1124d36 fix compiler and run error in static library
7 years ago
Luo Tao 24ced1d0b9 add independent demo for test static fluid library
7 years ago
Yan Chunwei 0cefb9461f
add topological sortting (#12059)
7 years ago
tensor-tang f92024470b
Merge pull request #12052 from tensor-tang/refine/infer/api/static
7 years ago
tensor-tang 2238ea56de paddle fluid static lib symbol hidden
7 years ago
Luo Tao fc3e7341fc fix compile warning in inference related codes
7 years ago
tensor-tang 3df99e72ab Merge remote-tracking branch 'ups/develop' into refine/set_num_threads
7 years ago
dzhwinter 4ed0b62476
Move fluid::framework::InitDevices into fluid::platform (#11757)
7 years ago
Yan Chunwei 4f555909ce
analysis/code clean (#11964)
7 years ago
sneaxiy 3f9292c6e6 fix merge conflict
7 years ago
sneaxiy dd70fb4393 fix type comparation bugs
7 years ago
Xin Pan a9086bf320 also move a few other dir to legacy/
7 years ago
Yan Chunwei 5e2656449c
add inference-analysis doc (#11813)
7 years ago
gongweibao c2165ffa7b Fix codesytle (#11836)
7 years ago
fengjiayi aab47cc08d fix Mac compile errors (#11829)
7 years ago
superjomn ba99bc2384 update
7 years ago
superjomn f1224945ba fix analysis compile bug
7 years ago
Yan Chunwei 5082642bdb
feature/analysis to support sub-graph for TRT engine (#11538)
7 years ago
tensor-tang e3a96300bb move SetNumThreads to platform
7 years ago
tensor-tang 1f09ddf806 Merge remote-tracking branch 'ups/develop' into refine/mklml/dyload
7 years ago
gongweibao 19958eeb71
fix (#11590)
7 years ago
tensor-tang f503f12925 enable dynamic load mklml lib on fluid
7 years ago
gongweibao 4dda54aa5a
Fix unlikely (#11537)
7 years ago
Yan Chunwei d734595978
Feature/pass manager (#11440)
7 years ago
tensor-tang 609dccfb55
Merge pull request #11395 from tensor-tang/fix
7 years ago
tensor-tang 0ddc5d8631
Merge pull request #11258 from tensor-tang/refine
7 years ago
tensor-tang 6c1cf60950 Merge remote-tracking branch 'ups/develop' into fix
7 years ago
Yan Chunwei 5fd142c3fd
bugfix/trt engine op (#11487)
7 years ago
tensor-tang c453573286 Merge remote-tracking branch 'ups/develop' into fix
7 years ago
tensor-tang 9169b3b802
Merge pull request #10789 from Xreki/core_fix_openblas_threads
7 years ago
tensor-tang 6a32f19865 fix unknown use_mkldnn
7 years ago
gongweibao d9de6b8621
Add brpc surpport. (#11263)
7 years ago
Luo Tao 79d555b9f2 Merge branch 'develop' into mkldnn
7 years ago
Luo Tao c6d230e03e add FLAGS_use_mkldnn to global control use_mkldnn
7 years ago
Yan Chunwei 145aaa4b49
loose threshold of TRT for CI in different model (#11305)
7 years ago
tensor-tang bfd42683ca Merge remote-tracking branch 'ups/develop' into refine
7 years ago
Luo Tao f6fb51a164 add test_mode in trt/activation_op
7 years ago
Luo Tao c73977af03 Merge branch 'develop' into trt
7 years ago
tensor-tang 9cf1f351d2 refine nlp test
7 years ago
tensor-tang 3a294042c8
Merge pull request #11233 from tensor-tang/multithreads
7 years ago
Yan Chunwei 4f95bc9463
feature/trt engine op test (#11182)
7 years ago
tensor-tang 944bdee738 Merge remote-tracking branch 'ups/develop' into multithreads
7 years ago
tensor-tang 6840953305 refine nlp multi-threads
7 years ago
Luo Tao e116129f03 rewrite unittest of trt_activation_op
7 years ago
Yan Chunwei df87e63baa
add dfg graphviz pass (#11211)
7 years ago
tensor-tang 6ae7cbe252 follow comments
7 years ago
tensor-tang 99d00cce93 follow comment: refine where time started
7 years ago
tensor-tang 38f8182df6 work around with dummy test
7 years ago
tensor-tang eaeb76c419 add some comments
7 years ago
tensor-tang 9c687a9789 Merge remote-tracking branch 'ups/develop' into nlp
7 years ago
tensor-tang 7e9f0790e0 fix scope in thread
7 years ago
Yan Chunwei 9503dbb173
fix compile error (#11119)
7 years ago
tensor-tang 3206bcd929 refine log and add QPS
7 years ago
tensor-tang 06adccf6eb Merge remote-tracking branch 'ups/develop' into nlp
7 years ago
tensor-tang 4a24c238c1 refine code
7 years ago
Yan Chunwei 0c0c5df4cb
feature/add TRT fc converter (#11043)
7 years ago
tensor-tang a4822ed897 add thread setting
7 years ago
fengjiayi d6997e5bc8
Merge pull request #11083 from JiayiFeng/dev_refine_programdesc_copy
7 years ago
tensor-tang 5387562576 add multi-thread test
7 years ago
fengjiayi 31f0533c5d fix compile errors
7 years ago
fengjiayi 04ccbed5b8 fix a compile error
7 years ago
gongweibao 4fb7cc7f5e
Move sync_mode device ctx from grpc server (#10881)
7 years ago
tensor-tang 733718c3e7 remove the ugly test
7 years ago
Yan Chunwei 97b7502772
inference API little fix (#11069)
7 years ago
tensor-tang 708bec2e56 add test
7 years ago
tensor-tang d13dd3b6a7 revert profiling
7 years ago
tensor-tang 4d11c8e9c6 retest single thread
7 years ago
Yan Chunwei 211e131525
feature/tensorrt engine op (#11001)
7 years ago
tensor-tang 77599415ba enable read dataset
7 years ago