Commit Graph

488 Commits (5f2e837847db9fff219333e03f4867abaa75768c)

Author SHA1 Message Date
Xin Pan 4313d870a2 refine
7 years ago
Xin Pan c69cf6dde8 fix
7 years ago
Xin Pan 926e1077ca version
7 years ago
tensor-tang ca973139fe
Merge pull request #13285 from tensor-tang/refine/ut/lac
7 years ago
tensor-tang 5a2fc5b52f fix print error
7 years ago
tensor-tang 3ea19b7596 fix bug and fc pass ut
7 years ago
tensor-tang acfdbf0293 enable ner analysis test and refine lac
7 years ago
tensor-tang df0c695618 fix fusion gru pass and enable it
7 years ago
luotao1 d4c3fe9a44 clean api_anakin_engine_rnn_tester
7 years ago
tensor-tang 7eebb90523 fix conflicts
7 years ago
tensor-tang 3c3ad1e4cf
Merge branch 'develop' into refine/ut/lac
7 years ago
tensor-tang ca30127e0a fix compile error undef registrar pass
7 years ago
tensor-tang 0618077971 Merge remote-tracking branch 'ups/develop' into refine/ut/lac
7 years ago
tensor-tang 6b104c90d3 fix profile
7 years ago
luotao1 00c7230996 Merge branch 'develop' into all_data
7 years ago
Yan Chunwei 6de0a18d5f
Refine/text classification support data (#13256)
7 years ago
Tao Luo 11b22883be
Merge pull request #12738 from luotao1/anakin_cpu
7 years ago
Xin Pan 883bbe1958
Merge pull request #13238 from panyx0718/clean
7 years ago
luotao1 4c283d87ef Merge branch 'develop' into all_data
7 years ago
luotao1 61cae53e79 support anakin for only-cpu environment
7 years ago
Yan Chunwei 225ecee5ea
refine/text classification tester (#13244)
7 years ago
tensor-tang 4d774953c6 enable fc gru fuse pass
7 years ago
tensor-tang 09016df8df make analyzer run
7 years ago
luotao1 fa5036aac8 add test_all_data in test_analyzer_ner
7 years ago
Xin Pan 18442a6088 rename pass.h/.cc to analysis_pass
7 years ago
tensor-tang 12b483c0db Merge remote-tracking branch 'ups/develop' into refine/ut/lac
7 years ago
luotao1 b4fa3dbda3 unify PrintTime of analysis unit-test
7 years ago
luotao1 f615ba2f8f update the multi-thread unit-tests
7 years ago
luotao1 35cff5e00d Merge branch 'develop' into multi-thread2
7 years ago
Yan Chunwei 9df2d8b5ba
test/add text-classification test (#13081)
7 years ago
luotao1 1a373fbb0d add result check for multi-thread UT
7 years ago
luotao1 2dc23ffaa8 Merge branch 'develop' into multi-thread2
7 years ago
luotao1 8cb92fb18e speedup the download of inference_demo
7 years ago
luotao1 39ed148714 fix multi-thread hang temporary
7 years ago
luotao1 459d4cc811 Merge branch 'develop' into multi-thread2
7 years ago
Tao Luo 907696709f
Merge pull request #13133 from luotao1/library
7 years ago
Jiabin Yang d091dd02a0 fix mac compile error 0903 (#13184)
7 years ago
Yan Chunwei 796c87d563
bugfix/fusion lstm (#13185)
7 years ago
luotao1 ae44efffee fix ci error
7 years ago
tensor-tang d83187dba8 enable lac analysis test
7 years ago
luotao1 6f18217386 fix codestyle
7 years ago
luotao1 d7b4965785 auto generate paddle_inference_pass.h
7 years ago
luotao1 0639a32477 Merge branch 'develop' into library
7 years ago
luotao1 f507e5c1f2 update multi-threads UT
7 years ago
luotao1 37d1a6685c Merge branch 'develop' into multi-thread2
7 years ago
Tao Luo 737a033ed0
Merge pull request #13140 from dzhwinter/windows/inference_api
7 years ago
dzhwinter b4d43030ff
windows inference fix (#13141)
7 years ago
Yan Chunwei 597b73053d
refine/fc lstm fusion link (#13158)
7 years ago
tensor-tang 1e7ccf9f45
Merge pull request #13126 from tensor-tang/fea/infer/ut/lac-new
7 years ago
dzhwinter bfa9b268de fix elementwise
7 years ago
Jiabin Yang 6ba2b22279
Merge pull request #13096 from JiabinYang/fix_mac
7 years ago
luotao1 fb077c17e6 add shared library for analysis
7 years ago
tensor-tang 9f02497b23 follow comment
7 years ago
tensor-tang 713e86486d bugfix ditu test
7 years ago
tensor-tang 63b38ca40b add lac test
7 years ago
tensor-tang 663a11ac7c bugfix and follow comment
7 years ago
nhzlx 5ec2fb0c93 add flexibledfs for find path between two nodes
7 years ago
luotao1 f3b7e18be9 add static library for analysis
7 years ago
luotao1 0fbe0a7a28 add multi-thread ut for ditu-rnn
7 years ago
luotao1 b3cd2ae88b Merge branch 'develop' into ner_ut2
7 years ago
Yan Chunwei af15f6f038
fea/refine fuse (#13076)
7 years ago
luotao1 07cb64adc0 add unit-test for chinese_ner
7 years ago
Xin Pan 823c4f87be
Merge pull request #13058 from panyx0718/infer
7 years ago
Jiabin Yang cceffca6bf
Update api_impl.cc
7 years ago
Jiabin Yang 5d5b70ad79
Update CMakeLists.txt
7 years ago
JiabinYang 7c7d3d6172 Fix mac
7 years ago
Yan Chunwei cfa6bbb755
move nodeid from graph to node (#13065)
7 years ago
Xin Pan 5adf118ab5 polish
7 years ago
Xin Pan c558f059ad fix
7 years ago
Xin Pan 4794d9cf70 use fast RunPrepareContext for inference
7 years ago
Yan Chunwei 902f19b46a
fea/fuse attention lstm simplify.with fusion lstm.with sequnce expand (#13006)
7 years ago
Xin Pan 2bb15f437c
Merge pull request #12791 from panyx0718/ir3
7 years ago
Xin Pan 880cb8c4c3 clean
7 years ago
Xin Pan 1a67061fee graph to program pass
7 years ago
nhzlx 478eeabdd4 refine uttest of api_tensorrt_subgraph_engine
7 years ago
nhzlx 791aa7f49d merge develop
7 years ago
dzhwinter 4fcc293617
memory module (#12931)
7 years ago
nhzlx 3de4556659 concat op && map cnn model support
7 years ago
luotao1 9c7fde45a7 enhance test_analyzer to profile ditu inference demo
7 years ago
Tao Luo decda738b0
fea/anakin compile with demo (#12772)
7 years ago
Yan Chunwei 9ee698e605
enhance/ditu rnn with fc fuse (#12831)
7 years ago
nhzlx c999895e93 merge develop
7 years ago
nhzlx 276950291a 1. fix ssa bug with batchnorm, 2. refine the trt
7 years ago
Yan Chunwei 896a37b6e3
fea/link ir to inference analysis and fc fuse support (#12789)
7 years ago
nhzlx ff052c0e6f merge develop
7 years ago
nhzlx c6a5c4b0c0 add comments for execute in ut_helper
7 years ago
tangwei12 99f74be561
Merge pull request #12802 from seiriosPlus/inference_teeny_mistakes
7 years ago
luotao1 808e5b1748 fix tensorrt compiler bug
7 years ago
nhzlx 1bf9d9e90c fix comments
7 years ago
tangwei12 cfb12f09bf fix some teeny mistakes
7 years ago
Tao Luo 7decbaaa13
Merge pull request #12762 from luotao1/anakin_cuda_env
7 years ago
nhzlx 144b20c160 add batch norm op converter
7 years ago
nhzlx 14311bb094 merge develop
7 years ago
Zhaolong Xing e5674f6dde
Merge pull request #12753 from NHZlX/add_benchmark
7 years ago
Zhaolong Xing 310708726b
Merge pull request #12761 from NHZlX/global_pooling_trt
7 years ago
nhzlx 1e92baf746 fix comments
7 years ago
nhzlx ce7f361a80 fix comments
7 years ago
nhzlx df9cbabcee add pool2d test for global_pooling true
7 years ago
Yan Chunwei 6fe5547db7
switch NodeAttr to boost::varient (#12539)
7 years ago
nhzlx 133ec69625 add batch norm trt converter
7 years ago
luotao1 413bf9d494 disable anakin when cuda < 8.0 or cudnn < 7.0
7 years ago
dzhwinter f36818d532
"windows testing easier" (#12739)
7 years ago
nhzlx 2bdd20be22 add support for global pooling for trt
7 years ago
nhzlx f55e8901c8 merge develop
7 years ago
nhzlx 1600ba86f6 1. change tensorrt op from cpu to gpu
7 years ago
luotao1 9f3789944c use latest anakin commit
7 years ago
Yan Chunwei e765dead86
add profiler to fluid inference (#12707)
7 years ago
Zhaolong Xing 83c85f34e8
Merge pull request #12598 from NHZlX/add_tensorrt_softmax
7 years ago
Tao Luo 1e1974c998
Merge pull request #12563 from luotao1/anakin_test
7 years ago
Wu Yi 8b77448d5f
hide misc APIs (#12540)
7 years ago
luotao1 a222d336ca modify the anakin_model download dir
7 years ago
luotao1 22bc328951 fix anakin-NOTFOUND compiler error
7 years ago
luotao1 b2367f3661 update anakin.cmake
7 years ago
xzl 29ad9794bb Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_tensorrt_softmax
7 years ago
luotao1 f4bcee1d6f Merge branch 'develop' into anakin_test
7 years ago
luotao1 94042ccd2d add comment
7 years ago
Yan Chunwei 7555cfe33a
fix inference double free bug (#12613)
7 years ago
Luo Tao 64c0ba288a fix inference_lib_dist error
7 years ago
nhzlx 641f32da8c add softmax op converter
7 years ago
nhzlx 943950c190 refine graph draw
7 years ago
nhzlx 7a019cd608 merge develop
7 years ago
nhzlx e823ce68bb filter redundant output
7 years ago
nhzlx c69ae865db fix comments
7 years ago
Luo Tao e8aa6d1283 add anakin compiler from github source code
7 years ago
nhzlx e8954a36f5 merge develop
7 years ago
nhzlx 32a9e050bc mapping the variable name inside the subgraph
7 years ago
Luo Tao cf74473244 make inference_anakin_test SERIAL
7 years ago
superjomn ebe1920626 add comment
7 years ago
superjomn 3c5e15de03 disable anakin test
7 years ago
Zhaolong Xing d7dd0868db
Merge pull request #12449 from NHZlX/add_tensorrt_elementwise_add
7 years ago
nhzlx d50f776b27 merge develop
7 years ago
nhzlx 64a08f840f increase the test batch
7 years ago
nhzlx c7e6a11bc1 merge develop
7 years ago
nhzlx 0015df1b12 modify op converter for conv2d
7 years ago
gongweibao 819ac3df0a
Modify style (#12465)
7 years ago
cuichaowen 046de2acdb Improve anakin feature (#11961)
7 years ago
nhzlx c13efe02d9 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_tensorrt_elementwise_add
7 years ago
nhzlx a5c96af33c Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_tensorrt_conv2d_converter
7 years ago
Yan Chunwei dcfbc6a661
inference analyzer as bin (#12450)
7 years ago
Yan Chunwei 31a2c87688
fea/lightly support lod (#12451)
7 years ago
nhzlx 5fcdd81da7 tiny modify
7 years ago
nhzlx f05c7fb8ae Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_tensorrt_conv2d_converter
7 years ago
nhzlx 6f6d552790 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_tensorrt_conv2d_converter
7 years ago
Superjomn 4d2405d851 inference analysis support ssa
7 years ago
minqiyang e96fef2cf7 Fix inference api impl deps
7 years ago
Luo Tao 062556f938 Merge branch 'develop' into unify
7 years ago
nhzlx 98948b975e wrong added file
7 years ago
nhzlx 830aa12c1a add elementwise init code
7 years ago
Zhaolong Xing 85c4912755
Merge pull request #12355 from NHZlX/add_tensorrt_pooling_converter
7 years ago
tensor-tang 9788e5ab87 add flags to control num_threads
7 years ago