Jiabin Yang
edb9e56934
Merge pull request #13401 from JiabinYang/mac/ci_unitest
...
add unitttest for mac on ci after some untest being disable
7 years ago
dzhwinter
85f8dd1c77
debug version
7 years ago
Yan Chunwei
3725f22442
Hotfix/api predictor ( #13383 )
...
* hotfix for PaddleTensor buffer.
7 years ago
Tao Luo
65b1fbb5d8
Merge pull request #13399 from luotao1/fix_cmake
...
fix text_classification download error
7 years ago
dzhwinter
e1999538eb
debug the device context
7 years ago
dzhwinter
372caf4000
windows staff
7 years ago
JiabinYang
87b11179e5
add unitttest for mac on ci after some untest being disable
7 years ago
tensor-tang
5fd2ffdce7
Merge pull request #13372 from tensor-tang/fea/ut/vis
...
add analysis vis ut
7 years ago
luotao1
e93c7b62dc
fix text_classification downlaod error
7 years ago
tensor-tang
26fc698f85
disable mkldnn fuse on ocr test
7 years ago
JiabinYang
9a9105018d
fix mac compile error in subgraph_splitter
7 years ago
tensor-tang
1a99302c14
refine and reuse code
7 years ago
tensor-tang
b7a64e8698
fix confilts
7 years ago
tensor-tang
4e4f952dea
Merge remote-tracking branch 'ups/develop' into fea/ut/vis
7 years ago
tensor-tang
89d09e6594
Merge branch 'develop' into fea/ut/vis
7 years ago
Zhaolong Xing
c9995289f1
Merge pull request #13124 from NHZlX/fix_subgraph_bug
...
Fix tensorrt subgraph bug
7 years ago
Tao Luo
d4a5326ac6
Merge pull request #13387 from luotao1/nlp_multi_thread
...
add multi-thread for nlp unit-tests
7 years ago
Tao Luo
968a56b672
Merge pull request #13373 from sfraczek/conv-relu-pass-hotfix
...
hotfix for conv-relu pass
7 years ago
tensor-tang
0bd6476f67
Merge remote-tracking branch 'ups/develop' into fea/ut/vis
7 years ago
luotao1
6b9ccd97a2
Merge branch 'develop' into nlp_multi_thread
7 years ago
luotao1
20b40cb06a
add multi-thread for nlp unit-tests
7 years ago
luotao1
29f5a93b5f
add analyzer_rnn2_test
7 years ago
nhzlx
0092ad3285
delete unused log
7 years ago
tensor-tang
dd0b2036c6
add note for use mkldnn
7 years ago
nhzlx
329a8c5283
merge develop
7 years ago
nhzlx
49bafc05bf
fix comments and set name for trt layer and ITensor
7 years ago
Sylwester Fraczek
dd149d469b
hotfix for conv-relu pass
7 years ago
tensor-tang
01f0f16884
enable mkldnn in infer api
7 years ago
tensor-tang
33c21291e7
Merge remote-tracking branch 'ups/develop' into fea/ut/vis
7 years ago
tensor-tang
65f901b36f
disable fc gru temporarily
7 years ago
tensor-tang
539b3f300f
add ocr analysis ut
7 years ago
luotao1
b12322ce95
fix fusion_lstm unique_name bug
7 years ago
Tao Luo
f351ceb65a
Merge pull request #13345 from Superjomn/bugfix/lac_test
...
fix ner_test when bs>1
7 years ago
dzhwinter
c3e1fb5a3e
add demo
7 years ago
tensor-tang
8cbb3c0720
refine lac ut and fix fetch
7 years ago
superjomn
8e0fe035d4
fix ner_test when bs>1
7 years ago
nhzlx
df161e08f0
delete unuse ut
7 years ago
nhzlx
49b5b3c5b3
merge develop
7 years ago
nhzlx
03ff4f6892
fix subgraph bug!
7 years ago
Tao Luo
94b66bdb5d
Merge pull request #13326 from luotao1/analysis_test_refine
...
refine Analysis test
7 years ago
Xin Pan
17bf8713a5
Merge pull request #12988 from panyx0718/ir2
...
program and tensor versioning support
7 years ago
luotao1
9664c53c7c
fix cmake error to pass the ci
7 years ago
luotao1
81c21705b4
simplify inference/tests/api/CMakeLists.txt
7 years ago
luotao1
d0fbe78040
move analyzer_xxx_tester to inference/tests/api
7 years ago
luotao1
83af1b3b3e
move analyzer_rnn1_test out of analyzer_test
7 years ago
Yan Chunwei
2fd1bf2ea6
fea/add color log ( #13305 )
7 years ago
Yan Chunwei
5023530a8a
Refactor/remove sensitive ( #13314 )
7 years ago
Yan Chunwei
478a4e850e
refactor ir pattern ( #13304 )
7 years ago
Xin Pan
4313d870a2
refine
7 years ago
Xin Pan
c69cf6dde8
fix
7 years ago
Xin Pan
926e1077ca
version
7 years ago
tensor-tang
ca973139fe
Merge pull request #13285 from tensor-tang/refine/ut/lac
...
add analysis unit test of lac and ner
7 years ago
tensor-tang
5a2fc5b52f
fix print error
7 years ago
tensor-tang
3ea19b7596
fix bug and fc pass ut
7 years ago
tensor-tang
acfdbf0293
enable ner analysis test and refine lac
7 years ago
tensor-tang
df0c695618
fix fusion gru pass and enable it
7 years ago
luotao1
d4c3fe9a44
clean api_anakin_engine_rnn_tester
7 years ago
tensor-tang
7eebb90523
fix conflicts
7 years ago
tensor-tang
3c3ad1e4cf
Merge branch 'develop' into refine/ut/lac
7 years ago
tensor-tang
ca30127e0a
fix compile error undef registrar pass
7 years ago
tensor-tang
0618077971
Merge remote-tracking branch 'ups/develop' into refine/ut/lac
7 years ago
tensor-tang
6b104c90d3
fix profile
7 years ago
luotao1
00c7230996
Merge branch 'develop' into all_data
7 years ago
Yan Chunwei
6de0a18d5f
Refine/text classification support data ( #13256 )
7 years ago
Tao Luo
11b22883be
Merge pull request #12738 from luotao1/anakin_cpu
...
support anakin for only-cpu environment
7 years ago
Xin Pan
883bbe1958
Merge pull request #13238 from panyx0718/clean
...
Clean
7 years ago
luotao1
4c283d87ef
Merge branch 'develop' into all_data
7 years ago
luotao1
61cae53e79
support anakin for only-cpu environment
7 years ago
Yan Chunwei
225ecee5ea
refine/text classification tester ( #13244 )
7 years ago
tensor-tang
4d774953c6
enable fc gru fuse pass
7 years ago
tensor-tang
09016df8df
make analyzer run
7 years ago
luotao1
fa5036aac8
add test_all_data in test_analyzer_ner
7 years ago
Xin Pan
18442a6088
rename pass.h/.cc to analysis_pass
7 years ago
tensor-tang
12b483c0db
Merge remote-tracking branch 'ups/develop' into refine/ut/lac
7 years ago
luotao1
b4fa3dbda3
unify PrintTime of analysis unit-test
7 years ago
luotao1
f615ba2f8f
update the multi-thread unit-tests
7 years ago
luotao1
35cff5e00d
Merge branch 'develop' into multi-thread2
7 years ago
Yan Chunwei
9df2d8b5ba
test/add text-classification test ( #13081 )
7 years ago
luotao1
1a373fbb0d
add result check for multi-thread UT
7 years ago
luotao1
2dc23ffaa8
Merge branch 'develop' into multi-thread2
7 years ago
luotao1
8cb92fb18e
speedup the download of inference_demo
7 years ago
luotao1
39ed148714
fix multi-thread hang temporary
7 years ago
luotao1
459d4cc811
Merge branch 'develop' into multi-thread2
7 years ago
Tao Luo
907696709f
Merge pull request #13133 from luotao1/library
...
add static and shared Library for analysis and IR
7 years ago
Jiabin Yang
d091dd02a0
fix mac compile error 0903 ( #13184 )
7 years ago
Yan Chunwei
796c87d563
bugfix/fusion lstm ( #13185 )
7 years ago
luotao1
ae44efffee
fix ci error
7 years ago
tensor-tang
d83187dba8
enable lac analysis test
7 years ago
luotao1
6f18217386
fix codestyle
7 years ago
luotao1
d7b4965785
auto generate paddle_inference_pass.h
7 years ago
dzhwinter
379b471ee2
squash commit
7 years ago
luotao1
0639a32477
Merge branch 'develop' into library
7 years ago
luotao1
f507e5c1f2
update multi-threads UT
7 years ago
luotao1
37d1a6685c
Merge branch 'develop' into multi-thread2
7 years ago
Tao Luo
737a033ed0
Merge pull request #13140 from dzhwinter/windows/inference_api
...
modify the timer
7 years ago
dzhwinter
b4d43030ff
windows inference fix ( #13141 )
...
* windows inference fix
* windows inference fix
7 years ago
Yan Chunwei
597b73053d
refine/fc lstm fusion link ( #13158 )
7 years ago
tensor-tang
1e7ccf9f45
Merge pull request #13126 from tensor-tang/fea/infer/ut/lac-new
...
add lac infer test
7 years ago
dzhwinter
a0aa2ec8b5
build compile
7 years ago
dzhwinter
75681c0a79
switch to 9.2
7 years ago
dzhwinter
bfa9b268de
fix elementwise
7 years ago
dzhwinter
dbe90cc0f6
merge develop branch
7 years ago
Jiabin Yang
6ba2b22279
Merge pull request #13096 from JiabinYang/fix_mac
...
Fix Mac compile error
7 years ago
luotao1
fb077c17e6
add shared library for analysis
7 years ago
tensor-tang
9f02497b23
follow comment
7 years ago
tensor-tang
713e86486d
bugfix ditu test
7 years ago
tensor-tang
63b38ca40b
add lac test
7 years ago
tensor-tang
663a11ac7c
bugfix and follow comment
7 years ago
nhzlx
5ec2fb0c93
add flexibledfs for find path between two nodes
7 years ago
luotao1
f3b7e18be9
add static library for analysis
7 years ago
luotao1
0fbe0a7a28
add multi-thread ut for ditu-rnn
7 years ago
luotao1
b3cd2ae88b
Merge branch 'develop' into ner_ut2
7 years ago
Yan Chunwei
af15f6f038
fea/refine fuse ( #13076 )
7 years ago
luotao1
07cb64adc0
add unit-test for chinese_ner
7 years ago
Xin Pan
823c4f87be
Merge pull request #13058 from panyx0718/infer
...
use fast RunPrepareContext for inference
7 years ago
Jiabin Yang
cceffca6bf
Update api_impl.cc
7 years ago
Jiabin Yang
5d5b70ad79
Update CMakeLists.txt
7 years ago
JiabinYang
7c7d3d6172
Fix mac
7 years ago
Yan Chunwei
cfa6bbb755
move nodeid from graph to node ( #13065 )
7 years ago
Xin Pan
5adf118ab5
polish
7 years ago
Xin Pan
c558f059ad
fix
7 years ago
Xin Pan
4794d9cf70
use fast RunPrepareContext for inference
7 years ago
Yan Chunwei
902f19b46a
fea/fuse attention lstm simplify.with fusion lstm.with sequnce expand ( #13006 )
7 years ago
dzhwinter
b78394ea57
done
7 years ago
Xin Pan
2bb15f437c
Merge pull request #12791 from panyx0718/ir3
...
graph to program pass
7 years ago
dzhwinter
b74af56bbc
cpu compile is done
7 years ago
Xin Pan
880cb8c4c3
clean
7 years ago
Xin Pan
1a67061fee
graph to program pass
...
fix a few other things
7 years ago
dzhwinter
78aab05b71
fix more op errors
7 years ago
nhzlx
478eeabdd4
refine uttest of api_tensorrt_subgraph_engine
7 years ago
nhzlx
791aa7f49d
merge develop
7 years ago
dzhwinter
7dceb8a080
check some operators
7 years ago
dzhwinter
4fcc293617
memory module ( #12931 )
...
* memory module
* "fix ci"
7 years ago
dzhwinter
488a2dd2e8
with ir node
7 years ago
nhzlx
3de4556659
concat op && map cnn model support
7 years ago
dzhwinter
89f95ea25e
merge develop branch
7 years ago
dzhwinter
34f8c9b6f5
windows port
7 years ago
luotao1
9c7fde45a7
enhance test_analyzer to profile ditu inference demo
7 years ago
Tao Luo
decda738b0
fea/anakin compile with demo ( #12772 )
...
* anakin support x86
* fix code style
* add anakin ditu cnn demo
* add timer
* add rnn
* fix inference_anakin_cnn/rnn_test compile error
* make anakin_rnn_tester run
* add anakin_enable_op_time option
* update api/CMakeLists.txt
* enlarge the max_batch_size in anakin.config
* update with comments
7 years ago
Yan Chunwei
9ee698e605
enhance/ditu rnn with fc fuse ( #12831 )
...
* make fc fuse work with ditu rnn
* add ditu rnn data download to CMAKE
7 years ago
nhzlx
c999895e93
merge develop
7 years ago
nhzlx
276950291a
1. fix ssa bug with batchnorm, 2. refine the trt
7 years ago
Yan Chunwei
896a37b6e3
fea/link ir to inference analysis and fc fuse support ( #12789 )
...
* link IR graph to analysis graph
* add clean code and update
* add infer_clean_pass
* add ir_pass_manager
* support fc fuse executation
* fix ir circle
7 years ago
nhzlx
ff052c0e6f
merge develop
7 years ago
nhzlx
c6a5c4b0c0
add comments for execute in ut_helper
7 years ago
tangwei12
99f74be561
Merge pull request #12802 from seiriosPlus/inference_teeny_mistakes
...
fix some teeny mistakes
7 years ago
luotao1
808e5b1748
fix tensorrt compiler bug
7 years ago
nhzlx
1bf9d9e90c
fix comments
7 years ago
tangwei12
cfb12f09bf
fix some teeny mistakes
7 years ago
Tao Luo
7decbaaa13
Merge pull request #12762 from luotao1/anakin_cuda_env
...
disable anakin when cuda < 8.0 or cudnn < 7.0
7 years ago
nhzlx
144b20c160
add batch norm op converter
7 years ago
nhzlx
14311bb094
merge develop
7 years ago
Zhaolong Xing
e5674f6dde
Merge pull request #12753 from NHZlX/add_benchmark
...
modify tensorrt engine op from cpu mode to gpu
7 years ago
Zhaolong Xing
310708726b
Merge pull request #12761 from NHZlX/global_pooling_trt
...
Add support for global pooling for trt
7 years ago
nhzlx
1e92baf746
fix comments
7 years ago
nhzlx
ce7f361a80
fix comments
7 years ago
nhzlx
df9cbabcee
add pool2d test for global_pooling true
7 years ago
Yan Chunwei
6fe5547db7
switch NodeAttr to boost::varient ( #12539 )
7 years ago
nhzlx
133ec69625
add batch norm trt converter
7 years ago
luotao1
413bf9d494
disable anakin when cuda < 8.0 or cudnn < 7.0
7 years ago
dzhwinter
f36818d532
"windows testing easier" ( #12739 )
7 years ago
nhzlx
2bdd20be22
add support for global pooling for trt
7 years ago
nhzlx
f55e8901c8
merge develop
7 years ago
nhzlx
1600ba86f6
1. change tensorrt op from cpu to gpu
7 years ago
luotao1
9f3789944c
use latest anakin commit
7 years ago
Yan Chunwei
e765dead86
add profiler to fluid inference ( #12707 )
7 years ago
Zhaolong Xing
83c85f34e8
Merge pull request #12598 from NHZlX/add_tensorrt_softmax
...
Add tensorrt softmax
7 years ago
Tao Luo
1e1974c998
Merge pull request #12563 from luotao1/anakin_test
...
* make inference_anakin_test SERIAL
* add anakin compiler from github source code
* fix inference_lib_dist error
* add comment
* update anakin.cmake
* fix anakin-NOTFOUND compiler error
* modify the anakin_model download dir
7 years ago
Wu Yi
8b77448d5f
hide misc APIs ( #12540 )
...
* hide misc APIs
* update
* fix transformer test
* update API.spec
7 years ago
luotao1
a222d336ca
modify the anakin_model download dir
7 years ago
luotao1
22bc328951
fix anakin-NOTFOUND compiler error
7 years ago
luotao1
b2367f3661
update anakin.cmake
7 years ago
xzl
29ad9794bb
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_tensorrt_softmax
7 years ago
luotao1
f4bcee1d6f
Merge branch 'develop' into anakin_test
7 years ago
luotao1
94042ccd2d
add comment
7 years ago
Yan Chunwei
7555cfe33a
fix inference double free bug ( #12613 )
7 years ago
Luo Tao
64c0ba288a
fix inference_lib_dist error
7 years ago
nhzlx
641f32da8c
add softmax op converter
7 years ago
nhzlx
943950c190
refine graph draw
7 years ago
nhzlx
7a019cd608
merge develop
7 years ago
nhzlx
e823ce68bb
filter redundant output
7 years ago
nhzlx
c69ae865db
fix comments
7 years ago
Luo Tao
e8aa6d1283
add anakin compiler from github source code
7 years ago
nhzlx
e8954a36f5
merge develop
7 years ago
nhzlx
32a9e050bc
mapping the variable name inside the subgraph
7 years ago
Luo Tao
cf74473244
make inference_anakin_test SERIAL
7 years ago
superjomn
ebe1920626
add comment
7 years ago
superjomn
3c5e15de03
disable anakin test
7 years ago
Zhaolong Xing
d7dd0868db
Merge pull request #12449 from NHZlX/add_tensorrt_elementwise_add
...
Add tensorrt elementwise add
7 years ago
nhzlx
d50f776b27
merge develop
7 years ago
nhzlx
64a08f840f
increase the test batch
7 years ago
nhzlx
c7e6a11bc1
merge develop
7 years ago
nhzlx
0015df1b12
modify op converter for conv2d
7 years ago
gongweibao
819ac3df0a
Modify style ( #12465 )
7 years ago
cuichaowen
046de2acdb
Improve anakin feature ( #11961 )
7 years ago
nhzlx
c13efe02d9
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_tensorrt_elementwise_add
7 years ago
nhzlx
a5c96af33c
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_tensorrt_conv2d_converter
7 years ago
Yan Chunwei
dcfbc6a661
inference analyzer as bin ( #12450 )
7 years ago
Yan Chunwei
31a2c87688
fea/lightly support lod ( #12451 )
7 years ago
nhzlx
5fcdd81da7
tiny modify
7 years ago
nhzlx
f05c7fb8ae
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_tensorrt_conv2d_converter
7 years ago
nhzlx
6f6d552790
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_tensorrt_conv2d_converter
7 years ago
Superjomn
4d2405d851
inference analysis support ssa
7 years ago
minqiyang
e96fef2cf7
Fix inference api impl deps
7 years ago
Luo Tao
062556f938
Merge branch 'develop' into unify
7 years ago
nhzlx
98948b975e
wrong added file
7 years ago
nhzlx
830aa12c1a
add elementwise init code
7 years ago
Zhaolong Xing
85c4912755
Merge pull request #12355 from NHZlX/add_tensorrt_pooling_converter
...
Add tensorrt pooling converter
7 years ago
tensor-tang
9788e5ab87
add flags to control num_threads
7 years ago
nhzlx
4f71a3b12b
fix a bug
7 years ago
Luo Tao
83e59257d0
fix manylinux1 Failed to publish artifacts
7 years ago
nhzlx
c8adfb3451
add paddle_enforce
7 years ago
nhzlx
5533400720
fix comments
7 years ago
Luo Tao
5ba4337698
unify libpaddle_inference_api into libpaddle_fluid
7 years ago
nhzlx
01566fb61b
1. support mutil batch utest 2. support pool op
7 years ago
nhzlx
990741aa85
add weight's dim assert
7 years ago
nhzlx
21890ca0cf
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_tensorrt_pooling_converter
7 years ago
tensor-tang
7b63b85086
fix mismatch of infer api ( #12342 )
7 years ago
nhzlx
fc41eb40b1
add conv2d trt converter
7 years ago
nhzlx
4d49e61ab8
fix comments
7 years ago
nhzlx
bcd67bdd71
add assert for GetOutput
7 years ago
nhzlx
7382f98600
1. set ut batch > 1 2. readd the mul op(utest will be added later)
7 years ago
nhzlx
bd64979fe9
the argument should not be a const one
7 years ago
nhzlx
f42ea48996
deal with conflict
7 years ago
nhzlx
82527696e7
1. we delelte mul op, 2.modify fc and action op 3. modify the test inferface
7 years ago
nhzlx
2372daff1d
there is no batchsize concept in tensorrt's tensor
7 years ago
Yan Chunwei
9e0a94f069
inference-api code clean ( #12274 )
7 years ago
Yan Chunwei
b42ced8eda
bugfix/tensorrt analysis fix subgraph trigger ( #12266 )
7 years ago
qiaolongfei
0e30c9d6fb
fix mac build
7 years ago
Tao Luo
3694fd5c4a
Merge pull request #12109 from emailweixu/cpu_only
...
Fixed unittests for WITH_GPU=OFF and WITH_DISTRIBUTE=OFF build
7 years ago
nhzlx
d384d39a68
add Temporarily add code with bug
7 years ago
Tao Luo
a8f0931428
Merge pull request #12229 from luotao1/api_doc
...
fix dead link in high_level_api.md
7 years ago
Luo Tao
43c1481f88
fix dead link in high_level_api.md
7 years ago
tensor-tang
d4691cedec
fix mac build
7 years ago
Luo Tao
2e68abf47c
rename api.h to paddle_inference_api.h, put demo_ci in fluid_install_dir
7 years ago
Luo Tao
44b6a5f308
fix inference_lib.cmake and make demo_ci pass
7 years ago
Luo Tao
af1e54acd8
fix compiler error after move
7 years ago
Luo Tao
369dfb3d0f
move contrib/inference to paddle/fluid/inference/api
7 years ago
Wei Xu
264e8305b0
Fixed unittests for WITH_GPU=OFF and WITH_DISTRIBUTE=OFF build
7 years ago
Luo Tao
b1a1124d36
fix compiler and run error in static library
7 years ago
Luo Tao
24ced1d0b9
add independent demo for test static fluid library
7 years ago
Yan Chunwei
0cefb9461f
add topological sortting ( #12059 )
7 years ago
tensor-tang
f92024470b
Merge pull request #12052 from tensor-tang/refine/infer/api/static
...
inference api static lib symbol hidden
7 years ago
tensor-tang
2238ea56de
paddle fluid static lib symbol hidden
7 years ago
Luo Tao
fc3e7341fc
fix compile warning in inference related codes
7 years ago
tensor-tang
3df99e72ab
Merge remote-tracking branch 'ups/develop' into refine/set_num_threads
...
fix conflicts
7 years ago
dzhwinter
4ed0b62476
Move fluid::framework::InitDevices into fluid::platform ( #11757 )
...
* move to platform
* "move init from framework to platform"
* "remove used init"
* "fix ci"
* "fix ci"
* "fix generic"
* "fix ci"
* "fix ci"
* "fix ci"
* "disable fragile test"
7 years ago
Yan Chunwei
4f555909ce
analysis/code clean ( #11964 )
7 years ago
sneaxiy
3f9292c6e6
fix merge conflict
7 years ago
sneaxiy
dd70fb4393
fix type comparation bugs
7 years ago
Xin Pan
a9086bf320
also move a few other dir to legacy/
7 years ago
Yan Chunwei
5e2656449c
add inference-analysis doc ( #11813 )
7 years ago
gongweibao
c2165ffa7b
Fix codesytle ( #11836 )
7 years ago
fengjiayi
aab47cc08d
fix Mac compile errors ( #11829 )
7 years ago
superjomn
ba99bc2384
update
7 years ago
superjomn
f1224945ba
fix analysis compile bug
7 years ago
Yan Chunwei
5082642bdb
feature/analysis to support sub-graph for TRT engine ( #11538 )
7 years ago
tensor-tang
e3a96300bb
move SetNumThreads to platform
7 years ago
tensor-tang
1f09ddf806
Merge remote-tracking branch 'ups/develop' into refine/mklml/dyload
7 years ago
gongweibao
19958eeb71
fix ( #11590 )
7 years ago
tensor-tang
f503f12925
enable dynamic load mklml lib on fluid
7 years ago
gongweibao
4dda54aa5a
Fix unlikely ( #11537 )
7 years ago
Yan Chunwei
d734595978
Feature/pass manager ( #11440 )
7 years ago
tensor-tang
609dccfb55
Merge pull request #11395 from tensor-tang/fix
...
remove mkldnn flag from gtest strdup for cpu
7 years ago
tensor-tang
0ddc5d8631
Merge pull request #11258 from tensor-tang/refine
...
Refine test and scope lock
7 years ago
tensor-tang
6c1cf60950
Merge remote-tracking branch 'ups/develop' into fix
7 years ago
Yan Chunwei
5fd142c3fd
bugfix/trt engine op ( #11487 )
7 years ago
tensor-tang
c453573286
Merge remote-tracking branch 'ups/develop' into fix
7 years ago
tensor-tang
9169b3b802
Merge pull request #10789 from Xreki/core_fix_openblas_threads
...
Add an interface to set the number of threads for math function, and set the default value to 1 for inference.
7 years ago
tensor-tang
6a32f19865
fix unknown use_mkldnn
7 years ago
gongweibao
d9de6b8621
Add brpc surpport. ( #11263 )
7 years ago
Luo Tao
79d555b9f2
Merge branch 'develop' into mkldnn
7 years ago
Luo Tao
c6d230e03e
add FLAGS_use_mkldnn to global control use_mkldnn
7 years ago
Yan Chunwei
145aaa4b49
loose threshold of TRT for CI in different model ( #11305 )
7 years ago
tensor-tang
bfd42683ca
Merge remote-tracking branch 'ups/develop' into refine
7 years ago
Luo Tao
f6fb51a164
add test_mode in trt/activation_op
7 years ago
Luo Tao
c73977af03
Merge branch 'develop' into trt
7 years ago
tensor-tang
9cf1f351d2
refine nlp test
7 years ago
tensor-tang
3a294042c8
Merge pull request #11233 from tensor-tang/multithreads
...
Fix abort issue in cpu multi-threads
7 years ago
Yan Chunwei
4f95bc9463
feature/trt engine op test ( #11182 )
7 years ago
tensor-tang
944bdee738
Merge remote-tracking branch 'ups/develop' into multithreads
7 years ago
tensor-tang
6840953305
refine nlp multi-threads
7 years ago
Luo Tao
e116129f03
rewrite unittest of trt_activation_op
7 years ago
Yan Chunwei
df87e63baa
add dfg graphviz pass ( #11211 )
7 years ago
tensor-tang
6ae7cbe252
follow comments
7 years ago
tensor-tang
99d00cce93
follow comment: refine where time started
7 years ago
tensor-tang
38f8182df6
work around with dummy test
7 years ago
tensor-tang
eaeb76c419
add some comments
7 years ago
tensor-tang
9c687a9789
Merge remote-tracking branch 'ups/develop' into nlp
7 years ago
tensor-tang
7e9f0790e0
fix scope in thread
7 years ago
Yan Chunwei
9503dbb173
fix compile error ( #11119 )
7 years ago
tensor-tang
3206bcd929
refine log and add QPS
7 years ago
tensor-tang
06adccf6eb
Merge remote-tracking branch 'ups/develop' into nlp
7 years ago
tensor-tang
4a24c238c1
refine code
7 years ago
Yan Chunwei
0c0c5df4cb
feature/add TRT fc converter ( #11043 )
7 years ago
tensor-tang
a4822ed897
add thread setting
7 years ago
fengjiayi
d6997e5bc8
Merge pull request #11083 from JiayiFeng/dev_refine_programdesc_copy
...
Refine ProgramDesc copy
7 years ago
tensor-tang
5387562576
add multi-thread test
7 years ago
fengjiayi
31f0533c5d
fix compile errors
7 years ago
fengjiayi
04ccbed5b8
fix a compile error
7 years ago
gongweibao
4fb7cc7f5e
Move sync_mode device ctx from grpc server ( #10881 )
7 years ago
tensor-tang
733718c3e7
remove the ugly test
7 years ago
Yan Chunwei
97b7502772
inference API little fix ( #11069 )
7 years ago
tensor-tang
708bec2e56
add test
7 years ago
tensor-tang
d13dd3b6a7
revert profiling
7 years ago
tensor-tang
4d11c8e9c6
retest single thread
7 years ago
Yan Chunwei
211e131525
feature/tensorrt engine op ( #11001 )
7 years ago
tensor-tang
77599415ba
enable read dataset
7 years ago
tensor-tang
c00843f4e8
enable multi-threads
7 years ago
Yancey
d92a75bee4
Merge pull request #10550 from Yancey1989/overlap_send_op
...
overlap send ops and backward ops
7 years ago
Yan Chunwei
f5fc9c3bc1
feature/mul converter ( #10841 )
7 years ago
Yancey1989
60d827a8b9
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into overlap_send_op
7 years ago
Yancey1989
20c24c05aa
singleton rpc_client
7 years ago
Yancey1989
ad6c0142c4
clean up codes
7 years ago
Xin Pan
2f0df56422
add inference interface impl
7 years ago
tensor-tang
400f5e7c3c
add threads test
7 years ago
tensor-tang
ce20dfa236
enable more choices
7 years ago
tensor-tang
602e28bf1c
use the actual data
7 years ago
Yancey1989
0aa6f9e934
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into overlap_send_op
7 years ago
Yu Yang
d736fb8047
Disable unstable test ( #10920 )
7 years ago
tensor-tang
1b8b253ec1
Merge remote-tracking branch 'ups/develop' into nlp
7 years ago
tensor-tang
98fb8e58fd
test infer nlp
7 years ago
Yan Chunwei
b1d446856c
fix inference api ( #10867 )
7 years ago
yuyang18
fcbf19bf93
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/refine_parallel_executor
7 years ago
Yan Chunwei
1153144fbb
Inference analysis/init data flow graph analysis ( #10776 )
...
Add the demo of subgraph splitter
7 years ago
yuyang18
1b69c25c92
Merge branch 'feature/sequnce_run_tests' into feature/refine_parallel_executor
7 years ago
yuyang18
91007fe974
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/refine_parallel_executor
7 years ago
yuyang18
1426d794ff
Force some unittests serial
7 years ago
tensor-tang
406c1dd143
Merge pull request #10701 from tensor-tang/usemkldnn
...
enable MKLDNN inference test
7 years ago
Yancey1989
952fa04009
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into overlap_send_op
7 years ago
Liu Yiqun
50ba205d79
Merge branch 'develop' into core_fix_openblas_threads
7 years ago
Liu Yiqun
39eb871ddf
Add an interface to set the number of threads for math function, and set the default value to 1 for inference.
7 years ago
yuyang18
6db9c3c7d6
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/refine_parallel_executor
7 years ago
daminglu
ae1990731d
Test word2vec ( #10779 )
7 years ago
Xin Pan
8e3e65ff93
Merge pull request #10526 from panyx0718/infer_profile2
...
allow inference test to generate timeline
7 years ago
Wu Yi
ebc7303990
listen_and_serv use local scope ( #10663 )
...
* listen_and_serv use localscope
* fix ut
7 years ago
yuyang18
ceb150e9fa
Merge remote-tracking branch 'yx/fix_bce_cdn_link' into feature/refine_parallel_executor
7 years ago
Yancey1989
274df85ca6
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into overlap_send_op
7 years ago
yuyang18
8a42c4749e
Disable tests
7 years ago
Kexin Zhao
eec1ac8638
fix warning
7 years ago
tensor-tang
661826a70a
enable MKLDNN inference test
7 years ago
Yancey1989
00efc4ccfa
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into overlap_send_op
7 years ago
yuyang18
7c777dd549
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/exec_strategy
7 years ago
Luo Tao
1992f70920
Merge branch 'develop' into refine_relu_test
7 years ago
Luo Tao
be41c2ffa6
Merge branch 'develop' into refine_relu_test
7 years ago
Yiqun Liu
b7026f79a9
Fix a bug related to dispensable inputs and refine the inference unittest ( #10527 )
...
* Fix a bug related to dispensable inputs and refine the inference unittest.
* Fix the use of dispensable inputs in reshape_op.
* Polish the enforce statements.
* Fix an English writing typo.
7 years ago
Yan Chunwei
674bd839cd
OpConverter change BlockDesc to proto::BlockDesc ( #10623 )
7 years ago
Luo Tao
4f5f0be769
use the latest buffer to update the convert
7 years ago
Yan Chunwei
de81ccb5cb
feature/analysis node representation ( #10522 )
7 years ago
Yancey1989
b35ea1a4d6
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into overlap_send_op
7 years ago
Luo Tao
a3ba264c47
Merge branch 'develop' into refine_relu_test
7 years ago
Lei Wang
f3ffec23cf
CI: exit when fail any step. ( #10579 )
7 years ago
Tao Luo
28de0ea404
Merge pull request #10545 from luotao1/fix_tensorrt_engine
...
fix tensorrt_engine compiler error
7 years ago
Luo Tao
4a5ebb6806
fix tensorrt_engine compiler error
7 years ago
Xin Pan
dcb77813aa
Revert "CI: rerun failed tests. ( #10536 )"
...
This reverts commit 0446220e01
.
Reason:
Rerun failed test hides flaky test.
Flaky test can be bugs, for example, race condition.
Test shouldn't be flaky, if a test is flaky, it should be fixed.
7 years ago
Lei Wang
0446220e01
CI: rerun failed tests. ( #10536 )
...
* CI: rerun failed tests.
* fix check style error.
7 years ago
Tao Luo
303277f002
Merge pull request #10437 from panyx0718/infer2
...
Add a multi-dim add layer test.
7 years ago
Xin Pan
f093a7b332
allow inference test to generate timeline
...
generate timeline file
PYTHONPATH=/paddle/dev/my/build2/python/ python /paddle/dev/my/Paddle2/tools/timeline.py --profile_path=train=run_inference_profiler --timeline_path=/tmp/timeline_infer
visualize it
open url chrome://tracing
7 years ago
Luo Tao
40b8b634f9
Merge branch 'develop' into refine_relu_test
7 years ago
Yan Chunwei
819038113e
Feature/engine refactor ( #10497 )
...
* init refactor
* init
* update some comment
* fix build
* fix errorrr
* fix bug
* fix comment
* update
7 years ago
Yan Chunwei
6eeb819538
feature/inference analysis dot ( #10494 )
7 years ago
Xin Pan
6728d96d89
follow comments
7 years ago
Xin Pan
3de43a87ef
Add a multi-dim add layer test.
...
We need to figure out if tensorrt
use row-major or col-major for tensor
layerout inorder to do conversion.
7 years ago
Luo Tao
0ae97e8a5b
Merge branch 'develop' into refine_relu_test
7 years ago
chengduoZH
e00c1ee10f
fix split var test
7 years ago
Luo Tao
89dcb0bd15
refine EngineIOConverter, and use io_convert in test_trt_activation_op
7 years ago
Tao Luo
3356fb3c6e
Merge pull request #10461 from luotao1/refine_convert
...
refine io_convert and op_convert
7 years ago
Luo Tao
53b401d589
refine io_convert and op_convert
7 years ago
Xin Pan
0c518888fa
Merge pull request #10430 from panyx0718/infer
...
Add comment to explain how to run inference test
7 years ago
Yan Chunwei
2a2c83b9e6
feature/convert tensorrt io ( #10440 )
...
* init
* init
* add ut
* split singleton from base class
* add singleton
* ad singleton
7 years ago
Xin Pan
9fccf46270
reword comments
7 years ago
Xin Pan
cdd52f3a30
Add comment to explain how to run inference test
7 years ago
Tao Luo
4646c0f35d
Merge pull request #10144 from luotao1/tr_convert_init
...
tensorrt convert init
7 years ago
Kexin Zhao
7a86069422
Add float16 demo code and put float16 work in contrib/float16 folder ( #10331 )
...
* add test float16 inference accuracy example
* complete the test
* clean code
* add argument parse and refine tests
* add shell script
* add float16 benchmark code
* refine code
* prepare for contrib/float16
* put things in contrib float16 folder
* update benchmark result
* further update benchmark report
* add float16 inference report
* update report
7 years ago
Luo Tao
beb1245560
add relu converter and unit-test
7 years ago
Abhinav Arora
55f0d84029
Fix Cpplint Issues in fluid/inference/tensorrt/ ( #10318 )
...
* Fix CPPLint issues in fluid/inference/tensorrt/
* Fix compile errors
7 years ago
Luo Tao
9945265f09
Merge branch 'develop' into tr_convert_init
7 years ago
whs
2f9fa9b721
Merge pull request #10167 from wanghaoshuang/fluid_init
...
Add init interface for customize devices.
7 years ago
Luo Tao
6f6f330423
update the register method
7 years ago
Kexin Zhao
0ecc6fa8f3
Add float16 transpiler and image classification example ( #10109 )
...
* add float16 transpiler
* fix feed fetch target names mismatch
* fix cast op input change issue
* fix program desc flush error
* fix inconsistent var names in block desc bug
* code clean up
* add float16 infernce C++ example and fix prune bug
7 years ago
Abhinav Arora
83b1a8f6bf
Pending more CPPLint errors in fluid/operators/math ( #10243 )
...
* Fix CPPLint issue in test_engine
* Fix CPPLint errors in operators/math
* Fix compilation
7 years ago
Abhinav Arora
f457d5da06
Fix more CPPLint errors ( #10218 )
...
* Fix more CPPLint issues
* Fix more CPPLint issues
* Fix more CPPLint issues
* Fix CPPLint issues in operators/math and operators/reader
7 years ago
wanghaoshuang
848fb00215
Fix comments.
7 years ago
Luo Tao
326221acec
Merge branch 'develop' into tr_convert_init
7 years ago
Abhinav Arora
4c8ff72615
Fix CPPLint errors with rxecutor ( #10212 )
7 years ago
Luo Tao
c4e3010b14
use template to do registry
7 years ago
Yan Chunwei
2d57158e2b
fea/init tensorrt engine ( #10003 )
7 years ago
Luo Tao
d599de5c41
auto registray op converters
7 years ago
Luo Tao
48473dddf4
Merge branch 'develop' into tr_convert_init
7 years ago
wanghaoshuang
a4b452a2d6
Remove initP2P(bool) and init function in framework.
7 years ago
wanghaoshuang
e4708565f4
Fix cpplint format.
7 years ago
wanghaoshuang
48b7b54321
Refine code.
7 years ago
wanghaoshuang
1bdea0a8d2
Add init interface for customize devices.
7 years ago
Luo Tao
42febfa928
tensorrt convert init
7 years ago
Luo Tao
71f51ff64a
refine tensorrt cmake and dockerfile
7 years ago
Abhinav Arora
744ebcfa18
Fix CPPlint issues in fluid/inference ( #10075 )
7 years ago
Luo Tao
d4682247e1
auto find tensorrt library
7 years ago
Yan Chunwei
186659798f
add tensorrt build support( #9891 )
7 years ago
Liu Yiqun
449bdde58a
Correct some typos.
7 years ago
Liu Yiqun
2762959f79
Merge branch 'develop' into core_inference_prepare
7 years ago
Liu Yiqun
339be6254e
Refine the order of arguments.
7 years ago
Yiqun Liu
e90e7ab237
Remove the use of ARCHIVE_START/END ( #9844 )
...
* Add USE_OP of all operators and kernels and remove ARCHIVE_START/END in CMakeLists.txt of inference unittests.
* Remove ARCHIVE_START/END when linking inference shared library.
* Disable some fluid related cmake operations for cross-compiling.
7 years ago
Liu Yiqun
bf485999f4
Merge branch 'develop' into core_inference_prepare
7 years ago
Yu Yang
5ceea265bb
Disable unstable unittest
7 years ago
Liu Yiqun
720f6196ea
Change the seed and make it not fixed for multi-threads cases.
7 years ago
Liu Yiqun
e24172eb54
Simplify the inference unittest of fit a line and add some comment.
7 years ago
Liu Yiqun
bdb21f6bc3
Merge branch 'develop' into core_inference_multi_thread
7 years ago
Liu Yiqun
93e9905482
Add unittest for calling CreateVariables manually.
7 years ago
Liu Yiqun
a9855e4afd
Merge branch 'develop' into core_inference_fix_run
7 years ago
Yi Wang
f31a0da363
Restore inference CMakeLists.txt
7 years ago
Yi Wang
e831bd43b0
Add ARCHIVE_START/END back
7 years ago
Liu Yiqun
90f3a421c7
Change the argument's type from reference to pointer.
7 years ago
Yi Wang
080e442671
Update
7 years ago
Yi Wang
0564e74fe5
Update
7 years ago
Yi Wang
e9ba79c880
Update
7 years ago
Liu Yiqun
7b40f7ce4a
Merge branch 'develop' into core_inference_prepare
7 years ago
Liu Yiqun
208fcf5225
Merge branch 'develop' into core_inference_multi_thread
7 years ago
Yi Wang
eebb205324
Update CMakeLists
7 years ago
Yi Wang
5bb7d59e3a
Fix cpplint errors with paddle/fluid/inference ( #9702 )
...
* Correct inference
* Update
* Update
7 years ago
Yi Wang
797a7184ac
Unify Fluid code to Google C++ style ( #9685 )
7 years ago
Lei Wang
09b4a1a361
Build: generate all the build related files into one directory. ( #9512 )
7 years ago
Liu Yiqun
27f553b377
Add the check of CPU results and GPU results in multi-thread unittest.
7 years ago
Liu Yiqun
9cba062252
Add inferface to change the feed/fetch_holder_name.
7 years ago
Liu Yiqun
fbd3604cad
Split Executor.Run to Executor.Prepare and Executor.RunPreparedContext for inference.
7 years ago
Liu Yiqun
5419da6e7a
Fix bug caused by block_id.
7 years ago
Liu Yiqun
0968753454
Enable the test of not creating variables every time.
7 years ago
Liu Yiqun
1885818016
Add multi-thread inference example.
7 years ago
Liu Yiqun
961151f17a
Disable the link flags on Mac.
7 years ago
Liu Yiqun
6c614814da
Limit the symbol table of fluid shared library.
7 years ago
Liu Yiqun
a8e8507767
Refine the profile codes for inference.
7 years ago
QI JUN
47ca1814f3
fix mac build error ( #8856 )
7 years ago
Yiqun Liu
a032f56f7c
Add profiling information for inference example ( #8748 )
...
* Add profiling information for inference example, recognize digits.
* Refine the profiling method.
* Correct the use of RecordEvent and simplify recognize_digits.
7 years ago
Tao Luo
6f50dee4d5
compile and install the static library of fluid inference ( #7827 )
...
* compile and install the static library of fluid inference
* fix dynload_cuda not in CPU mode
* update shared library and adjust the deploy of openblas
* adjust the deploy of openblas
* * auto add all fluid modules for static library
* use libprotobuf.a instead of libprotobuf-lite.a for profiler
* use set_property to set the global varible instead of ENV
* add gpu depends of fluid modules, auto add inference_lib_dist depends
* change the condition of openblas_lib, and fix a typo
7 years ago
Liu Yiqun
efb6ba3531
Merge branch 'develop' into core_refine_inference
7 years ago
Luo Tao
bb36084949
fix error directory of fluid inference unitest
7 years ago
Siddharth Goyal
a040239d3a
Add conv test case for inference-recognize digits ( #8466 )
7 years ago
qingqing01
24509f4af9
Fix the grammar in copyright. ( #8403 )
7 years ago
Liu Yiqun
2d74b5f9ba
Refine the Python API load/save_inference_model.
7 years ago
Liu Yiqun
b44917d09b
Implement IsPersistable() in c++.
7 years ago
Liu Yiqun
f95e05a388
Refine the inference unittests.
7 years ago
Liu Yiqun
899ba0d05a
Merge branch 'develop' into core_refine_inference
7 years ago
Liu Yiqun
c796e013c6
Refine the inference unittests.
7 years ago
Yu Yang
6f625f9c2f
Disable unstable unittest
7 years ago
kexinzhao
e800597bcf
Fix include path in inference test codes ( #8349 )
...
* fix absolute include path
* Remove test_helper.h in old location
* update include path
7 years ago
Yi Wang
35e61b3e7e
Merge branch 'develop' of https://github.com/paddlepaddle/paddle into move_to_fluid
7 years ago
Yi Wang
fc374821dd
Correct #include path
7 years ago
Yi Wang
90648f336d
Move file to fluid/; Edit CMakeLists.txt
7 years ago