Sylwester Fraczek
b5f617fa9b
make mobilenet test reuse resnet50 test
7 years ago
Sylwester Fraczek
1987d45e75
add comment for depthwise pass
7 years ago
minqiyang
87450b9ad4
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_vlog
...
test=develop
7 years ago
peizhilin
4ffa92d4f0
Merge branch 'develop' into windows/build
7 years ago
Tao Luo
813e54efbd
Merge pull request #14328 from PaddlePaddle/revert-14046-windows/debug
...
Revert "cherry picked windows patches."
7 years ago
minqiyang
3db9fad764
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_vlog
...
test=develop
7 years ago
minqiyang
3da43dcae2
Because anakin do NOT use glog, so we revert anakin related change
...
test=develop
7 years ago
Tao Luo
387610aae1
Merge pull request #14325 from luotao1/fix_test_analysis_predictor
...
fix test_analysis_predictor
7 years ago
peizhilin
45125ba538
fix share library issue
7 years ago
Zhaolong Xing
ba8b5619a3
Revert "cherry picked windows patches."
7 years ago
minqiyang
fcc0452c8b
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_vlog
...
test=develop
7 years ago
Tao Luo
381bea0a16
fix test_analysis_predictor
...
test=develop
7 years ago
minqiyang
0c3227a523
Change the origin VLOG level to 10 times
...
Fix code to support cpplint syntax check
test=develop
7 years ago
peizhilin
869487a2b7
Merge remote-tracking branch 'origin/develop' into windows/build
7 years ago
Wojciech Uss
7fd640b882
added additional call to graph_viz_pass
...
test=develop
7 years ago
dzhwinter
234a1d9248
Merge remote-tracking branch 'origin/develop' into windows/debug
...
test=develop
7 years ago
Sylwester Fraczek
f395075efc
rebased and stuff broke
7 years ago
Sylwester Fraczek
a60957f386
addd test_analyzer_mobilenet
7 years ago
Xin Pan
80132933b7
Merge pull request #14281 from luotao1/face
...
refine analysis_resnet50_tester
7 years ago
Tao Luo
eea36739cc
refine test_helper.h
...
test=develop
7 years ago
Tao Luo
2b791f1f63
unify analyzer_face_tester to analyzer_resnet50_tester
...
test=develop
7 years ago
Tao Luo
1ead9318d5
remove unused code in test_helper.h to pass ci
...
test=develop
7 years ago
dzhwinter
2835e04409
merge develop branch. test=develop
7 years ago
qingqing01
db8c52da5e
Revert " Exhaustive search for cuDNN conv. ( #14043 )"
...
This reverts commit ce7d9b0799
.
7 years ago
qingqing01
ce7d9b0799
Exhaustive search for cuDNN conv. ( #14043 )
...
* exhaustive search for cuDNN conv.
* Refine code and add unit testing.
* Clean code
* Fix model load in fluid/inference and unit testing in conv2d
* Follow comments.
7 years ago
Tao Luo
7a2887d212
add analyzer_face_tester
...
test=develop
7 years ago
Tao Luo
2ec65ae0db
download face_model in CMakeLists.txt
...
test=develop
7 years ago
Tao Luo
2f9a5a2e0a
add analyzer_face_tester
7 years ago
nhzlx
5700fafd0f
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_googlenet_bug_with_rule
...
test=develop
7 years ago
nhzlx
86b99ac953
fix comments and fix bug
7 years ago
peizhilin
1f12ba6192
gpu support, fix build issue:
...
1. Non utf-8 characters within comments of OPs may lead to protobuf fail to parse_from_string
2. comment out some ops which not supported on windows
3. cuda libs may not be correctly linked to target on windows
7 years ago
Zhen Wang
4dbc01841d
Nlp dam ( #14248 )
...
* add dam test
* update fuse_statis
* use separated dam model.
* Revert "use separated dam model."
This reverts commit 13e775c86f909b164b7cc1d35a8a24b964ec622e.
* test=develop
* modify the cmake file about infer test, test=develop.
* remove one comment, test=develop.
7 years ago
peizhilin
71d7980f69
fix build issue 1
7 years ago
peizhilin
9d67c1fb69
cpu build support
7 years ago
dzhwinter
60f70b174d
test=develop
7 years ago
Tao Luo
d2a56f7909
Merge pull request #14159 from sfraczek/sfraczek/depthwise-conv-mkldnn-pass
...
add depthwise conv mkldnn pass
7 years ago
dzhwinter
cc02353d10
test=develop
7 years ago
dzhwinter
eb2f7ed21b
refine tests. test=develop
7 years ago
Xin Pan
08d22cf7e1
Merge pull request #14091 from panyx0718/fix2
...
add program check
7 years ago
Tao Luo
fe8f178582
fix word2vec related inference unit-tests ( #14203 )
7 years ago
dzhwinter
1ace55c8ee
merge develop branch
7 years ago
Yan Chunwei
06e508ab58
fix simple_on_word2vec random fail ( #14171 )
7 years ago
dzhwinter
316765839d
add back jit simd instructions. stage.
7 years ago
Sylwester Fraczek
4e2aaf01bc
add depthwise conv mkldnn pass
...
added depthwise conv mkldnn pass which for MKLDNN changes depthwise_conv operator to conv operator because for mkldnn this is the same api
test=develop
7 years ago
dzhwinter
bf2e4cb188
cleard. staged
7 years ago
Yan Chunwei
70ce6dcd67
fix api_impl ci error ( #14140 )
7 years ago
Xin Pan
a943134a97
fix a few more tests
...
test=develop
7 years ago
dzhwinter
ebfe5a02b3
merge develop branch
7 years ago
JiabinYang
7c45e77c41
test=develop
7 years ago
Xin Pan
aa87a989ec
Merge pull request #14119 from Superjomn/fix/api-impl-tester
...
disable some tests
7 years ago
superjomn
5f7fda0b07
disable some tests
...
test=develop
7 years ago
Tao Luo
d3534d2b14
refine warning message
...
test=develop
7 years ago
Tao Luo
79da263b11
Merge pull request #14032 from sfraczek/sfraczek/fix-test-multithreading-mkldnn
...
fix test resnet50 multi-threading on mkldnn
7 years ago
Tao Luo
4928ff32a9
fix cmake warning when ON_INFER=false
...
test=develop
7 years ago
Yan Chunwei
ee74be3a49
[1.1] Bugfix/tensorarray ( #14044 )
7 years ago
Qiyang Min
33b4920d2d
Merge pull request #14057 from velconia/continue_hash_op
...
[1.1] Add hash_op implementation
7 years ago
minqiyang
7f7af5d412
Add xxhash deps to inference demo and trainer demo
...
test=develop
7 years ago
minqiyang
fe18adfbaa
Add fluid inference support
...
test=develop
7 years ago
dzhwinter
7141debe38
add cudnn back. staged.
7 years ago
Sylwester Fraczek
2098b42584
review fixes (Teamcity fails)
...
test=develop
7 years ago
dzhwinter
09409bad4d
staged. test speed=49ms in 1080.
7 years ago
Tao Luo
8ab953e37c
auto insert infer_graph_clean_pass as the default first one
...
test=develop
7 years ago
Tao Luo
ea2bdd192d
Merge branch 'develop' into remove_unused_code
7 years ago
Sylwester Fraczek
741cb33bd9
test multithreading
7 years ago
dzhwinter
468467f391
update real incnet tester
7 years ago
Zhaolong Xing
2256fae45d
Merge pull request #13938 from NHZlX/ocr_attention_support
...
ceil pool mode support for ocr attention model.
7 years ago
dzhwinter
abe8e207c4
clean demo_ci
7 years ago
dzhwinter
597d92179b
clean demo_ci
7 years ago
Tao Luo
f7bbcfa913
remove unused code in paddle_inference_api.h
...
test=develop
7 years ago
dzhwinter
c6dcffc61a
lb. add debug output
7 years ago
nhzlx
ae8f26072d
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_demo_ci_trt
...
test=develop
7 years ago
dzhwinter
607080e888
windows static library
7 years ago
Tao Luo
316bc9bfc9
fix typo and warning in analyzer_resnet50_test
...
test=develop
7 years ago
Tao Luo
42aa1d409d
Merge pull request #13485 from tpatejko/tpatejko/capi-resnet-conv-elementwise-fusion
...
MKLDNN conv+elementwise_add fusion for residual connections in Resnet
7 years ago
Tomasz Patejko
aa35aaa1ab
MKLDNN conv + elementwise_add fusion: fixing formatting
...
test=develop
7 years ago
Tomasz Patejko
1676094697
MKLDNN conv + elementwise_add fusion: turn on residual connection pass when CAPI is used.
...
test=develop
7 years ago
tensor-tang
40f8456a4f
refine fuse pattern and attr
...
test=develop
7 years ago
tensor-tang
cbbacb2534
Merge remote-tracking branch 'ups/develop' into fea/fusion_seqconv_add
...
test=develop
7 years ago
tensor-tang
603ba5e01d
add seqconv eltadd relu pass
7 years ago
Tao Luo
da722d6d9b
Merge pull request #13858 from Sand3r-/mgallus/conv-bias-pass
...
[MKLDNN] Fuse Conv + Bias using Pass
7 years ago
Tao Luo
a4b48f70c1
Merge pull request #13997 from wojtuss/wojtuss/do-not-enable-mkldnn-twice
...
do not enable MKL-DNN twice
7 years ago
Michał Gallus
f9ca31811d
Remove use mkldnn from config in resnet50 test
...
test=develop
7 years ago
Michal Gallus
91e8fbac2f
Enable MKLDNN in Resnet50Tester
...
test=develop
7 years ago
Michal Gallus
582f59c190
Conv+Bias fuse
7 years ago
Wojciech Uss
e6f480ec44
add comment on the default first pass
7 years ago
Wojciech Uss
2cf258e381
remove redundant pass list
7 years ago
Wojciech Uss
5632019f0f
add MKL-DNN placement pass
...
This patch also refactors conv+bn (includes changes from PR
https://github.com/PaddlePaddle/Paddle/pull/13926 )
updated to use the mkldnn-placement-pass.
test=develop
7 years ago
tensor-tang
0a9f5f1790
Merge pull request #13968 from tensor-tang/fix/jit/exp
...
Fix jit exp
7 years ago
Wojciech Uss
5083ec3a1b
do not enable MKL-DNN twice
...
After the MKL-DNN placement pass there is no need to enable MKL-DNN
in operators via executor
test=develop
7 years ago
Wojciech Uss
c3b70aece9
Add MKL-DNN placement pass ( #13958 )
...
* add MKL-DNN placement pass
This patch also refactors conv+bn (includes changes from PR
https://github.com/PaddlePaddle/Paddle/pull/13926 )
updated to use the mkldnn-placement-pass.
test=develop
* remove redundant pass list
* add comment on the default first pass
* fix test for conv+relu mkldnn fuse
7 years ago
Wojciech Uss
4a368a4901
add ifdef guard for MKL-DNN placement pass
...
test=develop
7 years ago
Tao Luo
305034f5b3
Merge pull request #13909 from luotao1/mkldnn_test
...
refine mkldnn test in analyzer_tests
7 years ago
superjomn
b77e4f4978
update
...
test=develop
7 years ago
Tao Luo
ef09862450
fix analyzer_rnn2_test
...
test=develop
7 years ago
Tao Luo
e5b4643ad8
add profile_mkldnn test
...
test=develop
7 years ago
Tao Luo
7d680be5a3
Merge branch 'develop' into mkldnn_test
7 years ago
Tao Luo
6a4e9230ed
Merge branch 'develop' into mkldnn_test
7 years ago
Tao Luo
b819684370
add compare_mkldnn test
...
test=develop
7 years ago
nhzlx
b970c6d5d0
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_demo_ci_trt
...
test=develop
7 years ago
nhzlx
32072d31b5
fix demo ci error on manylinux
7 years ago
Tao Luo
6ea9d1b595
add analysis_predictor in vis_demo
...
test=develop
7 years ago
Tao Luo
f444a7226e
Merge branch 'develop' into clean_inference_lib
7 years ago
Tao Luo
3598500773
Merge pull request #13867 from Superjomn/clean/CreatePaddlePredictor
...
clean CreatePaddlePredictor
7 years ago
Tao Luo
41eeb771e8
Merge branch 'develop' into clean_inference_lib
7 years ago
Tao Luo
b854d959a5
update with comments
7 years ago
nhzlx
2b5edfbc37
Add ceil model pooling for trt (ocr attention)
...
test=develop
7 years ago
Tao Luo
75bb0babef
Merge branch 'develop' into mkldnn_test
7 years ago
Yan Chunwei
6809238d97
fix analysis predictor profile ( #13896 )
7 years ago
nhzlx
9d98ca0424
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_googlenet_bug_with_rule
...
test=develop
7 years ago
nhzlx
849a6874ad
fix googlenet bug with relu
7 years ago
Tao Luo
a35e7f4bae
adjust demo_ci with fluid_inference_install_dir
...
test=develop
7 years ago
tensor-tang
dcfb687584
Merge pull request #13846 from tensor-tang/fix/mkldnn
...
fix default number of threads when inference with or without MKLDNN
7 years ago
Tao Luo
bd77460182
refine mkldnn test in analyzer_tests
...
test=develop
7 years ago
dzhwinter
e41a3fcd68
fix update to develop hang problem.
7 years ago
superjomn
1cfd2b51a7
update
...
test=develop
7 years ago
dzhwinter
804dd7da04
merge conflict. both linux and windows pass.
7 years ago
dzhwinter
962061f0a3
windows fix
7 years ago
superjomn
28459592cc
update
...
test=develop
7 years ago
Zhaolong Xing
7413fa458f
Merge pull request #13838 from NHZlX/add_trt_pad_op
...
Add trt pad op converter
7 years ago
superjomn
e2bd40ca82
update
...
test=develop
7 years ago
wanghaoshuang
3ae9645084
compile in linux
7 years ago
superjomn
049fcbe125
update
...
test=develop
7 years ago
superjomn
f5c0221c17
clean CreatePaddlePredictor
...
test=develop
7 years ago
nhzlx
320c78e16f
fix commets
...
test=develop
7 years ago
nhzlx
efa5bac7ad
fix demo_ci bug in vis_demo.cc
...
test=develop
7 years ago
tensor-tang
dc5a7b906d
fix default number of threads when inference with or without MKLDNN
...
test=develop
7 years ago
nhzlx
0cb88c34be
add op converter
7 years ago
Tao Luo
9b11a17502
Revert "[MKLDNN] Pass: Fuse Conv + Bias"
7 years ago
Tao Luo
ce248a15d9
Merge pull request #13368 from Sand3r-/mgallus/conv-bias-pass
...
[MKLDNN] Pass: Fuse Conv + Bias
7 years ago
Tao Luo
16b1beb244
Merge pull request #13486 from sfraczek/sfraczek/conv-bn-fuse-pass
...
Sfraczek/conv bn fuse pass
7 years ago
Michal Gallus
40b17be4b0
Pass: Fuse Conv + Bias
...
test=develop
7 years ago
nhzlx
d347ea689a
fix comments
7 years ago
nhzlx
f3af90d121
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into trt_dy_lib
...
test=develop
7 years ago
nhzlx
f569095084
add tensorrt api lib to paddle_fluid
7 years ago
Tao Luo
84a55155ec
revert with_fast_math to ON
...
test=develop
7 years ago
dzhwinter
a46e30aa6d
enhance isinf/isnan in tensor util, avoid copy back to cpu ( #12688 )
...
* "avoid copy back to cpu"
* "add infinity support"
* "fix ci"
* "add cpu macro"
* rerun ci; test=develop
* "fix api"
test=develop
* test=develop
* test=develop
* test=develop
* test=develop
* test=develop
7 years ago
Sylwester Fraczek
78f98294c2
conv bn fuse pass
...
review fix
review from hshen14 fix
test=develop
fix error in broadcast and code cleanup
rename bias -> eltwise and added macro to shorten code
formatting
7 years ago
Tao Luo
28889caea5
disable EIGEN_FAST_MATH and use_fast_math
...
test=develop
7 years ago
Tao Luo
d770b9bda3
Merge pull request #13663 from luotao1/resnet50_ut
...
add resnet50 inference unit-test
7 years ago
Michal Gallus
09d9d77a8f
Enable MKLDNN in Naive Executor
...
test=develop
7 years ago
Tao Luo
a89afd4c22
Merge pull request #13685 from luotao1/naive_cmake
...
update libpaddle_fluid.a/so
7 years ago
luotao1
9cbf2023ab
rollback paddle_inference_helper.h to helper.h
...
test=develop
7 years ago
Tao Luo
824a82d728
Merge pull request #13672 from luotao1/gen_fluid_library
...
reduce inference ci time
7 years ago
luotao1
d55d7e04fd
update libpaddle_fluid.so with zeroCopy
...
test=develop
7 years ago
Xin Pan
425a882165
Merge pull request #13643 from panyx0718/ir2
...
clean up channel
7 years ago
luotao1
a989a4e7c2
refine paddle_inference_helper.h
7 years ago
tensor-tang
ede4b230be
Merge pull request #13553 from jczaja/prv-fused_embedding_fc_lstm_op
...
Adding fused_embedding_fc_lstm op
7 years ago
Xin Pan
ddd60581b7
clean up channel
...
test=develop
7 years ago
Tao Luo
cfbd71c223
reduce inference ci time
...
test=develop
7 years ago
JiabinYang
358b386953
test=develop
7 years ago
Tao Luo
21ee30595b
clean some CMakeLists
...
test=develop
7 years ago
Tao Luo
b31905c54d
Merge branch 'develop' into resnet50_ut
7 years ago
Tao Luo
1dcd6ee532
add resnet50 inference UT
7 years ago
Yan Chunwei
c8744d118d
fea/infer executor and concurrency performance issue bug fix ( #13451 )
...
- add naive executor
- fix concurrency performance issue
7 years ago
Jacek Czaja
910cd415f2
- Disabled embedding_fc_lstm_fuse by defult and
...
extended test_text_classification ot use new op
7 years ago
Jacek Czaja
7ab5626dee
- Added initial pass for embedding-fc-lstm
...
- Added draft of new operator
- Added fused embedding fc lstm files
- First time embedding_fc_lstm_fuse_pass was invoked in
test_text_classification
- Added Embedding pattern
- Not crashing
- Enabled draft of embedding_fc_lstm pass (does it job)
- First working (Seqcompute only) version
- Removed diagnostic comment
- First enabling of BatchCompute
- Disabling pass for embedding with is_sparse and is_distributed
- Cosmetics
- Style
- Style
7 years ago
Yan Chunwei
9e8d372ff4
hide attention lstm fuse ( #13615 )
7 years ago
nhzlx
6c81230683
update code for config change
...
test=develop
7 years ago
nhzlx
5c57e15044
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_ut_for_trt
7 years ago
Tao Luo
f67483bf3b
add seq_conv UT ( #13517 )
...
* add multi_label UT
* rename, fix typo, add fuse_statis check
7 years ago
Tao Luo
c07b2a97a9
Merge pull request #13521 from Sand3r-/mgallus/fix-pooling-ceiled-size
...
Enable MKL-DNN in Analysis Predictor
7 years ago
Michal Gallus
f465b03ef9
Enable MKLDNN in Analysis Predictor
...
Also fix MKL-DNN pooling integration for ceil mode
7 years ago
Yan Chunwei
e426cdae32
fix inference output with lod ( #13557 )
7 years ago
Yan Chunwei
5de14c6b96
refine inference api ( #13518 )
7 years ago
dzhwinter
c66a8d2cd8
add guide ( #13332 )
...
* add guide
* "fix doc"
* Update windows_inference.md
Looks like there is a little problem in markdown format writing of head lines
7 years ago
dzhwinter
24447ec517
flags ( #13541 )
7 years ago
dzhwinter
4fd5eb2255
"refine cmake" ( #13546 )
7 years ago
dzhwinter
97636a9fcf
"fix link error" ( #13545 )
7 years ago
nhzlx
baae7e4f63
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_ut_for_trt
7 years ago
nhzlx
2763321684
fix comments
7 years ago
Yan Chunwei
90bc14da24
simple fix on inference tester helper ( #13507 )
7 years ago
nhzlx
0514882bc5
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_ut_for_trt
7 years ago
nhzlx
1f6c9dbad4
fix typo
7 years ago
nhzlx
f277f53c33
out of memory... i bet it's the last time commit for this pr
7 years ago
Tao Luo
b75887514e
Refine infer api test ( #13472 )
...
* refine analyzer_nlp_tester
* refine analyzer_rnn/vis_tester
7 years ago
nhzlx
0c51170052
fix the ut test error :)
7 years ago
nhzlx
4801beb101
add arguments for trt config
7 years ago
nhzlx
4c52be07dd
fix ut error
7 years ago
nhzlx
94a57f1d83
add trt config to arguments
7 years ago
nhzlx
68fb818aa8
add ut of trt common models
7 years ago
Tao Luo
2d89849125
add WITH_INFERENCE_API_TEST option ( #13425 )
7 years ago
nhzlx
cc4a7661c6
merge develop
7 years ago
nhzlx
d40402f9b7
add dropout and sigmoid op converter
7 years ago
Jiabin Yang
edb9e56934
Merge pull request #13401 from JiabinYang/mac/ci_unitest
...
add unitttest for mac on ci after some untest being disable
7 years ago
dzhwinter
85f8dd1c77
debug version
7 years ago
Yan Chunwei
3725f22442
Hotfix/api predictor ( #13383 )
...
* hotfix for PaddleTensor buffer.
7 years ago
Tao Luo
65b1fbb5d8
Merge pull request #13399 from luotao1/fix_cmake
...
fix text_classification download error
7 years ago
dzhwinter
e1999538eb
debug the device context
7 years ago
dzhwinter
372caf4000
windows staff
7 years ago
JiabinYang
87b11179e5
add unitttest for mac on ci after some untest being disable
7 years ago
tensor-tang
5fd2ffdce7
Merge pull request #13372 from tensor-tang/fea/ut/vis
...
add analysis vis ut
7 years ago
luotao1
e93c7b62dc
fix text_classification downlaod error
7 years ago
tensor-tang
26fc698f85
disable mkldnn fuse on ocr test
7 years ago
JiabinYang
9a9105018d
fix mac compile error in subgraph_splitter
7 years ago
tensor-tang
1a99302c14
refine and reuse code
7 years ago
tensor-tang
b7a64e8698
fix confilts
7 years ago
tensor-tang
4e4f952dea
Merge remote-tracking branch 'ups/develop' into fea/ut/vis
7 years ago
tensor-tang
89d09e6594
Merge branch 'develop' into fea/ut/vis
7 years ago
Zhaolong Xing
c9995289f1
Merge pull request #13124 from NHZlX/fix_subgraph_bug
...
Fix tensorrt subgraph bug
7 years ago
Tao Luo
d4a5326ac6
Merge pull request #13387 from luotao1/nlp_multi_thread
...
add multi-thread for nlp unit-tests
7 years ago
Tao Luo
968a56b672
Merge pull request #13373 from sfraczek/conv-relu-pass-hotfix
...
hotfix for conv-relu pass
7 years ago
tensor-tang
0bd6476f67
Merge remote-tracking branch 'ups/develop' into fea/ut/vis
7 years ago
luotao1
6b9ccd97a2
Merge branch 'develop' into nlp_multi_thread
7 years ago
luotao1
20b40cb06a
add multi-thread for nlp unit-tests
7 years ago
luotao1
29f5a93b5f
add analyzer_rnn2_test
7 years ago
nhzlx
0092ad3285
delete unused log
7 years ago
tensor-tang
dd0b2036c6
add note for use mkldnn
7 years ago
nhzlx
329a8c5283
merge develop
7 years ago
nhzlx
49bafc05bf
fix comments and set name for trt layer and ITensor
7 years ago
Sylwester Fraczek
dd149d469b
hotfix for conv-relu pass
7 years ago
tensor-tang
01f0f16884
enable mkldnn in infer api
7 years ago
tensor-tang
33c21291e7
Merge remote-tracking branch 'ups/develop' into fea/ut/vis
7 years ago
tensor-tang
65f901b36f
disable fc gru temporarily
7 years ago
tensor-tang
539b3f300f
add ocr analysis ut
7 years ago
luotao1
b12322ce95
fix fusion_lstm unique_name bug
7 years ago
Tao Luo
f351ceb65a
Merge pull request #13345 from Superjomn/bugfix/lac_test
...
fix ner_test when bs>1
7 years ago
dzhwinter
c3e1fb5a3e
add demo
7 years ago
tensor-tang
8cbb3c0720
refine lac ut and fix fetch
7 years ago
superjomn
8e0fe035d4
fix ner_test when bs>1
7 years ago
nhzlx
df161e08f0
delete unuse ut
7 years ago
nhzlx
49b5b3c5b3
merge develop
7 years ago
nhzlx
03ff4f6892
fix subgraph bug!
7 years ago
Tao Luo
94b66bdb5d
Merge pull request #13326 from luotao1/analysis_test_refine
...
refine Analysis test
7 years ago
Xin Pan
17bf8713a5
Merge pull request #12988 from panyx0718/ir2
...
program and tensor versioning support
7 years ago
luotao1
9664c53c7c
fix cmake error to pass the ci
7 years ago
luotao1
81c21705b4
simplify inference/tests/api/CMakeLists.txt
7 years ago
luotao1
d0fbe78040
move analyzer_xxx_tester to inference/tests/api
7 years ago
luotao1
83af1b3b3e
move analyzer_rnn1_test out of analyzer_test
7 years ago
Yan Chunwei
2fd1bf2ea6
fea/add color log ( #13305 )
7 years ago
Yan Chunwei
5023530a8a
Refactor/remove sensitive ( #13314 )
7 years ago
Yan Chunwei
478a4e850e
refactor ir pattern ( #13304 )
7 years ago
Xin Pan
4313d870a2
refine
7 years ago
Xin Pan
c69cf6dde8
fix
7 years ago
Xin Pan
926e1077ca
version
7 years ago
tensor-tang
ca973139fe
Merge pull request #13285 from tensor-tang/refine/ut/lac
...
add analysis unit test of lac and ner
7 years ago
tensor-tang
5a2fc5b52f
fix print error
7 years ago
tensor-tang
3ea19b7596
fix bug and fc pass ut
7 years ago
tensor-tang
acfdbf0293
enable ner analysis test and refine lac
7 years ago
tensor-tang
df0c695618
fix fusion gru pass and enable it
7 years ago
luotao1
d4c3fe9a44
clean api_anakin_engine_rnn_tester
7 years ago
tensor-tang
7eebb90523
fix conflicts
7 years ago
tensor-tang
3c3ad1e4cf
Merge branch 'develop' into refine/ut/lac
7 years ago
tensor-tang
ca30127e0a
fix compile error undef registrar pass
7 years ago
tensor-tang
0618077971
Merge remote-tracking branch 'ups/develop' into refine/ut/lac
7 years ago
tensor-tang
6b104c90d3
fix profile
7 years ago
luotao1
00c7230996
Merge branch 'develop' into all_data
7 years ago
Yan Chunwei
6de0a18d5f
Refine/text classification support data ( #13256 )
7 years ago
Tao Luo
11b22883be
Merge pull request #12738 from luotao1/anakin_cpu
...
support anakin for only-cpu environment
7 years ago
Xin Pan
883bbe1958
Merge pull request #13238 from panyx0718/clean
...
Clean
7 years ago
luotao1
4c283d87ef
Merge branch 'develop' into all_data
7 years ago
luotao1
61cae53e79
support anakin for only-cpu environment
7 years ago
Yan Chunwei
225ecee5ea
refine/text classification tester ( #13244 )
7 years ago
tensor-tang
4d774953c6
enable fc gru fuse pass
7 years ago
tensor-tang
09016df8df
make analyzer run
7 years ago
luotao1
fa5036aac8
add test_all_data in test_analyzer_ner
7 years ago
Xin Pan
18442a6088
rename pass.h/.cc to analysis_pass
7 years ago
tensor-tang
12b483c0db
Merge remote-tracking branch 'ups/develop' into refine/ut/lac
7 years ago
luotao1
b4fa3dbda3
unify PrintTime of analysis unit-test
7 years ago
luotao1
f615ba2f8f
update the multi-thread unit-tests
7 years ago
luotao1
35cff5e00d
Merge branch 'develop' into multi-thread2
7 years ago
Yan Chunwei
9df2d8b5ba
test/add text-classification test ( #13081 )
7 years ago
luotao1
1a373fbb0d
add result check for multi-thread UT
7 years ago
luotao1
2dc23ffaa8
Merge branch 'develop' into multi-thread2
7 years ago
luotao1
8cb92fb18e
speedup the download of inference_demo
7 years ago
luotao1
39ed148714
fix multi-thread hang temporary
7 years ago
luotao1
459d4cc811
Merge branch 'develop' into multi-thread2
7 years ago
Tao Luo
907696709f
Merge pull request #13133 from luotao1/library
...
add static and shared Library for analysis and IR
7 years ago
Jiabin Yang
d091dd02a0
fix mac compile error 0903 ( #13184 )
7 years ago
Yan Chunwei
796c87d563
bugfix/fusion lstm ( #13185 )
7 years ago
luotao1
ae44efffee
fix ci error
7 years ago
tensor-tang
d83187dba8
enable lac analysis test
7 years ago
luotao1
6f18217386
fix codestyle
7 years ago
luotao1
d7b4965785
auto generate paddle_inference_pass.h
7 years ago
dzhwinter
379b471ee2
squash commit
7 years ago
luotao1
0639a32477
Merge branch 'develop' into library
7 years ago
luotao1
f507e5c1f2
update multi-threads UT
7 years ago
luotao1
37d1a6685c
Merge branch 'develop' into multi-thread2
7 years ago
Tao Luo
737a033ed0
Merge pull request #13140 from dzhwinter/windows/inference_api
...
modify the timer
7 years ago
dzhwinter
b4d43030ff
windows inference fix ( #13141 )
...
* windows inference fix
* windows inference fix
7 years ago
Yan Chunwei
597b73053d
refine/fc lstm fusion link ( #13158 )
7 years ago
tensor-tang
1e7ccf9f45
Merge pull request #13126 from tensor-tang/fea/infer/ut/lac-new
...
add lac infer test
7 years ago
dzhwinter
a0aa2ec8b5
build compile
7 years ago
dzhwinter
75681c0a79
switch to 9.2
7 years ago
dzhwinter
bfa9b268de
fix elementwise
7 years ago
dzhwinter
dbe90cc0f6
merge develop branch
7 years ago
Jiabin Yang
6ba2b22279
Merge pull request #13096 from JiabinYang/fix_mac
...
Fix Mac compile error
7 years ago
luotao1
fb077c17e6
add shared library for analysis
7 years ago
tensor-tang
9f02497b23
follow comment
7 years ago
tensor-tang
713e86486d
bugfix ditu test
7 years ago
tensor-tang
63b38ca40b
add lac test
7 years ago
tensor-tang
663a11ac7c
bugfix and follow comment
7 years ago
nhzlx
5ec2fb0c93
add flexibledfs for find path between two nodes
7 years ago
luotao1
f3b7e18be9
add static library for analysis
7 years ago
luotao1
0fbe0a7a28
add multi-thread ut for ditu-rnn
7 years ago
luotao1
b3cd2ae88b
Merge branch 'develop' into ner_ut2
7 years ago
Yan Chunwei
af15f6f038
fea/refine fuse ( #13076 )
7 years ago
luotao1
07cb64adc0
add unit-test for chinese_ner
7 years ago
Xin Pan
823c4f87be
Merge pull request #13058 from panyx0718/infer
...
use fast RunPrepareContext for inference
7 years ago
Jiabin Yang
cceffca6bf
Update api_impl.cc
7 years ago
Jiabin Yang
5d5b70ad79
Update CMakeLists.txt
7 years ago
JiabinYang
7c7d3d6172
Fix mac
7 years ago
Yan Chunwei
cfa6bbb755
move nodeid from graph to node ( #13065 )
7 years ago
Xin Pan
5adf118ab5
polish
7 years ago
Xin Pan
c558f059ad
fix
7 years ago
Xin Pan
4794d9cf70
use fast RunPrepareContext for inference
7 years ago
Yan Chunwei
902f19b46a
fea/fuse attention lstm simplify.with fusion lstm.with sequnce expand ( #13006 )
7 years ago
dzhwinter
b78394ea57
done
7 years ago
Xin Pan
2bb15f437c
Merge pull request #12791 from panyx0718/ir3
...
graph to program pass
7 years ago
dzhwinter
b74af56bbc
cpu compile is done
7 years ago
Xin Pan
880cb8c4c3
clean
7 years ago
Xin Pan
1a67061fee
graph to program pass
...
fix a few other things
7 years ago
dzhwinter
78aab05b71
fix more op errors
7 years ago
nhzlx
478eeabdd4
refine uttest of api_tensorrt_subgraph_engine
7 years ago
nhzlx
791aa7f49d
merge develop
7 years ago
dzhwinter
7dceb8a080
check some operators
7 years ago
dzhwinter
4fcc293617
memory module ( #12931 )
...
* memory module
* "fix ci"
7 years ago
dzhwinter
488a2dd2e8
with ir node
7 years ago
nhzlx
3de4556659
concat op && map cnn model support
7 years ago
dzhwinter
89f95ea25e
merge develop branch
7 years ago
dzhwinter
34f8c9b6f5
windows port
7 years ago
luotao1
9c7fde45a7
enhance test_analyzer to profile ditu inference demo
7 years ago
Tao Luo
decda738b0
fea/anakin compile with demo ( #12772 )
...
* anakin support x86
* fix code style
* add anakin ditu cnn demo
* add timer
* add rnn
* fix inference_anakin_cnn/rnn_test compile error
* make anakin_rnn_tester run
* add anakin_enable_op_time option
* update api/CMakeLists.txt
* enlarge the max_batch_size in anakin.config
* update with comments
7 years ago
Yan Chunwei
9ee698e605
enhance/ditu rnn with fc fuse ( #12831 )
...
* make fc fuse work with ditu rnn
* add ditu rnn data download to CMAKE
7 years ago
nhzlx
c999895e93
merge develop
7 years ago
nhzlx
276950291a
1. fix ssa bug with batchnorm, 2. refine the trt
7 years ago
Yan Chunwei
896a37b6e3
fea/link ir to inference analysis and fc fuse support ( #12789 )
...
* link IR graph to analysis graph
* add clean code and update
* add infer_clean_pass
* add ir_pass_manager
* support fc fuse executation
* fix ir circle
7 years ago
nhzlx
ff052c0e6f
merge develop
7 years ago
nhzlx
c6a5c4b0c0
add comments for execute in ut_helper
7 years ago
tangwei12
99f74be561
Merge pull request #12802 from seiriosPlus/inference_teeny_mistakes
...
fix some teeny mistakes
7 years ago
luotao1
808e5b1748
fix tensorrt compiler bug
7 years ago
nhzlx
1bf9d9e90c
fix comments
7 years ago
tangwei12
cfb12f09bf
fix some teeny mistakes
7 years ago
Tao Luo
7decbaaa13
Merge pull request #12762 from luotao1/anakin_cuda_env
...
disable anakin when cuda < 8.0 or cudnn < 7.0
7 years ago
nhzlx
144b20c160
add batch norm op converter
7 years ago
nhzlx
14311bb094
merge develop
7 years ago
Zhaolong Xing
e5674f6dde
Merge pull request #12753 from NHZlX/add_benchmark
...
modify tensorrt engine op from cpu mode to gpu
7 years ago
Zhaolong Xing
310708726b
Merge pull request #12761 from NHZlX/global_pooling_trt
...
Add support for global pooling for trt
7 years ago
nhzlx
1e92baf746
fix comments
7 years ago
nhzlx
ce7f361a80
fix comments
7 years ago
nhzlx
df9cbabcee
add pool2d test for global_pooling true
7 years ago
Yan Chunwei
6fe5547db7
switch NodeAttr to boost::varient ( #12539 )
7 years ago
nhzlx
133ec69625
add batch norm trt converter
7 years ago
luotao1
413bf9d494
disable anakin when cuda < 8.0 or cudnn < 7.0
7 years ago
dzhwinter
f36818d532
"windows testing easier" ( #12739 )
7 years ago
nhzlx
2bdd20be22
add support for global pooling for trt
7 years ago
nhzlx
f55e8901c8
merge develop
7 years ago
nhzlx
1600ba86f6
1. change tensorrt op from cpu to gpu
7 years ago
luotao1
9f3789944c
use latest anakin commit
7 years ago
Yan Chunwei
e765dead86
add profiler to fluid inference ( #12707 )
7 years ago
Zhaolong Xing
83c85f34e8
Merge pull request #12598 from NHZlX/add_tensorrt_softmax
...
Add tensorrt softmax
7 years ago
Tao Luo
1e1974c998
Merge pull request #12563 from luotao1/anakin_test
...
* make inference_anakin_test SERIAL
* add anakin compiler from github source code
* fix inference_lib_dist error
* add comment
* update anakin.cmake
* fix anakin-NOTFOUND compiler error
* modify the anakin_model download dir
7 years ago
Wu Yi
8b77448d5f
hide misc APIs ( #12540 )
...
* hide misc APIs
* update
* fix transformer test
* update API.spec
7 years ago
luotao1
a222d336ca
modify the anakin_model download dir
7 years ago
luotao1
22bc328951
fix anakin-NOTFOUND compiler error
7 years ago
luotao1
b2367f3661
update anakin.cmake
7 years ago
xzl
29ad9794bb
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_tensorrt_softmax
7 years ago
luotao1
f4bcee1d6f
Merge branch 'develop' into anakin_test
7 years ago
luotao1
94042ccd2d
add comment
7 years ago
Yan Chunwei
7555cfe33a
fix inference double free bug ( #12613 )
7 years ago
Luo Tao
64c0ba288a
fix inference_lib_dist error
7 years ago
nhzlx
641f32da8c
add softmax op converter
7 years ago
nhzlx
943950c190
refine graph draw
7 years ago
nhzlx
7a019cd608
merge develop
7 years ago
nhzlx
e823ce68bb
filter redundant output
7 years ago
nhzlx
c69ae865db
fix comments
7 years ago
Luo Tao
e8aa6d1283
add anakin compiler from github source code
7 years ago
nhzlx
e8954a36f5
merge develop
7 years ago
nhzlx
32a9e050bc
mapping the variable name inside the subgraph
7 years ago
Luo Tao
cf74473244
make inference_anakin_test SERIAL
7 years ago
superjomn
ebe1920626
add comment
7 years ago
superjomn
3c5e15de03
disable anakin test
7 years ago
Zhaolong Xing
d7dd0868db
Merge pull request #12449 from NHZlX/add_tensorrt_elementwise_add
...
Add tensorrt elementwise add
7 years ago
nhzlx
d50f776b27
merge develop
7 years ago
nhzlx
64a08f840f
increase the test batch
7 years ago
nhzlx
c7e6a11bc1
merge develop
7 years ago
nhzlx
0015df1b12
modify op converter for conv2d
7 years ago
gongweibao
819ac3df0a
Modify style ( #12465 )
7 years ago
cuichaowen
046de2acdb
Improve anakin feature ( #11961 )
7 years ago
nhzlx
c13efe02d9
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_tensorrt_elementwise_add
7 years ago
nhzlx
a5c96af33c
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_tensorrt_conv2d_converter
7 years ago
Yan Chunwei
dcfbc6a661
inference analyzer as bin ( #12450 )
7 years ago
Yan Chunwei
31a2c87688
fea/lightly support lod ( #12451 )
7 years ago
nhzlx
5fcdd81da7
tiny modify
7 years ago
nhzlx
f05c7fb8ae
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_tensorrt_conv2d_converter
7 years ago
nhzlx
6f6d552790
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_tensorrt_conv2d_converter
7 years ago
Superjomn
4d2405d851
inference analysis support ssa
7 years ago
minqiyang
e96fef2cf7
Fix inference api impl deps
7 years ago
Luo Tao
062556f938
Merge branch 'develop' into unify
7 years ago
nhzlx
98948b975e
wrong added file
7 years ago
nhzlx
830aa12c1a
add elementwise init code
7 years ago
Zhaolong Xing
85c4912755
Merge pull request #12355 from NHZlX/add_tensorrt_pooling_converter
...
Add tensorrt pooling converter
7 years ago
tensor-tang
9788e5ab87
add flags to control num_threads
7 years ago
nhzlx
4f71a3b12b
fix a bug
7 years ago
Luo Tao
83e59257d0
fix manylinux1 Failed to publish artifacts
7 years ago
nhzlx
c8adfb3451
add paddle_enforce
7 years ago
nhzlx
5533400720
fix comments
7 years ago
Luo Tao
5ba4337698
unify libpaddle_inference_api into libpaddle_fluid
7 years ago
nhzlx
01566fb61b
1. support mutil batch utest 2. support pool op
7 years ago
nhzlx
990741aa85
add weight's dim assert
7 years ago
nhzlx
21890ca0cf
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_tensorrt_pooling_converter
7 years ago
tensor-tang
7b63b85086
fix mismatch of infer api ( #12342 )
7 years ago
nhzlx
fc41eb40b1
add conv2d trt converter
7 years ago
nhzlx
4d49e61ab8
fix comments
7 years ago
nhzlx
bcd67bdd71
add assert for GetOutput
7 years ago
nhzlx
7382f98600
1. set ut batch > 1 2. readd the mul op(utest will be added later)
7 years ago
nhzlx
bd64979fe9
the argument should not be a const one
7 years ago
nhzlx
f42ea48996
deal with conflict
7 years ago
nhzlx
82527696e7
1. we delelte mul op, 2.modify fc and action op 3. modify the test inferface
7 years ago
nhzlx
2372daff1d
there is no batchsize concept in tensorrt's tensor
7 years ago
Yan Chunwei
9e0a94f069
inference-api code clean ( #12274 )
7 years ago
Yan Chunwei
b42ced8eda
bugfix/tensorrt analysis fix subgraph trigger ( #12266 )
7 years ago
qiaolongfei
0e30c9d6fb
fix mac build
7 years ago
Tao Luo
3694fd5c4a
Merge pull request #12109 from emailweixu/cpu_only
...
Fixed unittests for WITH_GPU=OFF and WITH_DISTRIBUTE=OFF build
7 years ago
nhzlx
d384d39a68
add Temporarily add code with bug
7 years ago
Tao Luo
a8f0931428
Merge pull request #12229 from luotao1/api_doc
...
fix dead link in high_level_api.md
7 years ago
Luo Tao
43c1481f88
fix dead link in high_level_api.md
7 years ago
tensor-tang
d4691cedec
fix mac build
7 years ago
Luo Tao
2e68abf47c
rename api.h to paddle_inference_api.h, put demo_ci in fluid_install_dir
7 years ago
Luo Tao
44b6a5f308
fix inference_lib.cmake and make demo_ci pass
7 years ago
Luo Tao
af1e54acd8
fix compiler error after move
7 years ago
Luo Tao
369dfb3d0f
move contrib/inference to paddle/fluid/inference/api
7 years ago
Wei Xu
264e8305b0
Fixed unittests for WITH_GPU=OFF and WITH_DISTRIBUTE=OFF build
7 years ago
Luo Tao
b1a1124d36
fix compiler and run error in static library
7 years ago
Luo Tao
24ced1d0b9
add independent demo for test static fluid library
7 years ago
Yan Chunwei
0cefb9461f
add topological sortting ( #12059 )
7 years ago
tensor-tang
f92024470b
Merge pull request #12052 from tensor-tang/refine/infer/api/static
...
inference api static lib symbol hidden
7 years ago
tensor-tang
2238ea56de
paddle fluid static lib symbol hidden
7 years ago
Luo Tao
fc3e7341fc
fix compile warning in inference related codes
7 years ago
tensor-tang
3df99e72ab
Merge remote-tracking branch 'ups/develop' into refine/set_num_threads
...
fix conflicts
7 years ago
dzhwinter
4ed0b62476
Move fluid::framework::InitDevices into fluid::platform ( #11757 )
...
* move to platform
* "move init from framework to platform"
* "remove used init"
* "fix ci"
* "fix ci"
* "fix generic"
* "fix ci"
* "fix ci"
* "fix ci"
* "disable fragile test"
7 years ago
Yan Chunwei
4f555909ce
analysis/code clean ( #11964 )
7 years ago
sneaxiy
3f9292c6e6
fix merge conflict
7 years ago
sneaxiy
dd70fb4393
fix type comparation bugs
7 years ago
Xin Pan
a9086bf320
also move a few other dir to legacy/
7 years ago
Yan Chunwei
5e2656449c
add inference-analysis doc ( #11813 )
7 years ago
gongweibao
c2165ffa7b
Fix codesytle ( #11836 )
7 years ago
fengjiayi
aab47cc08d
fix Mac compile errors ( #11829 )
7 years ago
superjomn
ba99bc2384
update
7 years ago
superjomn
f1224945ba
fix analysis compile bug
7 years ago
Yan Chunwei
5082642bdb
feature/analysis to support sub-graph for TRT engine ( #11538 )
7 years ago
tensor-tang
e3a96300bb
move SetNumThreads to platform
7 years ago
tensor-tang
1f09ddf806
Merge remote-tracking branch 'ups/develop' into refine/mklml/dyload
7 years ago
gongweibao
19958eeb71
fix ( #11590 )
7 years ago
tensor-tang
f503f12925
enable dynamic load mklml lib on fluid
7 years ago
gongweibao
4dda54aa5a
Fix unlikely ( #11537 )
7 years ago
Yan Chunwei
d734595978
Feature/pass manager ( #11440 )
7 years ago
tensor-tang
609dccfb55
Merge pull request #11395 from tensor-tang/fix
...
remove mkldnn flag from gtest strdup for cpu
7 years ago
tensor-tang
0ddc5d8631
Merge pull request #11258 from tensor-tang/refine
...
Refine test and scope lock
7 years ago
tensor-tang
6c1cf60950
Merge remote-tracking branch 'ups/develop' into fix
7 years ago
Yan Chunwei
5fd142c3fd
bugfix/trt engine op ( #11487 )
7 years ago