Xin Pan
880cb8c4c3
clean
7 years ago
Xin Pan
1a67061fee
graph to program pass
...
fix a few other things
7 years ago
dzhwinter
4fcc293617
memory module ( #12931 )
...
* memory module
* "fix ci"
7 years ago
luotao1
9c7fde45a7
enhance test_analyzer to profile ditu inference demo
7 years ago
Tao Luo
decda738b0
fea/anakin compile with demo ( #12772 )
...
* anakin support x86
* fix code style
* add anakin ditu cnn demo
* add timer
* add rnn
* fix inference_anakin_cnn/rnn_test compile error
* make anakin_rnn_tester run
* add anakin_enable_op_time option
* update api/CMakeLists.txt
* enlarge the max_batch_size in anakin.config
* update with comments
7 years ago
Yan Chunwei
9ee698e605
enhance/ditu rnn with fc fuse ( #12831 )
...
* make fc fuse work with ditu rnn
* add ditu rnn data download to CMAKE
7 years ago
nhzlx
c999895e93
merge develop
7 years ago
nhzlx
276950291a
1. fix ssa bug with batchnorm, 2. refine the trt
7 years ago
Yan Chunwei
896a37b6e3
fea/link ir to inference analysis and fc fuse support ( #12789 )
...
* link IR graph to analysis graph
* add clean code and update
* add infer_clean_pass
* add ir_pass_manager
* support fc fuse executation
* fix ir circle
7 years ago
nhzlx
ff052c0e6f
merge develop
7 years ago
nhzlx
c6a5c4b0c0
add comments for execute in ut_helper
7 years ago
tangwei12
99f74be561
Merge pull request #12802 from seiriosPlus/inference_teeny_mistakes
...
fix some teeny mistakes
7 years ago
luotao1
808e5b1748
fix tensorrt compiler bug
7 years ago
nhzlx
1bf9d9e90c
fix comments
7 years ago
tangwei12
cfb12f09bf
fix some teeny mistakes
7 years ago
Tao Luo
7decbaaa13
Merge pull request #12762 from luotao1/anakin_cuda_env
...
disable anakin when cuda < 8.0 or cudnn < 7.0
7 years ago
nhzlx
144b20c160
add batch norm op converter
7 years ago
nhzlx
14311bb094
merge develop
7 years ago
Zhaolong Xing
e5674f6dde
Merge pull request #12753 from NHZlX/add_benchmark
...
modify tensorrt engine op from cpu mode to gpu
7 years ago
Zhaolong Xing
310708726b
Merge pull request #12761 from NHZlX/global_pooling_trt
...
Add support for global pooling for trt
7 years ago
nhzlx
1e92baf746
fix comments
7 years ago
nhzlx
ce7f361a80
fix comments
7 years ago
nhzlx
df9cbabcee
add pool2d test for global_pooling true
7 years ago
Yan Chunwei
6fe5547db7
switch NodeAttr to boost::varient ( #12539 )
7 years ago
nhzlx
133ec69625
add batch norm trt converter
7 years ago
luotao1
413bf9d494
disable anakin when cuda < 8.0 or cudnn < 7.0
7 years ago
dzhwinter
f36818d532
"windows testing easier" ( #12739 )
7 years ago
nhzlx
2bdd20be22
add support for global pooling for trt
7 years ago
nhzlx
f55e8901c8
merge develop
7 years ago
nhzlx
1600ba86f6
1. change tensorrt op from cpu to gpu
7 years ago
luotao1
9f3789944c
use latest anakin commit
7 years ago
Yan Chunwei
e765dead86
add profiler to fluid inference ( #12707 )
7 years ago
Zhaolong Xing
83c85f34e8
Merge pull request #12598 from NHZlX/add_tensorrt_softmax
...
Add tensorrt softmax
7 years ago
Tao Luo
1e1974c998
Merge pull request #12563 from luotao1/anakin_test
...
* make inference_anakin_test SERIAL
* add anakin compiler from github source code
* fix inference_lib_dist error
* add comment
* update anakin.cmake
* fix anakin-NOTFOUND compiler error
* modify the anakin_model download dir
7 years ago
Wu Yi
8b77448d5f
hide misc APIs ( #12540 )
...
* hide misc APIs
* update
* fix transformer test
* update API.spec
7 years ago
luotao1
a222d336ca
modify the anakin_model download dir
7 years ago
luotao1
22bc328951
fix anakin-NOTFOUND compiler error
7 years ago
luotao1
b2367f3661
update anakin.cmake
7 years ago
xzl
29ad9794bb
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_tensorrt_softmax
7 years ago
luotao1
f4bcee1d6f
Merge branch 'develop' into anakin_test
7 years ago
luotao1
94042ccd2d
add comment
7 years ago
Yan Chunwei
7555cfe33a
fix inference double free bug ( #12613 )
7 years ago
Luo Tao
64c0ba288a
fix inference_lib_dist error
7 years ago
nhzlx
641f32da8c
add softmax op converter
7 years ago
nhzlx
943950c190
refine graph draw
7 years ago
nhzlx
7a019cd608
merge develop
7 years ago
nhzlx
e823ce68bb
filter redundant output
7 years ago
nhzlx
c69ae865db
fix comments
7 years ago
Luo Tao
e8aa6d1283
add anakin compiler from github source code
7 years ago
nhzlx
e8954a36f5
merge develop
7 years ago