History

Tao Luo 1e1974c998 Merge pull request #12563 from luotao1/anakin_test * make inference_anakin_test SERIAL * add anakin compiler from github source code * fix inference_lib_dist error * add comment * update anakin.cmake * fix anakin-NOTFOUND compiler error * modify the anakin_model download dir		7 years ago
..
demo_ci	Modify style (#12465 )	7 years ago
CMakeLists.txt	modify the anakin_model download dir	7 years ago
README.md	move contrib/inference to paddle/fluid/inference/api	7 years ago
api.cc	fix inference double free bug (#12613 )	7 years ago
api_anakin_engine.cc	Improve anakin feature (#11961 )	7 years ago
api_anakin_engine.h	Improve anakin feature (#11961 )	7 years ago
api_anakin_engine_tester.cc	Improve anakin feature (#11961 )	7 years ago
api_impl.cc	fea/lightly support lod (#12451 )	7 years ago
api_impl.h	bugfix/tensorrt analysis fix subgraph trigger (#12266 )	7 years ago
api_impl_tester.cc	inference-api code clean (#12274 )	7 years ago
api_tensorrt_subgraph_engine.cc	inference analyzer as bin (#12450 )	7 years ago
api_tensorrt_subgraph_engine_tester.cc	fea/lightly support lod (#12451 )	7 years ago
api_tester.cc	inference-api code clean (#12274 )	7 years ago
high_level_api.md	fix dead link in high_level_api.md	7 years ago
high_level_api_cn.md	fix dead link in high_level_api.md	7 years ago
paddle_inference_api.h	hide misc APIs (#12540 )	7 years ago

Embed Paddle Inference in Your Application

Paddle inference offers the APIs in C and C++ languages.

One can easily deploy a model trained by Paddle following the steps as below:

Let's explain the steps in detail.

Optimize the native Fluid Model

The native model that get from the training phase needs to be optimized for that.

Clean the noise such as the cost operators that do not need inference;
Prune unnecessary computation fork that has nothing to do with the output;
Remove extraneous variables;
Memory reuse for native Fluid executor;
Translate the model storage format to some third-party engine's, so that the inference API can utilize the engine for acceleration;

We have an official tool to do the optimization, call paddle_inference_optimize --help for more information.

Read paddle_inference_api.h for more information.