History

Jiabin Yang d091dd02a0 fix mac compile error 0903 (#13184 )		7 years ago
..
demo_ci	windows inference fix (#13141 )	7 years ago
CMakeLists.txt	fea/fuse attention lstm simplify.with fusion lstm.with sequnce expand (#13006 )	7 years ago
README.md	move contrib/inference to paddle/fluid/inference/api	7 years ago
analysis_predictor.cc	fea/refine fuse (#13076 )	7 years ago
analysis_predictor.h	fea/refine fuse (#13076 )	7 years ago
api.cc	fea/anakin compile with demo (#12772 )	7 years ago
api_anakin_engine.cc	fea/anakin compile with demo (#12772 )	7 years ago
api_anakin_engine.h	fea/anakin compile with demo (#12772 )	7 years ago
api_anakin_engine_rnn_tester.cc	fea/anakin compile with demo (#12772 )	7 years ago
api_anakin_engine_tester.cc	Improve anakin feature (#11961 )	7 years ago
api_impl.cc	Merge pull request #13140 from dzhwinter/windows/inference_api	7 years ago
api_impl.h	add unit-test for chinese_ner	7 years ago
api_impl_tester.cc	inference-api code clean (#12274 )	7 years ago
api_tensorrt_subgraph_engine.cc	use fast RunPrepareContext for inference	7 years ago
api_tensorrt_subgraph_engine_tester.cc	refine uttest of api_tensorrt_subgraph_engine	7 years ago
api_tester.cc	inference-api code clean (#12274 )	7 years ago
helper.cc	fea/fuse attention lstm simplify.with fusion lstm.with sequnce expand (#13006 )	7 years ago
helper.h	fix mac compile error 0903 (#13184 )	7 years ago
high_level_api.md	fix dead link in high_level_api.md	7 years ago
high_level_api_cn.md	fix some teeny mistakes	7 years ago
paddle_inference_api.h	fea/fuse attention lstm simplify.with fusion lstm.with sequnce expand (#13006 )	7 years ago
timer.h	fix elementwise	7 years ago

README.md

Embed Paddle Inference in Your Application

Paddle inference offers the APIs in C and C++ languages.

One can easily deploy a model trained by Paddle following the steps as below:

Optimize the native model;
Write some codes for deployment.

Let's explain the steps in detail.

Optimize the native Fluid Model

The native model that get from the training phase needs to be optimized for that.

Clean the noise such as the cost operators that do not need inference;
Prune unnecessary computation fork that has nothing to do with the output;
Remove extraneous variables;
Memory reuse for native Fluid executor;
Translate the model storage format to some third-party engine's, so that the inference API can utilize the engine for acceleration;

We have an official tool to do the optimization, call paddle_inference_optimize --help for more information.

Write some codes

Read paddle_inference_api.h for more information.