History

Yan Chunwei bd2a537b05 feature/anakin ci (#11330 )		7 years ago
..
demo	fix gpu fraction	7 years ago
CMakeLists.txt	feature/anakin ci (#11330 )	7 years ago
README.md	mv contrib to paddle/ for unified compile (#10815 )	7 years ago
paddle_inference_api.cc	feature/inference api demo impl (#10825 )	7 years ago
paddle_inference_api.h	add Anakin api for paddle (#11228 )	7 years ago
paddle_inference_api_anakin_engine.cc	feature/anakin ci (#11330 )	7 years ago
paddle_inference_api_anakin_engine.h	feature/anakin ci (#11330 )	7 years ago
paddle_inference_api_anakin_engine_tester.cc	feature/anakin ci (#11330 )	7 years ago
paddle_inference_api_impl.cc	make infer init explicit	7 years ago
paddle_inference_api_impl.h	make infer init explicit	7 years ago
test_paddle_inference_api.cc	fix develop build issue (#10978 )	7 years ago
test_paddle_inference_api_impl.cc	fix compiler error in high-level api	7 years ago

Embed Paddle Inference in Your Application

Paddle inference offers the APIs in C and C++ languages.

One can easily deploy a model trained by Paddle following the steps as below:

Let's explain the steps in detail.

Optimize the native Fluid Model

The native model that get from the training phase needs to be optimized for that.

Clean the noise such as the cost operators that do not need inference;
Prune unnecessary computation fork that has nothing to do with the output;
Remove extraneous variables;
Memory reuse for native Fluid executor;
Translate the model storage format to some third-party engine's, so that the inference API can utilize the engine for acceleration;

We have an official tool to do the optimization, call paddle_inference_optimize --help for more information.

Read paddle_inference_api.h for more information.