parent
63fe8128d3
commit
d1f1351f9b
@ -0,0 +1,36 @@
|
||||
# BENCHMARK
|
||||
|
||||
This document gives the prediction time-consuming benchmark of PaddleOCR Ultra Lightweight Chinese Model (8.6M) on each platform.
|
||||
|
||||
## TEST DATA
|
||||
* 500 images were randomly sampled from the Chinese public data set [ICDAR2017-RCTW](https://github.com/PaddlePaddle/PaddleOCR/blob/develop/doc/doc_ch/datasets.md#ICDAR2017-RCTW-17).
|
||||
Most of the pictures in the set were collected in the wild through mobile phone cameras.
|
||||
Some are screenshots.
|
||||
These pictures show various scenes, including street scenes, posters, menus, indoor scenes and screenshots of mobile applications.
|
||||
|
||||
## MEASUREMENT
|
||||
The predicted time-consuming indicators on the four platforms are as follows:
|
||||
|
||||
| Long size(px) | T4(s) | V100(s) | Intel Xeon 6148(s) | Snapdragon 855(s) |
|
||||
| :---------: | :-----: | :-------: | :------------------: | :-----------------: |
|
||||
| 960 | 0.092 | 0.057 | 0.319 | 0.354 |
|
||||
| 640 | 0.067 | 0.045 | 0.198 | 0.236 |
|
||||
| 480 | 0.057 | 0.043 | 0.151 | 0.175 |
|
||||
|
||||
Explanation:
|
||||
* The evaluation time-consuming stage is the complete stage from image input to result output, including image
|
||||
pre-processing and post-processing.
|
||||
* ```Intel Xeon 6148``` is the server-side CPU model. Intel MKL-DNN is used in the test to accelerate the CPU prediction speed.
|
||||
To use this operation, you need to:
|
||||
* Update to the latest version of PaddlePaddle: https://www.paddlepaddle.org.cn/documentation/docs/zh/install/Tables.html#whl-dev
|
||||
Please select the corresponding mkl version wheel package according to the CUDA version and Python version of your environment,
|
||||
for example, CUDA10, Python3.7 environment, you should:
|
||||
|
||||
```
|
||||
# Obtain the installation package
|
||||
wget https://paddle-wheel.bj.bcebos.com/0.0.0-gpu-cuda10-cudnn7-mkl/paddlepaddle_gpu-0.0.0-cp37-cp37m-linux_x86_64.whl
|
||||
# Installation
|
||||
pip3.7 install paddlepaddle_gpu-0.0.0-cp37-cp37m-linux_x86_64.whl
|
||||
```
|
||||
* Use parameters ```--enable_mkldnn True``` to turn on the acceleration switch when making predictions
|
||||
* ```Snapdragon 855``` is a mobile processing platform model.
|
@ -0,0 +1,35 @@
|
||||
# DATA ANNOTATION TOOLS
|
||||
|
||||
There are the commonly used data annotation tools, which will be continuously updated. Welcome to contribute tools~
|
||||
|
||||
1.**labelImg**
|
||||
|
||||
* Tool description: Rectangular label
|
||||
|
||||
* Tool address: https://github.com/tzutalin/labelImg
|
||||
|
||||
* Sketch diagram:
|
||||
|
||||
![labelimg](C:\Users\USER\Desktop\labelimg.jpg)
|
||||
|
||||
|
||||
|
||||
2.**roLabelImg**
|
||||
|
||||
* Tool description: Label tool rewritten based on labelImg, supporting rotating rectangular label
|
||||
|
||||
* Tool address: https://github.com/cgvict/roLabelImg
|
||||
|
||||
* Sketch diagram:![roLabelImg](C:\Users\USER\Desktop\roLabelImg.png)
|
||||
|
||||
|
||||
|
||||
3.**labelme**
|
||||
|
||||
* Tool description: Support four points, polygons, circles and other labels
|
||||
|
||||
* Tool address: https://github.com/wkentaro/labelme
|
||||
|
||||
* Sketch diagram:
|
||||
|
||||
![labelme](C:\Users\USER\Desktop\labelme.jpg)
|
@ -0,0 +1,11 @@
|
||||
# DATA SYNTHESIS TOOLS
|
||||
|
||||
In addition to open source data, users can also use synthesis tools to synthesize data.
|
||||
There are the commonly used data synthesis tools, which will be continuously updated. Welcome to contribute tools~
|
||||
|
||||
* [Text_renderer](https://github.com/Sanster/text_renderer)
|
||||
* [SynthText](https://github.com/ankush-me/SynthText)
|
||||
* [SynthText_Chinese_version](https://github.com/JarveeLee/SynthText_Chinese_version)
|
||||
* [TextRecognitionDataGenerator](https://github.com/Belval/TextRecognitionDataGenerator)
|
||||
* [SynthText3D](https://github.com/MhLiao/SynthText3D)
|
||||
* [UnrealText](https://github.com/Jyouhou/UnrealText/)
|
@ -0,0 +1,28 @@
|
||||
# Handwritten OCR dataset
|
||||
Here we have sorted out the commonly used handwritten OCR dataset datasets, which are being updated continuously. We welcome you to contribute datasets ~
|
||||
- [Institute of automation, Chinese Academy of Sciences - handwritten Chinese dataset](#Institute of automation, Chinese Academy of Sciences - handwritten Chinese dataset)
|
||||
- [NIST handwritten single character dataset - English](#NIST handwritten single character dataset - English)
|
||||
|
||||
<a name="Institute of automation, Chinese Academy of Sciences - handwritten Chinese dataset"></a>
|
||||
## Institute of automation, Chinese Academy of Sciences - handwritten Chinese dataset
|
||||
- **Data source**:http://www.nlpr.ia.ac.cn/databases/handwriting/Download.html
|
||||
- **Data introduction**:
|
||||
* It includes online and offline handwritten data,`HWDB1.0~1.2` has totally 3895135 handwritten single character samples, which belong to 7356 categories (7185 Chinese characters and 171 English letters, numbers and symbols);`HWDB2.0~2.2` has totally 5091 pages of images, which are divided into 52230 text lines and 1349414 words. All text and text samples are stored as grayscale images. Some sample words are shown below.
|
||||
|
||||
![](../datasets/CASIA_0.jpg)
|
||||
|
||||
- **Download address**:http://www.nlpr.ia.ac.cn/databases/handwriting/Download.html
|
||||
- **使用建议**:Data for single character, white background, can form a large number of text lines for training. White background can be processed into transparent state, which is convenient to add various backgrounds. For the case of semantic needs, it is suggested to extract single character from real corpus to form text lines.
|
||||
|
||||
|
||||
<a name="NIST handwritten single character dataset - English"></a>
|
||||
## NIST handwritten single character dataset - English(NIST Handprinted Forms and Characters Database)
|
||||
|
||||
- **Data source**: [https://www.nist.gov/srd/nist-special-database-19](https://www.nist.gov/srd/nist-special-database-19)
|
||||
|
||||
- **Data introduction**: NIST19 dataset is suitable for handwritten document and character recognition model training. It is extracted from the handwritten sample form of 3600 authors and contains 810000 character images in total. Nine of them are shown below.
|
||||
|
||||
![](../datasets/nist_demo.png)
|
||||
|
||||
|
||||
- **Download address**: [https://www.nist.gov/srd/nist-special-database-19](https://www.nist.gov/srd/nist-special-database-19)
|
Loading…
Reference in new issue