fix readme

5 years ago · dae9e014d2
parent b346f0b3ec
commit dae9e014d2
4 changed files with 415 additions and 487 deletions
--- a/model_zoo/official/cv/lenet_quant/Readme.md
+++ b/model_zoo/official/cv/lenet_quant/Readme.md
--- a/model_zoo/official/cv/mobilenetv2_quant/Readme.md
+++ b/model_zoo/official/cv/mobilenetv2_quant/Readme.md
@ -1,100 +1,187 @@
-# MobileNetV2 Quantization Aware Training
+# Contents

-MobileNetV2 is a significant improvement over MobileNetV1 and pushes the state of the art for mobile visual recognition including classification, object detection and semantic segmentation.
+- [MobileNetV2 Description](#mobilenetv2-description)
+- [Model Architecture](#model-architecture)
+- [Dataset](#dataset)
+- [Features](#features)
+    - [Mixed Precision](#mixed-precision)
+- [Environment Requirements](#environment-requirements)
+- [Script Description](#script-description)
+    - [Script and Sample Code](#script-and-sample-code)
+    - [Training Process](#training-process)
+    - [Evaluation Process](#evaluation-process)
+        - [Evaluation](#evaluation)
+- [Model Description](#model-description)
+    - [Performance](#performance)
+        - [Training Performance](#evaluation-performance)
+        - [Inference Performance](#evaluation-performance)
+- [Description of Random Situation](#description-of-random-situation)
+- [ModelZoo Homepage](#modelzoo-homepage)

-MobileNetV2 builds upon the ideas from MobileNetV1, using depthwise separable convolution as efficient building blocks. However, V2 introduces two new features to the architecture: 1) linear bottlenecks between the layers, and 2) shortcut connections between the bottlenecks1.
+# [MobileNetV2 Description](#contents)

-Training MobileNetV2 with ImageNet dataset in MindSpore with quantization aware training.

-This is the simple and basic tutorial for constructing a network in MindSpore with quantization aware.
+MobileNetV2 is tuned to mobile phone CPUs through a combination of hardware- aware network architecture search (NAS) complemented by the NetAdapt algorithm and then subsequently improved through novel architecture advances.Nov 20, 2019.

-In this readme tutorial, you will:
+[Paper](https://arxiv.org/pdf/1905.02244) Howard, Andrew, Mark Sandler, Grace Chu, Liang-Chieh Chen, Bo Chen, Mingxing Tan, Weijun Wang et al. "Searching for MobileNetV2." In Proceedings of the IEEE International Conference on Computer Vision, pp. 1314-1324. 2019.

-1. Train a MindSpore fusion MobileNetV2 model for ImageNet from scratch using `nn.Conv2dBnAct` and `nn.DenseBnAct`.
-2. Fine tune the fusion model by applying the quantization aware training auto network converter API `convert_quant_network`, after the network convergence then export a quantization aware model checkpoint file.
+This is the quantitative network of MobileNetV2.

-[Paper](https://arxiv.org/pdf/1801.04381) Sandler, Mark, et al. "Mobilenetv2: Inverted residuals and linear bottlenecks." Proceedings of the IEEE conference on computer vision and pattern recognition. 2018.
+# [Model architecture](#contents)

-# Dataset
+The overall network architecture of MobileNetV2 is show below:

-Dataset use: ImageNet
+[Link](https://arxiv.org/pdf/1905.02244)

- Dataset size: about 125G
-	- Train: 120G, 1281167 images: 1000 directories
-	- Test: 5G, 50000 images: images should be classified into 1000 directories firstly, just like train images
+# [Dataset](#contents)
+
+Dataset used: [imagenet](http://www.image-net.org/)
+
+- Dataset size: ~125G, 1.2W colorful images in 1000 classes
+	- Train: 120G, 1.2W images
+	- Test: 5G, 50000 images
 - Data format: RGB images.
-	- Note: Data will be processed in src/dataset.py 
+	- Note: Data will be processed in src/dataset.py
+
+
+# [Features](#contents)
+
+## [Mixed Precision(Ascend)](#contents)

-# Environment Requirements
+The [mixed precision](https://www.mindspore.cn/tutorial/zh-CN/master/advanced_use/mixed_precision.html) training method accelerates the deep learning neural network training process by using both the single-precision and half-precision data formats, and maintains the network precision achieved by the single-precision training at the same time. Mixed precision training can accelerate the computation process, reduce memory usage, and enable a larger model or batch size to be trained on specific hardware.
+For FP16 operators, if the input data type is FP32, the backend of MindSpore will automatically handle it with reduced precision. Users could check the reduced-precision operators by enabling INFO log and then searching ‘reduce precision’.

- Hardware（Ascend)
-  - Prepare hardware environment with Ascend processor. If you want to try Ascend, please send the [application form](https://obs-9be7.obs.cn-east-2.myhuaweicloud.com/file/other/Ascend%20Model%20Zoo%E4%BD%93%E9%AA%8C%E8%B5%84%E6%BA%90%E7%94%B3%E8%AF%B7%E8%A1%A8.docx) to ascend@huawei.com. Once approved, you can get the resources. 
+# [Environment Requirements](#contents)
+
+- Hardware:Ascend
+  - Prepare hardware environment with Ascend. If you want to try Ascend  , please send the [application form](https://obs-9be7.obs.cn-east-2.myhuaweicloud.com/file/other/Ascend%20Model%20Zoo%E4%BD%93%E9%AA%8C%E8%B5%84%E6%BA%90%E7%94%B3%E8%AF%B7%E8%A1%A8.docx) to ascend@huawei.com. Once approved, you can get the resources.
 - Framework
  - [MindSpore](http://10.90.67.50/mindspore/archive/20200506/OpenSource/me_vm_x86/)
- For more information, please check the resources below：
-  - [MindSpore tutorials](https://www.mindspore.cn/tutorial/zh-CN/master/index.html) 
+- For more information, please check the resources below
+  - [MindSpore tutorials](https://www.mindspore.cn/tutorial/zh-CN/master/index.html)
  - [MindSpore API](https://www.mindspore.cn/api/zh-CN/master/index.html)


-# Script description
+# [Script description](#contents)

-## Script and sample code
+## [Script and sample code](#contents)

 ```python
-├── mobilenetv2_quant        
-  ├── Readme.md                      
+├── mobileNetv2_quant
+  ├── Readme.md     # descriptions about MobileNetV2-Quant
  ├── scripts
-  │   ├──run_train_quant.sh                  
-  │   ├──run_infer_quant.sh
-  ├── src                              
-  │   ├──config.py                     
-  │   ├──dataset.py
-  │   ├──luanch.py       
-  │   ├──lr_generator.py                                 
-  │   ├──mobilenetV2.py
-  ├── train.py
-  ├── eval.py
-
+  │   ├──run_train_quant.sh   # shell script for train on Ascend
+  │   ├──run_infer_quant.sh    # shell script for evaluation on Ascend
+  ├── src
+  │   ├──config.py      # parameter configuration
+  │   ├──dataset.py     # creating dataset
+  │   ├──launch.py      # start python script
+  │   ├──lr_generator.py     # learning rate config
+  │   ├──mobilenetV2.py      # MobileNetV2 architecture
+  │   ├──utils.py       # supply the monitor module
+  ├── train.py      # training script
+  ├── eval.py       # evaluation script
+  ├── export.py     # export checkpoint files into air/onnx
 ```

-### Fine-tune for quantization aware training
+## [Training process](#contents)
+
+### Usage
+

-Fine tune the fusion model by applying the quantization aware training auto network converter API `convert_quant_network`, after the network convergence then export a quantization aware model checkpoint file.
+You can start training using python or shell scripts. The usage of shell scripts as follows:

- sh run_train_quant.sh Ascend [DEVICE_NUM] [SERVER_IP(x.x.x.x)] [VISIABLE_DEVICES(0,1,2,3,4,5,6,7)] [DATASET_PATH] [CKPT_PATH]
+- Ascend: sh run_train_quant.sh Ascend [DEVICE_NUM] [VISIABLE_DEVICES(0,1,2,3,4,5,6,7)] [RANK_TABLE_FILE] [DATASET_PATH] [CKPT_PATH]

-You can just run this command instead.
+### Launch

-``` bash
->>> sh run_train_quant.sh Ascend 4 192.168.0.1 0,1,2,3 ~/imagenet/train/ ~/mobilenet.ckpt
 ```
+# training example
+  shell:
+      Ascend: sh run_train_quant.sh Ascend 8 10.222.223.224 0,1,2,3,4,5,6,7 ~/imagenet/train/ mobilenet_199.ckpt
+```
+
+### Result

-Training result will be stored in the example path. Checkpoints will be stored at `. /checkpoint` by default, and training log  will be redirected to `./train/train.log` like followings. 
+Training result will be stored in the example path. Checkpoints will be stored at `. /checkpoint` by default, and training log  will be redirected to `./train/train.log` like followings.

-``` 
->>> epoch: [  0/60], step:[  624/  625], loss:[5.258/5.258], time:[140412.236], lr:[0.100]
->>> epoch time: 140522.500, per step time: 224.836, avg loss: 5.258
->>> epoch: [  1/60], step:[  624/  625], loss:[3.917/3.917], time:[138221.250], lr:[0.200]
->>> epoch time: 138331.250, per step time: 221.330, avg loss: 3.917
 ```
+epoch: [  0/200], step:[  624/  625], loss:[5.258/5.258], time:[140412.236], lr:[0.100]
+epoch time: 140522.500, per step time: 224.836, avg loss: 5.258
+epoch: [  1/200], step:[  624/  625], loss:[3.917/3.917], time:[138221.250], lr:[0.200]
+epoch time: 138331.250, per step time: 221.330, avg loss: 3.917
+```
+
+## [Eval process](#contents)

-### Evaluate quantization aware training model
+### Usage

-Evaluate a MindSpore fusion MobileNetV2 model for ImageNet by applying the quantization aware training, like:
+You can start training using python or shell scripts. The usage of shell scripts as follows:

- sh run_infer_quant.sh Ascend [DATASET_PATH] [CHECKPOINT_PATH]
+- Ascend: sh run_infer_quant.sh Ascend [DATASET_PATH] [CHECKPOINT_PATH]

-You can just run this command instead.
+### Launch

-``` bash
->>> sh run_infer_quant.sh Ascend ~/imagenet/val/ ~/train/mobilenet-60_625.ckpt
 ```
+# infer example
+  shell:
+      Ascend: sh run_infer_quant.sh Ascend ~/imagenet/val/ ~/train/mobilenet-60_1601.ckpt
+```
+
+> checkpoint can be produced in training process.

-Inference result will be stored in the example path, you can find result like the followings in `val.log`. 
+### Result

-``` 
->>> result: {'acc': 0.71976314102564111} ckpt=/path/to/checkpoint/mobilenet-60_625.ckpt
+Inference result will be stored in the example path, you can find result like the followings in `./val/infer.log`.
+
+```
+result: {'acc': 0.71976314102564111}
 ```

-# ModelZoo Homepage  
- [Link](https://gitee.com/mindspore/mindspore/tree/master/mindspore/model_zoo)
+# [Model description](#contents)
+
+## [Performance](#contents)
+
+### Training Performance
+
+| Parameters                 | MobilenetV2                                                |
+| -------------------------- | ---------------------------------------------------------- |
+| Model Version              | V2                                                         |
+| Resource                   | Ascend 910, cpu:2.60GHz 56cores, memory:314G               |
+| uploaded Date              | 06/06/2020                                                 |
+| MindSpore Version          | 0.3.0                                                      |
+| Dataset                    | ImageNet                                                   |
+| Training Parameters        | src/config.py                                              |
+| Optimizer                  | Momentum                                                   |
+| Loss Function              | SoftmaxCrossEntropy                                        |
+| outputs                    | ckpt file                                                  |
+| Loss                       | 1.913                                                      |
+| Accuracy                   |                                                            |
+| Total time                 | 16h                                                        |
+| Params (M)                 | batch_size=192, epoch=60                                   |
+| Checkpoint for Fine tuning |                                                            |
+| Model for inference        |                                                            |
+
+#### Evaluation Performance
+
+| Parameters                 |                               |
+| -------------------------- | ----------------------------- |
+| Model Version              | V2                            |
+| Resource                   | Ascend 910                    |
+| uploaded Date              | 06/06/2020                    |
+| MindSpore Version          | 0.3.0                         |
+| Dataset                    | ImageNet, 1.2W                |
+| batch_size                 | 130(8P)                       |
+| outputs                    | probability                   |
+| Accuracy                   | ACC1[71.78%] ACC5[90.90%]     |
+| Speed                      | 200ms/step                    |
+| Total time                 | 5min                          |
+| Model for inference        |                               |
+
+# [Description of Random Situation](#contents)
+
+In dataset.py, we set the seed inside “create_dataset" function. We also use random seed in train.py.
+
+# [ModelZoo Homepage](#contents)
+
+Please check the official [homepage](https://gitee.com/mindspore/mindspore/tree/master/model_zoo).
--- a/model_zoo/official/cv/mobilenetv2_quant/src/mobilenetV2_quant.py
+++ b/model_zoo/official/cv/mobilenetv2_quant/src/mobilenetV2_quant.py
@ -1,218 +0,0 @@
-# Copyright 2020 Huawei Technologies Co., Ltd
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-# http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ============================================================================
-"""MobileNetV2 Quant model define"""
-
-import mindspore.nn as nn
-from mindspore.ops import operations as P
-
-__all__ = ['mobilenetV2_quant']
-
-_quant_delay = 200
-_ema_decay = 0.999
-_symmetric = False
-_per_channel = False
-
-
-def _make_divisible(v, divisor, min_value=None):
-    if min_value is None:
-        min_value = divisor
-    new_v = max(min_value, int(v + divisor / 2) // divisor * divisor)
-    # Make sure that round down does not go down by more than 10%.
-    if new_v < 0.9 * v:
-        new_v += divisor
-    return new_v
-
-
-class GlobalAvgPooling(nn.Cell):
-    """
-    Global avg pooling definition.
-
-    Args:
-
-    Returns:
-        Tensor, output tensor.
-
-    Examples:
-        >>> GlobalAvgPooling()
-    """
-
-    def __init__(self):
-        super(GlobalAvgPooling, self).__init__()
-        self.mean = P.ReduceMean(keep_dims=False)
-
-    def construct(self, x):
-        x = self.mean(x, (2, 3))
-        return x
-
-
-class ConvBNReLU(nn.Cell):
-    """
-    Convolution/Depthwise fused with Batchnorm and ReLU block definition.
-
-    Args:
-        in_planes (int): Input channel.
-        out_planes (int): Output channel.
-        kernel_size (int): Input kernel size.
-        stride (int): Stride size for the first convolutional layer. Default: 1.
-        groups (int): channel group. Convolution is 1 while Depthiwse is input channel. Default: 1.
-
-    Returns:
-        Tensor, output tensor.
-
-    Examples:
-        >>> ConvBNReLU(16, 256, kernel_size=1, stride=1, groups=1)
-    """
-
-    def __init__(self, in_planes, out_planes, kernel_size=3, stride=1, groups=1):
-        super(ConvBNReLU, self).__init__()
-        padding = (kernel_size - 1) // 2
-        conv = nn.Conv2dBnFoldQuant(in_planes, out_planes, kernel_size, stride,
-                                    pad_mode='pad', padding=padding, quant_delay=_quant_delay, group=groups,
-                                    per_channel=_per_channel, symmetric=_symmetric)
-        layers = [conv, nn.ReLU()]
-        self.features = nn.SequentialCell(layers)
-        self.fake = nn.FakeQuantWithMinMax(ema=True, ema_decay=_ema_decay, min_init=0, quant_delay=_quant_delay)
-
-    def construct(self, x):
-        output = self.features(x)
-        output = self.fake(output)
-        return output
-
-
-class InvertedResidual(nn.Cell):
-    """
-    Mobilenetv2 residual block definition.
-
-    Args:
-        inp (int): Input channel.
-        oup (int): Output channel.
-        stride (int): Stride size for the first convolutional layer. Default: 1.
-        expand_ratio (int): expand ration of input channel
-
-    Returns:
-        Tensor, output tensor.
-
-    Examples:
-        >>> ResidualBlock(3, 256, 1, 1)
-    """
-
-    def __init__(self, inp, oup, stride, expand_ratio):
-        super(InvertedResidual, self).__init__()
-        assert stride in [1, 2]
-
-        hidden_dim = int(round(inp * expand_ratio))
-        self.use_res_connect = stride == 1 and inp == oup
-
-        layers = []
-        if expand_ratio != 1:
-            layers.append(ConvBNReLU(inp, hidden_dim, kernel_size=1))
-        layers.extend([
-            # dw
-            ConvBNReLU(hidden_dim, hidden_dim, stride=stride, groups=hidden_dim),
-            # pw-linear
-            nn.Conv2dBnFoldQuant(hidden_dim, oup, kernel_size=1, stride=1, pad_mode='pad', padding=0, group=1,
-                                 per_channel=_per_channel, symmetric=_symmetric, quant_delay=_quant_delay),
-            nn.FakeQuantWithMinMax(ema=True, ema_decay=_ema_decay, quant_delay=_quant_delay)
-        ])
-        self.conv = nn.SequentialCell(layers)
-        self.add = P.TensorAdd()
-        self.add_fake = nn.FakeQuantWithMinMax(ema=True, ema_decay=_ema_decay, quant_delay=_quant_delay)
-
-    def construct(self, x):
-        identity = x
-        x = self.conv(x)
-        if self.use_res_connect:
-            x = self.add(identity, x)
-            x = self.add_fake(x)
-        return x
-
-
-class MobileNetV2Quant(nn.Cell):
-    """
-    MobileNetV2Quant architecture.
-
-    Args:
-        class_num (Cell): number of classes.
-        width_mult (int): Channels multiplier for round to 8/16 and others. Default is 1.
-        has_dropout (bool): Is dropout used. Default is false
-        inverted_residual_setting (list): Inverted residual settings. Default is None
-        round_nearest (list): Channel round to . Default is 8
-    Returns:
-        Tensor, output tensor.
-
-    Examples:
-        >>> MobileNetV2Quant(num_classes=1000)
-    """
-
-    def __init__(self, num_classes=1000, width_mult=1.,
-                 has_dropout=False, inverted_residual_setting=None, round_nearest=8):
-        super(MobileNetV2Quant, self).__init__()
-        block = InvertedResidual
-        input_channel = 32
-        last_channel = 1280
-        # setting of inverted residual blocks
-        self.cfgs = inverted_residual_setting
-        if inverted_residual_setting is None:
-            self.cfgs = [
-                # t, c, n, s
-                [1, 16, 1, 1],
-                [6, 24, 2, 2],
-                [6, 32, 3, 2],
-                [6, 64, 4, 2],
-                [6, 96, 3, 1],
-                [6, 160, 3, 2],
-                [6, 320, 1, 1],
-            ]
-
-        # building first layer
-        input_channel = _make_divisible(input_channel * width_mult, round_nearest)
-        self.out_channels = _make_divisible(last_channel * max(1.0, width_mult), round_nearest)
-        self.input_fake = nn.FakeQuantWithMinMax(ema=True, ema_decay=_ema_decay, quant_delay=_quant_delay)
-        features = [ConvBNReLU(3, input_channel, stride=2)]
-        # building inverted residual blocks
-        for t, c, n, s in self.cfgs:
-            output_channel = _make_divisible(c * width_mult, round_nearest)
-            for i in range(n):
-                stride = s if i == 0 else 1
-                features.append(block(input_channel, output_channel, stride, expand_ratio=t))
-                input_channel = output_channel
-        # building last several layers
-        features.append(ConvBNReLU(input_channel, self.out_channels, kernel_size=1))
-        # make it nn.CellList
-        self.features = nn.SequentialCell(features)
-        # mobilenet head
-        head = ([GlobalAvgPooling(),
-                 nn.DenseQuant(self.out_channels, num_classes, has_bias=True, per_channel=_per_channel,
-                               symmetric=_symmetric, quant_delay=_quant_delay),
-                 nn.FakeQuantWithMinMax(ema=True, ema_decay=_ema_decay)] if not has_dropout else
-                [GlobalAvgPooling(),
-                 nn.Dropout(0.2),
-                 nn.DenseQuant(self.out_channels, num_classes, has_bias=True, per_channel=_per_channel,
-                               symmetric=_symmetric, quant_delay=_quant_delay),
-                 nn.FakeQuantWithMinMax(ema=True, ema_decay=_ema_decay, quant_delay=_quant_delay)])
-        self.head = nn.SequentialCell(head)
-
-    def construct(self, x):
-        x = self.input_fake(x)
-        x = self.features(x)
-        x = self.head(x)
-        return x
-
-
-def mobilenetV2_quant(**kwargs):
-    """
-    Constructs a MobileNet V2 model
-    """
-    return MobileNetV2Quant(**kwargs)
--- a/model_zoo/official/cv/resnet50_quant/Readme.md
+++ b/model_zoo/official/cv/resnet50_quant/Readme.md