|
|
hai 1 día | |
|---|---|---|
| .. | ||
| README.md | hai 1 día | |
| metafile.yml | hai 1 día | |
| mobilenet-v3-large_8xb128_in1k.py | hai 1 día | |
| mobilenet-v3-small-050_8xb128_in1k.py | hai 1 día | |
| mobilenet-v3-small-075_8xb128_in1k.py | hai 1 día | |
| mobilenet-v3-small_8xb128_in1k.py | hai 1 día | |
| mobilenet-v3-small_8xb16_cifar10.py | hai 1 día | |
MobileNet V3 is initially described in the paper. MobileNetV3 parameters are obtained by NAS (network architecture search) search, and some practical results of V1 and V2 are inherited, and the attention mechanism of SE channel is attracted, which can be considered as a masterpiece. The author create two new MobileNet models for release: MobileNetV3-Large and MobileNetV3-Small which are targeted for high and low resource use cases. These models are then adapted and applied to the tasks of object detection and semantic segmentation. The author of MobileNet V3 measure its performance on Imagenet classification, COCO object detection, and Cityscapes segmentation.
Predict image
from mmpretrain import inference_model
predict = inference_model('mobilenet-v3-small-050_3rdparty_in1k', 'demo/bird.JPEG')
print(predict['pred_class'])
print(predict['pred_score'])
Use the model
import torch
from mmpretrain import get_model
model = get_model('mobilenet-v3-small-050_3rdparty_in1k', pretrained=True)
inputs = torch.rand(1, 3, 224, 224)
out = model(inputs)
print(type(out))
# To extract features.
feats = model.extract_feat(inputs)
print(type(feats))
Train/Test Command
Prepare your dataset according to the docs.
Train:
python tools/train.py configs/mobilenet_v3/mobilenet-v3-small_8xb128_in1k.py
Test:
python tools/test.py configs/mobilenet_v3/mobilenet-v3-small-050_8xb128_in1k.py https://download.openmmlab.com/mmclassification/v0/mobilenet_v3/mobilenet-v3-small-050_3rdparty_in1k_20221114-e0b86be1.pth
| Model | Pretrain | Params (M) | Flops (G) | Top-1 (%) | Top-5 (%) | Config | Download |
|---|---|---|---|---|---|---|---|
mobilenet-v3-small-050_3rdparty_in1k* |
From scratch | 1.59 | 0.02 | 57.91 | 80.19 | config | model |
mobilenet-v3-small-075_3rdparty_in1k* |
From scratch | 2.04 | 0.04 | 65.23 | 85.44 | config | model |
mobilenet-v3-small_8xb128_in1k |
From scratch | 2.54 | 0.06 | 66.68 | 86.74 | config | model | log |
mobilenet-v3-small_3rdparty_in1k* |
From scratch | 2.54 | 0.06 | 67.66 | 87.41 | config | model |
mobilenet-v3-large_8xb128_in1k |
From scratch | 5.48 | 0.23 | 73.49 | 91.31 | config | model | log |
mobilenet-v3-large_3rdparty_in1k* |
From scratch | 5.48 | 0.23 | 74.04 | 91.34 | config | model |
*Models with * are converted from the official repo. The config files of these models are only for inference. We haven't reproduce the training results.*
@inproceedings{Howard_2019_ICCV,
author = {Howard, Andrew and Sandler, Mark and Chu, Grace and Chen, Liang-Chieh and Chen, Bo and Tan, Mingxing and Wang, Weijun and Zhu, Yukun and Pang, Ruoming and Vasudevan, Vijay and Le, Quoc V. and Adam, Hartwig},
title = {Searching for MobileNetV3},
booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
month = {October},
year = {2019}
}