Real-Time Semantic Segmentation in Mobile device

This project is an example project of semantic segmentation for mobile real-time app.

The architecture is inspired by MobileNetV2 and U-Net.

LFW, Labeled Faces in the Wild, is used as a Dataset.

The goal of this project is to detect hair segments with reasonable accuracy and speed in mobile device. Currently, it achieves 0.89 IoU.

About speed vs accuracy, more details are available at my post.

Example application

iOS
Android (TODO)

Requirements

PyTorch 0.4
CoreML for iOS app.

About Model

At this time, there is only one model in this repository, MobileUNet.py. As a typical U-Net architecture, it has encoder and decoder parts, which consist of depthwise conv blocks proposed by MobileNets.

Input image is encoded to 1/32 size, and then decoded to 1/2. Finally, it scores the results and make it to original size.

Steps to training

Data Preparation

Data is available at LFW. To get mask images, refer issue #11 for more. After you got images and masks, put the images of faces and masks as shown below.

data/
  raw/
    images/
      0001.jpg
      0002.jpg
    masks/
      0001.ppm
      0002.ppm

Training

If you use 224 x 224 as input size, pre-trained weight of MobileNetV2 is available. Download it from A PyTorch implementation of MobileNetV2 and put weight file under weights directory.

python train_unet.py \
  --img_size=224 \
  --pre_trained='weights/mobilenet_v2.pth.tar'

If you use other input sizes, the model will be trained from scratch.

python train_unet.py --img_size=192

Dice coefficient is used as a loss function.

Pretrained model

Input size	IoU	Download
224	0.89	Google Drive

Converting

As the purpose of this project is to make model run in mobile device, this repository contains some scripts to convert models for iOS and Android.

coreml_converter.py
- It converts trained PyTorch model into CoreML model for iOS app.

TBD

Report speed vs accuracy in mobile device.
Convert pytorch to Android using TesorFlow Light

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.vscode		.vscode
assets		assets
nets		nets
.gitignore		.gitignore
LICENSE		LICENSE
Pipfile		Pipfile
Pipfile.lock		Pipfile.lock
README.md		README.md
config.py		config.py
coreml_converter.py		coreml_converter.py
dataset.py		dataset.py
eval_unet.ipynb		eval_unet.ipynb
eval_unet.py		eval_unet.py
loss.py		loss.py
mobilenet_v2.pth.tar		mobilenet_v2.pth.tar
train_unet.ipynb		train_unet.ipynb
train_unet.py		train_unet.py
trainer.py		trainer.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Real-Time Semantic Segmentation in Mobile device

Example application

Requirements

About Model

Steps to training

Data Preparation

Training

Pretrained model

Converting

TBD

About

Releases

Packages

Languages

License

akinoriosamura/mobile-segmentation-mobilenet-unet

Folders and files

Latest commit

History

Repository files navigation

Real-Time Semantic Segmentation in Mobile device

Example application

Requirements

About Model

Steps to training

Data Preparation

Training

Pretrained model

Converting

TBD

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages