Shape Robust Text Detection with Progressive Scale Expansion Network

Requirements

Python3
pyclipper
Polygon2
OpenCV
TensorFlow 2.0+

Introduction

(PSENet-tf2.0)Progressive Scale Expansion Network (PSENet) is a text detector which is able to well detect the arbitrary-shape text in natural scene. Besides, based on this text segmentation model, we got top 6 in MTWI 2018 Text Detection Challenge

Training (polygon)

CUDA_VISIBLE_DEVICES=0 python train_ic15.py

Testing (polygon)

CUDA_VISIBLE_DEVICES=0 python test_ic15.py --scale 1 --resume [path of model]

Training (quadrilateral)

CUDA_VISIBLE_DEVICES=0 python train_id41k.py

Testing (quadrilateral)

CUDA_VISIBLE_DEVICES=0 python test_id41k.py --scale 1 --resume [path of model]

Eval script for ICDAR 2015 and SCUT-CTW1500

cd eval
sh eval_ic15.sh
sh eval_ctw1500.sh

Performance (new version paper)

ICDAR 2015

Method	Extra Data	Precision (%)	Recall (%)	F-measure (%)	FPS (1080Ti)	Model
PSENet-1s (ResNet50)	-	81.49	79.68	80.57	1.6	baiduyun(extract code: rxti); OneDrive
PSENet-1s (ResNet50)	pretrain on IC17 MLT	86.92	84.5	85.69	1.6	baiduyun(extract code: aieo); OneDrive
PSENet-4s (ResNet50)	pretrain on IC17 MLT	86.1	83.77	84.92	3.8	baiduyun(extract code: aieo); OneDrive

SCUT-CTW1500

Method	Extra Data	Precision (%)	Recall (%)	F-measure (%)	FPS (1080Ti)	Model
PSENet-1s (ResNet50)	-	80.57	75.55	78.0	3.9	baiduyun(extract code: ksv7); OneDrive
PSENet-1s (ResNet50)	pretrain on IC17 MLT	84.84	79.73	82.2	3.9	baiduyun(extract code: z7ac); OneDrive
PSENet-4s (ResNet50)	pretrain on IC17 MLT	82.09	77.84	79.9	8.4	baiduyun(extract code: z7ac); OneDrive

Performance (old version paper)

ICDAR 2015 (training with ICDAR 2017 MLT)

Method	Precision (%)	Recall (%)	F-measure (%)
PSENet-4s (ResNet152)	87.98	83.87	85.88
PSENet-2s (ResNet152)	89.30	85.22	87.21
PSENet-1s (ResNet152)	88.71	85.51	87.08

ICDAR 2017 MLT

Method	Precision (%)	Recall (%)	F-measure (%)
PSENet-4s (ResNet152)	75.98	67.56	71.52
PSENet-2s (ResNet152)	76.97	68.35	72.40
PSENet-1s (ResNet152)	77.01	68.40	72.45

SCUT-CTW1500

Method	Precision (%)	Recall (%)	F-measure (%)
PSENet-4s (ResNet152)	80.49	78.13	79.29
PSENet-2s (ResNet152)	81.95	79.30	80.60
PSENet-1s (ResNet152)	82.50	79.89	81.17

ICPR MTWI 2018 Challenge 2

Method	Precision (%)	Recall (%)	F-measure (%)
PSENet-1s (ResNet152)	8.28	70.0	76

Results

Figure 3: The results on ICDAR 2015, ICDAR 2017 MLT and SCUT-CTW1500

Paper Link

[new version paper] https://arxiv.org/abs/1903.12473

[old version paper] https://arxiv.org/abs/1806.02559

Other Implements

[pytorch version (thanks @WenmuZhou)] (https://github.com/WenmuZhou/PSENet.pytorch)

[tensorflow1.x version (thanks @liuheng92)] https://github.com/liuheng92/tensorflow_PSENet

Thanks and collaborator

laizhihui @ lzh

Citation

@inproceedings{wang2019shape,
  title={Shape Robust Text Detection With Progressive Scale Expansion Network},
  author={Wang, Wenhai and Xie, Enze and Li, Xiang and Hou, Wenbo and Lu, Tong and Yu, Gang and Shao, Shuai},
  booktitle={Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition},
  pages={9336--9345},
  year={2019}
}

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
.github		.github
MobileNetV2		MobileNetV2
MobileNetV3		MobileNetV3
__pycache__		__pycache__
dataset		dataset
eval		eval
figure		figure
models		models
pse		pse
util		util
LICENSE		LICENSE
README.md		README.md
metrics.py		metrics.py
mobilenet_v2.py		mobilenet_v2.py
pypse.py		pypse.py
test_ctw1500.py		test_ctw1500.py
test_id41k.py		test_id41k.py
train_ctw1500.py		train_ctw1500.py
train_id41k.py		train_id41k.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Shape Robust Text Detection with Progressive Scale Expansion Network

Requirements

Introduction

Training (polygon)

Testing (polygon)

Training (quadrilateral)

Testing (quadrilateral)

Eval script for ICDAR 2015 and SCUT-CTW1500

Performance (new version paper)

ICDAR 2015

SCUT-CTW1500

Performance (old version paper)

ICDAR 2015 (training with ICDAR 2017 MLT)

ICDAR 2017 MLT

SCUT-CTW1500

ICPR MTWI 2018 Challenge 2

Results

Paper Link

Other Implements

Thanks and collaborator

Citation

About

Releases

Sponsor this project

Packages

Languages

License

li10141110/PSENet-tf2

Folders and files

Latest commit

History

Repository files navigation

Shape Robust Text Detection with Progressive Scale Expansion Network

Requirements

Introduction

Training (polygon)

Testing (polygon)

Training (quadrilateral)

Testing (quadrilateral)

Eval script for ICDAR 2015 and SCUT-CTW1500

Performance (new version paper)

ICDAR 2015

SCUT-CTW1500

Performance (old version paper)

ICDAR 2015 (training with ICDAR 2017 MLT)

ICDAR 2017 MLT

SCUT-CTW1500

ICPR MTWI 2018 Challenge 2

Results

Paper Link

Other Implements

Thanks and collaborator

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Sponsor this project

Packages 0

Languages

Packages