Youngmin Baek, Bado Lee, Dongyoon Han, Sangdoo Yun, Hwalsuk Lee (Submitted on 3 Apr 2019)
The full paper is available at: https://arxiv.org/pdf/1904.01941.pdf
1、PyTroch>=0.4.1
2、torchvision>=0.2.1
3、opencv-python>=3.4.2
4、check requiremtns.txt
5、4 nvidia GPUs(we use 4 nvidia titanX)
NOTE: There are old pre-trained models, I will upload the new results pre-trained models' link.
Syndata:Syndata for baidu drive || Syndata for google drive
Syndata+IC15:Syndata+IC15 for baidu drive || Syndata+IC15 for google
drive
Syndata+IC13+IC17:Syndata+IC13+IC17 for baidu drive|| Syndata+IC13+IC17 for google drive
We also trained the model for the Persian language.
first we create Persian synthetic data sets accordance to the https://arxiv.org/pdf/1604.06646.pdf , then we trained the model with that.
Due to the lack of Persian data sets to train the model with real data, we produced our own personal data set, which includes 400 images for training and 100 images for testing. Then we trained the model with this data set.Persian data set
Also, due to the fact that some letters in Persian are stuck together, we changed the following constants.
- text_threshold=0.6.
- low_text=0.5.
- link_threshold=0.3.
Here are some examples of test images.