This is a PyTorch re-implementation of CRNN: Convolutional Recurrent Neural Network (paper). The features are summarized blow:
We use the synthetic dataset (mjsynth) released by Jaderberg et al. as the training data. The dataset contains 8 millions training images and their corresponding ground truth words. Such images are generated by a synthetic text engine and are highly realistic.
@article{Jaderberg14c, title={Synthetic Data and Artificial Neural Networks for Natural Scene Text Recognition}, author={Jaderberg, M. and Simonyan, K. and Vedaldi, A. and Zisserman, A.}, journal={arXiv preprint arXiv:1406.2227}, year={2014} }
- PyTorch 1.1.0
Extract training & test images:
$ python extract.py
$ python train.py
If you want to visualize during training, run in your terminal:
$ tensorboard --logdir runs
Pick 10 random examples from test set of mjsynth:
$ python demo.py