GitHub - apple1987/lstm_ctc_ocr: Use CTC + tensorflow to OCR

master:

both standard ctc and warpCTC
read data at one time

dev(current):

the pipline version of lstm_ctc_ocr, resize to same size

beta:

generate data on the fly(highest accuracy)
deal with multi-width image, padding to same width

How to use

run python ./lib/utils/genImg.py to generate the train images in train/, validation set in valand the file name shall has the format of 00000001_name.png, the number of process is set to 16.
python ./lib/lstm/utils/tf_records.py to generate tf_records file, which includes both images and labels(the img_path shall be changed to your image_path)
./train.sh for training ./test.shfor testing

Notice that,
the pipline version use warpCTC as default : please install the warpCTC tensorflow_binding first
if your machine does not support warpCTC, then use standard ctc version in the master branch

standard CTC: use tf.nn.ctc_loss to calculate the ctc loss

Dependency

python 3
tensorflow 1.0.1
captcha
warpCTC tensorflow_binding

Some details

The training data:

Notice that, sufficient amount of data is a must, otherwise, the network cannot converge.
parameters can be found in ./lstm.yml(higher priority) and lib/lstm/utils
some parameters need to be fined tune:

learning rate
decay step & decay rate
image_width
image_height
optimizer?

in ./lib/lstm/utils/tf_records.py, I resize the images to the same size. if you want to use your own data and use pipline to read data, the height of the image shall be the same.

Result

update: Notice that, different optimizer may lead to different resuilt.

The accurary is about 85%~92% (training on 128k images)

Read this blog for more details and this blog for how to use tf.nn.ctc_loss or warpCTC

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
fonts		fonts
lib		lib
lstm		lstm
.gitignore		.gitignore
README.md		README.md
test.sh		test.sh
train.sh		train.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

How to use

Dependency

Some details

Result

About

Releases

Packages

Languages

apple1987/lstm_ctc_ocr

Folders and files

Latest commit

History

Repository files navigation

How to use

Dependency

Some details

Result

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages