Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to use this from js? #6

Open
alaa137 opened this issue May 25, 2019 · 1 comment
Open

How to use this from js? #6

alaa137 opened this issue May 25, 2019 · 1 comment

Comments

@alaa137
Copy link

alaa137 commented May 25, 2019

I have 2 questions...

  1. How to use these files from javascript? I've been working with files with .gz extensions...
  2. Is there a trained file that supports floating numbers and a colon? (like time. e.g. 4:25).

Thanks

@Shreeshrii
Copy link
Owner

Tesseract now supports the use of zipped files of traineddata. You could try zipping the traineddata file in .gz format and using it. I have personally not tried that option.

I have created different versions of digita traineddata as experiment, some with digita and period, others with more punctuation characters. One (or more) f those should have support for numbers and colon.

My suggestion will be to first try out the different traineddata files in command mode with the images that you need to OCR. Once you figure out which is best, zip it and then use in .js.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants