Recognize tables from images and restore them into word.
- python server.py
Load the unet model to extract table lines from the input image - python test.py
Feed the input image - python image2word.py
Restore table use opencv & python-docx
Step 1 & 2 are not necessary if you have quite neat PDF images, meanwhile this project can't deal with some complex samples like tortuous and colorful receipts, I am still working on it.
I am handling table recognition like this image, struggling with the dataset. Optimistically, there could be a radical change in weeks. If you are researching page layout and table recognition, please contact me.[email protected]