krishnapraveen7

Follow

Krishna krishnapraveen7

Follow

16 followers · 104 following

Stars

Document

36 repositories

phamquiluan / jdeskew

ICIP 2022: Adaptive Radial Projection on Fourier Magnitude Spectrum for Document Image Skew Estimation

Jupyter Notebook 133 11 Updated Jan 11, 2025

whoiskatrin / chart-gpt

AI tool to build charts based on text input

TypeScript 3,579 346 Updated Aug 22, 2023

tencentmusic / supersonic

SuperSonic is the next-generation AI+BI platform that unifies Chat BI (powered by LLM) and Headless BI (powered by semantic layer) paradigms.

Java 3,152 595 Updated Mar 12, 2025

deepdoctection / deepdoctection

A Repo For Document AI

Python 2,745 150 Updated Mar 11, 2025

castorini / hedwig

PyTorch deep learning models for document classification

Python 594 126 Updated Jul 21, 2023

dswang2011 / DocLLM

DocLLM: A layout-aware generative language model for multimodal document understanding

123 6 Updated Jan 3, 2024

GeorgeLuImmortal / DocLLM_reimplementation

22 Updated Mar 18, 2024

impira / docquery

An easy way to extract information from documents

Python 1,739 129 Updated May 3, 2023

karndeepsingh / Extract_key_information_Document_understanding

Jupyter Notebook 74 40 Updated Dec 26, 2022

NielsRogge / Transformers-Tutorials

This repository contains demos I made with the Transformers library by HuggingFace.

Jupyter Notebook 10,143 1,531 Updated Jan 13, 2025

microsoft / table-transformer

Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the official repository for the PubTables-1M dataset and GriTS ev…

Python 2,501 276 Updated Jun 24, 2024

Zipstack / unstract

No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documents

Python 4,857 406 Updated Mar 12, 2025

cvat-ai / cvat

Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.

Python 13,367 3,154 Updated Mar 12, 2025

ai8hyf / TF-ID

TF-ID: Table/Figure IDentifier for academic papers

Python 229 15 Updated Jul 12, 2024

illuin-tech / colpali

The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.

Python 1,585 133 Updated Mar 12, 2025

Ucas-HaoranWei / Vary

[ECCV 2024] Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.

Python 1,818 146 Updated Dec 30, 2024

AlibabaResearch / AdvancedLiterateMachinery

A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.

C++ 1,654 190 Updated Dec 27, 2024

kijai / ComfyUI-Florence2

Inference Microsoft Florence2 VLM

Python 1,020 70 Updated Mar 11, 2025

andimarafioti / florence2-finetuning

Quick exploration into fine tuning florence 2

Jupyter Notebook 303 28 Updated Sep 19, 2024

katanaml / sparrow

Data processing with ML, LLM and Vision LLM

Python 4,405 437 Updated Mar 12, 2025

mindee / doctr

docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.

Python 4,392 480 Updated Mar 10, 2025

mlpc-ucsd / BLIVA

(AAAI 2024) BLIVA: A Simple Multimodal LLM for Better Handling of Text-rich Visual Questions

Python 255 28 Updated Apr 14, 2024

Ucas-HaoranWei / GOT-OCR2.0

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Python 7,125 625 Updated Feb 10, 2025

ucbepic / docetl

A system for agentic LLM-powered data processing and ETL

Python 1,701 155 Updated Mar 10, 2025

opendatalab / MinerU

A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具，将PDF转换成Markdown和JSON格式。

Python 28,007 2,161 Updated Mar 12, 2025

lumina-ai-inc / chunkr

Vision infrastructure to turn complex documents into RAG/LLM-ready data

Rust 1,956 109 Updated Mar 12, 2025

VikParuchuri / tabled

Detect and extract tables to markdown and csv

Python 730 50 Updated Jan 24, 2025

microsoft / OmniParser

A simple screen parsing tool towards pure vision based GUI agent

Jupyter Notebook 20,102 1,633 Updated Mar 11, 2025

DocumindHQ / documind

Open-source platform for extracting structured data from documents using AI.

JavaScript 1,267 43 Updated Feb 21, 2025

QuivrHQ / MegaParse

File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.

Python 5,852 294 Updated Feb 21, 2025