Skip to content
View krishnapraveen7's full-sized avatar

Block or report krishnapraveen7

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

Document

36 repositories

ICIP 2022: Adaptive Radial Projection on Fourier Magnitude Spectrum for Document Image Skew Estimation

Jupyter Notebook 133 11 Updated Jan 11, 2025

AI tool to build charts based on text input

TypeScript 3,579 346 Updated Aug 22, 2023

SuperSonic is the next-generation AI+BI platform that unifies Chat BI (powered by LLM) and Headless BI (powered by semantic layer) paradigms.

Java 3,152 595 Updated Mar 12, 2025

A Repo For Document AI

Python 2,745 150 Updated Mar 11, 2025

PyTorch deep learning models for document classification

Python 594 126 Updated Jul 21, 2023

DocLLM: A layout-aware generative language model for multimodal document understanding

123 6 Updated Jan 3, 2024

An easy way to extract information from documents

Python 1,739 129 Updated May 3, 2023

This repository contains demos I made with the Transformers library by HuggingFace.

Jupyter Notebook 10,143 1,531 Updated Jan 13, 2025

Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the official repository for the PubTables-1M dataset and GriTS ev…

Python 2,501 276 Updated Jun 24, 2024

No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documents

Python 4,857 406 Updated Mar 12, 2025

Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.

Python 13,367 3,154 Updated Mar 12, 2025

TF-ID: Table/Figure IDentifier for academic papers

Python 229 15 Updated Jul 12, 2024

The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.

Python 1,585 133 Updated Mar 12, 2025

[ECCV 2024] Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.

Python 1,818 146 Updated Dec 30, 2024

A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.

C++ 1,654 190 Updated Dec 27, 2024

Inference Microsoft Florence2 VLM

Python 1,020 70 Updated Mar 11, 2025

Quick exploration into fine tuning florence 2

Jupyter Notebook 303 28 Updated Sep 19, 2024

Data processing with ML, LLM and Vision LLM

Python 4,405 437 Updated Mar 12, 2025

docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.

Python 4,392 480 Updated Mar 10, 2025

(AAAI 2024) BLIVA: A Simple Multimodal LLM for Better Handling of Text-rich Visual Questions

Python 255 28 Updated Apr 14, 2024

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Python 7,125 625 Updated Feb 10, 2025

A system for agentic LLM-powered data processing and ETL

Python 1,701 155 Updated Mar 10, 2025

A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。

Python 28,007 2,161 Updated Mar 12, 2025

Vision infrastructure to turn complex documents into RAG/LLM-ready data

Rust 1,956 109 Updated Mar 12, 2025

Detect and extract tables to markdown and csv

Python 730 50 Updated Jan 24, 2025

A simple screen parsing tool towards pure vision based GUI agent

Jupyter Notebook 20,102 1,633 Updated Mar 11, 2025

Open-source platform for extracting structured data from documents using AI.

JavaScript 1,267 43 Updated Feb 21, 2025

File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.

Python 5,852 294 Updated Feb 21, 2025