- Bhaktapur, Nepal
- in/saurab-shrestha-092090182
- saurabh.kshitiz
Lists (2)
Sort Name ascending (A-Z)
Starred repositories
The code used to train and run inference with the ColPali architecture.
Easily deployable and scalable backend server that efficiently converts various document formats (pdf, docx, pptx, html, images, etc) into Markdown. With support for both CPU and GPU processing, it…
This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
Jan is an open source alternative to ChatGPT that runs 100% offline on your computer. Multiple engine support (llama.cpp, TensorRT-LLM)
🚀🤖 Crawl4AI: Crawl Smarter, Faster, Freely. For AI.
🔊 Text-Prompted Generative Audio Model
RAG (Retrieval-Augmented Generation) Chatbot Examples Using PyMuPDF
An open-source project that uses cutting-edge NLP models and real-time web search to provide dynamic voice query responses. Features include speech-to-text with Nemo, text generation with Mistral-7…
RAG (Retrieval-augmented generation) ChatBot that provides answers based on contextual information extracted from a collection of Markdown files.
An open-source RAG-based tool for chatting with your documents.
This repo provides the server side code for llmsherpa API to connect. It includes parsers for various file formats.
Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI
A Simplified PyTorch Implementation of Vision Transformer (ViT)
An application for running LLMs locally on your device, with your documents, facilitating detailed citations in generated responses.
This is official implementation of the paper: "iHuman: Instant Animatable Digital Humans From Monocular Videos" [ECCV 2024]
Famous Vision Language Models and Their Architectures
This repository implements the the encoder and decoder model with attention model for OCR
An unofficial Implementation of DocParser: End-to-end OCR-free Information Extraction from Visually Rich Documents
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.
A curated list of awesome things related to Django
cgre23 / Open-Source-AI-Cookbook
Forked from huggingface/cookbookOpen-source AI cookbook
What, Why and How of LLMs.