Lists (1)
Sort Name ascending (A-Z)
Stars
Claude Engineer is an interactive command-line interface (CLI) that leverages the power of Anthropic's Claude-3.5-Sonnet model to assist with software development tasks.This framework enables Claud…
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…
Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation".
[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-sim…
A Comprehensive Benchmark for Document Parsing and Evaluation
源自PP-Structure的表格识别算法,模型转换为ONNX,推理引擎采用ONNXRuntime,部署简单,无内存泄露问题。
UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Agent Framework / shim to use Pydantic with LLMs
A community-supported supercharged version of paperless: scan, index and archive all your physical documents
Handwriting Synthesis with RNNs ✏️
So your teacher asked you to upload written assignments? Hate writing assigments? This tool will help you convert your text to handwriting xD
Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.
DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysis
⛅️ Home to Wrangler, the CLI for Cloudflare Workers®
Build multi-modal Agents with memory, knowledge, tools and reasoning. Chat with them using a beautiful Agent UI.
Composio equip's your AI agents & LLMs with 100+ high-quality integrations via function calling
OCR, layout analysis, reading order, table recognition in 90+ languages
An open-source RAG-based tool for chatting with your documents.
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
DSPy: The framework for programming—not prompting—language models
This repository provides tutorials and implementations for various Generative AI Agent techniques, from basic to advanced. It serves as a comprehensive guide for building intelligent, interactive A…
This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…
This repository provides an advanced Retrieval-Augmented Generation (RAG) solution for complex question answering. It uses sophisticated graph based algorithm to handle the tasks.