Starred repositories
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
pix2tex: Using a ViT to convert images of equations into LaTeX code.
800,000 step-level correctness labels on LLM solutions to MATH problems
A generative world for general-purpose robotics & embodied AI learning.
Use PEFT or Full-parameter to finetune 400+ LLMs (Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, ...) or 150+ MLLMs (Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, Inter…
Official implementation of the paper "Koala-36M: A Large-scale Video Dataset Improving Consistency between Fine-grained Conditions and Video Content".
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
CodeGen is a family of open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.
DeepSeek Coder: Let the Code Write Itself
Customizable implementation of the self-instruct paper.
JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf
Platform to experiment with the AI Software Engineer. Terminal based. NOTE: Very different from https://gptengineer.app
Apache Camel is an open source integration framework that empowers you to quickly and easily integrate various systems consuming or producing data.
GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)