Lists (1)
Sort Last updated
Stars
LLaVA-CoT, a visual language model capable of spontaneous, systematic reasoning
SGLang is a fast serving framework for large language models and vision language models.
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).
Train transformer language models with reinforcement learning.
Fine-tune LLM agents with online reinforcement learning
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
基于ChatGLM-6B、ChatGLM2-6B、ChatGLM3-6B模型,进行下游具体任务微调,涉及Freeze、Lora、P-tuning、全参微调等
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
A modular graph-based Retrieval-Augmented Generation (RAG) system
Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning
A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…
Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and…
🦜🔗 Build context-aware reasoning applications
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
[ICLR 2023] ReAct: Synergizing Reasoning and Acting in Language Models
A high-throughput and memory-efficient inference and serving engine for LLMs
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs
🔥🔥MLVU: Multi-task Long Video Understanding Benchmark