Stars
Official code for the publication "Large Language Models as Zero-shot Dialogue State Tracker through Function Calling" https//arxiv.org/abs/2402.10466
A Gradio web UI for Large Language Models with support for multiple inference backends.
Fully open reproduction of DeepSeek-R1
CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction
Data and evaluation scripts for "CodePlan: Repository-level Coding using LLMs and Planning", FSE 2024
[NeurIPS 2024 D&B Track] GTA: A Benchmark for General Tool Agents
Awesome things about LLM-powered agents. Papers / Repos / Blogs / ...
Dataframes powered by a multithreaded, vectorized query engine, written in Rust
Code for paper "Optima: Optimizing Effectiveness and Efficiency for LLM-Based Multi-Agent System"
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
A package which efficiently applies any function to a pandas dataframe or series in the fastest available manner
[ACL'24 Outstanding] Data and code for L-Eval, a comprehensive long context language models evaluation benchmark
Summarize existing representative LLMs text datasets.
Data and Code for Program of Thoughts (TMLR 2023)
LLM for Long Text Summary (Comprehensive Bulleted Notes)
Use PEFT or Full-parameter to finetune 450+ LLMs (Qwen2.5, InternLM3, GLM4, Llama3.3, Mistral, Yi1.5, Baichuan2, DeepSeek-R1, ...) and 150+ MLLMs (Qwen2.5-VL, Qwen2-Audio, Llama3.2-Vision, Llava, I…
Tool Learning for Big Models, Open-Source Solutions of ChatGPT-Plugins
Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调
Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models
[EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"
ruptures: change point detection in Python