-
Nanjing University
- China
- http://www.lamda.nju.edu.cn/wangch/
Stars
An Autonomous LLM Agent for Complex Task Solving
Code for Paper (ReMax: A Simple, Efficient and Effective Reinforcement Learning Method for Aligning Large Language Models)
800,000 step-level correctness labels on LLM solutions to MATH problems
A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.
Community interface for generative AI
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Official repo for consistency models.
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf
A cross-platform ChatGPT/Gemini UI (Web / PWA / Linux / Win / MacOS). 一键拥有你自己的跨平台 ChatGPT/Gemini/Claude LLM 应用。
Open Academic Research on Improving LLaMA to SOTA LLM
A beginner-friendly repository on Deep Reinforcement Learning (RL), written in PyTorch.
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…
OpenFE: automated feature generation with expert-level performance
Toolkit of Causal Model-based Reinforcement Learning.
Official implementation of NeurIPS22 paper “Multi-agent Dynamic Algorithm Configuration”
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
D3PE (Deep Data-Driven Policy Evaluation) aims to evaluation a large set of candidate policies from a fixed dataset to select best ones.
The official code of "Adversarial Counterfactual Environment Model Learning" (NeurIPS'23 spotlight)
A python module designed for agile RL algorithm developing.
刷算法全靠套路,认准 labuladong 就够了!English version supported! Crack LeetCode, not only how, but also why.