Highlights
- Pro
Stars
A simple OpenAI Gym environment for single and multi-agent reinforcement learning
Benchmarking the Spectrum of Agent Capabilities
Benchmarking Agentic LLM and VLM Reasoning On Games
🤗 smolagents: a barebones library for agents. Agents write python code to call tools and orchestrate other agents.
Training Large Language Model to Reason in a Continuous Latent Space
Flexible Python configuration system. The last one you will ever need.
A virtual environment for developing and evaluating automated scientific discovery agents.
Universal LLM Deployment Engine with ML Compilation
LMAct: A Benchmark for In-Context Imitation Learning with Long Multimodal Demonstrations
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
Hydra is a framework for elegantly configuring complex applications
Official Repo for ICLR 2024 paper MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language Feedback by Xingyao Wang*, Zihan Wang*, Jiateng Liu, Yangyi Chen, Lifan Yuan, Hao Peng and …
High throughput synchronous and asynchronous reinforcement learning
[ICLR 2024] SWE-bench: Can Language Models Resolve Real-world Github Issues?
Reinforcement learning on general 2D physics environments in JAX.
Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.
MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research
RAG that intelligently adapts to your use case, data, and queries
200+ detailed flashcards useful for reviewing topics in machine learning, computer vision, and computer science.
Machine Learning Journal for Intermediate to Advanced Topics.
Entropy Based Sampling and Parallel CoT Decoding