Lists (4)
Sort Name ascending (A-Z)
Starred repositories
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
๐งโ๐ซ 60+ Implementations/tutorials of deep learning papers with side-by-side notes ๐; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gaโฆ
Best Practices on Recommendation Systems
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
Simple, unified interface to multiple Generative AI providers
Perform data science on data that remains in someone else's server
An elegant PyTorch deep reinforcement learning library.
Large World Model -- Modeling Text and Video with Millions Context
PyTorch implementations of deep reinforcement learning algorithms and environments
An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
Google DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.
๐ค ๐๐ฒ๐ฎ๐ฟ๐ป for ๐ณ๐ฟ๐ฒ๐ฒ how to ๐ฏ๐๐ถ๐น๐ฑ an end-to-end ๐ฝ๐ฟ๐ผ๐ฑ๐๐ฐ๐๐ถ๐ผ๐ป-๐ฟ๐ฒ๐ฎ๐ฑ๐ ๐๐๐ & ๐ฅ๐๐ ๐๐๐๐๐ฒ๐บ using ๐๐๐ ๐ข๐ฝ๐ best practices: ~ ๐ด๐ฐ๐ถ๐ณ๐ค๐ฆ ๐ค๐ฐ๐ฅ๐ฆ + 12 ๐ฉ๐ข๐ฏ๐ฅ๐ด-๐ฐ๐ฏ ๐ญ๐ฆ๐ด๐ด๐ฐ๐ฏ๐ด
Minimal and Clean Reinforcement Learning Examples
Tensorforce: a TensorFlow library for applied reinforcement learning
Modularized Implementation of Deep RL Algorithms in PyTorch
Reinforcement Learning Coach by Intel AI Lab enables easy experimentation with state of the art Reinforcement Learning algorithms
Top 200 deep learning Github repositories sorted by the number of stars.
Artificial intelligence for the Snake game.
TensorFlow implementation of Deep Reinforcement Learning papers
Deal or No Deal? End-to-End Learning for Negotiation Dialogues
Code for Cicero, an AI agent that plays the game of Diplomacy with open-domain natural language negotiation.
ChainerRL is a deep reinforcement learning library built on top of Chainer.
pyMetaheuristic: A Comprehensive Python Library for Optimization
Repository demonstrating best practices and patterns for implementing agentic workflows in Python, featuring modular, scalable, and reusable design patterns for intelligent automation.
Multi-Depot Vehicle Routing Problem solver using Deep RL, GA and Google OR-Tools
Machine learning, artificial intelligence, and data analytics built from scratch.
KTH Artificial Intelligence (DD2380) final project VRP implemented in IP, PDDL and RL