Skip to content
View LARS12llt's full-sized avatar

Block or report LARS12llt

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
29 stars written in Jupyter Notebook
Clear filter

Google Research

Jupyter Notebook 34,494 7,950 Updated Dec 13, 2024

A course in reinforcement learning in the wild

Jupyter Notebook 5,953 1,699 Updated Oct 24, 2024

A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more

Jupyter Notebook 3,943 1,049 Updated Jun 6, 2024

Massively parallel rigidbody physics simulation on accelerator hardware.

Jupyter Notebook 2,400 260 Updated Dec 12, 2024

"Deep Generative Modeling": Introductory Examples

Jupyter Notebook 1,088 178 Updated Sep 22, 2024
Jupyter Notebook 767 379 Updated Mar 12, 2024

JAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.

Jupyter Notebook 640 69 Updated Oct 26, 2022

《Machine Learning: A Probabilistic Perspective》(Kevin P. Murphy)中文翻译和书中算法的Python实现。

Jupyter Notebook 570 135 Updated Dec 9, 2024

PyTorch implementation of Soft Actor-Critic (SAC)

Jupyter Notebook 518 102 Updated Dec 5, 2021

RAD: Reinforcement Learning with Augmented Data

Jupyter Notebook 402 71 Updated Mar 29, 2021

Code for experiments in my blog post on the Neural Tangent Kernel: https://eigentales.com/NTK

Jupyter Notebook 169 21 Updated Nov 10, 2019

A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each other, and investigate reliability of learned MuZero MDP models.

Jupyter Notebook 156 25 Updated Mar 28, 2021
Jupyter Notebook 147 50 Updated Apr 20, 2020

DQN-Atari-Agents: Modularized & Parallel PyTorch implementation of several DQN Agents, i.a. DDQN, Dueling DQN, Noisy DQN, C51, Rainbow, and DRQN

Jupyter Notebook 120 14 Updated Dec 18, 2020
Jupyter Notebook 87 22 Updated Jan 25, 2022

RL experiments

Jupyter Notebook 69 34 Updated Nov 21, 2022

RE3: State Entropy Maximization with Random Encoders for Efficient Exploration

Jupyter Notebook 67 8 Updated Jul 29, 2021

codebase for "A Theory of the Inductive Bias and Generalization of Kernel Regression and Wide Neural Networks"

Jupyter Notebook 49 8 Updated May 2, 2023

Code for the paper "Batch size invariance for policy optimization"

Jupyter Notebook 46 16 Updated Apr 2, 2023

Code repository for the research project "You Play Ball, I Play Ball: Bayesian Multi-Agent Reinforcement Learning for Slime Volleyball", won 1st Prize at 17th STePS.

Jupyter Notebook 16 4 Updated Nov 15, 2020

Prioritized Sequence Experience Replay

Jupyter Notebook 10 3 Updated Aug 16, 2021

An implementation in PyTorch of the paper "A Geometric Perspective on Optimal Representations for Reinforcement Learning" by Bellemare et al

Jupyter Notebook 8 1 Updated Jan 16, 2020

Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.

Jupyter Notebook 5 3 Updated Sep 9, 2020
Jupyter Notebook 4 3 Updated May 14, 2020

CS182 Final Project - aimed to improve generalization of common reinforcement learning algorithms (e.g. PPO w/ A2C) on the ProcGen suite of environments

Jupyter Notebook 1 Updated May 14, 2020
Jupyter Notebook 1 Updated Apr 29, 2019

Research repo looking at using automatic attention based methods to speed up Contrastive Learning methods in reinforcement learning environments

Jupyter Notebook 1 Updated Apr 25, 2020
Jupyter Notebook 1 Updated Jun 20, 2020