-
Karlsruhe Institute of Technology
- Karlsruhe
Stars
OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models
A python utility to read an mbox email file and output selected data to a csv file
Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.
Text files to help plan & log whatever it is you do. Bullet journal + pomodoro technique + text editors + cloud syncing = progress.
Work in progress, not ready for use. -- Getting preference feedback from real humans.
Discontinued! See https://github.com/timokau/marvin-mk2/issues/34#issuecomment-1100656280 (Previously: "Making sure your PR gets a review and your reviews don't get lost.")
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
rk1a / imitation
Forked from HumanCompatibleAI/imitationClean PyTorch implementations of imitation and reward learning algorithms
Simple and easily configurable grid world environments for reinforcement learning
Simple behavioural cloning baseline solution for BASALT 2022
Benchmark environments for reward modelling and imitation learning algorithms.
Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"
A unified ensemble framework for PyTorch to improve the performance and robustness of your deep learning model.
Open AI Gym version of Berkeley AI Pacman with images as states
A simple stochastic OpenAI environment for training RL agents
Kojoley / atari-py
Forked from openai/atari-pyAn `openai/atari-py` fork with Windows support and removed zlib/libpng dependencies. Binaries (wheels) are on "Releases" tab.
Tutorial material on machine learning with dirty data in Python
Library for graphical models of decision making, based on pgmpy and networkx
Stable-Baselines tutorial for Journées Nationales de la Recherche en Robotique 2019
Book and code for Think Complexity, 2nd edition
Code for Allen Downey's book Think Complexity, published by O'Reilly Media.
Blame someone else for your bad code.
Clean PyTorch implementations of imitation and reward learning algorithms
An educational resource to help anyone learn deep reinforcement learning.