Skip to content
View mschweizer's full-sized avatar
  • Karlsruhe Institute of Technology
  • Karlsruhe

Block or report mschweizer

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models

Python 1,502 116 Updated Jan 17, 2025

A dataset with political datasets

R 647 87 Updated Jan 25, 2025

A python utility to read an mbox email file and output selected data to a csv file

Python 2 Updated Mar 6, 2020

Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.

Jupyter Notebook 20,878 6,063 Updated Jul 13, 2023
Python 1 Updated Sep 14, 2023

Text files to help plan & log whatever it is you do. Bullet journal + pomodoro technique + text editors + cloud syncing = progress.

15 3 Updated Aug 7, 2021

Work in progress, not ready for use. -- Getting preference feedback from real humans.

Python 1 1 Updated Jun 11, 2024

Discontinued! See https://github.com/timokau/marvin-mk2/issues/34#issuecomment-1100656280 (Previously: "Making sure your PR gets a review and your reviews don't get lost.")

Python 19 9 Updated Apr 16, 2022

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Python 31,133 2,877 Updated Feb 2, 2025

Clean PyTorch implementations of imitation and reward learning algorithms

Python 1 1 Updated May 28, 2024

Simple and easily configurable grid world environments for reinforcement learning

Python 2,168 617 Updated Jan 30, 2025

Simple behavioural cloning baseline solution for BASALT 2022

Python 29 20 Updated Nov 3, 2022

Benchmark environments for reward modelling and imitation learning algorithms.

Python 45 6 Updated Sep 19, 2023

Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"

Python 314 69 Updated Nov 29, 2021

A unified ensemble framework for PyTorch to improve the performance and robustness of your deep learning model.

Python 1,008 95 Updated Jun 16, 2024

Open AI Gym version of Berkeley AI Pacman with images as states

Python 13 10 Updated May 4, 2018

Deep Reinforcement Learning in Pac-man

Python 280 125 Updated Apr 26, 2024

A simple stochastic OpenAI environment for training RL agents

Python 89 34 Updated Feb 8, 2023

An `openai/atari-py` fork with Windows support and removed zlib/libpng dependencies. Binaries (wheels) are on "Releases" tab.

C++ 183 35 Updated Jan 5, 2022

A collection of useful .gitignore templates

164,071 83,109 Updated Jan 29, 2025

Tutorial material on machine learning with dirty data in Python

Python 62 8 Updated Jul 7, 2024

Library for graphical models of decision making, based on pgmpy and networkx

Jupyter Notebook 101 15 Updated Sep 19, 2023

Stable-Baselines tutorial for Journées Nationales de la Recherche en Robotique 2019

Jupyter Notebook 633 119 Updated Jun 12, 2023

Book and code for Think Complexity, 2nd edition

Jupyter Notebook 746 668 Updated Sep 22, 2024

Code for Allen Downey's book Think Complexity, published by O'Reilly Media.

Jupyter Notebook 104 81 Updated Oct 1, 2024

Data workflow tool, like a "Make for data"

Clojure 1,479 110 Updated Apr 12, 2022

Blame someone else for your bad code.

Shell 11,030 264 Updated Dec 4, 2023

Clean PyTorch implementations of imitation and reward learning algorithms

Python 1,388 257 Updated Jan 7, 2025

Version control for machine learning

Python 1,655 72 Updated Jan 28, 2025

An educational resource to help anyone learn deep reinforcement learning.

Python 10,444 2,267 Updated Aug 5, 2024
Next