mschweizer

Follow

Marvin Schweizer mschweizer

Follow

3 followers · 4 following

Karlsruhe Institute of Technology
Karlsruhe

Stars

openreasoner / openr

OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models

Python 1,502 116 Updated Jan 17, 2025

erikgahner / PolData

A dataset with political datasets

R 647 87 Updated Jan 25, 2025

cgranier / readmbox

A python utility to read an mbox email file and output selected data to a csv file

Python 2 Updated Mar 6, 2020

dennybritz / reinforcement-learning

Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.

Jupyter Notebook 20,878 6,063 Updated Jul 13, 2023

dulpneto / exponential_rs_rl

Python 1 Updated Sep 14, 2023

ryan-p-randall / monthly-planning-files

Text files to help plan & log whatever it is you do. Bullet journal + pomodoro technique + text editors + cloud syncing = progress.

15 3 Updated Aug 7, 2021

timokau / prefq

Work in progress, not ready for use. -- Getting preference feedback from real humans.

Python 1 1 Updated Jun 11, 2024

timokau / marvin-mk2

Discontinued! See https://github.com/timokau/marvin-mk2/issues/34#issuecomment-1100656280 (Previously: "Making sure your PR gets a review and your reviews don't get lost.")

Python 19 9 Updated Apr 16, 2022

jax-ml / jax

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Python 31,133 2,877 Updated Feb 2, 2025

rk1a / imitation

Forked from HumanCompatibleAI/imitation

Clean PyTorch implementations of imitation and reward learning algorithms

Python 1 1 Updated May 28, 2024

Farama-Foundation / Minigrid

Simple and easily configurable grid world environments for reinforcement learning

Python 2,168 617 Updated Jan 30, 2025

minerllabs / basalt-2022-behavioural-cloning-baseline

Simple behavioural cloning baseline solution for BASALT 2022

Python 29 20 Updated Nov 3, 2022

HumanCompatibleAI / seals

Benchmark environments for reward modelling and imitation learning algorithms.

Python 45 6 Updated Sep 19, 2023

mrahtz / learning-from-human-preferences

Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"

Python 314 69 Updated Nov 29, 2021

TorchEnsemble-Community / Ensemble-Pytorch

A unified ensemble framework for PyTorch to improve the performance and robustness of your deep learning model.

Python 1,008 95 Updated Jun 16, 2024

sohamghosh121 / PacmanGym

Open AI Gym version of Berkeley AI Pacman with images as states

Python 13 10 Updated May 4, 2018

tychovdo / PacmanDQN

Deep Reinforcement Learning in Pac-man

Python 280 125 Updated Apr 26, 2024

MartinThoma / banana-gym

A simple stochastic OpenAI environment for training RL agents

Python 89 34 Updated Feb 8, 2023

Kojoley / atari-py

Forked from openai/atari-py

An `openai/atari-py` fork with Windows support and removed zlib/libpng dependencies. Binaries (wheels) are on "Releases" tab.

C++ 183 35 Updated Jan 5, 2022

github / gitignore

A collection of useful .gitignore templates

164,071 83,109 Updated Jan 29, 2025

dirty-data-science / python

Tutorial material on machine learning with dirty data in Python

Python 62 8 Updated Jul 7, 2024

causalincentives / pycid

Library for graphical models of decision making, based on pgmpy and networkx

Jupyter Notebook 101 15 Updated Sep 19, 2023

araffin / rl-tutorial-jnrr19

Stable-Baselines tutorial for Journées Nationales de la Recherche en Robotique 2019

Jupyter Notebook 633 119 Updated Jun 12, 2023

AllenDowney / ThinkComplexity2

Book and code for Think Complexity, 2nd edition

Jupyter Notebook 746 668 Updated Sep 22, 2024

AllenDowney / ThinkComplexity

Code for Allen Downey's book Think Complexity, published by O'Reilly Media.

Jupyter Notebook 104 81 Updated Oct 1, 2024

Factual / drake

Data workflow tool, like a "Make for data"

Clojure 1,479 110 Updated Apr 12, 2022

jayphelps / git-blame-someone-else

Blame someone else for your bad code.

Shell 11,030 264 Updated Dec 4, 2023

HumanCompatibleAI / imitation

Clean PyTorch implementations of imitation and reward learning algorithms

Python 1,388 257 Updated Jan 7, 2025

replicate / keepsake

Version control for machine learning

Python 1,655 72 Updated Jan 28, 2025

openai / spinningup

An educational resource to help anyone learn deep reinforcement learning.

Python 10,444 2,267 Updated Aug 5, 2024