Stars
TensorFlow code and pre-trained models for BERT
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
Tensorflow2.0 🍎🍊 is delicious, just eat it! 😋😋
A platform for Reasoning systems (Reinforcement Learning, Contextual Bandits, etc.)
A simple and well styled PPO implementation. Based on my Medium series: https://medium.com/@eyyu/coding-ppo-from-scratch-with-pytorch-part-1-4-613dfc1b14c8.
Path tracking with dynamic bicycle models
This is a search and optimization library
Code implementation for the NeurIPS 2022 paper "Policy Optimization with Advantage Regularization for Long-Term Fairness in Decision Systems".
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO) and Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (A…