Skip to content
View ZhizhenQin's full-sized avatar

Highlights

  • Pro

Block or report ZhizhenQin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
13 stars written in Python
Clear filter

TensorFlow code and pre-trained models for BERT

Python 38,715 9,665 Updated Jul 23, 2024

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Python 16,050 4,886 Updated Aug 1, 2024

Tensorflow2.0 🍎🍊 is delicious, just eat it! 😋😋

Python 9,969 2,474 Updated Sep 22, 2022

A platform for Reasoning systems (Reinforcement Learning, Contextual Bandits, etc.)

Python 3,591 523 Updated Feb 19, 2025

Soft Actor-Critic

Python 1,042 236 Updated Nov 29, 2023

A simple and well styled PPO implementation. Based on my Medium series: https://medium.com/@eyyu/coding-ppo-from-scratch-with-pytorch-part-1-4-613dfc1b14c8.

Python 870 125 Updated Oct 1, 2024

大麦网演唱会抢票程序

Python 419 132 Updated Dec 29, 2019

Path tracking with dynamic bicycle models

Python 97 18 Updated Jun 9, 2020

This is a search and optimization library

Python 23 5 Updated Apr 25, 2023

Fault-Tolerant Neural CBF

Python 10 3 Updated Feb 23, 2024

Code implementation for the NeurIPS 2022 paper "Policy Optimization with Advantage Regularization for Long-Term Fairness in Decision Systems".

Python 7 5 Updated Apr 16, 2023

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO) and Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (A…

Python 2 1 Updated Nov 7, 2018
Python 1 1 Updated Jan 31, 2021