chauncygu

Shangding Gu chauncygu

Safe learning researcher

123 followers · 16 following

Berkeley, USA
https://people.eecs.berkeley.edu/~shangding.gu/

Achievements

chauncygu Public

Updated Feb 28, 2025
gshangd.github.io Public

HTML Updated Feb 21, 2025
omnisafe Public
Forked from PKU-Alignment/omnisafe

JMLR: OmniSafe is an infrastructural framework for accelerating SafeRL research.

Python Apache License 2.0 Updated Feb 18, 2025
Safe-Reinforcement-Learning-Baselines Public

The repository is for safe reinforcement learning baselines.

reinforcement-learning robotics safety baseline safe-reinforcement-learning safe-robot-learning

Jupyter Notebook 595 86 Updated Jan 27, 2025
Safe-Multi-Agent-Isaac-Gym Public

Safe Multi-Agent Isaac Gym benchmark for safe multi-agent reinforcement learning research.

benchmark robotics multi-agent-reinforcement-learning safe-reinforcement-learning

Python 58 9 Apache License 2.0 Updated Jan 10, 2025
semikong Public
Forked from aitomatic/semikong

First Open-Source Industry-Specific Model for Semiconductors

Python Apache License 2.0 Updated Nov 22, 2024
openr Public
Forked from openreasoner/openr

OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models

Python MIT License Updated Oct 23, 2024
sample-efficient-rl Public

Updated Aug 5, 2024
ray Public
Forked from ray-project/ray

Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python Apache License 2.0 Updated Jul 17, 2024
Safe-Multi-Agent-Robosuite Public

Safe Multi-Agent Robosuite benchmark for safe multi-agent reinforcement learning research.

benchmark robotics multi-agent-reinforcement-learning safe-reinforcement-learning

Python 17 4 MIT License Updated Jun 13, 2024
Safe-Multi-Agent-Mujoco Public

Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.

benchmark reinforcement-learning robotics safe

Python 57 10 MIT License Updated Jun 13, 2024
safety-gymnasium Public
Forked from PKU-Alignment/safety-gymnasium

NeurIPS 2023: Safety-Gymnasium: A Unified Safe Reinforcement Learning Benchmark

Python Apache License 2.0 Updated May 14, 2024
tianshou Public
Forked from thu-ml/tianshou

An elegant PyTorch deep reinforcement learning library.

Python MIT License Updated May 8, 2024
WizardLM Public
Forked from nlpxucan/WizardLM

LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath

Python Updated May 2, 2024
Multi-Agent-Constrained-Policy-Optimisation Public

Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).

multi-agent-reinforcement-learning policy-optimization safe-reinforcement-learning

Python 161 27 Other Updated Apr 17, 2024
DexterousHands Public
Forked from PKU-MARL/DexterousHands

This is a library that provides dual dexterous hand manipulation tasks through Isaac Gym

Python Apache License 2.0 Updated Feb 7, 2024
Awesome-LLM-RL Public
Forked from 123penny123/Awesome-LLM-RL

A comprehensive list of PAPERS, CODEBASES, and, DATASETS on Decision Making using Foundation Models including LLMs and VLMs.

1 Updated Jan 23, 2024
alpaca_eval Public
Forked from tatsu-lab/alpaca_eval

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

Jupyter Notebook Apache License 2.0 Updated Aug 24, 2023
Grounding_LLMs_with_online_RL_work Public
Forked from flowersteam/Grounding_LLMs_with_online_RL

We perform functional grounding of LLMs' knowledge in BabyAI-Text

Python MIT License Updated Jun 29, 2023
Minigrid-work-python3.9 Public
Forked from Farama-Foundation/Minigrid

Simple and easily configurable grid world environments for reinforcement learning

Python 1 Other Updated Jun 8, 2023
Minecraft-work-python3.8 Public
Forked from fogleman/Minecraft

Simple Minecraft-inspired program using Python and Pyglet

Python MIT License Updated Jun 7, 2023
fix-TEMPERA Public
Forked from tianjunz/TEMPERA

Python Updated Jun 6, 2023
tree-of-thought-llm Public
Forked from princeton-nlp/tree-of-thought-llm

Tree of Thoughts: Deliberate Problem Solving with Large Language Models

Python MIT License Updated May 26, 2023
Safe-Policy-Optimization-Serial-Version Public
Forked from PKU-Alignment/Safe-Policy-Optimization

This is a benchmark repository for safe reinforcement learning algorithms

Python 2 Apache License 2.0 Updated May 13, 2023
mtenv Public
Forked from facebookresearch/mtenv

MultiTask Environments for Reinforcement Learning.

Python MIT License Updated May 9, 2023
ChatGPTAPIFree Public
Forked from ayaka14732/ChatGPTAPIFree

A simple and open-source proxy API that allows you to access OpenAI's ChatGPT API for free!

JavaScript Creative Commons Zero v1.0 Universal Updated Mar 27, 2023
README Public
Forked from guodongxiaren/README

README文件语法解读，即Github Flavored Markdown语法介绍

The Unlicense Updated Mar 8, 2023
DB-Football Public
Forked from Shanghai-Digital-Brain-Laboratory/DB-Football

A Simple, Distributed and Asynchronous Multi-Agent Reinforcement Learning Framework for Google Research Football AI.

Python Other Updated Jan 24, 2023
rl_on_manifold Public
Forked from PuzeLiu/rl_on_manifold

Robot Reinforcement Learning on the Constraint Manifold

Python Updated Dec 15, 2022
TimeChamber-rl Public
Forked from inspirai/TimeChamber

A Massively Parallel Large Scale Self-Play Framework

Python MIT License Updated Oct 15, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Shangding Gu chauncygu

Achievements

Achievements

Block or report chauncygu

chauncygu Public

gshangd.github.io Public

omnisafe Public

Safe-Reinforcement-Learning-Baselines Public

Safe-Multi-Agent-Isaac-Gym Public

semikong Public

openr Public

sample-efficient-rl Public

ray Public

Safe-Multi-Agent-Robosuite Public

Safe-Multi-Agent-Mujoco Public

safety-gymnasium Public

tianshou Public

WizardLM Public

Multi-Agent-Constrained-Policy-Optimisation Public

DexterousHands Public

Awesome-LLM-RL Public

alpaca_eval Public

Grounding_LLMs_with_online_RL_work Public

Minigrid-work-python3.9 Public

Minecraft-work-python3.8 Public

fix-TEMPERA Public

tree-of-thought-llm Public

Safe-Policy-Optimization-Serial-Version Public

mtenv Public

ChatGPTAPIFree Public

README Public

DB-Football Public

rl_on_manifold Public

TimeChamber-rl Public