- Berkeley, USA
- https://people.eecs.berkeley.edu/~shangding.gu/
-
-
-
omnisafe Public
Forked from PKU-Alignment/omnisafeJMLR: OmniSafe is an infrastructural framework for accelerating SafeRL research.
Python Apache License 2.0 UpdatedFeb 18, 2025 -
The repository is for safe reinforcement learning baselines.
-
Safe-Multi-Agent-Isaac-Gym Public
Safe Multi-Agent Isaac Gym benchmark for safe multi-agent reinforcement learning research.
-
semikong Public
Forked from aitomatic/semikongFirst Open-Source Industry-Specific Model for Semiconductors
Python Apache License 2.0 UpdatedNov 22, 2024 -
openr Public
Forked from openreasoner/openrOpenR: An Open Source Framework for Advanced Reasoning with Large Language Models
Python MIT License UpdatedOct 23, 2024 -
-
ray Public
Forked from ray-project/rayRay is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Python Apache License 2.0 UpdatedJul 17, 2024 -
Safe-Multi-Agent-Robosuite Public
Safe Multi-Agent Robosuite benchmark for safe multi-agent reinforcement learning research.
-
Safe-Multi-Agent-Mujoco Public
Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.
-
safety-gymnasium Public
Forked from PKU-Alignment/safety-gymnasiumNeurIPS 2023: Safety-Gymnasium: A Unified Safe Reinforcement Learning Benchmark
Python Apache License 2.0 UpdatedMay 14, 2024 -
tianshou Public
Forked from thu-ml/tianshouAn elegant PyTorch deep reinforcement learning library.
Python MIT License UpdatedMay 8, 2024 -
WizardLM Public
Forked from nlpxucan/WizardLMLLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath
Python UpdatedMay 2, 2024 -
Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).
-
DexterousHands Public
Forked from PKU-MARL/DexterousHandsThis is a library that provides dual dexterous hand manipulation tasks through Isaac Gym
Python Apache License 2.0 UpdatedFeb 7, 2024 -
Awesome-LLM-RL Public
Forked from 123penny123/Awesome-LLM-RLA comprehensive list of PAPERS, CODEBASES, and, DATASETS on Decision Making using Foundation Models including LLMs and VLMs.
1 UpdatedJan 23, 2024 -
alpaca_eval Public
Forked from tatsu-lab/alpaca_evalAn automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
Jupyter Notebook Apache License 2.0 UpdatedAug 24, 2023 -
Grounding_LLMs_with_online_RL_work Public
Forked from flowersteam/Grounding_LLMs_with_online_RLWe perform functional grounding of LLMs' knowledge in BabyAI-Text
Python MIT License UpdatedJun 29, 2023 -
Minigrid-work-python3.9 Public
Forked from Farama-Foundation/MinigridSimple and easily configurable grid world environments for reinforcement learning
-
Minecraft-work-python3.8 Public
Forked from fogleman/MinecraftSimple Minecraft-inspired program using Python and Pyglet
Python MIT License UpdatedJun 7, 2023 -
-
tree-of-thought-llm Public
Forked from princeton-nlp/tree-of-thought-llmTree of Thoughts: Deliberate Problem Solving with Large Language Models
Python MIT License UpdatedMay 26, 2023 -
Safe-Policy-Optimization-Serial-Version Public
Forked from PKU-Alignment/Safe-Policy-OptimizationThis is a benchmark repository for safe reinforcement learning algorithms
-
mtenv Public
Forked from facebookresearch/mtenvMultiTask Environments for Reinforcement Learning.
Python MIT License UpdatedMay 9, 2023 -
ChatGPTAPIFree Public
Forked from ayaka14732/ChatGPTAPIFreeA simple and open-source proxy API that allows you to access OpenAI's ChatGPT API for free!
JavaScript Creative Commons Zero v1.0 Universal UpdatedMar 27, 2023 -
README Public
Forked from guodongxiaren/READMEREADME文件语法解读,即Github Flavored Markdown语法介绍
The Unlicense UpdatedMar 8, 2023 -
DB-Football Public
Forked from Shanghai-Digital-Brain-Laboratory/DB-FootballA Simple, Distributed and Asynchronous Multi-Agent Reinforcement Learning Framework for Google Research Football AI.
Python Other UpdatedJan 24, 2023 -
rl_on_manifold Public
Forked from PuzeLiu/rl_on_manifoldRobot Reinforcement Learning on the Constraint Manifold
Python UpdatedDec 15, 2022 -
TimeChamber-rl Public
Forked from inspirai/TimeChamberA Massively Parallel Large Scale Self-Play Framework
Python MIT License UpdatedOct 15, 2022