Skip to content
View Zhehui-Huang's full-sized avatar

Highlights

  • Pro

Block or report Zhehui-Huang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

DSBench: How Far are Data Science Agents from Becoming Data Science Experts?

Jupyter Notebook 43 3 Updated Feb 19, 2025
Python 12 2 Updated Aug 8, 2024

Streamlining reinforcement learning with RLOps. State-of-the-art RL algorithms and tools.

Python 678 57 Updated Mar 3, 2025

Additional environments compatible with OpenAI gym

Python 137 50 Updated Jan 17, 2025

论文写作与资料分享

2,664 586 Updated Aug 7, 2022

Live posts of FeedBot on Libera.Chat

38 11 Updated Mar 3, 2025

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Python 6,438 712 Updated Mar 3, 2025

Materials for the Hugging Face Diffusion Models Course

Jupyter Notebook 3,895 423 Updated Feb 12, 2025

Matplotlib styles for scientific plotting

Python 7,520 730 Updated Feb 21, 2025

Modular reinforcement learning library (on PyTorch and JAX) with support for NVIDIA Isaac Gym, Omniverse Isaac Gym and Isaac Lab

Python 670 68 Updated Mar 2, 2025

An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites

4,780 495 Updated Jul 30, 2024

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Python 4,589 480 Updated Jan 8, 2024

This repository contains demos I made with the Transformers library by HuggingFace.

Jupyter Notebook 10,074 1,529 Updated Jan 13, 2025

Some Conferences' accepted paper lists (including AI, ML, Robotic)

Python 1,040 75 Updated Jan 23, 2025

🦁 A research-friendly codebase for fast experimentation of multi-agent reinforcement learning in JAX

Python 772 99 Updated Mar 3, 2025

An extension of the PyMARL codebase that includes additional algorithms and environment support

Python 568 151 Updated Sep 24, 2024

Reinforcement Learning Environments for Omniverse Isaac Gym

Python 937 227 Updated Jun 6, 2024

OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.

C++ 4,411 963 Updated Feb 27, 2025

Repository of continual learning papers

TeX 39 7 Updated Dec 30, 2021

Brain Agent for Large-Scale and Multi-Task Agent Learning

Python 94 14 Updated Jan 4, 2024

A platform for Reasoning systems (Reinforcement Learning, Contextual Bandits, etc.)

Python 3,594 523 Updated Feb 19, 2025

Multi-Joint dynamics with Contact. A general purpose physics simulator.

Jupyter Notebook 8,830 907 Updated Mar 3, 2025

⏰ AI conference deadline countdowns

JavaScript 5,782 1,001 Updated Sep 15, 2024

Pytorch Implementation of Reinforcement Learning Algorithms ( Soft Actor Critic(SAC)/ DDPG / TD3 /DQN / A2C/ PPO / TRPO)

Python 217 21 Updated Jul 10, 2022

Training scripts, training data, and experimental data for Neural Fly

Jupyter Notebook 168 42 Updated May 21, 2022

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 35,766 6,060 Updated Mar 3, 2025

The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.

Python 10,209 798 Updated Feb 27, 2025

🌈谷粒-Chrome插件英雄榜, 为优秀的Chrome插件写一本中文说明书, 让Chrome插件英雄们造福人类~ ChromePluginHeroes, Write a Chinese manual for the excellent Chrome plugin, let the Chrome plugin heroes benefit the human~ 公众号「0加1」同步更新

JavaScript 22,196 2,282 Updated Dec 7, 2024

The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization

Python 707 104 Updated Mar 23, 2024
Next