Bluixe

Wenhao Zhang Bluixe

11 followers · 16 following

Shanghai Jiao Tong University
Shanghai, China
wenhao.bluixe.cn

Highlights

Stars

UT-Austin-RPL / amago

a simple and scalable agent for training adaptive policies with sequence-based RL

Python 113 7 Updated Feb 20, 2025

TideDra / zotero-arxiv-daily

Recommend new arxiv papers of your interest daily according to your Zotero libarary.

Python 779 688 Updated Feb 28, 2025

codecrafters-io / build-your-own-x

Master programming by recreating your favorite technologies from scratch.

Markdown 351,411 32,590 Updated Sep 3, 2024

openreasoner / openr

OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models

Python 1,709 129 Updated Jan 17, 2025

jon--lee / decision-pretrained-transformer

Implemention of the Decision-Pretrained Transformer (DPT) from the paper Supervised Pretraining Can Learn In-Context Reinforcement Learning.

Python 59 9 Updated May 28, 2024

sjtu-marl / ZSC-Eval

This repository is the official implementation of ZSC-Eval: An Evaluation Toolkit and Benchmark for Multi-agent Zero-shot Coordination. Pre-trained Agent Zoo: https://huggingface.co/Leoxxxxh/ZSC-Ev…

JavaScript 34 7 Updated Jan 13, 2025

rustdesk / rustdesk

An open-source remote desktop application designed for self-hosting, as an alternative to TeamViewer.

Rust 83,735 11,744 Updated Mar 8, 2025

Healthcare-Robotics / assistive-gym

Assistive Gym, a physics-based simulation framework for physical human-robot interaction and robotic assistance.

Python 333 79 Updated Jan 26, 2024

eugenevinitsky / sequential_social_dilemma_games

Repo for reproduction of sequential social dilemmas

Python 395 135 Updated Mar 6, 2025

histmeisah / Large-Language-Models-play-StarCraftII

TextStarCraft2,a pure language env which support llms play starcraft2

Python 250 18 Updated Dec 23, 2024

facebookresearch / nocturne

A data-driven, fast driving simulator for multi-agent coordination under partial observability.

Python 273 31 Updated Jun 18, 2024

opendilab / DI-engine

OpenDILab Decision AI Engine. The Most Comprehensive Reinforcement Learning Framework B.P.

Python 3,279 391 Updated Mar 1, 2025

Stanford-ILIAD / Diverse-Conventions

Exploring techniques to generate diverse conventions in multi-agent settings

JavaScript 12 Updated Nov 14, 2023

yihuai-gao / genius-invokation-gym

原神七圣召唤模拟环境 Simulator of Genius Invocation

Python 48 11 Updated Apr 29, 2024

flick-ai / Genius-Invokation

七圣召唤强化学习环境

Python 35 4 Updated Sep 6, 2024

PKU-MARL / HARL

Official implementation of HARL algorithms based on PyTorch.

Python 608 70 Updated Oct 8, 2024

liyang619 / COLE-Platform

Overcooked human-AI experiment platform

Python 37 4 Updated Dec 21, 2023

windingwind / zotero-pdf-translate

Translate PDF, EPub, webpage, metadata, annotations, notes to the target language. Support 20+ translate services.

TypeScript 8,349 386 Updated Mar 3, 2025

datamllab / rlcard

Reinforcement Learning / AI Bots in Card (Poker) Games - Blackjack, Leduc, Texas, DouDizhu, Mahjong, UNO.

Python 3,042 656 Updated Jun 26, 2024

debauchee / barrier

Forked from deskflow/deskflow

Open-source KVM software

C 28,235 1,544 Updated Jun 22, 2024

Python 28 5 Updated Aug 11, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Wenhao Zhang Bluixe

Highlights

Block or report Bluixe

Stars

UT-Austin-RPL / amago

TideDra / zotero-arxiv-daily

codecrafters-io / build-your-own-x

openreasoner / openr

jon--lee / decision-pretrained-transformer

sjtu-marl / ZSC-Eval

rustdesk / rustdesk

Healthcare-Robotics / assistive-gym

eugenevinitsky / sequential_social_dilemma_games

histmeisah / Large-Language-Models-play-StarCraftII

facebookresearch / nocturne

opendilab / DI-engine

Stanford-ILIAD / Diverse-Conventions

yihuai-gao / genius-invokation-gym

flick-ai / Genius-Invokation

PKU-MARL / HARL

liyang619 / COLE-Platform

windingwind / zotero-pdf-translate

datamllab / rlcard

debauchee / barrier

google-research / rliable

garrettj403 / SciencePlots

FuxiRL / DunkCityDynasty

bsarkar321 / madrona_rl_envs

shacklettbp / madrona

samjia2000 / HSP

leehe228 / LogisticsEnv

chandar-lab / Lifelong-Hanabi

Shanghai-Digital-Brain-Laboratory / BDM-DB1

lil-lab / cb2