Skip to content
View YangRui2015's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Organizations

@DynaMath

Block or report YangRui2015

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
71 results for source starred repositories
Clear filter

Eko (Eko Keeps Operating) - Build Production-ready Agentic Workflow with Natural Language - eko.fellou.ai

TypeScript 2,103 117 Updated Feb 5, 2025

This is the repository for paper EscapeBench: Pushing Language Models to Think Outside the Box

Python 9 2 Updated Dec 19, 2024

RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation

Python 835 78 Updated Dec 24, 2024

A generative world for general-purpose robotics & embodied AI learning.

Python 23,549 1,997 Updated Feb 5, 2025

Demo-Driven Mobile Bi-Manual Manipulation Benchmark.

Python 135 18 Updated Jan 5, 2025
Python 52 2 Updated Nov 7, 2024

A Dynamic Visual Benchmark for Evaluating Mathematical Reasoning Robustness of Vision Language Models

Python 17 1 Updated Nov 25, 2024

OGBench: Benchmarking Offline Goal-Conditioned RL

Python 102 22 Updated Oct 29, 2024

Code for NeurIPS 2024 paper "Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs"

Python 20 1 Updated Dec 4, 2024

Guide for surviving at UIUC (under development)

56 8 Updated Oct 6, 2024

Monitoring recent cross-research on LLM & RL on arXiv for control. If there are good papers, PRs are welcome.

276 16 Updated Sep 12, 2024

Official implementation of Diffusion Model-Based Predictor (DMBP) presented in ICLR2024.

Python 8 Updated May 24, 2024

Course Material for the UG Course COMP4901Y

Python 52 4 Updated May 12, 2024

DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.

Python 1,715 120 Updated Dec 6, 2024

Word2World is an LLM-based PCG system that creates playable 2D world from stories

Python 45 5 Updated Nov 26, 2024

[ICML 2024] The offical Implementation of "DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning"

Python 71 1 Updated Sep 26, 2024

Recipes to train reward model for RLHF.

Python 1,133 80 Updated Jan 22, 2025

The Paper List on Data Contamination for Large Language Models Evaluation.

89 3 Updated Jan 10, 2025

Code for the ICML 2024 paper "Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment"

Python 55 4 Updated Dec 24, 2024

利用HuggingFace的官方下载工具从镜像网站进行高速下载。

Python 941 84 Updated Oct 12, 2024

[CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models

Python 269 8 Updated Nov 13, 2024

Code for ICLR 2024 paper "Towards Robust Offline Reinforcement Learning under Diverse Data Corruption"

Python 13 Updated Nov 25, 2024

Prompt Learning using Metaheuristics

Python 136 5 Updated Feb 13, 2024

Code for "TD-MPC2: Scalable, Robust World Models for Continuous Control"

Python 439 103 Updated Jan 21, 2025

Latent-based SR using MoE and frequency augmented VAE decoder

Python 153 4 Updated Nov 26, 2023
Python 2 2 Updated Oct 18, 2023
Python 14 1 Updated Jun 5, 2023

An environment based on JSBSIM aimed at one-to-one close air combat.

Python 318 102 Updated Jan 23, 2025
Next