YangRui2015

Follow

🎯

Focusing

Rui YangRui2015

🎯

Focusing

Follow

Do less and do better.

84 followers · 9 following

Achievements

Achievements

Highlights

Pro

Organizations

Lists (1)

Sort

✨ Inspiration

Stars

71 results for source starred repositories

FellouAI / eko

Eko (Eko Keeps Operating) - Build Production-ready Agentic Workflow with Natural Language - eko.fellou.ai

TypeScript 2,103 117 Updated Feb 5, 2025

OffDynamicsRL / off-dynamics-rl

Python 34 2 Updated Nov 22, 2024

qiancheng0 / EscapeBench

This is the repository for paper EscapeBench: Pushing Language Models to Think Outside the Box

Python 9 2 Updated Dec 19, 2024

thu-ml / RoboticsDiffusionTransformer

RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation

Python 835 78 Updated Dec 24, 2024

Genesis-Embodied-AI / Genesis

A generative world for general-purpose robotics & embodied AI learning.

Python 23,549 1,997 Updated Feb 5, 2025

chernyadev / bigym

Demo-Driven Mobile Bi-Manual Manipulation Benchmark.

Python 135 18 Updated Jan 5, 2025

ScalerLab / JudgeBench

Python 52 2 Updated Nov 7, 2024

ASTRAL-Group / SVIP_LLM_Inference_Verification

Python 9 1 Updated Nov 1, 2024

DynaMath / DynaMath

A Dynamic Visual Benchmark for Evaluating Mathematical Reasoning Robustness of Vision Language Models

Python 17 1 Updated Nov 25, 2024

seohongpark / ogbench

OGBench: Benchmarking Offline Goal-Conditioned RL

Python 102 22 Updated Oct 29, 2024

YangRui2015 / Generalizable-Reward-Model

Code for NeurIPS 2024 paper "Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs"

Python 20 1 Updated Dec 4, 2024

Violet24K / HowToUIUC

Guide for surviving at UIUC (under development)

56 8 Updated Oct 6, 2024

WindyLab / LLM-RL-Papers

Monitoring recent cross-research on LLM & RL on arXiv for control. If there are good papers, PRs are welcome.

276 16 Updated Sep 12, 2024

zhyang2226 / DMBP

Official implementation of Diffusion Model-Based Predictor (DMBP) presented in ICLR2024.

Python 8 Updated May 24, 2024

Relaxed-System-Lab / HKUST-COMP4901Y-2024spring

Course Material for the UG Course COMP4901Y

Python 52 4 Updated May 12, 2024

eloialonso / diamond

DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.

Python 1,715 120 Updated Dec 6, 2024

umair-nasir14 / Word2World

Word2World is an LLM-based PCG system that creates playable 2D world from stories

Python 45 5 Updated Nov 26, 2024

2toinf / DecisionNCE

[ICML 2024] The offical Implementation of "DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning"

Python 71 1 Updated Sep 26, 2024

RLHFlow / RLHF-Reward-Modeling

Recipes to train reward model for RLHF.

Python 1,133 80 Updated Jan 22, 2025

lyy1994 / awesome-data-contamination

The Paper List on Data Contamination for Large Language Models Evaluation.

89 3 Updated Jan 10, 2025

YangRui2015 / RiC

Code for the ICML 2024 paper "Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment"

Python 55 4 Updated Dec 24, 2024

LetheSec / HuggingFace-Download-Accelerator

利用HuggingFace的官方下载工具从镜像网站进行高速下载。

Python 941 84 Updated Oct 12, 2024

tianyi-lab / HallusionBench

[CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models

Python 269 8 Updated Nov 13, 2024

YangRui2015 / RIQL

Code for ICLR 2024 paper "Towards Robust Offline Reinforcement Learning under Diverse Data Corruption"

Python 13 Updated Nov 25, 2024

research4pan / Plum

Prompt Learning using Metaheuristics

Python 136 5 Updated Feb 13, 2024

nicklashansen / tdmpc2

Code for "TD-MPC2: Scalable, Robust World Models for Continuous Control"

Python 439 103 Updated Jan 21, 2025

tencent-ailab / Frequency_Aug_VAE_MoESR

Latent-based SR using MoE and frequency augmented VAE decoder

Python 153 4 Updated Nov 26, 2023

YangRui2015 / UWMSG

Python 2 2 Updated Oct 18, 2023

pcchenxi / LAPO-offlienRL

Python 14 1 Updated Jun 5, 2023

liuqh16 / LAG

An environment based on JSBSIM aimed at one-to-one close air combat.

Python 318 102 Updated Jan 23, 2025