-
Tsinghua University
- Beijing
- https://chendrag.github.io/
Stars
RAGEN is the first open-source reproduction of DeepSeek-R1 for training agentic models via reinforcement learning.
Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…
EDM2 and Autoguidance -- Official PyTorch implementation
Repo of paper "Free Process Rewards without Process Labels"
A curated list for awesome discrete diffusion models resources.
The unitree_il_lerobot open-source project is a modification of the LeRobot open-source training framework, enabling the training and testing of data collected using the dual-arm dexterous hands of…
Python library for solving reinforcement learning (RL) problems using generative models (e.g. Diffusion Models).
OpenDILab Decision AI Engine. The Most Comprehensive Reinforcement Learning Framework B.P.
A toolbox for benchmarking trustworthiness of multimodal large language models (MultiTrust, NeurIPS 2024 Track Datasets and Benchmarks)
Codes accompanying the paper "Toward Guidance-Free AR Visual Generation via Condition Contrastive Alignment"
RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation
Official Codebase for "Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control" (NeurIPS 2024)
SEED-Voken: A Series of Powerful Visual Tokenizers
PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838
Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).
This repository contains code for the paper Directo Preference Optimization with an Offset (ODPO).
DAMO-ConvAI: The official repository which contains the codebase for Alibaba DAMO Conversational AI.
Finetune Llama 3.3, Mistral, Phi-4, Qwen 2.5 & Gemma LLMs 2-5x faster with 70% less memory
Code for ACL2024 paper - Adversarial Preference Optimization (APO).
[RSS 2023] Diffusion Policy Visuomotor Policy Learning via Action Diffusion
This is the official implementation for ControlVAR.
[ICML 2024 Best Paper] Discrete Diffusion Modeling by Estimating the Ratios of the Data Distribution (https://arxiv.org/abs/2310.16834)
Neo4j 大规模 三元组 CVS 导入进数据库
A quick visualization tool for Jupyter and Neo4J