Skip to content
View ChenDRAG's full-sized avatar

Block or report ChenDRAG

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

RAGEN is the first open-source reproduction of DeepSeek-R1 for training agentic models via reinforcement learning.

Python 496 29 Updated Jan 30, 2025

Fully open reproduction of DeepSeek-R1

Python 13,103 988 Updated Jan 30, 2025
8 Updated Jan 26, 2025

Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…

Python 7,327 451 Updated Jan 28, 2025

EDM2 and Autoguidance -- Official PyTorch implementation

Python 619 28 Updated Dec 9, 2024

Repo of paper "Free Process Rewards without Process Labels"

Python 110 2 Updated Jan 16, 2025

A curated list for awesome discrete diffusion models resources.

211 7 Updated Jan 19, 2025

The unitree_il_lerobot open-source project is a modification of the LeRobot open-source training framework, enabling the training and testing of data collected using the dual-arm dexterous hands of…

Python 168 16 Updated Nov 15, 2024

Next-Token Prediction is All You Need

Python 1,977 79 Updated Oct 24, 2024

Python library for solving reinforcement learning (RL) problems using generative models (e.g. Diffusion Models).

Python 110 8 Updated Dec 19, 2024

OpenDILab Decision AI Engine. The Most Comprehensive Reinforcement Learning Framework B.P.

Python 3,206 387 Updated Jan 27, 2025

A toolbox for benchmarking trustworthiness of multimodal large language models (MultiTrust, NeurIPS 2024 Track Datasets and Benchmarks)

Python 123 8 Updated Nov 5, 2024

Codes accompanying the paper "Toward Guidance-Free AR Visual Generation via Condition Contrastive Alignment"

Python 23 Updated Nov 18, 2024

RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation

Python 823 78 Updated Dec 24, 2024

Official Codebase for "Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control" (NeurIPS 2024)

Python 12 Updated Oct 29, 2024

SEED-Voken: A Series of Powerful Visual Tokenizers

Python 818 32 Updated Jan 22, 2025

PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838

Python 1,256 68 Updated Sep 27, 2024

Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).

Python 6,726 478 Updated Jan 26, 2025

This repository contains code for the paper Directo Preference Optimization with an Offset (ODPO).

Python 11 Updated Jul 19, 2024

DAMO-ConvAI: The official repository which contains the codebase for Alibaba DAMO Conversational AI.

Python 1,303 198 Updated Jan 13, 2025

Finetune Llama 3.3, Mistral, Phi-4, Qwen 2.5 & Gemma LLMs 2-5x faster with 70% less memory

Python 22,419 1,560 Updated Jan 26, 2025

Code for ACL2024 paper - Adversarial Preference Optimization (APO).

Python 49 3 Updated Jun 3, 2024

[RSS 2023] Diffusion Policy Visuomotor Policy Learning via Action Diffusion

Python 1,948 369 Updated Dec 24, 2024

This is the official implementation for ControlVAR.

Python 91 3 Updated Dec 10, 2024

[ICML 2024 Best Paper] Discrete Diffusion Modeling by Estimating the Ratios of the Data Distribution (https://arxiv.org/abs/2310.16834)

Python 463 48 Updated Feb 29, 2024

Neo4j 大规模 三元组 CVS 导入进数据库

Python 10 Updated Jul 31, 2020

A quick visualization tool for Jupyter and Neo4J

Python 137 31 Updated May 12, 2020

基于neo4j肝病知识图谱的问答系统

Python 361 116 Updated May 21, 2019

Graphs for Everyone

Java 13,717 2,413 Updated Jan 10, 2025
Next