Skip to content
View xinntao's full-sized avatar
🎯
Focusing
🎯
Focusing

Organizations

@TencentARC @XPixelGroup

Block or report xinntao

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

MoBA: Mixture of Block Attention for Long-Context LLMs

Python 1,208 62 Updated Feb 19, 2025

Improving Video Generation with Human Feedback

Python 100 Updated Feb 12, 2025

[ARXIV'25] GameFactory: Creating New Games with Generative Interactive Videos

Python 261 9 Updated Jan 15, 2025

[ICLR'25] 3DTrajMaster: Mastering 3D Trajectory for Multi-Entity Motion in Video Generation

Jupyter Notebook 299 14 Updated Feb 7, 2025

[ICLR'25] SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints

Python 492 15 Updated Dec 11, 2024

[ARXIV'24] StyleMaster: Stylize Your Video with Artistic Generation and Translation

85 Updated Dec 11, 2024

Official implementation of the paper "Koala-36M: A Large-scale Video Dataset Improving Consistency between Fine-grained Conditions and Video Content".

Python 152 4 Updated Nov 8, 2024

Excalidraw app for mac. Powered by pure SwiftUI.

Swift 348 21 Updated Feb 21, 2025

Let your Claude able to think

TypeScript 14,378 1,685 Updated Jan 23, 2025

Deep Reinforcement Learning

3,576 607 Updated Dec 10, 2022

DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.

Python 1,732 124 Updated Dec 6, 2024

Next-Token Prediction is All You Need

Python 2,003 78 Updated Oct 24, 2024

Kolors Team

Python 4,196 316 Updated Nov 13, 2024

Bring portraits to life!

Python 14,098 1,517 Updated Feb 13, 2025

SEED-Voken: A Series of Powerful Visual Tokenizers

Python 834 32 Updated Feb 19, 2025

A PyTorch native library for large model training

Python 3,331 280 Updated Feb 21, 2025

Official repo for paper "MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions"

Python 405 11 Updated Sep 2, 2024
Python 355 15 Updated Oct 21, 2024

InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models

Python 3,641 393 Updated Jan 3, 2025

Translate PDF, EPub, webpage, metadata, annotations, notes to the target language. Support 20+ translate services.

TypeScript 8,207 378 Updated Feb 17, 2025

[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"

Python 1,489 121 Updated Dec 17, 2024

小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫 | 知乎问答文章|评论爬虫

Python 20,038 5,956 Updated Feb 12, 2025

[CVPR 2024] Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers

Python 573 24 Updated Oct 25, 2024

Pix2Seq codebase: multi-tasks with generative modeling (autoregressive and diffusion)

Jupyter Notebook 895 71 Updated Nov 7, 2023

A simple HTML visualization tool for computer vision research 🛠️

Python 242 15 Updated Feb 13, 2025

Transparent Image Layer Diffusion using Latent Transparency

2,072 30 Updated Jun 16, 2024

[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors

Python 2,778 225 Updated Sep 8, 2024

ICLR 2024 (Spotlight)

Python 745 20 Updated Mar 2, 2024

PhotoMaker [CVPR 2024]

Jupyter Notebook 9,796 777 Updated Oct 31, 2024
Next