-
http://physics.as.nyu.edu/page/home
- New York
- https://sites.google.com/nyu.edu/junzhicaospersonalwebsite/home
Starred repositories
DeepEP: an efficient expert-parallel communication library
High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.
Janus-Series: Unified Multimodal Understanding and Generation Models
Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…
Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
The Large-scale Manipulation Platform for Scalable and Intelligent Embodied Systems
A generative world for general-purpose robotics & embodied AI learning.
Lag-Llama: Towards Foundation Models for Probabilistic Time Series Forecasting
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
A large-scale benchmark and learning environment.
Tencent Hunyuan3D-1.0: A Unified Framework for Text-to-3D and Image-to-3D Generation
Code for SIGGRAPH 2020 paper "RigNet: Neural Rigging for Articulated Characters"
Letta (formerly MemGPT) is the stateful agents framework with memory, reasoning, and context management.
Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"
[SIGGRAPH Asia 2024, Journal Track] ToonCrafter: Generative Cartoon Interpolation
🔊 Text-Prompted Generative Audio Model
Command-line program to download videos from YouTube.com and other video sites
A series of code large language models developed by PKU-KCL
[NeurIPS 2024] Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image
Langflow is a powerful tool for building and deploying AI-powered agents and workflows.
real time face swap and one-click video deepfake with only a single image
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
High-Resolution Image Synthesis with Latent Diffusion Models
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型