Starred repositories
[arXiv 2025] Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control
Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…
Source code for the privacy radar app (android/ohos)
Official code of "DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longer Video Generation"
Official code for VMix: Improving Text-to-Image Diffusion Model with Cross-Attention Mixing Control
Mirai is a Server-Driven UI (SDUI) framework for Flutter. Mirai allows you to build beautiful cross-platform applications with JSON in real time.
FaceLift: Single Image to 3D Head with View Generation and GS-LRM
A Large-Scale Multimodal Car Dataset with Computational Fluid Dynamics Simulations and Deep Learning Benchmarks
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Enhance-A-Video: Better Generated Video for Free
Top ideas for a particular compmay to build some crazy projects.
A generative world for general-purpose robotics & embodied AI learning.
Flutter Grocery Shopping App (Mobile App, Web App)
🛍 A full E-commerce app with nice UI consists of on-boarding, login, sign-up, home, product details, cart and user profile.
Japanese-English dictionary app powered by Jisho.org.
美观易用且无广告的漫画和轻小说客户端, 同时支持MacOS,Windows,Android,iOS。类似动漫之家。
Cloud Gallery — An open-source Flutter project to easily manage, organize, and back up your photos and videos across local storage, Google Drive, and Dropbox, all in one place.
A mobile POS app written with Flutter, compatible Sunmi device
💸 An app created to help users manage a budget and purchases
A curated list of awesome open source Flutter apps
An open source flutter taxi - app for learning purpose(Provider & Bloc) using firebase as backend/server
OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340
[arXiv 2024] Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis
Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation".
Implementation of "DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation"
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer