-
Peking University
- Shenzhen, Guangdong, China
- https://akaneqwq.github.io
Highlights
- Pro
Stars
AutoDL平台服务器适配梯子, 使用 Clash 作为代理工具
A generative world for general-purpose robotics & embodied AI learning.
[NeurIPS 2024] OpenGaussian: Towards Point-Level 3D Gaussian-based Open Vocabulary Understanding
HunyuanVideo: A Systematic Framework For Large Video Generation Model
InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
Code of "3D Shape Variational Autoencoder Latent Disentanglement via Mini-Batch Feature Swapping for Bodies and Faces"
assistant tools for attention visualization in deep learning
Official inference repo for FLUX.1 models
Code of Pyramidal Flow Matching for Efficient Video Generative Modeling
Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复
V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.
[AAAI 2025] EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
[NeurIPS'22] This is an official implementation for "Scaling & Shifting Your Features: A New Baseline for Efficient Model Tuning".
This repo contains the code for 1D tokenizer and generator
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation
[ICCV 2021] Authors official PyTorch implementation of the "WarpedGANSpace: Finding non-linear RBF paths in GAN latent space".
Nvdiffrast - Modular Primitives for High-Performance Differentiable Rendering
Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set (CVPRW 2019). A PyTorch implementation.
[ECCV 2024] Towards High-Quality 3D Motion Transfer with Realistic Apparel Animation - MMDMC Dataset
🔥 [ICLR 2025] FakeShield: Explainable Image Forgery Detection and Localization via Multi-modal Large Language Models
制作懂人情世故的大语言模型 | 涵盖提示词工程、RAG、Agent、LLM微调教程
The Dawn of Video Generation: Preliminary Explorations with SORA-like Models
unofficial implementation of Comfyui magic clothing
Official implementation of Magic Clothing: Controllable Garment-Driven Image Synthesis