Lists (14)
Sort Newest
Stars
Writing AI Conference Papers: A Handbook for Beginners
Source Han Sans | 思源黑体 | 思源黑體 | 思源黑體 香港 | 源ノ角ゴシック | 본고딕
Project of AI3604 Computer Vision, 2023 Fall, SJTU
Official PyTorch Implementation of "WordStylist: Styled Verbatim Handwritten Text Generation with Latent Diffusion Models" - ICDAR 2023
Official Code for ECCV 2024 paper — One-Shot Diffusion Mimicker for Handwritten Text Generation
Source code for ECCV20 "GANwriting: Content-Conditioned Generation of Styled Handwritten Word Images"
Image-to-Image Translation in PyTorch
PyTorch implementations of Generative Adversarial Networks.
End-to-end recipes for optimizing diffusion models with torchao and diffusers (inference and FP8 training).
GGUF Quantization support for native ComfyUI models
A Collection of Variational Autoencoders (VAE) in PyTorch.
Visualizer for neural network, deep learning and machine learning models
Various AI scripts. Mostly Stable Diffusion stuff.
A general fine-tuning kit geared toward diffusion models.
collection of diffusion model papers categorized by their subareas
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
深度学习面试宝典(含数学、机器学习、深度学习、计算机视觉、自然语言处理和SLAM等方向)
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
Industry leading face manipulation platform
MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation
SoftVC VITS Singing Voice Conversion
SGLang is a fast serving framework for large language models and vision language models.
[CVPR 2024] This is the official source for our paper "SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis"