Skip to content
View guxu313's full-sized avatar

Highlights

  • Pro

Block or report guxu313

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Implementation of "DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation"

Python 273 20 Updated Dec 13, 2024

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 20,778 2,290 Updated Aug 12, 2024

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 15,420 1,419 Updated Sep 5, 2024

Custom nodes for using MV-Adapter in ComfyUI.

Python 177 10 Updated Dec 20, 2024

The official Meta Llama 3 GitHub site

Python 27,563 3,142 Updated Aug 12, 2024

[ECCV-2024] This is the official implementation of ZeST.

Jupyter Notebook 374 24 Updated Sep 12, 2024

High-resolution models for human tasks.

Python 4,661 265 Updated Nov 18, 2024

[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation

Python 4,143 359 Updated Aug 14, 2024

[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation

Python 7,128 545 Updated Jul 17, 2024

PyTorch code and models for the DINOv2 self-supervised learning method.

Jupyter Notebook 9,471 847 Updated Aug 7, 2024
Python 274 19 Updated Aug 20, 2023

Easily create large video dataset from video urls

Python 554 66 Updated Jul 30, 2024

[NeurIPS D&B Track 2024] Official implementation of HumanVid

Python 267 3 Updated Oct 23, 2024

[NeurIPS 2023] Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models

Python 618 42 Updated Jul 17, 2024

Clapper.app, a video synthesizer and sequencer designed for the age of AI cinema

TypeScript 2,100 197 Updated Nov 25, 2024

[ICCV 2023] Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation

Python 4,273 385 Updated Oct 25, 2023

[SIGGRAPH'24] CharacterGen: Efficient 3D Character Generation from Single Images with Multi-View Pose Canonicalization

JavaScript 609 49 Updated Sep 10, 2024
Python 58 4 Updated Oct 25, 2024

WebUI extension for ControlNet

Python 17,190 1,977 Updated Aug 12, 2024

[SIGGRAPH 2024] "EASI-Tex: Edge-Aware Mesh Texturing from Single Image", ACM Transactions on Graphics.

Python 113 9 Updated Jul 30, 2024

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.

Jupyter Notebook 5,440 348 Updated Jun 28, 2024

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Python 26,727 5,503 Updated Dec 20, 2024

Custom Diffusion: Multi-Concept Customization of Text-to-Image Diffusion (CVPR 2023)

Python 1,882 139 Updated Dec 20, 2023

利用HuggingFace的官方下载工具从镜像网站进行高速下载。

Python 886 83 Updated Oct 12, 2024

Official Pytorch Implementation for "MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation" presenting "MultiDiffusion" (ICML 2023)

Jupyter Notebook 1,005 61 Updated Sep 21, 2023

T2I-Adapter

Python 3,518 212 Updated Jun 21, 2024

Your image is almost there!

Python 7,404 425 Updated Jul 26, 2024

DanceCamera3D: 3D Camera Movement Synthesis with Music and Dance. [CVPR 2024] Official PyTorch implementation

Python 92 3 Updated Jul 31, 2024
Next