hysts

hysts

ML Engineer

274 followers · 218 following

https://huggingface.co/hysts

Achievements

x2 x3

Achievements

x2 x3

Lists (3)

Sort

🔮 Future ideas

✨ Inspiration

🚀 My stack

Stars

FunAudioLLM / InspireMusic

InspireMusic: A Unified Framework for Music, Song, Audio Generation.

Python 202 15 Updated Dec 12, 2024

freddyaboulton / gradio-webrtc

Realtime Video and Audio Streaming with WebRTC and Gradio

Python 102 13 Updated Dec 12, 2024

hustvl / ControlAR

Official code for "ControlAR: Controllable Image Generation with Autoregressive Models"

Python 153 4 Updated Dec 12, 2024

fallenshock / FlowEdit

Official implementation of the paper: "FlowEdit: Inversion-Free Text-Based Editing Using Pre-Trained Flow Models"

Python 47 1 Updated Dec 12, 2024

baaivision / See3D

You See it, You Got it: Learning 3D Creation on Pose-Free Videos at Scale

Python 345 8 Updated Dec 11, 2024

hkchengrex / MMAudio

[arXiv 2024] Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis

Python 155 9 Updated Dec 11, 2024

facebookresearch / flow_matching

A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.

Python 965 33 Updated Dec 12, 2024

radames / gradio_huggingfacehub_search

CSS 2 Updated Dec 6, 2024

Mark12Ding / SAM2Long

SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree

Jupyter Notebook 515 15 Updated Dec 9, 2024

1jsingh / negtome

Official Implementation for paper: Negative Token Merging: Image-based Adversarial Feature Guidance

Jupyter Notebook 61 2 Updated Dec 6, 2024

flymin / MagicDriveDiT

Official implementation of the paper “MagicDriveDiT: High-Resolution Long Video Generation for Autonomous Driving with Adaptive Control”

Python 181 7 Updated Dec 10, 2024

nv-tlabs / L4GM-official

[NeurIPS 2024] L4GM: Large 4D Gaussian Reconstruction Model

Python 115 5 Updated Dec 2, 2024

microsoft / TRELLIS

Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation".

Python 3,822 182 Updated Dec 7, 2024

JiuhaiChen / Florence-VL

Python 138 5 Updated Dec 7, 2024

jiah-cloud / Align3R

[arXiv'24] Align3R: Aligned Monocular Depth Estimation for Dynamic Videos

Python 207 6 Updated Dec 12, 2024

TIGER-AI-Lab / OmniEdit

Official Repo for Paper "OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision"

56 1 Updated Dec 7, 2024

TencentARC / NVComposer

Boosting Generative Novel View Synthesis with Sparse and Unposed Images

Python 40 1 Updated Dec 9, 2024

Yuanshi9815 / OminiControl

A minimal and universal controller for FLUX.1.

Python 868 46 Updated Dec 10, 2024

ByteFlow-AI / TokenFlow

🔥 Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".

Python 158 1 Updated Dec 12, 2024

IDEA-Research / TAPTR

[ECCV 2024 & NeurIPS 2024] Official implementation of the paper TAPTR & TAPTRv2 & TAPTRv3

240 12 Updated Dec 11, 2024

huggingface / smol-course

A course on aligning smol models.

Jupyter Notebook 3,041 849 Updated Dec 12, 2024

lehduong / OneDiffusion

Python 464 11 Updated Dec 12, 2024

yael-vinker / SketchAgent

Python 80 4 Updated Dec 6, 2024

Tencent / HunyuanVideo

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 5,602 390 Updated Dec 12, 2024

NIRVANALAN / GaussianAnything

High-quality and editable surfel Gaussian generation through native 3D diffusion.

Python 194 10 Updated Dec 12, 2024

yformer / EfficientTAM

Efficient Track Anything

Python 363 9 Updated Dec 12, 2024

gwang-kim / PersonaCraft

Pytorch implementation of "PersonaCraft: Personalized Full-Body Image Synthesis for Multiple Identities from Single References Using 3D-Model-Conditioned Diffusion"

37 Updated Dec 2, 2024

prs-eth / RollingDepth

Video Depth without Video Models

Python 339 10 Updated Dec 9, 2024

jasongzy / Make-It-Animatable

Official implementation of "Make-It-Animatable: An Efficient Framework for Authoring Animation-Ready 3D Characters"

152 5 Updated Dec 5, 2024

wangjiangshan0725 / RF-Solver-Edit

Taming FLUX for Image Inversion & Editing; OpenSora for Video Inversion & Editing! (Official implementation for Taming Rectified Flow for Inversion and Editing.)

Python 306 7 Updated Dec 5, 2024