Lists (1)
Sort Oldest
Stars
Learning Flow Fields in Attention for Controllable Person Image Generation
Implement Region Attention for Flux model
Mamba-YOLO-World: Marrying YOLO-World with Mamba for Open-Vocabulary Detection
A microframework on top of PyTorch with first-class citizen APIs for foundation model adaptation
MambaMIM: Pre-training Mamba with State Space Token-interpolation
Some awesome comfyui workflows in here, and they are built using the comfyui-easy-use node package.
A powerful tool that translates ComfyUI workflows into executable Python code.
ControlNet++: All-in-one ControlNet for image generations and editing!
Enjoy the magic of Diffusion models!
官方推荐的 ChatTTS 资源汇总项目,整理了全网相关资源和常见问题 || Officially recommended ChatTTS resource collection project
One Prompt Word is Enough to Boost Adversarial Robustness for Pre-trained Vision-Language Models
The official implementation of DiM: Diffusion Mamba for Efficient High-Resolution Image Synthesis
VMamba: Visual State Space Models,code is based on mamba
[ECCV-2024] This is the official implementation of ZeST.
Official Pytorch implementation of "Visual Style Prompting with Swapping Self-Attention"
Scaling RWKV-Like Architectures for Diffusion Models
Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis; ICLR 2024 Spotlight; Official code
Implementation of the text to video model LUMIERE from the paper: "A Space-Time Diffusion Model for Video Generation" by Google Research
Official PyTorch Implementation of "The Hidden Attention of Mamba Models"
[ECCV2024] API code for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy
小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫 | 知乎问答文章|评论爬虫