Starred repositories
ASCII generator (image to text, image to image, video to video)
OMG-LLaVA and OMG-Seg codebase [CVPR-24 and NeurIPS-24]
《Hello 算法》:动画图解、一键运行的数据结构与算法教程。支持 Python, Java, C++, C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 代码。简体版和繁体版同步更新,English version ongoing
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
Python scripts that are short but useful or interesting
A generative speech model for daily dialogue.
小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫 | 知乎问答文章|评论爬虫
AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术
OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。
💿 Free software that works great, and also happens to be open-source Python.
21 Lessons, Get Started Building with Generative AI 🔗
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
One-step image-to-image with Stable Diffusion turbo: sketch2image, day2night, and more
Open-Sora: Democratizing Efficient Video Production for All
[WIP] Layer Diffusion for WebUI (via Forge)
Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information
Taming Transformers for High-Resolution Image Synthesis
Stable Diffusion web UI
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
Industry leading face manipulation platform
[CVPR 2023 Highlight] Freestyle Layout-to-Image Synthesis
Hardnet descriptor model - "Working hard to know your neighbor's margins: Local descriptor learning loss"
[NeurIPS 2023] FreeMask: Synthetic Images with Dense Annotations Make Stronger Segmentation Models