Lists (3)
Sort Name ascending (A-Z)
Starred repositories
程序员在家做饭方法指南。Programmer's guide about how to cook at home (Simplified Chinese only).
🍃 JavaScript library for mobile-friendly interactive maps 🇺🇦
小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫 | 知乎问答文章|评论爬虫
Free universal database tool and SQL client
A solution to visualize and explore 3D models in your browser.
real time face swap and one-click video deepfake with only a single image
Sync with https://github.com/MaaAssistantArknights/MaaAssistantArknights/tree/dev/resource
A high-throughput and memory-efficient inference and serving engine for LLMs
Interactively explore metagenomes and more from a web browser.
[SIGGRAPH Asia 2024, Journal Track] ToonCrafter: Generative Cartoon Interpolation
《明日方舟》小助手,全日常一键长草!| A one-click tool for the daily tasks of Arknights, supporting all clients.
Open-Sora: Democratizing Efficient Video Production for All
Build AI-powered applications with React, Svelte, Vue, and Solid
Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data processing, RAG retrieval, and visual AI workflow orchestration, le…
《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
An Open Source text-to-speech system built by inverting Whisper.
AI powered speech denoising and enhancement
Robust Speech Recognition via Large-Scale Weak Supervision
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…
AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.
Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)
LeafMachine2 is a modular suite of computer vision and machine learning algorithms that enables efficient identification, location, and measurement of vegetative, reproductive, and archival compone…