Starred repositories
Robust Speech Recognition via Large-Scale Weak Supervision
中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
openpilot is an operating system for robotics. Currently, it upgrades the driver assistance system on 275+ supported cars.
Langflow is a low-code app builder for RAG and multi-agent AI applications. It’s Python-based and agnostic to any model, API, or database.
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
提取微信聊天记录,将其导出成HTML、Word、Excel文档永久保存,对聊天记录进行分析生成年度聊天报告,用聊天数据训练专属于个人的AI聊天助手
A generative speech model for daily dialogue.
Instant voice cloning by MIT and MyShell. Audio foundation model.
Real-time face swap for PC streaming or video calls
Easily train a good VC model with voice data <= 10 mins!
An open-source PAM tool alternative to CyberArk. 广受欢迎的开源堡垒机。
We write your reusable computer vision tools. 💜
A generative world for general-purpose robotics & embodied AI learning.
Zulip server and web application. Open-source team chat that helps teams stay productive and focused.
A modular graph-based Retrieval-Augmented Generation (RAG) system
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
An open-source RAG-based tool for chatting with your documents.
Build multi-modal Agents with memory, knowledge, tools and reasoning. Chat with them using a beautiful Agent UI.
Image Polygonal Annotation with Python (polygon, rectangle, circle, line, point and image-level flag annotation).
Awesome pre-trained models toolkit based on PaddlePaddle. (400+ models including Image, Text, Audio, Video and Cross-Modal with Easy Inference & Serving)【安全加固,暂停交互,请耐心等待】
100+ Chinese Word Vectors 上百种预训练中文词向量
📺IPTV电视直播源更新项目『✨秒播级体验🚀』:支持IPv4/IPv6;支持自定义频道;支持本地源、组播源、酒店源、订阅源、关键字搜索;每天自动更新两次,结果可用于TVBox等播放软件;支持工作流、Docker(amd64/arm64/arm v7)、命令行、GUI运行方式 | IPTV live TV source update project
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
🚀「Douyin_TikTok_Download_API」是一个开箱即用的高性能异步抖音、快手、TikTok、Bilibili数据爬取工具,支持API调用,在线批量解析及下载。
YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频