Starred repositories
Command-line program to download videos from YouTube.com and other video sites
A feature-rich command-line audio/video downloader
Robust Speech Recognition via Large-Scale Weak Supervision
中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
Awesome pre-trained models toolkit based on PaddlePaddle. (400+ models including Image, Text, Audio, Video and Cross-Modal with Easy Inference & Serving)【安全加固,暂停交互,请耐心等待】
YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]
Python client for Baidu Yun (Personal Cloud Storage) 百度云/百度网盘Python客户端
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
💖 High available distributed ip proxy pool, powerd by Scrapy and Redis
Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.
Production First and Production Ready End-to-End Speech Recognition Toolkit
CODO是一款为用户提供企业多混合云、全球一站式DevOps、自动化运维、完全开源的云管理平台、自动化运维平台
Addon scripts, plugins, and skins for XBMC Media Center. Special for chinese laguage.
Machine Learning Project to identify an ID Card on an image