Stars
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
An evolving, large-scale and multi-domain ASR corpus for low-resource languages with automated crawling, transcription and refinement
Large, modern dataset for speech recognition
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Robust Speech Recognition via Large-Scale Weak Supervision
Multilingual Voice Understanding Model
OK影视、tvbox配置文件,如果喜欢,请Fork自用。使用前请仔细阅读仓库说明,一旦使用将被视为你已了解。
自动收集的IPv4酒店电视直播源,自动测试播放速度,每日自动更新。 有CCTV央视卫视频道,及部分地方频道,播放流畅。也可在openwrt或群辉的docker运行。
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…
ncnn is a high-performance neural network inference framework optimized for the mobile platform
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.
Tools for handling speech data in machine learning projects.
ChatGPT 中文调教指南。各种场景使用指南。学习怎么让它听你的话。
Server for https://github.com/fippo/rtcstats
KDE Plasma Desktop container designed for Kubernetes, supporting OpenGL EGL and GLX, Vulkan, and Wine/Proton for NVIDIA GPUs through WebRTC and HTML5, providing an open-source remote cloud/HPC grap…
Run GUI applications and desktops in docker and podman containers. Focus on security.
Python爬虫实战 - 模拟登陆各大网站 包含但不限于:滑块验证、拼多多、美团、百度、bilibili、大众点评、淘宝,如果喜欢请start ❤️
📝A simple and elegant markdown editor, available for Linux, macOS and Windows.
Package gorilla/websocket is a fast, well-tested and widely used WebSocket implementation for Go.