Stars
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
🔊 Text-Prompted Generative Audio Model
Open Language Profiles — English profile datasets from CEFR-J
Robust Speech Recognition via Large-Scale Weak Supervision
html5 js 录音 mp3 wav ogg webm amr g711a g711u 格式,支持pc和Android、iOS部分浏览器、Hybrid App(提供Android iOS App源码)、微信,提供ASR语音识别转文字 H5版语音通话聊天示例 DTMF编码解码
Live Transcribe is an Android application that provides real-time captioning for people who are deaf or hard of hearing. This repository contains the Android client libraries for communicating with…
ChatterBot is a machine learning, conversational dialog engine for creating chat bots