krantas

krantas

Stars

gpt-omni / mini-omni

open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.

Python 3,233 291 Updated Nov 5, 2024

suno-ai / bark

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 36,528 4,298 Updated Aug 19, 2024

openlanguageprofiles / olp-en-cefrj

Open Language Profiles — English profile datasets from CEFR-J

109 21 Updated Mar 25, 2020

openai / whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Python 73,489 8,781 Updated Dec 1, 2024

acheong08 / ChatGPT

Reverse engineered ChatGPT API

Python 28,055 4,479 Updated Aug 2, 2023

xiangyuecn / Recorder

html5 js 录音 mp3 wav ogg webm amr g711a g711u 格式，支持pc和Android、iOS部分浏览器、Hybrid App（提供Android iOS App源码）、微信，提供ASR语音识别转文字 H5版语音通话聊天示例 DTMF编码解码

JavaScript 4,974 1,042 Updated Oct 20, 2024

google / live-transcribe-speech-engine

Live Transcribe is an Android application that provides real-time captioning for people who are deaf or hard of hearing. This repository contains the Android client libraries for communicating with…

Java 1,442 214 Updated Jul 20, 2022

gunthercox / ChatterBot

ChatterBot is a machine learning, conversational dialog engine for creating chat bots

Python 14,136 4,449 Updated Apr 24, 2024

ypwhs / shanbay_google_image

扇贝单词图片助手

JavaScript 5 1 Updated Nov 21, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly