Stars
Native UI for the Whispering Tiger project - https://github.com/Sharrnah/whispering (live transcription / translation)
VRChat Live Translation Application
A multi-platform application for memorizing Japanese language
PSX Lain's Low-Poly Model, Animation, PaperCraft.
Get up and running with Llama 3.3, Phi 4, Gemma 2, and other large language models.
tekigg / niconico-twitch
Forked from ThatKoffe/niconico-twitchNiconico themed overlay for Twitch chat
リアルタイムボイスチェンジャー Realtime Voice Changer
Script that organizes the Google Takeout archive into one big chronological folder
Twitch VOD tool which recovers all VODs including those that are sub only or deleted.
Adds Yeah! button to Twitter, essentially a public Like
Command-line program to download image galleries and collections from several image hosting sites
Timed subtitles of Serial Experiments Lain on PSX
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
OBS Studio - Free and open source software for live streaming and screen recording
Download your Spotify playlists and songs along with album art and metadata (from YouTube if a match is found).
discordier / sam
Forked from s-macke/SAMSoftware Automatic Mouth - Tiny Speech Synthesizer
A feature-rich command-line audio/video downloader
Auto search for movie/series on torrent, usenet, ddl, subtitles, streaming, predb and other sites. Adds links to IMDb pages from hundreds various sites. Adds movies/series to Radarr/Sonarr. Adds ex…
Speech to Text to Speech. Song now playing. Sends text as OSC messages to VRChat to display on avatar. (STTTS) (Speech to TTS) (VRC STT System) (VTuber TTS)
Easily train a good VC model with voice data <= 10 mins!
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…
A multi-voice TTS system trained with an emphasis on quality
Robust Speech Recognition via Large-Scale Weak Supervision
🔊 Text-Prompted Generative Audio Model