-
Pandrator Public
Turn PDFs and EPUBs into audiobooks, subtitles or videos into dubbed videos (including translation), and more. For free. Pandrator uses local models, notably XTTS, including voice-cloning (instant,…
-
whisperX_silero Public
Forked from m-bain/whisperXWhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization) with support for Silero VAD.
Python BSD 2-Clause "Simplified" License UpdatedFeb 8, 2025 -
easy_xtts_trainer Public
A command line utility to easily finetune XTTS models in a fully automated way. Developed for Pandrator.
-
Subdub Public
A command line Python app offering a video-to-dubbed-video workflow with transcription, translation and synchronisation. Developed for Pandrator.
-
breath-removal Public
Detect and remove or lower the volume of breathing in speech recordings.
-
LLM_Chat_Repo_Context Public
A tool for extracting and formatting repositories as plain text to share with Large Language Models in chat interfaces without direct repo access or API integration.
-
PyCropPDF Public
A small GUI app that overlays all pages of a PDF with transparency and enables batch cropping for all or even and odd pages separately. Intended to help safely remove headers and footers or margins…
-
RVC_CLI Public
Forked from blaisewf/rvc-cliRVC CLI enables seamless interaction with Retrieval-based Voice Conversion through commands or HTTP requests.
-
silero-api-server Public
Forked from ouoertheo/silero-api-server -
VoiceCraft_API Public
Forked from jasonppy/VoiceCraftWindows-compatible Fast API implementation of VoiceCraft, the Zero-Shot Speech Editing and Text-to-Speech in the Wild
-
NISQA-API Public
Forked from gabrielmittag/NISQAFastAPI implementation for NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment