-
whisper-timestamped Public
Forked from linto-ai/whisper-timestampedMultilingual Automatic Speech Recognition with word-level timestamps and confidence
Python GNU Affero General Public License v3.0 UpdatedJan 26, 2024 -
willow-inference-server Public
Forked from toverainc/willow-inference-serverOpen source, local, and self-hosted highly optimized language inference server supporting ASR/STT, TTS, and LLM across WebRTC, REST, and WS
Python Apache License 2.0 UpdatedJan 19, 2024 -
stash-box Public
Forked from stashapp/stash-boxStash App's own OpenSource video indexing and Perceptual Hashing MetaData API
TypeScript MIT License UpdatedJan 10, 2024 -
yt-dlp Public
Forked from yt-dlp/yt-dlpA youtube-dl fork with additional features and fixes
Python The Unlicense UpdatedJan 9, 2024 -
OpenVoice Public
Forked from myshell-ai/OpenVoiceInstant voice cloning by MyShell.
Python Other UpdatedJan 3, 2024 -
ComfyUI Public
Forked from comfyanonymous/ComfyUIThe most powerful and modular stable diffusion GUI, api and backend with a graph/nodes interface.
Python GNU General Public License v3.0 UpdatedJan 2, 2024 -
WAAS Public
Forked from schibsted/WAASWhisper as a Service (GUI and API with queuing for OpenAI Whisper)
JavaScript Apache License 2.0 UpdatedDec 31, 2023 -
RVC-Studio Public
Forked from SayanoAI/RVC-StudioThe best looking and most functional webui for RVC related tasks. See website for UI demo:
Python MIT License UpdatedDec 26, 2023 -
pyannote-audio Public
Forked from pyannote/pyannote-audioNeural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Jupyter Notebook MIT License UpdatedDec 19, 2023 -
whisper.cpp Public
Forked from ggerganov/whisper.cppPort of OpenAI's Whisper model in C/C++
-
bark Public
Forked from suno-ai/bark🔊 Text-Prompted Generative Audio Model
Jupyter Notebook MIT License UpdatedDec 14, 2023 -
PySceneDetect Public
Forked from Breakthrough/PySceneDetect🎥 Python and OpenCV-based scene cut/transition detection program & library.
Python Other UpdatedDec 9, 2023 -
TTS Public
Forked from coqui-ai/TTS🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Python Mozilla Public License 2.0 UpdatedDec 5, 2023 -
text-generation-webui Public
Forked from oobabooga/text-generation-webuiA Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.
Python GNU Affero General Public License v3.0 UpdatedDec 5, 2023 -
segment-anything Public
Forked from facebookresearch/segment-anythingThe repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Jupyter Notebook Apache License 2.0 UpdatedDec 4, 2023 -
demucs Public
Forked from facebookresearch/demucsCode for the paper Hybrid Spectrogram and Waveform Source Separation
Python MIT License UpdatedDec 3, 2023 -
Retrieval-based-Voice-Conversion-WebUI Public
Forked from RVC-Project/Retrieval-based-Voice-Conversion-WebUIVoice data <= 10 mins can also be used to train a good VC model!
Python MIT License UpdatedDec 1, 2023 -
ffmprovisr Public
Forked from amiaopensource/ffmprovisrRepository of useful FFmpeg commands for archivists!
HTML UpdatedNov 26, 2023 -
whisperX Public
Forked from coqui-ai/whisperXWhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Python BSD 4-Clause "Original" or "Old" License UpdatedNov 15, 2023 -
tinydiarize Public
Forked from akashmjn/tinydiarizeMinimal extension of OpenAI's Whisper adding speaker diarization with special tokens
Python MIT License UpdatedNov 6, 2023 -
awesome-ai-music-generation Public
Forked from Curated-Awesome-Lists/awesome-ai-music-generationA curated compilation of AI-driven generative music resources and projects. Explore the blend of machine learning algorithms and musical creativity.
-
flowframes Public
Forked from n00mkrad/flowframesFlowframes Windows GUI for video interpolation using DAIN (NCNN) or RIFE (CUDA/NCNN)
Python GNU General Public License v3.0 UpdatedNov 3, 2023 -
ffmpeg-scripts Public
Forked from NapoleonWils0n/ffmpeg-scriptsffmpeg shell scripts
Shell BSD 3-Clause "New" or "Revised" License UpdatedSep 26, 2023 -
pyannote-core Public
Forked from pyannote/pyannote-coreAdvanced data structures for handling temporal segments with attached labels.
Jupyter Notebook Other UpdatedJul 17, 2023 -
scenecut-extractor Public
Forked from slhck/scenecut-extractorExtract scenecuts from video files using ffmpeg
Python Other UpdatedMay 17, 2023 -
MachineVideoEditor Public
Forked from MachineEditor/MachineVideoEditorThis repository does not contain code, its purpose it for issue tracking and wiki
UpdatedMay 2, 2023 -
photonix Public
Forked from photonixapp/photonixA modern, web-based photo management server. Run it on your home server and it will let you find the right photo from your collection on any device. Smart filtering is made possible by object recog…
Python GNU Affero General Public License v3.0 UpdatedMar 4, 2023 -
DeepVideoAnalytics Public
Forked from ml-lab/DeepVideoAnalyticsAnalyze videos, perform detections, index frames & detected objects, search by examples
JavaScript UpdatedJan 31, 2017