
Stars
Stable Diffusion web UI
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Command-line program to download videos from YouTube.com and other video sites
Robust Speech Recognition via Large-Scale Weak Supervision
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
The official gpt4free repository | various collection of powerful language models | o3 and deepseek r1, gpt-4.5
real time face swap and one-click video deepfake with only a single image
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Automatically generate and overlay subtitles for any video.
Verify the configuration of your OS X machine.
SALMONN: Speech Audio Language Music Open Neural Network
Cutting edge AI technology for automated audio transcription. A nice GUI for OpenAIs Whisper and pyannote (speaker identification)
A tool for scanning and resolving DNS names into IP addresses