Stars
A latent text-to-image diffusion model
🔊 Text-Prompted Generative Audio Model
Instruct-tune LLaMA on consumer hardware
StableLM: Stability AI Language Models
stable diffusion webui colab
🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
Mobile ALOHA: Learning Bimanual Mobile Manipulation with Low-Cost Whole-Body Teleoperation
An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) fo…
A simple Google Colab notebook which can translate an original video into multiple languages along with lip sync.