Stars
Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"
A generative world for general-purpose robotics & embodied AI learning.
Reverse Engineering: Decompiling Binary Code with Large Language Models
Official repository for the paper "Grokfast: Accelerated Grokking by Amplifying Slow Gradients"
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
Driver and tools for controlling Lenovo Legion laptops in Linux including fan control and power mode.
Collection of leaked system prompts
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
Open source code for AlphaFold 2.
Official Code for DragGAN (SIGGRAPH 2023)
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis
Examples and guides for using the OpenAI API
Ungreedy subword tokenizer and vocabulary trainer for Python, Go & Javascript
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
TART: A plug-and-play Transformer module for task-agnostic reasoning
Fast & Simple repository for pre-training and fine-tuning T5-style models
Edit anything in images powered by segment-anything, ControlNet, StableDiffusion, etc. (ACM MM)
Segment Anything for Stable Diffusion WebUI
Port of OpenAI's Whisper model in C/C++
Chat with your documents on your local device using GPT models. No data leaves your device and 100% private.
Private chat with local GPT with document, images, video, etc. 100% private, Apache 2.0. Supports oLLaMa, Mixtral, llama.cpp, and more. Demo: https://gpt.h2o.ai/ https://gpt-docs.h2o.ai/