Stars
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
Trigger.dev is the open source background jobs platform.
Face registration and recognition system built with Docker, Uvicron+Fastapi, Milvus, Redis, and mariadb-mysql
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
Easily take an entire YouTube playlist and turn it into high quality transcripts using Whisper.
smol-podcaster is your podcast production agent 🎙️
A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.
Private Server and Remote JavaScript Console App
An open source application built using the new router, server components and everything new in Next.js 13.
A full-featured, hackable Next.js AI chatbot built by Vercel
A navigation UI ready to drop into your React Native application
notes for software engineers getting up to speed on new AI developments. Serves as datastore for https://latent.space writing, and product brainstorming, but has cleaned up canonical references und…
a fresh, modern & lightweight HTML5 game engine
Basic framework for training Dreambooth Stable Diffusion v1.5 on Banana's v1.0 serverless GPU platform
fast-stable-diffusion + DreamBooth
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
Examples and guides for using the OpenAI API
Neural style transfer in PyTorch.