Stars
Efficient and general syntactical decoding for Large Language Models
azooKey: A Japanese Keyboard iOS Application Fully Developed in Swift
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
This is a repository for all workshop related materials.
170,000 player capable Minecraft game engine built in Rust.
Speech To Speech: an effort for an open-sourced and modular GPT4-o
High-resolution models for human tasks.
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
Media Forensics / Fake Detection experiments in PyTorch. Implements Fighting Fake News: Image Splice Detection via Learned Self-Consistency
Exercises for exploring the Fibertree, Timeloop and Accelergy tools
Forget-Me-Not: Learning to Forget in Text-to-Image Diffusion Models, 2023
Multiview matching with deep-learning and hand-crafted local features for COLMAP and other SfM software. Supports high-resolution formats and images with rotations. Both CLI and GUI are supported.
Code for "Detector-Free Structure from Motion", CVPR 2024
Source code for ARM side libraries for interfacing to Raspberry Pi GPU.
Official repository of Evolutionary Optimization of Model Merging Recipes
[WIP] Layer Diffusion for WebUI (via Forge)
[CVPR 2024] 4D Gaussian Splatting for Real-Time Dynamic Scene Rendering
Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch
Official Code for Stable Cascade
Official Pytorch repo of CVPR'23 and NeurIPS'23 papers on understanding replication in diffusion models.
Dino V2 for Classification, PCA Visualization, Instance Retrival: https://arxiv.org/abs/2304.07193
Luma Interactive Scenes (captures) Web Examples, use lumalabs.ai captures directly in your three.js or other WebGL projects!
3D Gaussian Splatting Renderer for WebGL