Stars
SpargeAttention: A training-free sparse attention that can accelerate any model inference.
🦛 CHONK your texts with Chonkie ✨ - The no-nonsense RAG chunking library
Multilingual Voice Understanding Model
CLIP+MLP Aesthetic Score Predictor
Implementation of Perceiver, General Perception with Iterative Attention, in Pytorch
The fastest knowledge base for growing teams. Beautiful, realtime collaborative, feature packed, and markdown compatible.
Python based web automation tool. Powerful and elegant.
Vue3 + Pinia 仿抖音,Vue 在移动端的最佳实践 . Imitate TikTok ,Vue Best practices on Mobile
Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/
An open file format for infinite canvas data.
LaTeXML: a TeX and LaTeX to XML/HTML/ePub/MathML translator.
There can be more than Notion and Miro. AFFiNE(pronounced [ə‘fain]) is a next-gen knowledge base that brings planning, sorting and creating all together. Privacy first, open-source, customizable an…
Private voice keyboard, AI chat, images, webcam, recordings, voice control with >= 4 GiB of VRAM.
FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Cores
🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / DeepSeek / Qwen), Knowledge Base (file upload / knowledge managemen…
Reference implementation for DPO (Direct Preference Optimization)
We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts…
800,000 step-level correctness labels on LLM solutions to MATH problems
Implementation of the Transformer variant proposed in "Transformer Quality in Linear Time"
The fastest pure-Python PEG parser I can muster
Language Modeling with the H3 State Space Model
Robust Video Matting in PyTorch, TensorFlow, TensorFlow.js, ONNX, CoreML!
Implementation of GigaGAN, new SOTA GAN out of Adobe. Culmination of nearly a decade of research into GANs
JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.
Implementation of the conditionally routed attention in the CoLT5 architecture, in Pytorch
Official PyTorch implementation of StyleGAN3