Stars
Stable Diffusion web UI
Robust Speech Recognition via Large-Scale Weak Supervision
Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous β¦
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Run your own AI cluster at home with everyday devices π±π» π₯οΈβ
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
get things from one computer to another, safely
MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone
Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.
A high-performance, zero-overhead, extensible Python compiler with built-in NumPy support
π€ smolagents: a barebones library for agents. Agents write python code to call tools and orchestrate other agents.
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
BoxMOT: pluggable SOTA tracking modules for segmentation, object detection and pose estimation models
Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"
[being rewritten] Cross-platform iMessage POC
π + π― + π = Everything you need to query Apple's FindMy network!
Inofficial Qualcomm Firehose / Sahara / Streaming / Diag Tools :)
A free Linux distro with a Python-based userspace
The RF and reverse engineering framework for everyone. Follow and β to show your support!
Bringing Http/Https and WebSockets High Performance servers for PyPy3 and Python3
QCSuper is a tool communicating with Qualcomm-based phones and modems, allowing to capture raw 2G/3G/4G radio frames, among other things.
Lag-Llama: Towards Foundation Models for Probabilistic Time Series Forecasting
Google Assistant for Single Board Computers
An MIT License of YOLOv9, YOLOv7, YOLO-RD