Stars
LiveKit real-time and server SDKs for Python
End-to-end stack for WebRTC. SFU media server and SDKs.
PyTorch Python Index mirror with PEP 503 “simple” compliance updates
OpenMMLab Pose Estimation Toolbox and Benchmark.
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
TEN Agent is a conversational voice AI agent powered by TEN, integrating Deepseek, Gemini, OpenAI, RTC, and hardware like ESP32. It enables realtime AI capabilities like seeing, hearing, and speaki…
Code for "GVHMR: World-Grounded Human Motion Recovery via Gravity-View Coordinates", Siggraph Asia 2024
Speech To Speech: an effort for an open-sourced and modular GPT4-o
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Open Source framework for voice and multimodal conversational AI
A universal router for Solid inspired by Ember and React Router
Open-source UI component library and front-end development framework based on Tailwind CSS
Beautifully designed components. Built with Kobalte & corvu. Styled with Tailwind CSS.
A markdown editor that you can deploy on your own servers to achieve cloud storage and device synchronization(支持私有部署的云端存储双链笔记软件)
A self hosted virtual browser that runs in docker and uses WebRTC.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🚀 A framework helps you quickly build AI Native IDE products.
The first behavioral foundation model to control a virtual physics-based humanoid agent for a wide range of whole-body tasks.
collect papers about human motion capture
An ultra fast cross-platform multiple screenshots module in pure Python using ctypes.
Build and run containers leveraging NVIDIA GPUs