Stars
Task oriented AI agent framework for digital workers and vertical AI agents
Algorithmic Feature Extraction of Respiratory Data
百聆 是一个类似GPT-4o的语音对话机器人,通过ASR+LLM+TTS实现,时延低至800ms,低配置也可运行,支持打断
Installer & Activited Microsoft Office For MacOS
Cross-platform, customizable ML solutions for live and streaming media.
A knowledge-guided computer vision framework for strawberry fruits detection and growth modeling.
Simple, fast, and fair evaluation of remote physiological sensing models
AI Meeting Minutes analysis App built with NextJS, Langflow, Groq, and OpenAI
Tools for merging pretrained large language models.
Inference code for the paper "Spirit-LM Interleaved Spoken and Written Language Model".
Assessment of non-invasive blood pressure prediction from PPG and rPPG signals using deep learning
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
[CVPR2020] "Detecting Attended Visual Targets in Video"
A Driver Fatigue Detection Algorithm Based on Dynamic Tracking of Small Facial Targets Using YOLOv7
YOLOv8 for strawberry disease implementation. Achieves over 10% improvement in mAP in comparison to the Mask R-CNN baseline.
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
Every front-end GUI client for ChatGPT, Claude, and other LLMs
NeuroKit2: The Python Toolbox for Neurophysiological Signal Processing
Guided Diffusion Model for Molecular Inverse Design
HRnV-Calc software for heart rate n-variability and heart rate variability analysis
[TPAMI & ECCV 2022] Contrast-Phys & Contrast-Phys+ for facial video-based remote physiological signal measurement
一款在线图像标注工具(矩形、多边形、持续更新中……),可用于深度学习实例分割模型训练(Mask R-CNN)等。
A flexible and extensible framework for gait recognition. You can focus on designing your own models and comparing with state-of-the-arts easily with the help of OpenGait.
Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,同时支持语音识别转录、语音合成、字幕翻译。