![google logo](https://raw.githubusercontent.com/github/explore/80688e429a7d4ef2fca1e82350fe8e3517d3494d/topics/google/google.png)
Starred repositories
"LightRAG: Simple and Fast Retrieval-Augmented Generation"
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
本项目为xiaozhi-esp32提供后端服务,帮助您快速搭建ESP32设备控制服务器。Backend service for xiaozhi-esp32, helps you quickly build an ESP32 device control server.
AI Call Center,呼叫中心,大模型呼入机器人,大模型呼出机器人,智能外呼,大模型机器人,智能电话外呼,大模型,FreeSWITCH大模型智能客服,大模型智能呼叫中心系统!LLM,Call,IPCC,Voice,AI,Call Center,FreeSWITCH,TTS,ASR,NLP!
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
High-quality and streaming Speech-to-Speech interactive agent in a single file. 只用一个文件实现的流式全双工语音交互原型智能体!
A real-time voice conversation system based on WebSocket and LLM, integrating Automatic Speech Recognition (ASR), Large Language Model conversation (LLM), and Text-to-Speech (TTS) capabilities.
Speech To Speech: an effort for an open-sourced and modular GPT4-o
Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.
The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.
Multilingual Voice Understanding Model
GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型
🌐 The Internet OS! Free, Open-Source, and Self-Hostable.
👨🏻💻🎨 My personal site made by NextJS & Tailwind & Framer motion
Implementation of XFeat (CVPR 2024). Do you need robust and fast local feature extraction? You are in the right place!
A full Python implementation for real car surround view system
A simple multi-threaded distributed SSH brute-forcing tool written in Python
This is my final year project titled 'Computer vision-based detection for tooth decay and cavities'.
Smartee project: 3D Teeth Reconstruction from Orthodontic Photos
Enhances construction site safety using YOLO for object detection, identifying hazards like workers without helmets or safety vests, and proximity to machinery or vehicles. HDBSCAN clusters safety …
Starred topics
![google logo](https://raw.githubusercontent.com/github/explore/80688e429a7d4ef2fca1e82350fe8e3517d3494d/topics/google/google.png)