Lists (1)
Sort Name ascending (A-Z)
Stars
实时语音交互数字人,支持端到端语音方案(GLM-4-Voice - THG)和级联方案(ASR-LLM-TTS-THG)。可自定义形象与音色,无须训练,支持音色克隆,首包延迟低至3s。Real-time voice interactive digital human, supporting end-to-end voice solutions (GLM-4-Voice - THG) and …
更新2008年版本的《上海交通大学生存手册》gitbook发布于https://survivesjtu.gitbook.io/survivesjtumanual/
Speech-to-text, text-to-speech, speaker diarization, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, HarmonyOS, Raspberry Pi, RISC…
This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE, ERes2Net, CAM++, etc. It is not excluded that more models will be supported in the future. At the …
This is a speech interaction system built on an open-source model, integrating ASR, LLM, and TTS in sequence. The ASR model is SenceVoice, the LLM models are QWen2.5-0.5B/1.5B, and there are three …
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Common used path planning algorithms with animations.
Pseudo Streaming SenseVoice with Hotwords
A flexible framework powered by ComfyUI for generating personalized Nobel Prize images.
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
🚀🚀 「大模型」50分钟完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 50 min!
Letta (formerly MemGPT) is a framework for creating LLM services with memory.
A memory framework for Large Language Models and Agents.
Agentic components of the Llama Stack APIs
A dockerized fake SSH server honeypot written in Go that logs login attempts.
基于大模型搭建的聊天机器人,同时支持 微信公众号、企业微信应用、飞书、钉钉 等接入,可选择GPT3.5/GPT-4o/GPT-o1/ DeepSeek/Claude/文心一言/讯飞星火/通义千问/ Gemini/GLM-4/Claude/Kimi/LinkAI,能处理文本、语音和图片,访问操作系统和互联网,支持基于自有知识库进行定制企业智能客服。
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…
PyTorch implementation of AnimeGANv2
使深信服(Sangfor)开发的非自由的 VPN 软件 EasyConnect 和 aTrust 运行在 docker 或 podman 中,并作为网关和/或提供 socks5、http 代理服务
Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models
Finetune ModelScope's Text To Video model using Diffusers 🧨
Fine-Grained Open Domain Image Animation with Motion Guidance
Character Animation (AnimateAnyone, Face Reenactment)
[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
WinMerge is an Open Source differencing and merging tool for Windows. WinMerge can compare both folders and files, presenting differences in a visual text format that is easy to understand and handle.
A Multi-modal RAG Project with Dataset from Honor of Kings, one of the most popular smart phone games in China