Stars
Gender recognition by voice and speech analysis
Enjoy the magic of Diffusion models!
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE, ERes2Net, CAM++, etc. It is not excluded that more models will be supported in the future. At the …
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Animate a given image with animatediff and controlnet
视频硬字幕提取,生成srt文件。无需申请第三方API,本地实现文本识别。基于深度学习的视频字幕提取框架,包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files.
A simple screen parsing tool towards pure vision based GUI agent
🎉 汇聚并整理飞书等公开分享文档链接,解决没有官方全局搜索痛点,让知识持续传递。A list cool, beauty, interesting doc of feishu.
Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization
Running speech to text model (whisper.cpp) in Unity3d on your local machine.
Robust Speech Recognition via Large-Scale Weak Supervision
"LightRAG: Simple and Fast Retrieval-Augmented Generation"
AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
[ACL 2024] IEPile: A Large-Scale Information Extraction Corpus
DSPy: The framework for programming—not prompting—language models
zc277584121 / graphrag
Forked from microsoft/graphragA modular graph-based Retrieval-Augmented Generation (RAG) system
A phoneme extractor tool for a free lipsync workflow in Unity. This is not made by me. it is made by rmemr
The container platform tailored for Kubernetes multi-cloud, datacenter, and edge management ⎈ 🖥 ☁️
LlamaIndex is a data framework for your LLM applications
📷 EasyPhoto | Your Smart AI Photo Generator.
Create agents that monitor and act on your behalf. Your agents are standing by!