-
sherpa-onnx Public
Forked from k2-fsa/sherpa-onnxSpeech-to-text, text-to-speech, speaker diarization, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, HarmonyOS, Raspberry Pi, RISC…
C++ Apache License 2.0 UpdatedJan 3, 2025 -
echomimic_v2 Public
Forked from antgroup/echomimic_v2EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation
Python Apache License 2.0 UpdatedDec 24, 2024 -
CosyVoice Public
Forked from FunAudioLLM/CosyVoiceMulti-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Python Apache License 2.0 UpdatedDec 17, 2024 -
ClearerVoice-Studio Public
Forked from modelscope/ClearerVoice-StudioAn AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.
Python Apache License 2.0 UpdatedDec 17, 2024 -
InspireMusic Public
Forked from FunAudioLLM/InspireMusicInspireMusic: A Unified Framework for Music, Song, Audio Generation.
Python Apache License 2.0 UpdatedDec 16, 2024 -
SenseVoice Public
Forked from FunAudioLLM/SenseVoiceMultilingual Voice Understanding Model
Python Other UpdatedNov 29, 2024 -
fish-speech Public
Forked from fishaudio/fish-speechBrand new TTS solution
Python Other UpdatedNov 29, 2024 -
faiss Public
Forked from facebookresearch/faissA library for efficient similarity search and clustering of dense vectors.
C++ MIT License UpdatedNov 26, 2024 -
RTranslator Public
Forked from niedev/RTranslatorOpen source real-time translation app for Android that runs locally
C++ Apache License 2.0 UpdatedNov 23, 2024 -
seamless_communication Public
Forked from facebookresearch/seamless_communicationFoundational Models for State-of-the-Art Speech and Text Translation
Jupyter Notebook Other UpdatedNov 14, 2024 -
GOT-OCR2.0 Public
Forked from Ucas-HaoranWei/GOT-OCR2.0Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
Python UpdatedOct 24, 2024 -
Deep-Live-Cam Public
Forked from hacksider/Deep-Live-Camreal time face swap and one-click video deepfake with only a single image
Python GNU Affero General Public License v3.0 UpdatedOct 23, 2024 -
CatVTON Public
Forked from Zheng-Chong/CatVTONCatVTON is a simple and efficient virtual try-on diffusion model with 1) Lightweight Network (899.06M parameters totally), 2) Parameter-Efficient Training (49.57M parameters trainable) and 3) Simpl…
Python Other UpdatedOct 21, 2024 -
ecapture Public
Forked from gojue/ecaptureCapturing SSL/TLS plaintext without a CA certificate using eBPF. Supported on Linux/Android kernels for amd64/arm64.
C Apache License 2.0 UpdatedOct 7, 2024 -
HivisionIDPhotos Public
Forked from Zeyi-Lin/HivisionIDPhotos⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。
Python Apache License 2.0 UpdatedSep 28, 2024 -
FluxMusic Public
Forked from feizc/FluxMusicText-to-Music Generation with Rectified Flow Transformers
Python Other UpdatedSep 6, 2024 -
CodeFormer Public
Forked from sczhou/CodeFormer[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer
Python Other UpdatedAug 11, 2024 -
Bark-Voice-Cloning Public
Forked from KevinWang676/Bark-Voice-CloningBark Voice Cloning and Voice Cloning for Chinese Speech
Jupyter Notebook MIT License UpdatedAug 8, 2024 -
EchoMimic Public
Forked from antgroup/echomimicLifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
Python Apache License 2.0 UpdatedJul 23, 2024 -
project-based-learning Public
Forked from practical-tutorials/project-based-learningCurated list of project-based tutorials
MIT License UpdatedJul 22, 2024 -
PyTorch-Tutorial-2nd Public
Forked from TingsongYu/PyTorch-Tutorial-2nd《Pytorch实用教程》(第二版)无论是零基础入门,还是CV、NLP、LLM项目应用,或是进阶工程化部署落地,在这里都有。相信在本书的帮助下,读者将能够轻松掌握 PyTorch 的使用,成为一名优秀的深度学习工程师。
Jupyter Notebook UpdatedJun 16, 2024 -
GitHub-English-Top-Charts Public
Forked from GrowingGit/GitHub-English-Top-ChartsHelp you discover excellent English projects and get rid of disturbing by other spoken language.
Python Other UpdatedJun 16, 2024 -
GitHub-Chinese-Top-Charts Public
Forked from GrowingGit/GitHub-Chinese-Top-Charts🇨🇳 GitHub中文排行榜,各语言分设「软件 | 资料」榜单,精准定位中文好项目。各取所需,高效学习。
Java Other UpdatedJun 16, 2024 -
generative-ai-for-beginners Public
Forked from microsoft/generative-ai-for-beginners18 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
Jupyter Notebook MIT License UpdatedJun 16, 2024 -
mamba Public
Forked from state-spaces/mambaMamba SSM architecture
Python Apache License 2.0 UpdatedJun 7, 2024 -
ViViD Public
Forked from alibaba-yuanjing-aigclab/ViViDViViD: Video Virtual Try-on using Diffusion Models
Python Apache License 2.0 UpdatedJun 7, 2024 -
ChatTTS Public
Forked from 2noise/ChatTTSChatTTS is a generative speech model for daily dialogue.
Jupyter Notebook Other UpdatedJun 7, 2024 -
DeepLearning-500-questions Public
Forked from scutan90/DeepLearning-500-questions深度学习500问,以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述,以帮助自己及有需要的读者。 全书分为18个章节,50余万字。由于水平有限,书中不妥之处恳请广大读者批评指正。 未完待续............ 如有意合作,联系[email protected] 版权所有,违权必究 Tan 2018.06
JavaScript GNU General Public License v3.0 UpdatedJun 4, 2024 -
InstantID Public
Forked from instantX-research/InstantIDInstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥
Python Apache License 2.0 UpdatedMay 30, 2024 -
ComfyUI Public
Forked from comfyanonymous/ComfyUIThe most powerful and modular stable diffusion GUI, api and backend with a graph/nodes interface.
Python GNU General Public License v3.0 UpdatedMay 29, 2024