Skip to content
View Aknold's full-sized avatar

Block or report Aknold

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
  • sherpa-onnx Public

    Forked from k2-fsa/sherpa-onnx

    Speech-to-text, text-to-speech, speaker diarization, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, HarmonyOS, Raspberry Pi, RISC…

    C++ Apache License 2.0 Updated Jan 3, 2025
  • EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation

    Python Apache License 2.0 Updated Dec 24, 2024
  • CosyVoice Public

    Forked from FunAudioLLM/CosyVoice

    Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

    Python Apache License 2.0 Updated Dec 17, 2024
  • An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

    Python Apache License 2.0 Updated Dec 17, 2024
  • InspireMusic: A Unified Framework for Music, Song, Audio Generation.

    Python Apache License 2.0 Updated Dec 16, 2024
  • Multilingual Voice Understanding Model

    Python Other Updated Nov 29, 2024
  • Brand new TTS solution

    Python Other Updated Nov 29, 2024
  • faiss Public

    Forked from facebookresearch/faiss

    A library for efficient similarity search and clustering of dense vectors.

    C++ MIT License Updated Nov 26, 2024
  • RTranslator Public

    Forked from niedev/RTranslator

    Open source real-time translation app for Android that runs locally

    C++ Apache License 2.0 Updated Nov 23, 2024
  • Foundational Models for State-of-the-Art Speech and Text Translation

    Jupyter Notebook Other Updated Nov 14, 2024
  • Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

    Python Updated Oct 24, 2024
  • real time face swap and one-click video deepfake with only a single image

    Python GNU Affero General Public License v3.0 Updated Oct 23, 2024
  • CatVTON Public

    Forked from Zheng-Chong/CatVTON

    CatVTON is a simple and efficient virtual try-on diffusion model with 1) Lightweight Network (899.06M parameters totally), 2) Parameter-Efficient Training (49.57M parameters trainable) and 3) Simpl…

    Python Other Updated Oct 21, 2024
  • ecapture Public

    Forked from gojue/ecapture

    Capturing SSL/TLS plaintext without a CA certificate using eBPF. Supported on Linux/Android kernels for amd64/arm64.

    C Apache License 2.0 Updated Oct 7, 2024
  • ⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。

    Python Apache License 2.0 Updated Sep 28, 2024
  • FluxMusic Public

    Forked from feizc/FluxMusic

    Text-to-Music Generation with Rectified Flow Transformers

    Python Other Updated Sep 6, 2024
  • CodeFormer Public

    Forked from sczhou/CodeFormer

    [NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer

    Python Other Updated Aug 11, 2024
  • Bark Voice Cloning and Voice Cloning for Chinese Speech

    Jupyter Notebook MIT License Updated Aug 8, 2024
  • EchoMimic Public

    Forked from antgroup/echomimic

    Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning

    Python Apache License 2.0 Updated Jul 23, 2024
  • Curated list of project-based tutorials

    MIT License Updated Jul 22, 2024
  • 《Pytorch实用教程》(第二版)无论是零基础入门,还是CV、NLP、LLM项目应用,或是进阶工程化部署落地,在这里都有。相信在本书的帮助下,读者将能够轻松掌握 PyTorch 的使用,成为一名优秀的深度学习工程师。

    Jupyter Notebook Updated Jun 16, 2024
  • Help you discover excellent English projects and get rid of disturbing by other spoken language.

    Python Other Updated Jun 16, 2024
  • 🇨🇳 GitHub中文排行榜,各语言分设「软件 | 资料」榜单,精准定位中文好项目。各取所需,高效学习。

    Java Other Updated Jun 16, 2024
  • 18 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/

    Jupyter Notebook MIT License Updated Jun 16, 2024
  • mamba Public

    Forked from state-spaces/mamba

    Mamba SSM architecture

    Python Apache License 2.0 Updated Jun 7, 2024
  • ViViD: Video Virtual Try-on using Diffusion Models

    Python Apache License 2.0 Updated Jun 7, 2024
  • ChatTTS Public

    Forked from 2noise/ChatTTS

    ChatTTS is a generative speech model for daily dialogue.

    Jupyter Notebook Other Updated Jun 7, 2024
  • 深度学习500问,以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述,以帮助自己及有需要的读者。 全书分为18个章节,50余万字。由于水平有限,书中不妥之处恳请广大读者批评指正。 未完待续............ 如有意合作,联系[email protected] 版权所有,违权必究 Tan 2018.06

    JavaScript GNU General Public License v3.0 Updated Jun 4, 2024
  • InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥

    Python Apache License 2.0 Updated May 30, 2024
  • ComfyUI Public

    Forked from comfyanonymous/ComfyUI

    The most powerful and modular stable diffusion GUI, api and backend with a graph/nodes interface.

    Python GNU General Public License v3.0 Updated May 29, 2024