Stars
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
主要存储Datawhale组队学习中“数据挖掘/机器学习”方向的资料。
pycorrector is a toolkit for text error correction. 文本纠错,实现了Kenlm,T5,MacBERT,ChatGLM3,Qwen2.5等模型应用在纠错场景,开箱即用。
The code and data for GrammarGPT.
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Multilingual Voice Understanding Model
a curated list of speech datasets (110+ datasets, 75+ easy to download)
Instant voice cloning by MIT and MyShell.
AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents
Stable Diffusion web UI
A latent text-to-image diffusion model
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
Training LLMs with QLoRA + FSDP
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.
A GUI client for Windows and Linux, support Xray core and sing-box-core and others
MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。
Retrieval and Retrieval-augmented LLMs
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
⭐️ NLP Algorithms with transformers lib. Supporting Text-Classification, Text-Generation, Information-Extraction, Text-Matching, RLHF, SFT etc.
Question Answering annotation platform - Plateforme d'annotation
Aligning pretrained language models with instruction data generated by themselves.
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
超轻量级中文ocr,支持竖排文字识别, 支持ncnn、mnn、tnn推理 ( dbnet(1.8M) + crnn(2.5M) + anglenet(378KB)) 总模型仅4.7M