Skip to content
View Promethues3's full-sized avatar

Block or report Promethues3

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

SOTA Open Source TTS

Python 17,485 1,309 Updated Dec 21, 2024

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 8,152 1,045 Updated Dec 22, 2024

主要存储Datawhale组队学习中“数据挖掘/机器学习”方向的资料。

Jupyter Notebook 1,637 819 Updated Mar 16, 2022

pycorrector is a toolkit for text error correction. 文本纠错,实现了Kenlm,T5,MacBERT,ChatGLM3,Qwen2.5等模型应用在纠错场景,开箱即用。

Python 5,683 1,110 Updated Dec 18, 2024

The code and data for GrammarGPT.

Python 165 9 Updated Oct 10, 2023

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 7,406 791 Updated Dec 21, 2024

Multilingual Voice Understanding Model

Python 3,774 335 Updated Nov 29, 2024

a curated list of speech datasets (110+ datasets, 75+ easy to download)

103 4 Updated Feb 15, 2023
Python 577 50 Updated Jun 7, 2024

Instant voice cloning by MIT and MyShell.

Python 30,152 2,982 Updated Dec 12, 2024

AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents

Python 14,052 1,893 Updated Dec 21, 2024
Python 5,428 896 Updated Dec 15, 2024

Stable Diffusion web UI

Python 144,719 27,181 Updated Dec 17, 2024

A latent text-to-image diffusion model

Jupyter Notebook 68,834 10,228 Updated Jun 18, 2024

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Python 26,749 5,503 Updated Dec 22, 2024

通义千问 SFT试验

Jupyter Notebook 62 15 Updated Jan 6, 2024

Training LLMs with QLoRA + FSDP

Jupyter Notebook 1,432 189 Updated Nov 9, 2024

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 11,069 679 Updated Dec 4, 2024

A GUI client for Windows and Linux, support Xray core and sing-box-core and others

C# 72,015 11,793 Updated Dec 19, 2024

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

3,593 249 Updated Dec 17, 2024

Retrieval and Retrieval-augmented LLMs

Python 7,999 591 Updated Dec 18, 2024

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 14,884 1,205 Updated Dec 12, 2024

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 35,951 4,169 Updated Dec 20, 2024

全局指针统一处理嵌套与非嵌套NER的Pytorch实现

Python 384 46 Updated Mar 23, 2023

⭐️ NLP Algorithms with transformers lib. Supporting Text-Classification, Text-Generation, Information-Extraction, Text-Matching, RLHF, SFT etc.

Jupyter Notebook 2,190 387 Updated Sep 29, 2023

Question Answering annotation platform - Plateforme d'annotation

Python 87 21 Updated Apr 30, 2021

Aligning pretrained language models with instruction data generated by themselves.

Python 4,208 490 Updated Mar 27, 2023

Generative Judge for Evaluating Alignment

Python 221 14 Updated Jan 18, 2024

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Python 36,535 4,501 Updated Dec 21, 2024

超轻量级中文ocr,支持竖排文字识别, 支持ncnn、mnn、tnn推理 ( dbnet(1.8M) + crnn(2.5M) + anglenet(378KB)) 总模型仅4.7M

C++ 11,864 2,263 Updated Aug 14, 2023
Next