JinmingChe

JinmingChe

1 follower · 4 following

Achievements

Highlights

Freeze-Omni Public
Forked from VITA-MLLM/Freeze-Omni

✨✨Freeze-Omni: A Smart and Low Latency Speech-to-speech Dialogue Model with Frozen LLM

Python Other Updated Dec 3, 2024
GLM-4-Voice Public
Forked from THUDM/GLM-4-Voice

GLM-4-Voice | 端到端中英语音对话模型

Python Apache License 2.0 Updated Oct 25, 2024
getNode Public
Forked from Flikify/getNode

每小时更新最新的Clash、v2ray节点信息

Shell Updated Oct 16, 2024
External-Attention-pytorch Public
Forked from xmu-xiaoma666/External-Attention-pytorch

🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐

Python MIT License Updated Aug 29, 2024
Qifusion-net Public

The net mudule of Qifusion-Net: Layer-adapted Stream/Non-stream Model for End-to-End Multi-Accent Speech Recognition

Python 2 Updated Jul 3, 2024
TIM-Net_SER Public
Forked from Jiaxin-Ye/TIM-Net_SER

[ICASSP 2023] Official Tensorflow implementation of "Temporal Modeling Matters: A Novel Temporal Emotional Modeling Approach for Speech Emotion Recognition".

Python GNU General Public License v3.0 Updated May 15, 2024
pytorch-metric-learning Public
Forked from KevinMusgrave/pytorch-metric-learning

The easiest way to use deep metric learning in your application. Modular, flexible, and extensible. Written in PyTorch.

Python MIT License Updated Dec 16, 2023
vits_chinese_0829 Public
Forked from PlayVoice/vits_chinese

Best practice TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Support streaming out!

Python MIT License Updated Sep 19, 2023
so-vits-svc-5.0 Public
Forked from PlayVoice/whisper-vits-svc

Core Engine of Singing Voice Conversion & Singing Voice Clone

Python MIT License Updated Sep 11, 2023
auto_avsr Public
Forked from mpc001/auto_avsr

Auto-AVSR: Lip-Reading Sentences Project

Python Apache License 2.0 Updated Sep 3, 2023
audiocraft Public
Forked from facebookresearch/audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Python MIT License Updated Aug 3, 2023
Whisper-Finetune Public
Forked from yeyupiaoling/Whisper-Finetune

微调Whisper语音识别模型，支持无时间戳数据训练，有时间戳数据训练、无语音数据训练。加速推理，支持Web部署、Windows桌面部署和Android部署

C Apache License 2.0 Updated Jul 30, 2023
AttentionIsOFFByOne Public
Forked from kyegomez/AttentionIsOFFByOne

Implementation of "Attention Is Off By One" by Evan Miller

Python MIT License Updated Jul 25, 2023
VITS-fast-fine-tuning Public
Forked from Plachtaa/VITS-fast-fine-tuning

This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion

Python Apache License 2.0 Updated Jul 2, 2023
CIF-HieraDist Public
Forked from MingLunHan/CIF-HieraDist

[INTERSPEECH 2023] Knowledge Transfer from Pre-trained Language Models to Cif-based Recognizers via Hierarchical Distillation

Python Apache License 2.0 Updated Jun 16, 2023
generative-ai-roadmap Public
Forked from SeedV/generative-ai-roadmap

生成式AI的应用路线图 The roadmap of generative AI: use cases and applications

Creative Commons Attribution 4.0 International Updated Jun 11, 2023
FunASR Public
Forked from modelscope/FunASR

A Fundamental End-to-End Speech Recognition Toolkit

Python Other Updated Jun 8, 2023
wenet Public
Forked from wenet-e2e/wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

C++ Apache License 2.0 Updated Jun 7, 2023
whisper Public
Forked from openai/whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Python MIT License Updated Jun 5, 2023
so-vits-svc Public
Forked from svc-develop-team/so-vits-svc

SoftVC VITS Singing Voice Conversion

Python BSD 3-Clause "New" or "Revised" License Updated May 23, 2023
ColossalAI Public
Forked from hpcaitech/ColossalAI

Making big AI models cheaper, easier, and scalable

Python Apache License 2.0 Updated Feb 15, 2023
dparn Public
Forked from Qinwen-Hu/dparn

Python Updated Nov 30, 2022
LPCNet Public
Forked from xiph/LPCNet

Efficient neural speech synthesis

C BSD 3-Clause "New" or "Revised" License Updated Sep 30, 2022
Comprehensive-Transformer-TTS Public
Forked from keonlee9420/Comprehensive-Transformer-TTS

A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, …

Python MIT License Updated Sep 24, 2022
sound-separation Public
Forked from google-research/sound-separation

Python Apache License 2.0 Updated Sep 20, 2022
jieba Public
Forked from fxsjy/jieba

结巴中文分词

Python MIT License Updated Jul 17, 2022
chinese_speech_pretrain Public
Forked from TencentGameMate/chinese_speech_pretrain

chinese speech pretrained models

Shell Updated Jul 13, 2022
Leveraging-Self-Supervised-Learning-for-AVSR Public
Forked from LUMIA-Group/Leveraging-Self-Supervised-Learning-for-AVSR

Official PyTorch implementation of paper Leveraging Unimodal Self Supervised Learning for Multimodal Audio-Visual Speech Recognition

Python MIT License Updated Jul 13, 2022
attention_keras Public
Forked from thushv89/attention_keras

Keras Layer implementation of Attention for Sequential models

Python MIT License Updated Jul 7, 2022
PerceptualAudio Public
Forked from pranaymanocha/PerceptualAudio

Perceptual Metrics of Audio - perceptually relevant loss function. DPAM and CDPAM

Python MIT License Updated Jun 22, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

JinmingChe

Achievements

Achievements

Highlights

Block or report JinmingChe

Freeze-Omni Public

GLM-4-Voice Public

getNode Public

External-Attention-pytorch Public

Qifusion-net Public

TIM-Net_SER Public

pytorch-metric-learning Public

vits_chinese_0829 Public

so-vits-svc-5.0 Public

auto_avsr Public

audiocraft Public

Whisper-Finetune Public

AttentionIsOFFByOne Public

VITS-fast-fine-tuning Public

CIF-HieraDist Public

generative-ai-roadmap Public

FunASR Public

wenet Public

whisper Public

so-vits-svc Public

ColossalAI Public

dparn Public

LPCNet Public

Comprehensive-Transformer-TTS Public

sound-separation Public

jieba Public

chinese_speech_pretrain Public

Leveraging-Self-Supervised-Learning-for-AVSR Public

attention_keras Public

PerceptualAudio Public