Skip to content
View alufia's full-sized avatar
  • Yonsei University

Block or report alufia

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

한국어 언어모델 다분야 사고력 벤치마크

Python 178 31 Updated Oct 17, 2024

SOTA discrete acoustic codec models with 40 tokens per second for audio language modeling

Python 900 49 Updated Dec 9, 2024

Repository for Accent Recognition (Hackathon @SLT2022)

Jupyter Notebook 24 9 Updated May 12, 2024

Self-Supervised Speech Pre-training and Representation Learning Toolkit

Python 2,286 486 Updated Nov 16, 2024

AlphaFold 3 inference pipeline.

Python 5,560 652 Updated Dec 13, 2024

VoiceBench: Benchmarking LLM-Based Voice Assistants

Python 57 2 Updated Dec 13, 2024

A Survey of Spoken Dialogue Models (60 pages)

194 9 Updated Nov 28, 2024
Python 1 Updated Oct 31, 2024

This toolbox aims to unify audio generation model evaluation for easier comparison.

Python 309 31 Updated Sep 29, 2024

Algorithms for Intelligent Assessment of Human Personality Traits based on His Multimodal Data for ranking potential candidates to perform professional responsibilities

Python 28 3 Updated Dec 11, 2024
Python 131 232 Updated May 11, 2023

Learn how to use the Cognitive Services Python SDK with these samples

Python 173 199 Updated Mar 7, 2024

A collection of datasets for the purpose of emotion recognition/detection in speech.

HTML 306 42 Updated Sep 30, 2024

A Compact and Effective Pretrained Model for Speech Emotion Recognition

Python 29 1 Updated Jun 29, 2024

[INTERSPEECH 2024] EmoBox: Multilingual Multi-corpus Speech Emotion Recognition Toolkit and Benchmark

Python 155 7 Updated Jun 17, 2024

Baichuan-Omni: Towards Capable Open-source Omni-modal LLM 🌊

251 7 Updated Nov 2, 2024

Versatile Evaluation of Speech and Audio

Python 72 7 Updated Dec 14, 2024

A collection of dataset consists of a total of 8 English speech datasets for SER

Jupyter Notebook 12 Updated Oct 11, 2024

SCOREQ: Speech COntrastive REgression for Quality Assessment (NeurIPS 2024)

Python 56 4 Updated Dec 4, 2024

LibriTTS-P: A Corpus with Speaking Style and Speaker Identity Prompts for Text-to-Speech and Style Captioning

116 2 Updated Jun 13, 2024
Python 88 10 Updated Aug 26, 2024

The official repository of SpeechCraft dataset, a large-scale expressive bilingual speech dataset with natural language descriptions.

65 1 Updated Oct 4, 2024

UTokyo-SaruLab MOS Prediction System

Python 108 9 Updated Dec 9, 2024
Python 11 Updated Aug 19, 2024

Robust Speech Recognition via Large-Scale Weak Supervision

Python 72,795 8,681 Updated Dec 1, 2024

한국어 음성인식 STT API 리스트. 각 성능 벤치마크.

347 17 Updated Jun 3, 2024

Machine learning speaker characteristics

Python 33 5 Updated Dec 13, 2024

The TTSDS benchmark evaluates synthetic speech quality by considering prosody, speaker identity, and intelligibility, comparing these factors with real speech and noise datasets.

Python 24 Updated Dec 2, 2024

Banchmark for personality traits prediction with neural networks

Python 47 12 Updated Oct 7, 2024
Python 11 2 Updated Aug 28, 2024
Next