Skip to content
View alufia's full-sized avatar
  • Yonsei University

Block or report alufia

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Baichuan-Audio: A Unified Framework for End-to-End Speech Interaction

Python 129 10 Updated Feb 28, 2025

Official repository for KoMT-Bench built by LG AI Research

Python 59 Updated Aug 8, 2024

An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models. The goal of this repo is to provide the si…

TypeScript 13,768 1,389 Updated Feb 17, 2025

Unofficial implementation of Titans, SOTA memory for transformers, in Pytorch

Python 1,172 99 Updated Mar 2, 2025

Unified automatic quality assessment for speech, music, and sound.

Python 391 24 Updated Mar 4, 2025

Investment Research for Everyone, Everywhere.

Python 36,621 3,320 Updated Mar 5, 2025

Ola: Pushing the Frontiers of Omni-Modal Language Model

Python 292 11 Updated Feb 28, 2025

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬

Jupyter Notebook 9,131 1,338 Updated Feb 7, 2025

Everything you need to build state-of-the-art foundation models, end-to-end.

Python 7,594 536 Updated Mar 5, 2025

OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking

Python 405 52 Updated Feb 24, 2025

AIR-Bench: Benchmarking Large Audio-Language Models via Generative Comprehension

Python 81 4 Updated Dec 9, 2024

Official inference repo for FLUX.1 models

Python 20,583 1,447 Updated Feb 6, 2025

[NeurIPS 2024 Spotlight] The official implement of research paper "MotionBooth: Motion-Aware Customized Text-to-Video Generation"

Python 128 9 Updated Oct 8, 2024

Motion-Controllable Video Diffusion via Warped Noise

Python 786 41 Updated Feb 26, 2025

The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 1,550 123 Updated Aug 13, 2024

A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.

Python 1,970 195 Updated Mar 5, 2025

Audio Large Language Models

Python 417 26 Updated Feb 27, 2025

한국어 언어모델 다분야 사고력 벤치마크

Python 185 33 Updated Oct 17, 2024

[ICLR 2025] SOTA discrete acoustic codec models with 40/75 tokens per second for audio language modeling

Python 1,039 75 Updated Mar 2, 2025

Repository for Accent Recognition (Hackathon @SLT2022)

Jupyter Notebook 25 9 Updated May 12, 2024

Self-Supervised Speech Pre-training and Representation Learning Toolkit

Python 2,342 492 Updated Feb 12, 2025

AlphaFold 3 inference pipeline.

Python 6,165 761 Updated Mar 4, 2025

VoiceBench: Benchmarking LLM-Based Voice Assistants

Python 137 8 Updated Mar 5, 2025

A Survey of Spoken Dialogue Models (60 pages)

270 16 Updated Nov 28, 2024
Python 1 Updated Oct 31, 2024

This toolbox aims to unify audio generation model evaluation for easier comparison.

Python 320 32 Updated Sep 29, 2024

Algorithms for Intelligent Assessment of Human Personality Traits based on His Multimodal Data for ranking potential candidates to perform professional responsibilities

Python 32 5 Updated Dec 11, 2024
Python 135 245 Updated May 11, 2023

Learn how to use the Cognitive Services Python SDK with these samples

Python 182 202 Updated Mar 7, 2024
Next