Skip to content
View Mashiro009's full-sized avatar

Block or report Mashiro009

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Jupyter Notebook 63 4 Updated Oct 1, 2024

(CVPR2024)RMT: Retentive Networks Meet Vision Transformer

Python 319 25 Updated Jul 29, 2024
Python 17 6 Updated Apr 11, 2024

CAIRI Supervised, Semi- and Self-Supervised Visual Representation Learning Toolbox and Benchmark

Python 636 62 Updated Oct 27, 2024

Triton implement of bi-directional (non-causal) linear attention

Python 33 1 Updated Jan 13, 2025

🚀 Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton

Python 1,726 88 Updated Jan 17, 2025

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilizatio…

Python 2,087 348 Updated Jan 17, 2025

Fast and memory-efficient exact attention

Python 15,101 1,428 Updated Jan 18, 2025

[InterSpeech 24] FreeV: Free Lunch For Vocoders Through Pseudo Inversed Mel Filter

Python 86 7 Updated Jul 4, 2024

Official data preparation and metric evaluation scripts for the Interspeech 2025 URGENT challenge.

Python 43 5 Updated Jan 5, 2025

Utilizes ONNX Runtime for audio denoising.

Python 21 5 Updated Jan 17, 2025

This repository is the official implementation of the ECAI 2024 conference paper SUBLLM: A Novel Efficient Architecture with Token Sequence Subsampling for LLM

Python 67 4 Updated Aug 13, 2024

A flexible, high-performance 3D simulator for Embodied AI research.

C++ 2,745 435 Updated Jan 17, 2025

An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

Python 2,028 143 Updated Jan 17, 2025

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

HTML 4,767 551 Updated Oct 22, 2024

Implementation of Rotary Embeddings, from the Roformer paper, in Pytorch

Python 610 47 Updated Nov 27, 2024

SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.

Python 74 6 Updated Dec 22, 2024
Python 7,162 560 Updated Jan 14, 2025
Python 12 3 Updated Nov 2, 2024
Python 192 25 Updated Dec 14, 2024
216 106 Updated Feb 6, 2018

some papers about Kalman Filter

12 5 Updated Sep 4, 2019

CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages

HTML 468 60 Updated Mar 6, 2020

This is the repository for the speech enhancement model SyncFormer

9 Updated Nov 28, 2024

A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NeurIPS 2024]

Python 110 11 Updated Dec 11, 2024

ModelScope: bring the notion of Model-as-a-Service to life.

Python 7,249 754 Updated Jan 18, 2025
Python 1,140 41 Updated Nov 21, 2024

Code for the creation of CommonVoice-DEMAND speech enhancement datasets

Python 5 Updated Jul 24, 2023

Model configurations for scaling SE models in the paper "Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enhancement"

Python 32 3 Updated Aug 7, 2024

A first-of-its-kind acoustic simulation platform for audio-visual embodied AI research. It supports training and evaluating multiple tasks and applications.

Python 365 62 Updated Sep 29, 2023
Next