Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
DB_AIAT_DEMO		DB_AIAT_DEMO
MMB_DEMO		MMB_DEMO
Noisy_Demo		Noisy_Demo
README.md		README.md

Repository files navigation

DB-AIAT: A Dual-branch attention-in-attention transformer for single-channel SE

Abstract：Curriculum learning begins to thrive in the speech enhancement area, which decouples the original spectrum estimation task into multiple easier sub-tasks to achieve better performance. Motivated by that, we propose a dual-branch attention-in-attention transformer dubbed DB-AIAT to handle both coarse- and fine-grained regions of spectrum in parallel. From a complementary perspective, a magnitude masking branch is proposed to estimate the overall spectral magnitude, while a complex refining branch is designed to compensate for the missing complex spectral details and restore phase information. Within each branch, we propose a novel attention-in-attention transformer to replace the conventional RNNs and temporal convolutional network for temporal sequence modeling. Specifically, the proposed attention-in-attention transformer consists of adaptive temporal-frequency attention transformer blocks and an adaptive hierarchical attention module, which can capture long-term time-frequency dependencies and further aggregate global hierarchical contextual information. The experimental results on VoiceBank + Demand dataset show that the proposed method yields state-of-the-art performance (e.g., 3.31 PESQ and 94.7% STOI) over previous advanced systems with a relatively light model size (2.81M).

Comparison with SOTA:

Ablation study:

The source code will be released soon!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DB-AIAT: A Dual-branch attention-in-attention transformer for single-channel SE

About

Releases

Packages

Contributors 2

Languages

License

yuguochencuc/DB-AIAT

Folders and files

Latest commit

History

Repository files navigation

DB-AIAT: A Dual-branch attention-in-attention transformer for single-channel SE

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages