Welcome to the WildDESED dataset repository! This dataset is designed to advance research in sound event detection (SED) within the challenging and diverse acoustic environments of domestic settings.
WildDESED is an extension of the original DESED dataset, created to reflect a wider variety of domestic scenarios by incorporating complex and unpredictable background noises. These enhancements make WildDESED a powerful resource for developing and evaluating noise-robust SED systems. For more listenable examples of WildDESESD, please go check out our project page.
- Diverse Scenarios: The dataset includes eight domestic scenarios such as "Morning Routine" and "Home Office," designed using large language models (LLMs) to ensure realism.
- Rich Acoustic Variety: Background noises include bird chirping, car passing by, fan noise, and many more, integrated to simulate real-life domestic environments.
The WildDESED dataset is organized into the following subsets same as the original DESED dataset:
- Synth Training Set: 10,000 synthetic recordings with strong annotations.
- Synth Validation Set: 2,500 synthetic recordings for model validation.
- Weak Set: 1,578 real recordings with weak annotations.
- Unlabeled Training Set: 14,412 real, unlabeled recordings.
- Test Set: 1,168 real recordings with strong annotations.
The dataset includes a variety of noise types categorized into four groups based on their acoustic characteristics:
- Ambient Environmental Sounds: Continuous background noises like light rain and wind blowing.
- Human-Related Sounds: Sporadic sounds such as footsteps and door closing.
- Mechanical Sounds: Noises like clock ticking and coffee machine operation.
- Nature and Outdoor Sounds: External noises including bird chirping and car passing by.
Each scenario combines these noises with target sound classes from the DESED dataset, ensuring that the resulting soundscapes are both realistic and challenging for SED systems.
Here are visual representations of two different scenarios:
Morning Routine | Pet Care |
---|---|
The WildDESED dataset is available for download here. Please ensure to read and comply with the dataset's licensing terms before use.
- Please first
pip install -r requirements.txt
Without curriculum learning:
python train_sed.py
With curriculum learning:
python train_sed_cl.py
PS: Please follow instructions in DCASE 2024 Task 4 to download the DESED. You may also download the DESED here.
If you use WildDESED dataset in your research, please cite our paper: Y. Xiao and R. K. Das, "WildDESED: An LLM-Powered Dataset for Wild Domestic Environment Sound Event Detection System", in Proceedings of the Detection and Classification of Acoustic Scenes and Events 2024 Workshop (DCASE2024), 2024.
@inproceedings{Xiao2024WildDESED,
title={{WildDESED}: An LLM-Powered Dataset for Wild Domestic Environment Sound Event Detection System},
author={Yang Xiao and Rohan Kumar Das},
booktitle={Proceedings of the Detection and Classification of Acoustic Scenes and Events 2024 Workshop (DCASE2024)},
pages = {196--200},
year={2024},
}
We welcome contributions to the WildDESED dataset! Please open an issue or submit a pull request if you have suggestions or improvements. You can also send me an email.
This dataset is based on the AudioSet provided by Google Inc. under the CC BY 4.0 license, which has been modified by Yang Xiao. The modifications include the addition of noise to the DESED dataset to create a distinct dataset.
[1] N. Turpault, et al. "Sound event detection in domestic environments with weakly labeled data and soundscape synthesis", (DCASE 2019 Workshop).
[2] J. F. Gemmeke, et al. "Audio Set: An Ontology and Human-Labeled Dataset for Audio Events", (ICASSP 2017).