Noisy Speech Synthesizer

This repository contains a script to synthesize noisy speech data from clean speech and noise files. The script allows you to specify the number of hours of data to generate and the range of Signal-to-Noise Ratio (SNR) values.

Prerequisites

Python 3.x
Required Python packages (install using pip install -r requirements.txt)

Installation

Clone the repository:

git clone <repository-url>
cd <repository-directory>

Install the required packages:
```
pip install -r requirements.txt
```

Configuration

The script uses a configuration file (noisyspeech_synthesizer.cfg) to set various parameters. Make sure to update the configuration file as needed.

Usage

To generate noisy speech data, run the following command:

python noisyspeech_synthesizer.py --cfg noisyspeech_synthesizer.cfg --total_hours <number_of_hours>

Arguments

--cfg: Path to the configuration file (default is noisyspeech_synthesizer.cfg).
--cfg_str: Section in the configuration file to use (default is noisy_speech).
--total_hours: Total hours of data to be created.

Example

python noisyspeech_synthesizer.py --cfg noisyspeech_synthesizer.cfg --total_hours 100

This command will generate 100 hours of noisy speech data.

Uploading to Hugging Face

To upload the generated noisy speech data to a Hugging Face dataset, use the following script:

import os
from datasets import Dataset, DatasetDict, Audio
import pandas as pd

def create_dataset(noisyspeech_dir):
    # List all noisy speech files
    noisy_files = [os.path.join(noisyspeech_dir, f) for f in os.listdir(noisyspeech_dir) if f.endswith('.wav')]

    # Create a dataframe with file paths and tags
    data = {'file': noisy_files, 'label': ['noisy_speech'] * len(noisy_files)}
    df = pd.DataFrame(data)

    # Convert dataframe to Hugging Face Dataset
    dataset = Dataset.from_pandas(df)

    # Define audio column
    dataset = dataset.cast_column("file", Audio())

    return dataset

def upload_dataset(dataset, dataset_name):
    # Create a DatasetDict
    dataset_dict = DatasetDict({"train": dataset})

    # Save to Hugging Face
    dataset_dict.push_to_hub(dataset_name)

if __name__ == "__main__":
    # Directory containing the noisy speech files
    noisyspeech_dir = 'NoisySpeech_training'

    # Dataset name on Hugging Face
    dataset_name = 'rfhuang/audio-quality'

    # Create dataset
    dataset = create_dataset(noisyspeech_dir)

    # Upload dataset
    upload_dataset(dataset, dataset_name)

Instructions

Ensure you are logged in to your Hugging Face account:
```
huggingface-cli login
```
Run the script to upload the dataset:
```
python upload_to_huggingface.py
```

Replace 'rfhuang/audio-quality' with the appropriate dataset name on Hugging Face.

License

This project is licensed under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 55 Commits
clean_test		clean_test
clean_train		clean_train
hitapps		hitapps
noise_test		noise_test
noise_train		noise_train
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
audiolib.py		audiolib.py
noisyspeech_synthesizer.cfg		noisyspeech_synthesizer.cfg
noisyspeech_synthesizer.py		noisyspeech_synthesizer.py
requirements.txt		requirements.txt
upload_to_huggingface.py		upload_to_huggingface.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Noisy Speech Synthesizer

Prerequisites

Installation

Configuration

Usage

Arguments

Example

Uploading to Hugging Face

Instructions

License

About

Releases

Packages

Languages

License

ryan-huang1/MS-SNSD

Folders and files

Latest commit

History

Repository files navigation

Noisy Speech Synthesizer

Prerequisites

Installation

Configuration

Usage

Arguments

Example

Uploading to Hugging Face

Instructions

License

About

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages