A react app for collecting speech-text pairs

Create speech datasets easily with a user-friendly interface by recording audio for a list of phrases.

Setup

Install

First, clone the repository and install the necessary dependencies:

git clone https://github.com/HPC-IF/simple-speech-collector
cd simple-speech-collector
npm install

Start the Server

Start the development server:

npm run dev

Open http://localhost:3000 in your browser to access the interface.

Usage

Place a phrases.txt file in the ./public directory to set the phrases that will be displayed in the interface.
For each phrase, you can record and play back the audio. You can submit multiple audio samples for each phrase.
Each time you access the page, you are assigned a random speaker ID.

The output dataset is structured as follows:

/dataset
├── wavs
│   ├── speaker1_0.wav
│   ├── speaker1_1.wav
│   ├── speaker1_2.wav
│   ├── speaker2_0.wav
│   ├── speaker2_1.wav
│   └── ...
└── metadata.txt

The metadata.txt file will have the following format:

speaker1_0|First phrase|First phrase
speaker1_1|Second phrase|Second phrase
speaker1_2|Third phrase|Third phrase
speaker2_0|First phrase|First phrase
speaker2_1|Second phrase|Second phrase
speaker2_2|Third phrase|Third phrase

This format is widely used for model training.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
public		public
src		src
.gitignore		.gitignore
README.md		README.md
components.json		components.json
next.config.ts		next.config.ts
package-lock.json		package-lock.json
package.json		package.json
postcss.config.mjs		postcss.config.mjs
tailwind.config.ts		tailwind.config.ts
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A react app for collecting speech-text pairs

Setup

Install

Start the Server

Usage

About

Releases

Packages

Languages

HPC-IF/simple-speech-collector

Folders and files

Latest commit

History

Repository files navigation

A react app for collecting speech-text pairs

Setup

Install

Start the Server

Usage

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages