Skip to content

A subset of the popular LibriTTS dataset with subsets for English, Scottish, Welsh, and Irish accents.

License

Notifications You must be signed in to change notification settings

OscarVanL/LibriTTS-British-Accents

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

LibriTTS-British

This is a subset of the LibriTTS dataset that includes British speakers.

Speakers are sorted into libritts-english, libritts-irish, libritts-scottish, and libritts-welsh subsets.

Speakers were found using two resources, the LibriVox Accents Table and Ruth Golding's Blog, which both compile a list of British LibriVox audiobook speakers.

Please be aware that this dataset is likely not complete, and I make no promises of the regional accuracy.

Files are in a .tar.gz archive, split into 1GB chunks. This is because GitHub's LFS service imposes size limits, preventing the dataset being uploaded in a single file.

Download Mirrors

Kaggle dataset

Kaggle direct download

GitHub LFS

Note: GitHub LFS requires the purchase of "data packs", so I'd advise against using it.

  1. Install git lfs if you do not have this installed.
  2. Run git lfs install to set up lfs for your user account.
  3. git clone https://github.com/OscarVanL/LibriTTS-British-Accents

License

The original LibriTTS dataset was published under CC BY 4.0 licensing. This gives me permission to share and adapt this dataset as long as I give attribution. You can do the same with this dataset.

This dataset is licensed with CC BY 4.0.

About

A subset of the popular LibriTTS dataset with subsets for English, Scottish, Welsh, and Irish accents.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published