Skip to content

Universal multilingual automatic speech transcription into IPA

Notifications You must be signed in to change notification settings

ctaguchi/multipa

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

multipa

MultIPA is yet another automatic speech transcription model into phonetic IPA. The idea is that, if we train a multilingual speech-to-IPA model with enough amount of good phoneme representations, the model's output will be approximated to phonetic transcriptions.

Note that the codes in this repository are incomplete; we are still cleaning up the original codes for publishing in this repository. The finalized codes will be prepared by the day of the presentation at INTERSPEECH 2023 (mid-August). We appreciate your patience!

Available training languages

At this moment, we have the following languages incorporated available in the training data:

  • English
  • Finnish
  • Hungarian
  • Japanese
  • Maltese
  • Modern Greek
  • Polish
  • Tamil

Note that English was added after the INTERSPEECH 2023 paper was submitted. We aim to include more languages to take into account linguistic diversity.

About

Universal multilingual automatic speech transcription into IPA

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages