Skip to content

Latest commit

 

History

History
18 lines (15 loc) · 1.33 KB

README.md

File metadata and controls

18 lines (15 loc) · 1.33 KB

Note: for updates, see https://github.com/aalto-speech/dbca

Structure

  • scripts numbered 01-13 are meant to be run in succession
  • run.sh provides examples of running the scripts
  • exp/subset-d-1m/data contains the 1M sentence pair dataset
  • exp/subset-d-1m/splits/*/*/*/ids_{train,test_full}.txt.gz contain the data splits with different compound divergences and different random initialisations

Dependencies