Skip to content

Latest commit

 

History

History

preprocess

Preprocess Scripts


  • Note : the order of these scripts is IMPORTANT.
  • the preprocessing step is easy. but environment setting is not. please understand.
  • If you encounter any problems, please do not hesitate to email me or open an issue to the github.
  1. Transcribe piano wavs to midi
  1. synchronize midi
python pop_align.py DATA_DIR
  1. Estimate Pop's beats
python bpm_quantize.py DATA_DIR 
  1. get separated vocal track
python split_spleeter.py DATA_DIR
  1. caculate melody chroma accuracy
python melody_accuracy.py DATA_DIR 

Expected Structure

├── -7lV0oJ0QXc
│   ├── EHl_eQhgefw.beatinterval.npy
│   ├── EHl_eQhgefw.beatstep.npy
│   ├── EHl_eQhgefw.beattime.npy
│   ├── EHl_eQhgefw.mid
│   ├── EHl_eQhgefw.notes.npy
│   ├── EHl_eQhgefw.pitchshift.wav
│   ├── EHl_eQhgefw.qmidi.mid
│   ├── EHl_eQhgefw.qmix.flac
│   ├── EHl_eQhgefw.vocals.mp3
│   ├── EHl_eQhgefw.wav
│   └── The Beatles - With a Little Help from My Friends ____With A Little Help From My Friends - The Beatles _.txt
├── -7lV0oJ0QXc.mid
├── -7lV0oJ0QXc.wav
├── -7lV0oJ0QXc.yaml

Descriptions for each data

  1. *.beattime.npy
    • timesteps (unit : second) extracted using essentia. np.ndarray. (num_beats, )
  2. *.beatstep.npy
    • timesteps (unit : second) per every half-beat. it is calculated using linear interpolation of beattime.
  3. *.notes.npy
    • np.ndarray shape: (number_of_notes, 4)
    • each row contains : [onset(unit: index), offset(unit: index), pitch, velocity]
    • onset/offset values mean that the index of beatstep time.
    • for example,
      • beatstep = [0.6, 1.0, 1.4]
      • ```note = [0, 1, 77, 88]``
      • then note means a note starts from 0.6sec to 1.0sec, and its pitch is 77 and velocity is 88.