MERLIon CCS Baseline System

Result and description of baseline system for MERLIon CCS Challenge here.

Example command to run the training script:

python train_conformer.py --dim 39 --train /home/challenge_feat_all_train.txt --test /home/devset_feats.txt --warmup 5000 --epochs

The challenge_feat_all_train.txt is formatted as:

chunk_1_feature.npy 0
chunk_2_feature.npy 1

where 0 and 1 are language label indexes denoting English and Mandarin respectively.

Example command to compute Equal Error Rate for Task 1 (Language Identification), i.e., compute_eer_bac.py:

python compute_eer_bac.py --valid /your_utterance_to_language_index.txt --score /your_utterance_to_prediction.txt --trial /path_to_save_trial.txt

The your_utterance_to_language_index.txt is formatted as:

chunk_1 0
chunk_2 1

where 0 and 1 are ground-truth language label indexes denoting English and Mandarin respectively.

The your_utterance_to_language_index.txt is the prediction labels and can have two formats.

First format:

chunk_1 0 5.12316
chunk_1 1 -12.66789

where 0 and 1 are the predicted language label indexes denoting English and Mandarin respectively, followed by the language prediction scores.

Second format:

chunk_1 5.12316 -12.66789

where the first language prediction score is for English followed by the language prediction score for Mandarin.

Example to run diarization_validation.py (this is for our baseline system)

python diarization_validation.py --model /home/merlion/model.ckpt --audio /home/MERLIon-CCS-Challenge_Development-Set_v001/_CONFIDENTIAL/_audio/ --save /home/devset_diar/

If you have already computed the RTTMs for Task 2 (Language Diarization), the language diarization error rate and individual English and Mandarin error rates across the entire dataset can be computed by uncommenting the code in scoring_diar.py and running the following command:

python scoring_diar.py --predicting_file /your_folder_saved_prediction_rttm_files --ground_truth /your_folder_saved_ground_truth_rttm --result_output /expected_path_to_save_result

where:

--predicting_file is the folder path containing all the predicted RTTM files named according to the audio filenames (e.g., predicted RTTM file of 123.wav should be 123.txt in the prediction folder)
--ground_truth is the folder path containing all ground truth RTTM files with the same audio filename (e.g., ground truth RTTM file of 123.wav should be 123.txt in the prediction folder
--result_output is the folder to save the results to.

We have also provided preprocess_train.py for training data processing (just in case you need), the dev_process.py for task 1 and dev_process_diar.py for task 2 to help develop your model.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
LICENSE		LICENSE
README.md		README.md
compute_eer_bac.py		compute_eer_bac.py
conformer.py		conformer.py
conformer_activations.py		conformer_activations.py
conformer_conv.py		conformer_conv.py
convolution_module.py		convolution_module.py
data_load.py		data_load.py
dev_process.py		dev_process.py
dev_process_diar.py		dev_process_diar.py
diarization_validation.py		diarization_validation.py
model.py		model.py
preprocess_train.py		preprocess_train.py
scoring.py		scoring.py
scoring_diar.py		scoring_diar.py
test.sh		test.sh
train_conformer.py		train_conformer.py
transformer.py		transformer.py
vad.py		vad.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MERLIon CCS Baseline System

About

Releases

Packages

Languages

License

Lhx94As/merlion-ccs-2023-baseline

Folders and files

Latest commit

History

Repository files navigation

MERLIon CCS Baseline System

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages