average_wer.py：真的能按句平均吗？ #13

Ma-LinHan · 2024-08-11T07:12:47Z

我看到在计算 WER 的代码中，先是计算出每一条测试数据的 wer ，然后直接 wer = round(np.mean(wers)*100,3) 进行了平均。每一条测试数据中的字符数并不相同，长句和短句的字符数差异很大，这样的计算方式是不是不太合理？而为什么不按照测试数据的总字符数计算 WER 呢？

Ma-LinHan · 2024-08-11T07:34:28Z

用你们的代码对模型的测试结果进行自动识别并计算指标时，放大了短句 badcase 的影响，WER 明显偏高

liyunlongaaa · 2024-10-06T05:51:31Z

明显是哪个指标算得低用哪个^^

PussyCat0700 · 2024-10-29T05:18:30Z

+1 我认为总字符数计算才是合理的，参考FAIR的代码：
https://github.com/facebookresearch/av_hubert/blob/258fb50e155134eec2c4b49c2ae8de267075fd18/avhubert/infer_s2s.py#L258

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

average_wer.py：真的能按句平均吗？ #13

average_wer.py：真的能按句平均吗？ #13

Ma-LinHan commented Aug 11, 2024 •

edited

Loading

Ma-LinHan commented Aug 11, 2024

liyunlongaaa commented Oct 6, 2024

PussyCat0700 commented Oct 29, 2024

average_wer.py：真的能按句平均吗？ #13

average_wer.py：真的能按句平均吗？ #13

Comments

Ma-LinHan commented Aug 11, 2024 • edited Loading

Ma-LinHan commented Aug 11, 2024

liyunlongaaa commented Oct 6, 2024

PussyCat0700 commented Oct 29, 2024

Ma-LinHan commented Aug 11, 2024 •

edited

Loading