Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

加载KeSpeech的代码和训练代码可以提供吗 #1

Open
zth-1024 opened this issue Nov 7, 2024 · 3 comments
Open

加载KeSpeech的代码和训练代码可以提供吗 #1

zth-1024 opened this issue Nov 7, 2024 · 3 comments

Comments

@zth-1024
Copy link

zth-1024 commented Nov 7, 2024

非常感谢您提供模块代码,请问您可以提供一下处理KeSpeech数据集的代码和训练的代码吗?万分感谢

@JinmingChe
Copy link
Owner

训练模块和数据加载模块,采用的是wenet形式,可以把kespeech的数据处理成datalist的格式,同时加上情感的label,在process的时候读取label进行映射。由于公司的要求目前暂时只能公开模型块代码。

@zth-1024
Copy link
Author

明白了,感谢您的回复

@zth-1024
Copy link
Author

您好,再打扰一下您,是不是只用处理Tasks/ASR文件夹下的文件就可以了,不用管Tasks/SubdialectID了?
然后把asks/ASR的train_phase1和train_phase2中的数据合并成train的,对于data.list,按照您说的,组织成id、path、text、label是不是就可以了?
进行AID任务的训练时,仍然用ASR的训练集,但是只训练label是方言的,是这样吗?
期待您的回复!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants