-
Notifications
You must be signed in to change notification settings - Fork 74
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
chatglm3-6b-32k的中文测试结果远远低于README里的benchmark #59
Comments
用了新版的代码,分数已经和官方的一致了,问题应该出在chatglm3的build_chat部分~ |
嗯对,是这样的 |
请问官方发布的benchmark中各模型是如何解码的?greedy search(top_p=0, temperature=1)吗?@bys0318 |
请问这里用的是 greedy search解码吗?如果用generation_config里的跑出来差别大吗? |
是的 |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
我个人在longbench的5个中文任务上测试了一下chatglm3-6b-32k的分数,用的默认的load方式和默认的generation_config参数,也用了greedy search的参数,但是结果远远低于README里记录的benchmark(分数如下所示),想请问一下你们测试的时候,是用的什么generation_config呀?
The text was updated successfully, but these errors were encountered: