Mucong Ding* · Chenghao Deng* · Jocelyn Choo · Zichu Wu · Aakriti Agarawal · Avi Schwarzschild · Tianyi Zhou · Tom Goldstein · John Langford · Anima Anandkumar · Furong Huang
The codebase for the paper "Easy2Hard-Bench: Standardized Difficulty Labels for Profiling LLM Performance and Generalization" (https://arxiv.org/abs/2409.18433) by Mucong Ding*, Chenghao Deng*, Jocelyn Choo, Zichu Wu, Aakriti Agrawal, Avi Schwarzschild, Tianyi Zhou, Tom Goldstein, John Langford, Anima Anandkumar, Furong Huang.
Please cite our work if you find it is helpful:
@inproceedings{
ding2024easyhardbench,
title={Easy2Hard-Bench: Standardized Difficulty Labels for Profiling {LLM} Performance and Generalization},
author={Mucong Ding and Chenghao Deng and Jocelyn Choo and Zichu Wu and Aakriti Agrawal and Avi Schwarzschild and Tianyi Zhou and Tom Goldstein and John Langford and Anima Anandkumar and Furong Huang},
booktitle={The Thirty-eight Conference on Neural Information Processing Systems Datasets and Benchmarks Track},
year={2024},
}