-
UNICEF/iMMAP Inc.
- https://www.linkedin.com/in/inigoballester/
Stars
Benchmark
3 repositories
MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering
MLGym A New Framework and Benchmark for Advancing AI Research Agents