π€ Al human on working!
Engineer and Data Scientist βοΈ I am currently working as an AI/ML developer π€π Enthusiastic to keep growing!
- Valencia
- in/diegoarcos
Stars
π§ͺ Evaluator | LLM
5 repositories
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
Evaluation tool for LLM QA chains
Build, evaluate, understand, and fix LLM-based apps
Supercharge Your LLM Application Evaluations π
ACL 2023: Evaluating Open-Domain Question Answering in the Era of Large Language Models