I may be slow to respond.
Most of my work happens in private Repos :(
-
CMU
- CMU, Pittsburgh
- https://www.yuewu.ml/
Highlights
- Pro
Pinned Loading
-
microsoft/SmartPlay
microsoft/SmartPlay PublicSmartPlay is a benchmark for Large Language Models (LLMs). Uses a variety of games to test various important LLM capabilities as agents. SmartPlay is designed to be easy to use, and to support futu…
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.