ai-benchmarks

Star

Here are 5 public repositories matching this topic...

CAS-CLab / CNN-Inference-Engine-Quick-View

Star

A quick view of high-performance convolution neural networks (CNNs) inference engines on mobile devices.

cnn inference-engine cnns inference-engines cnn-inference-engine ai-benchmarks speed-benchmarks

Updated Jun 13, 2022

scicode-bench / SciCode

Star

A benchmark that challenges language models to code solutions for scientific problems

benchmark ai ai-benchmarks llm

Updated Dec 9, 2024
Python

Benchmark evaluating LLMs on their ability to create and resist disinformation. Includes comprehensive testing across major models (Claude, GPT-4, Gemini, Llama, etc.) with standardized evaluation metrics.

nlp machine-learning gemini llama language-model model-evaluation ai-safety mistral claude disinformation ai-security ai-benchmarks ai-evaluation llm llm-benchmarking gpt4o

Updated Oct 22, 2024

mrowan137 / ml-performance-benchmark

Star

Performance benchmarking for ML/AI workloads

benchmark deep-learning resnet ai-benchmarks cosmoflow deepcam

Updated Jul 27, 2021
OpenEdge ABL

Paraskevi-KIvroglou / Hackathon-LlamaEval

Star

LlamaEval is a rapid prototype developed during a hackathon to provide a user-friendly dashboard for evaluating and comparing Llama models using the TogetherAI API.

evaluation-metrics streamlit ai-benchmarks llms togetherai llms-benchmarking llama3

Updated Nov 10, 2024
Python

Improve this page

Add a description, image, and links to the ai-benchmarks topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the ai-benchmarks topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ai-benchmarks

Here are 5 public repositories matching this topic...

CAS-CLab / CNN-Inference-Engine-Quick-View

scicode-bench / SciCode

lechmazur / deception

mrowan137 / ml-performance-benchmark

Paraskevi-KIvroglou / Hackathon-LlamaEval

Improve this page

Add this topic to your repo