← back to glossary

models

Benchmark

A standardized test or set of tasks used to measure and compare AI model performance across specific capabilities like reasoning, coding, or language understanding.

Last updated 2026-05-12