← back to glossary

models

Latency

The time between sending a request to an AI model and receiving its complete response, which directly affects whether an application feels responsive or sluggish to the end user.

Last updated 2026-05-12