Latency

The time between sending a request to an AI model and receiving its complete response, which directly affects whether an application feels responsive or sluggish to the end user.

Related terms