Select model
Inter Token Latency (lower is better)
TTFT (lower is better)
End to End Latency (lower is better)
Request Output Throughput (higher is better)
Successful requests (higher is better)
Error rate
Prompt tokens
Decoded tokens