Performance Benchmarks

Compare throughput, latency, and time-to-first-token across different hardware configurations.

Loading metrics...
Performance metrics are averages from recent observations. Throughput is tokens/query/second (higher is better). Latency and TTFT (Time To First Token) are in seconds (lower is better).