Nvmix

Model Monitoring

Real-time performance metrics for all deployed endpoints.

All Systems Operational

Total Requests (24h)

1,284,392

+12% vs yesterday

Avg Latency

115ms

-8ms vs yesterday

Error Rate

0.24%

-0.1% vs yesterday

Active Models

0 / 0

Requests per Second

Live — updates every 2s

55 RPS
12 seconds agoNow

Endpoint Metrics

ModelRegionRPSp50p95p99Error %Status

Latency Distribution

< 50ms8%
50–100ms31%
100–200ms42%
200–500ms16%
> 500ms3%

Alert Rules

High error rate

Error rate > 5%

High latency

p95 Latency > 500ms

Low throughput

RPS < 1 for 5 min

Model offline

Status = sleeping

Region Health

us-east-1

142ms
62%

eu-west-1

88ms
28%

ap-southeast-1

0%

Recent Errors

RateLimitExceeded

llama3-sentiment-v2

10:33:12

×3

TimeoutError

gpt2-code-assistant

09:58:44

×1

InvalidInputError

llama3-sentiment-v2

08:21:09

×7