Dashboard

Production

Real-time monitoring for your LLM operations

All systems operational

Total cost

$2,847-12.5%

Savings from cache: $847

Cache hit rate

94.2%+8.3%

12,480 requests cached

Avg latency

142ms-23%

P99: 892ms

Total requests

1.2M+15.2%

842 errors (0.07%)

Live activity

4,281 req/min

12:45:32|gpt-4-turbo

Summarize the quarterly financial report...

892ms

12:45:28|gpt-4-turboHIT

Generate product description for SKU-4521...

45ms

12:45:21|claude-3-sonnet

Analyze customer feedback sentiment...

1240ms

12:45:15|gpt-3.5-turbo

Translate support ticket to Spanish...

234ms

12:45:08|gpt-4-turboHIT

Extract entities from contract document...

38ms

12:44:59|claude-3-opus

Code review for authentication module...

Model distribution

108.2K total

gpt-4-turbo45,200

claude-3-sonnet28,300

gpt-3.5-turbo21,800

claude-3-opus12,900

Semantic Cache

Intelligent request matching

Saved

$847

Avg match

94ms

Quick actions

Create prompt

Add a new versioned prompt

Configure model

Set up a new LLM provider

View cache

Inspect semantic cache entries

Export report

Download usage analytics