Get latency histogram
/api/logs/histogram/latencyReturns time-bucketed latency percentiles (avg, p90, p95, p99).
Query Parameters
Comma-separated list of providers to filter by
Comma-separated list of models to filter by
Comma-separated list of statuses to filter by
Comma-separated list of object types to filter by
Comma-separated list of selected key IDs to filter by
Comma-separated list of virtual key IDs to filter by
Comma-separated list of routing rule IDs to filter by
Comma-separated list of routing engines to filter by
Start time filter (RFC3339 format)
date-timeEnd time filter (RFC3339 format)
date-timeMinimum latency filter
Maximum latency filter
Minimum tokens filter
Maximum tokens filter
Minimum cost filter
Maximum cost filter
Only show logs with missing cost
Search in request/response content
Response Body
application/json
application/json
application/json
curl -X GET "http://localhost:8080/api/logs/histogram/latency"{
"buckets": [
{
"timestamp": "2019-08-24T14:15:22Z",
"avg_latency": 0,
"p90_latency": 0,
"p95_latency": 0,
"p99_latency": 0,
"total_requests": 0
}
],
"bucket_size_seconds": 0
}{
"event_id": "string",
"type": "string",
"is_bifrost_error": true,
"status_code": 0,
"error": {
"type": "string",
"code": "string",
"message": "string",
"param": "string",
"event_id": "string"
},
"extra_fields": {
"provider": "openai",
"model_requested": "string",
"request_type": "string"
}
}{
"event_id": "string",
"type": "string",
"is_bifrost_error": true,
"status_code": 0,
"error": {
"type": "string",
"code": "string",
"message": "string",
"param": "string",
"event_id": "string"
},
"extra_fields": {
"provider": "openai",
"model_requested": "string",
"request_type": "string"
}
}