Get latency histogram

GET/api/logs/histogram/latency

Returns time-bucketed latency percentiles (avg, p90, p95, p99).

Query Parameters

providers?string

Comma-separated list of providers to filter by

models?string

Comma-separated list of models to filter by

status?string

Comma-separated list of statuses to filter by

objects?string

Comma-separated list of object types to filter by

selected_key_ids?string

Comma-separated list of selected key IDs to filter by

virtual_key_ids?string

Comma-separated list of virtual key IDs to filter by

routing_rule_ids?string

Comma-separated list of routing rule IDs to filter by

routing_engine_used?string

Comma-separated list of routing engines to filter by

start_time?string

Start time filter (RFC3339 format)

Formatdate-time

end_time?string

End time filter (RFC3339 format)

Formatdate-time

min_latency?number

Minimum latency filter

max_latency?number

Maximum latency filter

min_tokens?integer

Minimum tokens filter

max_tokens?integer

Maximum tokens filter

min_cost?number

Minimum cost filter

max_cost?number

Maximum cost filter

missing_cost_only?boolean

Only show logs with missing cost

content_search?string

Search in request/response content

Response Body

`application/json`

curl -X GET "http://localhost:8080/api/logs/histogram/latency"

{
  "buckets": [
    {
      "timestamp": "2019-08-24T14:15:22Z",
      "avg_latency": 0,
      "p90_latency": 0,
      "p95_latency": 0,
      "p99_latency": 0,
      "total_requests": 0
    }
  ],
  "bucket_size_seconds": 0
}

{
  "event_id": "string",
  "type": "string",
  "is_bifrost_error": true,
  "status_code": 0,
  "error": {
    "type": "string",
    "code": "string",
    "message": "string",
    "param": "string",
    "event_id": "string"
  },
  "extra_fields": {
    "provider": "openai",
    "model_requested": "string",
    "request_type": "string"
  }
}

{
  "event_id": "string",
  "type": "string",
  "is_bifrost_error": true,
  "status_code": 0,
  "error": {
    "type": "string",
    "code": "string",
    "message": "string",
    "param": "string",
    "event_id": "string"
  },
  "extra_fields": {
    "provider": "openai",
    "model_requested": "string",
    "request_type": "string"
  }
}

Get latency histogram

Query Parameters

Response Body

200application/json

400application/json

500application/json

`application/json`

`application/json`

`application/json`