Create image
/openai/v1/images/generationsGenerates images from text prompts using OpenAI-compatible format.
Note: Azure OpenAI deployments are also supported via the Azure integration endpoint.
Note: This endpoint also works without the /v1 prefix (e.g., /openai/images/generations).
Request Body
application/json
TypeScript Definitions
Use the request body type in TypeScript.
Response Body
application/json
application/json
curl -X POST "http://localhost:8080/openai/v1/images/generations" \ -H "Content-Type: application/json" \ -d '{ "model": "string", "prompt": "string" }'{
"created": 0,
"data": [
{
"url": "http://example.com",
"b64_json": "string",
"revised_prompt": "string",
"index": 0
}
],
"background": "string",
"output_format": "string",
"quality": "string",
"size": "string",
"usage": {
"input_tokens": 0,
"input_tokens_details": {
"image_tokens": 0,
"text_tokens": 0
},
"total_tokens": 0,
"output_tokens": 0,
"output_tokens_details": {
"image_tokens": 0,
"text_tokens": 0
}
}
}{
"event_id": "string",
"type": "string",
"is_bifrost_error": true,
"status_code": 0,
"error": {
"type": "string",
"code": "string",
"message": "string",
"param": "string",
"event_id": "string"
},
"extra_fields": {
"provider": "openai",
"model_requested": "string",
"request_type": "string"
}
}{
"event_id": "string",
"type": "string",
"is_bifrost_error": true,
"status_code": 0,
"error": {
"type": "string",
"code": "string",
"message": "string",
"param": "string",
"event_id": "string"
},
"extra_fields": {
"provider": "openai",
"model_requested": "string",
"request_type": "string"
}
}Create embeddings (OpenAI format) POST
Creates embedding vectors for the input text. Note: This endpoint also works without the /v1 prefix (e.g., /openai/embeddings).
Create response (OpenAI Responses API) POST
Creates a response using OpenAI Responses API format. Supports streaming via SSE. Async inference: Send x-bf-async: true to submit the request as a background job and receive a job ID immediately. Poll with x-bf-async-id: <job-id> to retrieve the result. When the job is still processing, the response status will not be completed. When completed, the full response with output_text will be returned. See Async Inference for details. Note: This endpoint also works without the /v1 prefix (e.g., /openai/responses).