Skip to Content
API AccessChat Completions

Chat completions

POST /v1/chat/completions Content-Type: application/json Authorization: Bearer sk-steinkauz-live-...

Request and response bodies follow the OpenAI Chat Completions API  for supported fields. For parameter reference, streaming behavior, and response structure, see the OpenAI documentation:

Steinkauz supports a subset of OpenAI fields. Unsupported parameters return 400 invalid_request_error.

Non-streaming

curl -sS "$STEINKAUZ_BASE_URL/v1/chat/completions" \ -H "Content-Type: application/json" \ -H "Authorization: Bearer $STEINKAUZ_API_KEY" \ -d '{ "model": "steinkauz-gateway/openai/gpt-5.5", "messages": [{ "role": "user", "content": "Hello 👋" }] }'

Streaming

Set "stream": true. The response is text/event-stream with chat.completion.chunk objects, ending with data: [DONE]. See OpenAI streaming  for the event format.

curl -sS -N "$STEINKAUZ_BASE_URL/v1/chat/completions" \ -H "Content-Type: application/json" \ -H "Authorization: Bearer $STEINKAUZ_API_KEY" \ -d '{ "model": "steinkauz-gateway/openai/gpt-5.5", "messages": [{ "role": "user", "content": "Hello 👋" }], "stream": true }'

Tools

Function tools and tool_choice are supported where the underlying model and provider allow them. Behavior is consistent with the web chat.

OpenAI SDK (Python)

from openai import OpenAI client = OpenAI( api_key="sk-steinkauz-live-YOUR_SECRET", base_url="https://chat.steinkauz.ai/v1", ) completion = client.chat.completions.create( model="steinkauz-gateway/openai/gpt-5.5", messages=[{"role": "user", "content": "Hello 👋"}], ) print(completion.choices[0].message.content)

OpenAI SDK (JavaScript)

import OpenAI from "openai"; const client = new OpenAI({ apiKey: process.env.STEINKAUZ_API_KEY, baseURL: "https://chat.steinkauz.ai/v1", }); const completion = await client.chat.completions.create({ model: "steinkauz-gateway/openai/gpt-5.5", messages: [{ role: "user", content: "Hello 👋" }], }); console.log(completion.choices[0]?.message?.content);

Usage tracking

Successful completions record inference usage against your account and API key. View breakdowns under Settings → Usage. On Gateway plans, usage counts toward your shared inference budget; on BYOK plans, usage is recorded for reference while you are billed by your providers directly.

Last updated on