Inference
The OpenAI-compatible inference surface. Chat completions is the primary endpoint and runs the full feature path — guardrails, prompts, A/B tests, caching.
- POST
/v1/chat/completionschat.completions.create() - POST
/v1/completionscompletions.create() - POST
/v1/embeddingsembeddings.create() - POST
/v1/images/generationsimages.generate() - POST
/v1/audio/speechaudio.speech.create() - POST
/v1/audio/transcriptionsaudio.transcriptions.create()