Skip to main content
Dahl exposes models through the OpenAI-compatible GET /v1/models endpoint. Always read IDs from the live response — IDs can change as the Gonka network adds or rotates capacity.
curl https://inference.dahl.global/v1/models

Currently available

These models are typically returned from production (verify with GET /v1/models):
Model IDNotes
MiniMaxAI/MiniMax-M2.7Default recommendation — general chat and coding
moonshotai/Kimi-K2.6Long-context and reasoning workloads
GLM 5.2Coming soon

Default choice

Start with MiniMax M2.7 (MiniMaxAI/MiniMax-M2.7). It is the default in the Dahl chat UI and matches the examples on inference.dahl.global.

Using a model in chat completions

Set the model field to the exact id string from GET /v1/models:
curl https://inference.dahl.global/v1/chat/completions \
  -H "Authorization: Bearer $DAHL_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "MiniMaxAI/MiniMax-M2.7",
    "messages": [
      { "role": "user", "content": "Explain recursion in one paragraph." }
    ]
  }'

Streaming

Add "stream": true to the JSON body for token streaming. Dahl forwards streaming responses from the upstream network in OpenAI SSE format.

When a model fails

If you receive a 4xx or 502 mentioning the model:
  1. Call GET /v1/models again
  2. Switch to an id present in the response
  3. Retry with short backoff if the network reports overload (503 or timeouts)
See Quickstart for a full first-request walkthrough.