Models - Dahl

Dahl exposes models through the OpenAI-compatible GET /v1/models endpoint. Always read IDs from the live response — IDs can change as the Gonka network adds or rotates capacity.

curl https://inference.dahl.global/v1/models

Currently available

These models are typically returned from production (verify with GET /v1/models):

Model ID	Notes
`MiniMaxAI/MiniMax-M2.7`	Default recommendation — general chat and coding
`moonshotai/Kimi-K2.6`	Long-context and reasoning workloads
GLM 5.2	Coming soon

Default choice

Start with MiniMax M2.7 (MiniMaxAI/MiniMax-M2.7). It is the default in the Dahl chat UI and matches the examples on inference.dahl.global.

Using a model in chat completions

Set the model field to the exact id string from GET /v1/models:

curl https://inference.dahl.global/v1/chat/completions \
  -H "Authorization: Bearer $DAHL_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "MiniMaxAI/MiniMax-M2.7",
    "messages": [
      { "role": "user", "content": "Explain recursion in one paragraph." }
    ]
  }'

Streaming

Add "stream": true to the JSON body for token streaming. Dahl forwards streaming responses from the upstream network in OpenAI SSE format.

When a model fails

If you receive a 4xx or 502 mentioning the model:

Call GET /v1/models again
Switch to an id present in the response
Retry with short backoff if the network reports overload (503 or timeouts)

See Quickstart for a full first-request walkthrough.

​Currently available

​Default choice

​Using a model in chat completions

​Streaming

​When a model fails

Currently available

Default choice

Using a model in chat completions

Streaming

When a model fails