GET /v1/models endpoint. Always read IDs from the live response — IDs can change as the Gonka network adds or rotates capacity.
Currently available
These models are typically returned from production (verify withGET /v1/models):
| Model ID | Notes |
|---|---|
MiniMaxAI/MiniMax-M2.7 | Default recommendation — general chat and coding |
moonshotai/Kimi-K2.6 | Long-context and reasoning workloads |
| GLM 5.2 | Coming soon |
Default choice
Start with MiniMax M2.7 (MiniMaxAI/MiniMax-M2.7). It is the default in the Dahl chat UI and matches the examples on inference.dahl.global.
Using a model in chat completions
Set themodel field to the exact id string from GET /v1/models:
Streaming
Add"stream": true to the JSON body for token streaming. Dahl forwards streaming responses from the upstream network in OpenAI SSE format.
When a model fails
If you receive a 4xx or 502 mentioning the model:- Call
GET /v1/modelsagain - Switch to an
idpresent in the response - Retry with short backoff if the network reports overload (
503or timeouts)