$45
visiontools
MODALITIES
→
INPUT PRICE
$1
OUTPUT PRICE
$2
CONTEXT
200K
RELEASED
Nov 25, 2025
| Provider | Cache | Uptime | Chat | |||
|---|---|---|---|---|---|---|
| — | $1 | $2 | Cache read$0.1 |
Capabilities
Input modalities
fileimagetext
Output modalities
text
Features
include_reasoningmax_tokensreasoningresponse_formatstopstructured_outputstemperaturetool_choicetoolstop_kverbosity
1
Get your API key
Create an API key from the Tokens page, then set it as an environment variable:
export ONLIST_API_KEY=sk-...2
Make your first request
Endpoints
POST
https://onlist.io/v1/chat/completionsOpenAI Chat Completions format
Request Headers
Authorization:Bearer $ONLIST_API_KEY
Content-Type:application/json
Model:anthropic/claude-opus-4-5
POST
https://onlist.io/v1/responsesOpenAI Responses format
Request Headers
Authorization:Bearer $ONLIST_API_KEY
Content-Type:application/json
Model:anthropic/claude-opus-4-5
POST
https://onlist.io/v1/messagesAnthropic Messages format
Request Headers
Authorization:Bearer $ONLIST_API_KEY
Content-Type:application/json
Model:anthropic/claude-opus-4-5
Code samples
curl https://onlist.io/v1/chat/completions \
-H "Authorization: Bearer $ONLIST_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"model": "anthropic/claude-opus-4-5",
"messages": [
{
"role": "user",
"content": "Explain quantum entanglement in one paragraph."
}
]
}'Replace $ONLIST_API_KEY with the API key from your token settings.
Authentication
All requests must include an Authorization: Bearer <TOKEN> header. Generate tokens from the Tokens page; tokens can be scoped to specific models, groups, IP ranges, and rate limits.
3
Enable streaming
Add "stream": true to receive partial responses as server-sent events in real time.
Streaming example
curl https://onlist.io/v1/chat/completions \
-H "Authorization: Bearer $ONLIST_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"model": "anthropic/claude-opus-4-5",
"messages": [
{
"role": "user",
"content": "Write a haiku about recursion."
}
],
"stream": true
}'Supported parameters
| Name | Type | Description |
|---|---|---|
include_reasoning | ||
max_tokens | integer | Maximum number of tokens to generate in the completion. |
reasoning | ||
response_format | object | Specifies the output format. Use {"type": "json_object"} for JSON mode. |
stop | string | array | Up to 4 sequences where the API will stop generating tokens. |
structured_outputs | ||
temperature | number | Sampling temperature between 0 and 2. Higher values make output more random. |
tool_choice | string | object | Controls which tool is called. "auto", "none", "required", or a specific function. |
tools | array | A list of tools the model may call. Currently supports functions. |
top_k | integer | Limits token selection to the k most likely candidates at each step. |
verbosity |
These are the request parameters this model accepts. Parameter semantics follow the OpenAI Chat Completions specification.