Skip to content

Data Browser

This page displays comprehensive information about all LLM providers and models, automatically generated from API data.

Statistics

Provider Count: 149    Model Count: 5404    Last Updated: 6/15/2026, 5:31:32 AM

Capabilities Legend: 🧠 Reasoning   🔧 Tools   📎 Attachment   🌡️ Temperature

302.AI

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
gpt-5.4-mini-2026-03-17 gpt-5.4-mini-2026-03-17 400K 128K Input: $0.75
Output: $4.5
Model: 0.375
Completion: 6.000
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2026-03-19
chatgpt-4o-latest chatgpt-4o-latest 128K 16.4K Input: $5
Output: $15
Model: 2.500
Completion: 3.000
📎 🌡️ 2023-09 In: text, image
Out: text
Released: 2024-08-08
gpt-5.4-nano-2026-03-17 gpt-5.4-nano-2026-03-17 400K 128K Input: $0.2
Output: $1.25
Model: 0.100
Completion: 6.250
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2026-03-19
kimi-k2-0905-preview kimi-k2-0905-preview 262.1K 262.1K Input: $0.632
Output: $2.53
Model: 0.316
Completion: 4.003
🔧 🌡️ 2025-06 In: text
Out: text
Released: 2025-09-05
grok-4.20-beta-0309-non-reasoning grok-4.20-beta-0309-non-reasoning 2M 30K Input: $2
Output: $6
Model: 1.000
Completion: 3.000
📎 🔧 🌡️ - In: text, image
Out: text
Released: 2026-03-16
gemini-2.5-flash-nothink gemini-2.5-flash-nothink 1M 65.5K Input: $0.3
Output: $2.5
Model: 0.150
Completion: 8.333
📎 🔧 🌡️ 2025-01 In: text, image
Out: text
Released: 2025-06-24
Qwen-Plus qwen-plus 1M 32.8K Input: $0.12
Output: $1.2
Model: 0.060
Completion: 10.000
🔧 🌡️ 2024-10 In: text
Out: text
Released: 2024-07-23
glm-4.7 glm-4.7 204.8K 131.1K Input: $0.286
Output: $1.142
Model: 0.143
Completion: 3.993
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-12-22
qwen3-235b-a22b-instruct-2507 qwen3-235b-a22b-instruct-2507 128K 65.5K Input: $0.29
Output: $1.143
Model: 0.145
Completion: 3.941
🔧 🌡️ 2025-04 In: text
Out: text
Released: 2025-07-30
GLM-4.5V glm-4.5v 64K 16.4K Input: $0.29
Output: $0.86
Model: 0.145
Completion: 2.966
📎 🧠 🔧 🌡️ 2025-04 In: text, image, video
Out: text
Open Weights
Released: 2025-08-12
claude-opus-4-5 claude-opus-4-5 200K 64K Input: $5
Output: $25
Model: 2.500
Completion: 5.000
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-11-25
gemini-2.5-pro gemini-2.5-pro 1M 65.5K Input: $1.25
Output: $10
Model: 0.625
Completion: 8.000
📎 🔧 🌡️ 2025-01 In: text, image
Out: text
Released: 2025-06-17
gpt-5 gpt-5 400K 128K Input: $1.25
Output: $10
Model: 0.625
Completion: 8.000
📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-08-08
claude-haiku-4-5-20251001 claude-haiku-4-5-20251001 200K 64K Input: $1
Output: $5
Model: 0.500
Completion: 5.000
📎 🧠 🔧 🌡️ 2025-02-28 In: text, image, pdf
Out: text
Released: 2025-10-16
kimi-k2-thinking-turbo kimi-k2-thinking-turbo 262.1K 262.1K Input: $1.265
Output: $9.119
Model: 0.632
Completion: 7.209
🧠 🔧 🌡️ 2025-06 In: text
Out: text
Released: 2025-09-05
claude-3-5-haiku-20241022 claude-3-5-haiku-20241022 200K 8.2K Input: $0.8
Output: $4
Model: 0.400
Completion: 5.000
📎 🔧 🌡️ 2024-07-31 In: text, image, pdf
Out: text
Released: 2024-10-22
GLM-4.5 glm-4.5 131.1K 98.3K Input: $0.286
Output: $1.142
Model: 0.143
Completion: 3.993
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-29
gpt-5-pro gpt-5-pro 400K 272K Input: $15
Output: $120
Model: 7.500
Completion: 8.000
📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-10-08
grok-4.20-beta-0309-reasoning grok-4.20-beta-0309-reasoning 2M 30K Input: $2
Output: $6
Model: 1.000
Completion: 3.000
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2026-03-16
gemini-2.5-flash gemini-2.5-flash 1M 65.5K Input: $0.3
Output: $2.5
Model: 0.150
Completion: 8.333
📎 🔧 🌡️ 2025-01 In: text, image
Out: text
Released: 2025-06-17
gpt-4o gpt-4o 128K 16.4K Input: $2.5
Output: $10
Model: 1.250
Completion: 4.000
📎 🔧 🌡️ 2023-09 In: text, image, pdf
Out: text
Released: 2024-05-13
MiniMax-M2.1 MiniMax-M2.1 1M 131.1K Input: $0.3
Output: $1.2
Model: 0.150
Completion: 4.000
🔧 🌡️ - In: text
Out: text
Released: 2025-12-19
gemini-2.5-flash-lite-preview-09-2025 gemini-2.5-flash-lite-preview-09-2025 1M 65.5K Input: $0.1
Output: $0.4
Model: 0.050
Completion: 4.000
📎 🔧 🌡️ 2025-01 In: text, image
Out: text
Released: 2025-09-26
doubao-seed-1-6-vision-250815 doubao-seed-1-6-vision-250815 256K 32K Input: $0.114
Output: $1.143
Model: 0.057
Completion: 10.026
📎 🔧 🌡️ - In: text, image
Out: text
Released: 2025-09-30
claude-opus-4-1-20250805 claude-opus-4-1-20250805 200K 32K Input: $15
Output: $75
Model: 7.500
Completion: 5.000
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-08-05
qwen3-max-2025-09-23 qwen3-max-2025-09-23 258K 65.5K Input: $0.86
Output: $3.43
Model: 0.430
Completion: 3.988
🔧 🌡️ 2025-04 In: text
Out: text
Released: 2025-09-24
glm-4.7-flashx glm-4.7-flashx 200K 131.1K Input: $0.0715
Output: $0.429
Model: 0.036
Completion: 6.000
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2026-01-20
glm-5.1 glm-5.1 200K 131.1K Input: $0.86
Output: $3.5
Model: 0.430
Completion: 4.070
🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-04-10
glm-4.6 glm-4.6 204.8K 131.1K Input: $0.286
Output: $1.142
Model: 0.143
Completion: 3.993
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-09-30
kimi-k2-thinking kimi-k2-thinking 262.1K 262.1K Input: $0.575
Output: $2.3
Model: 0.287
Completion: 4.000
🧠 🔧 🌡️ 2025-06 In: text
Out: text
Released: 2025-09-05
claude-sonnet-4-5 claude-sonnet-4-5 200K 64K Input: $3
Output: $15
Model: 1.500
Completion: 5.000
📎 🧠 🔧 🌡️ 2025-07-31 In: text, image, pdf
Out: text
Released: 2025-09-30
glm-4.5-x glm-4.5-x 128K 16.4K Input: $1.143
Output: $2.29
Model: 0.572
Completion: 2.003
🔧 🌡️ 2025-04 In: text
Out: text
Released: 2025-07-29
DeepSeek-V3.2-Thinking deepseek-v3.2-thinking 128K 128K Input: $0.29
Output: $0.43
Model: 0.145
Completion: 1.483
🧠 🔧 🌡️ 2024-12 In: text
Out: text
Released: 2025-12-01
claude-sonnet-4-6-thinking claude-sonnet-4-6-thinking 1M 64K Input: $3
Output: $15
Model: 1.500
Completion: 5.000
📎 🧠 🔧 🌡️ 2025-08 In: text, image, pdf
Out: text
Released: 2026-02-18
Updated: 2026-03-13
claude-opus-4-7 claude-opus-4-7 1M 128K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2026-01-31 In: text, image, pdf
Out: text
Released: 2026-04-17
grok-4-1-fast-non-reasoning grok-4-1-fast-non-reasoning 2M 30K Input: $0.2
Output: $0.5
Model: 0.100
Completion: 2.500
📎 🔧 🌡️ 2025-06 In: text, image
Out: text
Released: 2025-11-20
gpt-5.4-nano gpt-5.4-nano 400K 128K Input: $0.2
Output: $1.25
Model: 0.100
Completion: 6.250
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2026-03-19
claude-opus-4-1-20250805-thinking claude-opus-4-1-20250805-thinking 200K 32K Input: $15
Output: $75
Model: 7.500
Completion: 5.000
📎 🧠 🔧 🌡️ 2025-03 In: text, image
Out: text
Released: 2025-05-27
glm-4.5-airx glm-4.5-airx 128K 16.4K Input: $0.572
Output: $1.714
Model: 0.286
Completion: 2.997
🔧 🌡️ 2025-04 In: text
Out: text
Released: 2025-07-29
grok-4.1 grok-4.1 200K 64K Input: $2
Output: $10
Model: 1.000
Completion: 5.000
📎 🔧 🌡️ 2025-06 In: text, image
Out: text
Released: 2025-11-18
gemini-2.5-flash-preview-09-2025 gemini-2.5-flash-preview-09-2025 1M 65.5K Input: $0.3
Output: $2.5
Model: 0.150
Completion: 8.333
📎 🔧 🌡️ 2025-01 In: text, image
Out: text
Released: 2025-09-26
claude-opus-4-5-20251101 claude-opus-4-5-20251101 200K 64K Input: $5
Output: $25
Model: 2.500
Completion: 5.000
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-11-25
claude-opus-4-20250514 claude-opus-4-20250514 200K 32K Input: $15
Output: $75
Model: 7.500
Completion: 5.000
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-05-22
gemini-3-pro-image-preview gemini-3-pro-image-preview 32.8K 64K Input: $2
Output: $120
Model: 1.000
Completion: 60.000
📎 🌡️ 2025-06 In: text, image
Out: text
Released: 2025-11-20
gemini-2.5-flash-image gemini-2.5-flash-image 32.8K 32.8K Input: $0.3
Output: $30
Model: 0.150
Completion: 100.000
📎 🌡️ 2025-01 In: text, image
Out: text
Released: 2025-10-08
glm-for-coding glm-for-coding 200K 131.1K Input: $0.086
Output: $0.343
Model: 0.043
Completion: 3.988
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-09-30
gpt-5.2 gpt-5.2 400K 128K Input: $1.75
Output: $14
Model: 0.875
Completion: 8.000
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2025-12-12
Qwen-Flash qwen-flash 1M 32.8K Input: $0.022
Output: $0.22
Model: 0.011
Completion: 10.000
🔧 🌡️ - In: text
Out: text
Released: 2025-07-28
claude-opus-4-6-thinking claude-opus-4-6-thinking 1M 128K Input: $5
Output: $25
Model: 2.500
Completion: 5.000
📎 🧠 🔧 🌡️ 2025-05 In: text, image, pdf
Out: text
Released: 2026-02-06
Updated: 2026-03-13
claude-sonnet-4-20250514 claude-sonnet-4-20250514 200K 64K Input: $3
Output: $15
Model: 1.500
Completion: 5.000
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-05-22
gpt-5.1-chat-latest gpt-5.1-chat-latest 128K 16.4K Input: $1.25
Output: $10
Model: 0.625
Completion: 8.000
📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-11-14
gpt-5.2-chat-latest gpt-5.2-chat-latest 128K 16.4K Input: $1.75
Output: $14
Model: 0.875
Completion: 8.000
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2025-12-12
grok-4-fast-reasoning grok-4-fast-reasoning 2M 30K Input: $0.2
Output: $0.5
Model: 0.100
Completion: 2.500
📎 🧠 🔧 🌡️ 2025-06 In: text, image
Out: text
Released: 2025-09-23
gpt-4.1-nano gpt-4.1-nano 1M 32.8K Input: $0.1
Output: $0.4
Model: 0.050
Completion: 4.000
📎 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2025-04-14
claude-sonnet-4-5-20250929-thinking claude-sonnet-4-5-20250929-thinking 200K 64K Input: $3
Output: $15
Model: 1.500
Completion: 5.000
📎 🧠 🔧 🌡️ 2025-03 In: text, image, pdf
Out: text
Released: 2025-09-30
MiniMax-M2.7-highspeed MiniMax-M2.7-highspeed 204.8K 131.1K Input: $0.6
Output: $4.8
Model: 0.300
Completion: 8.000
🔧 🌡️ - In: text
Out: text
Released: 2026-03-19
MiniMax-M2 MiniMax-M2 1M 128K Input: $0.33
Output: $1.32
Model: 0.165
Completion: 4.000
🔧 🌡️ - In: text
Out: text
Released: 2025-10-26
gemini-3.1-flash-image-preview gemini-3.1-flash-image-preview 131.1K 32.8K Input: $0.5
Output: $60
Model: 0.250
Completion: 120.000
📎 🌡️ 2025-01 In: text, image, pdf
Out: text, image
Released: 2026-02-27
Qwen3-235B-A22B qwen3-235b-a22b 128K 16.4K Input: $0.29
Output: $2.86
Model: 0.145
Completion: 9.862
🔧 🌡️ 2025-04 In: text
Out: text
Released: 2025-04-29
ministral-14b-2512 ministral-14b-2512 128K 128K Input: $0.33
Output: $0.33
Model: 0.165
Completion: 1.000
📎 🔧 🌡️ 2024-12 In: text, image
Out: text
Released: 2025-12-16
GLM-4.6V glm-4.6v 128K 32.8K Input: $0.145
Output: $0.43
Model: 0.072
Completion: 2.966
📎 🧠 🔧 🌡️ 2025-04 In: text, image, video
Out: text
Open Weights
Released: 2025-12-08
claude-haiku-4-5 claude-haiku-4-5 200K 64K Input: $1
Output: $5
Model: 0.500
Completion: 5.000
📎 🧠 🔧 🌡️ 2025-02-28 In: text, image, pdf
Out: text
Released: 2025-10-16
gpt-5.4 gpt-5.4 1.1M 128K Input: $2.5
Output: $15
Cache Read: $0.25
Cache Write: $0
Model: 1.250
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-03-05
doubao-seed-1-6-thinking-250715 doubao-seed-1-6-thinking-250715 256K 16K Input: $0.121
Output: $1.21
Model: 0.060
Completion: 10.000
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2025-07-15
gpt-5.4-mini gpt-5.4-mini 400K 128K Input: $0.75
Output: $4.5
Model: 0.375
Completion: 6.000
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2026-03-19
gpt-4.1 gpt-4.1 1M 32.8K Input: $2
Output: $8
Model: 1.000
Completion: 4.000
📎 🔧 🌡️ 2024-04 In: text, image, pdf
Out: text
Released: 2025-04-14
doubao-seed-code-preview-251028 doubao-seed-code-preview-251028 256K 32K Input: $0.17
Output: $1.14
Model: 0.085
Completion: 6.706
📎 🔧 🌡️ - In: text, image
Out: text
Released: 2025-11-11
claude-opus-4-6 claude-opus-4-6 1M 128K Input: $5
Output: $25
Model: 2.500
Completion: 5.000
📎 🧠 🔧 🌡️ 2025-05-31 In: text, image, pdf
Out: text
Released: 2026-02-06
Updated: 2026-03-13
qwen3-coder-480b-a35b-instruct qwen3-coder-480b-a35b-instruct 262.1K 65.5K Input: $0.86
Output: $3.43
Model: 0.430
Completion: 3.988
🔧 🌡️ 2025-04 In: text
Out: text
Released: 2025-07-23
claude-sonnet-4-5-20250929 claude-sonnet-4-5-20250929 200K 64K Input: $3
Output: $15
Model: 1.500
Completion: 5.000
📎 🧠 🔧 🌡️ 2025-07-31 In: text, image, pdf
Out: text
Released: 2025-09-30
Deepseek-Reasoner deepseek-reasoner 128K 128K Input: $0.29
Output: $0.43
Model: 0.145
Completion: 1.483
🧠 🔧 🌡️ 2024-07 In: text
Out: text
Released: 2025-01-20
grok-4-1-fast-reasoning grok-4-1-fast-reasoning 2M 30K Input: $0.2
Output: $0.5
Model: 0.100
Completion: 2.500
📎 🧠 🔧 🌡️ 2025-06 In: text, image
Out: text
Released: 2025-11-20
gemini-3-pro-preview gemini-3-pro-preview 1M 64K Input: $2
Output: $12
Model: 1.000
Completion: 6.000
📎 🔧 🌡️ 2025-06 In: text, image
Out: text
Released: 2025-11-19
gpt-5-thinking gpt-5-thinking 400K 128K Input: $1.25
Output: $10
Model: 0.625
Completion: 8.000
📎 🧠 🔧 🌡️ 2024-10 In: text, image
Out: text
Released: 2025-08-08
gpt-5-mini gpt-5-mini 400K 128K Input: $0.25
Output: $2
Model: 0.125
Completion: 8.000
📎 🧠 🔧 2024-05-30 In: text, image
Out: text
Released: 2025-08-08
gpt-4.1-mini gpt-4.1-mini 1M 32.8K Input: $0.4
Output: $1.6
Model: 0.200
Completion: 4.000
📎 🔧 🌡️ 2024-04 In: text, image, pdf
Out: text
Released: 2025-04-14
GLM-5V-Turbo glm-5v-turbo 200K 131.1K Input: $0.72
Output: $3.2
Model: 0.360
Completion: 4.444
📎 🧠 🔧 🌡️ - In: text, image, video, audio, pdf
Out: text
Released: 2026-04-02
gpt-5.4-pro gpt-5.4-pro 1.1M 128K Input: $30
Output: $180
Cache Read: $0
Cache Write: $0
Model: 15.000
Completion: 6.000
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2026-03-05
glm-4.5-air glm-4.5-air 131.1K 98.3K Input: $0.1143
Output: $0.286
Model: 0.057
Completion: 2.502
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-29
claude-sonnet-4-6 claude-sonnet-4-6 1M 64K Input: $3
Output: $15
Model: 1.500
Completion: 5.000
📎 🧠 🔧 🌡️ 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-02-18
Updated: 2026-03-13
grok-4-fast-non-reasoning grok-4-fast-non-reasoning 2M 30K Input: $0.2
Output: $0.5
Model: 0.100
Completion: 2.500
📎 🔧 🌡️ 2025-06 In: text, image
Out: text
Released: 2025-09-23
gemini-3-flash-preview gemini-3-flash-preview 1M 65.5K Input: $0.5
Output: $3
Model: 0.250
Completion: 6.000
📎 🔧 🌡️ 2025-06 In: text, image
Out: text
Released: 2025-12-18
Deepseek-Chat deepseek-chat 128K 8.2K Input: $0.29
Output: $0.43
Model: 0.145
Completion: 1.483
🔧 🌡️ 2024-07 In: text
Out: text
Released: 2024-11-29
MiniMax-M1 MiniMax-M1 1M 128K Input: $0.132
Output: $1.254
Model: 0.066
Completion: 9.500
🔧 🌡️ - In: text
Out: text
Released: 2025-06-16
grok-4.20-multi-agent-beta-0309 grok-4.20-multi-agent-beta-0309 2M 30K Input: $2
Output: $6
Model: 1.000
Completion: 3.000
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2026-03-16
glm-5 glm-5 204.8K 131.1K Input: $0.6
Output: $2.6
Model: 0.300
Completion: 4.333
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-12
Qwen-Max-Latest qwen-max-latest 131.1K 8.2K Input: $0.343
Output: $1.372
Model: 0.172
Completion: 4.000
🔧 🌡️ 2024-11 In: text
Out: text
Released: 2024-04-03
Updated: 2025-01-25
mistral-large-2512 mistral-large-2512 128K 262.1K Input: $1.1
Output: $3.3
Model: 0.550
Completion: 3.000
📎 🔧 🌡️ 2024-12 In: text, image
Out: text
Released: 2025-12-16
MiniMax-M2.7 MiniMax-M2.7 204.8K 131.1K Input: $0.3
Output: $1.2
Model: 0.150
Completion: 4.000
🔧 🌡️ - In: text
Out: text
Released: 2026-03-19
claude-3-5-haiku-latest claude-3-5-haiku-latest 200K 8.2K Input: $0.8
Output: $4
Model: 0.400
Completion: 5.000
📎 🔧 🌡️ 2024-07-31 In: text, image, pdf
Out: text
Released: 2024-10-22
Qwen3-30B-A3B qwen3-30b-a3b 128K 8.2K Input: $0.11
Output: $1.08
Model: 0.055
Completion: 9.818
🔧 🌡️ 2025-04 In: text
Out: text
Released: 2025-04-29
deepseek-v3.2 deepseek-v3.2 128K 8.2K Input: $0.29
Output: $0.43
Model: 0.145
Completion: 1.483
🔧 🌡️ 2024-12 In: text
Out: text
Released: 2025-12-01
doubao-seed-1-8-251215 doubao-seed-1-8-251215 224K 64K Input: $0.114
Output: $0.286
Model: 0.057
Completion: 2.509
📎 🔧 🌡️ - In: text, image
Out: text
Released: 2025-12-18
claude-opus-4-5-20251101-thinking claude-opus-4-5-20251101-thinking 200K 64K Input: $5
Output: $25
Model: 2.500
Completion: 5.000
📎 🧠 🔧 🌡️ 2025-03 In: text, image
Out: text
Released: 2025-11-25
gemini-2.0-flash-lite gemini-2.0-flash-lite 2M 8.2K Input: $0.075
Output: $0.3
Model: 0.037
Completion: 4.000
📎 🌡️ 2024-11 In: text, image
Out: text
Released: 2025-06-16
glm-5-turbo glm-5-turbo 200K 131.1K Input: $0.72
Output: $3.2
Model: 0.360
Completion: 4.444
🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-03-16
gpt-5.1 gpt-5.1 400K 128K Input: $1.25
Output: $10
Model: 0.625
Completion: 8.000
📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-11-14

Abacus

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
o3 o3 200K 100K Input: $2
Output: $8
Model: 1.000
Completion: 4.000
📎 🧠 🔧 2024-05 In: text, image
Out: text
Released: 2025-04-16
Route LLM route-llm 128K 16.4K Input: $3
Output: $15
Model: 1.500
Completion: 5.000
📎 🔧 🌡️ 2024-10 In: text, image
Out: text
Released: 2024-01-01
Grok Code Fast 1 grok-code-fast-1 256K 16.4K Input: $0.2
Output: $1.5
Model: 0.100
Completion: 7.500
📎 🔧 🌡️ - In: text, image
Out: text
Released: 2025-09-01
GPT-5.3 Codex XHigh gpt-5.3-codex-xhigh 400K 128K Input: $1.75
Output: $14
Model: 0.875
Completion: 8.000
📎 🧠 🔧 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-02-05
Llama 3.3 70B Versatile llama-3.3-70b-versatile 128K 32.8K Input: $0.59
Output: $0.79
Model: 0.295
Completion: 1.339
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-12-06
Gemini 2.5 Pro gemini-2.5-pro 1M 65.5K Input: $1.25
Output: $10
Model: 0.625
Completion: 8.000
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-03-25
GPT-5 gpt-5 400K 128K Input: $1.25
Output: $10
Model: 0.625
Completion: 8.000
📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-08-07
Claude Haiku 4.5 claude-haiku-4-5-20251001 200K 64K Input: $1
Output: $5
Model: 0.500
Completion: 5.000
📎 🧠 🔧 🌡️ 2025-02-28 In: text, image, pdf
Out: text
Released: 2025-10-15
Gemini 2.5 Flash gemini-2.5-flash 1M 65.5K Input: $0.3
Output: $2.5
Model: 0.150
Completion: 8.333
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-03-20
Updated: 2025-06-05
o4-mini o4-mini 200K 100K Input: $1.1
Output: $4.4
Model: 0.550
Completion: 4.000
📎 🧠 🔧 2024-05 In: text, image
Out: text
Released: 2025-04-16
Qwen3 Max qwen3-max 131.1K 16.4K Input: $1.2
Output: $6
Model: 0.600
Completion: 5.000
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-05-28
o3-pro o3-pro 200K 100K Input: $20
Output: $40
Model: 10.000
Completion: 2.000
📎 🧠 🔧 2024-05 In: text, image
Out: text
Released: 2025-06-10
Claude Opus 4.1 claude-opus-4-1-20250805 200K 32K Input: $15
Output: $75
Model: 7.500
Completion: 5.000
📎 🧠 🔧 🌡️ - In: text, image, pdf
Out: text
Released: 2025-08-05
Grok 4.1 Fast (Non-Reasoning) grok-4-1-fast-non-reasoning 2M 16.4K Input: $0.2
Output: $0.5
Model: 0.100
Completion: 2.500
📎 🔧 🌡️ - In: text, image
Out: text
Released: 2025-11-17
Claude Opus 4.5 claude-opus-4-5-20251101 200K 64K Input: $5
Output: $25
Model: 2.500
Completion: 5.000
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-11-01
GPT-5.1 Codex gpt-5.1-codex 400K 128K Input: $1.25
Output: $10
Model: 0.625
Completion: 8.000
📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-11-13
GPT-5.1 Codex Max gpt-5.1-codex-max 400K 128K Input: $1.25
Output: $10
Model: 0.625
Completion: 8.000
📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-11-13
Claude Opus 4 claude-opus-4-20250514 200K 32K Input: $15
Output: $75
Model: 7.500
Completion: 5.000
📎 🧠 🔧 🌡️ - In: text, image, pdf
Out: text
Released: 2025-05-14
GPT-5.3 Chat Latest gpt-5.3-chat-latest 400K 128K Input: $1.75
Output: $14
Model: 0.875
Completion: 8.000
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2026-03-01
o3-mini o3-mini 200K 100K Input: $1.1
Output: $4.4
Model: 0.550
Completion: 4.000
🧠 🔧 2024-05 In: text
Out: text
Released: 2024-12-20
Updated: 2025-01-29
Kimi K2 Turbo Preview kimi-k2-turbo-preview 256K 8.2K Input: $0.15
Output: $8
Model: 0.075
Completion: 53.333
🔧 🌡️ - In: text
Out: text
Released: 2025-07-08
GPT-5.2 gpt-5.2 400K 128K Input: $1.75
Output: $14
Model: 0.875
Completion: 8.000
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2025-12-11
GPT-5.3 Codex gpt-5.3-codex 400K 128K Input: $1.75
Output: $14
Model: 0.875
Completion: 8.000
📎 🧠 🔧 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-02-05
Grok 4 grok-4-0709 256K 16.4K Input: $3
Output: $15
Model: 1.500
Completion: 5.000
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2025-07-09
Claude Sonnet 4 claude-sonnet-4-20250514 200K 64K Input: $3
Output: $15
Model: 1.500
Completion: 5.000
📎 🧠 🔧 🌡️ - In: text, image, pdf
Out: text
Released: 2025-05-14
Kimi K2.5 kimi-k2.5 262.1K 32.8K Input: $0.6
Output: $3
Model: 0.300
Completion: 5.000
🧠 🔧 🌡️ 2025-01 In: text, image, video
Out: text
Open Weights
Released: 2026-01
GPT-5.1 Chat Latest gpt-5.1-chat-latest 400K 128K Input: $1.25
Output: $10
Model: 0.625
Completion: 8.000
📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-11-13
GPT-5.2 Chat Latest gpt-5.2-chat-latest 400K 128K Input: $1.75
Output: $14
Model: 0.875
Completion: 8.000
📎 🧠 🔧 🌡️ 2024-09-30 In: text, image
Out: text
Released: 2026-01-01
GPT-4.1 Nano gpt-4.1-nano 1M 32.8K Input: $0.1
Output: $0.4
Model: 0.050
Completion: 4.000
📎 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2025-04-14
GPT-4o (2024-11-20) gpt-4o-2024-11-20 128K 16.4K Input: $2.5
Output: $10
Model: 1.250
Completion: 4.000
📎 🔧 🌡️ 2024-10 In: text, image, audio
Out: text
Released: 2024-11-20
Claude Sonnet 3.7 claude-3-7-sonnet-20250219 200K 64K Input: $3
Output: $15
Model: 1.500
Completion: 5.000
📎 🧠 🔧 🌡️ 2024-10-31 In: text, image, pdf
Out: text
Released: 2025-02-19
GPT-5.4 gpt-5.4 1.1M 128K Input: $2.5
Output: $15
Model: 1.250
Completion: 6.000
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2026-03-05
GPT-4.1 gpt-4.1 1M 32.8K Input: $2
Output: $8
Model: 1.000
Completion: 4.000
📎 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2025-04-14
Gemini 3.1 Pro Preview gemini-3.1-pro-preview 1M 65.5K Input: $2
Output: $12
Model: 1.000
Completion: 6.000
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video, audio, pdf
Out: text
Released: 2026-02-19
Claude Opus 4.6 claude-opus-4-6 200K 128K Input: $5
Output: $25
Model: 2.500
Completion: 5.000
📎 🧠 🔧 🌡️ 2025-05-31 In: text, image, pdf
Out: text
Released: 2026-02-05
Qwen 2.5 Coder 32B qwen-2.5-coder-32b 128K 8.2K Input: $0.79
Output: $0.79
Model: 0.395
Completion: 1.000
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-11-11
Claude Sonnet 4.5 claude-sonnet-4-5-20250929 200K 64K Input: $3
Output: $15
Model: 1.500
Completion: 5.000
📎 🧠 🔧 🌡️ 2025-07-31 In: text, image, pdf
Out: text
Released: 2025-09-29
GPT-5 Mini gpt-5-mini 400K 128K Input: $0.25
Output: $2
Model: 0.125
Completion: 8.000
📎 🧠 🔧 2024-05-30 In: text, image
Out: text
Released: 2025-08-07
GPT-4.1 Mini gpt-4.1-mini 1M 32.8K Input: $0.4
Output: $1.6
Model: 0.200
Completion: 4.000
📎 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2025-04-14
GPT-5 Nano gpt-5-nano 400K 128K Input: $0.05
Output: $0.4
Model: 0.025
Completion: 8.000
📎 🧠 🔧 2024-05-30 In: text, image
Out: text
Released: 2025-08-07
Claude Sonnet 4.6 claude-sonnet-4-6 200K 64K Input: $3
Output: $15
Model: 1.500
Completion: 5.000
📎 🧠 🔧 🌡️ 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-02-17
Grok 4 Fast (Non-Reasoning) grok-4-fast-non-reasoning 2M 16.4K Input: $0.2
Output: $0.5
Model: 0.100
Completion: 2.500
📎 🔧 🌡️ - In: text, image
Out: text
Released: 2025-07-09
Gemini 3 Flash Preview gemini-3-flash-preview 1M 65.5K Input: $0.5
Output: $3
Model: 0.250
Completion: 6.000
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-12-17
GPT-4o Mini gpt-4o-mini 128K 16.4K Input: $0.15
Output: $0.6
Model: 0.075
Completion: 4.000
📎 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2024-07-18
GPT-5 Codex gpt-5-codex 400K 128K Input: $1.25
Output: $10
Model: 0.625
Completion: 8.000
🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-09-15
Gemini 3.1 Flash Lite Preview gemini-3.1-flash-lite-preview 1M 65.5K Input: $0.25
Output: $1.5
Cache Read: $0.025
Cache Write: $1
Model: 0.125
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 🌡️ - In: text, image, audio, video, pdf
Out: text
Released: 2026-03-01
GPT-5.2 Codex gpt-5.2-codex 400K 128K Input: $1.75
Output: $14
Model: 0.875
Completion: 8.000
📎 🧠 🔧 2025-08-31 In: text, image, pdf
Out: text
Released: 2025-12-11
GPT-5.1 gpt-5.1 400K 128K Input: $1.25
Output: $10
Model: 0.625
Completion: 8.000
📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-11-13
Llama 4 Maverick 17B 128E Instruct FP8 meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8 1M 32.8K Input: $0.14
Output: $0.59
Model: 0.070
Completion: 4.214
📎 🔧 🌡️ 2024-08 In: text, image
Out: text
Open Weights
Released: 2025-04-05
Llama 3.1 405B Instruct Turbo meta-llama/Meta-Llama-3.1-405B-Instruct-Turbo 128K 4.1K Input: $3.5
Output: $3.5
Model: 1.750
Completion: 1.000
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-07-23
Llama 3.1 8B Instruct meta-llama/Meta-Llama-3.1-8B-Instruct 128K 4.1K Input: $0.02
Output: $0.05
Model: 0.010
Completion: 2.500
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-07-23
Qwen3 Coder 480B A35B Instruct Qwen/qwen3-coder-480b-a35b-instruct 262.1K 65.5K Input: $0.29
Output: $1.2
Model: 0.145
Completion: 4.138
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-07-22
QwQ 32B Qwen/QwQ-32B 32.8K 32.8K Input: $0.4
Output: $0.4
Model: 0.200
Completion: 1.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-11-28
Qwen3 235B A22B Instruct Qwen/Qwen3-235B-A22B-Instruct-2507 262.1K 8.2K Input: $0.13
Output: $0.6
Model: 0.065
Completion: 4.615
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-07-01
Qwen3 32B Qwen/Qwen3-32B 128K 8.2K Input: $0.09
Output: $0.29
Model: 0.045
Completion: 3.222
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-04-29
Qwen 2.5 72B Instruct Qwen/Qwen2.5-72B-Instruct 128K 8.2K Input: $0.11
Output: $0.38
Model: 0.055
Completion: 3.455
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-09-19
GPT-OSS 120B openai/gpt-oss-120b 128K 32.8K Input: $0.08
Output: $0.44
Model: 0.040
Completion: 5.500
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2025-08-05
GLM-4.7 zai-org/glm-4.7 128K 8.2K Input: $0.6
Output: $2.2
Model: 0.300
Completion: 3.667
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-06-01
GLM-4.5 zai-org/glm-4.5 128K 8.2K Input: $0.6
Output: $2.2
Model: 0.300
Completion: 3.667
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-07-28
GLM-4.6 zai-org/glm-4.6 128K 8.2K Input: $0.6
Output: $2.2
Model: 0.300
Completion: 3.667
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-03-01
GLM-5 zai-org/glm-5 204.8K 131.1K Input: $1
Output: $3.2
Model: 0.500
Completion: 3.200
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-11
DeepSeek R1 deepseek-ai/DeepSeek-R1 128K 8.2K Input: $3
Output: $7
Model: 1.500
Completion: 2.333
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-01-20
DeepSeek V3.1 Terminus deepseek-ai/DeepSeek-V3.1-Terminus 128K 8.2K Input: $0.27
Output: $1
Model: 0.135
Completion: 3.704
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-06-01
DeepSeek V3.2 deepseek-ai/DeepSeek-V3.2 128K 8.2K Input: $0.27
Output: $0.4
Model: 0.135
Completion: 1.481
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-06-15
DeepSeek V3.1 deepseek/deepseek-v3.1 128K 8.2K Input: $0.55
Output: $1.66
Model: 0.275
Completion: 3.018
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-01-20

abliteration.ai

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Abliterated Model abliterated-model 150K 8.2K Input: $3
Output: $3
Model: 1.500
Completion: 1.000
📎 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2026-01-06

AIHubMix

📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Coding MiniMax M2.7 coding-minimax-m2.7 204.8K 128.1K Input: $0.2
Output: $0.2
Model: 0.100
Completion: 1.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-18
GLM-5.1 (Alibaba Cloud) alicloud-glm-5.1 200K 128K Input: $0.84
Output: $3.38
Cache Read: $0.169
Cache Write: $1.05625
Model: 0.420
Completion: 4.024
Cache: 0.201
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-27
Claude Sonnet 4.6 Thinking claude-sonnet-4-6-think 1M 64K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-02-17
Updated: 2026-03-13
Gemini 3.1 Flash Lite gemini-3.1-flash-lite 1M 65.5K Input: $0.25
Output: $1.5
Cache Read: $0.025
Cache Write: $1
Model: 0.125
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2026-05-07
DeepSeek V4 Flash (Alibaba Cloud) alicloud-deepseek-v4-flash 1M 384K Input: $0.14
Output: $0.28
Cache Read: $0.028
Model: 0.070
Completion: 2.000
Cache: 0.200
🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2026-04-24
Xiaomi MiMo-V2.5-Pro xiaomi-mimo-v2.5-pro 1M 131.1K Input: $1.1
Output: $3.3
Cache Read: $0.22
Model: 0.550
Completion: 3.000
Cache: 0.200
🧠 🔧 🌡️ 2024-12 In: text
Out: text
Open Weights
Released: 2026-04-22
Updated: 2026-05-13
Doubao Seed 2.0 Code Preview doubao-seed-2-0-code-preview 256K 128K Input: $0.48
Output: $2.41
Cache Read: $0.09644
Model: 0.240
Completion: 5.021
Cache: 0.201
📎 🧠 🔧 🌡️ - In: text, image, video
Out: text
Released: 2026-02-14
Coding Xiaomi MiMo-V2.5-Pro coding-xiaomi-mimo-v2.5-pro 1M 131.1K Input: $0.2
Output: $0.6
Cache Read: $0.04
Model: 0.100
Completion: 3.000
Cache: 0.200
🧠 🔧 🌡️ 2024-12 In: text
Out: text
Open Weights
Released: 2026-04-22
Updated: 2026-05-13
Doubao Seed 2.0 Pro doubao-seed-2-0-pro 256K 128K Input: $0.48
Output: $2.41
Cache Read: $0.09644
Model: 0.240
Completion: 5.021
Cache: 0.201
📎 🧠 🔧 🌡️ - In: text, image, video
Out: text
Released: 2026-02-14
DeepSeek V4 Flash (DeepSeek) deep-deepseek-v4-flash 1M 384K Input: $0.154
Output: $0.308
Cache Read: $0.0308
Model: 0.077
Completion: 2.000
Cache: 0.200
🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2026-04-24
Gemini 2.5 Pro gemini-2.5-pro 1M 65.5K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-03-20
Updated: 2025-06-05
Grok 4.3 grok-4.3 1M 1M Input: $1.25
Output: $2.5
Cache Read: $0.2
Model: 0.625
Completion: 2.000
Cache: 0.160
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2026-05-01
Gemini 2.5 Flash gemini-2.5-flash 1M 65.5K Input: $0.3
Output: $2.5
Cache Read: $0.03
Input Audio: $1
Model: 0.500
Completion: 2.500
Cache: 0.030
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-03-20
Updated: 2025-06-05
Xiaomi MiMo-V2.5 (free) xiaomi-mimo-v2.5-free 1M 131.1K Input: $0
Output: $0
Cache Read: $0
- 📎 🧠 🔧 🌡️ 2024-12 In: text, image, audio, video
Out: text
Open Weights
Released: 2026-04-22
Updated: 2026-05-13
Claude Opus 4.7 Thinking claude-opus-4-7-think 1M 128K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 2026-01-31 In: text, image, pdf
Out: text
Released: 2026-04-16
Coding GLM 5.1 (free) coding-glm-5.1-free 200K 128K Input: $0
Output: $0
- 🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-04-11
Xiaomi MiMo-V2.5-Pro (free) xiaomi-mimo-v2.5-pro-free 1M 131.1K Input: $0
Output: $0
Cache Read: $0
- 🧠 🔧 🌡️ 2024-12 In: text
Out: text
Open Weights
Released: 2026-04-22
Updated: 2026-05-13
Xiaomi MiMo-V2.5 xiaomi-mimo-v2.5 1M 131.1K Input: $0.44
Output: $2.2
Cache Read: $0.088
Model: 0.220
Completion: 5.000
Cache: 0.200
📎 🧠 🔧 🌡️ 2024-12 In: text, image, audio, video
Out: text
Open Weights
Released: 2026-04-22
Updated: 2026-05-13
Claude Opus 4.7 claude-opus-4-7 1M 128K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 2026-01-31 In: text, image, pdf
Out: text
Released: 2026-04-16
Qwen3.6 Flash qwen3.6-flash 991K 64K Input: $0.17
Output: $1.01
Cache Read: $0.0169
Cache Write: $0.21125
Model: 0.085
Completion: 5.941
Cache: 0.099
📎 🧠 🔧 🌡️ 2025-04 In: text, image, video
Out: text
Released: 2026-04-02
Coding Xiaomi MiMo-V2.5 coding-xiaomi-mimo-v2.5 1M 131.1K Input: $0.08
Output: $0.4
Cache Read: $0.016
Model: 0.040
Completion: 5.000
Cache: 0.200
📎 🧠 🔧 🌡️ 2024-12 In: text, image, audio, video
Out: text
Open Weights
Released: 2026-04-22
Updated: 2026-05-13
GPT-5.1 Codex gpt-5.1-codex 400K 128K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-11-13
Gemini 3.1 Pro Preview Custom Tools gemini-3.1-pro-preview-customtools 1M 65.5K Input: $2
Output: $12
Cache Read: $0.2
Model: 1.000
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2026-02-19
Doubao Seed 2.0 Mini 260428 doubao-seed-2-0-mini-260428 256K 128K Input: $0.03
Output: $0.28
Cache Read: $0.00564
Input Audio: $0.423
Model: 0.211
Completion: 0.662
Cache: 0.013
📎 🧠 🔧 🌡️ - In: text, image, video
Out: text
Released: 2026-04-28
Doubao Seed 2.0 Lite 260428 doubao-seed-2-0-lite-260428 256K 128K Input: $0.08
Output: $0.51
Cache Read: $0.01692
Input Audio: $1.269
Model: 0.634
Completion: 0.402
Cache: 0.013
📎 🧠 🔧 🌡️ - In: text, image, video
Out: text
Released: 2026-04-28
GPT-5.2 gpt-5.2 400K 128K Input: $1.75
Output: $14
Cache Read: $0.175
Model: 0.875
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2025-12-11
GPT-5.3 Codex gpt-5.3-codex 400K 128K Input: $1.75
Output: $14
Cache Read: $0.175
Model: 0.875
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-02-05
MiniMax M2.7 minimax-m2.7 204.8K 128K Input: $0.3
Output: $1.2
Cache Read: $0.06
Cache Write: $0.375
Model: 0.150
Completion: 4.000
Cache: 0.200
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-18
Claude Opus 4.6 Thinking claude-opus-4-6-think 1M 128K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-05-31 In: text, image, pdf
Out: text
Released: 2026-02-05
Updated: 2026-03-13
GPT-5.1 Codex mini gpt-5.1-codex-mini 400K 128K Input: $0.25
Output: $2
Cache Read: $0.025
Model: 0.125
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-11-13
Kimi K2.5 kimi-k2.5 262.1K 32.8K Input: $0.6
Output: $3
Cache Read: $0.1
Model: 0.300
Completion: 5.000
Cache: 0.167
📎 🧠 🔧 2025-01 In: text, image, video
Out: text
Open Weights
Released: 2026-01
Coding GLM 5.1 coding-glm-5.1 200K 128K Input: $0.06
Output: $0.22
Cache Read: $0.013
Model: 0.030
Completion: 3.667
Cache: 0.217
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-04-11
Coding MiniMax M2.7 (Free) coding-minimax-m2.7-free 204.8K 128.1K Input: $0
Output: $0
- 🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-18
GPT-5.4 gpt-5.4 1.1M 128K Input: $2.5
Output: $15
Cache Read: $0.25
Model: 1.250
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-03-05
GPT-5.4 mini gpt-5.4-mini 400K 128K Input: $0.75
Output: $4.5
Cache Read: $0.075
Model: 0.375
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2026-03-17
Kimi K2.6 kimi-k2.6 262.1K 32.8K Input: $0.95
Output: $4
Cache Read: $0.16
Model: 0.475
Completion: 4.211
Cache: 0.168
📎 🧠 🔧 2025-01 In: text, image, video
Out: text
Open Weights
Released: 2026-04-21
Gemini 3.1 Pro Preview gemini-3.1-pro-preview 1M 65.5K Input: $2
Output: $12
Cache Read: $0.2
Model: 1.000
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2026-02-19
Claude Opus 4.6 claude-opus-4-6 1M 128K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-05-31 In: text, image, pdf
Out: text
Released: 2026-02-05
Updated: 2026-03-13
DeepSeek V4 Pro (DeepSeek) deep-deepseek-v4-pro 1M 384K Input: $0.478
Output: $0.956
Cache Read: $0.004302
Model: 0.239
Completion: 2.000
Cache: 0.009
🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2026-04-24
GLM 5 Vision Turbo glm-5v-turbo 200K 128K Input: $0.7042
Output: $3.09848
Cache Read: $0.169008
Model: 0.352
Completion: 4.400
Cache: 0.240
📎 🧠 🔧 🌡️ - In: text, image, video
Out: text
Released: 2026-05-09
GLM-5.1 (Z.ai) zai-glm-5.1 200K 128K Input: $0.845
Output: $3.38
Cache Read: $0.183112
Model: 0.422
Completion: 4.000
Cache: 0.217
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-27
Claude Sonnet 4.6 claude-sonnet-4-6 1M 64K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-02-17
Updated: 2026-03-13
Gemini 3 Flash Preview gemini-3-flash-preview 1M 65.5K Input: $0.5
Output: $3
Cache Read: $0.05
Model: 0.250
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-12-17
Coding MiniMax M2.7 Highspeed coding-minimax-m2.7-highspeed 204.8K 128.1K Input: $0.2
Output: $0.2
Model: 0.100
Completion: 1.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-18
Qwen3.6 Max Preview qwen3.6-max-preview 240K 64K Input: $1.27
Output: $7.61
Cache Read: $0.1268
Cache Write: $1.585
Model: 0.635
Completion: 5.992
Cache: 0.100
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Released: 2026-05-09
GPT-5.2 Codex gpt-5.2-codex 400K 128K Input: $1.75
Output: $14
Cache Read: $0.175
Model: 0.875
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image, pdf
Out: text
Released: 2025-12-11
Qwen3.6 Plus qwen3.6-plus 991K 64K Input: $0.28
Output: $1.69
Cache Read: $0.0282
Cache Write: $0.3525
Model: 0.140
Completion: 6.036
Cache: 0.101
📎 🧠 🔧 🌡️ 2025-04 In: text, image, video
Out: text
Released: 2026-05-09
DeepSeek V4 Pro (Alibaba Cloud) alicloud-deepseek-v4-pro 1M 384K Input: $1.69
Output: $3.38
Cache Read: $0.13
Model: 0.845
Completion: 2.000
Cache: 0.077
🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2026-04-24
GPT-5.1 gpt-5.1 400K 128K Input: $1.25
Output: $10
Cache Read: $0.13
Model: 0.625
Completion: 8.000
Cache: 0.104
📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-11-13
GPT-5.5 gpt-5.5 1.1M 128K Input: $5
Output: $30
Cache Read: $0.5
Model: 2.500
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 2025-12-01 In: text, image, pdf
Out: text
Released: 2026-04-23

Alibaba

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Qwen3-Omni Flash qwen3-omni-flash 65.5K 16.4K Input: $0.43
Output: $1.66
Input Audio: $3.81
Output Audio: $15.11
Model: 1.905
Completion: 3.966
🧠 🔧 🌡️ 2024-04 In: text, image, audio, video
Out: text, audio
Released: 2025-09-15
Qwen3 Coder Plus qwen3-coder-plus 1M 65.5K Input: $1
Output: $5
Model: 0.500
Completion: 5.000
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-23
Qwen Plus qwen-plus 1M 32.8K Input: $0.4
Output: $1.2
Reasoning: $4
Model: 0.200
Completion: 3.000
🧠 🔧 🌡️ 2024-04 In: text
Out: text
Released: 2024-01-25
Updated: 2025-09-11
Qwen3-Coder 30B-A3B Instruct qwen3-coder-30b-a3b-instruct 262.1K 65.5K Input: $0.45
Output: $2.25
Model: 0.225
Completion: 5.000
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-04
Qwen3-Omni Flash Realtime qwen3-omni-flash-realtime 65.5K 16.4K Input: $0.52
Output: $1.99
Input Audio: $4.57
Output Audio: $18.13
Model: 2.285
Completion: 3.967
🔧 🌡️ 2024-04 In: text, image, audio, video
Out: text, audio
Released: 2025-09-15
Qwen3 32B qwen3-32b 131.1K 16.4K Input: $0.7
Output: $2.8
Reasoning: $8.4
Model: 0.350
Completion: 4.000
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-04
Qwen-Omni Turbo Realtime qwen-omni-turbo-realtime 32.8K 2K Input: $0.27
Output: $1.07
Input Audio: $4.44
Output Audio: $8.89
Model: 2.220
Completion: 2.002
🔧 🌡️ 2024-04 In: text, image, audio
Out: text, audio
Released: 2025-05-08
Qwen Plus Character (Japanese) qwen-plus-character-ja 8.2K 512 Input: $0.5
Output: $1.4
Model: 0.250
Completion: 2.800
🔧 🌡️ 2024-04 In: text
Out: text
Released: 2024-01
Qwen3-Next 80B-A3B Instruct qwen3-next-80b-a3b-instruct 131.1K 32.8K Input: $0.5
Output: $2
Model: 0.250
Completion: 4.000
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-09
Qwen3.7 Plus qwen3.7-plus 1M 65.5K Input: $0.5
Output: $3
Cache Read: $0.05
Cache Write: $0.625
Model: 0.250
Completion: 6.000
Cache: 0.100
🧠 🔧 🌡️ 2025-04 In: text, image, video
Out: text
Released: 2026-06-02
Updated: 2026-06-04
Qwen3.6 35B-A3B qwen3.6-35b-a3b 262.1K 65.5K Input: $0.248
Output: $1.485
Model: 0.124
Completion: 5.988
📎 🧠 🔧 🌡️ - In: text, image, video, audio
Out: text
Open Weights
Released: 2026-04-17
Qwen3.7 Max qwen3.7-max 1M 65.5K Input: $2.5
Output: $7.5
Cache Read: $0.5
Cache Write: $3.125
Model: 1.250
Completion: 3.000
Cache: 0.200
🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-05-21
Qwen3 Max qwen3-max 262.1K 65.5K Input: $1.2
Output: $6
Model: 0.600
Completion: 5.000
🔧 🌡️ 2025-04 In: text
Out: text
Released: 2025-09-23
Qwen2.5-Omni 7B qwen2-5-omni-7b 32.8K 2K Input: $0.1
Output: $0.4
Input Audio: $6.76
Model: 3.380
Completion: 0.059
🔧 🌡️ 2024-04 In: text, image, audio, video
Out: text, audio
Open Weights
Released: 2024-12
Qwen3 8B qwen3-8b 131.1K 8.2K Input: $0.18
Output: $0.7
Reasoning: $2.1
Model: 0.090
Completion: 3.889
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-04
Qwen2.5 14B Instruct qwen2-5-14b-instruct 131.1K 8.2K Input: $0.35
Output: $1.4
Model: 0.175
Completion: 4.000
🔧 🌡️ 2024-04 In: text
Out: text
Open Weights
Released: 2024-09
Qwen3-Next 80B-A3B (Thinking) qwen3-next-80b-a3b-thinking 131.1K 32.8K Input: $0.5
Output: $6
Model: 0.250
Completion: 12.000
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-09
QVQ Max qvq-max 131.1K 8.2K Input: $1.2
Output: $4.8
Model: 0.600
Completion: 4.000
🧠 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2025-03-25
Qwen3.6 Flash qwen3.6-flash 1M 65.5K Input: $0.1875
Output: $1.125
Cache Write: $0.234375
Model: 0.094
Completion: 6.000
📎 🧠 🔧 🌡️ - In: text, image, video
Out: text
Released: 2026-04-27
Qwen2.5-VL 72B Instruct qwen2-5-vl-72b-instruct 131.1K 8.2K Input: $2.8
Output: $8.4
Model: 1.400
Completion: 3.000
🔧 🌡️ 2024-04 In: text, image
Out: text
Open Weights
Released: 2024-09
Qwen3-VL Plus qwen3-vl-plus 262.1K 32.8K Input: $0.2
Output: $1.6
Reasoning: $4.8
Model: 0.100
Completion: 8.000
🧠 🔧 🌡️ 2025-04 In: text, image
Out: text
Released: 2025-09-23
Qwen-VL OCR qwen-vl-ocr 34.1K 4.1K Input: $0.72
Output: $0.72
Model: 0.360
Completion: 1.000
🌡️ 2024-04 In: text, image
Out: text
Released: 2024-10-28
Updated: 2025-04-13
Qwen-MT Turbo qwen-mt-turbo 16.4K 8.2K Input: $0.16
Output: $0.49
Model: 0.080
Completion: 3.063
🌡️ 2024-04 In: text
Out: text
Released: 2025-01
Qwen-MT Plus qwen-mt-plus 16.4K 8.2K Input: $2.46
Output: $7.37
Model: 1.230
Completion: 2.996
🌡️ 2024-04 In: text
Out: text
Released: 2025-01
Qwen3.5 Plus qwen3.5-plus 1M 65.5K Input: $0.4
Output: $2.4
Reasoning: $2.4
Model: 0.200
Completion: 6.000
🧠 🔧 🌡️ 2025-04 In: text, image, video
Out: text
Released: 2026-02-16
Qwen-Omni Turbo qwen-omni-turbo 32.8K 2K Input: $0.07
Output: $0.27
Input Audio: $4.44
Output Audio: $8.89
Model: 2.220
Completion: 2.002
🔧 🌡️ 2024-04 In: text, image, audio, video
Out: text, audio
Released: 2025-01-19
Updated: 2025-03-26
Qwen2.5 72B Instruct qwen2-5-72b-instruct 131.1K 8.2K Input: $1.4
Output: $5.6
Model: 0.700
Completion: 4.000
🔧 🌡️ 2024-04 In: text
Out: text
Open Weights
Released: 2024-09
Qwen Flash qwen-flash 1M 32.8K Input: $0.05
Output: $0.4
Model: 0.025
Completion: 8.000
🧠 🔧 🌡️ 2024-04 In: text
Out: text
Released: 2025-07-28
Qwen3-VL 235B-A22B qwen3-vl-235b-a22b 131.1K 32.8K Input: $0.7
Output: $2.8
Reasoning: $8.4
Model: 0.350
Completion: 4.000
🧠 🔧 🌡️ 2025-04 In: text, image
Out: text
Open Weights
Released: 2025-04
Qwen3-VL 30B-A3B qwen3-vl-30b-a3b 131.1K 32.8K Input: $0.2
Output: $0.8
Reasoning: $2.4
Model: 0.100
Completion: 4.000
🧠 🔧 🌡️ 2025-04 In: text, image
Out: text
Open Weights
Released: 2025-04
Qwen-VL Max qwen-vl-max 131.1K 8.2K Input: $0.8
Output: $3.2
Model: 0.400
Completion: 4.000
🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2024-04-08
Updated: 2025-08-13
Qwen3.5 27B qwen3.5-27b 262.1K 65.5K Input: $0.3
Output: $2.4
Model: 0.150
Completion: 8.000
📎 🧠 🔧 🌡️ - In: text, image, video, audio
Out: text
Open Weights
Released: 2026-02-23
Qwen Max qwen-max 32.8K 8.2K Input: $1.6
Output: $6.4
Model: 0.800
Completion: 4.000
🔧 🌡️ 2024-04 In: text
Out: text
Released: 2024-04-03
Updated: 2025-01-25
Qwen3 235B-A22B qwen3-235b-a22b 131.1K 16.4K Input: $0.7
Output: $2.8
Reasoning: $8.4
Model: 0.350
Completion: 4.000
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-04
Qwen3-LiveTranslate Flash Realtime qwen3-livetranslate-flash-realtime 53.2K 4.1K Input: $10
Output: $10
Input Audio: $10
Output Audio: $38
Model: 5.000
Completion: 3.800
🌡️ 2024-04 In: text, image, audio, video
Out: text, audio
Released: 2025-09-22
Qwen3.6 27B qwen3.6-27b 262.1K 65.5K Input: $0.6
Output: $3.6
Model: 0.300
Completion: 6.000
📎 🧠 🔧 🌡️ - In: text, image, video, audio
Out: text
Open Weights
Released: 2026-04-22
Qwen3.5 35B-A3B qwen3.5-35b-a3b 262.1K 65.5K Input: $0.25
Output: $2
Model: 0.125
Completion: 8.000
📎 🧠 🔧 🌡️ - In: text, image, video, audio
Out: text
Open Weights
Released: 2026-02-23
Qwen3-Coder 480B-A35B Instruct qwen3-coder-480b-a35b-instruct 262.1K 65.5K Input: $1.5
Output: $7.5
Model: 0.750
Completion: 5.000
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-04
QwQ Plus qwq-plus 131.1K 8.2K Input: $0.8
Output: $2.4
Model: 0.400
Completion: 3.000
🧠 🔧 🌡️ 2024-04 In: text
Out: text
Released: 2025-03-05
Qwen2.5 32B Instruct qwen2-5-32b-instruct 131.1K 8.2K Input: $0.7
Output: $2.8
Model: 0.350
Completion: 4.000
🔧 🌡️ 2024-04 In: text
Out: text
Open Weights
Released: 2024-09
Qwen3.5 397B-A17B qwen3.5-397b-a17b 262.1K 65.5K Input: $0.6
Output: $3.6
Model: 0.300
Completion: 6.000
📎 🧠 🔧 🌡️ - In: text, image, video, audio
Out: text
Open Weights
Released: 2026-02-15
Qwen3 Coder Flash qwen3-coder-flash 1M 65.5K Input: $0.3
Output: $1.5
Model: 0.150
Completion: 5.000
🔧 🌡️ 2025-04 In: text
Out: text
Released: 2025-07-28
Qwen3 14B qwen3-14b 131.1K 8.2K Input: $0.35
Output: $1.4
Reasoning: $4.2
Model: 0.175
Completion: 4.000
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-04
Qwen3-ASR Flash qwen3-asr-flash 53.2K 4.1K Input: $0.035
Output: $0.035
Model: 0.018
Completion: 1.000
- 2024-04 In: audio
Out: text
Released: 2025-09-08
Qwen Turbo qwen-turbo 1M 16.4K Input: $0.05
Output: $0.2
Reasoning: $0.5
Model: 0.025
Completion: 4.000
🧠 🔧 🌡️ 2024-04 In: text
Out: text
Released: 2024-11-01
Updated: 2025-04-28
Qwen2.5 7B Instruct qwen2-5-7b-instruct 131.1K 8.2K Input: $0.175
Output: $0.7
Model: 0.087
Completion: 4.000
🔧 🌡️ 2024-04 In: text
Out: text
Open Weights
Released: 2024-09
Qwen2.5-VL 7B Instruct qwen2-5-vl-7b-instruct 131.1K 8.2K Input: $0.35
Output: $1.05
Model: 0.175
Completion: 3.000
🔧 🌡️ 2024-04 In: text, image
Out: text
Open Weights
Released: 2024-09
Qwen3.6 Max Preview qwen3.6-max-preview 262.1K 65.5K Input: $1.3
Output: $7.8
Cache Read: $0.13
Cache Write: $1.625
Model: 0.650
Completion: 6.000
Cache: 0.100
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Released: 2026-04-20
Qwen3.5 122B-A10B qwen3.5-122b-a10b 262.1K 65.5K Input: $0.4
Output: $3.2
Model: 0.200
Completion: 8.000
📎 🧠 🔧 🌡️ - In: text, image, video, audio
Out: text
Open Weights
Released: 2026-02-23
Qwen-VL Plus qwen-vl-plus 131.1K 8.2K Input: $0.21
Output: $0.63
Model: 0.105
Completion: 3.000
🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2024-01-25
Updated: 2025-08-15
Qwen3.6 Plus qwen3.6-plus 1M 65.5K Input: $0.5
Output: $3
Cache Read: $0.05
Cache Write: $0.625
Model: 0.250
Completion: 6.000
Cache: 0.100
🧠 🔧 🌡️ 2025-04 In: text, image, video
Out: text
Released: 2026-04-02
DeepSeek R1 deepseek-r1 128K - Input: $4
Output: $16
Model: 2.000
Completion: 4.000
- - In: text
Out: text
-

Alibaba (China)

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Qwen2.5-Math 72B Instruct qwen2-5-math-72b-instruct 4.1K 3.1K Input: $0.574
Output: $1.721
Model: 0.287
Completion: 2.998
🔧 🌡️ 2024-04 In: text
Out: text
Open Weights
Released: 2024-09
DeepSeek R1 0528 deepseek-r1-0528 131.1K 16.4K Input: $0.574
Output: $2.294
Model: 0.287
Completion: 3.997
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-05-28
Qwen3-Omni Flash qwen3-omni-flash 65.5K 16.4K Input: $0.058
Output: $0.23
Input Audio: $3.584
Output Audio: $7.168
Model: 1.792
Completion: 2.000
🧠 🔧 🌡️ 2024-04 In: text, image, audio, video
Out: text, audio
Released: 2025-09-15
DeepSeek V4 Flash deepseek-v4-flash 1M 384K Input: $0.14
Output: $0.28
Cache Read: $0.0028
Model: 0.070
Completion: 2.000
Cache: 0.020
🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2026-04-24
Qwen Plus qwen-plus 1M 32.8K Input: $0.115
Output: $0.287
Reasoning: $1.147
Model: 0.058
Completion: 2.496
🧠 🔧 🌡️ 2024-04 In: text
Out: text
Released: 2024-01-25
Updated: 2025-09-11
Qwen3-Coder 30B-A3B Instruct qwen3-coder-30b-a3b-instruct 262.1K 65.5K Input: $0.216
Output: $0.861
Model: 0.108
Completion: 3.986
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-04
Qwen2.5-Coder 7B Instruct qwen2-5-coder-7b-instruct 131.1K 8.2K Input: $0.144
Output: $0.287
Model: 0.072
Completion: 1.993
🔧 🌡️ 2024-04 In: text
Out: text
Open Weights
Released: 2024-11
DeepSeek V3 deepseek-v3 65.5K 8.2K Input: $0.287
Output: $1.147
Model: 0.143
Completion: 3.997
🔧 🌡️ - In: text
Out: text
Released: 2024-12-01
Qwen3-Omni Flash Realtime qwen3-omni-flash-realtime 65.5K 16.4K Input: $0.23
Output: $0.918
Input Audio: $3.584
Output Audio: $7.168
Model: 1.792
Completion: 2.000
🔧 🌡️ 2024-04 In: text, image, audio
Out: text, audio
Released: 2025-09-15
DeepSeek R1 Distill Llama 70B deepseek-r1-distill-llama-70b 32.8K 16.4K Input: $0.287
Output: $0.861
Model: 0.143
Completion: 3.000
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-01-01
Qwen3 32B qwen3-32b 131.1K 16.4K Input: $0.287
Output: $1.147
Reasoning: $2.868
Model: 0.143
Completion: 3.997
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-04
Qwen-Omni Turbo Realtime qwen-omni-turbo-realtime 32.8K 2K Input: $0.23
Output: $0.918
Input Audio: $3.584
Output Audio: $7.168
Model: 1.792
Completion: 2.000
🔧 🌡️ 2024-04 In: text, image, audio
Out: text, audio
Released: 2025-05-08
Qwen2.5-Math 7B Instruct qwen2-5-math-7b-instruct 4.1K 3.1K Input: $0.144
Output: $0.287
Model: 0.072
Completion: 1.993
🔧 🌡️ 2024-04 In: text
Out: text
Open Weights
Released: 2024-09
Qwen3-Next 80B-A3B Instruct qwen3-next-80b-a3b-instruct 131.1K 32.8K Input: $0.144
Output: $0.574
Model: 0.072
Completion: 3.986
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-09
Qwen3.7 Plus qwen3.7-plus 1M 64K Input: $0.5
Output: $3
Cache Read: $0.05
Cache Write: $0.625
Model: 0.250
Completion: 6.000
Cache: 0.100
🧠 🔧 🌡️ 2025-04 In: text, image
Out: text
Released: 2026-06-02
Qwen3.7 Max qwen3.7-max 1M 65.5K Input: $2.5
Output: $7.5
Cache Read: $0.5
Cache Write: $3.125
Model: 1.250
Completion: 3.000
Cache: 0.200
🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-05-21
Qwen Long qwen-long 10M 8.2K Input: $0.072
Output: $0.287
Model: 0.036
Completion: 3.986
🔧 🌡️ 2024-04 In: text
Out: text
Released: 2025-01-25
Qwen Math Turbo qwen-math-turbo 4.1K 3.1K Input: $0.287
Output: $0.861
Model: 0.143
Completion: 3.000
🔧 🌡️ 2024-04 In: text
Out: text
Released: 2024-09-19
Qwen3 Max qwen3-max 262.1K 65.5K Input: $0.861
Output: $3.441
Model: 0.430
Completion: 3.997
🔧 🌡️ 2025-04 In: text
Out: text
Released: 2025-09-23
Qwen2.5-Omni 7B qwen2-5-omni-7b 32.8K 2K Input: $0.087
Output: $0.345
Input Audio: $5.448
Model: 2.724
Completion: 0.063
🔧 🌡️ 2024-04 In: text, image, audio, video
Out: text, audio
Open Weights
Released: 2024-12
Qwen3 8B qwen3-8b 131.1K 8.2K Input: $0.072
Output: $0.287
Reasoning: $0.717
Model: 0.036
Completion: 3.986
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-04
Qwen2.5 14B Instruct qwen2-5-14b-instruct 131.1K 8.2K Input: $0.144
Output: $0.431
Model: 0.072
Completion: 2.993
🔧 🌡️ 2024-04 In: text
Out: text
Open Weights
Released: 2024-09
GLM-5.1 glm-5.1 202.8K 128K Input: $0.87
Output: $3.48
Cache Read: $0.17
Model: 0.435
Completion: 4.000
Cache: 0.195
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-04-14
DeepSeek V4 Pro deepseek-v4-pro 1M 384K Input: $0.435
Output: $0.87
Cache Read: $0.003625
Model: 0.217
Completion: 2.000
Cache: 0.008
🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2026-04-24
Qwen3-Next 80B-A3B (Thinking) qwen3-next-80b-a3b-thinking 131.1K 32.8K Input: $0.144
Output: $1.434
Model: 0.072
Completion: 9.958
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-09
QVQ Max qvq-max 131.1K 8.2K Input: $1.147
Output: $4.588
Model: 0.574
Completion: 4.000
🧠 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2025-03-25
Qwen Plus Character qwen-plus-character 32.8K 4.1K Input: $0.115
Output: $0.287
Model: 0.058
Completion: 2.496
🔧 🌡️ 2024-04 In: text
Out: text
Released: 2024-01
Moonshot Kimi K2 Thinking kimi-k2-thinking 262.1K 16.4K Input: $0.574
Output: $2.294
Model: 0.287
Completion: 3.997
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-11-06
DeepSeek R1 deepseek-r1 131.1K 16.4K Input: $0.574
Output: $2.294
Model: 0.287
Completion: 3.997
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-01-01
Qwen3.5 Flash qwen3.5-flash 1M 65.5K Input: $0.172
Output: $1.72
Reasoning: $1.72
Model: 0.086
Completion: 10.000
📎 🧠 🔧 🌡️ 2025-04 In: text, image, video
Out: text
Released: 2026-02-23
Qwen3.6 Flash qwen3.6-flash 1M 65.5K Input: $0.1875
Output: $1.125
Cache Write: $0.234375
Model: 0.094
Completion: 6.000
📎 🧠 🔧 🌡️ - In: text, image, video
Out: text
Released: 2026-04-27
Qwen2.5-VL 72B Instruct qwen2-5-vl-72b-instruct 131.1K 8.2K Input: $2.294
Output: $6.881
Model: 1.147
Completion: 3.000
🔧 🌡️ 2024-04 In: text, image
Out: text
Open Weights
Released: 2024-09
Qwen3-VL Plus qwen3-vl-plus 262.1K 32.8K Input: $0.143353
Output: $1.433525
Reasoning: $4.300576
Model: 0.072
Completion: 10.000
🧠 🔧 🌡️ 2025-04 In: text, image
Out: text
Released: 2025-09-23
Qwen-VL OCR qwen-vl-ocr 34.1K 4.1K Input: $0.717
Output: $0.717
Model: 0.358
Completion: 1.000
🌡️ 2024-04 In: text, image
Out: text
Released: 2024-10-28
Updated: 2025-04-13
Qwen-MT Turbo qwen-mt-turbo 16.4K 8.2K Input: $0.101
Output: $0.28
Model: 0.051
Completion: 2.772
🌡️ 2024-04 In: text
Out: text
Released: 2025-01
Qwen Math Plus qwen-math-plus 4.1K 3.1K Input: $0.574
Output: $1.721
Model: 0.287
Completion: 2.998
🔧 🌡️ 2024-04 In: text
Out: text
Released: 2024-08-16
Updated: 2024-09-19
Qwen-MT Plus qwen-mt-plus 16.4K 8.2K Input: $0.259
Output: $0.775
Model: 0.130
Completion: 2.992
🌡️ 2024-04 In: text
Out: text
Released: 2025-01
Qwen3.5 Plus qwen3.5-plus 1M 65.5K Input: $0.573
Output: $3.44
Reasoning: $3.44
Model: 0.286
Completion: 6.003
🧠 🔧 🌡️ 2025-04 In: text, image, video
Out: text
Released: 2026-02-16
Qwen-Omni Turbo qwen-omni-turbo 32.8K 2K Input: $0.058
Output: $0.23
Input Audio: $3.584
Output Audio: $7.168
Model: 1.792
Completion: 2.000
🔧 🌡️ 2024-04 In: text, image, audio, video
Out: text, audio
Released: 2025-01-19
Updated: 2025-03-26
Qwen2.5 72B Instruct qwen2-5-72b-instruct 131.1K 8.2K Input: $0.574
Output: $1.721
Model: 0.287
Completion: 2.998
🔧 🌡️ 2024-04 In: text
Out: text
Open Weights
Released: 2024-09
DeepSeek R1 Distill Qwen 7B deepseek-r1-distill-qwen-7b 32.8K 16.4K Input: $0.072
Output: $0.144
Model: 0.036
Completion: 2.000
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-01-01
DeepSeek V3.1 deepseek-v3-1 131.1K 65.5K Input: $0.574
Output: $1.721
Model: 0.287
Completion: 2.998
🔧 🌡️ - In: text
Out: text
Released: 2025-01-01
Qwen2.5-Coder 32B Instruct qwen2-5-coder-32b-instruct 131.1K 8.2K Input: $0.287
Output: $0.861
Model: 0.143
Completion: 3.000
🔧 🌡️ 2024-04 In: text
Out: text
Open Weights
Released: 2024-11
Qwen Flash qwen-flash 1M 32.8K Input: $0.022
Output: $0.216
Model: 0.011
Completion: 9.818
🧠 🔧 🌡️ 2024-04 In: text
Out: text
Released: 2025-07-28
DeepSeek V3.2 Exp deepseek-v3-2-exp 131.1K 65.5K Input: $0.287
Output: $0.431
Model: 0.143
Completion: 1.502
🔧 🌡️ - In: text
Out: text
Released: 2025-01-01
Moonshot Kimi K2 Instruct moonshot-kimi-k2-instruct 131.1K 8.2K Input: $0.574
Output: $2.294
Model: 0.287
Completion: 3.997
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-01-01
DeepSeek R1 Distill Qwen 14B deepseek-r1-distill-qwen-14b 32.8K 16.4K Input: $0.144
Output: $0.431
Model: 0.072
Completion: 2.993
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-01-01
Qwen3-VL 235B-A22B qwen3-vl-235b-a22b 131.1K 32.8K Input: $0.286705
Output: $1.14682
Reasoning: $2.867051
Model: 0.143
Completion: 4.000
🧠 🔧 🌡️ 2025-04 In: text, image
Out: text
Open Weights
Released: 2025-04
Qwen Deep Research qwen-deep-research 1M 32.8K Input: $7.742
Output: $23.367
Model: 3.871
Completion: 3.018
🔧 🌡️ 2024-04 In: text
Out: text
Released: 2024-01
Moonshot Kimi K2.5 kimi-k2.5 262.1K 32.8K Input: $0.574
Output: $2.411
Model: 0.287
Completion: 4.200
🧠 🔧 🌡️ 2025-01 In: text, image, video
Out: text
Open Weights
Released: 2026-01-27
Qwen3-VL 30B-A3B qwen3-vl-30b-a3b 131.1K 32.8K Input: $0.108
Output: $0.431
Reasoning: $1.076
Model: 0.054
Completion: 3.991
🧠 🔧 🌡️ 2025-04 In: text, image
Out: text
Open Weights
Released: 2025-04
Qwen-VL Max qwen-vl-max 131.1K 8.2K Input: $0.23
Output: $0.574
Model: 0.115
Completion: 2.496
🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2024-04-08
Updated: 2025-08-13
DeepSeek R1 Distill Qwen 1.5B deepseek-r1-distill-qwen-1-5b 32.8K 16.4K Input: $0
Output: $0
- 🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-01-01
Qwen Max qwen-max 131.1K 8.2K Input: $0.345
Output: $1.377
Model: 0.172
Completion: 3.991
🔧 🌡️ 2024-04 In: text
Out: text
Released: 2024-04-03
Updated: 2025-01-25
MiniMax-M2.5 MiniMax-M2.5 204.8K 131.1K Input: $0.3
Output: $1.2
Model: 0.150
Completion: 4.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-12
Qwen3 235B-A22B qwen3-235b-a22b 131.1K 16.4K Input: $0.287
Output: $1.147
Reasoning: $2.868
Model: 0.143
Completion: 3.997
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-04
DeepSeek R1 Distill Llama 8B deepseek-r1-distill-llama-8b 32.8K 16.4K Input: $0
Output: $0
- 🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-01-01
Moonshot Kimi K2.6 kimi-k2.6 262.1K 16.4K Input: $0.929
Output: $3.858
Model: 0.465
Completion: 4.153
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video
Out: text
Open Weights
Released: 2026-04-21
Qwen3-Coder 480B-A35B Instruct qwen3-coder-480b-a35b-instruct 262.1K 65.5K Input: $0.861
Output: $3.441
Model: 0.430
Completion: 3.997
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-04
QwQ Plus qwq-plus 131.1K 8.2K Input: $0.23
Output: $0.574
Model: 0.115
Completion: 2.496
🧠 🔧 🌡️ 2024-04 In: text
Out: text
Released: 2025-03-05
Qwen2.5 32B Instruct qwen2-5-32b-instruct 131.1K 8.2K Input: $0.287
Output: $0.861
Model: 0.143
Completion: 3.000
🔧 🌡️ 2024-04 In: text
Out: text
Open Weights
Released: 2024-09
Qwen3.5 397B-A17B qwen3.5-397b-a17b 262.1K 65.5K Input: $0.43
Output: $2.58
Reasoning: $2.58
Model: 0.215
Completion: 6.000
🧠 🔧 🌡️ 2025-04 In: text, image, video
Out: text
Open Weights
Released: 2026-02-16
Tongyi Intent Detect V3 tongyi-intent-detect-v3 8.2K 1K Input: $0.058
Output: $0.144
Model: 0.029
Completion: 2.483
🌡️ 2024-04 In: text
Out: text
Released: 2024-01
Qwen3 Coder Flash qwen3-coder-flash 1M 65.5K Input: $0.144
Output: $0.574
Model: 0.072
Completion: 3.986
🔧 🌡️ 2025-04 In: text
Out: text
Released: 2025-07-28
DeepSeek R1 Distill Qwen 32B deepseek-r1-distill-qwen-32b 32.8K 16.4K Input: $0.287
Output: $0.861
Model: 0.143
Completion: 3.000
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-01-01
QwQ 32B qwq-32b 131.1K 8.2K Input: $0.287
Output: $0.861
Model: 0.143
Completion: 3.000
🧠 🔧 🌡️ 2024-04 In: text
Out: text
Open Weights
Released: 2024-12
Qwen3 14B qwen3-14b 131.1K 8.2K Input: $0.144
Output: $0.574
Reasoning: $1.434
Model: 0.072
Completion: 3.986
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-04
Qwen3-ASR Flash qwen3-asr-flash 53.2K 4.1K Input: $0.032
Output: $0.032
Model: 0.016
Completion: 1.000
- 2024-04 In: audio
Out: text
Released: 2025-09-08
Qwen Doc Turbo qwen-doc-turbo 131.1K 8.2K Input: $0.087
Output: $0.144
Model: 0.043
Completion: 1.655
🔧 🌡️ 2024-04 In: text
Out: text
Released: 2024-01
GLM-5 glm-5 202.8K 16.4K Input: $0.86
Output: $3.15
Model: 0.430
Completion: 3.663
🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-02-11
Qwen Turbo qwen-turbo 1M 16.4K Input: $0.044
Output: $0.087
Reasoning: $0.431
Model: 0.022
Completion: 1.977
🧠 🔧 🌡️ 2024-04 In: text
Out: text
Released: 2024-11-01
Updated: 2025-07-15
Qwen2.5 7B Instruct qwen2-5-7b-instruct 131.1K 8.2K Input: $0.072
Output: $0.144
Model: 0.036
Completion: 2.000
🔧 🌡️ 2024-04 In: text
Out: text
Open Weights
Released: 2024-09
Qwen2.5-VL 7B Instruct qwen2-5-vl-7b-instruct 131.1K 8.2K Input: $0.287
Output: $0.717
Model: 0.143
Completion: 2.498
🔧 🌡️ 2024-04 In: text, image
Out: text
Open Weights
Released: 2024-09
Qwen3.6 Max Preview qwen3.6-max-preview 245.8K 65.5K Input: $1.32
Output: $7.9
Cache Read: $0.132
Model: 0.660
Completion: 5.985
Cache: 0.100
🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-04-20
Updated: 2026-04-21
Qwen-VL Plus qwen-vl-plus 131.1K 8.2K Input: $0.115
Output: $0.287
Model: 0.058
Completion: 2.496
🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2024-01-25
Updated: 2025-08-15
Qwen3.6 Plus qwen3.6-plus 1M 65.5K Input: $0.5
Output: $3
Cache Read: $0.05
Cache Write: $0.625
Model: 0.250
Completion: 6.000
Cache: 0.100
🧠 🔧 🌡️ 2025-04 In: text, image, video
Out: text
Released: 2026-04-02
MiniMax-M2.7 MiniMax/MiniMax-M2.7 204.8K 131.1K Input: $0.3
Output: $1.2
Cache Read: $0.06
Cache Write: $0.375
Model: 0.150
Completion: 4.000
Cache: 0.200
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-18
kimi/kimi-k2.5 kimi/kimi-k2.5 262.1K 262.1K Input: $0.6
Output: $3
Cache Read: $0.1
Model: 0.300
Completion: 5.000
Cache: 0.167
🧠 🔧 2025-01 In: text, image, video
Out: text
Open Weights
Released: 2026-01-27
siliconflow/deepseek-r1-0528 siliconflow/deepseek-r1-0528 163.8K 32.8K Input: $0.5
Output: $2.18
Model: 0.250
Completion: 4.360
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-05-28
Updated: 2025-11-25
siliconflow/deepseek-v3.1-terminus siliconflow/deepseek-v3.1-terminus 163.8K 65.5K Input: $0.27
Output: $1
Model: 0.135
Completion: 3.704
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-09-29
Updated: 2025-11-25
siliconflow/deepseek-v3-0324 siliconflow/deepseek-v3-0324 163.8K 163.8K Input: $0.25
Output: $1
Model: 0.125
Completion: 4.000
🔧 🌡️ - In: text
Out: text
Released: 2024-12-26
Updated: 2025-11-25
siliconflow/deepseek-v3.2 siliconflow/deepseek-v3.2 163.8K 65.5K Input: $0.27
Output: $0.42
Model: 0.135
Completion: 1.556
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-12-03
Qwen3 Coder Plus qwen3-coder-plus 1M 65.5K Input: $1
Output: $5
Model: 0.500
Completion: 5.000
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-23

Alibaba Coding Plan

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Qwen3 Coder Plus qwen3-coder-plus 1M 65.5K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-23
GLM-4.7 glm-4.7 202.8K 16.4K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-12-22
Qwen3.7 Plus qwen3.7-plus 1M 64K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 🧠 🔧 🌡️ 2025-04 In: text, image
Out: text
Released: 2026-06-02
Qwen3.7 Max qwen3.7-max 1M 65.5K Input: $2.5
Output: $7.5
Cache Read: $0.5
Cache Write: $3.125
Model: 1.250
Completion: 3.000
Cache: 0.200
🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-05-21
Qwen3.6 Flash qwen3.6-flash 1M 65.5K Input: $0.1875
Output: $1.125
Cache Write: $0.234375
Model: 0.094
Completion: 6.000
📎 🧠 🔧 🌡️ - In: text, image, video
Out: text
Released: 2026-04-27
Qwen3 Max qwen3-max-2026-01-23 262.1K 32.8K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 🔧 🌡️ 2025-04 In: text
Out: text
Released: 2026-01-23
Qwen3.5 Plus qwen3.5-plus 1M 65.5K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 🧠 🔧 🌡️ 2025-04 In: text, image, video
Out: text
Released: 2026-02-16
Kimi K2.5 kimi-k2.5 262.1K 32.8K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 📎 🧠 🔧 🌡️ 2025-01 In: text, image, video
Out: text
Open Weights
Released: 2026-01-27
MiniMax-M2.5 MiniMax-M2.5 196.6K 24.6K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-12
Qwen3 Coder Next qwen3-coder-next 262.1K 65.5K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-03
GLM-5 glm-5 202.8K 16.4K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-02-11
Qwen3.6 Plus qwen3.6-plus 1M 65.5K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 🧠 🔧 🌡️ 2025-04 In: text, image, video
Out: text
Released: 2026-04-02

Alibaba Coding Plan (China)

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Qwen3 Coder Plus qwen3-coder-plus 1M 65.5K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-23
GLM-4.7 glm-4.7 202.8K 16.4K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-12-22
Qwen3.7 Plus qwen3.7-plus 1M 64K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 🧠 🔧 🌡️ 2025-04 In: text, image
Out: text
Released: 2026-06-02
Qwen3.7 Max qwen3.7-max 1M 65.5K Input: $2.5
Output: $7.5
Cache Read: $0.5
Cache Write: $3.125
Model: 1.250
Completion: 3.000
Cache: 0.200
🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-05-21
Qwen3.6 Flash qwen3.6-flash 1M 65.5K Input: $0.1875
Output: $1.125
Cache Write: $0.234375
Model: 0.094
Completion: 6.000
📎 🧠 🔧 🌡️ - In: text, image, video
Out: text
Released: 2026-04-27
Qwen3 Max qwen3-max-2026-01-23 262.1K 32.8K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 🔧 🌡️ 2025-04 In: text
Out: text
Released: 2026-01-23
Qwen3.5 Plus qwen3.5-plus 1M 65.5K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 🧠 🔧 🌡️ 2025-04 In: text, image
Out: text
Released: 2026-02-16
Kimi K2.5 kimi-k2.5 262.1K 32.8K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 📎 🧠 🔧 🌡️ 2025-01 In: text, image
Out: text
Open Weights
Released: 2026-01-27
MiniMax-M2.5 MiniMax-M2.5 196.6K 24.6K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-12
Qwen3 Coder Next qwen3-coder-next 262.1K 65.5K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-03
GLM-5 glm-5 202.8K 16.4K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-02-11
Qwen3.6 Plus qwen3.6-plus 1M 65.5K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 🧠 🔧 🌡️ 2025-04 In: text, image, video
Out: text
Released: 2026-04-02

Alibaba Token Plan

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
DeepSeek V4 Flash deepseek-v4-flash 1M 384K Input: $0
Output: $0
Cache Read: $0
- 🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2026-04-24
Qwen3.7 Plus qwen3.7-plus 1M 64K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 🧠 🔧 🌡️ 2025-04 In: text, image
Out: text
Released: 2026-06-02
Qwen3.7 Max qwen3.7-max 1M 65.5K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-05-21
GLM-5.1 glm-5.1 202.8K 128K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-27
DeepSeek V4 Pro deepseek-v4-pro 1M 384K Input: $0
Output: $0
Cache Read: $0
- 🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2026-04-24
Wan2.7 Image Pro wan2.7-image-pro 8.2K - Input: $0
Output: $0
- 🌡️ - In: text
Out: image
Released: 2026-05-29
Qwen3.6 Flash qwen3.6-flash 1M 65.5K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 📎 🧠 🔧 🌡️ - In: text, image, video
Out: text
Released: 2026-04-27
Kimi K2.5 kimi-k2.5 262.1K 32.8K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 📎 🧠 🔧 🌡️ 2025-01 In: text, image, video
Out: text
Open Weights
Released: 2026-01
Qwen Image 2.0 qwen-image-2.0 8.2K - Input: $0
Output: $0
- 🌡️ - In: text
Out: image
Released: 2026-03-03
MiniMax-M2.5 MiniMax-M2.5 196.6K 24.6K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-12
Kimi K2.6 kimi-k2.6 262.1K 16.4K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 📎 🧠 🔧 🌡️ 2025-01 In: text, image, video
Out: text
Open Weights
Released: 2026-04-21
Qwen Image 2.0 Pro qwen-image-2.0-pro 8.2K - Input: $0
Output: $0
- 🌡️ - In: text
Out: image
Released: 2026-03-03
Wan2.7 Image wan2.7-image 8.2K - Input: $0
Output: $0
- 🌡️ - In: text
Out: image
Released: 2026-05-29
GLM-5 glm-5 202.8K 16.4K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-02-11
DeepSeek V3.2 deepseek-v3.2 131.1K 65.5K Input: $0
Output: $0
- 🧠 🔧 🌡️ 2025-01 In: text
Out: text
Open Weights
Released: 2025-12-03
Updated: 2025-12-05
Qwen3.6 Plus qwen3.6-plus 1M 65.5K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 🧠 🔧 🌡️ 2025-04 In: text, image, video
Out: text
Released: 2026-04-02

Alibaba Token Plan (China)

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
DeepSeek V4 Flash deepseek-v4-flash 1M 384K Input: $0
Output: $0
Cache Read: $0
- 🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2026-04-24
Qwen3.7 Plus qwen3.7-plus 1M 64K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 🧠 🔧 🌡️ 2025-04 In: text, image
Out: text
Released: 2026-06-02
Qwen3.7 Max qwen3.7-max 1M 65.5K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-05-21
GLM-5.1 glm-5.1 202.8K 128K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-27
DeepSeek V4 Pro deepseek-v4-pro 1M 384K Input: $0
Output: $0
Cache Read: $0
- 🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2026-04-24
Wan2.7 Image Pro wan2.7-image-pro 8.2K - Input: $0
Output: $0
- 🌡️ - In: text
Out: image
Released: 2026-05-29
Qwen3.6 Flash qwen3.6-flash 1M 65.5K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 📎 🧠 🔧 🌡️ - In: text, image, video
Out: text
Released: 2026-04-27
Kimi K2.5 kimi-k2.5 262.1K 32.8K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 📎 🧠 🔧 🌡️ 2025-01 In: text, image, video
Out: text
Open Weights
Released: 2026-01
Qwen Image 2.0 qwen-image-2.0 8.2K - Input: $0
Output: $0
- 🌡️ - In: text
Out: image
Released: 2026-03-03
MiniMax-M2.5 MiniMax-M2.5 196.6K 24.6K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-12
Kimi K2.6 kimi-k2.6 262.1K 16.4K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 📎 🧠 🔧 🌡️ 2025-01 In: text, image, video
Out: text
Open Weights
Released: 2026-04-21
Qwen Image 2.0 Pro qwen-image-2.0-pro 8.2K - Input: $0
Output: $0
- 🌡️ - In: text
Out: image
Released: 2026-03-03
Wan2.7 Image wan2.7-image 8.2K - Input: $0
Output: $0
- 🌡️ - In: text
Out: image
Released: 2026-05-29
GLM-5 glm-5 202.8K 16.4K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-02-11
DeepSeek V3.2 deepseek-v3.2 131.1K 65.5K Input: $0
Output: $0
- 🧠 🔧 🌡️ 2025-01 In: text
Out: text
Open Weights
Released: 2025-12-03
Updated: 2025-12-05
Qwen3.6 Plus qwen3.6-plus 1M 65.5K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 🧠 🔧 🌡️ 2025-04 In: text, image, video
Out: text
Released: 2026-04-02

aliyun-bailian

📖 API Address

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
animate-anyone-gen2 animate-anyone-gen2 - - Per Second Standard: ¥0.08 Model: 0.080
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
animate-anyone-template-gen2 animate-anyone-template-gen2 - - Per Second Standard: ¥0.08 Model: 0.080
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
cosyvoice-v1 cosyvoice-v1 - - ¥2/10K chars Model: 2.000
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
cosyvoice-v2 cosyvoice-v2 - - ¥2/10K chars Model: 2.000
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
cosyvoice-v3-plus cosyvoice-v3-plus - - ¥2/10K chars Model: 2.000
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
cosyvoice-v3 cosyvoice-v3 - - ¥0.4/10K chars Model: 0.400
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
fun-asr-2025-08-25 fun-asr-2025-08-25 - - ¥0.00022/s Model: 0.000
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
fun-asr-mtl-2025-08-25 fun-asr-mtl-2025-08-25 - - ¥0.00022/s Model: 0.000
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
fun-asr-mtl fun-asr-mtl - - ¥0.00022/s Model: 0.000
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
fun-asr-realtime-2025-09-15 fun-asr-realtime-2025-09-15 - - ¥0.00033/s Model: 0.000
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
fun-asr-realtime fun-asr-realtime - - ¥0.00033/s Model: 0.000
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
fun-asr fun-asr - - ¥0.00022/s Model: 0.000
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
gte-rerank-v2 gte-rerank-v2 - - Input: ¥0.8
Output: -
Model: 0.400
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
image-out-painting image-out-painting - - ¥0.18/img Model: 0.180
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
multimodal-embedding-v1 multimodal-embedding-v1 - - Text: ¥0.7/1K
Image: ¥0.9/1K
- - - In: text
Out: text
-
paraformer-8k-v2 paraformer-8k-v2 - - ¥0.00008/s Model: 0.000
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
paraformer-realtime-8k-v2 paraformer-realtime-8k-v2 - - ¥0.00024/s Model: 0.000
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
paraformer-realtime-v2 paraformer-realtime-v2 - - ¥0.00024/s Model: 0.000
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
paraformer-v2 paraformer-v2 - - ¥0.00008/s Model: 0.000
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
qvq-max-2025-03-25 qvq-max-2025-03-25 - - Input: ¥8
Output: ¥32
Model: 4.000
Completion: 4.000
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
qwen-coder-turbo-2024-09-19 qwen-coder-turbo-2024-09-19 - - Input: ¥2
Output: ¥6
Model: 1.000
Completion: 3.000
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
qwen-coder-turbo-latest qwen-coder-turbo-latest - - Input: ¥2
Output: ¥6
Model: 1.000
Completion: 3.000
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
qwen-flash-2025-07-28 qwen-flash-2025-07-28 - - Input: ¥0.15
Output: ¥1.5
Input 128k 256k: ¥0.6
Input 256k 1m: ¥1.2
Output 128k 256k: ¥6
Output 256k 1m: ¥12
Model: 0.600
Completion: 10.000
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
qwen-flash qwen-flash - - Input: ¥0.15
Output: ¥1.5
Cache Read: ¥0.015
Cache Read 128k 256k: ¥0.06
Cache Read 256k 1m: ¥0.12
Cache Write: ¥0.188
Cache Write 128k 256k: ¥0.75
Cache Write 256k 1m: ¥1.5
Input 128k 256k: ¥0.6
Input 256k 1m: ¥1.2
Output 128k 256k: ¥6
Output 256k 1m: ¥12
Model: 0.600
Completion: 10.000
Cache: 0.100
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
qwen-image-edit qwen-image-edit - - ¥0.3/img Model: 0.300
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
qwen-image-plus qwen-image-plus - - ¥0.2/img Model: 0.200
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
qwen-image qwen-image - - ¥0.25/img Model: 0.250
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
qwen-long-latest qwen-long-latest - - Input: ¥0.5
Output: ¥2
Model: 0.250
Completion: 4.000
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
qwen-long qwen-long - - Input: ¥0.5
Output: ¥2
Model: 0.250
Completion: 4.000
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
qwen-max-latest qwen-max-latest - - Input: ¥2.4
Output: ¥9.6
Model: 1.200
Completion: 4.000
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
qwen-max qwen-max - - Input: ¥2.4
Output: ¥9.6
Cache Read: ¥0.48
Model: 1.200
Completion: 4.000
Cache: 0.200
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
qwen-mt-image qwen-mt-image - - ¥0.003/img Model: 0.003
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
qwen-mt-plus qwen-mt-plus - - Input: ¥1.8
Output: ¥5.4
Model: 0.900
Completion: 3.000
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
qwen-mt-turbo qwen-mt-turbo - - Input: ¥0.7
Output: ¥1.95
Model: 0.350
Completion: 2.786
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
qwen-omni-turbo-latest qwen-omni-turbo-latest - - Text Input: ¥0.4
Vision Input: ¥1.5
Audio Input: ¥25
Output: ¥50
Multi Output: ¥50
Multiin Text Output: ¥4.5
Purein Text Output: ¥1.6
- - - In: text
Out: text
-
qwen-omni-turbo-realtime-latest qwen-omni-turbo-realtime-latest - - Text Input: ¥1.6
Vision Input: ¥6
Audio Input: ¥25
Output: ¥50
Multi Output: ¥50
Multiin Text Output: ¥18
Purein Text Output: ¥6.4
- - - In: text
Out: text
-
qwen-omni-turbo-realtime qwen-omni-turbo-realtime - - Text Input: ¥1.6
Vision Input: ¥6
Audio Input: ¥25
Output: ¥50
Multi Output: ¥50
Multiin Text Output: ¥18
Purein Text Output: ¥6.4
- - - In: text
Out: text
-
qwen-omni-turbo qwen-omni-turbo - - Text Input: ¥0.4
Vision Input: ¥1.5
Audio Input: ¥25
Output: ¥50
Audio Input Cache: ¥5
Multi Output: ¥50
Multiin Text Output: ¥4.5
Purein Text Output: ¥1.6
Text Input Cache: ¥0.08
Vision Input Cache: ¥0.3
- - - In: text
Out: text
-
qwen-plus-2024-09-19 qwen-plus-2024-09-19 - - Input: ¥0.8
Output: ¥2
Model: 0.400
Completion: 2.500
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
qwen-plus-latest qwen-plus-latest - - Input: ¥0.8
Output: ¥2
Input 128k 256k: ¥2.4
Input 256k 1m: ¥4.8
Output 128k 256k: ¥20
Output 256k 1m: ¥48
Thinking Input: ¥0.8
Thinking Input 128k 256k: ¥2.4
Thinking Input 256k 1m: ¥4.8
Thinking Output: ¥8
Thinking Output 128k 256k: ¥24
Thinking Output 256k 1m: ¥64
Model: 2.400
Completion: 13.333
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
qwen-plus qwen-plus - - Input: ¥0.8
Output: ¥2
Cache Read: ¥0.08
Cache Read 128k 256k: ¥0.24
Cache Read 256k 1m: ¥0.48
Cache Write: ¥1
Cache Write 128k 256k: ¥3
Cache Write 256k 1m: ¥6
Input 128k 256k: ¥2.4
Input 256k 1m: ¥4.8
Output 128k 256k: ¥20
Output 256k 1m: ¥48
Thinking Cache Read: ¥0.08
Thinking Cache Read 128k 256k: ¥0.24
Thinking Cache Read 256k 1m: ¥0.48
Thinking Cache Write: ¥1
Thinking Cache Write 128k 256k: ¥3
Thinking Cache Write 256k 1m: ¥6
Thinking Input: ¥0.8
Thinking Input 128k 256k: ¥2.4
Thinking Input 256k 1m: ¥4.8
Thinking Output: ¥8
Thinking Output 128k 256k: ¥24
Thinking Output 256k 1m: ¥64
Model: 2.400
Completion: 13.333
Cache: 0.100
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
qwen-turbo-latest qwen-turbo-latest - - Input: ¥0.3
Output: ¥0.6
Thinking Input: ¥0.3
Thinking Output: ¥3
Model: 0.150
Completion: 10.000
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
qwen-turbo qwen-turbo - - Input: ¥0.3
Output: ¥0.6
Cache Read: ¥0.06
Thinking Cache Read: ¥0.06
Thinking Input: ¥0.3
Thinking Output: ¥3
Model: 0.150
Completion: 10.000
Cache: 0.200
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
qwen-vl-max-latest qwen-vl-max-latest - - Input: ¥1.6
Output: ¥4
Model: 0.800
Completion: 2.500
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
qwen-vl-max qwen-vl-max - - Input: ¥1.6
Output: ¥4
Cache Read: ¥0.32
Model: 0.800
Completion: 2.500
Cache: 0.200
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
qwen-vl-ocr-latest qwen-vl-ocr-latest - - VL: ¥5/1K - - - In: text
Out: text
-
qwen-vl-ocr qwen-vl-ocr - - VL: ¥5/1K - - - In: text
Out: text
-
qwen-vl-plus-latest qwen-vl-plus-latest - - Input: ¥0.8
Output: ¥2
Model: 0.400
Completion: 2.500
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
qwen-vl-plus qwen-vl-plus - - Input: ¥0.8
Output: ¥2
Cache Read: ¥0.16
Model: 0.400
Completion: 2.500
Cache: 0.200
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
qwen2.5-14b-instruct-1m qwen2.5-14b-instruct-1m - - Input: ¥1
Output: ¥3
Model: 0.500
Completion: 3.000
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
qwen2.5-14b-instruct qwen2.5-14b-instruct - - Input: ¥1
Output: ¥3
Model: 0.500
Completion: 3.000
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
qwen2.5-32b-instruct qwen2.5-32b-instruct - - Input: ¥2
Output: ¥6
Model: 1.000
Completion: 3.000
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
qwen2.5-3b-instruct qwen2.5-3b-instruct - - Input: ¥0.3
Output: ¥0.9
Model: 0.150
Completion: 3.000
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
qwen2.5-72b-instruct qwen2.5-72b-instruct - - Input: ¥4
Output: ¥12
Model: 2.000
Completion: 3.000
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
qwen2.5-7b-instruct-1m qwen2.5-7b-instruct-1m - - Input: ¥0.5
Output: ¥1
Model: 0.250
Completion: 2.000
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
qwen2.5-7b-instruct qwen2.5-7b-instruct - - Input: ¥0.5
Output: ¥1
Model: 0.250
Completion: 2.000
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
qwen2.5-coder-14b-instruct qwen2.5-coder-14b-instruct - - Input: ¥2
Output: ¥6
Model: 1.000
Completion: 3.000
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
qwen2.5-coder-32b-instruct qwen2.5-coder-32b-instruct - - Input: ¥2
Output: ¥6
Model: 1.000
Completion: 3.000
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
qwen2.5-coder-7b-instruct qwen2.5-coder-7b-instruct - - Input: ¥1
Output: ¥2
Model: 0.500
Completion: 2.000
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
qwen2.5-omni-7b qwen2.5-omni-7b - - Text Input: ¥0.6
Vision Input: ¥2
Audio Input: ¥38
Output: ¥76
Multi Output: ¥76
Multiin Text Output: ¥6
Purein Text Output: ¥2.4
- - - In: text
Out: text
-
qwen2.5-vl-32b-instruct qwen2.5-vl-32b-instruct - - Input: ¥8
Output: ¥24
Model: 4.000
Completion: 3.000
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
qwen2.5-vl-3b-instruct qwen2.5-vl-3b-instruct - - Input: ¥1.2
Output: ¥3.6
Model: 0.600
Completion: 3.000
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
qwen2.5-vl-72b-instruct qwen2.5-vl-72b-instruct - - Input: ¥16
Output: ¥48
Model: 8.000
Completion: 3.000
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
qwen2.5-vl-7b-instruct qwen2.5-vl-7b-instruct - - Input: ¥2
Output: ¥5
Model: 1.000
Completion: 2.500
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
qwen3-0.6b qwen3-0.6b - - Input: ¥0.3
Output: ¥1.2
Thinking Input: ¥0.3
Thinking Output: ¥3
Model: 0.150
Completion: 10.000
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
qwen3-1.7b qwen3-1.7b - - Input: ¥0.3
Output: ¥1.2
Thinking Input: ¥0.3
Thinking Output: ¥3
Model: 0.150
Completion: 10.000
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
qwen3-14b qwen3-14b - - Input: ¥1
Output: ¥4
Thinking Input: ¥1
Thinking Output: ¥10
Model: 0.500
Completion: 10.000
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
qwen3-235b-a22b-instruct-2507 qwen3-235b-a22b-instruct-2507 - - Input: ¥2
Output: ¥8
Model: 1.000
Completion: 4.000
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
qwen3-235b-a22b-thinking-2507 qwen3-235b-a22b-thinking-2507 - - Thinking Input: ¥2
Thinking Output: ¥20
Model: 1.000
Completion: 10.000
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
qwen3-235b-a22b qwen3-235b-a22b - - Input: ¥2
Output: ¥8
Thinking Input: ¥2
Thinking Output: ¥20
Model: 1.000
Completion: 10.000
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
qwen3-30b-a3b qwen3-30b-a3b - - Input: ¥0.75
Output: ¥3
Thinking Input: ¥0.75
Thinking Output: ¥7.5
Model: 0.375
Completion: 10.000
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
qwen3-32b qwen3-32b - - Input: ¥2
Output: ¥8
Thinking Input: ¥2
Thinking Output: ¥20
Model: 1.000
Completion: 10.000
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
qwen3-4b qwen3-4b - - Input: ¥0.3
Output: ¥1.2
Thinking Input: ¥0.3
Thinking Output: ¥3
Model: 0.150
Completion: 10.000
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
qwen3-8b qwen3-8b - - Input: ¥0.5
Output: ¥2
Thinking Input: ¥0.5
Thinking Output: ¥5
Model: 0.250
Completion: 10.000
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
qwen3-asr-flash-2025-09-08 qwen3-asr-flash-2025-09-08 - - ¥0.00022/s Model: 0.000
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
qwen3-asr-flash qwen3-asr-flash - - ¥0.00022/s Model: 0.000
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
qwen3-coder-30b-a3b-instruct qwen3-coder-30b-a3b-instruct - - Input: ¥1.5
Output: ¥6
Input 128k 256k: ¥3.75
Input 256k 1m: ¥7.5
Input 32k 128k: ¥2.25
Output 128k 256k: ¥15
Output 256k 1m: ¥37.5
Output 32k 128k: ¥9
Model: 3.750
Completion: 5.000
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
qwen3-coder-480b-a35b-instruct qwen3-coder-480b-a35b-instruct - - Input: ¥6
Output: ¥24
Input 128k 256k: ¥15
Input 256k 1m: ¥30
Input 32k 128k: ¥9
Output 128k 256k: ¥60
Output 256k 1m: ¥300
Output 32k 128k: ¥36
Model: 15.000
Completion: 10.000
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
qwen3-coder-flash qwen3-coder-flash - - Input: ¥1
Output: ¥4
Cache Read: ¥0.1
Cache Read 128k 256k: ¥0.25
Cache Read 256k 1m: ¥0.5
Cache Read 32k 128k: ¥0.15
Cache Write: ¥1.25
Cache Write 128k 256k: ¥3.125
Cache Write 256k 1m: ¥6.25
Cache Write 32k 128k: ¥1.875
Input 128k 256k: ¥2.5
Input 256k 1m: ¥5
Input 32k 128k: ¥1.5
Output 128k 256k: ¥10
Output 256k 1m: ¥25
Output 32k 128k: ¥6
Model: 2.500
Completion: 5.000
Cache: 0.100
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
qwen3-coder-plus-2025-07-22 qwen3-coder-plus-2025-07-22 - - Input: ¥4
Output: ¥16
Input 128k 256k: ¥10
Input 256k 1m: ¥20
Input 32k 128k: ¥6
Output 128k 256k: ¥40
Output 256k 1m: ¥200
Output 32k 128k: ¥24
Model: 10.000
Completion: 10.000
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
qwen3-coder-plus-2025-09-23 qwen3-coder-plus-2025-09-23 - - Input: ¥4
Output: ¥16
Input 128k 256k: ¥10
Input 256k 1m: ¥20
Input 32k 128k: ¥6
Output 128k 256k: ¥40
Output 256k 1m: ¥200
Output 32k 128k: ¥24
Model: 10.000
Completion: 10.000
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
qwen3-coder-plus qwen3-coder-plus - - Input: ¥4
Output: ¥16
Cache Read: ¥0.4
Cache Read 128k 256k: ¥1
Cache Read 256k 1m: ¥2
Cache Read 32k 128k: ¥0.6
Cache Write: ¥5
Cache Write 128k 256k: ¥12.5
Cache Write 256k 1m: ¥25
Cache Write 32k 128k: ¥7.5
Input 128k 256k: ¥10
Input 256k 1m: ¥20
Input 32k 128k: ¥6
Output 128k 256k: ¥40
Output 256k 1m: ¥200
Output 32k 128k: ¥24
Model: 10.000
Completion: 10.000
Cache: 0.100
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
qwen3-max-2025-09-23 qwen3-max-2025-09-23 - - Input: ¥6
Output: ¥24
Input 128k 256k: ¥15
Input 32k 128k: ¥10
Output 128k 256k: ¥60
Output 32k 128k: ¥40
Model: 7.500
Completion: 4.000
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
qwen3-max-preview qwen3-max-preview - - Input: ¥6
Output: ¥24
Cache Read: ¥1.2
Cache Read 128k 256k: ¥3
Cache Read 32k 128k: ¥2
Input 128k 256k: ¥15
Input 32k 128k: ¥10
Output 128k 256k: ¥60
Output 32k 128k: ¥40
Model: 7.500
Completion: 4.000
Cache: 0.200
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
qwen3-max qwen3-max - - Input: ¥6
Output: ¥24
Cache Read: ¥0.6
Cache Read 128k 256k: ¥1.5
Cache Read 32k 128k: ¥1
Cache Write: ¥7.5
Cache Write 128k 256k: ¥18.75
Cache Write 32k 128k: ¥12.5
Input 128k 256k: ¥15
Input 32k 128k: ¥10
Output 128k 256k: ¥60
Output 32k 128k: ¥40
Model: 7.500
Completion: 4.000
Cache: 0.100
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
qwen3-omni-30b-a3b-captioner qwen3-omni-30b-a3b-captioner - - Audio Input: ¥15.8
Multi Output: ¥12.7
Multiin Text Output: ¥12.7
- - - In: text
Out: text
-
qwen3-omni-flash-2025-09-15 qwen3-omni-flash-2025-09-15 - - Text Input: ¥1.8
Vision Input: ¥3.3
Audio Input: ¥15.8
Output: ¥62.6
Multi Output: ¥62.6
Multiin Text Output: ¥12.7
Purein Text Output: ¥6.9
Thinking Audio Input: ¥15.8
Thinking Multiin Text Output: ¥12.7
Thinking Purein Text Output: ¥6.9
Thinking Text Input: ¥1.8
Thinking Vision Input: ¥3.3
- - - In: text
Out: text
-
qwen3-omni-flash-realtime-2025-09-15 qwen3-omni-flash-realtime-2025-09-15 - - Text Input: ¥2.2
Vision Input: ¥3.9
Audio Input: ¥18.9
Output: ¥75.1
Multi Output: ¥75.1
Multiin Text Output: ¥15.2
Purein Text Output: ¥8.3
- - - In: text
Out: text
-
qwen3-omni-flash-realtime qwen3-omni-flash-realtime - - Text Input: ¥2.2
Vision Input: ¥3.9
Audio Input: ¥18.9
Output: ¥75.1
Multi Output: ¥75.1
Multiin Text Output: ¥15.2
Purein Text Output: ¥8.3
- - - In: text
Out: text
-
qwen3-omni-flash qwen3-omni-flash - - Text Input: ¥1.8
Vision Input: ¥3.3
Audio Input: ¥15.8
Output: ¥62.6
Multi Output: ¥62.6
Multiin Text Output: ¥12.7
Purein Text Output: ¥6.9
Thinking Audio Input: ¥15.8
Thinking Multiin Text Output: ¥12.7
Thinking Purein Text Output: ¥6.9
Thinking Text Input: ¥1.8
Thinking Vision Input: ¥3.3
- - - In: text
Out: text
-
qwen3-tts-flash-2025-09-18 qwen3-tts-flash-2025-09-18 - - ¥0.8/10K chars Model: 0.800
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
qwen3-tts-flash-realtime-2025-09-18 qwen3-tts-flash-realtime-2025-09-18 - - ¥1/10K chars Model: 1.000
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
qwen3-tts-flash-realtime qwen3-tts-flash-realtime - - ¥1/10K chars Model: 1.000
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
qwen3-tts-flash qwen3-tts-flash - - ¥0.8/10K chars Model: 0.800
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
qwen3-vl-plus-2025-09-23 qwen3-vl-plus-2025-09-23 - - Input: ¥1
Output: ¥10
Input 128k 256k: ¥3
Input 32k 128k: ¥1.5
Output 128k 256k: ¥30
Output 32k 128k: ¥15
Model: 1.500
Completion: 10.000
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
qwen3-vl-plus qwen3-vl-plus - - Input: ¥1
Output: ¥10
Cache Read: ¥0.2
Cache Read 128k 256k: ¥0.6
Cache Read 32k 128k: ¥0.3
Input 128k 256k: ¥3
Input 32k 128k: ¥1.5
Output 128k 256k: ¥30
Output 32k 128k: ¥15
Model: 1.500
Completion: 10.000
Cache: 0.200
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
qwq-32b-preview qwq-32b-preview - - Input: ¥2
Output: ¥6
Model: 1.000
Completion: 3.000
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
qwq-32b qwq-32b - - Input: ¥2
Output: ¥6
Model: 1.000
Completion: 3.000
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
qwq-plus-latest qwq-plus-latest - - Input: ¥1.6
Output: ¥4
Model: 0.800
Completion: 2.500
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
qwq-plus qwq-plus - - Input: ¥1.6
Output: ¥4
Model: 0.800
Completion: 2.500
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
text-embedding-async-v2 text-embedding-async-v2 - - Input: ¥0.7
Output: -
Model: 0.350
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
text-embedding-v1 text-embedding-v1 - - Input: ¥0.7
Output: -
Model: 0.350
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
text-embedding-v2 text-embedding-v2 - - Input: ¥0.7
Output: -
Model: 0.350
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
text-embedding-v3 text-embedding-v3 - - Input: ¥0.5
Output: -
Model: 0.250
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
text-embedding-v4 text-embedding-v4 - - Input: ¥0.5
Output: -
Model: 0.250
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
tongyi-embedding-vision-flash tongyi-embedding-vision-flash - - Text: ¥0.2/1K
Image: ¥0.5/1K
- - - In: text
Out: text
-
tongyi-embedding-vision-plus tongyi-embedding-vision-plus - - Text: ¥0.5/1K
Image: ¥0.5/1K
- - - In: text
Out: text
-
tongyi-intent-detect-v3 tongyi-intent-detect-v3 - - Input: ¥0.4
Output: ¥1
Model: 0.200
Completion: 2.500
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
wan2.2-animate-mix wan2.2-animate-mix - - Per Second Pro: ¥0.9
Per Second Standard: ¥0.6
Model: 0.600
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
wan2.2-animate-move wan2.2-animate-move - - Per Second Pro: ¥0.6
Per Second Standard: ¥0.4
Model: 0.400
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
wan2.2-i2v-flash wan2.2-i2v-flash - - Per Second 1080p: ¥0.48
Per Second 480p: ¥0.1
Per Second 720p: ¥0.2
Model: 0.100
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
wan2.2-i2v-plus wan2.2-i2v-plus - - Per Second 1080p: ¥0.7
Per Second 480p: ¥0.14
Model: 0.140
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
wan2.2-kf2v-flash wan2.2-kf2v-flash - - Per Second 1080p: ¥0.48
Per Second 480p: ¥0.1
Per Second 720p: ¥0.2
Model: 0.100
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
wan2.2-s2v wan2.2-s2v - - Per Second 480p: ¥0.5
Per Second 720p: ¥0.9
Model: 0.500
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
wan2.2-t2i-flash wan2.2-t2i-flash - - ¥0.14/img Model: 0.140
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
wan2.2-t2i-plus wan2.2-t2i-plus - - ¥0.2/img Model: 0.200
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
wan2.2-t2v-plus wan2.2-t2v-plus - - Per Second 1080x1920: ¥0.7
Per Second 1248x1632: ¥0.7
Per Second 1440x1440: ¥0.7
Per Second 1632x1248: ¥0.7
Per Second 1920x1080: ¥0.7
Per Second 480x832: ¥0.14
Per Second 624x624: ¥0.14
Per Second 832x480: ¥0.14
Model: 0.140
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
wan2.5-i2i-preview wan2.5-i2i-preview - - ¥0.2/img Model: 0.200
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
wan2.5-i2v-preview wan2.5-i2v-preview - - Per Second 1080p: ¥1
Per Second 480p: ¥0.3
Per Second 720p: ¥0.6
Model: 0.300
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
wan2.5-t2i-preview wan2.5-t2i-preview - - ¥0.2/img Model: 0.200
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
wan2.5-t2v-preview wan2.5-t2v-preview - - Per Second 1080x1920: ¥1
Per Second 1088x832: ¥0.6
Per Second 1248x1632: ¥1
Per Second 1280x720: ¥0.6
Per Second 1440x1440: ¥1
Per Second 1632x1248: ¥1
Per Second 1920x1080: ¥1
Per Second 480x832: ¥0.3
Per Second 624x624: ¥0.3
Per Second 720x1280: ¥0.6
Per Second 832x1088: ¥0.6
Per Second 832x480: ¥0.3
Per Second 960x960: ¥0.6
Model: 0.300
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
wanx-background-generation-v2 wanx-background-generation-v2 - - ¥0.08/img Model: 0.080
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
wanx-sketch-to-image-lite wanx-sketch-to-image-lite - - ¥0.06/img Model: 0.060
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
wanx-style-repaint-v1 wanx-style-repaint-v1 - - ¥0.12/img Model: 0.120
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
wanx-v1 wanx-v1 - - ¥0.16/img Model: 0.160
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
wanx2.0-t2i-turbo wanx2.0-t2i-turbo - - ¥0.04/img Model: 0.040
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
wanx2.1-i2v-plus wanx2.1-i2v-plus - - Per Second Standard: ¥0.7 Model: 0.700
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
wanx2.1-i2v-turbo wanx2.1-i2v-turbo - - Per Second Standard: ¥0.24 Model: 0.240
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
wanx2.1-imageedit wanx2.1-imageedit - - ¥0.14/img Model: 0.140
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
wanx2.1-kf2v-plus wanx2.1-kf2v-plus - - Per Second Standard: ¥0.7 Model: 0.700
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
wanx2.1-t2i-plus wanx2.1-t2i-plus - - ¥0.2/img Model: 0.200
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
wanx2.1-t2i-turbo wanx2.1-t2i-turbo - - ¥0.14/img Model: 0.140
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
wanx2.1-t2v-plus wanx2.1-t2v-plus - - Per Second 1088x832: ¥0.7
Per Second 1280x720: ¥0.7
Per Second 720x1280: ¥0.7
Per Second 832x1088: ¥0.7
Per Second 960x960: ¥0.7
Model: 0.700
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
wanx2.1-t2v-turbo wanx2.1-t2v-turbo - - Per Second 1088x832: ¥0.24
Per Second 1280x720: ¥0.24
Per Second 480x832: ¥0.24
Per Second 624x624: ¥0.24
Per Second 720x1280: ¥0.24
Per Second 832x1088: ¥0.24
Per Second 832x480: ¥0.24
Per Second 960x960: ¥0.24
Model: 0.240
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-
wanx2.1-vace-plus wanx2.1-vace-plus - - Per Second Standard: ¥0.7 Model: 0.700
(CNY pricing, multiply by USD/CNY rate for NewAPI)
- - In: text
Out: text
-

Amazon Bedrock

📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Claude Haiku 4.5 (Global) global.anthropic.claude-haiku-4-5-20251001-v1:0 200K 64K Input: $1
Output: $5
Cache Read: $0.1
Cache Write: $1.25
Model: 0.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-02-28 In: text, image, pdf
Out: text
Released: 2025-10-15
Claude Sonnet 4.5 (Global) global.anthropic.claude-sonnet-4-5-20250929-v1:0 200K 64K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-07-31 In: text, image, pdf
Out: text
Released: 2025-09-29
Llama 4 Scout 17B Instruct (US) us.meta.llama4-scout-17b-instruct-v1:0 3.5M 16.4K Input: $0.17
Output: $0.66
Model: 0.085
Completion: 3.882
📎 🔧 🌡️ 2024-08 In: text, image
Out: text
Open Weights
Released: 2025-04-05
MiniMax M2 minimax.minimax-m2 204.6K 128K Input: $0.3
Output: $1.2
Model: 0.150
Completion: 4.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-10-27
Claude Opus 4.7 anthropic.claude-opus-4-7 1M 128K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 2026-01-31 In: text, image, pdf
Out: text
Released: 2026-04-16
Claude Sonnet 4.6 (EU) eu.anthropic.claude-sonnet-4-6 1M 64K Input: $3.3
Output: $16.5
Cache Read: $0.33
Cache Write: $4.125
Model: 1.650
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-02-17
Updated: 2026-03-13
Voxtral Small 24B 2507 mistral.voxtral-small-24b-2507 32K 8.2K Input: $0.15
Output: $0.35
Model: 0.075
Completion: 2.333
📎 🔧 🌡️ - In: text, audio
Out: text
Open Weights
Released: 2025-07-01
Ministral 3 3B mistral.ministral-3-3b-instruct 256K 8.2K Input: $0.1
Output: $0.1
Model: 0.050
Completion: 1.000
🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2025-12-02
gpt-oss-20b openai.gpt-oss-20b 128K 16.4K Input: $0.07
Output: $0.3
Model: 0.035
Completion: 4.286
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-08-05
Claude Opus 4.6 anthropic.claude-opus-4-6-v1 1M 128K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-05-31 In: text, image, pdf
Out: text
Released: 2026-02-05
Updated: 2026-03-13
GPT OSS Safeguard 20B openai.gpt-oss-safeguard-20b 128K 16.4K Input: $0.07
Output: $0.2
Model: 0.035
Completion: 2.857
🔧 🌡️ - In: text
Out: text
Released: 2025-10-29
Claude Opus 4.5 anthropic.claude-opus-4-5-20251101-v1:0 200K 64K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-11-24
Updated: 2025-08-01
Claude Fable 5 (Global) global.anthropic.claude-fable-5 1M 128K Input: $10
Output: $50
Cache Read: $1
Cache Write: $12.5
Model: 5.000
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 2026-01-31 In: text, image
Out: text
Released: 2026-06-09
gpt-oss-120b openai.gpt-oss-120b-1:0 128K 16.4K Input: $0.15
Output: $0.6
Model: 0.075
Completion: 4.000
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-08-05
Claude Sonnet 4.5 anthropic.claude-sonnet-4-5-20250929-v1:0 200K 64K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-07-31 In: text, image, pdf
Out: text
Released: 2025-09-29
Nova Pro amazon.nova-pro-v1:0 300K 8.2K Input: $0.8
Output: $3.2
Cache Read: $0.2
Model: 0.400
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-10 In: text, image, video
Out: text
Released: 2024-12-03
Qwen3 Coder Next qwen.qwen3-coder-next 131.1K 65.5K Input: $0.22
Output: $1.8
Model: 0.110
Completion: 8.182
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-06
Claude Opus 4.7 (US) us.anthropic.claude-opus-4-7 1M 128K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 2026-01-31 In: text, image, pdf
Out: text
Released: 2026-04-16
NVIDIA Nemotron Nano 9B v2 nvidia.nemotron-nano-9b-v2 128K 4.1K Input: $0.06
Output: $0.23
Model: 0.030
Completion: 3.833
🔧 🌡️ - In: text
Out: text
Released: 2024-12-01
Qwen3 32B (dense) qwen.qwen3-32b-v1:0 16.4K 16.4K Input: $0.15
Output: $0.6
Model: 0.075
Completion: 4.000
🧠 🔧 🌡️ 2024-04 In: text
Out: text
Open Weights
Released: 2025-09-18
Claude Sonnet 4.6 (JP) jp.anthropic.claude-sonnet-4-6 1M 64K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-02-17
Updated: 2026-03-13
DeepSeek-R1 deepseek.r1-v1:0 128K 32.8K Input: $1.35
Output: $5.4
Model: 0.675
Completion: 4.000
🧠 🔧 🌡️ 2024-07 In: text
Out: text
Released: 2025-01-20
Updated: 2025-05-29
Mistral Large 3 mistral.mistral-large-3-675b-instruct 256K 8.2K Input: $0.5
Output: $1.5
Model: 0.250
Completion: 3.000
🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2025-12-02
Google Gemma 3 27B Instruct google.gemma-3-27b-it 202.8K 8.2K Input: $0.12
Output: $0.2
Model: 0.060
Completion: 1.667
📎 🔧 🌡️ 2025-07 In: text, image
Out: text
Open Weights
Released: 2025-07-27
Claude Sonnet 4.6 anthropic.claude-sonnet-4-6 1M 64K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-02-17
Updated: 2026-03-13
Nova 2 Lite amazon.nova-2-lite-v1:0 128K 4.1K Input: $0.33
Output: $2.75
Model: 0.165
Completion: 8.333
🧠 🔧 🌡️ - In: text, image, video
Out: text
Released: 2024-12-01
GPT OSS Safeguard 120B openai.gpt-oss-safeguard-120b 128K 16.4K Input: $0.15
Output: $0.6
Model: 0.075
Completion: 4.000
🔧 🌡️ - In: text
Out: text
Released: 2025-10-29
Ministral 3 8B mistral.ministral-3-8b-instruct 128K 4.1K Input: $0.15
Output: $0.15
Model: 0.075
Completion: 1.000
🔧 🌡️ - In: text
Out: text
Released: 2024-12-01
Claude Opus 4.6 (EU) eu.anthropic.claude-opus-4-6-v1 1M 128K Input: $5.5
Output: $27.5
Cache Read: $0.55
Cache Write: $6.875
Model: 2.750
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-05-31 In: text, image, pdf
Out: text
Released: 2026-02-05
Updated: 2026-03-13
AU Anthropic Claude Opus 4.6 au.anthropic.claude-opus-4-6-v1 1M 128K Input: $16.5
Output: $82.5
Cache Read: $1.65
Cache Write: $20.625
Model: 8.250
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-05 In: text, image, pdf
Out: text
Released: 2026-02-05
gpt-oss-120b openai.gpt-oss-120b 128K 16.4K Input: $0.15
Output: $0.6
Model: 0.075
Completion: 4.000
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-08-05
Claude Opus 4.8 (Global) global.anthropic.claude-opus-4-8 1M 128K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2026-05-28
Llama 4 Maverick 17B Instruct (US) us.meta.llama4-maverick-17b-instruct-v1:0 1M 16.4K Input: $0.24
Output: $0.97
Model: 0.120
Completion: 4.042
📎 🔧 🌡️ 2024-08 In: text, image
Out: text
Open Weights
Released: 2025-04-05
GPT-5.4 openai.gpt-5.4 272K 128K Input: $2.75
Output: $16.5
Cache Read: $0.275
Model: 1.375
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-03-05
Updated: 2026-06-01
Devstral 2 123B mistral.devstral-2-123b 256K 8.2K Input: $0.4
Output: $2
Model: 0.200
Completion: 5.000
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-17
GLM-4.7 zai.glm-4.7 204.8K 131.1K Input: $0.6
Output: $2.2
Model: 0.300
Completion: 3.667
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-12-22
Palmyra X4 writer.palmyra-x4-v1:0 122.9K 8.2K Input: $2.5
Output: $10
Model: 1.250
Completion: 4.000
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-04-28
Magistral Small 1.2 mistral.magistral-small-2509 128K 40K Input: $0.5
Output: $1.5
Model: 0.250
Completion: 3.000
🧠 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2025-12-02
Qwen3 Coder 480B A35B Instruct qwen.qwen3-coder-480b-a35b-v1:0 131.1K 65.5K Input: $0.22
Output: $1.8
Model: 0.110
Completion: 8.182
🔧 🌡️ 2024-04 In: text
Out: text
Open Weights
Released: 2025-09-18
Nova Micro amazon.nova-micro-v1:0 128K 8.2K Input: $0.035
Output: $0.14
Cache Read: $0.00875
Model: 0.018
Completion: 4.000
Cache: 0.250
🔧 🌡️ 2024-10 In: text
Out: text
Released: 2024-12-03
Pixtral Large (25.02) mistral.pixtral-large-2502-v1:0 128K 8.2K Input: $2
Output: $6
Model: 1.000
Completion: 3.000
🔧 🌡️ - In: text, image
Out: text
Released: 2025-04-08
Claude Opus 4.6 (US) us.anthropic.claude-opus-4-6-v1 1M 128K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-05-31 In: text, image, pdf
Out: text
Released: 2026-02-05
Updated: 2026-03-13
Claude Opus 4.7 (JP) jp.anthropic.claude-opus-4-7 1M 128K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 2026-01-31 In: text, image, pdf
Out: text
Released: 2026-04-16
Claude Sonnet 4.5 (AU) au.anthropic.claude-sonnet-4-5-20250929-v1:0 200K 64K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-07-31 In: text, image, pdf
Out: text
Released: 2025-09-29
DeepSeek-V3.1 deepseek.v3-v1:0 163.8K 81.9K Input: $0.58
Output: $1.68
Model: 0.290
Completion: 2.897
🧠 🔧 🌡️ 2024-07 In: text
Out: text
Open Weights
Released: 2025-09-18
Claude Opus 4.1 anthropic.claude-opus-4-1-20250805-v1:0 200K 32K Input: $15
Output: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-08-05
Claude Sonnet 4.5 (JP) jp.anthropic.claude-sonnet-4-5-20250929-v1:0 200K 64K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-07-31 In: text, image, pdf
Out: text
Released: 2025-09-29
Gemma 3 4B IT google.gemma-3-4b-it 128K 4.1K Input: $0.04
Output: $0.08
Model: 0.020
Completion: 2.000
🔧 🌡️ - In: text, image
Out: text
Released: 2024-12-01
Claude Sonnet 4.5 (EU) eu.anthropic.claude-sonnet-4-5-20250929-v1:0 200K 64K Input: $3.3
Output: $16.5
Cache Read: $0.33
Cache Write: $4.125
Model: 1.650
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-07-31 In: text, image, pdf
Out: text
Released: 2025-09-29
Qwen/Qwen3-VL-235B-A22B-Instruct qwen.qwen3-vl-235b-a22b 262K 262K Input: $0.3
Output: $1.5
Model: 0.150
Completion: 5.000
📎 🔧 🌡️ - In: text, image
Out: text
Released: 2025-10-04
Updated: 2025-11-25
Palmyra X5 writer.palmyra-x5-v1:0 1M 8.2K Input: $0.6
Output: $6
Model: 0.300
Completion: 10.000
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-04-28
Claude Sonnet 4.6 (US) us.anthropic.claude-sonnet-4-6 1M 64K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-02-17
Updated: 2026-03-13
Claude Haiku 4.5 (AU) au.anthropic.claude-haiku-4-5-20251001-v1:0 200K 64K Input: $1
Output: $5
Cache Read: $0.1
Cache Write: $1.25
Model: 0.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-02-28 In: text, image, pdf
Out: text
Released: 2025-10-15
Llama 3.3 70B Instruct meta.llama3-3-70b-instruct-v1:0 128K 4.1K Input: $0.72
Output: $0.72
Model: 0.360
Completion: 1.000
🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-12-06
GLM-5 zai.glm-5 202.8K 101.4K Input: $1
Output: $3.2
Model: 0.500
Completion: 3.200
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-18
Claude Opus 4.8 (US) us.anthropic.claude-opus-4-8 1M 128K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2026-05-28
Claude Opus 4.7 (Global) global.anthropic.claude-opus-4-7 1M 128K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 2026-01-31 In: text, image, pdf
Out: text
Released: 2026-04-16
Claude Fable 5 (US) us.anthropic.claude-fable-5 1M 128K Input: $10
Output: $50
Cache Read: $1
Cache Write: $12.5
Model: 5.000
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 2026-01-31 In: text, image
Out: text
Released: 2026-06-09
Nova Lite amazon.nova-lite-v1:0 300K 8.2K Input: $0.06
Output: $0.24
Cache Read: $0.015
Model: 0.030
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-10 In: text, image, video
Out: text
Released: 2024-12-03
Claude Haiku 4.5 (US) us.anthropic.claude-haiku-4-5-20251001-v1:0 200K 64K Input: $1
Output: $5
Cache Read: $0.1
Cache Write: $1.25
Model: 0.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-02-28 In: text, image, pdf
Out: text
Released: 2025-10-15
Voxtral Mini 3B 2507 mistral.voxtral-mini-3b-2507 128K 4.1K Input: $0.04
Output: $0.04
Model: 0.020
Completion: 1.000
🔧 🌡️ - In: audio, text
Out: text
Released: 2024-12-01
Kimi K2 Thinking moonshot.kimi-k2-thinking 262.1K 16K Input: $0.6
Output: $2.5
Model: 0.300
Completion: 4.167
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-12-02
Llama 3.1 70B Instruct meta.llama3-1-70b-instruct-v1:0 128K 4.1K Input: $0.72
Output: $0.72
Model: 0.360
Completion: 1.000
🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-07-23
DeepSeek-R1 (US) us.deepseek.r1-v1:0 128K 32.8K Input: $1.35
Output: $5.4
Model: 0.675
Completion: 4.000
🧠 🔧 🌡️ 2024-07 In: text
Out: text
Open Weights
Released: 2025-01-20
Updated: 2025-05-29
Claude Sonnet 4.6 (Global) global.anthropic.claude-sonnet-4-6 1M 64K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-02-17
Updated: 2026-03-13
Claude Haiku 4.5 anthropic.claude-haiku-4-5-20251001-v1:0 200K 64K Input: $1
Output: $5
Cache Read: $0.1
Cache Write: $1.25
Model: 0.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-02-28 In: text, image, pdf
Out: text
Released: 2025-10-15
Kimi K2.5 moonshotai.kimi-k2.5 262.1K 16K Input: $0.6
Output: $3
Model: 0.300
Completion: 5.000
🧠 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2026-02-06
Claude Opus 4.8 (AU) au.anthropic.claude-opus-4-8 1M 128K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2026-05-28
NVIDIA Nemotron Nano 12B v2 VL BF16 nvidia.nemotron-nano-12b-v2 128K 4.1K Input: $0.2
Output: $0.6
Model: 0.100
Completion: 3.000
🔧 🌡️ - In: text, image
Out: text
Released: 2024-12-01
GLM-4.7-Flash zai.glm-4.7-flash 200K 131.1K Input: $0.07
Output: $0.4
Model: 0.035
Completion: 5.714
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2026-01-19
Llama 4 Scout 17B Instruct meta.llama4-scout-17b-instruct-v1:0 3.5M 16.4K Input: $0.17
Output: $0.66
Model: 0.085
Completion: 3.882
📎 🔧 🌡️ 2024-08 In: text, image
Out: text
Open Weights
Released: 2025-04-05
Qwen3 235B A22B 2507 qwen.qwen3-235b-a22b-2507-v1:0 262.1K 131.1K Input: $0.22
Output: $0.88
Model: 0.110
Completion: 4.000
🔧 🌡️ 2024-04 In: text
Out: text
Open Weights
Released: 2025-09-18
Claude Haiku 4.5 (EU) eu.anthropic.claude-haiku-4-5-20251001-v1:0 200K 64K Input: $1
Output: $5
Cache Read: $0.1
Cache Write: $1.25
Model: 0.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-02-28 In: text, image, pdf
Out: text
Released: 2025-10-15
gpt-oss-20b openai.gpt-oss-20b-1:0 128K 16.4K Input: $0.07
Output: $0.3
Model: 0.035
Completion: 4.286
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-08-05
Claude Opus 4.8 (JP) jp.anthropic.claude-opus-4-8 1M 128K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2026-05-28
Claude Opus 4.8 anthropic.claude-opus-4-8 1M 128K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2026-05-28
Qwen3 Coder 30B A3B Instruct qwen.qwen3-coder-30b-a3b-v1:0 262.1K 131.1K Input: $0.15
Output: $0.6
Model: 0.075
Completion: 4.000
🔧 🌡️ 2024-04 In: text
Out: text
Released: 2025-09-18
Qwen/Qwen3-Next-80B-A3B-Instruct qwen.qwen3-next-80b-a3b 262K 262K Input: $0.14
Output: $1.4
Model: 0.070
Completion: 10.000
🔧 🌡️ - In: text
Out: text
Released: 2025-09-18
Updated: 2025-11-25
GPT-5.5 openai.gpt-5.5 272K 128K Input: $5.5
Output: $33
Cache Read: $0.55
Model: 2.750
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 2025-12-01 In: text, image, pdf
Out: text
Released: 2026-04-23
Updated: 2026-06-01
AU Anthropic Claude Sonnet 4.6 au.anthropic.claude-sonnet-4-6 1M 128K Input: $3.3
Output: $16.5
Cache Read: $0.33
Cache Write: $4.125
Model: 1.650
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-08 In: text, image, pdf
Out: text
Released: 2026-02-17
Claude Opus 4.5 (US) us.anthropic.claude-opus-4-5-20251101-v1:0 200K 64K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-11-24
Updated: 2025-08-01
MiniMax M2.5 minimax.minimax-m2.5 196.6K 98.3K Input: $0.3
Output: $1.2
Model: 0.150
Completion: 4.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-18
Claude Opus 4.8 (EU) eu.anthropic.claude-opus-4-8 1M 128K Input: $5.5
Output: $27.5
Cache Read: $0.55
Cache Write: $6.875
Model: 2.750
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2026-05-28
Claude Opus 4.7 (EU) eu.anthropic.claude-opus-4-7 1M 128K Input: $5.5
Output: $27.5
Cache Read: $0.55
Cache Write: $6.875
Model: 2.750
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 2026-01-31 In: text, image, pdf
Out: text
Released: 2026-04-16
Llama 3.1 8B Instruct meta.llama3-1-8b-instruct-v1:0 128K 4.1K Input: $0.22
Output: $0.22
Model: 0.110
Completion: 1.000
🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-07-23
Claude Opus 4.1 (US) us.anthropic.claude-opus-4-1-20250805-v1:0 200K 32K Input: $15
Output: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-08-05
Llama 4 Maverick 17B Instruct meta.llama4-maverick-17b-instruct-v1:0 1M 16.4K Input: $0.24
Output: $0.97
Model: 0.120
Completion: 4.042
📎 🔧 🌡️ 2024-08 In: text, image
Out: text
Open Weights
Released: 2025-04-05
Claude Opus 4.5 (Global) global.anthropic.claude-opus-4-5-20251101-v1:0 200K 64K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-11-24
Updated: 2025-08-01
NVIDIA Nemotron 3 Super 120B A12B nvidia.nemotron-super-3-120b 262.1K 131.1K Input: $0.15
Output: $0.65
Model: 0.075
Completion: 4.333
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-11
NVIDIA Nemotron Nano 3 30B nvidia.nemotron-nano-3-30b 128K 4.1K Input: $0.06
Output: $0.24
Model: 0.030
Completion: 4.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-12-23
Claude Opus 4.5 (EU) eu.anthropic.claude-opus-4-5-20251101-v1:0 200K 64K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-11-24
Updated: 2025-08-01
Claude Opus 4.6 (Global) global.anthropic.claude-opus-4-6-v1 1M 128K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-05-31 In: text, image, pdf
Out: text
Released: 2026-02-05
Updated: 2026-03-13
Claude Sonnet 4.5 (US) us.anthropic.claude-sonnet-4-5-20250929-v1:0 200K 64K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-07-31 In: text, image, pdf
Out: text
Released: 2025-09-29
Google Gemma 3 12B google.gemma-3-12b-it 131.1K 8.2K Input: $0.049999999999999996
Output: $0.09999999999999999
Model: 0.025
Completion: 2.000
🌡️ 2024-12 In: text, image
Out: text
Released: 2024-12-01
MiniMax M2.1 minimax.minimax-m2.1 204.8K 131.1K Input: $0.3
Output: $1.2
Model: 0.150
Completion: 4.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-12-23
Claude Fable 5 (EU) eu.anthropic.claude-fable-5 1M 128K Input: $11
Output: $55
Cache Read: $1.1
Cache Write: $13.75
Model: 5.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 2026-01-31 In: text, image
Out: text
Released: 2026-06-09
DeepSeek-V3.2 deepseek.v3.2 163.8K 81.9K Input: $0.62
Output: $1.85
Model: 0.310
Completion: 2.984
🧠 🔧 🌡️ 2024-07 In: text
Out: text
Open Weights
Released: 2026-02-06
Ministral 14B 3.0 mistral.ministral-3-14b-instruct 128K 4.1K Input: $0.2
Output: $0.2
Model: 0.100
Completion: 1.000
🔧 🌡️ - In: text
Out: text
Released: 2024-12-01

Ambient

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Kimi K2.6 moonshotai/kimi-k2.6 262.1K 262.1K Input: $0.95
Output: $4
Cache Read: $0.2
Cache Write: $0
Model: 0.475
Completion: 4.211
Cache: 0.211
📎 🧠 🔧 🌡️ 2025-01 In: text, image
Out: text
Open Weights
Released: 2026-04-21
GLM-5.1 zai-org/GLM-5.1-FP8 202.8K 131.1K Input: $1.4
Output: $4.4
Cache Read: $0
Cache Write: $0
Model: 0.700
Completion: 3.143
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-27

Anthropic

📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Claude Opus 4.5 (latest) claude-opus-4-5 200K 64K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-11-24
Claude Haiku 4.5 claude-haiku-4-5-20251001 200K 64K Input: $1
Output: $5
Cache Read: $0.1
Cache Write: $1.25
Model: 0.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-02-28 In: text, image, pdf
Out: text
Released: 2025-10-15
Claude Haiku 3.5 claude-3-5-haiku-20241022 200K 8.2K Input: $0.8
Output: $4
Cache Read: $0.08
Cache Write: $1
Model: 0.400
Completion: 5.000
Cache: 0.100
📎 🔧 🌡️ 2024-07-31 In: text, image, pdf
Out: text
Released: 2024-10-22
Claude Opus 4 (latest) claude-opus-4-0 200K 32K Input: $15
Output: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-05-22
Claude Opus 3 claude-3-opus-20240229 200K 4.1K Input: $15
Output: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🔧 🌡️ 2023-08-31 In: text, image, pdf
Out: text
Released: 2024-02-29
Claude Opus 4.1 claude-opus-4-1-20250805 200K 32K Input: $15
Output: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-08-05
Claude Sonnet 4.5 (latest) claude-sonnet-4-5 200K 64K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-07-31 In: text, image, pdf
Out: text
Released: 2025-09-29
Claude Opus 4.7 claude-opus-4-7 1M 128K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 2026-01-31 In: text, image, pdf
Out: text
Released: 2026-04-16
Claude Opus 4.5 claude-opus-4-5-20251101 200K 64K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-11-01
Claude Sonnet 3.5 v2 claude-3-5-sonnet-20241022 200K 8.2K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🔧 🌡️ 2024-04-30 In: text, image, pdf
Out: text
Released: 2024-10-22
Claude Opus 4.8 claude-opus-4-8 1M 128K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2026-05-28
Claude Opus 4 claude-opus-4-20250514 200K 32K Input: $15
Output: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-05-22
Claude Sonnet 3.5 claude-3-5-sonnet-20240620 200K 8.2K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🔧 🌡️ 2024-04-30 In: text, image, pdf
Out: text
Released: 2024-06-20
Claude Sonnet 4 claude-sonnet-4-20250514 200K 64K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-05-22
Claude Opus 4.1 (latest) claude-opus-4-1 200K 32K Input: $15
Output: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-08-05
Claude Haiku 3 claude-3-haiku-20240307 200K 4.1K Input: $0.25
Output: $1.25
Cache Read: $0.03
Cache Write: $0.3
Model: 0.125
Completion: 5.000
Cache: 0.120
📎 🔧 🌡️ 2023-08-31 In: text, image, pdf
Out: text
Released: 2024-03-13
Claude Fable 5 claude-fable-5 1M 128K Input: $10
Output: $50
Cache Read: $1
Cache Write: $12.5
Model: 5.000
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2026-06-09
Claude Sonnet 4 (latest) claude-sonnet-4-0 200K 64K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-05-22
Claude Sonnet 3.7 claude-3-7-sonnet-20250219 200K 64K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2024-10-31 In: text, image, pdf
Out: text
Released: 2025-02-19
Claude Haiku 4.5 (latest) claude-haiku-4-5 200K 64K Input: $1
Output: $5
Cache Read: $0.1
Cache Write: $1.25
Model: 0.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-02-28 In: text, image, pdf
Out: text
Released: 2025-10-15
Claude Opus 4.6 claude-opus-4-6 1M 128K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-05-31 In: text, image, pdf
Out: text
Released: 2026-02-05
Updated: 2026-03-13
Claude Sonnet 4.5 claude-sonnet-4-5-20250929 200K 64K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-07-31 In: text, image, pdf
Out: text
Released: 2025-09-29
Claude Sonnet 3 claude-3-sonnet-20240229 200K 4.1K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $0.3
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🔧 🌡️ 2023-08-31 In: text, image, pdf
Out: text
Released: 2024-03-04
Claude Sonnet 4.6 claude-sonnet-4-6 1M 64K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-02-17
Updated: 2026-03-13
Claude Haiku 3.5 (latest) claude-3-5-haiku-latest 200K 8.2K Input: $0.8
Output: $4
Cache Read: $0.08
Cache Write: $1
Model: 0.400
Completion: 5.000
Cache: 0.100
📎 🔧 🌡️ 2024-07-31 In: text, image, pdf
Out: text
Released: 2024-10-22

AnyAPI

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Grok 4.3 xai/grok-4.3 1M 30K - - 📎 🧠 🔧 🌡️ - In: text, image, pdf
Out: text
Released: 2026-04-17
Gemini 2.5 Pro google/gemini-2.5-pro 1M 65.5K - - 📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-06-17
Gemini 2.5 Flash google/gemini-2.5-flash 1M 65.5K - - 📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-06-17
Gemini 2.5 Flash-Lite google/gemini-2.5-flash-lite 1M 65.5K - - 📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-06-17
Gemini 3 Pro Preview google/gemini-3-pro-preview 1M 65.5K - - 📎 🧠 🔧 🌡️ 2025-01 In: text, image, video, audio, pdf
Out: text
Released: 2025-11-18
Gemini 3 Flash Preview google/gemini-3-flash-preview 1M 65.5K - - 📎 🧠 🔧 🌡️ 2025-01 In: text, image, video, audio, pdf
Out: text
Released: 2025-12-17
o3 openai/o3 200K 100K - - 📎 🧠 🔧 2024-05 In: text, image, pdf
Out: text
Released: 2025-04-16
GPT-5 openai/gpt-5 400K 128K - - 📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-08-07
o4-mini openai/o4-mini 200K 100K - - 📎 🧠 🔧 2024-05 In: text, image
Out: text
Released: 2025-04-16
o3-mini openai/o3-mini 200K 100K - - 🧠 🔧 2024-05 In: text
Out: text
Released: 2024-12-20
Updated: 2025-01-29
GPT-5.2 openai/gpt-5.2 400K 128K - - 📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2025-12-11
GPT-5.4 openai/gpt-5.4 1.1M 128K - - 📎 🧠 🔧 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-03-05
GPT-4.1 openai/gpt-4.1 1M 32.8K - - 📎 🔧 🌡️ 2024-04 In: text, image, pdf
Out: text
Released: 2025-04-14
GPT-5 Mini openai/gpt-5-mini 400K 128K - - 📎 🧠 🔧 2024-05-30 In: text, image
Out: text
Released: 2025-08-07
GPT-4.1 mini openai/gpt-4.1-mini 1M 32.8K - - 📎 🔧 🌡️ 2024-04 In: text, image, pdf
Out: text
Released: 2025-04-14
GPT-5.1 openai/gpt-5.1 400K 128K - - 📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-11-13
Devstral 2 mistralai/devstral-2512 262.1K 262.1K - - 🔧 🌡️ 2025-12 In: text
Out: text
Open Weights
Released: 2025-12-09
Mistral Large 3 mistralai/mistral-large-2512 262.1K 262.1K - - 📎 🔧 🌡️ 2024-11 In: text, image
Out: text
Open Weights
Released: 2024-11-01
Updated: 2025-12-02
Claude Sonnet 4.5 (latest) anthropic/claude-sonnet-4-5 200K 64K - - 📎 🧠 🔧 🌡️ 2025-07-31 In: text, image, pdf
Out: text
Released: 2025-09-29
Claude Opus 4.7 anthropic/claude-opus-4-7 1M 128K - - 📎 🧠 🔧 2026-01-31 In: text, image, pdf
Out: text
Released: 2026-04-16
Claude Haiku 4.5 (latest) anthropic/claude-haiku-4-5 200K 64K - - 📎 🧠 🔧 🌡️ 2025-02-28 In: text, image, pdf
Out: text
Released: 2025-10-15
Claude Opus 4.6 anthropic/claude-opus-4-6 1M 128K - - 📎 🧠 🔧 🌡️ 2025-05-31 In: text, image, pdf
Out: text
Released: 2026-02-05
Updated: 2026-03-13
Claude Sonnet 4.6 anthropic/claude-sonnet-4-6 1M 64K - - 📎 🧠 🔧 🌡️ 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-02-17
Updated: 2026-03-13
Command R+ cohere/command-r-plus-08-2024 128K 4K - - 🔧 🌡️ 2024-06-01 In: text
Out: text
Open Weights
Released: 2024-08-30
Sonar Reasoning Pro perplexity/sonar-reasoning-pro 128K 4.1K - - 📎 🧠 🌡️ 2025-09-01 In: text, image
Out: text
Released: 2024-01-01
Updated: 2025-09-01
Sonar Pro perplexity/sonar-pro 200K 8.2K - - 📎 🌡️ 2025-09-01 In: text, image
Out: text
Released: 2024-01-01
Updated: 2025-09-01
DeepSeek V4 Flash deepseek/deepseek-v4-flash 1M 384K - - 🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2026-04-24
DeepSeek V4 Pro deepseek/deepseek-v4-pro 1M 384K - - 🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2026-04-24
DeepSeek Reasoner deepseek/deepseek-r1 1M 384K - - 📎 🧠 🔧 🌡️ 2025-09 In: text
Out: text
Open Weights
Released: 2025-12-01
Updated: 2026-02-28
DeepSeek Chat deepseek/deepseek-chat 1M 384K - - 📎 🔧 🌡️ 2025-09 In: text
Out: text
Open Weights
Released: 2025-12-01
Updated: 2026-02-28

Atomic Chat

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Gemma 4 E4B Instruct (IQ4_XS) gemma-4-E4B-it-IQ4_XS 32.8K 8.2K Input: $0
Output: $0
- 🌡️ - In: text
Out: text
Open Weights
Released: 2026-04-02
Meta Llama 3.1 8B Instruct (GGUF) Meta-Llama-3_1-8B-Instruct-GGUF 131.1K 4.1K Input: $0
Output: $0
- 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-07-23
Qwen 3.5 9B (MLX 4-bit) Qwen3_5-9B-MLX-4bit 32.8K 8.2K Input: $0
Output: $0
- 📎 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2026-03-05
Updated: 2026-04-04
Gemma 4 E4B Instruct (MLX 4-bit) gemma-4-E4B-it-MLX-4bit 32.8K 8.2K Input: $0
Output: $0
- 🌡️ - In: text
Out: text
Open Weights
Released: 2026-04-02
Qwen 3.5 9B (Q4_K_M) Qwen3_5-9B-Q4_K_M 32.8K 8.2K Input: $0
Output: $0
- 📎 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2026-03-05
Updated: 2026-04-04

Auriko

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
DeepSeek V4 Flash deepseek-v4-flash 1M 384K Input: $0.14
Output: $0.28
Cache Read: $0.0028
Model: 0.070
Completion: 2.000
Cache: 0.020
🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2026-04-24
Gemini 2.5 Pro gemini-2.5-pro 1M 65.5K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-06-17
Grok 4.3 grok-4.3 1M 30K Input: $1.25
Output: $2.5
Cache Read: $0.2
Model: 0.625
Completion: 2.000
Cache: 0.160
📎 🧠 🔧 🌡️ - In: text, image, pdf
Out: text
Released: 2026-04-17
Gemini 2.5 Flash gemini-2.5-flash 1M 65.5K Input: $0.3
Output: $2.5
Cache Read: $0.03
Model: 0.150
Completion: 8.333
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-06-17
GLM-5.1 glm-5.1 200K 131.1K Input: $1.4
Output: $4.4
Cache Read: $0.26
Model: 0.700
Completion: 3.143
Cache: 0.186
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-27
DeepSeek V4 Pro deepseek-v4-pro 1M 384K Input: $0.435
Output: $0.87
Cache Read: $0.003625
Model: 0.217
Completion: 2.000
Cache: 0.008
🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2026-04-24
Claude Opus 4.7 claude-opus-4-7 1M 128K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 2026-01-31 In: text, image, pdf
Out: text
Released: 2026-04-16
MiniMax-M2.7-highspeed minimax-m2-7-highspeed 204.8K 131.1K Input: $0.6
Output: $2.4
Cache Write: $0.375
Model: 0.300
Completion: 4.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-18
MiniMax-M2.7 minimax-m2-7 204.8K 131.1K Input: $0.3
Output: $1.2
Cache Write: $0.375
Model: 0.150
Completion: 4.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-18
Qwen3.6 Plus qwen-3.6-plus 1M 65.5K Input: $0.5
Output: $3
Cache Read: $0.1
Model: 0.250
Completion: 6.000
Cache: 0.200
🧠 🔧 🌡️ 2025-04 In: text, image, video
Out: text
Released: 2026-04-02
Kimi K2.5 kimi-k2.5 262.1K 262.1K Input: $0.5
Output: $2.8
Model: 0.250
Completion: 5.600
🧠 🔧 2025-01 In: text, image, video
Out: text
Open Weights
Released: 2026-01
Kimi K2.6 kimi-k2.6 262.1K 262.1K Input: $0.95
Output: $4
Cache Read: $0.16
Model: 0.475
Completion: 4.211
Cache: 0.168
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video
Out: text
Open Weights
Released: 2026-04-21
Gemini 3.1 Pro Preview gemini-3.1-pro-preview 1M 65.5K Input: $2
Output: $12
Cache Read: $0.2
Model: 1.000
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video, audio, pdf
Out: text
Released: 2026-02-19
Claude Opus 4.6 claude-opus-4-6 1M 128K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-05-31 In: text, image, pdf
Out: text
Released: 2026-02-05
Updated: 2026-03-13
Claude Sonnet 4.6 claude-sonnet-4-6 1M 64K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-02-17
Updated: 2026-03-13

Azure

📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Codex Mini codex-mini 200K 100K Input: $1.5
Output: $6
Cache Read: $0.375
Model: 0.750
Completion: 4.000
Cache: 0.250
📎 🧠 🔧 2024-04 In: text
Out: text
Released: 2025-05-16
Phi-3.5-MoE-instruct phi-3.5-moe-instruct 128K 4.1K Input: $0.16
Output: $0.64
Model: 0.080
Completion: 4.000
🌡️ 2023-10 In: text
Out: text
Open Weights
Released: 2024-08-20
GPT-3.5 Turbo Instruct gpt-3.5-turbo-instruct 4.1K 4.1K Input: $1.5
Output: $2
Model: 0.750
Completion: 1.333
🌡️ 2021-08 In: text
Out: text
Released: 2023-09-21
DeepSeek-R1-0528 deepseek-r1-0528 163.8K 163.8K Input: $1.35
Output: $5.4
Model: 0.675
Completion: 4.000
🧠 🔧 🌡️ 2024-07 In: text
Out: text
Open Weights
Released: 2025-05-28
DeepSeek-V4-Flash deepseek-v4-flash 1M 384K Input: $0.19
Output: $0.51
Model: 0.095
Completion: 2.684
🧠 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2026-04-24
GPT-5.2 Chat gpt-5.2-chat 128K 16.4K Input: $1.75
Output: $14
Cache Read: $0.175
Model: 0.875
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2025-12-11
o3 o3 200K 100K Input: $2
Output: $8
Cache Read: $0.5
Model: 1.000
Completion: 4.000
Cache: 0.250
📎 🧠 🔧 2024-05 In: text, image
Out: text
Released: 2025-04-16
DeepSeek-V3-0324 deepseek-v3-0324 131.1K 131.1K Input: $1.14
Output: $4.56
Model: 0.570
Completion: 4.000
🔧 🌡️ 2024-07 In: text
Out: text
Open Weights
Released: 2025-03-24
Phi-3-small-instruct (128k) phi-3-small-128k-instruct 128K 4.1K Input: $0.15
Output: $0.6
Model: 0.075
Completion: 4.000
🌡️ 2023-10 In: text
Out: text
Open Weights
Released: 2024-04-23
Meta-Llama-3-8B-Instruct meta-llama-3-8b-instruct 8.2K 2K Input: $0.3
Output: $0.61
Model: 0.150
Completion: 2.033
🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-04-18
Mistral Small 3.1 mistral-small-2503 128K 32.8K Input: $0.1
Output: $0.3
Model: 0.050
Completion: 3.000
📎 🔧 🌡️ 2024-09 In: text, image
Out: text
Released: 2025-03-01
text-embedding-3-large text-embedding-3-large 8.2K 3.1K Input: $0.13
Output: $0
Model: 0.065 - - In: text
Out: text
Released: 2024-01-25
o1-mini o1-mini 128K 65.5K Input: $1.1
Output: $4.4
Cache Read: $0.55
Model: 0.550
Completion: 4.000
Cache: 0.500
🧠 🔧 2023-09 In: text
Out: text
Released: 2024-09-12
Phi-3.5-mini-instruct phi-3.5-mini-instruct 128K 4.1K Input: $0.13
Output: $0.52
Model: 0.065
Completion: 4.000
🌡️ 2023-10 In: text
Out: text
Open Weights
Released: 2024-08-20
Mistral Nemo mistral-nemo 128K 128K Input: $0.15
Output: $0.15
Model: 0.075
Completion: 1.000
🔧 🌡️ 2024-07 In: text
Out: text
Open Weights
Released: 2024-07-18
Embed v4 cohere-embed-v-4-0 128K 1.5K Input: $0.12
Output: $0
Model: 0.060 📎 - In: text, image
Out: text
Open Weights
Released: 2025-04-15
Claude Opus 4.5 claude-opus-4-5 200K 64K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-11-24
Updated: 2025-08-01
GPT-5 gpt-5 272K 128K Input: $1.25
Output: $10
Cache Read: $0.13
Model: 0.625
Completion: 8.000
Cache: 0.104
📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-08-07
GPT-5 Chat gpt-5-chat 128K 16.4K Input: $1.25
Output: $10
Cache Read: $0.13
Model: 0.625
Completion: 8.000
Cache: 0.104
📎 🧠 2024-10-24 In: text, image
Out: text
Released: 2025-08-07
Command A cohere-command-a 256K 8K Input: $2.5
Output: $10
Model: 1.250
Completion: 4.000
🧠 🔧 🌡️ 2024-06-01 In: text
Out: text
Open Weights
Released: 2025-03-13
Llama-3.2-11B-Vision-Instruct llama-3.2-11b-vision-instruct 128K 8.2K Input: $0.37
Output: $0.37
Model: 0.185
Completion: 1.000
📎 🔧 🌡️ 2023-12 In: text, image
Out: text
Open Weights
Released: 2024-09-25
GPT-5 Pro gpt-5-pro 400K 272K Input: $15
Output: $120
Model: 7.500
Completion: 8.000
📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-10-06
Command R cohere-command-r-08-2024 128K 4K Input: $0.15
Output: $0.6
Model: 0.075
Completion: 4.000
🔧 🌡️ 2024-06-01 In: text
Out: text
Open Weights
Released: 2024-08-30
GPT-4o gpt-4o 128K 16.4K Input: $2.5
Output: $10
Cache Read: $1.25
Model: 1.250
Completion: 4.000
Cache: 0.500
📎 🔧 🌡️ 2023-09 In: text, image
Out: text
Released: 2024-05-13
Updated: 2024-08-06
GPT-4 gpt-4 8.2K 8.2K Input: $60
Output: $120
Model: 30.000
Completion: 2.000
🔧 🌡️ 2023-11 In: text
Out: text
Released: 2023-03-14
o4-mini o4-mini 200K 100K Input: $1.1
Output: $4.4
Cache Read: $0.275
Model: 0.550
Completion: 4.000
Cache: 0.250
📎 🧠 🔧 2024-05 In: text, image
Out: text
Released: 2025-04-16
Phi-3-medium-instruct (128k) phi-3-medium-128k-instruct 128K 4.1K Input: $0.17
Output: $0.68
Model: 0.085
Completion: 4.000
🌡️ 2023-10 In: text
Out: text
Open Weights
Released: 2024-04-23
GPT-4 32K gpt-4-32k 32.8K 32.8K Input: $60
Output: $120
Model: 30.000
Completion: 2.000
🔧 🌡️ 2023-11 In: text
Out: text
Released: 2023-03-14
Meta-Llama-3.1-405B-Instruct meta-llama-3.1-405b-instruct 128K 32.8K Input: $5.33
Output: $16
Model: 2.665
Completion: 3.002
🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-07-23
Command R+ cohere-command-r-plus-08-2024 128K 4K Input: $2.5
Output: $10
Model: 1.250
Completion: 4.000
🔧 🌡️ 2024-06-01 In: text
Out: text
Open Weights
Released: 2024-08-30
Phi-4-mini phi-4-mini 128K 4.1K Input: $0.075
Output: $0.3
Model: 0.037
Completion: 4.000
🔧 🌡️ 2023-10 In: text
Out: text
Open Weights
Released: 2024-12-11
GPT-3.5 Turbo 1106 gpt-3.5-turbo-1106 16.4K 16.4K Input: $1
Output: $2
Model: 0.500
Completion: 2.000
🌡️ 2021-08 In: text
Out: text
Released: 2023-11-06
Llama 4 Scout 17B 16E Instruct llama-4-scout-17b-16e-instruct 128K 8.2K Input: $0.2
Output: $0.78
Model: 0.100
Completion: 3.900
📎 🔧 🌡️ 2024-08 In: text, image
Out: text
Open Weights
Released: 2025-04-05
Llama-3.3-70B-Instruct llama-3.3-70b-instruct 128K 32.8K Input: $0.71
Output: $0.71
Model: 0.355
Completion: 1.000
🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-12-06
Grok 4.20 (Non-Reasoning) grok-4-20-non-reasoning 262K 8.2K Input: $2
Output: $6
Model: 1.000
Completion: 3.000
🔧 🌡️ 2025-09 In: text
Out: text
Released: 2026-04-08
GPT-5.1 Chat gpt-5.1-chat 128K 16.4K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2024-09-30 In: text, image, audio
Out: text, image, audio
Released: 2025-11-14
DeepSeek-V4-Pro deepseek-v4-pro 1M 384K Input: $1.74
Output: $3.48
Model: 0.870
Completion: 2.000
🧠 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2026-04-24
Kimi K2 Thinking kimi-k2-thinking 262.1K 262.1K Input: $0.6
Output: $2.5
Cache Read: $0.15
Model: 0.300
Completion: 4.167
Cache: 0.250
🧠 🔧 🌡️ 2024-08 In: text
Out: text
Open Weights
Released: 2025-11-06
Updated: 2025-12-02
Claude Sonnet 4.5 claude-sonnet-4-5 200K 64K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-07-31 In: text, image, pdf
Out: text
Released: 2025-11-18
DeepSeek-R1 deepseek-r1 163.8K 163.8K Input: $1.35
Output: $5.4
Model: 0.675
Completion: 4.000
🧠 🌡️ 2024-07 In: text
Out: text
Open Weights
Released: 2025-01-20
Grok 4.1 Fast (Non-Reasoning) grok-4-1-fast-non-reasoning 128K 8.2K Input: $0.2
Output: $0.5
Cache Read: $0.05
Model: 0.100
Completion: 2.500
Cache: 0.250
📎 🔧 🌡️ - In: text, image
Out: text
Released: 2025-06-27
GPT-5.4 Nano gpt-5.4-nano 400K 128K Input: $0.2
Output: $1.25
Cache Read: $0.02
Model: 0.100
Completion: 6.250
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-03-17
Phi-3-small-instruct (8k) phi-3-small-8k-instruct 8.2K 2K Input: $0.15
Output: $0.6
Model: 0.075
Completion: 4.000
🌡️ 2023-10 In: text
Out: text
Open Weights
Released: 2024-04-23
GPT-5.3 Chat gpt-5.3-chat 128K 16.4K Input: $1.75
Output: $14
Cache Read: $0.175
Model: 0.875
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2026-03-03
GPT-3.5 Turbo 0125 gpt-3.5-turbo-0125 16.4K 16.4K Input: $0.5
Output: $1.5
Model: 0.250
Completion: 3.000
🌡️ 2021-08 In: text
Out: text
Released: 2024-01-25
Embed v3 Multilingual cohere-embed-v3-multilingual 512 1K Input: $0.1
Output: $0
Model: 0.050 - - In: text
Out: text
Open Weights
Released: 2023-11-07
GPT-3.5 Turbo 0613 gpt-3.5-turbo-0613 16.4K 16.4K Input: $3
Output: $4
Model: 1.500
Completion: 1.333
🌡️ 2021-08 In: text
Out: text
Released: 2023-06-13
GPT-5.1 Codex gpt-5.1-codex 400K 128K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
🧠 🔧 2024-09-30 In: text, image, audio
Out: text, image, audio
Released: 2025-11-14
GPT-5.1 Codex Max gpt-5.1-codex-max 400K 128K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-11-13
Phi-4-reasoning phi-4-reasoning 32K 4.1K Input: $0.125
Output: $0.5
Model: 0.063
Completion: 4.000
🧠 🌡️ 2023-10 In: text
Out: text
Open Weights
Released: 2024-12-11
Mistral Medium 3 mistral-medium-2505 128K 128K Input: $0.4
Output: $2
Model: 0.200
Completion: 5.000
📎 🔧 🌡️ 2025-05 In: text, image
Out: text
Released: 2025-05-07
Meta-Llama-3.1-70B-Instruct meta-llama-3.1-70b-instruct 128K 32.8K Input: $2.68
Output: $3.54
Model: 1.340
Completion: 1.321
🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-07-23
text-embedding-ada-002 text-embedding-ada-002 8.2K 1.5K Input: $0.1
Output: $0
Model: 0.050 - - In: text
Out: text
Released: 2022-12-15
o3-mini o3-mini 200K 100K Input: $1.1
Output: $4.4
Cache Read: $0.55
Model: 0.550
Completion: 4.000
Cache: 0.500
🧠 🔧 2024-05 In: text
Out: text
Released: 2024-12-20
Updated: 2025-01-29
GPT-5.2 gpt-5.2 400K 128K Input: $1.75
Output: $14
Cache Read: $0.125
Model: 0.875
Completion: 8.000
Cache: 0.071
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2025-12-11
GPT-4 Turbo Vision gpt-4-turbo-vision 128K 4.1K Input: $10
Output: $30
Model: 5.000
Completion: 3.000
📎 🔧 🌡️ 2023-11 In: text, image
Out: text
Released: 2023-11-06
Updated: 2024-04-09
Mistral Large 24.11 mistral-large-2411 128K 32.8K Input: $2
Output: $6
Model: 1.000
Completion: 3.000
🔧 🌡️ 2024-09 In: text
Out: text
Released: 2024-11-01
GPT-5.3 Codex gpt-5.3-codex 400K 128K Input: $1.75
Output: $14
Cache Read: $0.175
Model: 0.875
Completion: 8.000
Cache: 0.100
🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2026-02-24
text-embedding-3-small text-embedding-3-small 8.2K 1.5K Input: $0.02
Output: $0
Model: 0.010 - - In: text
Out: text
Released: 2024-01-25
DeepSeek-V3.2-Speciale deepseek-v3.2-speciale 128K 128K Input: $0.58
Output: $1.68
Model: 0.290
Completion: 2.897
🧠 🌡️ 2024-07 In: text
Out: text
Open Weights
Released: 2025-12-01
GPT-3.5 Turbo 0301 gpt-3.5-turbo-0301 4.1K 4.1K Input: $1.5
Output: $2
Model: 0.750
Completion: 1.333
🌡️ 2021-08 In: text
Out: text
Released: 2023-03-01
Meta-Llama-3.1-8B-Instruct meta-llama-3.1-8b-instruct 128K 32.8K Input: $0.3
Output: $0.61
Model: 0.150
Completion: 2.033
🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-07-23
Claude Opus 4.1 claude-opus-4-1 200K 32K Input: $15
Output: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-11-18
GPT-5.1 Codex Mini gpt-5.1-codex-mini 400K 128K Input: $0.25
Output: $2
Cache Read: $0.025
Model: 0.125
Completion: 8.000
Cache: 0.100
🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-11-14
Kimi K2.5 kimi-k2.5 262.1K 262.1K Input: $0.6
Output: $3
Model: 0.300
Completion: 5.000
🧠 🔧 🌡️ 2025-01 In: text, image
Out: text
Open Weights
Released: 2026-02-06
Phi-4 phi-4 128K 4.1K Input: $0.125
Output: $0.5
Model: 0.063
Completion: 4.000
🌡️ 2023-10 In: text
Out: text
Open Weights
Released: 2024-12-11
Grok 4 Fast (Reasoning) grok-4-fast-reasoning 2M 30K Input: $0.2
Output: $0.5
Cache Read: $0.05
Model: 0.100
Completion: 2.500
Cache: 0.250
📎 🧠 🔧 🌡️ 2025-07 In: text, image
Out: text
Released: 2025-09-19
GPT-4.1 nano gpt-4.1-nano 1M 32.8K Input: $0.1
Output: $0.4
Cache Read: $0.025
Model: 0.050
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2025-04-14
Claude Fable 5 claude-fable-5 1M 128K Input: $10
Output: $50
Cache Read: $1
Cache Write: $12.5
Model: 5.000
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 2026-01-31 In: text, image, pdf
Out: text
Released: 2026-06-09
o1 o1 200K 100K Input: $15
Output: $60
Cache Read: $7.5
Model: 7.500
Completion: 4.000
Cache: 0.500
🧠 🔧 2023-09 In: text, image
Out: text
Released: 2024-12-05
DeepSeek-V3.1 deepseek-v3.1 131.1K 131.1K Input: $0.56
Output: $1.68
Model: 0.280
Completion: 3.000
🧠 🔧 🌡️ 2024-07 In: text
Out: text
Open Weights
Released: 2025-08-21
Llama-3.2-90B-Vision-Instruct llama-3.2-90b-vision-instruct 128K 8.2K Input: $2.04
Output: $2.04
Model: 1.020
Completion: 1.000
📎 🔧 🌡️ 2023-12 In: text, image
Out: text
Open Weights
Released: 2024-09-25
Claude Haiku 4.5 claude-haiku-4-5 200K 64K Input: $1
Output: $5
Cache Read: $0.1
Cache Write: $1.25
Model: 0.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-02-31 In: text, image, pdf
Out: text
Released: 2025-11-18
Ministral 3B ministral-3b 128K 8.2K Input: $0.04
Output: $0.04
Model: 0.020
Completion: 1.000
🔧 🌡️ 2024-03 In: text
Out: text
Open Weights
Released: 2024-10-22
GPT-5.4 gpt-5.4 1.1M 128K Input: $2.5
Output: $15
Cache Read: $0.25
Model: 1.250
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-03-05
GPT-5.4 Mini gpt-5.4-mini 400K 128K Input: $0.75
Output: $4.5
Cache Read: $0.075
Model: 0.375
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-03-17
Kimi K2.6 kimi-k2.6 262.1K 262.1K Input: $0.95
Output: $4
Model: 0.475
Completion: 4.211
🧠 🔧 🌡️ 2025-01 In: text, image
Out: text
Open Weights
Released: 2026-04-22
GPT-4.1 gpt-4.1 1M 32.8K Input: $2
Output: $8
Cache Read: $0.5
Model: 1.000
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2025-04-14
Claude Opus 4.6 claude-opus-4-6 200K 128K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-05-31 In: text, image, pdf
Out: text
Released: 2026-02-05
Grok 4.1 Fast (Reasoning) grok-4-1-fast-reasoning 128K 8.2K Input: $0.2
Output: $0.5
Cache Read: $0.05
Model: 0.100
Completion: 2.500
Cache: 0.250
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2025-06-27
GPT-5 Mini gpt-5-mini 272K 128K Input: $0.25
Output: $2
Cache Read: $0.03
Model: 0.125
Completion: 8.000
Cache: 0.120
📎 🧠 🔧 2024-05-30 In: text, image
Out: text
Released: 2025-08-07
GPT-4.1 mini gpt-4.1-mini 1M 32.8K Input: $0.4
Output: $1.6
Cache Read: $0.1
Model: 0.200
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2025-04-14
GPT-4 Turbo gpt-4-turbo 128K 4.1K Input: $10
Output: $30
Model: 5.000
Completion: 3.000
📎 🔧 🌡️ 2023-12 In: text, image
Out: text
Released: 2023-11-06
Updated: 2024-04-09
GPT-5 Nano gpt-5-nano 272K 128K Input: $0.05
Output: $0.4
Cache Read: $0.01
Model: 0.025
Completion: 8.000
Cache: 0.200
📎 🧠 🔧 2024-05-30 In: text, image
Out: text
Released: 2025-08-07
Phi-3-medium-instruct (4k) phi-3-medium-4k-instruct 4.1K 1K Input: $0.17
Output: $0.68
Model: 0.085
Completion: 4.000
🌡️ 2023-10 In: text
Out: text
Open Weights
Released: 2024-04-23
GPT-5.4 Pro gpt-5.4-pro 1.1M 128K Input: $30
Output: $180
Model: 15.000
Completion: 6.000
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2026-03-05
Embed v3 English cohere-embed-v3-english 512 1K Input: $0.1
Output: $0
Model: 0.050 - - In: text
Out: text
Open Weights
Released: 2023-11-07
Claude Sonnet 4.6 claude-sonnet-4-6 1M 64K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-02-17
Updated: 2026-03-13
Codestral 25.01 codestral-2501 256K 256K Input: $0.3
Output: $0.9
Model: 0.150
Completion: 3.000
🔧 🌡️ 2024-03 In: text
Out: text
Released: 2025-01-01
Phi-4-reasoning-plus phi-4-reasoning-plus 32K 4.1K Input: $0.125
Output: $0.5
Model: 0.063
Completion: 4.000
🧠 🌡️ 2023-10 In: text
Out: text
Open Weights
Released: 2024-12-11
Phi-4-multimodal phi-4-multimodal 128K 4.1K Input: $0.08
Output: $0.32
Input Audio: $4
Model: 2.000
Completion: 0.080
📎 🌡️ 2023-10 In: text, image, audio
Out: text
Open Weights
Released: 2024-12-11
GPT-4o mini gpt-4o-mini 128K 16.4K Input: $0.15
Output: $0.6
Cache Read: $0.075
Model: 0.075
Completion: 4.000
Cache: 0.500
📎 🔧 🌡️ 2023-09 In: text, image
Out: text
Released: 2024-07-18
Model Router model-router 128K 16.4K Input: $0.14
Output: $0
Model: 0.070 📎 🔧 - In: text, image
Out: text
Released: 2025-05-19
Updated: 2025-11-18
Grok 4.20 (Reasoning) grok-4-20-reasoning 262K 8.2K Input: $2
Output: $6
Model: 1.000
Completion: 3.000
🧠 🔧 🌡️ 2025-09 In: text
Out: text
Released: 2026-04-08
GPT-5-Codex gpt-5-codex 400K 128K Input: $1.25
Output: $10
Cache Read: $0.13
Model: 0.625
Completion: 8.000
Cache: 0.104
🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-09-15
DeepSeek-V3.2 deepseek-v3.2 128K 128K Input: $0.58
Output: $1.68
Model: 0.290
Completion: 2.897
🧠 🔧 🌡️ 2024-07 In: text
Out: text
Open Weights
Released: 2025-12-01
Llama 4 Maverick 17B 128E Instruct FP8 llama-4-maverick-17b-128e-instruct-fp8 128K 8.2K Input: $0.25
Output: $1
Model: 0.125
Completion: 4.000
📎 🔧 🌡️ 2024-08 In: text, image
Out: text
Open Weights
Released: 2025-04-05
GPT-5.2 Codex gpt-5.2-codex 400K 128K Input: $1.75
Output: $14
Cache Read: $0.175
Model: 0.875
Completion: 8.000
Cache: 0.100
🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2026-01-14
Phi-4-mini-reasoning phi-4-mini-reasoning 128K 4.1K Input: $0.075
Output: $0.3
Model: 0.037
Completion: 4.000
🧠 🔧 🌡️ 2023-10 In: text
Out: text
Open Weights
Released: 2024-12-11
Phi-3-mini-instruct (128k) phi-3-mini-128k-instruct 128K 4.1K Input: $0.13
Output: $0.52
Model: 0.065
Completion: 4.000
🌡️ 2023-10 In: text
Out: text
Open Weights
Released: 2024-04-23
Phi-3-mini-instruct (4k) phi-3-mini-4k-instruct 4.1K 1K Input: $0.13
Output: $0.52
Model: 0.065
Completion: 4.000
🌡️ 2023-10 In: text
Out: text
Open Weights
Released: 2024-04-23
Meta-Llama-3-70B-Instruct meta-llama-3-70b-instruct 8.2K 2K Input: $2.68
Output: $3.54
Model: 1.340
Completion: 1.321
🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-04-18
GPT-5.1 gpt-5.1 272K 128K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2024-09-30 In: text, image, audio
Out: text, image, audio
Released: 2025-11-14
GPT-5.5 gpt-5.5 1.1M 128K Input: $5
Output: $30
Cache Read: $0.5
Model: 2.500
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 2025-12-01 In: text, image, pdf
Out: text
Released: 2026-04-24

Azure Cognitive Services

📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Claude Opus 4.5 claude-opus-4-5 200K 64K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-11-24
Updated: 2025-08-01
Claude Sonnet 4.5 claude-sonnet-4-5 200K 64K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-07-31 In: text, image, pdf
Out: text
Released: 2025-11-18
GPT-5.4 Nano gpt-5.4-nano 400K 128K Input: $0.2
Output: $1.25
Cache Read: $0.02
Model: 0.100
Completion: 6.250
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-03-17
Claude Opus 4.1 claude-opus-4-1 200K 32K Input: $15
Output: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-11-18
Claude Haiku 4.5 claude-haiku-4-5 200K 64K Input: $1
Output: $5
Cache Read: $0.1
Cache Write: $1.25
Model: 0.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-02-31 In: text, image, pdf
Out: text
Released: 2025-11-18
GPT-5.4 gpt-5.4 1.1M 128K Input: $2.5
Output: $15
Cache Read: $0.25
Model: 1.250
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-03-05
GPT-5.4 Mini gpt-5.4-mini 400K 128K Input: $0.75
Output: $4.5
Cache Read: $0.075
Model: 0.375
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-03-17
Kimi K2.6 kimi-k2.6 262.1K 262.1K Input: $0.95
Output: $4
Model: 0.475
Completion: 4.211
🧠 🔧 🌡️ 2025-01 In: text, image
Out: text
Open Weights
Released: 2026-04-22
Claude Opus 4.6 claude-opus-4-6 200K 128K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-05-31 In: text, image, pdf
Out: text
Released: 2026-02-05
GPT-5.4 Pro gpt-5.4-pro 1.1M 128K Input: $30
Output: $180
Model: 15.000
Completion: 6.000
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2026-03-05
GPT-5.5 gpt-5.5 1.1M 128K Input: $5
Output: $30
Cache Read: $0.5
Model: 2.500
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 2025-12-01 In: text, image, pdf
Out: text
Released: 2026-04-24
GPT-5.1 gpt-5.1 272K 128K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2024-09-30 In: text, image, audio
Out: text, image, audio
Released: 2025-11-14
Meta-Llama-3-70B-Instruct meta-llama-3-70b-instruct 8.2K 2K Input: $2.68
Output: $3.54
Model: 1.340
Completion: 1.321
🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-04-18
Phi-3-mini-instruct (4k) phi-3-mini-4k-instruct 4.1K 1K Input: $0.13
Output: $0.52
Model: 0.065
Completion: 4.000
🌡️ 2023-10 In: text
Out: text
Open Weights
Released: 2024-04-23
Phi-3-mini-instruct (128k) phi-3-mini-128k-instruct 128K 4.1K Input: $0.13
Output: $0.52
Model: 0.065
Completion: 4.000
🌡️ 2023-10 In: text
Out: text
Open Weights
Released: 2024-04-23
Phi-4-mini-reasoning phi-4-mini-reasoning 128K 4.1K Input: $0.075
Output: $0.3
Model: 0.037
Completion: 4.000
🧠 🔧 🌡️ 2023-10 In: text
Out: text
Open Weights
Released: 2024-12-11
GPT-5.2 Codex gpt-5.2-codex 400K 128K Input: $1.75
Output: $14
Cache Read: $0.175
Model: 0.875
Completion: 8.000
Cache: 0.100
🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2026-01-14
Llama 4 Maverick 17B 128E Instruct FP8 llama-4-maverick-17b-128e-instruct-fp8 128K 8.2K Input: $0.25
Output: $1
Model: 0.125
Completion: 4.000
📎 🔧 🌡️ 2024-08 In: text, image
Out: text
Open Weights
Released: 2025-04-05
DeepSeek-V3.2 deepseek-v3.2 128K 128K Input: $0.58
Output: $1.68
Model: 0.290
Completion: 2.897
🧠 🔧 🌡️ 2024-07 In: text
Out: text
Open Weights
Released: 2025-12-01
GPT-5-Codex gpt-5-codex 400K 128K Input: $1.25
Output: $10
Cache Read: $0.13
Model: 0.625
Completion: 8.000
Cache: 0.104
🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-09-15
Model Router model-router 128K 16.4K Input: $0.14
Output: $0
Model: 0.070 📎 🔧 - In: text, image
Out: text
Released: 2025-05-19
Updated: 2025-11-18
GPT-4o mini gpt-4o-mini 128K 16.4K Input: $0.15
Output: $0.6
Cache Read: $0.075
Model: 0.075
Completion: 4.000
Cache: 0.500
📎 🔧 🌡️ 2023-09 In: text, image
Out: text
Released: 2024-07-18
Phi-4-multimodal phi-4-multimodal 128K 4.1K Input: $0.08
Output: $0.32
Input Audio: $4
Model: 2.000
Completion: 0.080
📎 🌡️ 2023-10 In: text, image, audio
Out: text
Open Weights
Released: 2024-12-11
Phi-4-reasoning-plus phi-4-reasoning-plus 32K 4.1K Input: $0.125
Output: $0.5
Model: 0.063
Completion: 4.000
🧠 🌡️ 2023-10 In: text
Out: text
Open Weights
Released: 2024-12-11
Codestral 25.01 codestral-2501 256K 256K Input: $0.3
Output: $0.9
Model: 0.150
Completion: 3.000
🔧 🌡️ 2024-03 In: text
Out: text
Released: 2025-01-01
Embed v3 English cohere-embed-v3-english 512 1K Input: $0.1
Output: $0
Model: 0.050 - - In: text
Out: text
Open Weights
Released: 2023-11-07
Phi-3-medium-instruct (4k) phi-3-medium-4k-instruct 4.1K 1K Input: $0.17
Output: $0.68
Model: 0.085
Completion: 4.000
🌡️ 2023-10 In: text
Out: text
Open Weights
Released: 2024-04-23
GPT-5 Nano gpt-5-nano 272K 128K Input: $0.05
Output: $0.4
Cache Read: $0.01
Model: 0.025
Completion: 8.000
Cache: 0.200
📎 🧠 🔧 2024-05-30 In: text, image
Out: text
Released: 2025-08-07
GPT-4 Turbo gpt-4-turbo 128K 4.1K Input: $10
Output: $30
Model: 5.000
Completion: 3.000
📎 🔧 🌡️ 2023-12 In: text, image
Out: text
Released: 2023-11-06
Updated: 2024-04-09
GPT-4.1 mini gpt-4.1-mini 1M 32.8K Input: $0.4
Output: $1.6
Cache Read: $0.1
Model: 0.200
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2025-04-14
GPT-5 Mini gpt-5-mini 272K 128K Input: $0.25
Output: $2
Cache Read: $0.03
Model: 0.125
Completion: 8.000
Cache: 0.120
📎 🧠 🔧 2024-05-30 In: text, image
Out: text
Released: 2025-08-07
GPT-4.1 gpt-4.1 1M 32.8K Input: $2
Output: $8
Cache Read: $0.5
Model: 1.000
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2025-04-14
Ministral 3B ministral-3b 128K 8.2K Input: $0.04
Output: $0.04
Model: 0.020
Completion: 1.000
🔧 🌡️ 2024-03 In: text
Out: text
Open Weights
Released: 2024-10-22
Llama-3.2-90B-Vision-Instruct llama-3.2-90b-vision-instruct 128K 8.2K Input: $2.04
Output: $2.04
Model: 1.020
Completion: 1.000
📎 🔧 🌡️ 2023-12 In: text, image
Out: text
Open Weights
Released: 2024-09-25
DeepSeek-V3.1 deepseek-v3.1 131.1K 131.1K Input: $0.56
Output: $1.68
Model: 0.280
Completion: 3.000
🧠 🔧 🌡️ 2024-07 In: text
Out: text
Open Weights
Released: 2025-08-21
o1 o1 200K 100K Input: $15
Output: $60
Cache Read: $7.5
Model: 7.500
Completion: 4.000
Cache: 0.500
🧠 🔧 2023-09 In: text, image
Out: text
Released: 2024-12-05
GPT-4.1 nano gpt-4.1-nano 1M 32.8K Input: $0.1
Output: $0.4
Cache Read: $0.025
Model: 0.050
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2025-04-14
Grok 4 Fast (Reasoning) grok-4-fast-reasoning 2M 30K Input: $0.2
Output: $0.5
Cache Read: $0.05
Model: 0.100
Completion: 2.500
Cache: 0.250
📎 🧠 🔧 🌡️ 2025-07 In: text, image
Out: text
Released: 2025-09-19
Phi-4 phi-4 128K 4.1K Input: $0.125
Output: $0.5
Model: 0.063
Completion: 4.000
🌡️ 2023-10 In: text
Out: text
Open Weights
Released: 2024-12-11
Kimi K2.5 kimi-k2.5 262.1K 262.1K Input: $0.6
Output: $3
Model: 0.300
Completion: 5.000
🧠 🔧 🌡️ 2025-01 In: text, image
Out: text
Open Weights
Released: 2026-02-06
GPT-5.1 Codex Mini gpt-5.1-codex-mini 400K 128K Input: $0.25
Output: $2
Cache Read: $0.025
Model: 0.125
Completion: 8.000
Cache: 0.100
🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-11-14
Meta-Llama-3.1-8B-Instruct meta-llama-3.1-8b-instruct 128K 32.8K Input: $0.3
Output: $0.61
Model: 0.150
Completion: 2.033
🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-07-23
GPT-3.5 Turbo 0301 gpt-3.5-turbo-0301 4.1K 4.1K Input: $1.5
Output: $2
Model: 0.750
Completion: 1.333
🌡️ 2021-08 In: text
Out: text
Released: 2023-03-01
DeepSeek-V3.2-Speciale deepseek-v3.2-speciale 128K 128K Input: $0.58
Output: $1.68
Model: 0.290
Completion: 2.897
🧠 🌡️ 2024-07 In: text
Out: text
Open Weights
Released: 2025-12-01
text-embedding-3-small text-embedding-3-small 8.2K 1.5K Input: $0.02
Output: $0
Model: 0.010 - - In: text
Out: text
Released: 2024-01-25
GPT-5.3 Codex gpt-5.3-codex 400K 128K Input: $1.75
Output: $14
Cache Read: $0.175
Model: 0.875
Completion: 8.000
Cache: 0.100
🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2026-02-24
Mistral Large 24.11 mistral-large-2411 128K 32.8K Input: $2
Output: $6
Model: 1.000
Completion: 3.000
🔧 🌡️ 2024-09 In: text
Out: text
Released: 2024-11-01
GPT-4 Turbo Vision gpt-4-turbo-vision 128K 4.1K Input: $10
Output: $30
Model: 5.000
Completion: 3.000
📎 🔧 🌡️ 2023-11 In: text, image
Out: text
Released: 2023-11-06
Updated: 2024-04-09
GPT-5.2 gpt-5.2 400K 128K Input: $1.75
Output: $14
Cache Read: $0.125
Model: 0.875
Completion: 8.000
Cache: 0.071
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2025-12-11
o3-mini o3-mini 200K 100K Input: $1.1
Output: $4.4
Cache Read: $0.55
Model: 0.550
Completion: 4.000
Cache: 0.500
🧠 🔧 2024-05 In: text
Out: text
Released: 2024-12-20
Updated: 2025-01-29
text-embedding-ada-002 text-embedding-ada-002 8.2K 1.5K Input: $0.1
Output: $0
Model: 0.050 - - In: text
Out: text
Released: 2022-12-15
Meta-Llama-3.1-70B-Instruct meta-llama-3.1-70b-instruct 128K 32.8K Input: $2.68
Output: $3.54
Model: 1.340
Completion: 1.321
🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-07-23
Mistral Medium 3 mistral-medium-2505 128K 128K Input: $0.4
Output: $2
Model: 0.200
Completion: 5.000
📎 🔧 🌡️ 2025-05 In: text, image
Out: text
Released: 2025-05-07
Phi-4-reasoning phi-4-reasoning 32K 4.1K Input: $0.125
Output: $0.5
Model: 0.063
Completion: 4.000
🧠 🌡️ 2023-10 In: text
Out: text
Open Weights
Released: 2024-12-11
GPT-5.1 Codex gpt-5.1-codex 400K 128K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
🧠 🔧 2024-09-30 In: text, image, audio
Out: text, image, audio
Released: 2025-11-14
GPT-3.5 Turbo 0613 gpt-3.5-turbo-0613 16.4K 16.4K Input: $3
Output: $4
Model: 1.500
Completion: 1.333
🌡️ 2021-08 In: text
Out: text
Released: 2023-06-13
Embed v3 Multilingual cohere-embed-v3-multilingual 512 1K Input: $0.1
Output: $0
Model: 0.050 - - In: text
Out: text
Open Weights
Released: 2023-11-07
GPT-3.5 Turbo 0125 gpt-3.5-turbo-0125 16.4K 16.4K Input: $0.5
Output: $1.5
Model: 0.250
Completion: 3.000
🌡️ 2021-08 In: text
Out: text
Released: 2024-01-25
Phi-3-small-instruct (8k) phi-3-small-8k-instruct 8.2K 2K Input: $0.15
Output: $0.6
Model: 0.075
Completion: 4.000
🌡️ 2023-10 In: text
Out: text
Open Weights
Released: 2024-04-23
DeepSeek-R1 deepseek-r1 163.8K 163.8K Input: $1.35
Output: $5.4
Model: 0.675
Completion: 4.000
🧠 🌡️ 2024-07 In: text
Out: text
Open Weights
Released: 2025-01-20
Kimi K2 Thinking kimi-k2-thinking 262.1K 262.1K Input: $0.6
Output: $2.5
Cache Read: $0.15
Model: 0.300
Completion: 4.167
Cache: 0.250
🧠 🔧 🌡️ 2024-08 In: text
Out: text
Open Weights
Released: 2025-11-06
Updated: 2025-12-02
GPT-5.1 Chat gpt-5.1-chat 128K 16.4K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2024-09-30 In: text, image, audio
Out: text, image, audio
Released: 2025-11-14
Llama-3.3-70B-Instruct llama-3.3-70b-instruct 128K 32.8K Input: $0.71
Output: $0.71
Model: 0.355
Completion: 1.000
🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-12-06
Llama 4 Scout 17B 16E Instruct llama-4-scout-17b-16e-instruct 128K 8.2K Input: $0.2
Output: $0.78
Model: 0.100
Completion: 3.900
📎 🔧 🌡️ 2024-08 In: text, image
Out: text
Open Weights
Released: 2025-04-05
GPT-3.5 Turbo 1106 gpt-3.5-turbo-1106 16.4K 16.4K Input: $1
Output: $2
Model: 0.500
Completion: 2.000
🌡️ 2021-08 In: text
Out: text
Released: 2023-11-06
Phi-4-mini phi-4-mini 128K 4.1K Input: $0.075
Output: $0.3
Model: 0.037
Completion: 4.000
🔧 🌡️ 2023-10 In: text
Out: text
Open Weights
Released: 2024-12-11
Command R+ cohere-command-r-plus-08-2024 128K 4K Input: $2.5
Output: $10
Model: 1.250
Completion: 4.000
🔧 🌡️ 2024-06-01 In: text
Out: text
Open Weights
Released: 2024-08-30
Meta-Llama-3.1-405B-Instruct meta-llama-3.1-405b-instruct 128K 32.8K Input: $5.33
Output: $16
Model: 2.665
Completion: 3.002
🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-07-23
GPT-4 32K gpt-4-32k 32.8K 32.8K Input: $60
Output: $120
Model: 30.000
Completion: 2.000
🔧 🌡️ 2023-11 In: text
Out: text
Released: 2023-03-14
Phi-3-medium-instruct (128k) phi-3-medium-128k-instruct 128K 4.1K Input: $0.17
Output: $0.68
Model: 0.085
Completion: 4.000
🌡️ 2023-10 In: text
Out: text
Open Weights
Released: 2024-04-23
o4-mini o4-mini 200K 100K Input: $1.1
Output: $4.4
Cache Read: $0.275
Model: 0.550
Completion: 4.000
Cache: 0.250
📎 🧠 🔧 2024-05 In: text, image
Out: text
Released: 2025-04-16
GPT-4 gpt-4 8.2K 8.2K Input: $60
Output: $120
Model: 30.000
Completion: 2.000
🔧 🌡️ 2023-11 In: text
Out: text
Released: 2023-03-14
GPT-4o gpt-4o 128K 16.4K Input: $2.5
Output: $10
Cache Read: $1.25
Model: 1.250
Completion: 4.000
Cache: 0.500
📎 🔧 🌡️ 2023-09 In: text, image
Out: text
Released: 2024-05-13
Updated: 2024-08-06
Command R cohere-command-r-08-2024 128K 4K Input: $0.15
Output: $0.6
Model: 0.075
Completion: 4.000
🔧 🌡️ 2024-06-01 In: text
Out: text
Open Weights
Released: 2024-08-30
GPT-5 Pro gpt-5-pro 400K 272K Input: $15
Output: $120
Model: 7.500
Completion: 8.000
📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-10-06
Llama-3.2-11B-Vision-Instruct llama-3.2-11b-vision-instruct 128K 8.2K Input: $0.37
Output: $0.37
Model: 0.185
Completion: 1.000
📎 🔧 🌡️ 2023-12 In: text, image
Out: text
Open Weights
Released: 2024-09-25
Command A cohere-command-a 256K 8K Input: $2.5
Output: $10
Model: 1.250
Completion: 4.000
🧠 🔧 🌡️ 2024-06-01 In: text
Out: text
Open Weights
Released: 2025-03-13
GPT-5 Chat gpt-5-chat 128K 16.4K Input: $1.25
Output: $10
Cache Read: $0.13
Model: 0.625
Completion: 8.000
Cache: 0.104
📎 🧠 2024-10-24 In: text, image
Out: text
Released: 2025-08-07
GPT-5 gpt-5 272K 128K Input: $1.25
Output: $10
Cache Read: $0.13
Model: 0.625
Completion: 8.000
Cache: 0.104
📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-08-07
Embed v4 cohere-embed-v-4-0 128K 1.5K Input: $0.12
Output: $0
Model: 0.060 📎 - In: text, image
Out: text
Open Weights
Released: 2025-04-15
Mistral Nemo mistral-nemo 128K 128K Input: $0.15
Output: $0.15
Model: 0.075
Completion: 1.000
🔧 🌡️ 2024-07 In: text
Out: text
Open Weights
Released: 2024-07-18
Phi-3.5-mini-instruct phi-3.5-mini-instruct 128K 4.1K Input: $0.13
Output: $0.52
Model: 0.065
Completion: 4.000
🌡️ 2023-10 In: text
Out: text
Open Weights
Released: 2024-08-20
o1-mini o1-mini 128K 65.5K Input: $1.1
Output: $4.4
Cache Read: $0.55
Model: 0.550
Completion: 4.000
Cache: 0.500
🧠 🔧 2023-09 In: text
Out: text
Released: 2024-09-12
text-embedding-3-large text-embedding-3-large 8.2K 3.1K Input: $0.13
Output: $0
Model: 0.065 - - In: text
Out: text
Released: 2024-01-25
Mistral Small 3.1 mistral-small-2503 128K 32.8K Input: $0.1
Output: $0.3
Model: 0.050
Completion: 3.000
📎 🔧 🌡️ 2024-09 In: text, image
Out: text
Released: 2025-03-01
Meta-Llama-3-8B-Instruct meta-llama-3-8b-instruct 8.2K 2K Input: $0.3
Output: $0.61
Model: 0.150
Completion: 2.033
🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-04-18
Phi-3-small-instruct (128k) phi-3-small-128k-instruct 128K 4.1K Input: $0.15
Output: $0.6
Model: 0.075
Completion: 4.000
🌡️ 2023-10 In: text
Out: text
Open Weights
Released: 2024-04-23
DeepSeek-V3-0324 deepseek-v3-0324 131.1K 131.1K Input: $1.14
Output: $4.56
Model: 0.570
Completion: 4.000
🔧 🌡️ 2024-07 In: text
Out: text
Open Weights
Released: 2025-03-24
o3 o3 200K 100K Input: $2
Output: $8
Cache Read: $0.5
Model: 1.000
Completion: 4.000
Cache: 0.250
📎 🧠 🔧 2024-05 In: text, image
Out: text
Released: 2025-04-16
GPT-5.2 Chat gpt-5.2-chat 128K 16.4K Input: $1.75
Output: $14
Cache Read: $0.175
Model: 0.875
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2025-12-11
DeepSeek-R1-0528 deepseek-r1-0528 163.8K 163.8K Input: $1.35
Output: $5.4
Model: 0.675
Completion: 4.000
🧠 🔧 🌡️ 2024-07 In: text
Out: text
Open Weights
Released: 2025-05-28
GPT-3.5 Turbo Instruct gpt-3.5-turbo-instruct 4.1K 4.1K Input: $1.5
Output: $2
Model: 0.750
Completion: 1.333
🌡️ 2021-08 In: text
Out: text
Released: 2023-09-21
Phi-3.5-MoE-instruct phi-3.5-moe-instruct 128K 4.1K Input: $0.16
Output: $0.64
Model: 0.080
Completion: 4.000
🌡️ 2023-10 In: text
Out: text
Open Weights
Released: 2024-08-20
Codex Mini codex-mini 200K 100K Input: $1.5
Output: $6
Cache Read: $0.375
Model: 0.750
Completion: 4.000
Cache: 0.250
📎 🧠 🔧 2024-04 In: text
Out: text
Released: 2025-05-16

Bailing

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Ring-1T Ring-1T 128K 32K Input: $0.57
Output: $2.29
Model: 0.285
Completion: 4.018
🧠 🌡️ 2024-06 In: text
Out: text
Open Weights
Released: 2025-10
Ling-1T Ling-1T 128K 32K Input: $0.57
Output: $2.29
Model: 0.285
Completion: 4.018
🔧 🌡️ 2024-06 In: text
Out: text
Open Weights
Released: 2025-10

Baseten

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Kimi K2.6 moonshotai/Kimi-K2.6 262K 262K Input: $0.95
Output: $4
Cache Read: $0.16
Model: 0.475
Completion: 4.211
Cache: 0.168
📎 🧠 🔧 🌡️ 2025-01 In: text, image
Out: text
Open Weights
Released: 2026-04-21
Kimi K2.5 moonshotai/Kimi-K2.5 262K 262K Input: $0.6
Output: $3
Cache Read: $0.12
Model: 0.300
Completion: 5.000
Cache: 0.200
📎 🧠 🔧 🌡️ 2025-12 In: text, image
Out: text
Open Weights
Released: 2026-01-30
Updated: 2026-02-12
OpenAI GPT 120B openai/gpt-oss-120b 128.1K 128.1K Input: $0.1
Output: $0.5
Model: 0.050
Completion: 5.000
🧠 🔧 🌡️ 2025-08 In: text
Out: text
Open Weights
Released: 2025-08-05
Nemotron Super nvidia/Nemotron-120B-A12B 202.8K 202.8K Input: $0.06
Output: $0.75
Cache Read: $0.06
Model: 0.030
Completion: 12.500
Cache: 1.000
🧠 🔧 🌡️ 2026-02 In: text
Out: text
Open Weights
Released: 2026-03-11
Nemotron Ultra nvidia/NVIDIA-Nemotron-3-Ultra-550B-A55B 202.8K 202.8K Input: $0.6
Output: $2.4
Cache Read: $0.12
Model: 0.300
Completion: 4.000
Cache: 0.200
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-06-04
GLM 5 zai-org/GLM-5 202.8K 202.8K Input: $0.95
Output: $3.15
Cache Read: $0.2
Model: 0.475
Completion: 3.316
Cache: 0.211
🧠 🔧 🌡️ 2026-01 In: text
Out: text
Open Weights
Released: 2026-02-12
GLM 4.7 zai-org/GLM-4.7 200K 200K Input: $0.12
Output: $2.2
Cache Read: $0.12
Model: 0.060
Completion: 18.333
Cache: 1.000
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-12-22
GLM 5.1 zai-org/GLM-5.1 202.8K 202.8K Input: $1.3
Output: $4.3
Cache Read: $0.26
Model: 0.650
Completion: 3.308
Cache: 0.200
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-27
DeepSeek V3.1 deepseek-ai/DeepSeek-V3.1 164K 131K Input: $0.5
Output: $1.5
Model: 0.250
Completion: 3.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-25
Deepseek V4 Pro deepseek-ai/DeepSeek-V4-Pro 131K 131K Input: $1.74
Output: $3.48
Cache Read: $0.145
Model: 0.870
Completion: 2.000
Cache: 0.083
🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2026-04-24
MiniMax-M2.5 MiniMaxAI/MiniMax-M2.5 204K 204K Input: $0.3
Output: $1.2
Model: 0.150
Completion: 4.000
🧠 🔧 🌡️ 2026-01 In: text
Out: text
Open Weights
Released: 2026-02-12

Berget.AI

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Llama 3.3 70B Instruct meta-llama/Llama-3.3-70B-Instruct 128K 8.2K Input: $0.99
Output: $0.99
Model: 0.495
Completion: 1.000
🧠 🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2025-04-27
Kimi K2.6 moonshotai/Kimi-K2.6 262.1K 262.1K Input: $0.83
Output: $3.85
Cache Read: $0.16
Model: 0.415
Completion: 4.639
Cache: 0.193
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video
Out: text
Open Weights
Released: 2026-05-07
Gemma 4 31B Instruct google/gemma-4-31B-it 128K 8.2K Input: $0.275
Output: $0.55
Model: 0.138
Completion: 2.000
📎 🧠 🔧 🌡️ 2025-12 In: audio, image, text, video
Out: text
Open Weights
Released: 2026-04-02
GPT-OSS-120B openai/gpt-oss-120b 128K 8.2K Input: $0.44
Output: $0.99
Model: 0.220
Completion: 2.250
🧠 🔧 🌡️ 2025-08 In: text
Out: text
Open Weights
Released: 2025-08-05
Mistral Medium 3.5 128B mistralai/Mistral-Medium-3.5-128B 262.1K 131.1K Input: $1.65
Output: $5.5
Model: 0.825
Completion: 3.333
📎 🧠 🔧 🌡️ 2026-04 In: image, text
Out: text
Open Weights
Released: 2026-04-29
Mistral Small 3.2 24B Instruct 2506 mistralai/Mistral-Small-3.2-24B-Instruct-2506 32K 8.2K Input: $0.33
Output: $0.33
Model: 0.165
Completion: 1.000
🧠 🔧 🌡️ 2025-09 In: text
Out: text
Open Weights
Released: 2025-10-01
GLM 4.7 zai-org/GLM-4.7 128K 8.2K Input: $0.77
Output: $2.75
Model: 0.385
Completion: 3.571
🧠 🔧 🌡️ 2025-12 In: text
Out: text
Open Weights
Released: 2026-01-19

Cerebras

📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
GPT OSS 120B gpt-oss-120b 131.1K 41K Input: $0.35
Output: $0.75
Model: 0.175
Completion: 2.143
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-05
Updated: 2026-06-10
Z.AI GLM-4.7 zai-glm-4.7 131.1K 41K Input: $2.25
Output: $2.75
Cache Read: $0
Cache Write: $0
Model: 1.125
Completion: 1.222
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-01-07
Updated: 2026-06-10

Chutes

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Kimi K2.6 TEE moonshotai/Kimi-K2.6-TEE 262.1K 65.5K Input: $0.95
Output: $4
Cache Read: $0.475
Model: 0.475
Completion: 4.211
Cache: 0.500
📎 🧠 🔧 🌡️ 2025-12 In: text, image, video
Out: text
Open Weights
Released: 2026-04-21
Kimi K2.5 TEE moonshotai/Kimi-K2.5-TEE 262.1K 65.5K Input: $0.44
Output: $2
Cache Read: $0.22
Model: 0.220
Completion: 4.545
Cache: 0.500
📎 🧠 🔧 🌡️ 2024-10 In: text, image, video
Out: text
Open Weights
Released: 2026-01
gemma 4 31B turbo TEE google/gemma-4-31B-turbo-TEE 131.1K 65.5K Input: $0.13
Output: $0.38
Cache Read: $0.065
Model: 0.065
Completion: 2.923
Cache: 0.500
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2026-04-02
Qwen3 32B TEE Qwen/Qwen3-32B-TEE 41K 41K Input: $0.08
Output: $0.24
Cache Read: $0.04
Model: 0.040
Completion: 3.000
Cache: 0.500
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-04
Qwen2.5 VL 32B Instruct Qwen/Qwen2.5-VL-32B-Instruct 16.4K 16.4K Input: $0.0543
Output: $0.2174
Cache Read: $0.02715
Model: 0.027
Completion: 4.004
Cache: 0.500
📎 🌡️ - In: text, image
Out: text
Open Weights
Released: 2025-12-29
Updated: 2026-04-25
Qwen3 Next 80B A3B Instruct Qwen/Qwen3-Next-80B-A3B-Instruct 262.1K 262.1K Input: $0.1
Output: $0.8
Cache Read: $0.05
Model: 0.050
Completion: 8.000
Cache: 0.500
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-12-29
Updated: 2026-04-25
Qwen3 235B A22B Instruct 2507 TEE Qwen/Qwen3-235B-A22B-Instruct-2507-TEE 262.1K 65.5K Input: $0.1
Output: $0.6
Cache Read: $0.05
Model: 0.050
Completion: 6.000
Cache: 0.500
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-04
Qwen3 235B A22B Thinking 2507 Qwen/Qwen3-235B-A22B-Thinking-2507 262.1K 262.1K Input: $0.11
Output: $0.6
Cache Read: $0.055
Model: 0.055
Completion: 5.455
Cache: 0.500
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-12-29
Updated: 2026-04-25
Qwen3.6 27B TEE Qwen/Qwen3.6-27B-TEE 262.1K 65.5K Input: $0.195
Output: $1.56
Cache Read: $0.0975
Model: 0.098
Completion: 8.000
Cache: 0.500
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2026-04-22
Qwen3Guard Gen 0.6B Qwen/Qwen3Guard-Gen-0.6B 32.8K 8.2K Input: $0.01
Output: $0.0109
Cache Read: $0.005
Model: 0.005
Completion: 1.090
Cache: 0.500
🌡️ - In: text
Out: text
Open Weights
Released: 2025-12-29
Updated: 2026-04-25
Qwen3 Coder Next TEE Qwen/Qwen3-Coder-Next-TEE 262.1K 65.5K Input: $0.12
Output: $0.75
Cache Read: $0.06
Model: 0.060
Completion: 6.250
Cache: 0.500
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-04-25
Qwen2.5 Coder 32B Instruct Qwen/Qwen2.5-Coder-32B-Instruct 32.8K 32.8K Input: $0.0272
Output: $0.1087
Cache Read: $0.0136
Model: 0.014
Completion: 3.996
Cache: 0.500
🌡️ - In: text
Out: text
Open Weights
Released: 2025-12-29
Updated: 2026-04-25
Qwen2.5 72B Instruct Qwen/Qwen2.5-72B-Instruct 32.8K 32.8K Input: $0.2989
Output: $1.1957
Cache Read: $0.14945
Model: 0.149
Completion: 4.000
Cache: 0.500
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-12-29
Updated: 2026-04-25
Qwen3.5 397B A17B TEE Qwen/Qwen3.5-397B-A17B-TEE 262.1K 65.5K Input: $0.39
Output: $2.34
Cache Read: $0.195
Model: 0.195
Completion: 6.000
Cache: 0.500
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2026-02-15
Qwen3 30B A3B Qwen/Qwen3-30B-A3B 41K 41K Input: $0.06
Output: $0.22
Cache Read: $0.03
Model: 0.030
Completion: 3.667
Cache: 0.500
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-12-29
Updated: 2026-04-25
gpt oss 120b TEE openai/gpt-oss-120b-TEE 131.1K 65.5K Input: $0.09
Output: $0.36
Cache Read: $0.045
Model: 0.045
Completion: 4.000
Cache: 0.500
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-12-29
Updated: 2026-04-25
MiMo V2 Flash TEE XiaomiMiMo/MiMo-V2-Flash-TEE 262.1K 65.5K Input: $0.09
Output: $0.29
Cache Read: $0.045
Model: 0.045
Completion: 3.222
Cache: 0.500
🔧 🌡️ 2024-12-01 In: text
Out: text
Open Weights
Released: 2025-12-16
Updated: 2026-02-04
Hermes 4 14B NousResearch/Hermes-4-14B 41K 41K Input: $0.0136
Output: $0.0543
Cache Read: $0.0068
Model: 0.007
Completion: 3.993
Cache: 0.500
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-12-29
Updated: 2026-04-25
DeepHermes 3 Mistral 24B Preview NousResearch/DeepHermes-3-Mistral-24B-Preview 32.8K 32.8K Input: $0.0245
Output: $0.0978
Cache Read: $0.01225
Model: 0.012
Completion: 3.992
Cache: 0.500
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-12-29
Updated: 2026-04-25
Llama 3.2 1B Instruct unsloth/Llama-3.2-1B-Instruct 16.4K 8.2K Input: $0.01
Output: $0.0109
Cache Read: $0.005
Model: 0.005
Completion: 1.090
Cache: 0.500
🌡️ - In: text
Out: text
Open Weights
Released: 2026-01-27
Updated: 2026-04-25
gemma 3 12b it unsloth/gemma-3-12b-it 131.1K 131.1K Input: $0.03
Output: $0.1
Cache Read: $0.015
Model: 0.015
Completion: 3.333
Cache: 0.500
📎 🌡️ - In: text, image
Out: text
Open Weights
Released: 2025-12-29
Updated: 2026-04-25
gemma 3 4b it unsloth/gemma-3-4b-it 96K 96K Input: $0.01
Output: $0.0272
Cache Read: $0.005
Model: 0.005
Completion: 2.720
Cache: 0.500
📎 🌡️ - In: text, image
Out: text
Open Weights
Released: 2025-12-29
Updated: 2026-04-25
gemma 3 27b it unsloth/gemma-3-27b-it 128K 65.5K Input: $0.0272
Output: $0.1087
Cache Read: $0.0136
Model: 0.014
Completion: 3.996
Cache: 0.500
📎 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2025-12-29
Updated: 2026-04-25
Mistral Nemo Instruct 2407 unsloth/Mistral-Nemo-Instruct-2407 131.1K 131.1K Input: $0.02
Output: $0.04
Cache Read: $0.01
Model: 0.010
Completion: 2.000
Cache: 0.500
🌡️ - In: text
Out: text
Open Weights
Released: 2025-12-29
Updated: 2026-04-25
Llama 3.2 3B Instruct unsloth/Llama-3.2-3B-Instruct 16.4K 16.4K Input: $0.01
Output: $0.0136
Cache Read: $0.005
Model: 0.005
Completion: 1.360
Cache: 0.500
🌡️ - In: text
Out: text
Open Weights
Released: 2025-02-12
Updated: 2026-04-25
DeepSeek TNG R1T2 Chimera TEE tngtech/DeepSeek-TNG-R1T2-Chimera-TEE 163.8K 163.8K Input: $0.3
Output: $1.1
Cache Read: $0.15
Model: 0.150
Completion: 3.667
Cache: 0.500
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-04-25
GLM 4.7 TEE zai-org/GLM-4.7-TEE 202.8K 65.5K Input: $0.39
Output: $1.75
Cache Read: $0.195
Model: 0.195
Completion: 4.487
Cache: 0.500
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-12-22
GLM 5 TEE zai-org/GLM-5-TEE 202.8K 65.5K Input: $0.95
Output: $2.55
Cache Read: $0.475
Model: 0.475
Completion: 2.684
Cache: 0.500
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-11
GLM 5.1 TEE zai-org/GLM-5.1-TEE 202.8K 65.5K Input: $1.05
Output: $3.5
Cache Read: $0.525
Model: 0.525
Completion: 3.333
Cache: 0.500
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-27
GLM 4.6V zai-org/GLM-4.6V 131.1K 65.5K Input: $0.3
Output: $0.9
Cache Read: $0.15
Model: 0.150
Completion: 3.000
Cache: 0.500
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2025-12-29
Updated: 2026-04-25
GLM 5 Turbo zai-org/GLM-5-Turbo 202.8K 65.5K Input: $0.4891
Output: $1.9565
Cache Read: $0.24455
Model: 0.245
Completion: 4.000
Cache: 0.500
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-11
Updated: 2026-04-25
GLM 4.7 FP8 zai-org/GLM-4.7-FP8 202.8K 65.5K Input: $0.2989
Output: $1.1957
Cache Read: $0.14945
Model: 0.149
Completion: 4.000
Cache: 0.500
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-01-27
Updated: 2026-04-25
DeepSeek V3.2 TEE deepseek-ai/DeepSeek-V3.2-TEE 131.1K 65.5K Input: $0.28
Output: $0.42
Cache Read: $0.14
Model: 0.140
Completion: 1.500
Cache: 0.500
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-12-29
Updated: 2026-04-25
DeepSeek V3 0324 TEE deepseek-ai/DeepSeek-V3-0324-TEE 163.8K 65.5K Input: $0.25
Output: $1
Cache Read: $0.125
Model: 0.125
Completion: 4.000
Cache: 0.500
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-12-29
Updated: 2026-04-25
DeepSeek V3.1 TEE deepseek-ai/DeepSeek-V3.1-TEE 163.8K 65.5K Input: $0.27
Output: $1
Cache Read: $0.135
Model: 0.135
Completion: 3.704
Cache: 0.500
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-12-29
Updated: 2026-04-25
DeepSeek R1 Distill Llama 70B deepseek-ai/DeepSeek-R1-Distill-Llama-70B 131.1K 131.1K Input: $0.0272
Output: $0.1087
Cache Read: $0.0136
Model: 0.014
Completion: 3.996
Cache: 0.500
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-12-29
Updated: 2026-04-25
DeepSeek R1 0528 TEE deepseek-ai/DeepSeek-R1-0528-TEE 163.8K 65.5K Input: $0.45
Output: $2.15
Cache Read: $0.225
Model: 0.225
Completion: 4.778
Cache: 0.500
🧠 🔧 🌡️ 2024-07 In: text
Out: text
Open Weights
Released: 2025-01-20
Updated: 2025-05-29
MiniMax M2.5 TEE MiniMaxAI/MiniMax-M2.5-TEE 196.6K 65.5K Input: $0.15
Output: $1.2
Cache Read: $0.075
Model: 0.075
Completion: 8.000
Cache: 0.500
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-12
dots.ocr rednote-hilab/dots.ocr 131.1K 131.1K Input: $0.01
Output: $0.0109
Cache Read: $0.005
Model: 0.005
Completion: 1.090
Cache: 0.500
📎 🌡️ - In: text, image
Out: text
Open Weights
Released: 2025-12-29
Updated: 2026-04-25

Clarifai

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Kimi K2.6 moonshotai/chat-completion/models/Kimi-K2_6 262.1K 262.1K Input: $0.95
Output: $4
Model: 0.475
Completion: 4.211
📎 🧠 🔧 🌡️ 2025-01 In: text, image
Out: text
Open Weights
Released: 2026-04-21
MiniMax-M2.5 High Throughput minimaxai/chat-completion/models/MiniMax-M2_5-high-throughput 204.8K 131.1K Input: $0.3
Output: $1.2
Model: 0.150
Completion: 4.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-12
Updated: 2026-02-25
GPT OSS 120B High Throughput openai/chat-completion/models/gpt-oss-120b-high-throughput 131.1K 16.4K Input: $0.09
Output: $0.36
Model: 0.045
Completion: 4.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-05
Updated: 2026-02-25
GPT OSS 20B openai/chat-completion/models/gpt-oss-20b 131.1K 16.4K Input: $0.045
Output: $0.18
Model: 0.022
Completion: 4.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-05
Updated: 2025-12-12
Ministral 3 14B Reasoning 2512 mistralai/completion/models/Ministral-3-14B-Reasoning-2512 262.1K 262.1K Input: $2.5
Output: $1.7
Model: 1.250
Completion: 0.680
📎 🧠 🔧 🌡️ 2025-12 In: text, image
Out: text
Open Weights
Released: 2025-12-01
Updated: 2025-12-12
Ministral 3 3B Reasoning 2512 mistralai/completion/models/Ministral-3-3B-Reasoning-2512 262.1K 262.1K Input: $1.039
Output: $0.54825
Model: 0.519
Completion: 0.528
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2025-12
Updated: 2026-02-25
DeepSeek OCR deepseek-ai/deepseek-ocr/models/DeepSeek-OCR 8.2K 8.2K Input: $0.2
Output: $0.7
Model: 0.100
Completion: 3.500
📎 🌡️ - In: text, image
Out: text
Open Weights
Released: 2025-10-20
Updated: 2026-02-25
Qwen3 30B A3B Thinking 2507 qwen/qwenLM/models/Qwen3-30B-A3B-Thinking-2507 262.1K 131.1K Input: $0.36
Output: $1.3
Model: 0.180
Completion: 3.611
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-07-31
Updated: 2026-02-25
Qwen3 30B A3B Instruct 2507 qwen/qwenLM/models/Qwen3-30B-A3B-Instruct-2507 262.1K 262.1K Input: $0.3
Output: $0.5
Model: 0.150
Completion: 1.667
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-07-30
Updated: 2026-02-25
Qwen3 Coder 30B A3B Instruct qwen/qwenCoder/models/Qwen3-Coder-30B-A3B-Instruct 262.1K 65.5K Input: $0.11458
Output: $0.74812
Model: 0.057
Completion: 6.529
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-31
Updated: 2026-02-12
Trinity Mini arcee_ai/AFM/models/trinity-mini 131.1K 131.1K Input: $0.045
Output: $0.15
Model: 0.022
Completion: 3.333
🧠 🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-12
Updated: 2026-02-25
MM Poly 8B clarifai/main/models/mm-poly-8b 32.8K 4.1K Input: $0.658
Output: $1.11
Model: 0.329
Completion: 1.687
📎 🌡️ - In: text, image, video
Out: text
Released: 2025-06
Updated: 2026-02-25

Claudinio

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Claudinio claudinio 256K 64K Input: $0.5
Output: $2
Cache Read: $0.15
Model: 0.250
Completion: 4.000
Cache: 0.300
📎 🧠 🔧 2026-05 In: text, image, audio, video
Out: text
Released: 2026-05-12
Updated: 2026-06-02

CloudFerro Sherlock

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Llama 3.3 70B Instruct meta-llama/Llama-3.3-70B-Instruct 70K 70K Input: $2.92
Output: $2.92
Model: 1.460
Completion: 1.000
🔧 🌡️ 2024-10-09 In: text
Out: text
Open Weights
Released: 2024-12-06
OpenAI GPT OSS 120B openai/gpt-oss-120b 131K 131K Input: $2.92
Output: $2.92
Model: 1.460
Completion: 1.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-28
Bielik 11B v3.0 Instruct speakleash/Bielik-11B-v3.0-Instruct 32K 32K Input: $0.67
Output: $0.67
Model: 0.335
Completion: 1.000
🔧 🌡️ 2025-03 In: text
Out: text
Open Weights
Released: 2025-03-13
Bielik 11B v2.6 Instruct speakleash/Bielik-11B-v2.6-Instruct 32K 32K Input: $0.67
Output: $0.67
Model: 0.335
Completion: 1.000
🔧 🌡️ 2025-03 In: text
Out: text
Open Weights
Released: 2025-03-13
MiniMax-M2.5 MiniMaxAI/MiniMax-M2.5 196K 16K Input: $0.3
Output: $1.2
Model: 0.150
Completion: 4.000
🧠 🔧 🌡️ 2026-01 In: text
Out: text
Open Weights
Released: 2026-03-05

Cloudflare AI Gateway

📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
BGE M3 workers-ai/cf/baai/bge-m3 128K 16.4K Input: $0.012
Output: $0
Model: 0.006 🌡️ - In: text
Out: text
Released: 2025-04-03
BGE Small EN v1.5 workers-ai/cf/baai/bge-small-en-v1.5 128K 16.4K Input: $0.02
Output: $0
Model: 0.010 🌡️ - In: text
Out: text
Released: 2025-04-03
BGE Reranker Base workers-ai/cf/baai/bge-reranker-base 128K 16.4K Input: $0.0031
Output: $0
Model: 0.002 🌡️ - In: text
Out: text
Released: 2025-04-09
BGE Base EN v1.5 workers-ai/cf/baai/bge-base-en-v1.5 128K 16.4K Input: $0.067
Output: $0
Model: 0.034 🌡️ - In: text
Out: text
Released: 2025-04-03
BGE Large EN v1.5 workers-ai/cf/baai/bge-large-en-v1.5 128K 16.4K Input: $0.2
Output: $0
Model: 0.100 🌡️ - In: text
Out: text
Released: 2025-04-03
IndicTrans2 EN-Indic 1B workers-ai/cf/ai4bharat/indictrans2-en-indic-1B 128K 16.4K Input: $0.34
Output: $0.34
Model: 0.170
Completion: 1.000
🌡️ - In: text
Out: text
Released: 2025-09-25
IBM Granite 4.0 H Micro workers-ai/cf/ibm-granite/granite-4.0-h-micro 128K 16.4K Input: $0.017
Output: $0.11
Model: 0.009
Completion: 6.471
🌡️ - In: text
Out: text
Released: 2025-10-15
DistilBERT SST-2 INT8 workers-ai/cf/huggingface/distilbert-sst-2-int8 128K 16.4K Input: $0.026
Output: $0
Model: 0.013 🌡️ - In: text
Out: text
Released: 2025-04-03
Kimi K2.5 workers-ai/cf/moonshotai/kimi-k2.5 256K 256K Input: $0.6
Output: $3
Cache Read: $0.1
Model: 0.300
Completion: 5.000
Cache: 0.167
📎 🧠 🔧 🌡️ 2025-01 In: text, image
Out: text
Open Weights
Released: 2026-01-27
Kimi K2.6 workers-ai/cf/moonshotai/kimi-k2.6 256K 256K Input: $0.95
Output: $4
Cache Read: $0.16
Model: 0.475
Completion: 4.211
Cache: 0.168
📎 🧠 🔧 🌡️ 2025-01 In: text, image
Out: text
Open Weights
Released: 2026-04-20
Mistral 7B Instruct v0.1 workers-ai/cf/mistral/mistral-7b-instruct-v0.1 128K 16.4K Input: $0.11
Output: $0.19
Model: 0.055
Completion: 1.727
🌡️ - In: text
Out: text
Released: 2025-04-03
Gemma 3 12B IT workers-ai/cf/google/gemma-3-12b-it 128K 16.4K Input: $0.35
Output: $0.56
Model: 0.175
Completion: 1.600
🌡️ - In: text
Out: text
Released: 2025-04-11
MyShell MeloTTS workers-ai/cf/myshell-ai/melotts 128K 16.4K Input: $0
Output: $0
- 🌡️ - In: text
Out: text
Released: 2025-11-14
GPT OSS 120B workers-ai/cf/openai/gpt-oss-120b 128K 16.4K Input: $0.35
Output: $0.75
Model: 0.175
Completion: 2.143
🌡️ - In: text
Out: text
Released: 2025-08-05
GPT OSS 20B workers-ai/cf/openai/gpt-oss-20b 128K 16.4K Input: $0.2
Output: $0.3
Model: 0.100
Completion: 1.500
🌡️ - In: text
Out: text
Released: 2025-08-05
Mistral Small 3.1 24B Instruct workers-ai/cf/mistralai/mistral-small-3.1-24b-instruct 128K 16.4K Input: $0.35
Output: $0.56
Model: 0.175
Completion: 1.600
🌡️ - In: text
Out: text
Released: 2025-04-11
Nemotron 3 Super 120B workers-ai/cf/nvidia/nemotron-3-120b-a12b 256K 256K Input: $0.5
Output: $1.5
Model: 0.250
Completion: 3.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-11
PLaMo Embedding 1B workers-ai/cf/pfnet/plamo-embedding-1b 128K 16.4K Input: $0.019
Output: $0
Model: 0.009 🌡️ - In: text
Out: text
Released: 2025-09-25
Deepgram Aura 2 (ES) workers-ai/cf/deepgram/aura-2-es 128K 16.4K Input: $0
Output: $0
- 🌡️ - In: text
Out: text
Released: 2025-11-14
Deepgram Aura 2 (EN) workers-ai/cf/deepgram/aura-2-en 128K 16.4K Input: $0
Output: $0
- 🌡️ - In: text
Out: text
Released: 2025-11-14
Deepgram Nova 3 workers-ai/cf/deepgram/nova-3 128K 16.4K Input: $0
Output: $0
- 🌡️ - In: text
Out: text
Released: 2025-11-14
GLM-4.7-Flash workers-ai/cf/zai-org/glm-4.7-flash 131.1K 131.1K Input: $0.06
Output: $0.4
Model: 0.030
Completion: 6.667
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2026-01-19
DeepSeek R1 Distill Qwen 32B workers-ai/cf/deepseek-ai/deepseek-r1-distill-qwen-32b 128K 16.4K Input: $0.5
Output: $4.88
Model: 0.250
Completion: 9.760
🌡️ - In: text
Out: text
Released: 2025-04-03
Qwen3 Embedding 0.6B workers-ai/cf/qwen/qwen3-embedding-0.6b 128K 16.4K Input: $0.012
Output: $0
Model: 0.006 🌡️ - In: text
Out: text
Released: 2025-11-14
Qwen3 30B A3B FP8 workers-ai/cf/qwen/qwen3-30b-a3b-fp8 128K 16.4K Input: $0.051
Output: $0.34
Model: 0.025
Completion: 6.667
🌡️ - In: text
Out: text
Released: 2025-11-14
Qwen 2.5 Coder 32B Instruct workers-ai/cf/qwen/qwen2.5-coder-32b-instruct 128K 16.4K Input: $0.66
Output: $1
Model: 0.330
Completion: 1.515
🌡️ - In: text
Out: text
Released: 2025-04-11
QwQ 32B workers-ai/cf/qwen/qwq-32b 128K 16.4K Input: $0.66
Output: $1
Model: 0.330
Completion: 1.515
🌡️ - In: text
Out: text
Released: 2025-04-11
Pipecat Smart Turn v2 workers-ai/cf/pipecat-ai/smart-turn-v2 128K 16.4K Input: $0
Output: $0
- 🌡️ - In: text
Out: text
Released: 2025-11-14
Llama 3.1 8B Instruct workers-ai/cf/meta/llama-3.1-8b-instruct 128K 16.4K Input: $0.28
Output: $0.8299999999999998
Model: 0.140
Completion: 2.964
🌡️ - In: text
Out: text
Released: 2025-04-03
M2M100 1.2B workers-ai/cf/meta/m2m100-1.2b 128K 16.4K Input: $0.34
Output: $0.34
Model: 0.170
Completion: 1.000
🌡️ - In: text
Out: text
Released: 2025-04-03
Llama 3.2 1B Instruct workers-ai/cf/meta/llama-3.2-1b-instruct 128K 16.4K Input: $0.027
Output: $0.2
Model: 0.013
Completion: 7.407
🌡️ - In: text
Out: text
Released: 2025-04-03
Llama 3.2 11B Vision Instruct workers-ai/cf/meta/llama-3.2-11b-vision-instruct 128K 16.4K Input: $0.049
Output: $0.68
Model: 0.025
Completion: 13.878
🌡️ - In: text
Out: text
Released: 2025-04-03
Llama 4 Scout 17B 16E Instruct workers-ai/cf/meta/llama-4-scout-17b-16e-instruct 128K 16.4K Input: $0.27
Output: $0.85
Model: 0.135
Completion: 3.148
🌡️ - In: text
Out: text
Released: 2025-04-16
Llama Guard 3 8B workers-ai/cf/meta/llama-guard-3-8b 128K 16.4K Input: $0.48
Output: $0.03
Model: 0.240
Completion: 0.063
🌡️ - In: text
Out: text
Released: 2025-04-03
Llama 3 8B Instruct AWQ workers-ai/cf/meta/llama-3-8b-instruct-awq 128K 16.4K Input: $0.12
Output: $0.27
Model: 0.060
Completion: 2.250
🌡️ - In: text
Out: text
Released: 2025-04-03
Llama 3.1 8B Instruct AWQ workers-ai/cf/meta/llama-3.1-8b-instruct-awq 128K 16.4K Input: $0.12
Output: $0.27
Model: 0.060
Completion: 2.250
🌡️ - In: text
Out: text
Released: 2025-04-03
Llama 3.3 70B Instruct FP8 Fast workers-ai/cf/meta/llama-3.3-70b-instruct-fp8-fast 128K 16.4K Input: $0.29
Output: $2.25
Model: 0.145
Completion: 7.759
🌡️ - In: text
Out: text
Released: 2025-04-03
Llama 3 8B Instruct workers-ai/cf/meta/llama-3-8b-instruct 128K 16.4K Input: $0.28
Output: $0.83
Model: 0.140
Completion: 2.964
🌡️ - In: text
Out: text
Released: 2025-04-03
Llama 3.1 8B Instruct FP8 workers-ai/cf/meta/llama-3.1-8b-instruct-fp8 128K 16.4K Input: $0.15
Output: $0.29
Model: 0.075
Completion: 1.933
🌡️ - In: text
Out: text
Released: 2025-04-03
Llama 2 7B Chat FP16 workers-ai/cf/meta/llama-2-7b-chat-fp16 128K 16.4K Input: $0.56
Output: $6.67
Model: 0.280
Completion: 11.911
🌡️ - In: text
Out: text
Released: 2025-04-03
Llama 3.2 3B Instruct workers-ai/cf/meta/llama-3.2-3b-instruct 128K 16.4K Input: $0.051
Output: $0.34
Model: 0.025
Completion: 6.667
🌡️ - In: text
Out: text
Released: 2025-04-03
BART Large CNN workers-ai/cf/facebook/bart-large-cnn 128K 16.4K Input: $0
Output: $0
- 🌡️ - In: text
Out: text
Released: 2025-04-09
Gemma SEA-LION v4 27B IT workers-ai/cf/aisingapore/gemma-sea-lion-v4-27b-it 128K 16.4K Input: $0.35
Output: $0.56
Model: 0.175
Completion: 1.600
🌡️ - In: text
Out: text
Released: 2025-09-25
o3 openai/o3 200K 100K Input: $2
Output: $8
Cache Read: $0.5
Model: 1.000
Completion: 4.000
Cache: 0.250
📎 🧠 🔧 2024-05 In: text, image
Out: text
Released: 2025-04-16
GPT-3.5-turbo openai/gpt-3.5-turbo 16.4K 4.1K Input: $0.5
Output: $1.5
Cache Read: $1.25
Model: 0.250
Completion: 3.000
Cache: 2.500
🌡️ 2021-09-01 In: text
Out: text
Released: 2023-03-01
Updated: 2023-11-06
GPT-4o openai/gpt-4o 128K 16.4K Input: $2.5
Output: $10
Cache Read: $1.25
Model: 1.250
Completion: 4.000
Cache: 0.500
📎 🔧 🌡️ 2023-09 In: text, image
Out: text
Released: 2024-05-13
Updated: 2024-08-06
GPT-4 openai/gpt-4 8.2K 8.2K Input: $30
Output: $60
Model: 15.000
Completion: 2.000
📎 🔧 🌡️ 2023-11 In: text
Out: text
Released: 2023-11-06
Updated: 2024-04-09
o4-mini openai/o4-mini 200K 100K Input: $1.1
Output: $4.4
Cache Read: $0.28
Model: 0.550
Completion: 4.000
Cache: 0.255
📎 🧠 🔧 2024-05 In: text, image
Out: text
Released: 2025-04-16
o3-pro openai/o3-pro 200K 100K Input: $20
Output: $80
Model: 10.000
Completion: 4.000
📎 🧠 🔧 2024-05 In: text, image
Out: text
Released: 2025-06-10
GPT-5.1 Codex openai/gpt-5.1-codex 400K 128K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-11-13
o3-mini openai/o3-mini 200K 100K Input: $1.1
Output: $4.4
Cache Read: $0.55
Model: 0.550
Completion: 4.000
Cache: 0.500
🧠 🔧 2024-05 In: text
Out: text
Released: 2024-12-20
Updated: 2025-01-29
GPT-5.2 openai/gpt-5.2 400K 128K Input: $1.75
Output: $14
Cache Read: $0.175
Model: 0.875
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2025-12-11
GPT-5.3 Codex openai/gpt-5.3-codex 400K 128K Input: $1.75
Output: $14
Cache Read: $0.175
Model: 0.875
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-02-05
o1 openai/o1 200K 100K Input: $15
Output: $60
Cache Read: $7.5
Model: 7.500
Completion: 4.000
Cache: 0.500
📎 🧠 🔧 2023-09 In: text, image
Out: text
Released: 2024-12-05
GPT-5.4 openai/gpt-5.4 1.1M 128K Input: $2.5
Output: $15
Cache Read: $0.25
Model: 1.250
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-03-05
GPT-4 Turbo openai/gpt-4-turbo 128K 4.1K Input: $10
Output: $30
Model: 5.000
Completion: 3.000
📎 🔧 🌡️ 2023-12 In: text, image
Out: text
Released: 2023-11-06
Updated: 2024-04-09
GPT-4o mini openai/gpt-4o-mini 128K 16.4K Input: $0.15
Output: $0.6
Cache Read: $0.08
Model: 0.075
Completion: 4.000
Cache: 0.533
📎 🔧 🌡️ 2023-09 In: text, image
Out: text
Released: 2024-07-18
GPT-5.2 Codex openai/gpt-5.2-codex 400K 128K Input: $1.75
Output: $14
Cache Read: $0.175
Model: 0.875
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image, pdf
Out: text
Released: 2025-12-11
GPT-5.1 openai/gpt-5.1 400K 128K Input: $1.25
Output: $10
Cache Read: $0.13
Model: 0.625
Completion: 8.000
Cache: 0.104
📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-11-13
GPT-5.5 openai/gpt-5.5 1.1M 128K Input: $5
Output: $30
Cache Read: $0.5
Model: 2.500
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 2025-12-01 In: text, image, pdf
Out: text
Released: 2026-04-23
Claude Haiku 3.5 (latest) anthropic/claude-3.5-haiku 200K 8.2K Input: $0.8
Output: $4
Cache Read: $0.08
Cache Write: $1
Model: 0.400
Completion: 5.000
Cache: 0.100
📎 🔧 🌡️ 2024-07-31 In: text, image, pdf
Out: text
Released: 2024-10-22
Claude Sonnet 3.5 v2 anthropic/claude-3.5-sonnet 200K 8.2K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🔧 🌡️ 2024-04-30 In: text, image, pdf
Out: text
Released: 2024-10-22
Claude Sonnet 4 (latest) anthropic/claude-sonnet-4 200K 64K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-05-22
Claude Opus 4.5 (latest) anthropic/claude-opus-4-5 200K 64K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-11-24
Claude Sonnet 4.5 (latest) anthropic/claude-sonnet-4-5 200K 64K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-07-31 In: text, image, pdf
Out: text
Released: 2025-09-29
Claude Opus 4.7 anthropic/claude-opus-4-7 1M 128K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 2026-01 In: text, image, pdf
Out: text
Released: 2026-04-16
Claude Sonnet 3 anthropic/claude-3-sonnet 200K 4.1K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $0.3
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🔧 🌡️ 2023-08-31 In: text, image, pdf
Out: text
Released: 2024-03-04
Claude Opus 4.8 anthropic/claude-opus-4-8 1M 128K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2026-05-28
Claude Opus 3 anthropic/claude-3-opus 200K 4.1K Input: $15
Output: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🔧 🌡️ 2023-08-31 In: text, image, pdf
Out: text
Released: 2024-02-29
Claude Haiku 3.5 (latest) anthropic/claude-3-5-haiku 200K 8.2K Input: $0.8
Output: $4
Cache Read: $0.08
Cache Write: $1
Model: 0.400
Completion: 5.000
Cache: 0.100
📎 🔧 🌡️ 2024-07-31 In: text, image, pdf
Out: text
Released: 2024-10-22
Claude Opus 4.1 (latest) anthropic/claude-opus-4-1 200K 32K Input: $15
Output: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-08-05
Claude Fable 5 anthropic/claude-fable-5 1M 128K Input: $10
Output: $50
Cache Read: $1
Cache Write: $12.5
Model: 5.000
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 2026-01-31 In: text, image, pdf
Out: text
Released: 2026-06-09
Claude Haiku 4.5 (latest) anthropic/claude-haiku-4-5 200K 64K Input: $1
Output: $5
Cache Read: $0.1
Cache Write: $1.25
Model: 0.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-02-28 In: text, image, pdf
Out: text
Released: 2025-10-15
Claude Opus 4.6 (latest) anthropic/claude-opus-4-6 1M 128K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-05-31 In: text, image, pdf
Out: text
Released: 2026-02-05
Claude Haiku 3 anthropic/claude-3-haiku 200K 4.1K Input: $0.25
Output: $1.25
Cache Read: $0.03
Cache Write: $0.3
Model: 0.125
Completion: 5.000
Cache: 0.120
📎 🔧 🌡️ 2023-08-31 In: text, image, pdf
Out: text
Released: 2024-03-13
Claude Sonnet 4.6 anthropic/claude-sonnet-4-6 1M 64K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-02-17
Claude Opus 4 (latest) anthropic/claude-opus-4 200K 32K Input: $15
Output: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-05-22

Cloudflare Workers AI

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Granite 4.0 H Micro cf/ibm-granite/granite-4.0-h-micro 131K 131K Input: $0.017
Output: $0.112
Model: 0.009
Completion: 6.588
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-10-07
Kimi K2.7 Code cf/moonshotai/kimi-k2.7-code 262.1K 262.1K Input: $0.95
Output: $4
Cache Read: $0.19
Model: 0.475
Completion: 4.211
Cache: 0.200
📎 🧠 🔧 🌡️ 2025-01 In: text, image
Out: text
Open Weights
Released: 2026-06-12
Kimi K2.6 cf/moonshotai/kimi-k2.6 262.1K 256K Input: $0.95
Output: $4
Cache Read: $0.16
Model: 0.475
Completion: 4.211
Cache: 0.168
📎 🧠 🔧 🌡️ 2025-01 In: text, image
Out: text
Open Weights
Released: 2026-04-21
Gemma 4 26B A4B IT cf/google/gemma-4-26b-a4b-it 256K 16.4K Input: $0.1
Output: $0.3
Model: 0.050
Completion: 3.000
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2026-04-02
GPT OSS 120B cf/openai/gpt-oss-120b 128K 16.4K Input: $0.35
Output: $0.75
Model: 0.175
Completion: 2.143
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-05
GPT OSS 20B cf/openai/gpt-oss-20b 128K 16.4K Input: $0.2
Output: $0.3
Model: 0.100
Completion: 1.500
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-05
Mistral Small 3.1 24B Instruct cf/mistralai/mistral-small-3.1-24b-instruct 128K 128K Input: $0.351
Output: $0.555
Model: 0.175
Completion: 1.581
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-03-18
Nemotron 3 Super 120B cf/nvidia/nemotron-3-120b-a12b 256K 256K Input: $0.5
Output: $1.5
Model: 0.250
Completion: 3.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-11
GLM-4.7-Flash cf/zai-org/glm-4.7-flash 131.1K 131.1K Input: $0.0605
Output: $0.4
Model: 0.030
Completion: 6.612
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2026-01-19
Deepseek R1 Distill Qwen 32B cf/deepseek-ai/deepseek-r1-distill-qwen-32b 80K 80K Input: $0.497
Output: $4.881
Model: 0.248
Completion: 9.821
🧠 🌡️ 2024-07 In: text
Out: text
Open Weights
Released: 2025-01-20
Updated: 2025-05-29
Qwen3 30B A3b fp8 cf/qwen/qwen3-30b-a3b-fp8 32.8K 32.8K Input: $0.0509
Output: $0.335
Model: 0.025
Completion: 6.582
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-04-30
Qwen2.5 Coder 32B Instruct cf/qwen/qwen2.5-coder-32b-instruct 32.8K 32.8K Input: $0.66
Output: $1
Model: 0.330
Completion: 1.515
🌡️ - In: text
Out: text
Open Weights
Released: 2025-02-27
Qwq 32B cf/qwen/qwq-32b 24K 24K Input: $0.66
Output: $1
Model: 0.330
Completion: 1.515
🧠 🌡️ - In: text
Out: text
Open Weights
Released: 2025-03-05
Llama 3.2 1B Instruct cf/meta/llama-3.2-1b-instruct 60K 60K Input: $0.027
Output: $0.201
Model: 0.013
Completion: 7.444
🌡️ - In: text
Out: text
Open Weights
Released: 2024-09-25
Llama 3.2 11B Vision Instruct cf/meta/llama-3.2-11b-vision-instruct 128K 128K Input: $0.0485
Output: $0.676
Model: 0.024
Completion: 13.938
📎 🌡️ - In: text, image
Out: text
Open Weights
Released: 2024-09-25
Llama 4 Scout 17B 16E Instruct cf/meta/llama-4-scout-17b-16e-instruct 131K 16.4K Input: $0.27
Output: $0.85
Model: 0.135
Completion: 3.148
📎 🔧 🌡️ 2024-08 In: text, image
Out: text
Open Weights
Released: 2025-04-05
Llama Guard 3 8B cf/meta/llama-guard-3-8b 131.1K 131.1K Input: $0.484
Output: $0.03
Model: 0.242
Completion: 0.062
🌡️ - In: text
Out: text
Open Weights
Released: 2025-01-22
Llama 3.3 70B Instruct fp8 Fast cf/meta/llama-3.3-70b-instruct-fp8-fast 24K 24K Input: $0.293
Output: $2.253
Model: 0.146
Completion: 7.689
🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-12-06
Llama 3.1 8B Instruct fp8 cf/meta/llama-3.1-8b-instruct-fp8 32K 32K Input: $0.152
Output: $0.287
Model: 0.076
Completion: 1.888
🌡️ - In: text
Out: text
Open Weights
Released: 2024-07-25
Llama 3.2 3B Instruct cf/meta/llama-3.2-3b-instruct 80K 80K Input: $0.0509
Output: $0.335
Model: 0.025
Completion: 6.582
🌡️ - In: text
Out: text
Open Weights
Released: 2024-09-25
Gemma Sea Lion V4 27B It cf/aisingapore/gemma-sea-lion-v4-27b-it 128K 128K Input: $0.351
Output: $0.555
Model: 0.175
Completion: 1.581
🌡️ - In: text
Out: text
Open Weights
Released: 2025-09-23

Cohere

📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Aya Expanse 32B c4ai-aya-expanse-32b 128K 4K - - 🌡️ - In: text
Out: text
Open Weights
Released: 2024-10-24
Command A command-a-03-2025 256K 8K Input: $2.5
Output: $10
Model: 1.250
Completion: 4.000
🔧 🌡️ 2024-06-01 In: text
Out: text
Open Weights
Released: 2025-03-13
Aya Vision 32B c4ai-aya-vision-32b 16K 4K - - 📎 🌡️ - In: text, image
Out: text
Open Weights
Released: 2025-03-04
Updated: 2025-05-14
Command R7B Arabic command-r7b-arabic-02-2025 128K 4K Input: $0.0375
Output: $0.15
Model: 0.019
Completion: 4.000
🔧 🌡️ 2024-06-01 In: text
Out: text
Open Weights
Released: 2025-02-27
Aya Vision 8B c4ai-aya-vision-8b 16K 4K - - 📎 🌡️ - In: text, image
Out: text
Open Weights
Released: 2025-03-04
Updated: 2025-05-14
Command R command-r-08-2024 128K 4K Input: $0.15
Output: $0.6
Model: 0.075
Completion: 4.000
🔧 🌡️ 2024-06-01 In: text
Out: text
Open Weights
Released: 2024-08-30
Command R7B command-r7b-12-2024 128K 4K Input: $0.0375
Output: $0.15
Model: 0.019
Completion: 4.000
🔧 🌡️ 2024-06-01 In: text
Out: text
Open Weights
Released: 2024-12-02
Command A Vision command-a-vision-07-2025 128K 8K Input: $2.5
Output: $10
Model: 1.250
Completion: 4.000
🌡️ 2024-06-01 In: text, image
Out: text
Open Weights
Released: 2025-07-31
Command A Plus command-a-plus-05-2026 128K 64K Input: $2.5
Output: $10
Model: 1.250
Completion: 4.000
📎 🧠 🔧 🌡️ 2025-04-01 In: text, image
Out: text
Open Weights
Released: 2026-05-20
Updated: 2026-06-09
Command A Translate command-a-translate-08-2025 8K 8K Input: $2.5
Output: $10
Model: 1.250
Completion: 4.000
🔧 🌡️ 2024-06-01 In: text
Out: text
Open Weights
Released: 2025-08-28
Command R+ command-r-plus-08-2024 128K 4K Input: $2.5
Output: $10
Model: 1.250
Completion: 4.000
🔧 🌡️ 2024-06-01 In: text
Out: text
Open Weights
Released: 2024-08-30
Command A Reasoning command-a-reasoning-08-2025 256K 32K Input: $2.5
Output: $10
Model: 1.250
Completion: 4.000
🧠 🔧 🌡️ 2024-06-01 In: text
Out: text
Open Weights
Released: 2025-08-21
North Mini Code north-mini-code-1-0 256K 64K Input: $0
Output: $0
- 🧠 🔧 🌡️ 2025-09-23 In: text
Out: text
Open Weights
Released: 2026-06-09
Aya Expanse 8B c4ai-aya-expanse-8b 8K 4K - - 🌡️ - In: text
Out: text
Open Weights
Released: 2024-10-24

Cortecs

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
DeepSeek R1 0528 deepseek-r1-0528 164K 164K Input: $0.585
Output: $2.307
Model: 0.292
Completion: 3.944
🧠 🔧 🌡️ 2024-07 In: text
Out: text
Open Weights
Released: 2025-05-28
DeepSeek V4 Flash deepseek-v4-flash 1M 384K Input: $0.133
Output: $0.266
Cache Read: $0.0028
Model: 0.067
Completion: 2.000
Cache: 0.021
🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2026-04-24
MiniMax-M2.5 minimax-m2.5 196.6K 196.6K Input: $0.32
Output: $1.18
Model: 0.160
Completion: 3.687
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-12
DeepSeek V3 0324 deepseek-v3-0324 128K 128K Input: $0.551
Output: $1.654
Model: 0.276
Completion: 3.002
🔧 🌡️ 2024-07 In: text
Out: text
Open Weights
Released: 2025-03-24
Claude Opus 4.7 claude-opus4-7 1M 128K Input: $5.6
Output: $27.99
Cache Read: $0.56
Cache Write: $6.99
Model: 2.800
Completion: 4.998
Cache: 0.100
📎 🧠 🔧 2026-01-31 In: text, image, pdf
Out: text
Released: 2026-04-16
GLM 4.7 glm-4.7 198K 198K Input: $0.45
Output: $2.23
Model: 0.225
Completion: 4.956
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-12-22
Qwen3 235B A22B Instruct 2507 qwen3-235b-a22b-instruct-2507 131K 131K Input: $0.062
Output: $0.408
Model: 0.031
Completion: 6.581
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-23
Qwen3 Coder 30B A3B Instruct qwen3-coder-30b-a3b-instruct 262K 262K Input: $0.053
Output: $0.222
Model: 0.026
Completion: 4.189
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-31
MiniMax-M2.1 minimax-m2.1 196K 196K Input: $0.34
Output: $1.34
Model: 0.170
Completion: 3.941
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-12-23
Qwen3 32B qwen3-32b 16.4K 16.4K Input: $0.099
Output: $0.33
Model: 0.050
Completion: 3.333
🔧 🌡️ 2024-12 In: text
Out: text
Open Weights
Released: 2025-04-29
Claude Sonnet 4.6 claude-4-6-sonnet 1M 1M Input: $3.59
Output: $17.92
Model: 1.795
Completion: 4.992
📎 🧠 🔧 🌡️ 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-02-17
Updated: 2026-03-13
Claude Sonnet 4 claude-sonnet-4 200K 64K Input: $3.307
Output: $16.536
Model: 1.653
Completion: 5.000
🔧 🌡️ 2025-03 In: text, image, pdf
Out: text
Released: 2025-05-22
Gemini 2.5 Pro gemini-2.5-pro 1M 65.5K Input: $1.654
Output: $11.024
Model: 0.827
Completion: 6.665
🔧 🌡️ 2025-01 In: text, image
Out: text
Released: 2025-03-20
Updated: 2025-06-17
Claude 4.5 Sonnet claude-4-5-sonnet 200K 200K Input: $3.259
Output: $16.296
Model: 1.629
Completion: 5.000
📎 🧠 🔧 🌡️ 2025-07-31 In: text, image, pdf
Out: text
Released: 2025-09-29
Mixtral 8x7B Instruct v0.1 mixtral-8x7B-instruct-v0.1 32K 32K Input: $0.438
Output: $0.68
Model: 0.219
Completion: 1.553
🧠 🌡️ 2023-09 In: text
Out: text
Open Weights
Released: 2023-12-11
GLM 4.5 glm-4.5 131.1K 131.1K Input: $0.67
Output: $2.46
Model: 0.335
Completion: 3.672
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-29
Kimi K2 Instruct kimi-k2-instruct 131K 131K Input: $0.551
Output: $2.646
Model: 0.276
Completion: 4.802
🔧 🌡️ 2024-07 In: text
Out: text
Open Weights
Released: 2025-07-11
Updated: 2025-09-05
Llama 3.3 70B Instruct llama-3.3-70b-instruct 131K 131K Input: $0.089
Output: $0.275
Model: 0.044
Completion: 3.090
🧠 🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-12-06
Claude Opus 4.8 claude-opus4-8 1M 128K Input: $5.64
Output: $28.198
Cache Read: $0.563
Cache Write: $7.049
Model: 2.820
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2026-05-28
Devstral Small 2 2512 devstral-small-2512 262K 262K Input: $0
Output: $0
- 🔧 🌡️ 2025-12 In: text, image
Out: text
Open Weights
Released: 2025-12-09
GLM-5.1 glm-5.1 204.8K 131.1K Input: $1.31
Output: $4.1
Cache Read: $0.24
Model: 0.655
Completion: 3.130
Cache: 0.183
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-04-14
DeepSeek V4 Pro deepseek-v4-pro 1M 384K Input: $1.553
Output: $3.106
Cache Read: $0.003625
Model: 0.776
Completion: 2.000
Cache: 0.002
🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2026-04-24
Qwen3 Next 80B A3B Thinking qwen3-next-80b-a3b-thinking 128K 128K Input: $0.164
Output: $1.311
Model: 0.082
Completion: 7.994
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-09-11
Kimi K2 Thinking kimi-k2-thinking 262K 262K Input: $0.656
Output: $2.731
Model: 0.328
Completion: 4.163
📎 🧠 🔧 🌡️ 2025-12 In: text
Out: text
Open Weights
Released: 2025-12-08
MiniMax-M2 minimax-m2 400K 400K Input: $0.39
Output: $1.57
Model: 0.195
Completion: 4.026
🧠 🔧 🌡️ 2024-11 In: text
Out: text
Open Weights
Released: 2025-10-27
MiniMax-m2.7 minimax-m2.7 202.8K 196.1K Input: $0.47
Output: $1.4
Model: 0.235
Completion: 2.979
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-18
Kimi K2.5 kimi-k2.5 256K 256K Input: $0.55
Output: $2.76
Model: 0.275
Completion: 5.018
🧠 🔧 🌡️ 2025-01 In: text, image, video
Out: text
Open Weights
Released: 2026-01-27
GPT Oss 120b gpt-oss-120b 128K 128K Input: $0
Output: $0
- 🔧 🌡️ 2024-01 In: text
Out: text
Open Weights
Released: 2025-08-05
Qwen2.5 72B Instruct qwen-2.5-72b-instruct 33K 33K Input: $0.062
Output: $0.231
Model: 0.031
Completion: 3.726
🔧 🌡️ 2024-06 In: text
Out: text
Open Weights
Released: 2024-09-19
Hermes 4 70B hermes-4-70b 128K 128K Input: $0.116
Output: $0.358
Model: 0.058
Completion: 3.086
🧠 🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2025-08-26
Claude Haiku 4.5 claude-haiku-4-5 200K 200K Input: $1.09
Output: $5.43
Model: 0.545
Completion: 4.982
📎 🧠 🔧 🌡️ 2025-02-28 In: text, image, pdf
Out: text
Released: 2025-10-15
GPT-5.4 gpt-5.4 1.1M 128K Input: $3
Output: $16.13
Cache Read: $0.25
Model: 1.500
Completion: 5.377
Cache: 0.083
📎 🧠 🔧 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-03-05
Kimi K2.6 kimi-k2.6 256K 256K Input: $0.81
Output: $3.54
Cache Read: $0.2
Model: 0.405
Completion: 4.370
Cache: 0.247
🧠 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2026-04-17
GPT 4.1 gpt-4.1 1M 32.8K Input: $2.354
Output: $9.417
Model: 1.177
Completion: 4.000
🔧 🌡️ 2024-06 In: text, image
Out: text
Released: 2025-04-14
Nova Pro 1.0 nova-pro-v1 300K 5K Input: $1.016
Output: $4.061
Model: 0.508
Completion: 3.997
🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2024-12-03
Qwen3 Coder Next 80B qwen3-coder-next 256K 65.5K Input: $0.158
Output: $0.84
Model: 0.079
Completion: 5.316
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2026-02-04
Claude Opus 4.5 claude-opus4-5 200K 200K Input: $5.98
Output: $29.89
Model: 2.990
Completion: 4.998
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-11-24
Qwen3 Coder 480B A35B Instruct qwen3-coder-480b-a35b-instruct 262K 262K Input: $0.441
Output: $1.984
Model: 0.221
Completion: 4.499
🔧 🌡️ 2025-01 In: text
Out: text
Open Weights
Released: 2025-07-25
Devstral 2 2512 devstral-2512 262K 262K Input: $0
Output: $0
- 🔧 🌡️ 2025-12 In: text
Out: text
Open Weights
Released: 2025-12-09
Qwen3.5 397B A17B qwen3.5-397b-a17b 250K 250K Input: $0.6
Output: $3.6
Model: 0.300
Completion: 6.000
🧠 🔧 🌡️ 2026-01 In: text
Out: text
Open Weights
Released: 2026-02-16
GLM 4.5 Air glm-4.5-air 131.1K 131.1K Input: $0.22
Output: $1.34
Model: 0.110
Completion: 6.091
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-08-01
GLM-4.7-Flash glm-4.7-flash 203K 203K Input: $0.09
Output: $0.53
Model: 0.045
Completion: 5.889
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-08-08
INTELLECT 3 intellect-3 128K 128K Input: $0.219
Output: $1.202
Model: 0.110
Completion: 5.489
📎 🧠 🔧 🌡️ 2025-11 In: text
Out: text
Open Weights
Released: 2025-11-26
Llama 3.1 405B Instruct llama-3.1-405b-instruct 128K 128K Input: $0
Output: $0
- 🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-07-23
GLM 5 glm-5 202.8K 202.8K Input: $1.08
Output: $3.44
Model: 0.540
Completion: 3.185
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-11
Mistral Large 3 2512 mistral-large-2512 256K 256K Input: $0.5
Output: $1.5
Cache Read: $0.05
Model: 0.250
Completion: 3.000
Cache: 0.100
📎 🔧 🌡️ 2025-12 In: text, image
Out: text
Open Weights
Released: 2025-12-01
DeepSeek V3.2 deepseek-v3.2 163.8K 163.8K Input: $0.266
Output: $0.444
Model: 0.133
Completion: 1.669
🧠 🔧 🌡️ 2024-07 In: text
Out: text
Open Weights
Released: 2025-12-01
Codestral 2508 codestral-2508 256K 256K Input: $0.3
Output: $0.9
Cache Read: $0.03
Model: 0.150
Completion: 3.000
Cache: 0.100
🔧 🌡️ 2025-03 In: text
Out: text
Open Weights
Released: 2025-07-30
Qwen3.5 122B A10B qwen3.5-122b-a10b 262.1K 262.1K Input: $0.444
Output: $3.106
Model: 0.222
Completion: 6.995
🧠 🔧 🌡️ 2026-01 In: text
Out: text
Open Weights
Released: 2026-02-24
Nemotron 3 Super 120B A12B nemotron-3-super-120b-a12b 262.1K 262.1K Input: $0.266
Output: $0.799
Model: 0.133
Completion: 3.004
🧠 🔧 🌡️ 2025-12 In: text
Out: text
Open Weights
Released: 2026-03-11
Claude Opus 4.6 claude-opus4-6 1M 1M Input: $5.98
Output: $29.89
Model: 2.990
Completion: 4.998
📎 🧠 🔧 🌡️ 2025-05-31 In: text, image, pdf
Out: text
Released: 2026-02-05
Updated: 2026-03-13

CrofAI

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
DeepSeek V4 Flash deepseek-v4-flash 1M 131.1K Input: $0.12
Output: $0.21
Cache Read: $0.003
Model: 0.060
Completion: 1.750
Cache: 0.025
🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2026-04-24
MiniMax-M2.5 minimax-m2.5 204.8K 131.1K Input: $0.11
Output: $0.95
Cache Read: $0.02
Cache Write: $0.375
Model: 0.055
Completion: 8.636
Cache: 0.182
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-12
GLM-4.7 glm-4.7 202.8K 202.8K Input: $0.25
Output: $1.1
Cache Read: $0.05
Cache Write: $0
Model: 0.125
Completion: 4.400
Cache: 0.200
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-12-22
Greg 1 Super greg-1-super 229.4K 229.4K Input: $1
Output: $5
Cache Read: $0.2
Model: 0.500
Completion: 5.000
Cache: 0.200
🌡️ - In: text
Out: text
Released: 2026-01-27
DeepSeek V4 Pro deepseek-v4-pro-lightning 1M 131.1K Input: $0.8
Output: $1.6
Cache Read: $0.02
Model: 0.400
Completion: 2.000
Cache: 0.025
🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2026-04-24
Greg (Roleplay) greg-rp 229.4K 229.4K Input: $0.1
Output: $0.3
Cache Read: $0.02
Model: 0.050
Completion: 3.000
Cache: 0.200
🌡️ - In: text
Out: text
Released: 2026-01-27
Gemma 4 31B IT gemma-4-31b-it 262.1K 262.1K Input: $0.1
Output: $0.3
Cache Read: $0.02
Model: 0.050
Completion: 3.000
Cache: 0.200
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2026-04-02
Greg 1 Normal greg-1 229.4K 229.4K Input: $0.1
Output: $0.3
Cache Read: $0.02
Model: 0.050
Completion: 3.000
Cache: 0.200
🌡️ - In: text
Out: text
Released: 2026-01-27
GLM-5.1 glm-5.1 202.8K 202.8K Input: $0.45
Output: $2.15
Cache Read: $0.08
Cache Write: $0
Model: 0.225
Completion: 4.778
Cache: 0.178
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-27
DeepSeek V4 Pro deepseek-v4-pro 1M 131.1K Input: $0.35
Output: $0.8
Cache Read: $0.003
Model: 0.175
Completion: 2.286
Cache: 0.009
🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2026-04-24
Kimi K2.5 (Lightning) kimi-k2.5-lightning 131.1K 32.8K Input: $1
Output: $3
Cache Read: $0.2
Model: 0.500
Completion: 3.000
Cache: 0.200
🧠 🔧 - In: text, image, video
Out: text
Open Weights
Released: 2026-02-06
Kimi K2.5 kimi-k2.5 262.1K 262.1K Input: $0.35
Output: $1.7
Cache Read: $0.07
Model: 0.175
Completion: 4.857
Cache: 0.200
🧠 🔧 2025-01 In: text, image, video
Out: text
Open Weights
Released: 2026-01
Kimi K2.6 kimi-k2.6 262.1K 262.1K Input: $0.5
Output: $1.99
Cache Read: $0.05
Model: 0.250
Completion: 3.980
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video
Out: text
Open Weights
Released: 2026-04-21
Qwen3.6 27B qwen3.6-27b 262.1K 262.1K Input: $0.2
Output: $1.5
Cache Read: $0.04
Model: 0.100
Completion: 7.500
Cache: 0.200
📎 🧠 🔧 🌡️ - In: text, image, video, audio
Out: text
Open Weights
Released: 2026-04-22
Qwen3.5 9B qwen3.5-9b 262.1K 262.1K Input: $0.04
Output: $0.15
Cache Read: $0.008
Model: 0.020
Completion: 3.750
Cache: 0.200
📎 🧠 🔧 🌡️ - In: text, image, video, audio
Out: text
Open Weights
Released: 2026-03-13
Qwen3.5 397B-A17B qwen3.5-397b-a17b 262.1K 262.1K Input: $0.35
Output: $1.75
Cache Read: $0.07
Model: 0.175
Completion: 5.000
Cache: 0.200
📎 🧠 🔧 🌡️ - In: text, image, video, audio
Out: text
Open Weights
Released: 2026-02-15
GLM-4.7-Flash glm-4.7-flash 202.8K 131.1K Input: $0.04
Output: $0.3
Cache Read: $0.008
Cache Write: $0
Model: 0.020
Completion: 7.500
Cache: 0.200
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2026-01-19
MiMo-V2.5-Pro mimo-v2.5-pro 1M 131.1K Input: $0.4
Output: $0.8
Cache Read: $0.003
Model: 0.200
Completion: 2.000
Cache: 0.007
🧠 🔧 🌡️ 2024-12 In: text
Out: text
Open Weights
Released: 2026-04-22
GLM-5 glm-5 202.8K 202.8K Input: $0.48
Output: $1.9
Cache Read: $0.1
Cache Write: $0
Model: 0.240
Completion: 3.958
Cache: 0.208
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-11
DeepSeek V3.2 deepseek-v3.2 163.8K 163.8K Input: $0.18
Output: $0.35
Cache Read: $0.04
Model: 0.090
Completion: 1.944
Cache: 0.222
🔧 🌡️ - In: text
Out: text
Released: 2025-07-22
Greg 1 Mini greg-1-mini 229.4K 229.4K Input: $0.07
Output: $0.15
Cache Read: $0.01
Model: 0.035
Completion: 2.143
Cache: 0.143
🌡️ - In: text
Out: text
Released: 2026-01-27

Databricks

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Claude Opus 4.7 databricks-claude-opus-4-7 1M 128K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 2026-01-31 In: text, image, pdf
Out: text
Released: 2026-04-16
GPT-5.4 databricks-gpt-5-4 1.1M 128K Input: $2.5
Output: $15
Cache Read: $0.25
Model: 1.250
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-03-05
Gemini 3 Flash Preview databricks-gemini-3-flash 1M 65.5K Input: $0.5
Output: $3
Cache Read: $0.05
Input Audio: $1
Model: 0.500
Completion: 3.000
Cache: 0.050
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video, audio, pdf
Out: text
Released: 2025-12-17
Claude Opus 4.5 (latest) databricks-claude-opus-4-5 200K 64K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-11-24
GPT-5 Nano databricks-gpt-5-nano 400K 128K Input: $0.05
Output: $0.4
Cache Read: $0.005
Model: 0.025
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2024-05-30 In: text, image
Out: text
Released: 2025-08-07
GPT-5 Mini databricks-gpt-5-mini 400K 128K Input: $0.25
Output: $2
Cache Read: $0.025
Model: 0.125
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2024-05-30 In: text, image
Out: text
Released: 2025-08-07
GPT-5 databricks-gpt-5 400K 128K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-08-07
Gemini 2.5 Pro databricks-gemini-2-5-pro 1M 65.5K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-06-17
Gemini 3.1 Pro Preview Custom Tools databricks-gemini-3-1-pro 1M 65.5K Input: $2
Output: $12
Cache Read: $0.2
Model: 1.000
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video, audio, pdf
Out: text
Released: 2026-02-19
Gemini 2.5 Flash databricks-gemini-2-5-flash 1M 65.5K Input: $0.3
Output: $2.5
Cache Read: $0.03
Input Audio: $1
Model: 0.500
Completion: 2.500
Cache: 0.030
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-06-17
Claude Sonnet 4.5 databricks-claude-sonnet-4 200K 64K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-07-31 In: text, image, pdf
Out: text
Released: 2025-09-29
Claude Haiku 4.5 (latest) databricks-claude-haiku-4-5 200K 64K Input: $1
Output: $5
Cache Read: $0.1
Cache Write: $1.25
Model: 0.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-02-28 In: text, image, pdf
Out: text
Released: 2025-10-15
GPT-5.4 nano databricks-gpt-5-4-nano 400K 128K Input: $0.2
Output: $1.25
Cache Read: $0.02
Model: 0.100
Completion: 6.250
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2026-03-17
Claude Opus 4.6 databricks-claude-opus-4-6 1M 128K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-05-31 In: text, image, pdf
Out: text
Released: 2026-02-05
Updated: 2026-03-13
GPT-5.1 databricks-gpt-5-1 400K 128K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-11-13
Claude Sonnet 4.5 (latest) databricks-claude-sonnet-4-5 200K 64K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-07-31 In: text, image, pdf
Out: text
Released: 2025-09-29
Claude Opus 4.1 (latest) databricks-claude-opus-4-1 200K 32K Input: $15
Output: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-08-05
GPT-5.5 databricks-gpt-5-5 1.1M 128K Input: $5
Output: $30
Cache Read: $0.5
Model: 2.500
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 2025-12-01 In: text, image, pdf
Out: text
Released: 2026-04-23
Gemini 3.1 Flash Lite Preview databricks-gemini-3-1-flash-lite 1M 65.5K Input: $0.25
Output: $1.5
Cache Read: $0.025
Input Audio: $0.5
Model: 0.250
Completion: 3.000
Cache: 0.050
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video, audio, pdf
Out: text
Released: 2026-03-03
Gemini 3 Pro Preview databricks-gemini-3-pro 1M 65.5K Input: $2
Output: $12
Cache Read: $0.2
Model: 1.000
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video, audio, pdf
Out: text
Released: 2025-11-18
Claude Sonnet 4.6 databricks-claude-sonnet-4-6 1M 64K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-02-17
Updated: 2026-03-13
GPT OSS 20B databricks-gpt-oss-20b 131.1K 32.8K Input: $0.05
Output: $0.2
Model: 0.025
Completion: 4.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-05
GPT OSS 120B databricks-gpt-oss-120b 131.1K 32.8K Input: $0.072
Output: $0.28
Model: 0.036
Completion: 3.889
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-05
GPT-5.2 databricks-gpt-5-2 400K 128K Input: $1.75
Output: $14
Cache Read: $0.175
Model: 0.875
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2025-12-11
GPT-5.4 mini databricks-gpt-5-4-mini 400K 128K Input: $0.75
Output: $4.5
Cache Read: $0.075
Model: 0.375
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2026-03-17

Deep Infra

📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Llama 4 Maverick 17B FP8 meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8 1M 16.4K Input: $0.15
Output: $0.6
Model: 0.075
Completion: 4.000
- - In: text, image
Out: text
Open Weights
Released: 2025-04-05
Llama 4 Scout 17B meta-llama/Llama-4-Scout-17B-16E-Instruct 327.7K 16.4K Input: $0.1
Output: $0.3
Model: 0.050
Completion: 3.000
🔧 - In: text, image
Out: text
Open Weights
Released: 2025-04-05
Llama 3.3 70B Turbo meta-llama/Llama-3.3-70B-Instruct-Turbo 131.1K 16.4K Input: $0.1
Output: $0.32
Model: 0.050
Completion: 3.200
🔧 - In: text
Out: text
Open Weights
Released: 2024-12-06
Kimi K2.6 moonshotai/Kimi-K2.6 262.1K 16.4K Input: $0.75
Output: $3.5
Cache Read: $0.15
Model: 0.375
Completion: 4.667
Cache: 0.200
📎 🧠 🔧 🌡️ 2024-04 In: text, image, video
Out: text
Open Weights
Released: 2026-04-21
Kimi K2.5 moonshotai/Kimi-K2.5 262.1K 32.8K Input: $0.45
Output: $2.25
Cache Read: $0.07
Model: 0.225
Completion: 5.000
Cache: 0.156
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video
Out: text
Open Weights
Released: 2026-01-27
Gemma 4 31B IT google/gemma-4-31B-it 262.1K 32.8K Input: $0.13
Output: $0.38
Model: 0.065
Completion: 2.923
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2026-04-02
Gemma 4 26B A4B IT google/gemma-4-26B-A4B-it 262.1K 32.8K Input: $0.07
Output: $0.34
Model: 0.035
Completion: 4.857
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2026-04-02
Qwen3.6 35B A3B Qwen/Qwen3.6-35B-A3B 262.1K 81.9K Input: $0.15
Output: $0.95
Model: 0.075
Completion: 6.333
📎 🧠 🔧 🌡️ - In: text, image, video
Out: text
Open Weights
Released: 2026-04-01
Qwen 3.5 397B A17B Qwen/Qwen3.5-397B-A17B 262.1K 81.9K Input: $0.45
Output: $3
Cache Read: $0.22
Model: 0.225
Completion: 6.667
Cache: 0.489
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video
Out: text
Open Weights
Released: 2026-02-01
Updated: 2026-04-20
Qwen3 Coder 480B A35B Instruct Turbo Qwen/Qwen3-Coder-480B-A35B-Instruct-Turbo 262.1K 66.5K Input: $0.3
Output: $1
Model: 0.150
Completion: 3.333
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-23
Qwen 3.5 35B A3B Qwen/Qwen3.5-35B-A3B 262.1K 81.9K Input: $0.14
Output: $1
Cache Read: $0.05
Model: 0.070
Completion: 7.143
Cache: 0.357
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video
Out: text
Open Weights
Released: 2026-02-01
Updated: 2026-04-20
GPT OSS 120B openai/gpt-oss-120b 131.1K 16.4K Input: $0.039
Output: $0.19
Model: 0.019
Completion: 4.872
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-05
GPT OSS 20B openai/gpt-oss-20b 131.1K 16.4K Input: $0.03
Output: $0.14
Model: 0.015
Completion: 4.667
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-05
MiMo-V2.5 XiaomiMiMo/MiMo-V2.5 262.1K 16.4K Input: $0.4
Output: $2
Cache Read: $0.08
Model: 0.200
Completion: 5.000
Cache: 0.200
📎 🧠 🔧 🌡️ 2024-12 In: text, image, audio, video
Out: text
Open Weights
Released: 2026-04-22
MiMo-V2.5-Pro XiaomiMiMo/MiMo-V2.5-Pro 1M 16.4K Input: $1
Output: $3
Cache Read: $0.2
Model: 0.500
Completion: 3.000
Cache: 0.200
🧠 🔧 🌡️ 2024-12 In: text
Out: text
Open Weights
Released: 2026-04-22
GLM-4.7-Flash zai-org/GLM-4.7-Flash 202.8K 16.4K Input: $0.06
Output: $0.4
Model: 0.030
Completion: 6.667
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2026-01-19
GLM-4.6 zai-org/GLM-4.6 202.8K 131.1K Input: $0.43
Output: $1.74
Cache Read: $0.08
Model: 0.215
Completion: 4.047
Cache: 0.186
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-09-30
GLM-5 zai-org/GLM-5 202.8K 16.4K Input: $0.6
Output: $2.08
Cache Read: $0.12
Model: 0.300
Completion: 3.467
Cache: 0.200
🧠 🔧 🌡️ 2025-12 In: text
Out: text
Open Weights
Released: 2026-02-12
GLM-4.7 zai-org/GLM-4.7 202.8K 16.4K Input: $0.4
Output: $1.75
Cache Read: $0.08
Model: 0.200
Completion: 4.375
Cache: 0.200
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-12-22
GLM-5.1 zai-org/GLM-5.1 202.8K 16.4K Input: $1.05
Output: $3.5
Cache Read: $0.205
Model: 0.525
Completion: 3.333
Cache: 0.195
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2026-04-07
DeepSeek-R1-0528 deepseek-ai/DeepSeek-R1-0528 163.8K 64K Input: $0.5
Output: $2.15
Cache Read: $0.35
Model: 0.250
Completion: 4.300
Cache: 0.700
🧠 🔧 🌡️ 2024-07 In: text
Out: text
Released: 2025-05-28
DeepSeek V4 Flash deepseek-ai/DeepSeek-V4-Flash 1M 16.4K Input: $0.1
Output: $0.2
Cache Read: $0.02
Model: 0.050
Completion: 2.000
Cache: 0.200
🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2026-04-24
DeepSeek V4 Pro deepseek-ai/DeepSeek-V4-Pro 1M 16.4K Input: $1.3
Output: $2.6
Cache Read: $0.1
Model: 0.650
Completion: 2.000
Cache: 0.077
🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2026-04-24
DeepSeek-V3.2 deepseek-ai/DeepSeek-V3.2 163.8K 64K Input: $0.26
Output: $0.38
Cache Read: $0.13
Model: 0.130
Completion: 1.462
Cache: 0.500
🧠 🔧 🌡️ 2024-12 In: text
Out: text
Released: 2025-12-02
MiniMax M2.5 MiniMaxAI/MiniMax-M2.5 196.6K 131.1K Input: $0.15
Output: $1.15
Cache Read: $0.03
Cache Write: $0.375
Model: 0.075
Completion: 7.667
Cache: 0.200
🧠 🔧 🌡️ 2025-06 In: text
Out: text
Open Weights
Released: 2026-02-12

DeepSeek

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
DeepSeek V4 Flash deepseek-v4-flash 1M 384K Input: $0.14
Output: $0.28
Cache Read: $0.0028
Model: 0.070
Completion: 2.000
Cache: 0.020
🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2026-04-24
DeepSeek V4 Pro deepseek-v4-pro 1M 384K Input: $0.435
Output: $0.87
Cache Read: $0.003625
Model: 0.217
Completion: 2.000
Cache: 0.008
🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2026-04-24
DeepSeek Reasoner deepseek-reasoner 1M 384K Input: $0.14
Output: $0.28
Cache Read: $0.0028
Model: 0.070
Completion: 2.000
Cache: 0.020
📎 🧠 🔧 🌡️ 2025-09 In: text
Out: text
Open Weights
Released: 2025-12-01
Updated: 2026-02-28
DeepSeek Chat deepseek-chat 1M 384K Input: $0.14
Output: $0.28
Cache Read: $0.0028
Model: 0.070
Completion: 2.000
Cache: 0.020
📎 🔧 🌡️ 2025-09 In: text
Out: text
Open Weights
Released: 2025-12-01
Updated: 2026-02-28

DigitalOcean

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Claude Haiku 4.5 anthropic-claude-haiku-4.5 200K 64K Input: $1
Output: $5
Cache Read: $1
Cache Write: $1.25
Model: 0.500
Completion: 5.000
Cache: 1.000
📎 🧠 🔧 🌡️ 2025-02-28 In: text, image, pdf
Out: text
Released: 2025-10-15
GPT Image 1 openai-gpt-image-1 - - Input: $5
Output: $40
Cache Read: $1.25
Model: 2.500
Completion: 8.000
Cache: 0.250
📎 - In: text, image
Out: image
Released: 2025-04-24
E5 Large v2 e5-large-v2 512 1K Input: $0.02
Output: $0
Model: 0.010 - - In: text
Out: text
Open Weights
Released: 2023-05-19
Updated: 2026-04-30
BGE M3 bge-m3 8.2K 1K Input: $0.02
Output: $0
Model: 0.010 - - In: text
Out: text
Open Weights
Released: 2024-01-30
Updated: 2026-04-30
Ministral 3 14B Instruct mistral-3-14B 262.1K 128K Input: $0.2
Output: $0.2
Model: 0.100
Completion: 1.000
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-12-15
Updated: 2026-04-30
Nemotron 3 Ultra nemotron-3-ultra-550b 131.1K 8.2K - - 🔧 🌡️ - In: text
Out: text
Released: 2026-06-04
Updated: 2026-06-12
MiniMax M2.5 minimax-m2.5 204.8K 128K Input: $0.3
Output: $1.2
Model: 0.150
Completion: 4.000
🧠 🔧 🌡️ 2025-08 In: text
Out: text
Open Weights
Released: 2026-02-12
Updated: 2026-04-16
GPT-5.4 nano openai-gpt-5.4-nano 400K 128K Input: $0.2
Output: $1.25
Cache Read: $0.02
Model: 0.100
Completion: 6.250
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2026-03-17
DeepSeek V3 deepseek-v3 163.8K 131.1K - - 🔧 🌡️ 2024-07 In: text
Out: text
Open Weights
Released: 2024-12-26
Updated: 2025-03-24
GPT Image 2 openai-gpt-image-2 - - - - 📎 - In: text, image
Out: image
Released: 2025-04-24
GPT-5.2 openai-gpt-5.2 400K 128K Input: $1.75
Output: $14
Cache Read: $0.175
Model: 0.875
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2025-12-11
DeepSeek R1 Distill Llama 70B deepseek-r1-distill-llama-70b 131.1K 32.8K Input: $0.99
Output: $0.99
Model: 0.495
Completion: 1.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-01-30
Qwen3 Embedding 0.6B qwen3-embedding-0.6b 8K 1K Input: $0.04
Output: $0
Model: 0.020 - - In: text
Out: text
Open Weights
Released: 2025-06-03
Updated: 2026-04-16
Gemma 4 31B gemma-4-31B-it 256K 8.2K Input: $0.18
Output: $0.5
Model: 0.090
Completion: 2.778
📎 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2026-04-22
Updated: 2026-04-30
Llama 4 Maverick 17B 128E Instruct llama-4-maverick 1M 16.4K Input: $0.25
Output: $0.87
Model: 0.125
Completion: 3.480
📎 🔧 🌡️ 2024-08 In: text, image
Out: text
Open Weights
Released: 2025-04-05
Updated: 2026-04-30
Claude 3.7 Sonnet anthropic-claude-3.7-sonnet 200K 64K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2024-11 In: text, image
Out: text
Released: 2025-02-24
GPT-4o mini openai-gpt-4o-mini 128K 16.4K Input: $0.15
Output: $0.6
Cache Read: $0.075
Model: 0.075
Completion: 4.000
Cache: 0.500
📎 🔧 🌡️ 2023-09 In: text, image, pdf
Out: text
Released: 2024-07-18
Claude Opus 4.7 anthropic-claude-opus-4.7 1M 128K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 2026-01-31 In: text, image, pdf
Out: text
Released: 2026-04-16
Deepseek V4 Flash deepseek-4-flash 262.1K 8.2K - - 🔧 🌡️ - In: text
Out: text
Released: 2026-05-27
Updated: 2026-05-29
Claude Haiku 4.5 anthropic-claude-4.5-haiku 200K 64K Input: $1
Output: $5
Cache Read: $1
Cache Write: $1.25
Model: 0.500
Completion: 5.000
Cache: 1.000
📎 🧠 🔧 🌡️ 2025-02-28 In: text, image
Out: text
Released: 2025-10-15
Claude Sonnet 4.6 anthropic-claude-4.6-sonnet 1M 64K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-02-17
Updated: 2026-03-13
Claude Sonnet 4 anthropic-claude-sonnet-4 1M 64K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-05-22
Ministral 3 8B ministral-3-8b-instruct-2512 262.1K 262.1K - - 📎 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2025-12-15
Claude Sonnet 4.5 anthropic-claude-4.5-sonnet 1M 64K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-07-31 In: text, image, pdf
Out: text
Released: 2025-09-29
Claude Opus 4.6 anthropic-claude-opus-4.6 1M 128K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-05-31 In: text, image, pdf
Out: text
Released: 2026-02-05
Updated: 2026-03-13
DeepSeek V4 Pro deepseek-v4-pro 1M 393.2K Input: $1.74
Output: $3.48
Model: 0.870
Completion: 2.000
🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2026-04-24
gpt-oss-20b openai-gpt-oss-20b 131.1K 131.1K Input: $0.05
Output: $0.45
Model: 0.025
Completion: 9.000
🧠 🔧 🌡️ 2024-06 In: text
Out: text
Open Weights
Released: 2025-08-05
Updated: 2026-04-16
Qwen 2.5 14B Instruct qwen-2.5-14b-instruct 131.1K 131.1K - - 🔧 🌡️ 2024-09 In: text
Out: text
Open Weights
Released: 2024-09-19
Claude Opus 4.8 anthropic-claude-opus-4.8 1M 128K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2026-05-28
Updated: 2026-05-29
Claude Opus 4.5 anthropic-claude-opus-4.5 200K 64K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-11-24
Claude Opus 4.1 anthropic-claude-4.1-opus 200K 32K Input: $15
Output: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-08-05
Llama 3.1 Instruct (8B) llama3-8b-instruct 131.1K 131.1K Input: $0.198
Output: $0.198
Model: 0.099
Completion: 1.000
🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-07-23
Stable Diffusion 3.5 Large stable-diffusion-3.5-large 256 1 Input: $0.08
Output: $0
Model: 0.040 - - In: text
Out: image
Open Weights
Released: 2024-10-22
Updated: 2026-04-30
GPT-5.4 pro openai-gpt-5.4-pro 400K 128K Input: $30
Output: $180
Model: 15.000
Completion: 6.000
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2026-03-05
All-MiniLM-L6-v2 all-mini-lm-l6-v2 256 384 Input: $0.009
Output: $0
Model: 0.004 - - In: text
Out: text
Open Weights
Released: 2021-08-30
Updated: 2026-04-16
GPT Image 1.5 openai-gpt-image-1.5 - - Input: $5
Output: $10
Cache Read: $1
Model: 2.500
Completion: 2.000
Cache: 0.200
📎 - In: text, image
Out: image
Released: 2025-11-25
GPT-4o openai-gpt-4o 128K 16.4K Input: $2.5
Output: $10
Cache Read: $1.25
Model: 1.250
Completion: 4.000
Cache: 0.500
📎 🔧 🌡️ 2023-09 In: text, image, pdf
Out: text
Released: 2024-05-13
Updated: 2024-08-06
Claude Opus 4 anthropic-claude-opus-4 200K 32K Input: $15
Output: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-05-22
BGE Reranker v2 M3 bge-reranker-v2-m3 8.2K 1 Input: $0.01
Output: $0
Model: 0.005 - - In: text
Out: text
Open Weights
Released: 2024-03-12
Updated: 2026-04-30
GPT-5 openai-gpt-5 400K 128K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-08-07
o3-mini openai-o3-mini 200K 100K Input: $1.1
Output: $4.4
Cache Read: $0.55
Model: 0.550
Completion: 4.000
Cache: 0.500
🧠 🔧 2024-05 In: text
Out: text
Released: 2024-12-20
Updated: 2025-01-29
Multi-QA-mpnet-base-dot-v1 multi-qa-mpnet-base-dot-v1 512 768 Input: $0.009
Output: $0
Model: 0.004 - - In: text
Out: text
Open Weights
Released: 2021-08-30
Updated: 2026-04-16
Kimi K2.5 kimi-k2.5 262.1K 32.8K Input: $0.5
Output: $2.7
Model: 0.250
Completion: 5.400
🧠 🔧 2025-01 In: text
Out: text
Open Weights
Released: 2026-01
Updated: 2026-04-16
Llama 3.3 Instruct 70B llama3.3-70b-instruct 128K 128K Input: $0.65
Output: $0.65
Model: 0.325
Completion: 1.000
🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-12-06
GTE Large (v1.5) gte-large-en-v1.5 8.2K 1K Input: $0.09
Output: $0
Model: 0.045 - - In: text
Out: text
Open Weights
Released: 2024-03-27
Updated: 2026-04-16
GPT-5.4 mini openai-gpt-5.4-mini 400K 128K Input: $0.75
Output: $4.5
Cache Read: $0.075
Model: 0.375
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2026-03-17
GPT-5.5 openai-gpt-5.5 1M 128K Input: $5
Output: $30
Cache Read: $0.5
Model: 2.500
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 2025-12-01 In: text, image, pdf
Out: text
Released: 2026-04-23
Updated: 2026-04-30
Nemotron 3 Nano 30B A3B nemotron-3-nano-30b 262.1K 262.1K - - 🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-04-14
gpt-oss-120b openai-gpt-oss-120b 131.1K 131.1K Input: $0.1
Output: $0.7
Model: 0.050
Completion: 7.000
🧠 🔧 🌡️ 2024-06 In: text
Out: text
Open Weights
Released: 2025-08-05
Updated: 2026-04-16
GPT-5 nano openai-gpt-5-nano 400K 128K Input: $0.05
Output: $0.4
Cache Read: $0.005
Model: 0.025
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2024-05-30 In: text, image
Out: text
Released: 2025-08-07
Wan2.2-T2V-A14B wan2-2-t2v-a14b 100 1 Input: $0.6
Output: $0
Model: 0.300 - - In: text
Out: video
Open Weights
Released: 2025-07-28
Updated: 2026-04-30
Mistral 7B Instruct v0.3 mistral-7b-instruct-v0.3 32.8K 32.8K - - 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-05-22
Qwen3 TTS VoiceDesign qwen3-tts-voicedesign 32.8K 1 - - - - In: text
Out: audio
Open Weights
Released: 2026-04-21
Updated: 2026-04-30
Mistral Nemo Instruct mistral-nemo-instruct-2407 128K 16.4K Input: $0.3
Output: $0.3
Model: 0.150
Completion: 1.000
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-07-18
Kimi K2.6 kimi-k2.6 262.1K 262.1K Input: $0.95
Output: $4
Model: 0.475
Completion: 4.211
📎 🧠 🔧 2025-01 In: text, image
Out: text
Open Weights
Released: 2026-04-21
DeepSeek V3.2 deepseek-3.2 128K 64K Input: $0.5
Output: $1.6
Model: 0.250
Completion: 3.200
🧠 🔧 🌡️ 2024-12 In: text
Out: text
Open Weights
Released: 2025-12-02
Updated: 2026-04-30
o1 openai-o1 200K 100K Input: $15
Output: $60
Cache Read: $7.5
Model: 7.500
Completion: 4.000
Cache: 0.500
📎 🧠 🔧 2023-09 In: text, image, pdf
Out: text
Released: 2024-12-05
Qwen 3.5 397B A17B qwen3.5-397b-a17b 262.1K 81.9K Input: $0.55
Output: $3.5
Model: 0.275
Completion: 6.364
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-15
Updated: 2026-04-30
Qwen3 Coder Flash qwen3-coder-flash 262.1K 65.5K Input: $0.45
Output: $1.7
Model: 0.225
Completion: 3.778
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-28
Updated: 2026-04-30
GPT-5.3 Codex openai-gpt-5.3-codex 400K 128K Input: $1.75
Output: $14
Cache Read: $0.175
Model: 0.875
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-02-05
GPT-4.1 openai-gpt-4.1 1M 32.8K Input: $2
Output: $8
Cache Read: $0.5
Model: 1.000
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-04 In: text, image, pdf
Out: text
Released: 2025-04-14
Claude 3.5 Haiku anthropic-claude-3.5-haiku 200K 8.2K Input: $0.8
Output: $4
Cache Read: $0.08
Cache Write: $1
Model: 0.400
Completion: 5.000
Cache: 0.100
🔧 🌡️ 2024-07 In: text
Out: text
Released: 2024-11-05
GPT-5.4 openai-gpt-5.4 1M 128K Input: $2.5
Output: $15
Cache Read: $0.25
Model: 1.250
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-03-05
Trinity Large Thinking arcee-trinity-large-thinking 256K 128K Input: $0.25
Output: $0.9
Cache Read: $0.06
Model: 0.125
Completion: 3.600
Cache: 0.240
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-04-02
Updated: 2026-04-16
o3 openai-o3 200K 100K Input: $2
Output: $8
Cache Read: $0.5
Model: 1.000
Completion: 4.000
Cache: 0.250
📎 🧠 🔧 2024-05 In: text, image, pdf
Out: text
Released: 2025-04-16
Claude 3.5 Sonnet anthropic-claude-3.5-sonnet 200K 8.2K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2024-06-20
Updated: 2024-10-22
GPT-5 mini openai-gpt-5-mini 400K 128K Input: $0.25
Output: $2
Cache Read: $0.025
Model: 0.125
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2024-05-30 In: text, image
Out: text
Released: 2025-08-07
Nemotron-3-Super-120B nvidia-nemotron-3-super-120b 256K 32.8K Input: $0.3
Output: $0.65
Model: 0.150
Completion: 2.167
🧠 🔧 🌡️ 2026-02 In: text
Out: text
Open Weights
Released: 2026-03-11
Updated: 2026-04-16
GLM 5 glm-5 202.8K 128K Input: $1
Output: $3.2
Model: 0.500
Completion: 3.200
🧠 🔧 - In: text
Out: text
Open Weights
Released: 2026-02-11
Updated: 2026-04-16
GPT-5.2 pro openai-gpt-5.2-pro 400K 128K Input: $21
Output: $168
Model: 10.500
Completion: 8.000
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2025-12-11
Claude 3 Opus anthropic-claude-3-opus 200K 4.1K Input: $15
Output: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🔧 🌡️ 2023-08 In: text, image
Out: text
Released: 2024-02-29
Anthropic Claude Fable 5 anthropic-claude-fable-5 1M 128K - - 📎 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2026-06-09
Updated: 2026-06-12
Nemotron Nano 3 Omni nemotron-3-nano-omni 65.5K 65.5K Input: $0.5
Output: $0.9
Model: 0.250
Completion: 1.800
📎 🧠 🔧 🌡️ - In: text, image, video, audio
Out: text
Open Weights
Released: 2026-04-28
Updated: 2026-04-30
Qwen3-32B alibaba-qwen3-32b 131K 41K Input: $0.25
Output: $0.55
Model: 0.125
Completion: 2.200
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-04-30
Updated: 2026-04-16
Nemotron Nano 12B v2 VL nemotron-nano-12b-v2-vl 128K 16.4K Input: $0.2
Output: $0.6
Model: 0.100
Completion: 3.000
📎 🧠 🔧 🌡️ 2024-10 In: text, image
Out: text
Open Weights
Released: 2025-12-01
Updated: 2026-04-30
GPT-5.1 Codex Max openai-gpt-5.1-codex-max 400K 128K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-11-13
Fast SDXL fal-ai/fast-sdxl - - - - - - In: text
Out: image
Open Weights
Released: 2023-07-26
Updated: 2026-04-16
ElevenLabs Multilingual TTS v2 fal-ai/elevenlabs/tts/multilingual-v2 - - - - - - In: text
Out: audio
Released: 2023-08-22
Updated: 2026-04-16
FLUX.1 [schnell] fal-ai/flux/schnell - - - - - - In: text
Out: image
Open Weights
Released: 2024-08-01
Updated: 2026-04-16
Stable Audio 2.5 (Text-to-Audio) fal-ai/stable-audio-25/text-to-audio - - - - - - In: text
Out: audio
Released: 2025-10-08
Updated: 2026-04-16

DInference

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
MiniMax-M2.5 minimax-m2.5 200K 32K Input: $0.22
Output: $0.88
Model: 0.110
Completion: 4.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-12
GLM-4.7 glm-4.7 200K 128K Input: $0.45
Output: $1.65
Model: 0.225
Completion: 3.667
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-12-22
GLM-5.1 glm-5.1 200K 128K Input: $1.25
Output: $3.89
Model: 0.625
Completion: 3.112
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-27
GPT OSS 120B gpt-oss-120b 131.1K 32.8K Input: $0.0675
Output: $0.27
Model: 0.034
Completion: 4.000
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08
GLM-5 glm-5 200K 128K Input: $0.75
Output: $2.4
Model: 0.375
Completion: 3.200
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-11

doubao

📖 API Address

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
doubao-seed-1-6-flash doubao-seed-1-6-flash 256K 32K - - 🔧 🌡️ 2023-10 In: text, image
Out: text
Released: 2025-06-11
Updated: 2025-07-15
doubao-seed-1-6-thinking doubao-seed-1-6-thinking 256K 32K - - 🧠 🔧 🌡️ 2023-10 In: text, image
Out: text
Released: 2025-06-11
Updated: 2025-07-15
doubao-seed-1-6 doubao-seed-1-6 256K 32K - - 🧠 🔧 🌡️ 2023-10 In: text, image
Out: text
Released: 2025-06-11
Updated: 2025-06-15

D.Run (China)

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
DeepSeek V3 public/deepseek-v3 131.1K 8.2K Input: $0.28
Output: $1.1
Model: 0.140
Completion: 3.929
🔧 🌡️ 2024-07 In: text
Out: text
Open Weights
Released: 2024-12-26
DeepSeek R1 public/deepseek-r1 131.1K 32K Input: $0.55
Output: $2.2
Model: 0.275
Completion: 4.000
🧠 🔧 🌡️ 2024-12 In: text
Out: text
Open Weights
Released: 2025-01-20
MiniMax M2.5 public/minimax-m25 204.8K 131.1K Input: $0.29
Output: $1.16
Model: 0.145
Completion: 4.000
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-03-01

evroc

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Kimi K2.5 moonshotai/Kimi-K2.5 262.1K 262.1K Input: $1.47
Output: $5.9
Model: 0.735
Completion: 4.014
🧠 🔧 - In: text, image, video
Out: text
Open Weights
Released: 2026-01-27
Phi-4 15B microsoft/Phi-4-multimodal-instruct 32K 32K Input: $0.24
Output: $0.47
Model: 0.120
Completion: 1.958
- - In: text, image
Out: text
Open Weights
Released: 2025-01-01
Qwen3 Embedding 8B Qwen/Qwen3-Embedding-8B 41K 4.1K Input: $0.12
Output: $0.12
Model: 0.060
Completion: 1.000
- - In: text
Out: text
Open Weights
Released: 2025-07-30
Qwen3 30B 2507 Qwen/Qwen3-30B-A3B-Instruct-2507-FP8 64K 64K Input: $0.35
Output: $1.42
Model: 0.175
Completion: 4.057
🔧 - In: text
Out: text
Open Weights
Released: 2025-07-30
Qwen3 VL 30B Qwen/Qwen3-VL-30B-A3B-Instruct 100K 100K Input: $0.24
Output: $0.94
Model: 0.120
Completion: 3.917
🔧 - In: text, image, video
Out: text
Open Weights
Released: 2025-07-30
GPT OSS 120B openai/gpt-oss-120b 65.5K 65.5K Input: $0.24
Output: $0.94
Model: 0.120
Completion: 3.917
🧠 🔧 - In: text
Out: text
Open Weights
Released: 2025-08-05
Whisper 3 Large openai/whisper-large-v3 448 4.1K Input: $0.00236
Output: $0.00236
Output Audio: $2.36
Model: 0.001
Completion: 1000.000
- - In: audio
Out: text
Open Weights
Released: 2024-10-01
Devstral Small 2 24B Instruct 2512 mistralai/devstral-small-2-24b-instruct-2512 32.8K 32.8K Input: $0.12
Output: $0.47
Model: 0.060
Completion: 3.917
🔧 - In: text
Out: text
Open Weights
Released: 2025-12-01
Magistral Small 1.2 24B mistralai/Magistral-Small-2509 131.1K 131.1K Input: $0.59
Output: $2.36
Model: 0.295
Completion: 4.000
- - In: text
Out: text
Open Weights
Released: 2025-06-01
Voxtral Small 24B mistralai/Voxtral-Small-24B-2507 32K 32K Input: $0.00236
Output: $0.00236
Output Audio: $2.36
Model: 0.001
Completion: 1000.000
- - In: audio, text
Out: text
Open Weights
Released: 2025-03-01
Llama 3.3 70B nvidia/Llama-3.3-70B-Instruct-FP8 131.1K 32.8K Input: $1.18
Output: $1.18
Model: 0.590
Completion: 1.000
- - In: text
Out: text
Open Weights
Released: 2024-12-01
KB Whisper KBLab/kb-whisper-large 448 448 Input: $0.00236
Output: $0.00236
Output Audio: $2.36
Model: 0.001
Completion: 1000.000
- - In: audio
Out: text
Open Weights
Released: 2024-10-01
E5 Multi-Lingual Large Embeddings 0.6B intfloat/multilingual-e5-large-instruct 512 512 Input: $0.12
Output: $0.12
Model: 0.060
Completion: 1.000
- - In: text
Out: text
Open Weights
Released: 2024-06-01

ExampleCorp AI

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Novus 1 novus-1 128K 4.1K Input: $5
Output: $15
Cache Read: $0.075
Cache Write: $0.5
Model: 2.500
Completion: 3.000
Cache: 0.015
📎 🧠 🔧 🌡️ 2024-07 In: text, image, audio, video, pdf
Out: text, image, audio, video, pdf
Released: 2025-01-20
Updated: 2025-08-21

FastRouter

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Wan 2.6 wanx/wan-v2-6 400K - - - 📎 - In: text, image
Out: video
Open Weights
Released: 2025-12-01
Kimi K2 moonshotai/kimi-k2 131.1K 32.8K Input: $0.55
Output: $2.2
Model: 0.275
Completion: 4.000
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-07-11
Kimi K2.6 moonshotai/kimi-k2.6 262.1K 262.1K Input: $0.75
Output: $3.5
Model: 0.375
Completion: 4.667
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video
Out: text
Open Weights
Released: 2026-04-21
Imagen 4 Fast google/imagen-4.0-fast 480 - - - - - In: text
Out: image
Released: 2025-05-20
Gemini 2.5 Pro google/gemini-2.5-pro 1M 65.5K Input: $1.25
Output: $10
Cache Read: $0.31
Model: 0.625
Completion: 8.000
Cache: 0.248
📎 🧠 🔧 🌡️ 2025-01 In: text, image, pdf
Out: text
Released: 2025-06-17
Gemini 2.5 Flash google/gemini-2.5-flash 1M 65.5K Input: $0.3
Output: $2.5
Cache Read: $0.0375
Model: 0.150
Completion: 8.333
Cache: 0.125
📎 🧠 🔧 🌡️ 2025-01 In: text, image, pdf
Out: text
Released: 2025-06-17
Gemini 3.5 Flash google/gemini-3.5-flash 1M 65.5K Input: $1.5
Output: $9
Model: 0.750
Completion: 6.000
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video, audio, pdf
Out: text
Released: 2026-05-19
Veo 3.1 Lite google/veo3.1-lite 400K - - - 📎 - In: text, image
Out: video
Released: 2026-05-01
Gemma 4 31B IT google/gemma-4-31b-it 262.1K 32.8K Input: $0.13
Output: $0.38
Model: 0.065
Completion: 2.923
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2026-04-02
Veo 3.1 google/veo3.1 400K - - - 📎 - In: text, image
Out: video
Released: 2026-05-01
Imagen 4 Ultra google/imagen-4.0-ultra 480 - - - - - In: text
Out: image
Released: 2025-05-20
Nano Banana Pro google/gemini-3-pro-image-preview 65.5K 32.8K Input: $2
Output: $12
Model: 1.000
Completion: 6.000
📎 🧠 🌡️ 2025-01 In: text, image
Out: text, image
Released: 2025-11-20
Nano Banana 2 google/gemini-3.1-flash-image-preview 65.5K 65.5K Input: $0.5
Output: $3
Model: 0.250
Completion: 6.000
📎 🧠 🌡️ 2025-01 In: text, image, pdf
Out: text, image
Released: 2026-02-26
Gemini 3.1 Pro Preview google/gemini-3.1-pro-preview 1M 65.5K Input: $2
Output: $12
Model: 1.000
Completion: 6.000
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video, audio, pdf
Out: text
Released: 2026-02-19
Veo 3.1 Fast google/veo3.1-fast 400K - - - 📎 - In: text, image
Out: video
Released: 2026-05-01
Grok 4.3 x-ai/grok-4.3 1M 30K Input: $1.25
Output: $2.5
Model: 0.625
Completion: 2.000
📎 🧠 🔧 🌡️ - In: text, image, pdf
Out: text
Released: 2026-04-17
Grok 4 x-ai/grok-4 256K 64K Input: $3
Output: $15
Cache Read: $0.75
Cache Write: $15
Model: 1.500
Completion: 5.000
Cache: 0.250
🧠 🔧 🌡️ 2025-07 In: text
Out: text
Released: 2025-07-09
Grok Build 0.1 x-ai/grok-build-0.1 256K 256K Input: $1
Output: $2
Model: 0.500
Completion: 2.000
📎 🧠 🔧 🌡️ - In: text, image, pdf
Out: text
Released: 2026-04-16
GLM-5.1 z-ai/glm-5.1 200K 131.1K Input: $1.05
Output: $3.5
Model: 0.525
Completion: 3.333
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-27
GLM-5 z-ai/glm-5 204.8K 131.1K Input: $0.95
Output: $3.15
Model: 0.475
Completion: 3.316
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-11
GPT-5 openai/gpt-5 400K 128K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2024-10-01 In: text, image
Out: text
Released: 2025-08-07
GPT-5.4 nano openai/gpt-5.4-nano 400K 128K Input: $0.2
Output: $1.25
Model: 0.100
Completion: 6.250
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2026-03-17
GPT-5.3 Codex openai/gpt-5.3-codex 400K 128K Input: $1.75
Output: $14
Model: 0.875
Completion: 8.000
📎 🧠 🔧 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-02-05
GPT Realtime 1.5 openai/gpt-realtime-1.5 32K 4.1K Input: $4
Output: $16
Model: 2.000
Completion: 4.000
📎 🔧 🌡️ - In: text, audio, image
Out: text, audio
Released: 2025-06-01
GPT OSS 120B openai/gpt-oss-120b 131.1K 32.8K Input: $0.15
Output: $0.6
Model: 0.075
Completion: 4.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-05
GPT-5.4 mini openai/gpt-5.4-mini 400K 128K Input: $0.75
Output: $4.5
Model: 0.375
Completion: 6.000
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2026-03-17
GPT-4.1 openai/gpt-4.1 1M 32.8K Input: $2
Output: $8
Cache Read: $0.5
Model: 1.000
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2025-04-14
GPT-5 Mini openai/gpt-5-mini 400K 128K Input: $0.25
Output: $2
Cache Read: $0.025
Model: 0.125
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2024-10-01 In: text, image
Out: text
Released: 2025-08-07
GPT-5 Nano openai/gpt-5-nano 400K 128K Input: $0.05
Output: $0.4
Cache Read: $0.005
Model: 0.025
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2024-10-01 In: text, image
Out: text
Released: 2025-08-07
GPT-5.5 Pro openai/gpt-5.5-pro 1.1M 128K Input: $30
Output: $180
Model: 15.000
Completion: 6.000
📎 🧠 🔧 2025-12-01 In: text, image, pdf
Out: text
Released: 2026-04-23
GPT OSS 20B openai/gpt-oss-20b 131.1K 65.5K Input: $0.05
Output: $0.2
Model: 0.025
Completion: 4.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-05
GPT Image 2 openai/gpt-image-2 128K - - - 📎 🌡️ - In: text, image
Out: image
Released: 2026-04-21
GPT-5.5 openai/gpt-5.5 1.1M 128K Input: $5
Output: $30
Model: 2.500
Completion: 6.000
📎 🧠 🔧 2025-12-01 In: text, image, pdf
Out: text
Released: 2026-04-23
Seedance 2 bytedance/seedance-2 4.1K - - - 📎 - In: text, image
Out: video
Released: 2026-04-01
Claude Sonnet 4 anthropic/claude-sonnet-4 200K 64K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-05-22
Claude Opus 4.8 anthropic/claude-opus-4.8 1M 128K Input: $5
Output: $25
Model: 2.500
Completion: 5.000
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2026-05-28
Claude Opus 4.1 anthropic/claude-opus-4.1 200K 32K Input: $15
Output: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-08-05
Claude Sonnet 4.6 anthropic/claude-sonnet-4.6 1M 64K Input: $3
Output: $15
Model: 1.500
Completion: 5.000
📎 🧠 🔧 🌡️ 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-02-17
Updated: 2026-03-13
Sarvam 105B sarvam/sarvam-105b 131.1K 131.1K Input: $0.04
Output: $0.16
Model: 0.020
Completion: 4.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-09-01
Sarvam 30B sarvam/sarvam-30b 128K 128K Input: $0.02
Output: $0.1
Model: 0.010
Completion: 5.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-18
DeepSeek R1 Distill Llama 70B deepseek-ai/deepseek-r1-distill-llama-70b 131.1K 131.1K Input: $0.03
Output: $0.14
Model: 0.015
Completion: 4.667
🧠 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-01-23
Qwen3 Coder qwen/qwen3-coder 262.1K 66.5K Input: $0.3
Output: $1.2
Model: 0.150
Completion: 4.000
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-23
DeepSeek V4 Pro deepseek/deepseek-v4-pro 1M 384K Input: $1.74
Output: $3.48
Model: 0.870
Completion: 2.000
🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2026-04-24
MiniMax-M2.7-highspeed minimax/minimax-m2.7-highspeed 204.8K 131.1K Input: $0.6
Output: $2.4
Model: 0.300
Completion: 4.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-18
MiniMax-M2.7 minimax/minimax-m2.7 204.8K 131.1K Input: $0.3
Output: $1.2
Model: 0.150
Completion: 4.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-18
Lucid Origin leonardo-ai/lucid-origin 4.1K - - - 📎 - In: text, image
Out: image
Released: 2025-06-01
Lucid Realism leonardo-ai/lucid-realism 4.1K - - - 📎 - In: text, image
Out: image
Released: 2025-06-01

Fireworks (Firepass)

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Kimi K2.6 Turbo accounts/fireworks/routers/kimi-k2p6-turbo 262K 262K Input: $0
Output: $0
Cache Read: $0
- 🧠 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2026-04-17

Fireworks AI

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Kimi K2.6 Turbo accounts/fireworks/routers/kimi-k2p6-turbo 262K 262K Input: $2
Output: $8
Cache Read: $0.3
Model: 1.000
Completion: 4.000
Cache: 0.150
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2026-04-17
Kimi K2.7 Code Fast accounts/fireworks/routers/kimi-k2p7-code-fast 262K 262K Input: $2
Output: $8
Cache Read: $0.38
Model: 1.000
Completion: 4.000
Cache: 0.190
📎 🧠 🔧 🌡️ 2025-01 In: text, image
Out: text
Open Weights
Released: 2026-06-12
GLM 5.1 Fast accounts/fireworks/routers/glm-5p1-fast 202.8K 131.1K Input: $2.8
Output: $8.8
Cache Read: $0.52
Model: 1.400
Completion: 3.143
Cache: 0.186
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-04-01
Kimi K2.6 Fast accounts/fireworks/routers/kimi-k2p6-fast 262K 262K Input: $2
Output: $8
Cache Read: $0.3
Model: 1.000
Completion: 4.000
Cache: 0.150
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2026-04-17
Updated: 2026-06-05
DeepSeek V4 Flash accounts/fireworks/models/deepseek-v4-flash 1M 384K Input: $0.14
Output: $0.28
Cache Read: $0.03
Model: 0.070
Completion: 2.000
Cache: 0.214
🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2026-04-24
DeepSeek V4 Pro accounts/fireworks/models/deepseek-v4-pro 1M 384K Input: $1.74
Output: $3.48
Cache Read: $0.145
Model: 0.870
Completion: 2.000
Cache: 0.083
🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2026-04-24
MiniMax-M2.7 accounts/fireworks/models/minimax-m2p7 196.6K 196.6K Input: $0.3
Output: $1.2
Cache Read: $0.06
Model: 0.150
Completion: 4.000
Cache: 0.200
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-04-12
MiniMax-M3 accounts/fireworks/models/minimax-m3 512K 512K Input: $0.3
Output: $1.2
Cache Read: $0.06
Model: 0.150
Completion: 4.000
Cache: 0.200
📎 🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-06-12
Kimi K2.6 accounts/fireworks/models/kimi-k2p6 262K 262K Input: $0.95
Output: $4
Cache Read: $0.16
Model: 0.475
Completion: 4.211
Cache: 0.168
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2026-04-17
Qwen 3.7 Plus accounts/fireworks/models/qwen3p7-plus 262.1K 65.5K Input: $0.4
Output: $1.6
Cache Read: $0.08
Model: 0.200
Completion: 4.000
Cache: 0.200
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2026-06-12
GLM 5.1 accounts/fireworks/models/glm-5p1 202.8K 131.1K Input: $1.4
Output: $4.4
Cache Read: $0.26
Model: 0.700
Completion: 3.143
Cache: 0.186
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-04-01
GPT OSS 120B accounts/fireworks/models/gpt-oss-120b 131.1K 32.8K Input: $0.15
Output: $0.6
Cache Read: $0.015
Model: 0.075
Completion: 4.000
Cache: 0.100
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-05
GPT OSS 20B accounts/fireworks/models/gpt-oss-20b 131.1K 32.8K Input: $0.07
Output: $0.3
Cache Read: $0.035
Model: 0.035
Completion: 4.286
Cache: 0.500
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-05
Kimi K2.7 Code accounts/fireworks/models/kimi-k2p7-code 262K 262K Input: $0.95
Output: $4
Cache Read: $0.19
Model: 0.475
Completion: 4.211
Cache: 0.200
📎 🧠 🔧 🌡️ 2025-01 In: text, image
Out: text
Open Weights
Released: 2026-06-12

FreeModel

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Claude Haiku 4.5 claude-haiku-4-5-20251001 200K 64K Input: $1
Output: $5
Cache Read: $0.1
Cache Write: $1.25
Model: 0.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-02-28 In: text, image, pdf
Out: text
Released: 2025-10-15
Claude Opus 4.7 claude-opus-4-7 1M 128K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 2026-01-31 In: text, image, pdf
Out: text
Released: 2026-04-16
Claude Opus 4.8 claude-opus-4-8 1M 128K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2026-05-28
GPT-5.3 Codex gpt-5.3-codex 400K 128K Input: $1.75
Output: $14
Cache Read: $0.175
Cache Write: $1.75
Model: 0.875
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-02-05
Claude Fable 5 claude-fable-5 1M 128K Input: $10
Output: $50
Cache Read: $1
Cache Write: $12.5
Model: 5.000
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 2026-01-31 In: text, image, pdf
Out: text
Released: 2026-06-09
GPT-5.4 gpt-5.4 1.1M 128K Input: $2.5
Output: $15
Cache Read: $0.25
Cache Write: $2.5
Model: 1.250
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-03-05
GPT-5.4 mini gpt-5.4-mini 400K 128K Input: $0.75
Output: $4.5
Cache Read: $0.075
Cache Write: $0.75
Model: 0.375
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2026-03-17
Claude Opus 4.6 claude-opus-4-6 1M 128K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-05-31 In: text, image, pdf
Out: text
Released: 2026-02-05
Updated: 2026-03-13
Claude Sonnet 4.6 claude-sonnet-4-6 1M 64K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-02-17
Updated: 2026-03-13
GPT-5.5 gpt-5.5 1.1M 128K Input: $5
Output: $30
Cache Read: $0.5
Cache Write: $5
Model: 2.500
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 2025-12-01 In: text, image, pdf
Out: text
Released: 2026-04-23

Friendli

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Llama 3.3 70B Instruct meta-llama/Llama-3.3-70B-Instruct 131.1K 131.1K Input: $0.6
Output: $0.6
Model: 0.300
Completion: 1.000
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-08-01
Updated: 2025-12-23
Llama 3.1 8B Instruct meta-llama/Llama-3.1-8B-Instruct 131.1K 8K Input: $0.1
Output: $0.1
Model: 0.050
Completion: 1.000
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-08-01
Updated: 2025-12-23
Qwen3 235B A22B Instruct 2507 Qwen/Qwen3-235B-A22B-Instruct-2507 262.1K 262.1K Input: $0.2
Output: $0.8
Model: 0.100
Completion: 4.000
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-07-29
Updated: 2026-01-29
GLM-5 zai-org/GLM-5 202.8K 202.8K Input: $1
Output: $3.2
Cache Read: $0.5
Model: 0.500
Completion: 3.200
Cache: 0.500
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-12
GLM-5.1 zai-org/GLM-5.1 202.8K 202.8K Input: $1.4
Output: $4.4
Cache Read: $0.26
Model: 0.700
Completion: 3.143
Cache: 0.186
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-04-07
MiniMax-M2.5 MiniMaxAI/MiniMax-M2.5 196.6K 196.6K Input: $0.3
Output: $1.2
Cache Read: $0.06
Model: 0.150
Completion: 4.000
Cache: 0.200
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-12

FrogBot

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
MiniMax-M2.5 minimax-m2-5 192K 8.2K Input: $0.3
Output: $1.2
Cache Read: $0.03
Model: 0.150
Completion: 4.000
Cache: 0.100
📎 🔧 🌡️ 2024-09 In: text
Out: text
Released: 2025-01-15
Updated: 2025-02-22
Kimi-K2.6 kimi-k2-6 256K 128K Input: $0.95
Output: $4
Cache Read: $0.16
Model: 0.475
Completion: 4.211
Cache: 0.168
🧠 🔧 🌡️ - In: text, image
Out: text
Released: 1970-01-01
Z.AI GLM-5.1 zai-glm-5-1 198K 8.2K Input: $1.4
Output: $4.4
Cache Read: $0.26
Model: 0.700
Completion: 3.143
Cache: 0.186
📎 🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-01-20
Updated: 2025-02-22
Grok 4.1 Fast (Reasoning) grok-code-fast-1 256K 128K Input: $0.2
Output: $1.5
Cache Read: $0.02
Model: 0.100
Completion: 7.500
Cache: 0.100
🧠 🔧 🌡️ 2023-10 In: text
Out: text
Released: 2025-08-28
Gemini 2.5 Pro gemini-2.5-pro 1M 65.5K Input: $1.25
Output: $10
Cache Read: $0.31
Model: 0.625
Completion: 8.000
Cache: 0.248
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-03-20
Updated: 2025-06-05
Gemini 2.5 Flash gemini-2.5-flash 1M 65.5K Input: $0.3
Output: $2.5
Cache Read: $0.075
Model: 0.150
Completion: 8.333
Cache: 0.250
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-07-17
GPT-4o gpt-4o 128K 16.4K Input: $2.5
Output: $10
Cache Read: $1.25
Model: 1.250
Completion: 4.000
Cache: 0.500
📎 🔧 🌡️ 2023-09 In: text, image
Out: text
Released: 2024-05-13
Updated: 2024-08-06
Qwen 3.6 Plus qwen-3-6-plus 1M 64K Input: $0.5
Output: $3
Cache Read: $0.1
Model: 0.250
Completion: 6.000
Cache: 0.200
📎 🧠 🔧 🌡️ - In: text, image, pdf
Out: text
Released: 2026-04-02
Updated: 2026-04-03
Grok 4.3 grok-4-3 1M 128K Input: $1.25
Output: $2.5
Cache Read: $0.2
Model: 0.625
Completion: 2.000
Cache: 0.160
📎 🧠 🔧 🌡️ 2024-11 In: text, image, video
Out: text
Released: 2026-04-30
DeepSeek v4 Pro deepseek-v4-pro 128K 8.2K Input: $1.74
Output: $3.48
Cache Read: $0.14
Model: 0.870
Completion: 2.000
Cache: 0.080
📎 🔧 🌡️ 2026-01 In: text
Out: text
Released: 2026-04-24
Claude Opus 4.7 claude-opus-4-7 200K 128K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 2026-01-31 In: text, image, pdf
Out: text
Released: 2026-04-16
Grok 4.1 Fast (Non-Reasoning) grok-4-1-fast-non-reasoning 2M 128K Input: $0.2
Output: $0.5
Cache Read: $0.05
Model: 0.100
Completion: 2.500
Cache: 0.250
📎 🔧 🌡️ 2025-11 In: text, image
Out: text
Released: 2025-11-25
MiniMax-M2.7 minimax-m2-7 192K 8.2K Input: $0.3
Output: $1.2
Cache Read: $0.06
Model: 0.150
Completion: 4.000
Cache: 0.200
📎 🔧 🌡️ 2024-09 In: text
Out: text
Released: 2026-03-18
GPT-5.3 Codex gpt-5-3-codex 400K 128K Input: $1.75
Output: $14
Cache Read: $0.175
Model: 0.875
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2026-01-31 In: text, image
Out: text
Released: 2026-02-15
GPT-5.4 Nano gpt-5-4-nano 400K 128K Input: $0.2
Output: $1.25
Cache Read: $0.02
Model: 0.100
Completion: 6.250
Cache: 0.100
📎 🧠 🔧 2024-05-30 In: text, image
Out: text
Released: 2025-08-07
GPT-5.4 Mini gpt-5-4-mini 400K 128K Input: $0.75
Output: $4.5
Cache Read: $0.075
Model: 0.375
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2026-03-17
Kimi-K2.5 kimi-k2.5 256K 128K Input: $0.6
Output: $3
Cache Read: $0.1
Model: 0.300
Completion: 5.000
Cache: 0.167
🧠 🔧 🌡️ - In: text, image
Out: text
Released: 1970-01-01
GPT OSS 120B gpt-oss-120b 131.1K 32.8K Input: $0.15
Output: $0.6
Model: 0.075
Completion: 4.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 1970-01-01
GPT-5.5 gpt-5-5 272K 128K Input: $2.5
Output: $15
Cache Read: $0.25
Model: 1.250
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2026-03-05
Claude Haiku 4.5 claude-haiku-4-5 200K 64K Input: $1
Output: $5
Cache Read: $0.1
Cache Write: $1.25
Model: 0.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-02-28 In: text, image, pdf
Out: text
Released: 2025-10-15
Claude Opus 4.6 claude-opus-4-6 200K 128K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-05-31 In: text, image, pdf
Out: text
Released: 2026-02-05
Gemini 3.1 Pro Preview gemini-3-1-pro-preview 1M 64K Input: $2
Output: $12
Cache Read: $0.2
Model: 1.000
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2026-01 In: text, image, video, audio, pdf
Out: text
Released: 2026-02-18
Grok 4.1 Fast (Reasoning) grok-4-1-fast-reasoning 2M 128K Input: $0.2
Output: $0.5
Cache Read: $0.05
Model: 0.100
Completion: 2.500
Cache: 0.250
📎 🧠 🔧 🌡️ 2025-11 In: text, image
Out: text
Released: 2025-11-25
Claude Sonnet 4.6 claude-sonnet-4-6 200K 64K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-02-17
Gemini 3 Flash Preview gemini-3-flash-preview 1M 65.5K Input: $0.5
Output: $3
Cache Read: $0.05
Model: 0.250
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video, audio, pdf
Out: text
Released: 2025-12-17
GPT OSS 20B gpt-oss-20b 131.1K 32.8K Input: $0.07
Output: $0.2
Model: 0.035
Completion: 2.857
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 1970-01-01

GitHub Copilot

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Raptor mini raptor-mini 400K 128K Input: $0.25
Output: $2
Cache Read: $0.025
Model: 0.125
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2024-05-30 In: text, image
Out: text
Released: 2025-08-07
Claude Sonnet 4.5 (latest) claude-sonnet-4.5 200K 32K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-07-31 In: text, image, pdf
Out: text
Released: 2025-09-29
Claude Sonnet 4 (latest) claude-sonnet-4 216K 16K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-05-22
Gemini 2.5 Pro gemini-2.5-pro 128K 64K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-06-17
Claude Haiku 4.5 (latest) claude-haiku-4.5 200K 64K Input: $1
Output: $5
Cache Read: $0.1
Cache Write: $1.25
Model: 0.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-02-28 In: text, image, pdf
Out: text
Released: 2025-10-15
Gemini 3.5 Flash gemini-3.5-flash 200K 64K Input: $1.5
Output: $9
Cache Read: $0.15
Input Audio: $1.5
Model: 0.750
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video, audio, pdf
Out: text
Released: 2026-05-19
GPT-5.4 nano gpt-5.4-nano 400K 128K Input: $0.2
Output: $1.25
Cache Read: $0.02
Model: 0.100
Completion: 6.250
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2026-03-17
Claude Opus 4.7 claude-opus-4.7 200K 32K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 2026-01-31 In: text, image, pdf
Out: text
Released: 2026-04-16
GPT-5.2 gpt-5.2 400K 128K Input: $1.75
Output: $14
Cache Read: $0.175
Model: 0.875
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2025-12-11
GPT-5.3 Codex gpt-5.3-codex 400K 128K Input: $1.75
Output: $14
Cache Read: $0.175
Model: 0.875
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-02-05
Claude Opus 4.8 claude-opus-4.8 200K 64K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2026-05-28
Claude Fable 5 claude-fable-5 1M 128K Input: $10
Output: $50
Cache Read: $1
Cache Write: $12.5
Model: 5.000
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 2026-01-31 In: text, image, pdf
Out: text
Released: 2026-06-09
Claude Opus 4.5 (latest) claude-opus-4.5 200K 32K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-11-24
GPT-5.4 gpt-5.4 400K 128K Input: $2.5
Output: $15
Cache Read: $0.25
Model: 1.250
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-03-05
GPT-5.4 mini gpt-5.4-mini 400K 128K Input: $0.75
Output: $4.5
Cache Read: $0.075
Model: 0.375
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2026-03-17
GPT-4.1 gpt-4.1 128K 16.4K Input: $2
Output: $8
Cache Read: $0.5
Model: 1.000
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-04 In: text, image, pdf
Out: text
Released: 2025-04-14
Gemini 3.1 Pro Preview gemini-3.1-pro-preview 200K 64K Input: $2
Output: $12
Cache Read: $0.2
Model: 1.000
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video, audio, pdf
Out: text
Released: 2026-02-19
Claude Sonnet 4.6 claude-sonnet-4.6 200K 32K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-02-17
Updated: 2026-03-13
GPT-5 Mini gpt-5-mini 264K 64K Input: $0.25
Output: $2
Cache Read: $0.025
Model: 0.125
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2024-05-30 In: text, image
Out: text
Released: 2025-08-07
Gemini 3 Flash Preview gemini-3-flash-preview 128K 64K Input: $0.5
Output: $3
Cache Read: $0.05
Input Audio: $1
Model: 0.500
Completion: 3.000
Cache: 0.050
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video, audio, pdf
Out: text
Released: 2025-12-17
Claude Opus 4.6 claude-opus-4.6 200K 32K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-05-31 In: text, image, pdf
Out: text
Released: 2026-02-05
Updated: 2026-03-13
GPT-5.2 Codex gpt-5.2-codex 400K 128K Input: $1.75
Output: $14
Cache Read: $0.175
Model: 0.875
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image, pdf
Out: text
Released: 2025-12-11
GPT-5.5 gpt-5.5 400K 128K Input: $5
Output: $30
Cache Read: $0.5
Model: 2.500
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 2025-12-01 In: text, image, pdf
Out: text
Released: 2026-04-23

GitHub Models

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
AI21 Jamba 1.5 Mini ai21-labs/ai21-jamba-1.5-mini 256K 4.1K Input: $0
Output: $0
- 🧠 🔧 🌡️ 2024-03 In: text
Out: text
Released: 2024-08-29
AI21 Jamba 1.5 Large ai21-labs/ai21-jamba-1.5-large 256K 4.1K Input: $0
Output: $0
- 🧠 🔧 🌡️ 2024-03 In: text
Out: text
Released: 2024-08-29
JAIS 30b Chat core42/jais-30b-chat 8.2K 2K Input: $0
Output: $0
- 🧠 🔧 🌡️ 2023-03 In: text
Out: text
Open Weights
Released: 2023-08-30
Grok 3 Mini xai/grok-3-mini 128K 8.2K Input: $0
Output: $0
- 🧠 🔧 🌡️ 2024-10 In: text
Out: text
Released: 2024-12-09
Grok 3 xai/grok-3 128K 8.2K Input: $0
Output: $0
- 🧠 🔧 🌡️ 2024-10 In: text
Out: text
Released: 2024-12-09
Phi-3.5-MoE instruct (128k) microsoft/phi-3.5-moe-instruct 128K 4.1K Input: $0
Output: $0
- 🧠 🔧 🌡️ 2023-10 In: text
Out: text
Open Weights
Released: 2024-08-20
Phi-3-small instruct (128k) microsoft/phi-3-small-128k-instruct 128K 4.1K Input: $0
Output: $0
- 🧠 🔧 🌡️ 2023-10 In: text
Out: text
Open Weights
Released: 2024-04-23
Phi-3.5-mini instruct (128k) microsoft/phi-3.5-mini-instruct 128K 4.1K Input: $0
Output: $0
- 🧠 🔧 🌡️ 2023-10 In: text
Out: text
Open Weights
Released: 2024-08-20
Phi-3-medium instruct (128k) microsoft/phi-3-medium-128k-instruct 128K 4.1K Input: $0
Output: $0
- 🧠 🔧 🌡️ 2023-10 In: text
Out: text
Open Weights
Released: 2024-04-23
Phi-3-small instruct (8k) microsoft/phi-3-small-8k-instruct 8.2K 2K Input: $0
Output: $0
- 🧠 🔧 🌡️ 2023-10 In: text
Out: text
Open Weights
Released: 2024-04-23
Phi-4-Reasoning microsoft/phi-4-reasoning 128K 4.1K Input: $0
Output: $0
- 🧠 🔧 🌡️ 2023-10 In: text
Out: text
Open Weights
Released: 2024-12-11
MAI-DS-R1 microsoft/mai-ds-r1 65.5K 8.2K Input: $0
Output: $0
- 🧠 🔧 🌡️ 2024-06 In: text
Out: text
Released: 2025-01-20
Phi-4-mini-instruct microsoft/phi-4-mini-instruct 128K 4.1K Input: $0
Output: $0
- 🧠 🔧 🌡️ 2023-10 In: text
Out: text
Open Weights
Released: 2024-12-11
Phi-4 microsoft/phi-4 16K 4.1K Input: $0
Output: $0
- 🧠 🔧 🌡️ 2023-10 In: text
Out: text
Open Weights
Released: 2024-12-11
Phi-3.5-vision instruct (128k) microsoft/phi-3.5-vision-instruct 128K 4.1K Input: $0
Output: $0
- 🧠 🔧 🌡️ 2023-10 In: text, image
Out: text
Open Weights
Released: 2024-08-20
Phi-3-medium instruct (4k) microsoft/phi-3-medium-4k-instruct 4.1K 1K Input: $0
Output: $0
- 🧠 🔧 🌡️ 2023-10 In: text
Out: text
Open Weights
Released: 2024-04-23
Phi-4-multimodal-instruct microsoft/phi-4-multimodal-instruct 128K 4.1K Input: $0
Output: $0
- 🧠 🔧 🌡️ 2023-10 In: text, image, audio
Out: text
Open Weights
Released: 2024-12-11
Phi-4-mini-reasoning microsoft/phi-4-mini-reasoning 128K 4.1K Input: $0
Output: $0
- 🧠 🔧 🌡️ 2023-10 In: text
Out: text
Open Weights
Released: 2024-12-11
Phi-3-mini instruct (128k) microsoft/phi-3-mini-128k-instruct 128K 4.1K Input: $0
Output: $0
- 🧠 🔧 🌡️ 2023-10 In: text
Out: text
Open Weights
Released: 2024-04-23
Phi-3-mini instruct (4k) microsoft/phi-3-mini-4k-instruct 4.1K 1K Input: $0
Output: $0
- 🧠 🔧 🌡️ 2023-10 In: text
Out: text
Open Weights
Released: 2024-04-23
OpenAI o3 openai/o3 200K 100K Input: $0
Output: $0
- 🧠 2024-04 In: text, image
Out: text
Released: 2025-01-31
OpenAI o1-mini openai/o1-mini 128K 65.5K Input: $0
Output: $0
- 🧠 2023-10 In: text
Out: text
Released: 2024-09-12
Updated: 2024-12-17
GPT-4o openai/gpt-4o 128K 16.4K Input: $0
Output: $0
- 📎 🔧 🌡️ 2023-10 In: text, image, audio
Out: text
Released: 2024-05-13
OpenAI o4-mini openai/o4-mini 200K 100K Input: $0
Output: $0
- 🧠 2024-04 In: text, image
Out: text
Released: 2025-01-31
OpenAI o1-preview openai/o1-preview 128K 32.8K Input: $0
Output: $0
- 🧠 2023-10 In: text
Out: text
Released: 2024-09-12
OpenAI o3-mini openai/o3-mini 200K 100K Input: $0
Output: $0
- 🧠 2024-04 In: text
Out: text
Released: 2025-01-31
GPT-4.1-nano openai/gpt-4.1-nano 128K 16.4K Input: $0
Output: $0
- 📎 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2025-04-14
OpenAI o1 openai/o1 200K 100K Input: $0
Output: $0
- 🧠 2023-10 In: text, image
Out: text
Released: 2024-09-12
Updated: 2024-12-17
GPT-4.1 openai/gpt-4.1 128K 16.4K Input: $0
Output: $0
- 📎 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2025-04-14
GPT-4.1-mini openai/gpt-4.1-mini 128K 16.4K Input: $0
Output: $0
- 📎 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2025-04-14
GPT-4o mini openai/gpt-4o-mini 128K 16.4K Input: $0
Output: $0
- 📎 🔧 🌡️ 2023-10 In: text, image, audio
Out: text
Released: 2024-07-18
Mistral Small 3.1 mistral-ai/mistral-small-2503 128K 32.8K Input: $0
Output: $0
- 🧠 🔧 🌡️ 2024-09 In: text, image
Out: text
Released: 2025-03-01
Mistral Nemo mistral-ai/mistral-nemo 128K 8.2K Input: $0
Output: $0
- 🧠 🔧 🌡️ 2024-03 In: text
Out: text
Open Weights
Released: 2024-07-18
Mistral Medium 3 (25.05) mistral-ai/mistral-medium-2505 128K 32.8K Input: $0
Output: $0
- 🧠 🔧 🌡️ 2024-09 In: text, image
Out: text
Released: 2025-05-01
Mistral Large 24.11 mistral-ai/mistral-large-2411 128K 32.8K Input: $0
Output: $0
- 🧠 🔧 🌡️ 2024-09 In: text
Out: text
Released: 2024-11-01
Ministral 3B mistral-ai/ministral-3b 128K 8.2K Input: $0
Output: $0
- 🧠 🔧 🌡️ 2024-03 In: text
Out: text
Open Weights
Released: 2024-10-22
Codestral 25.01 mistral-ai/codestral-2501 32K 8.2K Input: $0
Output: $0
- 🧠 🔧 🌡️ 2024-03 In: text
Out: text
Released: 2025-01-01
Cohere Command A cohere/cohere-command-a 128K 4.1K Input: $0
Output: $0
- 🧠 🔧 🌡️ 2024-03 In: text
Out: text
Released: 2024-11-01
Cohere Command R 08-2024 cohere/cohere-command-r-08-2024 128K 4.1K Input: $0
Output: $0
- 🔧 🌡️ 2024-03 In: text
Out: text
Released: 2024-08-01
Cohere Command R+ cohere/cohere-command-r-plus 128K 4.1K Input: $0
Output: $0
- 🔧 🌡️ 2024-03 In: text
Out: text
Released: 2024-04-04
Updated: 2024-08-01
Cohere Command R+ 08-2024 cohere/cohere-command-r-plus-08-2024 128K 4.1K Input: $0
Output: $0
- 🔧 🌡️ 2024-03 In: text
Out: text
Released: 2024-08-01
Cohere Command R cohere/cohere-command-r 128K 4.1K Input: $0
Output: $0
- 🧠 🔧 🌡️ 2024-03 In: text
Out: text
Released: 2024-03-11
Updated: 2024-08-01
Meta-Llama-3-8B-Instruct meta/meta-llama-3-8b-instruct 8.2K 2K Input: $0
Output: $0
- 🧠 🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-04-18
Llama-3.2-11B-Vision-Instruct meta/llama-3.2-11b-vision-instruct 128K 8.2K Input: $0
Output: $0
- 🧠 🔧 🌡️ 2023-12 In: text, image, audio
Out: text
Open Weights
Released: 2024-09-25
Meta-Llama-3.1-405B-Instruct meta/meta-llama-3.1-405b-instruct 128K 32.8K Input: $0
Output: $0
- 🧠 🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-07-23
Llama 4 Scout 17B 16E Instruct meta/llama-4-scout-17b-16e-instruct 128K 8.2K Input: $0
Output: $0
- 🧠 🔧 🌡️ 2024-12 In: text, image
Out: text
Open Weights
Released: 2025-01-31
Llama-3.3-70B-Instruct meta/llama-3.3-70b-instruct 128K 32.8K Input: $0
Output: $0
- 🧠 🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-12-06
Meta-Llama-3.1-70B-Instruct meta/meta-llama-3.1-70b-instruct 128K 32.8K Input: $0
Output: $0
- 🧠 🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-07-23
Meta-Llama-3.1-8B-Instruct meta/meta-llama-3.1-8b-instruct 128K 32.8K Input: $0
Output: $0
- 🧠 🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-07-23
Llama-3.2-90B-Vision-Instruct meta/llama-3.2-90b-vision-instruct 128K 8.2K Input: $0
Output: $0
- 🧠 🔧 🌡️ 2023-12 In: text, image, audio
Out: text
Open Weights
Released: 2024-09-25
Llama 4 Maverick 17B 128E Instruct FP8 meta/llama-4-maverick-17b-128e-instruct-fp8 128K 8.2K Input: $0
Output: $0
- 🧠 🔧 🌡️ 2024-12 In: text, image
Out: text
Open Weights
Released: 2025-01-31
Meta-Llama-3-70B-Instruct meta/meta-llama-3-70b-instruct 8.2K 2K Input: $0
Output: $0
- 🧠 🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-04-18
DeepSeek-R1-0528 deepseek/deepseek-r1-0528 65.5K 8.2K Input: $0
Output: $0
- 🧠 🔧 🌡️ 2024-06 In: text
Out: text
Open Weights
Released: 2025-05-28
DeepSeek-V3-0324 deepseek/deepseek-v3-0324 128K 8.2K Input: $0
Output: $0
- 🧠 🔧 🌡️ 2024-06 In: text
Out: text
Open Weights
Released: 2025-03-24
DeepSeek-R1 deepseek/deepseek-r1 65.5K 8.2K Input: $0
Output: $0
- 🧠 🔧 🌡️ 2024-06 In: text
Out: text
Open Weights
Released: 2025-01-20

GitLab Duo

📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Agentic Chat (Claude Opus 4.5) duo-chat-opus-4-5 200K 64K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2026-01-08
Agentic Chat (Claude Opus 4.8) duo-chat-opus-4-8 1M 128K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 📎 🧠 🔧 2026-01-31 In: text, image, pdf
Out: text
Released: 2026-05-28
Agentic Chat (Claude Opus 4.7) duo-chat-opus-4-7 1M 64K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 📎 🧠 🔧 2026-01-31 In: text, image, pdf
Out: text
Released: 2026-04-16
Agentic Chat (GPT-5.2 Codex) duo-chat-gpt-5-2-codex 400K 128K Input: $0
Output: $0
- 📎 🧠 🔧 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-01-22
Agentic Chat (Claude Fable 5) duo-chat-fable-5 1M 128K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 📎 🧠 🔧 2026-01-31 In: text, image, pdf
Out: text
Released: 2026-06-09
Agentic Chat (GPT-5.5) duo-chat-gpt-5-5 1.1M 128K Input: $0
Output: $0
- 📎 🧠 🔧 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-04-23
Agentic Chat (Claude Opus 4.6) duo-chat-opus-4-6 1M 64K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 📎 🧠 🔧 🌡️ 2025-05-31 In: text, image, pdf
Out: text
Released: 2026-02-05
Agentic Chat (GPT-5.4) duo-chat-gpt-5-4 1.1M 128K Input: $0
Output: $0
- 📎 🧠 🔧 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-03-05
Agentic Chat (GPT-5 Codex) duo-chat-gpt-5-codex 400K 128K Input: $0
Output: $0
- 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2026-01-22
Agentic Chat (GPT-5.4 Nano) duo-chat-gpt-5-4-nano 400K 128K Input: $0
Output: $0
- 📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2026-03-17
Agentic Chat (Claude Sonnet 4.6) duo-chat-sonnet-4-6 1M 64K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 📎 🧠 🔧 🌡️ 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-02-17
Agentic Chat (GPT-5 Mini) duo-chat-gpt-5-mini 400K 128K Input: $0
Output: $0
- 📎 🧠 🔧 2024-05-30 In: text, image
Out: text
Released: 2026-01-22
Agentic Chat (GPT-5.4 Mini) duo-chat-gpt-5-4-mini 400K 128K Input: $0
Output: $0
- 📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2026-03-17
Agentic Chat (GPT-5.3 Codex) duo-chat-gpt-5-3-codex 400K 128K Input: $0
Output: $0
- 📎 🧠 🔧 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-02-05
Agentic Chat (Claude Haiku 4.5) duo-chat-haiku-4-5 200K 64K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 📎 🧠 🔧 🌡️ 2025-02-28 In: text, image, pdf
Out: text
Released: 2026-01-08
Agentic Chat (GPT-5.2) duo-chat-gpt-5-2 400K 128K Input: $0
Output: $0
- 📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2026-01-23
Agentic Chat (Claude Sonnet 4.5) duo-chat-sonnet-4-5 200K 64K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 📎 🧠 🔧 🌡️ 2025-07-31 In: text, image, pdf
Out: text
Released: 2026-01-08
Agentic Chat (GPT-5.1) duo-chat-gpt-5-1 400K 128K Input: $0
Output: $0
- 📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2026-01-22

GMI Cloud

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Kimi K2.6 moonshotai/Kimi-K2.6 65.5K 65.5K Input: $0.855
Output: $3.6
Cache Read: $0.144
Model: 0.427
Completion: 4.211
Cache: 0.168
📎 🧠 🔧 🌡️ 2025-01 In: text
Out: text
Open Weights
Released: 2026-04-21
Claude Opus 4.7 anthropic/claude-opus-4.7 409.6K 128K Input: $4.5
Output: $22.5
Cache Read: $0.45
Model: 2.250
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 2026-01-31 In: text
Out: text
Released: 2026-04-16
Claude Sonnet 4.6 anthropic/claude-sonnet-4.6 409.6K 64K Input: $3
Output: $15
Cache Read: $0.3
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-08-31 In: text
Out: text
Released: 2026-02-17
Updated: 2026-03-13
Claude Opus 4.6 anthropic/claude-opus-4.6 409.6K 128K Input: $5
Output: $25
Cache Read: $0.5
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-05-31 In: text
Out: text
Released: 2026-02-05
Updated: 2026-03-13
GLM-5.1 zai-org/GLM-5.1-FP8 202.8K 131.1K Input: $0.98
Output: $3.08
Cache Read: $0.182
Model: 0.490
Completion: 3.143
Cache: 0.186
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-27
GLM-5 zai-org/GLM-5-FP8 202.8K 131.1K Input: $0.6
Output: $1.92
Cache Read: $0.12
Model: 0.300
Completion: 3.200
Cache: 0.200
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-11
DeepSeek V4 Flash deepseek-ai/DeepSeek-V4-Flash 1M 384K Input: $0.112
Output: $0.224
Cache Read: $0.022
Model: 0.056
Completion: 2.000
Cache: 0.196
🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2026-04-24
DeepSeek V4 Pro deepseek-ai/DeepSeek-V4-Pro 1M 384K Input: $1.392
Output: $2.784
Cache Read: $0.116
Model: 0.696
Completion: 2.000
Cache: 0.083
🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2026-04-24

Google

📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Gemini 3.1 Flash Lite gemini-3.1-flash-lite 1M 65.5K Input: $0.25
Output: $1.5
Cache Read: $0.025
Input Audio: $0.5
Model: 0.250
Completion: 3.000
Cache: 0.050
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video, audio, pdf
Out: text
Released: 2026-05-07
Gemini 2.5 Flash Preview TTS gemini-2.5-flash-preview-tts 8.2K 16.4K Input: $0.5
Output: $10
Model: 0.250
Completion: 20.000
🌡️ 2025-01 In: text
Out: audio
Released: 2025-05-01
Gemini 2.5 Pro gemini-2.5-pro 1M 65.5K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-03-20
Updated: 2025-06-05
Gemini 2.5 Flash gemini-2.5-flash 1M 65.5K Input: $0.3
Output: $2.5
Cache Read: $0.03
Input Audio: $1
Model: 0.500
Completion: 2.500
Cache: 0.030
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-03-20
Updated: 2025-06-05
Gemini 3.5 Flash gemini-3.5-flash 1M 65.5K Input: $1.5
Output: $9
Cache Read: $0.15
Input Audio: $1.5
Model: 0.750
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video, audio, pdf
Out: text
Released: 2026-05-19
Gemma 4 31B IT gemma-4-31b-it 262.1K 32.8K - - 📎 🧠 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2026-04-02
Gemini 2.0 Flash gemini-2.0-flash 1M 8.2K Input: $0.1
Output: $0.4
Cache Read: $0.025
Model: 0.050
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-06 In: text, image, audio, video, pdf
Out: text
Released: 2024-12-11
Gemma 4 E4B IT gemma-4-E4B-it 131.1K 8.2K - - 📎 🧠 🔧 🌡️ - In: text, image, audio
Out: text
Open Weights
Released: 2026-04-02
gemini-embedding-001 gemini-embedding-001 2K 3.1K Input: $0.15
Output: $0
Cache Read: $0
Cache Write: $0
Model: 0.075 🔧 2025-06 In: text
Out: text
Released: 2025-06-01
Gemini 3.1 Pro Preview Custom Tools gemini-3.1-pro-preview-customtools 1M 65.5K Input: $2
Output: $12
Cache Read: $0.2
Model: 1.000
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video, audio, pdf
Out: text
Released: 2026-02-19
Gemini Flash-Lite Latest gemini-flash-lite-latest 1M 65.5K Input: $0.1
Output: $0.4
Cache Read: $0.025
Model: 0.050
Completion: 4.000
Cache: 0.250
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-09-25
Gemma 4 E2B IT gemma-4-E2B-it 131.1K 8.2K - - 📎 🧠 🔧 🌡️ - In: text, image, audio
Out: text
Open Weights
Released: 2026-04-02
Nano Banana Pro gemini-3-pro-image-preview 131.1K 32.8K Input: $2
Output: $120
Model: 1.000
Completion: 60.000
📎 🧠 🌡️ 2025-01 In: text, image
Out: text, image
Released: 2025-11-20
Nano Banana gemini-2.5-flash-image 32.8K 32.8K Input: $0.3
Output: $30
Cache Read: $0.075
Model: 0.150
Completion: 100.000
Cache: 0.250
📎 🧠 🌡️ 2025-06 In: text, image
Out: text, image
Released: 2025-08-26
Gemini 2.5 Flash-Lite gemini-2.5-flash-lite 1M 65.5K Input: $0.1
Output: $0.4
Cache Read: $0.01
Input Audio: $0.3
Model: 0.150
Completion: 1.333
Cache: 0.033
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-06-17
Nano Banana 2 gemini-3.1-flash-image-preview 65.5K 65.5K Input: $0.5
Output: $60
Model: 0.250
Completion: 120.000
📎 🧠 🌡️ 2025-01 In: text, image, pdf
Out: text, image
Released: 2026-02-26
Gemini 3.1 Pro Preview gemini-3.1-pro-preview 1M 65.5K Input: $2
Output: $12
Cache Read: $0.2
Model: 1.000
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video, audio, pdf
Out: text
Released: 2026-02-19
Gemma 4 26B A4B IT gemma-4-26b-a4b-it 262.1K 32.8K - - 📎 🧠 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2026-04-02
Gemini 3 Pro Preview gemini-3-pro-preview 1M 65.5K Input: $2
Output: $12
Cache Read: $0.2
Model: 1.000
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video, audio, pdf
Out: text
Released: 2025-11-18
Gemini 3 Flash Preview gemini-3-flash-preview 1M 65.5K Input: $0.5
Output: $3
Cache Read: $0.05
Input Audio: $1
Model: 0.500
Completion: 3.000
Cache: 0.050
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video, audio, pdf
Out: text
Released: 2025-12-17
Gemini 2.5 Pro Preview TTS gemini-2.5-pro-preview-tts 8.2K 16.4K Input: $1
Output: $20
Model: 0.500
Completion: 20.000
🌡️ 2025-01 In: text
Out: audio
Released: 2025-05-01
Gemini Flash Latest gemini-flash-latest 1M 65.5K Input: $0.3
Output: $2.5
Cache Read: $0.075
Input Audio: $1
Model: 0.500
Completion: 2.500
Cache: 0.075
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-09-25
Gemini 3.1 Flash Lite Preview gemini-3.1-flash-lite-preview 1M 65.5K Input: $0.25
Output: $1.5
Cache Read: $0.025
Input Audio: $0.5
Model: 0.250
Completion: 3.000
Cache: 0.050
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video, audio, pdf
Out: text
Released: 2026-03-03
Gemini 2.0 Flash-Lite gemini-2.0-flash-lite 1M 8.2K Input: $0.075
Output: $0.3
Model: 0.037
Completion: 4.000
📎 🔧 🌡️ 2024-06 In: text, image, audio, video, pdf
Out: text
Released: 2024-12-11

Vertex

📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Gemini 2.5 Pro TTS gemini-2.5-pro-tts 32.8K 16.4K Input: $1
Output: $20
Model: 0.500
Completion: 20.000
- 2025-01 In: text
Out: audio
Released: 2025-09-30
Updated: 2025-12-10
Claude Haiku 4.5 claude-haiku-4-5@20251001 200K 64K Input: $1
Output: $5
Cache Read: $0.1
Cache Write: $1.25
Model: 0.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-02-28 In: text, image, pdf
Out: text
Released: 2025-10-15
Gemini 3.1 Flash Lite gemini-3.1-flash-lite 1M 65.5K Input: $0.25
Output: $1.5
Cache Read: $0.025
Input Audio: $0.5
Model: 0.250
Completion: 3.000
Cache: 0.050
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video, audio, pdf
Out: text
Released: 2026-05-07
Gemini 2.5 Pro gemini-2.5-pro 1M 65.5K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-06-17
Gemini 2.5 Flash TTS gemini-2.5-flash-tts 32.8K 16.4K Input: $0.5
Output: $10
Model: 0.250
Completion: 20.000
- 2025-01 In: text
Out: audio
Released: 2025-09-30
Updated: 2025-12-10
Gemini 2.5 Flash gemini-2.5-flash 1M 65.5K Input: $0.3
Output: $2.5
Cache Read: $0.075
Cache Write: $0.383
Model: 0.150
Completion: 8.333
Cache: 0.250
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-06-17
Gemini 3.5 Flash gemini-3.5-flash 1M 65.5K Input: $1.5
Output: $9
Cache Read: $0.15
Input Audio: $1.5
Model: 0.750
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video, audio, pdf
Out: text
Released: 2026-05-19
Claude Opus 4 claude-opus-4@20250514 200K 32K Input: $15
Output: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-05-22
Claude Opus 4.1 claude-opus-4-1@20250805 200K 32K Input: $15
Output: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-08-05
Gemini Embedding 001 gemini-embedding-001 2K 1 Input: $0.15
Output: $0
Model: 0.075 - 2025-05 In: text
Out: text
Released: 2025-05-20
Claude Opus 4.5 claude-opus-4-5@20251101 200K 64K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-11-01
Claude Haiku 3.5 claude-3-5-haiku@20241022 200K 8.2K Input: $0.8
Output: $4
Cache Read: $0.08
Cache Write: $1
Model: 0.400
Completion: 5.000
Cache: 0.100
📎 🔧 🌡️ 2024-07-31 In: text, image, pdf
Out: text
Released: 2024-10-22
Gemini 3.1 Pro Preview Custom Tools gemini-3.1-pro-preview-customtools 1M 65.5K Input: $2
Output: $12
Cache Read: $0.2
Model: 1.000
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video, audio, pdf
Out: text
Released: 2026-02-19
Gemini Flash-Lite Latest gemini-flash-lite-latest 1M 65.5K Input: $0.1
Output: $0.4
Cache Read: $0.025
Model: 0.050
Completion: 4.000
Cache: 0.250
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-09-25
Claude Sonnet 4 claude-sonnet-4@20250514 200K 64K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-05-22
Gemini 2.5 Flash-Lite gemini-2.5-flash-lite 1M 65.5K Input: $0.1
Output: $0.4
Cache Read: $0.01
Input Audio: $0.3
Model: 0.150
Completion: 1.333
Cache: 0.033
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-06-17
Claude Opus 4.7 claude-opus-4-7@default 1M 128K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 2026-01-31 In: text, image, pdf
Out: text
Released: 2026-04-16
Gemini 3.1 Pro Preview gemini-3.1-pro-preview 1M 65.5K Input: $2
Output: $12
Cache Read: $0.2
Model: 1.000
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video, audio, pdf
Out: text
Released: 2026-02-19
Claude Sonnet 4.5 claude-sonnet-4-5@20250929 200K 64K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-07-31 In: text, image, pdf
Out: text
Released: 2025-09-29
Gemini 3 Flash Preview gemini-3-flash-preview 1M 65.5K Input: $0.5
Output: $3
Cache Read: $0.05
Input Audio: $1
Model: 0.500
Completion: 3.000
Cache: 0.050
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video, audio, pdf
Out: text
Released: 2025-12-17
Claude Opus 4.6 claude-opus-4-6@default 1M 128K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-05-31 In: text, image, pdf
Out: text
Released: 2026-02-05
Updated: 2026-03-13
Gemini Flash Latest gemini-flash-latest 1M 65.5K Input: $0.3
Output: $2.5
Cache Read: $0.075
Cache Write: $0.383
Model: 0.150
Completion: 8.333
Cache: 0.250
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-09-25
Claude Opus 4.8 claude-opus-4-8@default 1M 128K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2026-05-28
Claude Sonnet 4.6 claude-sonnet-4-6@default 1M 128K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-02-17
Updated: 2026-03-13
Gemini 3.1 Flash Lite Preview gemini-3.1-flash-lite-preview 1M 65.5K Input: $0.25
Output: $1.5
Cache Read: $0.025
Input Audio: $0.5
Model: 0.250
Completion: 3.000
Cache: 0.050
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video, audio, pdf
Out: text
Released: 2026-03-03
Kimi K2 Thinking moonshotai/kimi-k2-thinking-maas 262.1K 262.1K Input: $0.6
Output: $2.5
Model: 0.300
Completion: 4.167
🧠 🔧 🌡️ 2024-08 In: text
Out: text
Open Weights
Released: 2025-11-13
GPT OSS 120B openai/gpt-oss-120b-maas 131.1K 32.8K Input: $0.09
Output: $0.36
Model: 0.045
Completion: 4.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-05
GPT OSS 20B openai/gpt-oss-20b-maas 131.1K 32.8K Input: $0.07
Output: $0.25
Model: 0.035
Completion: 3.571
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-05
GLM-4.7 zai-org/glm-4.7-maas 200K 128K Input: $0.6
Output: $2.2
Model: 0.300
Completion: 3.667
🧠 🔧 🌡️ 2025-04 In: text, pdf
Out: text
Open Weights
Released: 2026-01-06
GLM-5 zai-org/glm-5-maas 202.8K 131.1K Input: $1
Output: $3.2
Cache Read: $0.1
Model: 0.500
Completion: 3.200
Cache: 0.100
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-11
DeepSeek V3.1 deepseek-ai/deepseek-v3.1-maas 163.8K 32.8K Input: $0.6
Output: $1.7
Model: 0.300
Completion: 2.833
🧠 🔧 🌡️ - In: text, pdf
Out: text
Open Weights
Released: 2025-08-28
DeepSeek V3.2 deepseek-ai/deepseek-v3.2-maas 163.8K 65.5K Input: $0.56
Output: $1.68
Cache Read: $0.056
Model: 0.280
Completion: 3.000
Cache: 0.100
🧠 🔧 🌡️ - In: text, pdf
Out: text
Open Weights
Released: 2025-12-17
Updated: 2026-04-04
Qwen3 235B A22B Instruct qwen/qwen3-235b-a22b-instruct-2507-maas 262.1K 16.4K Input: $0.22
Output: $0.88
Model: 0.110
Completion: 4.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-13
Llama 3.3 70B Instruct meta/llama-3.3-70b-instruct-maas 128K 8.2K Input: $0.72
Output: $0.72
Model: 0.360
Completion: 1.000
🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2025-04-29
Llama 4 Maverick 17B 128E Instruct meta/llama-4-maverick-17b-128e-instruct-maas 524.3K 8.2K Input: $0.35
Output: $1.15
Model: 0.175
Completion: 3.286
📎 🔧 🌡️ 2024-08 In: text, image
Out: text
Open Weights
Released: 2025-04-29

Vertex (Anthropic)

📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Claude Haiku 4.5 claude-haiku-4-5@20251001 200K 64K Input: $1
Output: $5
Cache Read: $0.1
Cache Write: $1.25
Model: 0.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-02-28 In: text, image, pdf
Out: text
Released: 2025-10-15
Claude Opus 4 claude-opus-4@20250514 200K 32K Input: $15
Output: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-05-22
Claude Opus 4.1 claude-opus-4-1@20250805 200K 32K Input: $15
Output: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-08-05
Claude Opus 4.5 claude-opus-4-5@20251101 200K 64K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-11-01
Claude Haiku 3.5 claude-3-5-haiku@20241022 200K 8.2K Input: $0.8
Output: $4
Cache Read: $0.08
Cache Write: $1
Model: 0.400
Completion: 5.000
Cache: 0.100
📎 🔧 🌡️ 2024-07-31 In: text, image, pdf
Out: text
Released: 2024-10-22
Claude Sonnet 4 claude-sonnet-4@20250514 200K 64K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-05-22
Claude Opus 4.7 claude-opus-4-7@default 1M 128K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 2026-01-31 In: text, image, pdf
Out: text
Released: 2026-04-16
Claude Sonnet 4.5 claude-sonnet-4-5@20250929 200K 64K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-07-31 In: text, image, pdf
Out: text
Released: 2025-09-29
Claude Opus 4.6 claude-opus-4-6@default 1M 128K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-05-31 In: text, image, pdf
Out: text
Released: 2026-02-05
Updated: 2026-03-13
Claude Opus 4.8 claude-opus-4-8@default 1M 128K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2026-05-28
Claude Sonnet 4.6 claude-sonnet-4-6@default 1M 128K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-02-17
Updated: 2026-03-13

Groq

📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Llama 3.3 70B llama-3.3-70b-versatile 131.1K 32.8K Input: $0.59
Output: $0.79
Model: 0.295
Completion: 1.339
🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-12-06
Llama 3.1 8B llama-3.1-8b-instant 131.1K 131.1K Input: $0.05
Output: $0.08
Model: 0.025
Completion: 1.600
🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-07-23
Whisper Large V3 Turbo whisper-large-v3-turbo - - - - 🌡️ - In: audio
Out: text
Open Weights
Released: 2024-10-01
Whisper whisper-large-v3 - - - - 🌡️ - In: audio
Out: text
Open Weights
Released: 2023-09-01
Updated: 2025-09-05
Prompt Guard 2 86M meta-llama/llama-prompt-guard-2-86m 512 512 Input: $0.04
Output: $0.04
Model: 0.020
Completion: 1.000
- - In: text
Out: text
Open Weights
Released: 2025-05-29
Llama Prompt Guard 2 22M meta-llama/llama-prompt-guard-2-22m 512 512 Input: $0.03
Output: $0.03
Model: 0.015
Completion: 1.000
- - In: text
Out: text
Open Weights
Released: 2025-05-29
Llama 4 Scout 17B 16E meta-llama/llama-4-scout-17b-16e-instruct 131.1K 8.2K Input: $0.11
Output: $0.34
Model: 0.055
Completion: 3.091
📎 🔧 🌡️ 2024-08 In: text, image
Out: text
Open Weights
Released: 2025-04-05
Safety GPT OSS 20B openai/gpt-oss-safeguard-20b 131.1K 65.5K Input: $0.075
Output: $0.3
Cache Read: $0.037
Model: 0.037
Completion: 4.000
Cache: 0.493
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-10-29
GPT OSS 120B openai/gpt-oss-120b 131.1K 65.5K Input: $0.15
Output: $0.6
Cache Read: $0.075
Model: 0.075
Completion: 4.000
Cache: 0.500
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-05
Updated: 2025-10-21
GPT OSS 20B openai/gpt-oss-20b 131.1K 65.5K Input: $0.075
Output: $0.3
Cache Read: $0.0375
Model: 0.037
Completion: 4.000
Cache: 0.500
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-05
Updated: 2025-09-25
Canopy Labs Orpheus V1 English canopylabs/orpheus-v1-english 4K 50K - - - - In: text
Out: audio
Released: 2025-12-19
Canopy Labs Orpheus Arabic Saudi canopylabs/orpheus-arabic-saudi 4K 50K - - - - In: text
Out: audio
Released: 2025-12-16
Compound groq/compound 131.1K 8.2K - - 🌡️ - In: text
Out: text
Released: 2025-09-04
Compound Mini groq/compound-mini 131.1K 8.2K - - 🌡️ - In: text
Out: text
Released: 2025-09-04
Qwen3-32B qwen/qwen3-32b 131.1K 41K Input: $0.29
Output: $0.59
Model: 0.145
Completion: 2.034
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-06-11
Updated: 2025-06-12

Helicone

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
OpenAI ChatGPT-4o chatgpt-4o-latest 128K 16.4K Input: $5
Output: $20
Cache Read: $2.5
Model: 2.500
Completion: 4.000
Cache: 0.500
🔧 🌡️ 2024-08 In: text, image
Out: text
Released: 2024-08-14
OpenAI GPT-4.1 Mini gpt-4.1-mini-2025-04-14 1M 32.8K Input: $0.39999999999999997
Output: $1.5999999999999999
Cache Read: $0.09999999999999999
Model: 0.200
Completion: 4.000
Cache: 0.250
🔧 🌡️ 2025-04 In: text, image
Out: text
Released: 2025-04-14
DeepSeek V3.1 Terminus deepseek-v3.1-terminus 128K 16.4K Input: $0.27
Output: $1
Cache Read: $0.21600000000000003
Model: 0.135
Completion: 3.704
Cache: 0.800
🧠 🔧 🌡️ 2025-09 In: text
Out: text
Released: 2025-09-22
Anthropic: Claude 3.5 Haiku claude-3.5-haiku 200K 8.2K Input: $0.7999999999999999
Output: $4
Cache Read: $0.08
Cache Write: $1
Model: 0.400
Completion: 5.000
Cache: 0.100
🔧 🌡️ 2024-10 In: text, image
Out: text
Released: 2024-10-22
Meta Llama 3.1 8B Instruct llama-3.1-8b-instruct 16.4K 16.4K Input: $0.02
Output: $0.049999999999999996
Model: 0.010
Completion: 2.500
🔧 🌡️ 2024-07 In: text
Out: text
Released: 2024-07-23
OpenAI o3 o3 200K 100K Input: $2
Output: $8
Cache Read: $0.5
Model: 1.000
Completion: 4.000
Cache: 0.250
🔧 2024-06 In: text, image
Out: text
Released: 2024-06-01
Meta Llama Prompt Guard 2 86M llama-prompt-guard-2-86m 512 2 Input: $0.01
Output: $0.01
Model: 0.005
Completion: 1.000
🌡️ 2024-10 In: text
Out: text
Released: 2024-10-01
Qwen3 Coder 30B A3B Instruct qwen3-coder-30b-a3b-instruct 262.1K 262.1K Input: $0.09999999999999999
Output: $0.3
Model: 0.050
Completion: 3.000
🔧 🌡️ 2025-07 In: text
Out: text
Released: 2025-07-31
Hermes 2 Pro Llama 3 8B hermes-2-pro-llama-3-8b 131.1K 131.1K Input: $0.14
Output: $0.14
Model: 0.070
Completion: 1.000
🔧 🌡️ 2024-05 In: text
Out: text
Released: 2024-05-27
DeepSeek V3 deepseek-v3 128K 8.2K Input: $0.56
Output: $1.68
Cache Read: $0.07
Model: 0.280
Completion: 3.000
Cache: 0.125
🔧 🌡️ 2024-12 In: text
Out: text
Released: 2024-12-26
xAI Grok Code Fast 1 grok-code-fast-1 256K 10K Input: $0.19999999999999998
Output: $1.5
Cache Read: $0.02
Model: 0.100
Completion: 7.500
Cache: 0.100
🔧 🌡️ 2024-08 In: text
Out: text
Released: 2024-08-25
OpenAI: o1-mini o1-mini 128K 65.5K Input: $1.1
Output: $4.4
Cache Read: $0.55
Model: 0.550
Completion: 4.000
Cache: 0.500
- 2025-01 In: text
Out: text
Released: 2025-01-01
DeepSeek R1 Distill Llama 70B deepseek-r1-distill-llama-70b 128K 4.1K Input: $0.03
Output: $0.13
Model: 0.015
Completion: 4.333
🧠 🔧 🌡️ 2025-01 In: text
Out: text
Released: 2025-01-20
Qwen3 32B qwen3-32b 131.1K 41K Input: $0.29
Output: $0.59
Model: 0.145
Completion: 2.034
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Released: 2025-04-28
Anthropic: Claude Sonnet 4 claude-sonnet-4 200K 64K Input: $3
Output: $15
Cache Read: $0.30000000000000004
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
🧠 🔧 🌡️ 2025-05 In: text, image
Out: text
Released: 2025-05-14
Qwen3 Next 80B A3B Instruct qwen3-next-80b-a3b-instruct 262K 16.4K Input: $0.14
Output: $1.4
Model: 0.070
Completion: 10.000
🔧 🌡️ 2025-01 In: text, image, video
Out: text
Released: 2025-01-01
Meta Llama 4 Maverick 17B 128E llama-4-maverick 131.1K 8.2K Input: $0.15
Output: $0.6
Model: 0.075
Completion: 4.000
🔧 🌡️ 2025-01 In: text, image
Out: text
Released: 2025-01-01
Mistral Nemo mistral-nemo 128K 16.4K Input: $20
Output: $40
Model: 10.000
Completion: 2.000
🌡️ 2024-07 In: text, image
Out: text
Released: 2024-07-18
Meta Llama 3.3 70B Versatile llama-3.3-70b-versatile 131.1K 32.7K Input: $0.59
Output: $0.7899999999999999
Model: 0.295
Completion: 1.339
🔧 🌡️ 2024-12 In: text
Out: text
Released: 2024-12-06
Google Gemma 2 gemma2-9b-it 8.2K 8.2K Input: $0.01
Output: $0.03
Model: 0.005
Completion: 3.000
🌡️ 2024-06 In: text
Out: text
Released: 2024-06-25
Google Gemini 2.5 Pro gemini-2.5-pro 1M 65.5K Input: $1.25
Output: $10
Cache Read: $0.3125
Cache Write: $1.25
Model: 0.625
Completion: 8.000
Cache: 0.250
🧠 🔧 🌡️ 2025-06 In: text, image
Out: text
Released: 2025-06-17
OpenAI GPT-5 gpt-5 400K 128K Input: $1.25
Output: $10
Cache Read: $0.12500000000000003
Model: 0.625
Completion: 8.000
Cache: 0.100
🔧 2025-01 In: text, image
Out: text
Released: 2025-01-01
Anthropic: Claude 4.5 Haiku (20251001) claude-haiku-4-5-20251001 200K 8.2K Input: $1
Output: $5
Cache Read: $0.09999999999999999
Cache Write: $1.25
Model: 0.500
Completion: 5.000
Cache: 0.100
🔧 🌡️ 2025-10 In: text, image
Out: text
Released: 2025-10-01
Anthropic: Claude 4.5 Haiku claude-4.5-haiku 200K 8.2K Input: $1
Output: $5
Cache Read: $0.09999999999999999
Cache Write: $1.25
Model: 0.500
Completion: 5.000
Cache: 0.100
🔧 🌡️ 2025-10 In: text, image
Out: text
Released: 2025-10-01
OpenAI: GPT-5 Pro gpt-5-pro 128K 32.8K Input: $15
Output: $120
Model: 7.500
Completion: 8.000
- 2025-01 In: text
Out: text
Released: 2025-01-01
Google Gemini 2.5 Flash gemini-2.5-flash 1M 65.5K Input: $0.3
Output: $2.5
Cache Read: $0.075
Cache Write: $0.3
Model: 0.150
Completion: 8.333
Cache: 0.250
🧠 🔧 🌡️ 2025-06 In: text, image
Out: text
Released: 2025-06-17
OpenAI GPT-4o gpt-4o 128K 16.4K Input: $2.5
Output: $10
Cache Read: $1.25
Model: 1.250
Completion: 4.000
Cache: 0.500
🔧 🌡️ 2024-05 In: text, image
Out: text
Released: 2024-05-13
OpenAI o4 Mini o4-mini 200K 100K Input: $1.1
Output: $4.4
Cache Read: $0.275
Model: 0.550
Completion: 4.000
Cache: 0.250
🔧 2024-06 In: text, image
Out: text
Released: 2024-06-01
Meta Llama 3.1 8B Instant llama-3.1-8b-instant 131.1K 32.7K Input: $0.049999999999999996
Output: $0.08
Model: 0.025
Completion: 1.600
🔧 🌡️ 2024-07 In: text
Out: text
Released: 2024-07-01
Meta Llama Prompt Guard 2 22M llama-prompt-guard-2-22m 512 2 Input: $0.01
Output: $0.01
Model: 0.005
Completion: 1.000
🌡️ 2024-10 In: text
Out: text
Released: 2024-10-01
OpenAI o3 Pro o3-pro 200K 100K Input: $20
Output: $80
Model: 10.000
Completion: 4.000
🔧 2024-06 In: text, image
Out: text
Released: 2024-06-01
xAI Grok 3 Mini grok-3-mini 131.1K 131.1K Input: $0.3
Output: $0.5
Cache Read: $0.075
Model: 0.150
Completion: 1.667
Cache: 0.250
🔧 🌡️ 2024-06 In: text
Out: text
Released: 2024-06-01
Anthropic: Claude Opus 4.1 (20250805) claude-opus-4-1-20250805 200K 32K Input: $15
Output: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
🧠 🔧 🌡️ 2025-08 In: text, image
Out: text
Released: 2025-08-05
DeepSeek TNG R1T2 Chimera deepseek-tng-r1t2-chimera 130K 163.8K Input: $0.3
Output: $1.2
Model: 0.150
Completion: 4.000
🔧 🌡️ 2025-07 In: text
Out: text
Released: 2025-07-02
Meta Llama 3.3 70B Instruct llama-3.3-70b-instruct 128K 16.4K Input: $0.13
Output: $0.39
Model: 0.065
Completion: 3.000
🔧 🌡️ 2024-12 In: text
Out: text
Released: 2024-12-06
Perplexity Sonar Reasoning Pro sonar-reasoning-pro 127K 4.1K Input: $2
Output: $8
Model: 1.000
Completion: 4.000
🧠 🌡️ 2025-01 In: text
Out: text
Released: 2025-01-27
xAI Grok 3 grok-3 131.1K 131.1K Input: $3
Output: $15
Cache Read: $0.75
Model: 1.500
Completion: 5.000
Cache: 0.250
🔧 🌡️ 2024-06 In: text
Out: text
Released: 2024-06-01
Zai GLM-4.6 glm-4.6 204.8K 131.1K Input: $0.44999999999999996
Output: $1.5
Model: 0.225
Completion: 3.333
🧠 🔧 🌡️ 2024-07 In: text
Out: text
Released: 2024-07-18
Kimi K2 Thinking kimi-k2-thinking 256K 262.1K Input: $0.48
Output: $2
Model: 0.240
Completion: 4.167
🔧 🌡️ 2025-11 In: text
Out: text
Released: 2025-11-06
xAI Grok 4.1 Fast Non-Reasoning grok-4-1-fast-non-reasoning 2M 30K Input: $0.19999999999999998
Output: $0.5
Cache Read: $0.049999999999999996
Model: 0.100
Completion: 2.500
Cache: 0.250
🔧 🌡️ 2025-11 In: text, image
Out: text, image
Released: 2025-11-17
Qwen3 Coder 480B A35B Instruct Turbo qwen3-coder 262.1K 16.4K Input: $0.22
Output: $0.95
Model: 0.110
Completion: 4.318
🔧 🌡️ 2025-07 In: text, image, audio, video
Out: text
Released: 2025-07-23
OpenAI GPT-5 Chat Latest gpt-5-chat-latest 128K 16.4K Input: $1.25
Output: $10
Cache Read: $0.12500000000000003
Model: 0.625
Completion: 8.000
Cache: 0.100
🔧 2024-09 In: text, image
Out: text
Released: 2024-09-30
OpenAI: GPT-5.1 Codex gpt-5.1-codex 400K 128K Input: $1.25
Output: $10
Cache Read: $0.12500000000000003
Model: 0.625
Completion: 8.000
Cache: 0.100
🔧 2025-01 In: text, image
Out: text, image
Released: 2025-01-01
OpenAI o3 Mini o3-mini 200K 100K Input: $1.1
Output: $4.4
Cache Read: $0.55
Model: 0.550
Completion: 4.000
Cache: 0.500
🔧 2023-10 In: text
Out: text
Released: 2023-10-01
Mistral-Large mistral-large-2411 128K 32.8K Input: $2
Output: $6
Model: 1.000
Completion: 3.000
🔧 🌡️ 2024-07 In: text
Out: text
Released: 2024-07-24
Google Gemini 2.5 Flash Lite gemini-2.5-flash-lite 1M 65.5K Input: $0.09999999999999999
Output: $0.39999999999999997
Cache Read: $0.024999999999999998
Cache Write: $0.09999999999999999
Model: 0.050
Completion: 4.000
Cache: 0.250
🧠 🔧 🌡️ 2025-07 In: text, image
Out: text
Released: 2025-07-22
Meta Llama Guard 4 12B llama-guard-4 131.1K 1K Input: $0.21
Output: $0.21
Model: 0.105
Completion: 1.000
🌡️ 2025-01 In: text, image
Out: text
Released: 2025-01-01
Anthropic: Claude Opus 4.1 claude-opus-4-1 200K 32K Input: $15
Output: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
🧠 🔧 🌡️ 2025-08 In: text, image
Out: text
Released: 2025-08-05
Anthropic: Claude 3.7 Sonnet claude-3.7-sonnet 200K 64K Input: $3
Output: $15
Cache Read: $0.30000000000000004
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
🔧 🌡️ 2025-02 In: text, image
Out: text
Released: 2025-02-19
OpenAI: GPT-5.1 Codex Mini gpt-5.1-codex-mini 400K 128K Input: $0.25
Output: $2
Cache Read: $0.024999999999999998
Model: 0.125
Completion: 8.000
Cache: 0.100
🔧 2025-01 In: text, image
Out: text, image
Released: 2025-01-01
OpenAI GPT-5.1 Chat gpt-5.1-chat-latest 128K 16.4K Input: $1.25
Output: $10
Cache Read: $0.12500000000000003
Model: 0.625
Completion: 8.000
Cache: 0.100
🔧 2025-01 In: text, image
Out: text, image
Released: 2025-01-01
Anthropic: Claude 3 Haiku claude-3-haiku-20240307 200K 4.1K Input: $0.25
Output: $1.25
Cache Read: $0.03
Cache Write: $0.3
Model: 0.125
Completion: 5.000
Cache: 0.120
🔧 🌡️ 2024-03 In: text, image
Out: text
Released: 2024-03-07
xAI: Grok 4 Fast Reasoning grok-4-fast-reasoning 2M 2M Input: $0.19999999999999998
Output: $0.5
Cache Read: $0.049999999999999996
Model: 0.100
Completion: 2.500
Cache: 0.250
🧠 🔧 🌡️ 2025-09 In: text, image
Out: text
Released: 2025-09-01
OpenAI GPT-4.1 Nano gpt-4.1-nano 1M 32.8K Input: $0.09999999999999999
Output: $0.39999999999999997
Cache Read: $0.024999999999999998
Model: 0.050
Completion: 4.000
Cache: 0.250
🔧 🌡️ 2025-04 In: text, image
Out: text
Released: 2025-04-14
OpenAI GPT-OSS 120b gpt-oss-120b 131.1K 131.1K Input: $0.04
Output: $0.16
Model: 0.020
Completion: 4.000
🧠 🔧 🌡️ 2024-06 In: text
Out: text
Released: 2024-06-01
Perplexity Sonar sonar 127K 4.1K Input: $1
Output: $1
Model: 0.500
Completion: 1.000
🌡️ 2025-01 In: text
Out: text
Released: 2025-01-27
Qwen2.5 Coder 7B fast qwen2.5-coder-7b-fast 32K 8.2K Input: $0.03
Output: $0.09
Model: 0.015
Completion: 3.000
🌡️ 2024-09 In: text
Out: text
Released: 2024-09-15
OpenAI: o1 o1 200K 100K Input: $15
Output: $60
Cache Read: $7.5
Model: 7.500
Completion: 4.000
Cache: 0.500
- 2025-01 In: text
Out: text
Released: 2025-01-01
Baidu Ernie 4.5 21B A3B Thinking ernie-4.5-21b-a3b-thinking 128K 8K Input: $0.07
Output: $0.28
Model: 0.035
Completion: 4.000
🧠 🌡️ 2025-03 In: text
Out: text
Released: 2025-03-16
Meta Llama 4 Scout 17B 16E llama-4-scout 131.1K 8.2K Input: $0.08
Output: $0.3
Model: 0.040
Completion: 3.750
🔧 🌡️ 2025-01 In: text, image
Out: text
Released: 2025-01-01
Perplexity Sonar Pro sonar-pro 200K 4.1K Input: $3
Output: $15
Model: 1.500
Completion: 5.000
🌡️ 2025-01 In: text
Out: text
Released: 2025-01-27
OpenAI GPT-4.1 gpt-4.1 1M 32.8K Input: $2
Output: $8
Cache Read: $0.5
Model: 1.000
Completion: 4.000
Cache: 0.250
🔧 🌡️ 2025-04 In: text, image
Out: text
Released: 2025-04-14
Anthropic: Claude Sonnet 4.5 (20250929) claude-sonnet-4-5-20250929 200K 64K Input: $3
Output: $15
Cache Read: $0.30000000000000004
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
🧠 🔧 🌡️ 2025-09 In: text, image
Out: text
Released: 2025-09-29
DeepSeek Reasoner deepseek-reasoner 128K 64K Input: $0.56
Output: $1.68
Cache Read: $0.07
Model: 0.280
Completion: 3.000
Cache: 0.125
🌡️ 2025-01 In: text
Out: text
Released: 2025-01-20
xAI Grok 4.1 Fast Reasoning grok-4-1-fast-reasoning 2M 2M Input: $0.19999999999999998
Output: $0.5
Cache Read: $0.049999999999999996
Model: 0.100
Completion: 2.500
Cache: 0.250
🧠 🔧 🌡️ 2025-11 In: text, image
Out: text
Released: 2025-11-17
Google Gemini 3 Pro Preview gemini-3-pro-preview 1M 65.5K Input: $2
Output: $12
Cache Read: $0.19999999999999998
Model: 1.000
Completion: 6.000
Cache: 0.100
🧠 🔧 🌡️ 2025-11 In: text, image, audio, video
Out: text
Released: 2025-11-18
OpenAI GPT-5 Mini gpt-5-mini 400K 128K Input: $0.25
Output: $2
Cache Read: $0.024999999999999998
Model: 0.125
Completion: 8.000
Cache: 0.100
🔧 2025-01 In: text, image
Out: text
Released: 2025-01-01
OpenAI GPT-4.1 Mini gpt-4.1-mini 1M 32.8K Input: $0.39999999999999997
Output: $1.5999999999999999
Cache Read: $0.09999999999999999
Model: 0.200
Completion: 4.000
Cache: 0.250
🔧 🌡️ 2025-04 In: text, image
Out: text
Released: 2025-04-14
Perplexity Sonar Reasoning sonar-reasoning 127K 4.1K Input: $1
Output: $5
Model: 0.500
Completion: 5.000
🧠 🌡️ 2025-01 In: text
Out: text
Released: 2025-01-27
Perplexity Sonar Deep Research sonar-deep-research 127K 4.1K Input: $2
Output: $8
Model: 1.000
Completion: 4.000
🧠 🌡️ 2025-01 In: text
Out: text
Released: 2025-01-27
Kimi K2 (09/05) kimi-k2-0905 262.1K 16.4K Input: $0.5
Output: $2
Cache Read: $0.39999999999999997
Model: 0.250
Completion: 4.000
Cache: 0.800
🔧 🌡️ 2025-09 In: text
Out: text
Released: 2025-09-05
OpenAI GPT-5 Nano gpt-5-nano 400K 128K Input: $0.049999999999999996
Output: $0.39999999999999997
Cache Read: $0.005
Model: 0.025
Completion: 8.000
Cache: 0.100
🔧 2025-01 In: text, image
Out: text
Released: 2025-01-01
xAI Grok 4 grok-4 256K 256K Input: $3
Output: $15
Cache Read: $0.75
Model: 1.500
Completion: 5.000
Cache: 0.250
🔧 🌡️ 2024-07 In: text
Out: text
Released: 2024-07-09
Qwen3 235B A22B Thinking qwen3-235b-a22b-thinking 262.1K 81.9K Input: $0.3
Output: $2.9000000000000004
Model: 0.150
Completion: 9.667
🧠 🌡️ 2025-07 In: text, image, video
Out: text
Released: 2025-07-25
Anthropic: Claude Opus 4 claude-opus-4 200K 32K Input: $15
Output: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
🧠 🔧 🌡️ 2025-05 In: text, image
Out: text
Released: 2025-05-14
xAI Grok 4 Fast Non-Reasoning grok-4-fast-non-reasoning 2M 2M Input: $0.19999999999999998
Output: $0.5
Cache Read: $0.049999999999999996
Model: 0.100
Completion: 2.500
Cache: 0.250
🔧 🌡️ 2025-09 In: text, image, audio
Out: text
Released: 2025-09-19
Anthropic: Claude Opus 4.5 claude-4.5-opus 200K 64K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
🧠 🔧 🌡️ 2025-11 In: text, image
Out: text
Released: 2025-11-24
Qwen3 VL 235B A22B Instruct qwen3-vl-235b-a22b-instruct 256K 16.4K Input: $0.3
Output: $1.5
Model: 0.150
Completion: 5.000
🔧 🌡️ 2025-09 In: text, image, video
Out: text
Released: 2025-09-23
Kimi K2 (07/11) kimi-k2-0711 131.1K 16.4K Input: $0.5700000000000001
Output: $2.3
Model: 0.285
Completion: 4.035
🔧 🌡️ 2025-01 In: text
Out: text
Released: 2025-01-01
Google Gemma 3 12B gemma-3-12b-it 131.1K 8.2K Input: $0.049999999999999996
Output: $0.09999999999999999
Model: 0.025
Completion: 2.000
🌡️ 2024-12 In: text, image
Out: text
Released: 2024-12-01
OpenAI GPT-4o-mini gpt-4o-mini 128K 16.4K Input: $0.15
Output: $0.6
Cache Read: $0.075
Model: 0.075
Completion: 4.000
Cache: 0.500
🔧 🌡️ 2024-07 In: text, image
Out: text
Released: 2024-07-18
OpenAI GPT-OSS 20b gpt-oss-20b 131.1K 131.1K Input: $0.049999999999999996
Output: $0.19999999999999998
Model: 0.025
Completion: 4.000
🧠 🔧 🌡️ 2024-06 In: text
Out: text
Released: 2024-06-01
Anthropic: Claude 3.5 Sonnet v2 claude-3.5-sonnet-v2 200K 8.2K Input: $3
Output: $15
Cache Read: $0.30000000000000004
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
🔧 🌡️ 2024-10 In: text, image
Out: text
Released: 2024-10-22
Qwen3 30B A3B qwen3-30b-a3b 41K 41K Input: $0.08
Output: $0.29
Model: 0.040
Completion: 3.625
🔧 🌡️ 2025-06 In: text, image
Out: text
Released: 2025-06-01
OpenAI: GPT-5 Codex gpt-5-codex 400K 128K Input: $1.25
Output: $10
Cache Read: $0.12500000000000003
Model: 0.625
Completion: 8.000
Cache: 0.100
🔧 2025-01 In: text
Out: text
Released: 2025-01-01
DeepSeek V3.2 deepseek-v3.2 163.8K 65.5K Input: $0.27
Output: $0.41
Model: 0.135
Completion: 1.519
🔧 🌡️ 2025-09 In: text
Out: text
Released: 2025-09-22
Anthropic: Claude Sonnet 4.5 claude-4.5-sonnet 200K 64K Input: $3
Output: $15
Cache Read: $0.30000000000000004
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
🧠 🔧 🌡️ 2025-09 In: text, image
Out: text
Released: 2025-09-29
Mistral Small 3.2 mistral-small 128K 16.4K Input: $0.075
Output: $0.2
Model: 0.037
Completion: 2.667
🔧 🌡️ 2025-03 In: text, image
Out: text
Open Weights
Released: 2025-06-20
Meta Llama 3.1 8B Instruct Turbo llama-3.1-8b-instruct-turbo 128K 128K Input: $0.02
Output: $0.03
Model: 0.010
Completion: 1.500
🔧 🌡️ 2024-07 In: text
Out: text
Released: 2024-07-23
OpenAI GPT-5.1 gpt-5.1 400K 128K Input: $1.25
Output: $10
Cache Read: $0.12500000000000003
Model: 0.625
Completion: 8.000
Cache: 0.100
🔧 2025-01 In: text, image
Out: text, image
Released: 2025-01-01

HPC-AI

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Kimi K2.5 moonshotai/kimi-k2.5 262.1K 262.1K Input: $0.3
Output: $1.5
Cache Read: $0.05
Model: 0.150
Completion: 5.000
Cache: 0.167
🧠 🔧 2025-01-01 In: text, image, video
Out: text
Open Weights
Released: 2026-01-01
Updated: 2026-06-01
GLM 5.1 zai-org/glm-5.1 202K 202K Input: $0.615
Output: $2.46
Cache Read: $0.133
Model: 0.307
Completion: 4.000
Cache: 0.216
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-04-08
Updated: 2026-06-01
MiniMax M2.5 minimax/minimax-m2.5 1M 131.1K Input: $0.3
Output: $1.2
Cache Read: $0.03
Model: 0.150
Completion: 4.000
Cache: 0.100
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-12
Updated: 2026-06-01

Hugging Face

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Kimi-K2-Thinking moonshotai/Kimi-K2-Thinking 262.1K 262.1K Input: $0.6
Output: $2.5
Cache Read: $0.15
Model: 0.300
Completion: 4.167
Cache: 0.250
🧠 🔧 🌡️ 2024-08 In: text
Out: text
Open Weights
Released: 2025-11-06
Kimi-K2-Instruct-0905 moonshotai/Kimi-K2-Instruct-0905 262.1K 16.4K Input: $1
Output: $3
Model: 0.500
Completion: 3.000
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-09-04
Kimi-K2-Instruct moonshotai/Kimi-K2-Instruct 131.1K 16.4K Input: $1
Output: $3
Model: 0.500
Completion: 3.000
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-07-14
Kimi-K2.6 moonshotai/Kimi-K2.6 262.1K 262.1K Input: $0.95
Output: $4
Cache Read: $0.16
Model: 0.475
Completion: 4.211
Cache: 0.168
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video
Out: text
Open Weights
Released: 2026-04-20
Kimi-K2.5 moonshotai/Kimi-K2.5 262.1K 262.1K Input: $0.6
Output: $3
Cache Read: $0.1
Model: 0.300
Completion: 5.000
Cache: 0.167
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video
Out: text
Open Weights
Released: 2026-01-01
Qwen3-Coder-Next Qwen/Qwen3-Coder-Next 262.1K 65.5K Input: $0.2
Output: $1.5
Model: 0.100
Completion: 7.500
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2026-02-03
Qwen 3 Embedding 8B Qwen/Qwen3-Embedding-8B 32K 4.1K Input: $0.01
Output: $0
Model: 0.005 - 2024-12 In: text
Out: text
Open Weights
Released: 2025-01-01
Qwen3-Next-80B-A3B-Instruct Qwen/Qwen3-Next-80B-A3B-Instruct 262.1K 66.5K Input: $0.25
Output: $1
Model: 0.125
Completion: 4.000
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-09-11
Qwen3.5-397B-A17B Qwen/Qwen3.5-397B-A17B 262.1K 32.8K Input: $0.6
Output: $3.6
Model: 0.300
Completion: 6.000
📎 🧠 🔧 🌡️ 2025-04 In: text, image
Out: text
Open Weights
Released: 2026-02-01
Qwen3-235B-A22B-Thinking-2507 Qwen/Qwen3-235B-A22B-Thinking-2507 262.1K 131.1K Input: $0.3
Output: $3
Model: 0.150
Completion: 10.000
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-25
Qwen 3 Embedding 4B Qwen/Qwen3-Embedding-4B 32K 2K Input: $0.01
Output: $0
Model: 0.005 - 2024-12 In: text
Out: text
Open Weights
Released: 2025-01-01
Qwen3-Coder-480B-A35B-Instruct Qwen/Qwen3-Coder-480B-A35B-Instruct 262.1K 66.5K Input: $2
Output: $2
Model: 1.000
Completion: 1.000
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-23
Qwen3-Next-80B-A3B-Thinking Qwen/Qwen3-Next-80B-A3B-Thinking 262.1K 131.1K Input: $0.3
Output: $2
Model: 0.150
Completion: 6.667
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-09-11
MiMo-V2-Flash XiaomiMiMo/MiMo-V2-Flash 262.1K 4.1K Input: $0.1
Output: $0.3
Model: 0.050
Completion: 3.000
🧠 🔧 🌡️ 2024-12 In: text
Out: text
Open Weights
Released: 2025-12-16
GLM-4.7-Flash zai-org/GLM-4.7-Flash 200K 128K Input: $0
Output: $0
- 🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-08-08
GLM-5 zai-org/GLM-5 202.8K 131.1K Input: $1
Output: $3.2
Cache Read: $0.2
Model: 0.500
Completion: 3.200
Cache: 0.200
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-11
GLM-4.7 zai-org/GLM-4.7 204.8K 131.1K Input: $0.6
Output: $2.2
Cache Read: $0.11
Model: 0.300
Completion: 3.667
Cache: 0.183
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-12-22
GLM-5.1 zai-org/GLM-5.1 202.8K 131.1K Input: $1
Output: $3.2
Cache Read: $0.2
Model: 0.500
Completion: 3.200
Cache: 0.200
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-04-03
DeepSeek-R1-0528 deepseek-ai/DeepSeek-R1-0528 163.8K 163.8K Input: $3
Output: $5
Model: 1.500
Completion: 1.667
🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2025-05-28
DeepSeek V4 Pro deepseek-ai/DeepSeek-V4-Pro 1M 393.2K Input: $0.435
Output: $0.87
Cache Read: $0.003625
Model: 0.217
Completion: 2.000
Cache: 0.008
🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2026-04-24
DeepSeek-V3.2 deepseek-ai/DeepSeek-V3.2 163.8K 65.5K Input: $0.28
Output: $0.4
Model: 0.140
Completion: 1.429
🧠 🔧 🌡️ 2024-07 In: text
Out: text
Open Weights
Released: 2025-12-01
MiniMax-M2.1 MiniMaxAI/MiniMax-M2.1 204.8K 131.1K Input: $0.3
Output: $1.2
Model: 0.150
Completion: 4.000
🧠 🔧 🌡️ 2025-10 In: text
Out: text
Open Weights
Released: 2025-12-23
MiniMax-M2.5 MiniMaxAI/MiniMax-M2.5 204.8K 131.1K Input: $0.3
Output: $1.2
Cache Read: $0.03
Model: 0.150
Completion: 4.000
Cache: 0.100
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-12
MiniMax-M2.7 MiniMaxAI/MiniMax-M2.7 204.8K 131.1K Input: $0.3
Output: $1.2
Cache Read: $0.06
Model: 0.150
Completion: 4.000
Cache: 0.200
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-18

iFlow

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Qwen3-Coder-Plus qwen3-coder-plus 256K 64K Input: $0
Output: $0
- 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-01
DeepSeek-V3 deepseek-v3 128K 32K Input: $0
Output: $0
- 🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2024-12-26
Kimi-K2 kimi-k2 128K 64K Input: $0
Output: $0
- 🔧 🌡️ 2024-10 In: text
Out: text
Released: 2024-12-01
Qwen3-32B qwen3-32b 128K 32K Input: $0
Output: $0
- 🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2024-12-01
Qwen3-Max-Preview qwen3-max-preview 256K 32K Input: $0
Output: $0
- 🔧 🌡️ 2024-12 In: text
Out: text
Released: 2025-01-01
Qwen3-Max qwen3-max 256K 32K Input: $0
Output: $0
- 🔧 🌡️ 2024-12 In: text
Out: text
Released: 2025-01-01
Qwen3-235B-A22B qwen3-235b 128K 32K Input: $0
Output: $0
- 🧠 🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2024-12-01
GLM-4.6 glm-4.6 200K 128K Input: $0
Output: $0
- 🧠 🔧 🌡️ 2024-10 In: text
Out: text
Released: 2024-12-01
Updated: 2025-11-13
Qwen3-235B-A22B-Thinking qwen3-235b-a22b-thinking-2507 256K 64K Input: $0
Output: $0
- 🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-01
DeepSeek-R1 deepseek-r1 128K 32K Input: $0
Output: $0
- 🧠 🔧 🌡️ 2024-12 In: text
Out: text
Open Weights
Released: 2025-01-20
Qwen3-VL-Plus qwen3-vl-plus 256K 32K Input: $0
Output: $0
- 📎 🔧 🌡️ 2024-12 In: text, image
Out: text
Released: 2025-01-01
Qwen3-235B-A22B-Instruct qwen3-235b-a22b-instruct 256K 64K Input: $0
Output: $0
- 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-01
Kimi-K2-0905 kimi-k2-0905 256K 64K Input: $0
Output: $0
- 🔧 🌡️ 2024-12 In: text
Out: text
Released: 2025-09-05
DeepSeek-V3.2-Exp deepseek-v3.2 128K 64K Input: $0
Output: $0
- 🔧 🌡️ 2024-12 In: text
Out: text
Open Weights
Released: 2025-01-01

Inception

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Mercury Edit 2 mercury-edit-2 128K 8.2K Input: $0.25
Output: $0.75
Cache Read: $0.025
Model: 0.125
Completion: 3.000
Cache: 0.100
🧠 🌡️ - In: text
Out: text
Released: 2026-03-30
Mercury 2 mercury-2 128K 50K Input: $0.25
Output: $0.75
Cache Read: $0.025
Model: 0.125
Completion: 3.000
Cache: 0.100
🧠 🔧 🌡️ 2025-01-01 In: text
Out: text
Released: 2026-02-24

Inceptron

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Kimi K2.6 moonshotai/Kimi-K2.6 262.1K 262.1K Input: $0.78
Output: $3.5
Cache Read: $0.2
Cache Write: $0
Model: 0.390
Completion: 4.487
Cache: 0.256
📎 🧠 🔧 🌡️ 2025-01 In: text, image
Out: text
Open Weights
Released: 2026-04-21
Llama 3.3 70B Instruct nvidia/llama-3.3-70b-instruct-fp8 131.1K 131.1K Input: $0.12
Output: $0.38
Cache Read: $0
Cache Write: $0
Model: 0.060
Completion: 3.167
📎 🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-12-06
GLM 5.1 zai-org/GLM-5.1-FP8 202.8K 202.8K Input: $1.4
Output: $4.4
Cache Read: $0.26
Cache Write: $0
Model: 0.700
Completion: 3.143
Cache: 0.186
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-27
MiniMax M2.5 MiniMaxAI/MiniMax-M2.5 196.6K 196.6K Input: $0.24
Output: $0.9
Cache Read: $0.03
Cache Write: $0
Model: 0.120
Completion: 3.750
Cache: 0.125
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-12

Inference

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Mistral Nemo 12B Instruct mistral/mistral-nemo-12b-instruct 16K 4.1K Input: $0.038
Output: $0.1
Model: 0.019
Completion: 2.632
🔧 🌡️ 2024-12 In: text
Out: text
Open Weights
Released: 2025-01-01
Google Gemma 3 google/gemma-3 125K 4.1K Input: $0.15
Output: $0.3
Model: 0.075
Completion: 2.000
📎 🔧 🌡️ 2024-12 In: text, image
Out: text
Open Weights
Released: 2025-01-01
Osmosis Structure 0.6B osmosis/osmosis-structure-0.6b 4K 2K Input: $0.1
Output: $0.5
Model: 0.050
Completion: 5.000
🔧 🌡️ 2024-12 In: text
Out: text
Open Weights
Released: 2025-01-01
Qwen 3 Embedding 4B qwen/qwen3-embedding-4b 32K 2K Input: $0.01
Output: $0
Model: 0.005 - 2024-12 In: text
Out: text
Open Weights
Released: 2025-01-01
Qwen 2.5 7B Vision Instruct qwen/qwen-2.5-7b-vision-instruct 125K 4.1K Input: $0.2
Output: $0.2
Model: 0.100
Completion: 1.000
📎 🔧 🌡️ 2024-12 In: text, image
Out: text
Open Weights
Released: 2025-01-01
Llama 3.1 8B Instruct meta/llama-3.1-8b-instruct 16K 4.1K Input: $0.025
Output: $0.025
Model: 0.013
Completion: 1.000
🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2025-01-01
Llama 3.2 1B Instruct meta/llama-3.2-1b-instruct 16K 4.1K Input: $0.01
Output: $0.01
Model: 0.005
Completion: 1.000
🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2025-01-01
Llama 3.2 11B Vision Instruct meta/llama-3.2-11b-vision-instruct 16K 4.1K Input: $0.055
Output: $0.055
Model: 0.028
Completion: 1.000
📎 🔧 🌡️ 2023-12 In: text, image
Out: text
Open Weights
Released: 2025-01-01
Llama 3.2 3B Instruct meta/llama-3.2-3b-instruct 16K 4.1K Input: $0.02
Output: $0.02
Model: 0.010
Completion: 1.000
🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2025-01-01

IO.NET

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Llama 4 Maverick 17B 128E Instruct meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8 430K 4.1K Input: $0.15
Output: $0.6
Cache Read: $0.075
Cache Write: $0.3
Model: 0.075
Completion: 4.000
Cache: 0.500
🔧 🌡️ 2024-12 In: text, image
Out: text
Open Weights
Released: 2025-01-15
Llama 3.3 70B Instruct meta-llama/Llama-3.3-70B-Instruct 128K 4.1K Input: $0.13
Output: $0.38
Cache Read: $0.065
Cache Write: $0.26
Model: 0.065
Completion: 2.923
Cache: 0.500
🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-12-06
Llama 3.2 90B Vision Instruct meta-llama/Llama-3.2-90B-Vision-Instruct 16K 4.1K Input: $0.35
Output: $0.4
Cache Read: $0.175
Cache Write: $0.7
Model: 0.175
Completion: 1.143
Cache: 0.500
🔧 🌡️ 2023-12 In: text, image
Out: text
Open Weights
Released: 2024-09-25
Kimi K2 Thinking moonshotai/Kimi-K2-Thinking 32.8K 4.1K Input: $0.55
Output: $2.25
Cache Read: $0.275
Cache Write: $1.1
Model: 0.275
Completion: 4.091
Cache: 0.500
🧠 🔧 🌡️ 2024-08 In: text
Out: text
Released: 2024-11-01
Kimi K2 Instruct moonshotai/Kimi-K2-Instruct-0905 32.8K 4.1K Input: $0.39
Output: $1.9
Cache Read: $0.195
Cache Write: $0.78
Model: 0.195
Completion: 4.872
Cache: 0.500
🔧 🌡️ 2024-08 In: text
Out: text
Released: 2024-09-05
Qwen 2.5 VL 32B Instruct Qwen/Qwen2.5-VL-32B-Instruct 32K 4.1K Input: $0.05
Output: $0.22
Cache Read: $0.025
Cache Write: $0.1
Model: 0.025
Completion: 4.400
Cache: 0.500
🔧 🌡️ 2024-09 In: text, image
Out: text
Open Weights
Released: 2024-11-01
Qwen 3 Next 80B Instruct Qwen/Qwen3-Next-80B-A3B-Instruct 262.1K 4.1K Input: $0.1
Output: $0.8
Cache Read: $0.05
Cache Write: $0.2
Model: 0.050
Completion: 8.000
Cache: 0.500
🔧 🌡️ 2024-12 In: text
Out: text
Open Weights
Released: 2025-01-10
Qwen 3 235B Thinking Qwen/Qwen3-235B-A22B-Thinking-2507 262.1K 4.1K Input: $0.11
Output: $0.6
Cache Read: $0.055
Cache Write: $0.22
Model: 0.055
Completion: 5.455
Cache: 0.500
🧠 🔧 🌡️ 2024-12 In: text
Out: text
Open Weights
Released: 2025-07-01
GPT-OSS 120B openai/gpt-oss-120b 131.1K 4.1K Input: $0.04
Output: $0.4
Cache Read: $0.02
Cache Write: $0.08
Model: 0.020
Completion: 10.000
Cache: 0.500
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2024-12-01
GPT-OSS 20B openai/gpt-oss-20b 64K 4.1K Input: $0.03
Output: $0.14
Cache Read: $0.015
Cache Write: $0.06
Model: 0.015
Completion: 4.667
Cache: 0.500
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2024-12-01
Devstral Small 2505 mistralai/Devstral-Small-2505 128K 4.1K Input: $0.05
Output: $0.22
Cache Read: $0.025
Cache Write: $0.1
Model: 0.025
Completion: 4.400
Cache: 0.500
🔧 🌡️ 2024-12 In: text
Out: text
Released: 2025-05-01
Magistral Small 2506 mistralai/Magistral-Small-2506 128K 4.1K Input: $0.5
Output: $1.5
Cache Read: $0.25
Cache Write: $1
Model: 0.250
Completion: 3.000
Cache: 0.500
🔧 🌡️ 2025-01 In: text
Out: text
Released: 2025-06-01
Mistral Large Instruct 2411 mistralai/Mistral-Large-Instruct-2411 128K 4.1K Input: $2
Output: $6
Cache Read: $1
Cache Write: $4
Model: 1.000
Completion: 3.000
Cache: 0.500
🔧 🌡️ 2024-10 In: text, image
Out: text
Released: 2024-11-01
Mistral Nemo Instruct 2407 mistralai/Mistral-Nemo-Instruct-2407 128K 4.1K Input: $0.02
Output: $0.04
Cache Read: $0.01
Cache Write: $0.04
Model: 0.010
Completion: 2.000
Cache: 0.500
🔧 🌡️ 2024-05 In: text
Out: text
Open Weights
Released: 2024-07-01
Qwen 3 Coder 480B Intel/Qwen3-Coder-480B-A35B-Instruct-int4-mixed-ar 106K 4.1K Input: $0.22
Output: $0.95
Cache Read: $0.11
Cache Write: $0.44
Model: 0.110
Completion: 4.318
Cache: 0.500
🔧 🌡️ 2024-12 In: text
Out: text
Open Weights
Released: 2025-01-15
GLM 4.6 zai-org/GLM-4.6 200K 4.1K Input: $0.4
Output: $1.75
Cache Read: $0.2
Cache Write: $0.8
Model: 0.200
Completion: 4.375
Cache: 0.500
🔧 🌡️ 2024-10 In: text
Out: text
Released: 2024-11-15
DeepSeek R1 deepseek-ai/DeepSeek-R1-0528 128K 4.1K Input: $2
Output: $8.75
Cache Read: $1
Cache Write: $4
Model: 1.000
Completion: 4.375
Cache: 0.500
🧠 🔧 🌡️ 2024-07 In: text
Out: text
Open Weights
Released: 2025-01-20
Updated: 2025-05-28

Jiekou.AI

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
o3 o3 131.1K 131.1K Input: $10
Output: $40
Model: 5.000
Completion: 4.000
📎 🔧 🌡️ - In: text, image
Out: text
Released: 2026-01
grok-code-fast-1 grok-code-fast-1 256K 256K Input: $0.18
Output: $1.35
Model: 0.090
Completion: 7.500
📎 🔧 🌡️ - In: text, image
Out: text
Released: 2026-01
gpt-5.2-pro gpt-5.2-pro 400K 128K Input: $18.9
Output: $151.2
Model: 9.450
Completion: 8.000
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2026-01
gemini-2.5-pro gemini-2.5-pro 1M 65.5K Input: $1.125
Output: $9
Model: 0.563
Completion: 8.000
📎 🔧 🌡️ - In: text, image, video, audio
Out: text
Released: 2026-01
claude-haiku-4-5-20251001 claude-haiku-4-5-20251001 20K 64K Input: $0.9
Output: $4.5
Model: 0.450
Completion: 5.000
📎 🔧 🌡️ - In: text, image
Out: text
Released: 2026-01
gpt-5-pro gpt-5-pro 400K 272K Input: $13.5
Output: $108
Model: 6.750
Completion: 8.000
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2026-01
gemini-2.5-flash gemini-2.5-flash 1M 65.5K Input: $0.27
Output: $2.25
Model: 0.135
Completion: 8.333
📎 🔧 🌡️ - In: text, image, video, audio
Out: text
Released: 2026-01
o4-mini o4-mini 200K 100K Input: $1.1
Output: $4.4
Model: 0.550
Completion: 4.000
📎 🔧 🌡️ - In: text, image
Out: text
Released: 2026-01
gemini-2.5-flash-lite-preview-09-2025 gemini-2.5-flash-lite-preview-09-2025 1M 65.5K Input: $0.09
Output: $0.36
Model: 0.045
Completion: 4.000
📎 🧠 🔧 🌡️ - In: text, image, video, audio
Out: text
Released: 2026-01
claude-opus-4-1-20250805 claude-opus-4-1-20250805 200K 32K Input: $13.5
Output: $67.5
Model: 6.750
Completion: 5.000
📎 🔧 🌡️ - In: text, image
Out: text
Released: 2026-01
grok-4-1-fast-non-reasoning grok-4-1-fast-non-reasoning 2M 2M Input: $0.18
Output: $0.45
Model: 0.090
Completion: 2.500
📎 🔧 🌡️ - In: text, image
Out: text
Released: 2026-01
gpt-5-chat-latest gpt-5-chat-latest 400K 128K Input: $1.125
Output: $9
Model: 0.563
Completion: 8.000
📎 🔧 🌡️ - In: text, image
Out: text
Released: 2026-01
claude-opus-4-5-20251101 claude-opus-4-5-20251101 200K 65.5K Input: $4.5
Output: $22.5
Model: 2.250
Completion: 5.000
📎 🔧 🌡️ - In: text, image
Out: text
Released: 2026-01
gpt-5.1-codex gpt-5.1-codex 400K 128K Input: $1.125
Output: $9
Model: 0.563
Completion: 8.000
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2026-01
gpt-5.1-codex-max gpt-5.1-codex-max 400K 128K Input: $1.125
Output: $9
Model: 0.563
Completion: 8.000
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2026-01
claude-opus-4-20250514 claude-opus-4-20250514 200K 32K Input: $13.5
Output: $67.5
Model: 6.750
Completion: 5.000
📎 🔧 🌡️ - In: text, image
Out: text
Released: 2026-01
o3-mini o3-mini 131.1K 131.1K Input: $1.1
Output: $4.4
Model: 0.550
Completion: 4.000
📎 🔧 🌡️ - In: text, image
Out: text
Released: 2026-01
gpt-5.2 gpt-5.2 400K 128K Input: $1.575
Output: $12.6
Model: 0.787
Completion: 8.000
📎 🔧 🌡️ - In: text, image
Out: text
Released: 2026-01
grok-4-0709 grok-4-0709 256K 8.2K Input: $2.7
Output: $13.5
Model: 1.350
Completion: 5.000
📎 🔧 🌡️ - In: text, image
Out: text
Released: 2026-01
gemini-2.5-flash-lite gemini-2.5-flash-lite 1M 65.5K Input: $0.09
Output: $0.36
Model: 0.045
Completion: 4.000
📎 🔧 🌡️ - In: text, image, video, audio
Out: text
Released: 2026-01
claude-sonnet-4-20250514 claude-sonnet-4-20250514 200K 64K Input: $2.7
Output: $13.5
Model: 1.350
Completion: 5.000
📎 🔧 🌡️ - In: text, image
Out: text
Released: 2026-01
gpt-5.1-codex-mini gpt-5.1-codex-mini 400K 128K Input: $0.225
Output: $1.8
Model: 0.113
Completion: 8.000
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2026-01
grok-4-fast-reasoning grok-4-fast-reasoning 2M 2M Input: $0.18
Output: $0.45
Model: 0.090
Completion: 2.500
📎 🔧 🌡️ - In: text, image
Out: text
Released: 2026-01
claude-opus-4-6 claude-opus-4-6 1M 128K Input: $5
Output: $25
Model: 2.500
Completion: 5.000
📎 🧠 🔧 🌡️ 2025-05-31 In: text, image
Out: text
Released: 2026-02
claude-sonnet-4-5-20250929 claude-sonnet-4-5-20250929 200K 64K Input: $2.7
Output: $13.5
Model: 1.350
Completion: 5.000
📎 🔧 🌡️ - In: text, image
Out: text
Released: 2026-01
grok-4-1-fast-reasoning grok-4-1-fast-reasoning 2M 2M Input: $0.18
Output: $0.45
Model: 0.090
Completion: 2.500
📎 🔧 🌡️ - In: text, image
Out: text
Released: 2026-01
gemini-3-pro-preview gemini-3-pro-preview 1M 65.5K Input: $1.8
Output: $10.8
Model: 0.900
Completion: 6.000
📎 🔧 🌡️ - In: text, image, video, audio
Out: text
Released: 2026-01
gpt-5-mini gpt-5-mini 400K 128K Input: $0.225
Output: $1.8
Model: 0.113
Completion: 8.000
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2026-01
gpt-5-nano gpt-5-nano 400K 128K Input: $0.045
Output: $0.36
Model: 0.022
Completion: 8.000
📎 🔧 🌡️ - In: text, image
Out: text
Released: 2026-01
grok-4-fast-non-reasoning grok-4-fast-non-reasoning 2M 2M Input: $0.18
Output: $0.45
Model: 0.090
Completion: 2.500
📎 🔧 🌡️ - In: text, image
Out: text
Released: 2026-01
gemini-3-flash-preview gemini-3-flash-preview 1M 65.5K Input: $0.5
Output: $3
Model: 0.250
Completion: 6.000
📎 🔧 🌡️ - In: text, image, video, audio
Out: text
Released: 2026-01
gemini-2.5-flash-preview-05-20 gemini-2.5-flash-preview-05-20 1M 200K Input: $0.135
Output: $3.15
Model: 0.068
Completion: 23.333
📎 🔧 🌡️ - In: text, image, video, audio
Out: text
Released: 2026-01
gemini-2.5-flash-lite-preview-06-17 gemini-2.5-flash-lite-preview-06-17 1M 65.5K Input: $0.09
Output: $0.36
Model: 0.045
Completion: 4.000
📎 🔧 🌡️ - In: text, video, image, audio
Out: text
Released: 2026-01
gemini-2.5-pro-preview-06-05 gemini-2.5-pro-preview-06-05 1M 200K Input: $1.125
Output: $9
Model: 0.563
Completion: 8.000
📎 🔧 🌡️ - In: text, image, video, audio
Out: text
Released: 2026-01
gpt-5-codex gpt-5-codex 400K 128K Input: $1.125
Output: $9
Model: 0.563
Completion: 8.000
📎 🔧 🌡️ - In: text, image
Out: text
Released: 2026-01
gpt-5.2-codex gpt-5.2-codex 400K 128K Input: $1.75
Output: $14
Model: 0.875
Completion: 8.000
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2026-01
gpt-5.1 gpt-5.1 400K 128K Input: $1.125
Output: $9
Model: 0.563
Completion: 8.000
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2026-02
Kimi K2 Instruct moonshotai/kimi-k2-instruct 131.1K 131.1K Input: $0.57
Output: $2.3
Model: 0.285
Completion: 4.035
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-01
Kimi K2.5 moonshotai/kimi-k2.5 262.1K 262.1K Input: $0.6
Output: $3
Model: 0.300
Completion: 5.000
📎 🧠 🔧 🌡️ - In: text, image, video
Out: text
Open Weights
Released: 2026-01
Kimi K2 0905 moonshotai/kimi-k2-0905 262.1K 262.1K Input: $0.6
Output: $2.5
Model: 0.300
Completion: 4.167
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-01
MiniMax M1 minimaxai/minimax-m1-80k 1M 40K Input: $0.55
Output: $2.2
Model: 0.275
Completion: 4.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-01
ERNIE 4.5 VL 424B A47B baidu/ernie-4.5-vl-424b-a47b 123K 16K Input: $0.42
Output: $1.25
Model: 0.210
Completion: 2.976
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2026-01
ERNIE 4.5 300B A47B baidu/ernie-4.5-300b-a47b-paddle 123K 12K Input: $0.28
Output: $1.1
Model: 0.140
Completion: 3.929
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-01
XiaomiMiMo/MiMo-V2-Flash xiaomimimo/mimo-v2-flash 262.1K 131.1K Input: $0
Output: $0
- 🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-01
GLM-4.7 zai-org/glm-4.7 204.8K 131.1K Input: $0.6
Output: $2.2
Model: 0.300
Completion: 3.667
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-01
GLM 4.5V zai-org/glm-4.5v 65.5K 16.4K Input: $0.6
Output: $1.8
Model: 0.300
Completion: 3.000
📎 🧠 🔧 🌡️ - In: text, image, video
Out: text
Open Weights
Released: 2026-01
GLM-4.5 zai-org/glm-4.5 131.1K 98.3K Input: $0.6
Output: $2.2
Model: 0.300
Completion: 3.667
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-01
GLM-4.7-Flash zai-org/glm-4.7-flash 200K 128K Input: $0.07
Output: $0.4
Model: 0.035
Completion: 5.714
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-01
Qwen3 235B A22B Instruct 2507 qwen/qwen3-235b-a22b-instruct-2507 131.1K 16.4K Input: $0.15
Output: $0.8
Model: 0.075
Completion: 5.333
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-01
Qwen3 Next 80B A3B Instruct qwen/qwen3-next-80b-a3b-instruct 65.5K 65.5K Input: $0.15
Output: $1.5
Model: 0.075
Completion: 10.000
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-01
Qwen3 Next 80B A3B Thinking qwen/qwen3-next-80b-a3b-thinking 65.5K 65.5K Input: $0.15
Output: $1.5
Model: 0.075
Completion: 10.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-01
Qwen3 235B A22b Thinking 2507 qwen/qwen3-235b-a22b-thinking-2507 131.1K 131.1K Input: $0.3
Output: $3
Model: 0.150
Completion: 10.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-01
Qwen3 32B qwen/qwen3-32b-fp8 41K 20K Input: $0.1
Output: $0.45
Model: 0.050
Completion: 4.500
🧠 🌡️ - In: text
Out: text
Open Weights
Released: 2026-01
Qwen3 30B A3B qwen/qwen3-30b-a3b-fp8 41K 20K Input: $0.09
Output: $0.45
Model: 0.045
Completion: 5.000
🧠 🌡️ - In: text
Out: text
Open Weights
Released: 2026-01
qwen/qwen3-coder-next qwen/qwen3-coder-next 262.1K 65.5K Input: $0.2
Output: $1.5
Model: 0.100
Completion: 7.500
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02
Qwen3 Coder 480B A35B Instruct qwen/qwen3-coder-480b-a35b-instruct 262.1K 65.5K Input: $0.29
Output: $1.2
Model: 0.145
Completion: 4.138
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-01
Qwen3 235B A22B qwen/qwen3-235b-a22b-fp8 41K 20K Input: $0.2
Output: $0.8
Model: 0.100
Completion: 4.000
🧠 🌡️ - In: text
Out: text
Open Weights
Released: 2026-01
DeepSeek R1 0528 deepseek/deepseek-r1-0528 163.8K 32.8K Input: $0.7
Output: $2.5
Model: 0.350
Completion: 3.571
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-01
DeepSeek V3 0324 deepseek/deepseek-v3-0324 163.8K 163.8K Input: $0.28
Output: $1.14
Model: 0.140
Completion: 4.071
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-01
DeepSeek V3.1 deepseek/deepseek-v3.1 163.8K 32.8K Input: $0.27
Output: $1
Model: 0.135
Completion: 3.704
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-01
Minimax M2.1 minimax/minimax-m2.1 204.8K 131.1K Input: $0.3
Output: $1.2
Model: 0.150
Completion: 4.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-01

Kilo Gateway

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
inclusionAI: Ling-2.6-1T inclusionai/ling-2.6-1t 262.1K 32.8K Input: $0.3
Output: $2.5
Cache Read: $0.06
Model: 0.150
Completion: 8.333
Cache: 0.200
🔧 🌡️ - In: text
Out: text
Released: 2026-04-23
Updated: 2026-05-16
inclusionAI: Ring-2.6-1T inclusionai/ring-2.6-1t 262.1K 65.5K Input: $0.075
Output: $0.625
Cache Read: $0.015
Model: 0.037
Completion: 8.333
Cache: 0.200
🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-05-08
Updated: 2026-05-16
inclusionAI: Ling-2.6 Flash inclusionai/ling-2.6-flash 262.1K 32.8K Input: $0.08
Output: $0.24
Cache Read: $0.016
Model: 0.040
Completion: 3.000
Cache: 0.200
🔧 🌡️ - In: text
Out: text
Released: 2026-04-21
Updated: 2026-05-01
IBM: Granite 4.0 Micro ibm-granite/granite-4.0-h-micro 131K 32.8K Input: $0.017
Output: $0.11
Model: 0.009
Completion: 6.471
🌡️ - In: text
Out: text
Open Weights
Released: 2025-10-20
Updated: 2026-03-15
IBM: Granite 4.1 8B ibm-granite/granite-4.1-8b 131.1K 131.1K Input: $0.05
Output: $0.1
Cache Read: $0.05
Model: 0.025
Completion: 2.000
Cache: 1.000
🔧 🌡️ - In: text
Out: text
Released: 2026-04-30
Updated: 2026-05-01
Meta: Llama 3.1 8B Instruct meta-llama/llama-3.1-8b-instruct 16.4K 16.4K Input: $0.02
Output: $0.05
Model: 0.010
Completion: 2.500
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-07-23
Updated: 2025-12-23
Meta: Llama 3 70B Instruct meta-llama/llama-3-70b-instruct 8.2K 8K Input: $0.51
Output: $0.74
Model: 0.255
Completion: 1.451
🌡️ - In: text
Out: text
Open Weights
Released: 2024-07-23
Meta: Llama 3.1 70B Instruct meta-llama/llama-3.1-70b-instruct 131.1K 26.2K Input: $0.4
Output: $0.4
Model: 0.200
Completion: 1.000
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-07-16
Updated: 2024-07-23
Meta: Llama 3.2 1B Instruct meta-llama/llama-3.2-1b-instruct 60K 12K Input: $0.027
Output: $0.2
Model: 0.013
Completion: 7.407
🌡️ - In: text
Out: text
Open Weights
Released: 2024-09-18
Updated: 2026-01-27
Meta: Llama 4 Maverick meta-llama/llama-4-maverick 1M 16.4K Input: $0.15
Output: $0.6
Model: 0.075
Completion: 4.000
📎 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2025-04-05
Updated: 2025-12-24
Meta: Llama 3.2 11B Vision Instruct meta-llama/llama-3.2-11b-vision-instruct 131.1K 16.4K Input: $0.049
Output: $0.049
Model: 0.025
Completion: 1.000
📎 🌡️ - In: text, image
Out: text
Open Weights
Released: 2024-09-25
Meta: Llama 3.3 70B Instruct meta-llama/llama-3.3-70b-instruct 131.1K 16.4K Input: $0.1
Output: $0.32
Model: 0.050
Completion: 3.200
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-08-01
Updated: 2026-02-04
Llama Guard 3 8B meta-llama/llama-guard-3-8b 131.1K 26.2K Input: $0.02
Output: $0.06
Model: 0.010
Completion: 3.000
🌡️ - In: text
Out: text
Open Weights
Released: 2024-04-18
Updated: 2026-02-04
Meta: Llama Guard 4 12B meta-llama/llama-guard-4-12b 163.8K 32.8K Input: $0.18
Output: $0.18
Model: 0.090
Completion: 1.000
📎 🌡️ - In: image, text
Out: text
Open Weights
Released: 2025-04-05
Meta: Llama 3 8B Instruct meta-llama/llama-3-8b-instruct 8.2K 16.4K Input: $0.03
Output: $0.04
Model: 0.015
Completion: 1.333
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-04-25
Updated: 2025-04-03
Meta: Llama 4 Scout meta-llama/llama-4-scout 327.7K 16.4K Input: $0.08
Output: $0.3
Model: 0.040
Completion: 3.750
📎 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2025-04-05
Meta: Llama 3.2 3B Instruct meta-llama/llama-3.2-3b-instruct 80K 16.4K Input: $0.051
Output: $0.34
Model: 0.025
Completion: 6.667
🌡️ - In: text
Out: text
Open Weights
Released: 2024-09-18
Updated: 2026-03-15
Anthropic: Claude Haiku Latest ~anthropic/claude-haiku-latest 200K 64K Input: $1
Output: $5
Cache Read: $0.1
Cache Write: $1.25
Model: 0.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ - In: text, image, pdf
Out: text
Released: 2026-04-27
Updated: 2026-05-01
Anthropic: Claude Sonnet Latest ~anthropic/claude-sonnet-latest 1M 128K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ - In: text, image, pdf
Out: text
Released: 2026-04-27
Updated: 2026-05-01
Anthropic: Claude Opus Latest ~anthropic/claude-opus-latest 1M 128K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2026-04-16
Updated: 2026-05-01
Kilo Auto Balanced kilo-auto/balanced 204.8K 131.1K Input: $0.6
Output: $3
Model: 0.300
Completion: 5.000
🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-03-15
Kilo Auto Small kilo-auto/small 400K 128K Input: $0.05
Output: $0.4
Model: 0.025
Completion: 8.000
📎 🧠 🔧 🌡️ - In: image, text
Out: text
Released: 2026-03-15
Kilo Auto Free kilo-auto/free 204.8K 131.1K Input: $0
Output: $0
- 🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-03-15
Kilo Auto Frontier kilo-auto/frontier 1M 128K Input: $5
Output: $25
Model: 2.500
Completion: 5.000
📎 🧠 🔧 🌡️ - In: image, text
Out: text
Released: 2026-03-15
MoonshotAI: Kimi K2 0711 moonshotai/kimi-k2 131K 26.2K Input: $0.55
Output: $2.2
Model: 0.275
Completion: 4.000
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-07-11
Updated: 2026-03-15
MoonshotAI: Kimi K2 Thinking moonshotai/kimi-k2-thinking 131.1K 65.5K Input: $0.47
Output: $2
Cache Read: $0.2
Model: 0.235
Completion: 4.255
Cache: 0.426
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-11-06
Updated: 2026-03-15
MoonshotAI: Kimi K2.5 moonshotai/kimi-k2.5 262.1K 65.5K Input: $0.45
Output: $2.2
Model: 0.225
Completion: 4.889
📎 🧠 🔧 🌡️ - In: image, text
Out: text
Open Weights
Released: 2026-01-27
Updated: 2026-03-15
MoonshotAI: Kimi K2.6 moonshotai/kimi-k2.6 262.1K 65.5K Input: $0.75
Output: $3.5
Cache Read: $0.375
Model: 0.375
Completion: 4.667
Cache: 0.500
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2026-04-20
Updated: 2026-05-12
MoonshotAI: Kimi K2 0905 moonshotai/kimi-k2-0905 131.1K 26.2K Input: $0.4
Output: $2
Cache Read: $0.15
Model: 0.200
Completion: 5.000
Cache: 0.375
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-09-05
**Baidu: ERNIE 4.5 300B A47B ** baidu/ernie-4.5-300b-a47b 123K 12K Input: $0.28
Output: $1.1
Model: 0.140
Completion: 3.929
🌡️ - In: text
Out: text
Open Weights
Released: 2025-06-30
Updated: 2026-01
Baidu: ERNIE 4.5 VL 28B A3B baidu/ernie-4.5-vl-28b-a3b 30K 8K Input: $0.14
Output: $0.56
Model: 0.070
Completion: 4.000
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2025-06-30
**Baidu: ERNIE 4.5 VL 424B A47B ** baidu/ernie-4.5-vl-424b-a47b 123K 16K Input: $0.42
Output: $1.25
Model: 0.210
Completion: 2.976
📎 🧠 🌡️ - In: image, text
Out: text
Open Weights
Released: 2025-06-30
Updated: 2026-01
Baidu: ERNIE 4.5 21B A3B Thinking baidu/ernie-4.5-21b-a3b-thinking 131.1K 65.5K Input: $0.07
Output: $0.28
Model: 0.035
Completion: 4.000
🧠 🌡️ - In: text
Out: text
Open Weights
Released: 2025-09-19
Baidu: CoBuddy (free) baidu/cobuddy:free 131.1K 65.5K Input: $0
Output: $0
- 🧠 🔧 - In: text
Out: text
Released: 2026-05-06
Updated: 2026-05-07
Baidu: ERNIE 4.5 21B A3B baidu/ernie-4.5-21b-a3b 120K 8K Input: $0.07
Output: $0.28
Model: 0.035
Completion: 4.000
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-06-30
Baidu: Qianfan-OCR-Fast baidu/qianfan-ocr-fast 65.5K 28.7K Input: $0.68
Output: $2.81
Model: 0.340
Completion: 4.132
📎 🧠 🌡️ - In: image, text
Out: text
Released: 2026-04-20
Updated: 2026-05-16
Perceptron: Perceptron Mk1 perceptron/perceptron-mk1 32.8K 8.2K Input: $0.15
Output: $1.5
Model: 0.075
Completion: 10.000
📎 🧠 🌡️ - In: image, text, video
Out: text
Released: 2026-05-12
Updated: 2026-05-16
AlfredPros: CodeLLaMa 7B Instruct Solidity alfredpros/codellama-7b-instruct-solidity 4.1K 4.1K Input: $0.8
Output: $1.2
Model: 0.400
Completion: 1.500
🌡️ - In: text
Out: text
Open Weights
Released: 2025-04-14
Updated: 2026-03-15
Google: Gemini 3.1 Flash Lite google/gemini-3.1-flash-lite 1M 65.5K Input: $0.25
Output: $1.5
Cache Read: $0.025
Cache Write: $0.08333
Reasoning: $1.5
Model: 0.125
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 🌡️ - In: audio, image, pdf, text, video
Out: text
Released: 2026-05-07
Updated: 2026-05-16
Google: Gemma 3n 4B google/gemma-3n-e4b-it 32.8K 6.6K Input: $0.02
Output: $0.04
Model: 0.010
Completion: 2.000
🌡️ - In: text
Out: text
Open Weights
Released: 2025-05-20
Google: Gemini 2.5 Pro google/gemini-2.5-pro 1M 65.5K Input: $1.25
Output: $10
Cache Read: $0.125
Cache Write: $0.375
Reasoning: $10
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 🌡️ - In: audio, image, pdf, text, video
Out: text
Released: 2025-03-20
Updated: 2026-03-15
Google: Gemini 2.5 Flash google/gemini-2.5-flash 1M 65.5K Input: $0.3
Output: $2.5
Cache Read: $0.03
Cache Write: $0.083333
Reasoning: $2.5
Model: 0.150
Completion: 8.333
Cache: 0.100
📎 🧠 🔧 🌡️ - In: audio, image, pdf, text, video
Out: text
Released: 2025-07-17
Updated: 2026-03-15
Google: Gemini 3.5 Flash google/gemini-3.5-flash 1M 65.5K Input: $1.5
Output: $9
Cache Read: $0.15
Cache Write: $0.08333
Reasoning: $9
Model: 0.750
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 🌡️ - In: audio, image, pdf, text, video
Out: text
Released: 2026-05-19
Updated: 2026-05-27
Google: Gemini 2.0 Flash Lite google/gemini-2.0-flash-lite-001 1M 8.2K Input: $0.075
Output: $0.3
Model: 0.037
Completion: 4.000
📎 🔧 🌡️ - In: audio, image, pdf, text, video
Out: text
Released: 2024-12-11
Updated: 2026-03-15
Google: Gemini 2.5 Flash Lite Preview 09-2025 google/gemini-2.5-flash-lite-preview-09-2025 1M 65.5K Input: $0.1
Output: $0.4
Cache Read: $0.01
Cache Write: $0.083333
Reasoning: $0.4
Model: 0.050
Completion: 4.000
Cache: 0.100
📎 🧠 🔧 🌡️ - In: audio, image, pdf, text, video
Out: text
Released: 2025-09-25
Updated: 2026-03-15
Google: Gemini 2.0 Flash google/gemini-2.0-flash-001 1M 8.2K Input: $0.1
Output: $0.4
Cache Read: $0.025
Cache Write: $0.083333
Model: 0.050
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ - In: audio, image, pdf, text, video
Out: text
Released: 2024-12-11
Updated: 2026-03-15
Google: Gemma 4 31B google/gemma-4-31b-it 262.1K 131.1K Input: $0.14
Output: $0.4
Model: 0.070
Completion: 2.857
📎 🧠 🔧 🌡️ - In: image, text, video
Out: text
Open Weights
Released: 2026-04-02
Updated: 2026-04-11
Google: Lyria 3 Clip Preview google/lyria-3-clip-preview 1M 65.5K Input: $0
Output: $0
- 📎 🌡️ - In: image, text
Out: audio, text
Released: 2026-03-30
Updated: 2026-04-11
Google: Gemini 3.1 Pro Preview Custom Tools google/gemini-3.1-pro-preview-customtools 1M 65.5K Input: $2
Output: $12
Reasoning: $12
Model: 1.000
Completion: 6.000
📎 🧠 🔧 🌡️ - In: audio, image, pdf, text, video
Out: text
Released: 2026-02-26
Updated: 2026-03-15
Google: Nano Banana Pro (Gemini 3 Pro Image Preview) google/gemini-3-pro-image-preview 65.5K 32.8K Input: $2
Output: $12
Reasoning: $12
Model: 1.000
Completion: 6.000
📎 🧠 🌡️ - In: image, text
Out: image, text
Released: 2025-11-20
Updated: 2026-03-15
Google: Nano Banana (Gemini 2.5 Flash Image) google/gemini-2.5-flash-image 32.8K 32.8K Input: $0.3
Output: $2.5
Model: 0.150
Completion: 8.333
📎 🌡️ - In: image, text
Out: image, text
Released: 2025-10-08
Updated: 2026-03-15
Google: Gemini 2.5 Flash Lite google/gemini-2.5-flash-lite 1M 65.5K Input: $0.1
Output: $0.4
Cache Read: $0.01
Cache Write: $0.083333
Reasoning: $0.4
Model: 0.050
Completion: 4.000
Cache: 0.100
📎 🧠 🔧 🌡️ - In: audio, image, pdf, text, video
Out: text
Released: 2025-06-17
Updated: 2026-03-15
Google: Nano Banana 2 (Gemini 3.1 Flash Image Preview) google/gemini-3.1-flash-image-preview 65.5K 65.5K Input: $0.5
Output: $3
Model: 0.250
Completion: 6.000
📎 🧠 🌡️ - In: image, text
Out: image, text
Released: 2026-02-26
Updated: 2026-03-15
Google: Gemini 2.5 Pro Preview 05-06 google/gemini-2.5-pro-preview-05-06 1M 65.5K Input: $1.25
Output: $10
Cache Read: $0.125
Cache Write: $0.375
Reasoning: $10
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 🌡️ - In: audio, image, pdf, text, video
Out: text
Released: 2025-05-06
Updated: 2026-03-15
Google: Gemini 3.1 Pro Preview google/gemini-3.1-pro-preview 1M 65.5K Input: $2
Output: $12
Reasoning: $12
Model: 1.000
Completion: 6.000
📎 🧠 🔧 🌡️ - In: audio, image, pdf, text, video
Out: text
Released: 2026-02-19
Updated: 2026-03-15
Google: Gemma 4 26B A4B google/gemma-4-26b-a4b-it 262.1K 262.1K Input: $0.12
Output: $0.4
Model: 0.060
Completion: 3.333
📎 🧠 🔧 🌡️ - In: image, text, video
Out: text
Open Weights
Released: 2026-04-03
Updated: 2026-04-11
Google: Gemini 2.5 Pro Preview 06-05 google/gemini-2.5-pro-preview 1M 65.5K Input: $1.25
Output: $10
Cache Read: $0.125
Cache Write: $0.375
Reasoning: $10
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 🌡️ - In: audio, image, pdf, text
Out: text
Released: 2025-06-05
Updated: 2026-03-15
Google: Gemini 3 Flash Preview google/gemini-3-flash-preview 1M 65.5K Input: $0.5
Output: $3
Cache Read: $0.05
Cache Write: $0.083333
Reasoning: $3
Model: 0.250
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 🌡️ - In: audio, image, pdf, text, video
Out: text
Released: 2025-12-17
Updated: 2026-03-15
Google: Gemma 3 12B google/gemma-3-12b-it 131.1K 131.1K Input: $0.04
Output: $0.13
Cache Read: $0.015
Model: 0.020
Completion: 3.250
Cache: 0.375
📎 🌡️ - In: image, text
Out: text
Open Weights
Released: 2025-03-13
Updated: 2026-03-15
Google: Gemma 3 4B google/gemma-3-4b-it 131.1K 19.2K Input: $0.04
Output: $0.08
Model: 0.020
Completion: 2.000
📎 🌡️ - In: image, text
Out: text
Open Weights
Released: 2025-03-13
Updated: 2026-03-15
Google: Gemma 3 27B google/gemma-3-27b-it 128K 65.5K Input: $0.03
Output: $0.11
Cache Read: $0.02
Model: 0.015
Completion: 3.667
Cache: 0.667
📎 🔧 🌡️ - In: image, text
Out: text
Open Weights
Released: 2025-03-12
Updated: 2026-03-15
Google: Lyria 3 Pro Preview google/lyria-3-pro-preview 1M 65.5K Input: $0
Output: $0
- 📎 🌡️ - In: image, text
Out: audio, text
Released: 2026-03-30
Updated: 2026-04-11
Google: Gemma 2 27B google/gemma-2-27b-it 8.2K 2K Input: $0.65
Output: $0.65
Model: 0.325
Completion: 1.000
🌡️ - In: text
Out: text
Open Weights
Released: 2024-06-24
Google: Gemini 3.1 Flash Lite Preview google/gemini-3.1-flash-lite-preview 1M 65.5K Input: $0.25
Output: $1.5
Reasoning: $1.5
Model: 0.125
Completion: 6.000
📎 🧠 🔧 🌡️ - In: audio, image, pdf, text, video
Out: text
Released: 2026-03-03
Updated: 2026-03-15
LiquidAI: LFM2-24B-A2B liquid/lfm-2-24b-a2b 32.8K 32.8K Input: $0.03
Output: $0.12
Model: 0.015
Completion: 4.000
🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-26
Updated: 2026-03-15
xAI: Grok 4.20 x-ai/grok-4.20 2M 2M Input: $2
Output: $6
Cache Read: $0.2
Model: 1.000
Completion: 3.000
Cache: 0.100
📎 🧠 🔧 🌡️ - In: image, pdf, text
Out: text
Released: 2026-03-31
Updated: 2026-04-11
xAI: Grok 4.3 x-ai/grok-4.3 1M 4.1K Input: $1.25
Output: $2.5
Cache Read: $0.2
Model: 0.625
Completion: 2.000
Cache: 0.160
📎 🧠 🔧 🌡️ - In: text, image, pdf
Out: text
Released: 2026-05-01
xAI: Grok 4.20 Multi-Agent x-ai/grok-4.20-multi-agent 2M 2M Input: $2
Output: $6
Cache Read: $0.2
Model: 1.000
Completion: 3.000
Cache: 0.100
📎 🧠 🌡️ - In: image, pdf, text
Out: text
Released: 2026-03-31
Updated: 2026-04-11
xAI: Grok Build 0.1 x-ai/grok-build-0.1 256K 256K Input: $1
Output: $2
Cache Read: $0.2
Model: 0.500
Completion: 2.000
Cache: 0.200
📎 🧠 🔧 🌡️ - In: image, text
Out: text
Released: 2026-05-20
Updated: 2026-05-27
Google: Gemini Pro Latest ~google/gemini-pro-latest 1M 65.5K Input: $2
Output: $12
Cache Read: $0.2
Cache Write: $0.375
Model: 1.000
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 🌡️ - In: text, image, audio, video, pdf
Out: text
Released: 2026-04-27
Updated: 2026-05-01
Google: Gemini Flash Latest ~google/gemini-flash-latest 1M 65.5K Input: $0.5
Output: $3
Cache Read: $0.05
Cache Write: $0.08333333333333334
Model: 0.250
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 🌡️ - In: text, image, audio, video, pdf
Out: text
Released: 2026-04-27
Updated: 2026-05-01
Microsoft: Phi 4 Mini Instruct microsoft/phi-4-mini-instruct 128K 128K Input: $0.08
Output: $0.35
Cache Read: $0.08
Model: 0.040
Completion: 4.375
Cache: 1.000
🌡️ - In: text
Out: text
Open Weights
Released: 2025-10-17
Updated: 2026-05-07
Microsoft: Phi 4 microsoft/phi-4 16.4K 16.4K Input: $0.06
Output: $0.14
Model: 0.030
Completion: 2.333
🌡️ - In: text
Out: text
Open Weights
Released: 2024-12-11
WizardLM-2 8x22B microsoft/wizardlm-2-8x22b 65.5K 8K Input: $0.62
Output: $0.62
Model: 0.310
Completion: 1.000
🌡️ - In: text
Out: text
Open Weights
Released: 2024-04-24
Poolside: Laguna XS.2 (free) poolside/laguna-xs.2:free 131.1K 8.2K Input: $0
Output: $0
- 🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-04-28
Updated: 2026-05-01
Poolside: Laguna M.1 (free) poolside/laguna-m.1:free 131.1K 8.2K Input: $0
Output: $0
- 🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-04-28
Updated: 2026-05-01
Writer: Palmyra X5 writer/palmyra-x5 1M 8.2K Input: $0.6
Output: $6
Model: 0.300
Completion: 10.000
🌡️ - In: text
Out: text
Released: 2025-04-28
Z.ai: GLM 4.7 z-ai/glm-4.7 202.8K 65.5K Input: $0.38
Output: $1.98
Cache Read: $0.2
Model: 0.190
Completion: 5.211
Cache: 0.526
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-12-22
Updated: 2026-03-15
Z.ai: GLM 4.5V z-ai/glm-4.5v 65.5K 16.4K Input: $0.6
Output: $1.8
Cache Read: $0.11
Model: 0.300
Completion: 3.000
Cache: 0.183
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2025-08-11
Z.ai: GLM 4.5 z-ai/glm-4.5 131.1K 98.3K Input: $0.6
Output: $2.2
Cache Read: $0.175
Model: 0.300
Completion: 3.667
Cache: 0.292
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-07-28
Updated: 2026-03-15
Z.ai: GLM 5.1 z-ai/glm-5.1 202.8K 131.1K Input: $1.26
Output: $3.96
Model: 0.630
Completion: 3.143
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-27
Z.ai: GLM 4.6 z-ai/glm-4.6 204.8K 204.8K Input: $0.39
Output: $1.9
Cache Read: $0.175
Model: 0.195
Completion: 4.872
Cache: 0.449
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-09-30
Updated: 2026-03-15
**Z.ai: GLM 4 32B ** z-ai/glm-4-32b 128K 32.8K Input: $0.1
Output: $0.1
Model: 0.050
Completion: 1.000
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-07-25
Updated: 2026-03-15
Z.ai: GLM 4.6V z-ai/glm-4.6v 131.1K 131.1K Input: $0.3
Output: $0.9
Model: 0.150
Completion: 3.000
📎 🧠 🔧 🌡️ - In: image, text, video
Out: text
Open Weights
Released: 2025-09-30
Updated: 2026-01-10
Z.ai: GLM 5V Turbo z-ai/glm-5v-turbo 202.8K 131.1K Input: $1.2
Output: $4
Cache Read: $0.24
Model: 0.600
Completion: 3.333
Cache: 0.200
📎 🧠 🔧 🌡️ - In: image, text, video
Out: text
Open Weights
Released: 2026-04-01
Updated: 2026-04-11
Z.ai: GLM 4.5 Air z-ai/glm-4.5-air 131.1K 98.3K Input: $0.13
Output: $0.85
Cache Read: $0.025
Model: 0.065
Completion: 6.538
Cache: 0.192
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-07-28
Z.ai: GLM 4.7 Flash z-ai/glm-4.7-flash 202.8K 40.6K Input: $0.06
Output: $0.4
Cache Read: $0.01
Model: 0.030
Completion: 6.667
Cache: 0.167
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-01-19
Z.ai: GLM 5 z-ai/glm-5 202.8K 131.1K Input: $0.72
Output: $2.3
Model: 0.360
Completion: 3.194
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-12
Updated: 2026-03-15
Z.ai: GLM 5 Turbo z-ai/glm-5-turbo 202.8K 131.1K Input: $1.2
Output: $4
Cache Read: $0.24
Model: 0.600
Completion: 3.333
Cache: 0.200
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-15
Updated: 2026-04-11
OpenAI: GPT-4o-mini (2024-07-18) openai/gpt-4o-mini-2024-07-18 128K 16.4K Input: $0.15
Output: $0.6
Model: 0.075
Completion: 4.000
📎 🔧 🌡️ - In: image, pdf, text
Out: text
Released: 2024-07-18
Updated: 2026-03-15
OpenAI: gpt-oss-safeguard-20b openai/gpt-oss-safeguard-20b 131.1K 65.5K Input: $0.075
Output: $0.3
Cache Read: $0.037
Model: 0.037
Completion: 4.000
Cache: 0.493
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-10-29
OpenAI: GPT-3.5 Turbo Instruct openai/gpt-3.5-turbo-instruct 4.1K 4.1K Input: $1.5
Output: $2
Model: 0.750
Completion: 1.333
🌡️ - In: text
Out: text
Released: 2023-03-01
Updated: 2023-09-21
OpenAI: GPT-5.2 Chat openai/gpt-5.2-chat 128K 16.4K Input: $1.75
Output: $14
Cache Read: $0.175
Model: 0.875
Completion: 8.000
Cache: 0.100
📎 🔧 - In: image, pdf, text
Out: text
Released: 2025-12-11
Updated: 2026-03-15
OpenAI: o3 openai/o3 200K 100K Input: $2
Output: $8
Cache Read: $0.5
Model: 1.000
Completion: 4.000
Cache: 0.250
📎 🧠 🔧 - In: image, pdf, text
Out: text
Released: 2025-04-16
Updated: 2026-03-15
OpenAI: o4 Mini High openai/o4-mini-high 200K 100K Input: $1.1
Output: $4.4
Model: 0.550
Completion: 4.000
📎 🧠 🔧 - In: image, pdf, text
Out: text
Released: 2025-04-17
Updated: 2026-03-15
OpenAI: GPT Audio openai/gpt-audio 128K 16.4K Input: $2.5
Output: $10
Model: 1.250
Completion: 4.000
🌡️ - In: audio, text
Out: audio, text
Released: 2026-01-20
Updated: 2026-03-15
OpenAI: GPT-5.2 Pro openai/gpt-5.2-pro 400K 128K Input: $21
Output: $168
Model: 10.500
Completion: 8.000
📎 🧠 🔧 - In: image, pdf, text
Out: text
Released: 2025-12-11
Updated: 2026-03-15
OpenAI: GPT-4o-mini Search Preview openai/gpt-4o-mini-search-preview 128K 16.4K Input: $0.15
Output: $0.6
Model: 0.075
Completion: 4.000
- - In: text
Out: text
Released: 2025-01
OpenAI: GPT-5 openai/gpt-5 400K 128K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 - In: image, pdf, text
Out: text
Released: 2025-08-07
Updated: 2026-03-15
OpenAI: GPT-5 Chat openai/gpt-5-chat 128K 16.4K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 - In: image, pdf, text
Out: text
Released: 2025-08-07
Updated: 2026-03-15
OpenAI: GPT-3.5 Turbo openai/gpt-3.5-turbo 16.4K 4.1K Input: $0.5
Output: $1.5
Model: 0.250
Completion: 3.000
🔧 🌡️ - In: text
Out: text
Released: 2023-03-01
Updated: 2023-11-06
OpenAI: GPT-5 Pro openai/gpt-5-pro 400K 128K Input: $15
Output: $120
Model: 7.500
Completion: 8.000
📎 🧠 🔧 - In: image, pdf, text
Out: text
Released: 2025-10-06
Updated: 2026-03-15
OpenAI: GPT-4o openai/gpt-4o 128K 16.4K Input: $2.5
Output: $10
Cache Read: $1.25
Model: 1.250
Completion: 4.000
Cache: 0.500
📎 🔧 🌡️ - In: image, pdf, text
Out: text
Released: 2024-05-13
Updated: 2026-03-15
OpenAI: GPT-4 openai/gpt-4 8.2K 4.1K Input: $30
Output: $60
Model: 15.000
Completion: 2.000
🔧 🌡️ - In: text
Out: text
Released: 2023-03-14
Updated: 2024-04-09
OpenAI: o4 Mini openai/o4-mini 200K 100K Input: $1.1
Output: $4.4
Cache Read: $0.275
Model: 0.550
Completion: 4.000
Cache: 0.250
📎 🧠 🔧 - In: image, pdf, text
Out: text
Released: 2025-04-16
Updated: 2026-03-15
OpenAI: GPT-3.5 Turbo 16k openai/gpt-3.5-turbo-16k 16.4K 4.1K Input: $3
Output: $4
Model: 1.500
Completion: 1.333
🔧 🌡️ - In: text
Out: text
Released: 2023-08-28
Updated: 2026-03-15
OpenAI: o3 Pro openai/o3-pro 200K 100K Input: $20
Output: $80
Model: 10.000
Completion: 4.000
📎 🧠 🔧 - In: image, pdf, text
Out: text
Released: 2025-04-16
Updated: 2026-03-15
OpenAI: GPT-5.1 Chat openai/gpt-5.1-chat 128K 16.4K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🔧 - In: image, pdf, text
Out: text
Released: 2025-11-13
Updated: 2026-03-15
OpenAI: GPT-4o (2024-05-13) openai/gpt-4o-2024-05-13 128K 4.1K Input: $5
Output: $15
Model: 2.500
Completion: 3.000
📎 🔧 🌡️ - In: image, pdf, text
Out: text
Released: 2024-05-13
Updated: 2026-03-15
OpenAI: GPT-4 (older v0314) openai/gpt-4-0314 8.2K 4.1K Input: $30
Output: $60
Model: 15.000
Completion: 2.000
🔧 🌡️ - In: text
Out: text
Released: 2023-05-28
Updated: 2026-03-15
OpenAI: GPT-5.4 Nano openai/gpt-5.4-nano 400K 128K Input: $0.2
Output: $1.25
Cache Read: $0.02
Model: 0.100
Completion: 6.250
Cache: 0.100
📎 🧠 🔧 - In: image, pdf, text
Out: text
Released: 2026-03-17
Updated: 2026-04-11
OpenAI: GPT-5.3 Chat openai/gpt-5.3-chat 128K 16.4K Input: $1.75
Output: $14
Model: 0.875
Completion: 8.000
📎 🔧 - In: image, pdf, text
Out: text
Released: 2026-03-04
Updated: 2026-03-15
OpenAI: GPT-3.5 Turbo (older v0613) openai/gpt-3.5-turbo-0613 4.1K 4.1K Input: $1
Output: $2
Model: 0.500
Completion: 2.000
🔧 🌡️ - In: text
Out: text
Released: 2023-06-13
OpenAI: GPT-5 Image Mini openai/gpt-5-image-mini 400K 128K Input: $2.5
Output: $2
Model: 1.250
Completion: 0.800
📎 🧠 🔧 🌡️ - In: image, pdf, text
Out: image, text
Released: 2025-10-16
Updated: 2026-03-15
OpenAI: GPT-5.1-Codex openai/gpt-5.1-codex 400K 128K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 - In: text, image
Out: text
Released: 2025-11-13
OpenAI: GPT-5.1-Codex-Max openai/gpt-5.1-codex-max 400K 128K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 - In: text, image
Out: text
Released: 2025-11-13
OpenAI: GPT-4o (2024-08-06) openai/gpt-4o-2024-08-06 128K 16.4K Input: $2.5
Output: $10
Cache Read: $1.25
Model: 1.250
Completion: 4.000
Cache: 0.500
📎 🔧 🌡️ - In: image, pdf, text
Out: text
Released: 2024-08-06
Updated: 2026-03-15
OpenAI: o3 Mini openai/o3-mini 200K 100K Input: $1.1
Output: $4.4
Cache Read: $0.55
Model: 0.550
Completion: 4.000
Cache: 0.500
📎 🔧 - In: pdf, text
Out: text
Released: 2024-12-20
Updated: 2026-03-15
OpenAI: GPT-5.2 openai/gpt-5.2 400K 128K Input: $1.75
Output: $14
Cache Read: $0.175
Model: 0.875
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 - In: image, pdf, text
Out: text
Released: 2025-12-11
Updated: 2026-03-15
OpenAI: GPT-5.3-Codex openai/gpt-5.3-codex 400K 128K Input: $1.75
Output: $14
Model: 0.875
Completion: 8.000
📎 🧠 🔧 - In: image, text
Out: text
Released: 2026-02-25
Updated: 2026-03-15
OpenAI: GPT Audio Mini openai/gpt-audio-mini 128K 16.4K Input: $0.6
Output: $2.4
Model: 0.300
Completion: 4.000
🌡️ - In: audio, text
Out: audio, text
Released: 2026-01-20
Updated: 2026-03-15
OpenAI: GPT-5.1-Codex-Mini openai/gpt-5.1-codex-mini 400K 100K Input: $0.25
Output: $2
Cache Read: $0.025
Model: 0.125
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 - In: image, text
Out: text
Released: 2025-11-13
OpenAI: o4 Mini Deep Research openai/o4-mini-deep-research 200K 100K Input: $2
Output: $8
Cache Read: $0.5
Model: 1.000
Completion: 4.000
Cache: 0.250
📎 🧠 🔧 🌡️ - In: image, pdf, text
Out: text
Released: 2024-06-26
Updated: 2026-03-15
OpenAI: GPT-4.1 Nano openai/gpt-4.1-nano 1M 32.8K Input: $0.1
Output: $0.4
Cache Read: $0.025
Model: 0.050
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ - In: image, pdf, text
Out: text
Released: 2025-04-14
Updated: 2026-03-15
OpenAI: gpt-oss-120b openai/gpt-oss-120b 131.1K 26.2K Input: $0.039
Output: $0.19
Model: 0.019
Completion: 4.872
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-05
OpenAI: GPT-4o (2024-11-20) openai/gpt-4o-2024-11-20 128K 16.4K Input: $2.5
Output: $10
Cache Read: $1.25
Model: 1.250
Completion: 4.000
Cache: 0.500
📎 🔧 🌡️ - In: image, pdf, text
Out: text
Released: 2024-11-20
Updated: 2026-03-15
OpenAI: o1 openai/o1 200K 100K Input: $15
Output: $60
Cache Read: $7.5
Model: 7.500
Completion: 4.000
Cache: 0.500
📎 🔧 - In: image, pdf, text
Out: text
Released: 2024-12-05
Updated: 2026-03-15
OpenAI: o1-pro openai/o1-pro 200K 100K Input: $150
Output: $600
Model: 75.000
Completion: 4.000
📎 🧠 - In: image, pdf, text
Out: text
Released: 2025-03-19
Updated: 2026-03-15
OpenAI: GPT Chat Latest openai/gpt-chat-latest 400K 128K Input: $5
Output: $30
Cache Read: $0.5
Model: 2.500
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 - In: image, pdf, text
Out: text
Released: 2026-05-05
Updated: 2026-05-07
OpenAI: GPT-5 Image openai/gpt-5-image 400K 128K Input: $10
Output: $10
Model: 5.000
Completion: 1.000
📎 🧠 🔧 🌡️ - In: image, pdf, text
Out: image, text
Released: 2025-10-14
Updated: 2026-03-15
OpenAI: GPT-5.4 openai/gpt-5.4 1.1M 128K Input: $2.5
Output: $15
Model: 1.250
Completion: 6.000
📎 🧠 🔧 - In: image, pdf, text
Out: text
Released: 2026-03-06
Updated: 2026-03-15
OpenAI: GPT-5.4 Mini openai/gpt-5.4-mini 400K 128K Input: $0.75
Output: $4.5
Cache Read: $0.075
Model: 0.375
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 - In: image, pdf, text
Out: text
Released: 2026-03-17
Updated: 2026-04-11
OpenAI: GPT-4.1 openai/gpt-4.1 1M 32.8K Input: $2
Output: $8
Cache Read: $0.5
Model: 1.000
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ - In: image, pdf, text
Out: text
Released: 2025-04-14
Updated: 2026-03-15
OpenAI: GPT-4o Audio openai/gpt-4o-audio-preview 128K 16.4K Input: $2.5
Output: $10
Model: 1.250
Completion: 4.000
🔧 🌡️ - In: audio, text
Out: audio, text
Released: 2025-08-15
Updated: 2026-03-15
OpenAI: o3 Deep Research openai/o3-deep-research 200K 100K Input: $10
Output: $40
Cache Read: $2.5
Model: 5.000
Completion: 4.000
Cache: 0.250
📎 🧠 🔧 🌡️ - In: image, pdf, text
Out: text
Released: 2024-06-26
Updated: 2026-03-15
OpenAI: GPT-4 Turbo Preview openai/gpt-4-turbo-preview 128K 4.1K Input: $10
Output: $30
Model: 5.000
Completion: 3.000
🔧 🌡️ - In: text
Out: text
Released: 2024-01-25
Updated: 2026-03-15
OpenAI: GPT-5 Mini openai/gpt-5-mini 400K 128K Input: $0.25
Output: $2
Cache Read: $0.025
Model: 0.125
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 - In: image, pdf, text
Out: text
Released: 2025-08-07
Updated: 2026-03-15
OpenAI: GPT-4.1 Mini openai/gpt-4.1-mini 1M 32.8K Input: $0.4
Output: $1.6
Cache Read: $0.1
Model: 0.200
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ - In: image, pdf, text
Out: text
Released: 2025-04-14
Updated: 2026-03-15
OpenAI: GPT-4 Turbo openai/gpt-4-turbo 128K 4.1K Input: $10
Output: $30
Model: 5.000
Completion: 3.000
📎 🔧 🌡️ - In: text, image
Out: text
Released: 2023-09-13
Updated: 2024-04-09
OpenAI: GPT-5 Nano openai/gpt-5-nano 400K 128K Input: $0.05
Output: $0.4
Cache Read: $0.005
Model: 0.025
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 - In: image, pdf, text
Out: text
Released: 2025-08-07
Updated: 2026-03-15
OpenAI: GPT-5.4 Pro openai/gpt-5.4-pro 1.1M 128K Input: $30
Output: $180
Model: 15.000
Completion: 6.000
📎 🧠 🔧 - In: image, pdf, text
Out: text
Released: 2026-03-06
Updated: 2026-03-15
OpenAI: o3 Mini High openai/o3-mini-high 200K 100K Input: $1.1
Output: $4.4
Cache Read: $0.55
Model: 0.550
Completion: 4.000
Cache: 0.500
📎 🔧 - In: pdf, text
Out: text
Released: 2025-01-31
Updated: 2026-03-15
OpenAI: GPT-5.4 Image 2 openai/gpt-5.4-image-2 272K 128K Input: $8
Output: $15
Cache Read: $2
Model: 4.000
Completion: 1.875
Cache: 0.250
📎 🧠 - In: image, text, pdf
Out: image, text
Released: 2026-04-21
Updated: 2026-05-01
OpenAI: GPT-4o Search Preview openai/gpt-4o-search-preview 128K 16.4K Input: $2.5
Output: $10
Model: 1.250
Completion: 4.000
- - In: text
Out: text
Released: 2025-03-13
Updated: 2026-03-15
OpenAI: GPT-5.5 Pro openai/gpt-5.5-pro 1.1M 128K Input: $30
Output: $180
Model: 15.000
Completion: 6.000
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2026-04-24
Updated: 2026-05-01
OpenAI: GPT-4o-mini openai/gpt-4o-mini 128K 16.4K Input: $0.15
Output: $0.6
Cache Read: $0.075
Model: 0.075
Completion: 4.000
Cache: 0.500
📎 🔧 🌡️ - In: image, pdf, text
Out: text
Released: 2024-07-18
Updated: 2026-03-15
OpenAI: gpt-oss-20b openai/gpt-oss-20b 131.1K 26.2K Input: $0.03
Output: $0.14
Model: 0.015
Completion: 4.667
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-05
OpenAI: GPT-4 Turbo (older v1106) openai/gpt-4-1106-preview 128K 4.1K Input: $10
Output: $30
Model: 5.000
Completion: 3.000
🔧 🌡️ - In: text
Out: text
Released: 2023-11-06
Updated: 2026-03-15
OpenAI: GPT-5 Codex openai/gpt-5-codex 400K 128K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 - In: text, image
Out: text
Released: 2025-09-15
OpenAI: GPT-5.2-Codex openai/gpt-5.2-codex 400K 128K Input: $1.75
Output: $14
Cache Read: $0.175
Model: 0.875
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 - In: text, image
Out: text
Released: 2026-01-14
OpenAI: GPT-5.1 openai/gpt-5.1 400K 128K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 - In: image, pdf, text
Out: text
Released: 2025-11-13
Updated: 2026-03-15
OpenAI: GPT-5.5 openai/gpt-5.5 1.1M 128K Input: $5
Output: $30
Cache Read: $0.5
Model: 2.500
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2026-04-24
Updated: 2026-05-01
TheDrummer: Cydonia 24B V4.1 thedrummer/cydonia-24b-v4.1 131.1K 131.1K Input: $0.3
Output: $0.5
Model: 0.150
Completion: 1.667
🌡️ - In: text
Out: text
Open Weights
Released: 2025-09-27
Updated: 2026-03-15
TheDrummer: Skyfall 36B V2 thedrummer/skyfall-36b-v2 32.8K 32.8K Input: $0.55
Output: $0.8
Model: 0.275
Completion: 1.455
🌡️ - In: text
Out: text
Open Weights
Released: 2025-03-11
Updated: 2026-03-15
TheDrummer: UnslopNemo 12B thedrummer/unslopnemo-12b 32.8K 32.8K Input: $0.4
Output: $0.4
Model: 0.200
Completion: 1.000
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-11-09
Updated: 2026-03-15
TheDrummer: Rocinante 12B thedrummer/rocinante-12b 32.8K 32.8K Input: $0.17
Output: $0.43
Model: 0.085
Completion: 2.529
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-09-30
Updated: 2026-03-15
**ByteDance: UI-TARS 7B ** bytedance/ui-tars-1.5-7b 128K 2K Input: $0.1
Output: $0.2
Model: 0.050
Completion: 2.000
📎 🌡️ - In: image, text
Out: text
Released: 2025-07-23
Updated: 2026-03-15
Reka Flash 3 rekaai/reka-flash-3 65.5K 65.5K Input: $0.1
Output: $0.2
Model: 0.050
Completion: 2.000
🧠 🌡️ - In: text
Out: text
Open Weights
Released: 2025-03-12
Updated: 2026-04-11
Reka Edge rekaai/reka-edge 16.4K 16.4K Input: $0.1
Output: $0.1
Model: 0.050
Completion: 1.000
📎 🔧 🌡️ - In: image, text, video
Out: text
Open Weights
Released: 2026-03-20
Updated: 2026-04-11
Mistral Large 2407 mistralai/mistral-large-2407 131.1K 32.8K Input: $2
Output: $6
Model: 1.000
Completion: 3.000
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-11-19
Updated: 2026-03-15
Mistral: Mistral Small 3.2 24B mistralai/mistral-small-3.2-24b-instruct 131.1K 131.1K Input: $0.06
Output: $0.18
Cache Read: $0.03
Model: 0.030
Completion: 3.000
Cache: 0.500
📎 🔧 🌡️ - In: image, text
Out: text
Open Weights
Released: 2025-06-20
Mistral: Mistral Nemo mistralai/mistral-nemo 131.1K 16.4K Input: $0.02
Output: $0.04
Model: 0.010
Completion: 2.000
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-07-01
Updated: 2024-07-30
Mistral: Mistral Medium 3.5 mistralai/mistral-medium-3-5 262.1K 262.1K Input: $1.5
Output: $7.5
Model: 0.750
Completion: 5.000
📎 🧠 🔧 🌡️ - In: image, text
Out: text
Released: 2026-04-30
Updated: 2026-05-07
Mistral: Ministral 3 8B 2512 mistralai/ministral-8b-2512 262.1K 32.8K Input: $0.15
Output: $0.15
Model: 0.075
Completion: 1.000
📎 🔧 🌡️ - In: image, text
Out: text
Open Weights
Released: 2025-12-02
Updated: 2026-03-15
Mistral: Devstral Small 1.1 mistralai/devstral-small 131.1K 26.2K Input: $0.1
Output: $0.3
Model: 0.050
Completion: 3.000
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-05-07
Updated: 2025-07-10
Mistral: Mistral Small 3.1 24B mistralai/mistral-small-3.1-24b-instruct 128K 131.1K Input: $0.35
Output: $0.56
Cache Read: $0.015
Model: 0.175
Completion: 1.600
Cache: 0.043
📎 🌡️ - In: image, text
Out: text
Open Weights
Released: 2025-03-17
Updated: 2026-03-15
Mistral: Saba mistralai/mistral-saba 32.8K 32.8K Input: $0.2
Output: $0.6
Model: 0.100
Completion: 3.000
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-02-17
Updated: 2026-03-15
Mistral Large mistralai/mistral-large 128K 25.6K Input: $2
Output: $6
Model: 1.000
Completion: 3.000
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-07-24
Updated: 2025-12-02
Mistral: Mistral Medium 3.1 mistralai/mistral-medium-3.1 131.1K 26.2K Input: $0.4
Output: $2
Model: 0.200
Completion: 5.000
📎 🔧 🌡️ - In: text, image
Out: text
Released: 2025-08-12
Mistral: Pixtral Large 2411 mistralai/pixtral-large-2411 131.1K 32.8K Input: $2
Output: $6
Model: 1.000
Completion: 3.000
📎 🔧 🌡️ - In: image, text
Out: text
Open Weights
Released: 2024-11-19
Updated: 2026-03-15
Mistral: Devstral Medium mistralai/devstral-medium 131.1K 26.2K Input: $0.4
Output: $2
Model: 0.200
Completion: 5.000
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-07-10
Mistral: Mistral Small 3 mistralai/mistral-small-24b-instruct-2501 32.8K 16.4K Input: $0.05
Output: $0.08
Model: 0.025
Completion: 1.600
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-12-29
Updated: 2026-01-10
Mistral: Ministral 3 3B 2512 mistralai/ministral-3b-2512 131.1K 32.8K Input: $0.1
Output: $0.1
Model: 0.050
Completion: 1.000
📎 🔧 🌡️ - In: image, text
Out: text
Open Weights
Released: 2025-12-02
Updated: 2026-03-15
Mistral: Mistral Small 4 mistralai/mistral-small-2603 262.1K 262.1K Input: $0.15
Output: $0.6
Cache Read: $0.015
Model: 0.075
Completion: 4.000
Cache: 0.100
📎 🧠 🔧 🌡️ - In: image, text
Out: text
Open Weights
Released: 2026-03-16
Updated: 2026-04-11
Mistral Large 2411 mistralai/mistral-large-2411 131.1K 26.2K Input: $2
Output: $6
Model: 1.000
Completion: 3.000
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-07-24
Updated: 2024-11-04
Mistral: Mistral 7B Instruct v0.1 mistralai/mistral-7b-instruct-v0.1 2.8K 565 Input: $0.11
Output: $0.19
Model: 0.055
Completion: 1.727
🌡️ - In: text
Out: text
Released: 2025-04-03
Mistral: Ministral 3 14B 2512 mistralai/ministral-14b-2512 262.1K 52.4K Input: $0.2
Output: $0.2
Model: 0.100
Completion: 1.000
📎 🔧 🌡️ - In: text, image
Out: text
Released: 2025-12-16
Mistral: Devstral 2 2512 mistralai/devstral-2512 262.1K 65.5K Input: $0.4
Output: $2
Cache Read: $0.025
Model: 0.200
Completion: 5.000
Cache: 0.063
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-09-12
Updated: 2026-03-15
Mistral: Mixtral 8x22B Instruct mistralai/mixtral-8x22b-instruct 65.5K 13.1K Input: $2
Output: $6
Model: 1.000
Completion: 3.000
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-04-17
Mistral: Mistral Medium 3 mistralai/mistral-medium-3 131.1K 26.2K Input: $0.4
Output: $2
Model: 0.200
Completion: 5.000
📎 🔧 🌡️ - In: text, image
Out: text
Released: 2025-05-07
Mistral: Voxtral Small 24B 2507 mistralai/voxtral-small-24b-2507 32K 6.4K Input: $0.1
Output: $0.3
Model: 0.050
Completion: 3.000
🔧 🌡️ - In: text, audio
Out: text
Open Weights
Released: 2025-07-01
Mistral: Mistral Large 3 2512 mistralai/mistral-large-2512 262.1K 52.4K Input: $0.5
Output: $1.5
Model: 0.250
Completion: 3.000
📎 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2024-11-01
Updated: 2025-12-16
Mistral: Codestral 2508 mistralai/codestral-2508 256K 51.2K Input: $0.3
Output: $0.9
Model: 0.150
Completion: 3.000
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-01
Morph: Morph V3 Fast morph/morph-v3-fast 81.9K 38K Input: $0.8
Output: $1.2
Model: 0.400
Completion: 1.500
🌡️ - In: text
Out: text
Released: 2024-08-15
Morph: Morph V3 Large morph/morph-v3-large 262.1K 131.1K Input: $0.9
Output: $1.9
Model: 0.450
Completion: 2.111
🌡️ - In: text
Out: text
Released: 2024-08-15
ByteDance Seed: Seed 1.6 Flash bytedance-seed/seed-1.6-flash 262.1K 32.8K Input: $0.075
Output: $0.3
Model: 0.037
Completion: 4.000
📎 🧠 🔧 🌡️ - In: image, text, video
Out: text
Open Weights
Released: 2025-12-23
Updated: 2026-03-15
ByteDance Seed: Seed 1.6 bytedance-seed/seed-1.6 262.1K 32.8K Input: $0.25
Output: $2
Model: 0.125
Completion: 8.000
📎 🧠 🔧 🌡️ - In: image, text, video
Out: text
Released: 2025-09
ByteDance Seed: Seed-2.0-Mini bytedance-seed/seed-2.0-mini 262.1K 131.1K Input: $0.1
Output: $0.4
Model: 0.050
Completion: 4.000
📎 🧠 🔧 🌡️ - In: image, text, video
Out: text
Open Weights
Released: 2026-02-27
Updated: 2026-03-15
ByteDance Seed: Seed-2.0-Lite bytedance-seed/seed-2.0-lite 262.1K 131.1K Input: $0.25
Output: $2
Model: 0.125
Completion: 8.000
📎 🧠 🔧 🌡️ - In: image, text, video
Out: text
Open Weights
Released: 2026-03-10
Updated: 2026-03-15
Magnum v4 72B anthracite-org/magnum-v4-72b 16.4K 2K Input: $3
Output: $5
Model: 1.500
Completion: 1.667
🌡️ - In: text
Out: text
Open Weights
Released: 2024-10-22
Updated: 2026-03-15
NVIDIA: Nemotron 3 Nano Omni (free) nvidia/nemotron-3-nano-omni-30b-a3b-reasoning:free 256K 65.5K Input: $0
Output: $0
- 📎 🧠 🔧 🌡️ - In: text, audio, image, video
Out: text
Released: 2026-04-28
Updated: 2026-05-01
NVIDIA: Nemotron 3 Nano 30B A3B nvidia/nemotron-3-nano-30b-a3b 262.1K 52.4K Input: $0.05
Output: $0.2
Model: 0.025
Completion: 4.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-12
Updated: 2026-02-04
NVIDIA: Llama 3.3 Nemotron Super 49B V1.5 nvidia/llama-3.3-nemotron-super-49b-v1.5 131.1K 26.2K Input: $0.1
Output: $0.4
Model: 0.050
Completion: 4.000
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-03-16
NVIDIA: Nemotron Nano 9B V2 nvidia/nemotron-nano-9b-v2 131.1K 26.2K Input: $0.04
Output: $0.16
Model: 0.020
Completion: 4.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-18
NVIDIA: Nemotron 3 Super (free) nvidia/nemotron-3-super-120b-a12b:free 262.1K 262.1K Input: $0
Output: $0
- 🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-12
Updated: 2026-03-15
NVIDIA: Nemotron 3 Super nvidia/nemotron-3-super-120b-a12b 262.1K 262.1K Input: $0.1
Output: $0.5
Cache Read: $0.1
Model: 0.050
Completion: 5.000
Cache: 1.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-11
Updated: 2026-04-11
Xiaomi: MiMo-V2.5 xiaomi/mimo-v2.5 1M 131.1K Input: $0.4
Output: $2
Cache Read: $0.08
Model: 0.200
Completion: 5.000
Cache: 0.200
📎 🧠 🔧 🌡️ 2024-12 In: text, image, audio, video
Out: text
Open Weights
Released: 2026-04-22
Xiaomi: MiMo-V2-Omni xiaomi/mimo-v2-omni 262.1K 65.5K Input: $0.4
Output: $2
Cache Read: $0.08
Model: 0.200
Completion: 5.000
Cache: 0.200
📎 🧠 🔧 🌡️ 2024-12 In: text, image, audio, video, pdf
Out: text
Released: 2026-03-18
Xiaomi: MiMo-V2-Flash xiaomi/mimo-v2-flash 262.1K 65.5K Input: $0.09
Output: $0.29
Cache Read: $0.045
Model: 0.045
Completion: 3.222
Cache: 0.500
🧠 🔧 🌡️ 2024-12-01 In: text
Out: text
Open Weights
Released: 2025-12-16
Updated: 2026-02-04
Xiaomi: MiMo-V2-Pro xiaomi/mimo-v2-pro 1M 131.1K Input: $1
Output: $3
Cache Read: $0.2
Model: 0.500
Completion: 3.000
Cache: 0.200
🧠 🔧 🌡️ 2024-12 In: text
Out: text
Released: 2026-03-18
Xiaomi: MiMo V2.5 Pro xiaomi/mimo-v2.5-pro 1M 131.1K Input: $1
Output: $3
Cache Read: $0.2
Model: 0.500
Completion: 3.000
Cache: 0.200
🧠 🔧 🌡️ 2024-12 In: text
Out: text
Open Weights
Released: 2026-04-22
Inception: Mercury 2 inception/mercury-2 128K 50K Input: $0.25
Output: $0.75
Cache Read: $0.025
Model: 0.125
Completion: 3.000
Cache: 0.100
🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-02-24
Anthropic: Claude 3.5 Haiku anthropic/claude-3.5-haiku 200K 8.2K Input: $0.8
Output: $4
Cache Read: $0.08
Cache Write: $1
Model: 0.400
Completion: 5.000
Cache: 0.100
📎 🔧 🌡️ - In: text, image
Out: text
Released: 2024-10-22
Anthropic: Claude Sonnet 4.5 anthropic/claude-sonnet-4.5 1M 64K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ - In: image, pdf, text
Out: text
Released: 2025-09-29
Updated: 2026-03-15
Anthropic: Claude Sonnet 4 anthropic/claude-sonnet-4 200K 64K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ - In: image, pdf, text
Out: text
Released: 2025-05-22
Updated: 2026-03-15
Anthropic: Claude Opus 4.6 (Fast) anthropic/claude-opus-4.6-fast 1M 128K Input: $30
Output: $150
Cache Read: $3
Cache Write: $37.5
Model: 15.000
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-05-31 In: image, text
Out: text
Released: 2026-04-07
Updated: 2026-04-11
Anthropic: Claude Haiku 4.5 anthropic/claude-haiku-4.5 200K 64K Input: $1
Output: $5
Cache Read: $0.1
Cache Write: $1.25
Model: 0.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ - In: image, text
Out: text
Released: 2025-10-15
Anthropic: Claude Opus 4.7 (Fast) anthropic/claude-opus-4.7-fast 1M 128K Input: $30
Output: $150
Cache Read: $3
Cache Write: $37.5
Model: 15.000
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 - In: image, pdf, text
Out: text
Released: 2026-05-12
Updated: 2026-05-16
Anthropic: Claude Opus 4.7 anthropic/claude-opus-4.7 1M 128K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2026-04-16
Updated: 2026-05-01
Anthropic: Claude Opus 4.1 anthropic/claude-opus-4.1 200K 32K Input: $15
Output: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ - In: image, pdf, text
Out: text
Released: 2025-08-05
Updated: 2026-03-15
Anthropic: Claude Opus 4.5 anthropic/claude-opus-4.5 200K 64K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ - In: image, pdf, text
Out: text
Released: 2025-11-24
Updated: 2026-03-15
Anthropic: Claude Sonnet 4.6 anthropic/claude-sonnet-4.6 1M 128K Input: $3
Output: $15
Model: 1.500
Completion: 5.000
📎 🧠 🔧 🌡️ 2025-08-31 In: image, text
Out: text
Released: 2026-02-17
Updated: 2026-03-15
Anthropic: Claude 3 Haiku anthropic/claude-3-haiku 200K 4.1K Input: $0.25
Output: $1.25
Cache Read: $0.03
Cache Write: $0.3
Model: 0.125
Completion: 5.000
Cache: 0.120
📎 🔧 🌡️ - In: text, image
Out: text
Released: 2024-03-07
Anthropic: Claude Opus 4 anthropic/claude-opus-4 200K 32K Input: $15
Output: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ - In: image, pdf, text
Out: text
Released: 2025-05-22
Updated: 2026-03-15
Anthropic: Claude Opus 4.6 anthropic/claude-opus-4.6 1M 128K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-05-31 In: text, image
Out: text
Released: 2026-02-05
Tencent: Hunyuan A13B Instruct tencent/hunyuan-a13b-instruct 131.1K 131.1K Input: $0.14
Output: $0.57
Model: 0.070
Completion: 4.071
🧠 🌡️ - In: text
Out: text
Released: 2025-06-30
Updated: 2025-11-25
Tencent: Hy3 Preview tencent/hy3-preview 262.1K 262.1K Input: $0.066
Output: $0.26
Cache Read: $0.029
Model: 0.033
Completion: 3.939
Cache: 0.439
🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-04-22
Updated: 2026-05-16
Deep Cogito: Cogito v2.1 671B deepcogito/cogito-v2.1-671b 128K 32.8K Input: $1.25
Output: $1.25
Model: 0.625
Completion: 1.000
🧠 🌡️ - In: text
Out: text
Open Weights
Released: 2025-11-14
Updated: 2026-03-15
Cohere: Command A cohere/command-a 256K 8.2K Input: $2.5
Output: $10
Model: 1.250
Completion: 4.000
🌡️ - In: text
Out: text
Open Weights
Released: 2025-03-13
Cohere: Command R (08-2024) cohere/command-r-08-2024 128K 4K Input: $0.15
Output: $0.6
Model: 0.075
Completion: 4.000
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-08-30
Cohere: Command R7B (12-2024) cohere/command-r7b-12-2024 128K 4K Input: $0.0375
Output: $0.15
Model: 0.019
Completion: 4.000
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-12-02
Cohere: Command R+ (08-2024) cohere/command-r-plus-08-2024 128K 4K Input: $2.5
Output: $10
Model: 1.250
Completion: 4.000
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-08-30
MythoMax 13B gryphe/mythomax-l2-13b 4.1K 4.1K Input: $0.06
Output: $0.06
Model: 0.030
Completion: 1.000
🌡️ - In: text
Out: text
Open Weights
Released: 2024-04-25
StepFun: Step 3.5 Flash stepfun/step-3.5-flash 256K 256K Input: $0.1
Output: $0.3
Cache Read: $0.02
Model: 0.050
Completion: 3.000
Cache: 0.200
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-01-29
Prime Intellect: INTELLECT-3 prime-intellect/intellect-3 131.1K 131.1K Input: $0.2
Output: $1.1
Model: 0.100
Completion: 5.500
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-11-26
Updated: 2026-02-04
Nex AGI: DeepSeek V3.1 Nex N1 nex-agi/deepseek-v3.1-nex-n1 131.1K 163.8K Input: $0.27
Output: $1
Model: 0.135
Completion: 3.704
🔧 🌡️ - In: text
Out: text
Released: 2025-01-01
Updated: 2025-11-25
ReMM SLERP 13B undi95/remm-slerp-l2-13b 6.1K 4.1K Input: $0.45
Output: $0.65
Model: 0.225
Completion: 1.444
🌡️ - In: text
Out: text
Open Weights
Released: 2023-07-22
Updated: 2026-03-15
OpenAI: GPT Mini Latest ~openai/gpt-mini-latest 400K 128K Input: $0.75
Output: $4.5
Cache Read: $0.075
Model: 0.375
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2026-04-27
Updated: 2026-05-01
OpenAI: GPT Latest ~openai/gpt-latest 1.1M 128K Input: $5
Output: $30
Cache Read: $0.5
Model: 2.500
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2026-04-27
Updated: 2026-05-01
MoonshotAI: Kimi Latest ~moonshotai/kimi-latest 262.1K 262.1K Input: $0.74
Output: $3.49
Cache Read: $0.14
Model: 0.370
Completion: 4.716
Cache: 0.189
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2026-04-27
Updated: 2026-05-01
Relace: Relace Search relace/relace-search 256K 128K Input: $1
Output: $3
Model: 0.500
Completion: 3.000
🔧 🌡️ - In: text
Out: text
Released: 2025-12-09
Updated: 2026-03-15
Relace: Relace Apply 3 relace/relace-apply-3 256K 128K Input: $0.85
Output: $1.25
Model: 0.425
Completion: 1.471
- - In: text
Out: text
Released: 2025-09-26
Updated: 2026-03-15
AI21: Jamba Large 1.7 ai21/jamba-large-1.7 256K 4.1K Input: $2
Output: $8
Model: 1.000
Completion: 4.000
🔧 🌡️ - In: text
Out: text
Released: 2025-08-09
Updated: 2026-03-15
Arcee AI: Coder Large arcee-ai/coder-large 32.8K 32.8K Input: $0.5
Output: $0.8
Model: 0.250
Completion: 1.600
🌡️ - In: text
Out: text
Open Weights
Released: 2025-05-06
Updated: 2026-03-15
Arcee AI: Virtuoso Large arcee-ai/virtuoso-large 131.1K 64K Input: $0.75
Output: $1.2
Model: 0.375
Completion: 1.600
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-05-06
Updated: 2026-03-15
Arcee AI: Spotlight arcee-ai/spotlight 131.1K 65.5K Input: $0.18
Output: $0.18
Model: 0.090
Completion: 1.000
📎 🌡️ - In: image, text
Out: text
Open Weights
Released: 2025-05-06
Updated: 2026-03-15
Arcee AI: Trinity Large Thinking arcee-ai/trinity-large-thinking 262.1K 262.1K Input: $0.22
Output: $0.85
Model: 0.110
Completion: 3.864
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-04-01
Updated: 2026-04-11
Arcee AI: Maestro Reasoning arcee-ai/maestro-reasoning 131.1K 32K Input: $0.9
Output: $3.3
Model: 0.450
Completion: 3.667
🌡️ - In: text
Out: text
Open Weights
Released: 2025-05-06
Updated: 2026-03-15
Arcee AI: Trinity Mini arcee-ai/trinity-mini 131.1K 131.1K Input: $0.045
Output: $0.15
Model: 0.022
Completion: 3.333
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-12
Updated: 2026-01-28
Mancer: Weaver (alpha) mancer/weaver 8K 2K Input: $0.75
Output: $1
Model: 0.375
Completion: 1.333
🌡️ - In: text
Out: text
Released: 2023-08-02
Updated: 2026-03-15
Perplexity: Sonar Reasoning Pro perplexity/sonar-reasoning-pro 128K 25.6K Input: $2
Output: $8
Model: 1.000
Completion: 4.000
📎 🧠 🌡️ - In: text, image
Out: text
Released: 2024-01-01
Updated: 2025-09-01
Perplexity: Sonar perplexity/sonar 127.1K 25.4K Input: $1
Output: $1
Model: 0.500
Completion: 1.000
📎 🌡️ - In: text, image
Out: text
Released: 2024-01-01
Updated: 2025-09-01
Perplexity: Sonar Pro perplexity/sonar-pro 200K 8K Input: $3
Output: $15
Model: 1.500
Completion: 5.000
📎 🌡️ - In: text, image
Out: text
Released: 2024-01-01
Updated: 2025-09-01
Perplexity: Sonar Pro Search perplexity/sonar-pro-search 200K 8K Input: $3
Output: $15
Model: 1.500
Completion: 5.000
📎 🧠 🌡️ - In: image, text
Out: text
Released: 2025-10-31
Updated: 2026-03-15
Perplexity: Sonar Deep Research perplexity/sonar-deep-research 128K 25.6K Input: $2
Output: $8
Model: 1.000
Completion: 4.000
🧠 🌡️ - In: text
Out: text
Released: 2025-01-27
Switchpoint Router switchpoint/router 131.1K 32.8K Input: $0.85
Output: $3.4
Model: 0.425
Completion: 4.000
🧠 🌡️ - In: text
Out: text
Released: 2025-07-12
Updated: 2026-03-15
Body Builder (beta) openrouter/bodybuilder 128K 32.8K Input: $0
Output: $0
- - - In: text
Out: text
Released: 2026-03-15
Free Models Router openrouter/free 200K 32.8K Input: $0
Output: $0
- 📎 🧠 🔧 🌡️ - In: image, text
Out: text
Released: 2026-02-01
Updated: 2026-03-15
Owl Alpha openrouter/owl-alpha 1M 262.1K Input: $0
Output: $0
- 🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-04-28
Updated: 2026-04-30
Pareto Code Router openrouter/pareto-code 200K 65.5K Input: $0
Output: $0
- - - In: text
Out: text
Released: 2026-04-21
Updated: 2026-05-01
Auto Router openrouter/auto 2M 32.8K Input: $0
Output: $0
- 📎 🧠 🔧 🌡️ - In: audio, image, pdf, text, video
Out: image, text
Released: 2026-03-15
Qwen: Qwen3.5 Plus 2026-04-20 qwen/qwen3.5-plus-20260420 1M 65.5K Input: $0.4
Output: $2.4
Model: 0.200
Completion: 6.000
📎 🧠 🔧 🌡️ - In: text, image, video
Out: text
Released: 2026-04-27
Updated: 2026-05-01
Qwen: Qwen3 VL 235B A22B Thinking qwen/qwen3-vl-235b-a22b-thinking 131.1K 32.8K Input: $0.26
Output: $2.6
Model: 0.130
Completion: 10.000
📎 🧠 🔧 🌡️ - In: image, text
Out: text
Open Weights
Released: 2025-09-24
Updated: 2026-03-15
Qwen: Qwen3 VL 30B A3B Thinking qwen/qwen3-vl-30b-a3b-thinking 131.1K 32.8K Input: $0.13
Output: $1.56
Model: 0.065
Completion: 12.000
📎 🧠 🔧 🌡️ - In: image, text
Out: text
Open Weights
Released: 2025-10-11
Updated: 2026-03-15
Qwen: Qwen3 Coder Plus qwen/qwen3-coder-plus 1M 65.5K Input: $0.65
Output: $3.25
Cache Read: $0.2
Model: 0.325
Completion: 5.000
Cache: 0.308
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-07-01
Updated: 2026-03-15
Qwen: Qwen-Plus qwen/qwen-plus 1M 32.8K Input: $0.4
Output: $1.2
Cache Read: $0.08
Model: 0.200
Completion: 3.000
Cache: 0.200
🔧 🌡️ - In: text
Out: text
Released: 2024-01-25
Updated: 2025-09-11
Qwen: Qwen3 Coder 30B A3B Instruct qwen/qwen3-coder-30b-a3b-instruct 160K 32.8K Input: $0.07
Output: $0.27
Model: 0.035
Completion: 3.857
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-07-31
Qwen: Qwen3 32B qwen/qwen3-32b 41K 41K Input: $0.08
Output: $0.24
Cache Read: $0.04
Model: 0.040
Completion: 3.000
Cache: 0.500
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-12-01
Updated: 2026-02-04
Qwen: Qwen3 Next 80B A3B Instruct qwen/qwen3-next-80b-a3b-instruct 131.1K 52.4K Input: $0.09
Output: $1.1
Model: 0.045
Completion: 12.222
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-09-11
Updated: 2026-03-15
Qwen: Qwen3 VL 8B Instruct qwen/qwen3-vl-8b-instruct 131.1K 32.8K Input: $0.08
Output: $0.5
Model: 0.040
Completion: 6.250
📎 🔧 🌡️ - In: image, text
Out: text
Open Weights
Released: 2025-10-15
Updated: 2025-11-25
Qwen: Qwen3.6 35B A3B qwen/qwen3.6-35b-a3b 262.1K 65.5K Input: $0.1612
Output: $0.96525
Cache Read: $0.1612
Model: 0.081
Completion: 5.988
Cache: 1.000
📎 🧠 🌡️ - In: text, image, video
Out: text
Released: 2026-04-27
Updated: 2026-05-01
Qwen: Qwen3.7 Max qwen/qwen3.7-max 1M 65.5K Input: $1.625
Output: $4.875
Cache Read: $0.1625
Cache Write: $2.03125
Model: 0.813
Completion: 3.000
Cache: 0.100
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-08-26
Updated: 2026-05-27
Qwen: Qwen3 Max qwen/qwen3-max 262.1K 32.8K Input: $1.2
Output: $6
Cache Read: $0.24
Model: 0.600
Completion: 5.000
Cache: 0.200
🔧 🌡️ - In: text
Out: text
Released: 2025-09-05
Updated: 2026-03-15
Qwen: Qwen3 8B qwen/qwen3-8b 41K 8.2K Input: $0.05
Output: $0.4
Cache Read: $0.05
Model: 0.025
Completion: 8.000
Cache: 1.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-04
Updated: 2026-03-15
Qwen: Qwen Plus 0728 qwen/qwen-plus-2025-07-28 1M 32.8K Input: $0.26
Output: $0.78
Model: 0.130
Completion: 3.000
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-09-09
Updated: 2026-03-15
Qwen: Qwen3.5-Flash qwen/qwen3.5-flash-02-23 1M 65.5K Input: $0.1
Output: $0.4
Model: 0.050
Completion: 4.000
📎 🧠 🔧 🌡️ - In: image, text, video
Out: text
Open Weights
Released: 2026-02-26
Updated: 2026-03-15
Qwen: Qwen3 30B A3B Instruct 2507 qwen/qwen3-30b-a3b-instruct-2507 262.1K 262.1K Input: $0.09
Output: $0.3
Cache Read: $0.04
Model: 0.045
Completion: 3.333
Cache: 0.444
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-07-29
Updated: 2026-03-15
Qwen2.5 Coder 32B Instruct qwen/qwen-2.5-coder-32b-instruct 32.8K 8.2K Input: $0.2
Output: $0.2
Cache Read: $0.015
Model: 0.100
Completion: 1.000
Cache: 0.075
🌡️ - In: text
Out: text
Open Weights
Released: 2024-11-11
Updated: 2026-03-15
Qwen: Qwen3 Next 80B A3B Thinking qwen/qwen3-next-80b-a3b-thinking 131.1K 32.8K Input: $0.0975
Output: $0.78
Model: 0.049
Completion: 8.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-09-11
Updated: 2026-03-15
Qwen: Qwen3 235B A22B Thinking 2507 qwen/qwen3-235b-a22b-thinking-2507 262.1K 262.1K Input: $0.11
Output: $0.6
Model: 0.055
Completion: 5.455
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-07-25
Updated: 2026-03-15
Qwen: Qwen3 VL 32B Instruct qwen/qwen3-vl-32b-instruct 131.1K 32.8K Input: $0.104
Output: $0.416
Model: 0.052
Completion: 4.000
📎 🔧 🌡️ - In: text, image
Out: text
Released: 2025-10-21
Updated: 2025-11-25
Qwen: Qwen3 Coder 480B A35B qwen/qwen3-coder 262.1K 52.4K Input: $0.22
Output: $1
Cache Read: $0.022
Model: 0.110
Completion: 4.545
Cache: 0.100
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-07-23
Qwen: Qwen3.6 Flash qwen/qwen3.6-flash 1M 65.5K Input: $0.25
Output: $1.5
Cache Write: $0.3125
Model: 0.125
Completion: 6.000
📎 🧠 🔧 🌡️ - In: text, image, video
Out: text
Released: 2026-04-27
Updated: 2026-05-01
Qwen: Qwen3.5 Plus 2026-02-15 qwen/qwen3.5-plus-02-15 1M 65.5K Input: $0.26
Output: $1.56
Model: 0.130
Completion: 6.000
📎 🧠 🔧 🌡️ - In: image, text, video
Out: text
Released: 2026-02-15
Updated: 2026-03-15
Qwen: Qwen2.5 7B Instruct qwen/qwen-2.5-7b-instruct 32.8K 6.6K Input: $0.04
Output: $0.1
Model: 0.020
Completion: 2.500
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-09
Updated: 2025-04-16
Qwen: Qwen3 VL 8B Thinking qwen/qwen3-vl-8b-thinking 131.1K 32.8K Input: $0.117
Output: $1.365
Model: 0.059
Completion: 11.667
📎 🧠 🔧 🌡️ - In: image, text
Out: text
Released: 2025-10-15
Updated: 2025-11-25
Qwen: Qwen3 Max Thinking qwen/qwen3-max-thinking 262.1K 32.8K Input: $0.78
Output: $3.9
Model: 0.390
Completion: 5.000
🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-01-23
Updated: 2026-03-15
Qwen: Qwen3 30B A3B Thinking 2507 qwen/qwen3-30b-a3b-thinking-2507 32.8K 6.6K Input: $0.051
Output: $0.34
Model: 0.025
Completion: 6.667
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-07-29
Qwen: Qwen2.5 VL 72B Instruct qwen/qwen2.5-vl-72b-instruct 32.8K 32.8K Input: $0.8
Output: $0.8
Cache Read: $0.075
Model: 0.400
Completion: 1.000
Cache: 0.094
📎 🌡️ - In: image, text
Out: text
Open Weights
Released: 2025-02-01
Updated: 2026-03-15
Qwen: Qwen3.5-27B qwen/qwen3.5-27b 262.1K 65.5K Input: $0.195
Output: $1.56
Model: 0.098
Completion: 8.000
📎 🧠 🔧 🌡️ - In: image, text, video
Out: text
Open Weights
Released: 2026-02-26
Updated: 2026-03-15
Qwen: Qwen3 235B A22B qwen/qwen3-235b-a22b 131.1K 8.2K Input: $0.455
Output: $1.82
Cache Read: $0.15
Model: 0.228
Completion: 4.000
Cache: 0.330
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-12-01
Updated: 2026-03-15
Qwen2.5 72B Instruct qwen/qwen-2.5-72b-instruct 32.8K 16.4K Input: $0.12
Output: $0.39
Model: 0.060
Completion: 3.250
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-09
Updated: 2026-01-10
Qwen: Qwen Plus 0728 (thinking) qwen/qwen-plus-2025-07-28:thinking 1M 32.8K Input: $0.26
Output: $0.78
Model: 0.130
Completion: 3.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-09-09
Updated: 2026-03-15
Qwen: Qwen3 Coder Next qwen/qwen3-coder-next 262.1K 65.5K Input: $0.12
Output: $0.75
Cache Read: $0.035
Model: 0.060
Completion: 6.250
Cache: 0.292
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-02
Updated: 2026-03-15
Qwen: Qwen3.6 27B qwen/qwen3.6-27b 256K 65.5K Input: $0.325
Output: $3.25
Model: 0.163
Completion: 10.000
📎 🧠 🔧 🌡️ - In: text, image, video
Out: text
Released: 2026-04-27
Updated: 2026-05-01
Qwen: Qwen3.5-35B-A3B qwen/qwen3.5-35b-a3b 262.1K 65.5K Input: $0.1625
Output: $1.3
Model: 0.081
Completion: 8.000
📎 🧠 🔧 🌡️ - In: image, text, video
Out: text
Open Weights
Released: 2026-02-26
Updated: 2026-03-15
Qwen: Qwen3.5-9B qwen/qwen3.5-9b 256K 32.8K Input: $0.05
Output: $0.15
Model: 0.025
Completion: 3.000
📎 🧠 🔧 🌡️ - In: image, text, video
Out: text
Open Weights
Released: 2026-03-10
Updated: 2026-03-15
Qwen: Qwen3.5 397B A17B qwen/qwen3.5-397b-a17b 262.1K 65.5K Input: $0.39
Output: $2.34
Model: 0.195
Completion: 6.000
📎 🧠 🔧 🌡️ - In: image, text, video
Out: text
Released: 2026-02-15
Updated: 2026-03-15
Qwen: Qwen3 VL 30B A3B Instruct qwen/qwen3-vl-30b-a3b-instruct 131.1K 32.8K Input: $0.13
Output: $0.52
Model: 0.065
Completion: 4.000
📎 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2025-10-05
Updated: 2025-11-25
Qwen: Qwen3 235B A22B Instruct 2507 qwen/qwen3-235b-a22b-2507 262.1K 52.4K Input: $0.071
Output: $0.1
Model: 0.035
Completion: 1.408
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-04
Updated: 2026-01
Qwen: Qwen3 Coder Flash qwen/qwen3-coder-flash 1M 65.5K Input: $0.195
Output: $0.975
Cache Read: $0.06
Model: 0.098
Completion: 5.000
Cache: 0.308
🔧 🌡️ - In: text
Out: text
Released: 2025-07-23
Updated: 2026-03-15
Qwen: Qwen3 14B qwen/qwen3-14b 41K 41K Input: $0.06
Output: $0.24
Cache Read: $0.025
Model: 0.030
Completion: 4.000
Cache: 0.417
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-04
Updated: 2026-03-15
Qwen: Qwen3 VL 235B A22B Instruct qwen/qwen3-vl-235b-a22b-instruct 262.1K 52.4K Input: $0.2
Output: $0.88
Cache Read: $0.11
Model: 0.100
Completion: 4.400
Cache: 0.550
📎 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2025-09-23
Updated: 2026-01-10
Qwen: Qwen3 30B A3B qwen/qwen3-30b-a3b 41K 41K Input: $0.08
Output: $0.28
Cache Read: $0.03
Model: 0.040
Completion: 3.500
Cache: 0.375
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-04
Updated: 2026-03-15
Qwen: Qwen3.6 Max Preview qwen/qwen3.6-max-preview 262.1K 65.5K Input: $1.04
Output: $6.24
Cache Write: $1.3
Model: 0.520
Completion: 6.000
🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-04-27
Updated: 2026-05-01
Qwen: Qwen3.5-122B-A10B qwen/qwen3.5-122b-a10b 262.1K 65.5K Input: $0.26
Output: $2.08
Model: 0.130
Completion: 8.000
📎 🧠 🔧 🌡️ - In: image, text, video
Out: text
Open Weights
Released: 2026-02-26
Updated: 2026-03-15
Qwen: Qwen3.6 Plus qwen/qwen3.6-plus 1M 65.5K Input: $0.325
Output: $1.95
Cache Read: $0.0325
Cache Write: $0.40625
Model: 0.163
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 🌡️ - In: image, text
Out: text
Released: 2025-08-26
Updated: 2026-04-11
Amazon: Nova Lite 1.0 amazon/nova-lite-v1 300K 5.1K Input: $0.06
Output: $0.24
Model: 0.030
Completion: 4.000
📎 🔧 🌡️ - In: image, text
Out: text
Released: 2024-12-06
Updated: 2026-03-15
Amazon: Nova Premier 1.0 amazon/nova-premier-v1 1M 32K Input: $2.5
Output: $12.5
Model: 1.250
Completion: 5.000
📎 🔧 🌡️ - In: image, text
Out: text
Released: 2025-11-01
Updated: 2026-03-15
Amazon: Nova Pro 1.0 amazon/nova-pro-v1 300K 5.1K Input: $0.8
Output: $3.2
Model: 0.400
Completion: 4.000
📎 🔧 🌡️ - In: text, image
Out: text
Released: 2024-12-03
Amazon: Nova Micro 1.0 amazon/nova-micro-v1 128K 5.1K Input: $0.035
Output: $0.14
Model: 0.018
Completion: 4.000
🔧 🌡️ - In: text
Out: text
Released: 2024-12-06
Updated: 2026-03-15
Amazon: Nova 2 Lite amazon/nova-2-lite-v1 1M 65.5K Input: $0.3
Output: $2.5
Model: 0.150
Completion: 8.333
📎 🧠 🔧 🌡️ - In: image, pdf, text, video
Out: text
Released: 2024-12-01
Updated: 2026-03-15
AionLabs: Aion-RP 1.0 (8B) aion-labs/aion-rp-llama-3.1-8b 32.8K 32.8K Input: $0.8
Output: $1.6
Model: 0.400
Completion: 2.000
🌡️ - In: text
Out: text
Released: 2025-02-05
Updated: 2026-03-15
AionLabs: Aion-1.0-Mini aion-labs/aion-1.0-mini 131.1K 32.8K Input: $0.7
Output: $1.4
Model: 0.350
Completion: 2.000
🧠 🌡️ - In: text
Out: text
Released: 2025-02-05
Updated: 2026-03-15
AionLabs: Aion-2.0 aion-labs/aion-2.0 131.1K 32.8K Input: $0.8
Output: $1.6
Model: 0.400
Completion: 2.000
🧠 🌡️ - In: text
Out: text
Released: 2026-02-24
Updated: 2026-03-15
AionLabs: Aion-1.0 aion-labs/aion-1.0 131.1K 32.8K Input: $4
Output: $8
Model: 2.000
Completion: 2.000
🧠 🌡️ - In: text
Out: text
Released: 2025-02-05
Updated: 2026-03-15
Inflection: Inflection 3 Pi inflection/inflection-3-pi 8K 1K Input: $2.5
Output: $10
Model: 1.250
Completion: 4.000
🌡️ - In: text
Out: text
Released: 2024-10-11
Updated: 2026-03-15
Inflection: Inflection 3 Productivity inflection/inflection-3-productivity 8K 1K Input: $2.5
Output: $10
Model: 1.250
Completion: 4.000
🌡️ - In: text
Out: text
Released: 2024-10-11
Updated: 2026-03-15
Sao10K: Llama 3.1 Euryale 70B v2.2 sao10k/l3.1-euryale-70b 131.1K 16.4K Input: $0.85
Output: $0.85
Model: 0.425
Completion: 1.000
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-08-28
Updated: 2026-03-15
Sao10K: Llama 3.3 Euryale 70B sao10k/l3.3-euryale-70b 131.1K 16.4K Input: $0.65
Output: $0.75
Model: 0.325
Completion: 1.154
🌡️ - In: text
Out: text
Open Weights
Released: 2024-12-18
Updated: 2026-03-15
Sao10K: Llama 3 8B Lunaris sao10k/l3-lunaris-8b 8.2K 8.2K Input: $0.04
Output: $0.05
Model: 0.020
Completion: 1.250
🌡️ - In: text
Out: text
Open Weights
Released: 2024-08-13
Updated: 2026-03-15
Sao10k: Llama 3 Euryale 70B v2.1 sao10k/l3-euryale-70b 8.2K 8.2K Input: $1.48
Output: $1.48
Model: 0.740
Completion: 1.000
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-06-18
Updated: 2026-03-15
Sao10K: Llama 3.1 70B Hanami x1 sao10k/l3.1-70b-hanami-x1 16K 16K Input: $3
Output: $3
Model: 1.500
Completion: 1.000
🌡️ - In: text
Out: text
Open Weights
Released: 2025-01-08
Updated: 2026-03-15
Upstage: Solar Pro 3 upstage/solar-pro-3 128K 32.8K Input: $0.15
Output: $0.6
Model: 0.075
Completion: 4.000
🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-01-27
Updated: 2026-03-15
AllenAI: Olmo 3 32B Think allenai/olmo-3-32b-think 65.5K 65.5K Input: $0.15
Output: $0.5
Model: 0.075
Completion: 3.333
🧠 🌡️ - In: text
Out: text
Open Weights
Released: 2025-11-22
Updated: 2026-03-15
EssentialAI: Rnj 1 Instruct essentialai/rnj-1-instruct 32.8K 6.6K Input: $0.15
Output: $0.15
Model: 0.075
Completion: 1.000
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-12-05
Updated: 2026-03-15
DeepSeek: R1 0528 deepseek/deepseek-r1-0528 163.8K 65.5K Input: $0.45
Output: $2.15
Cache Read: $0.2
Model: 0.225
Completion: 4.778
Cache: 0.444
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-05-28
Updated: 2026-03-15
DeepSeek: DeepSeek V4 Flash deepseek/deepseek-v4-flash 1M 384K Input: $0.14
Output: $0.28
Cache Read: $0.0028
Model: 0.070
Completion: 2.000
Cache: 0.020
🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-04-24
Updated: 2026-05-01
DeepSeek: DeepSeek V3.1 Terminus deepseek/deepseek-v3.1-terminus 163.8K 32.8K Input: $0.21
Output: $0.79
Cache Read: $0.13
Model: 0.105
Completion: 3.762
Cache: 0.619
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-09-22
DeepSeek: R1 Distill Llama 70B deepseek/deepseek-r1-distill-llama-70b 131.1K 16.4K Input: $0.7
Output: $0.8
Cache Read: $0.015
Model: 0.350
Completion: 1.143
Cache: 0.021
🧠 🌡️ - In: text
Out: text
Open Weights
Released: 2025-01-23
Updated: 2026-03-15
DeepSeek: DeepSeek V4 Pro deepseek/deepseek-v4-pro 1M 384K Input: $0.435
Output: $0.87
Cache Read: $0.003625
Model: 0.217
Completion: 2.000
Cache: 0.008
🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-04-24
Updated: 2026-05-01
DeepSeek: R1 deepseek/deepseek-r1 64K 16K Input: $0.7
Output: $2.5
Model: 0.350
Completion: 3.571
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-01-20
DeepSeek: DeepSeek V3.2 Speciale deepseek/deepseek-v3.2-speciale 163.8K 163.8K Input: $0.4
Output: $1.2
Cache Read: $0.135
Model: 0.200
Completion: 3.000
Cache: 0.338
🧠 🌡️ - In: text
Out: text
Open Weights
Released: 2025-12-01
Updated: 2026-03-15
DeepSeek: DeepSeek V3.2 Exp deepseek/deepseek-v3.2-exp 163.8K 65.5K Input: $0.27
Output: $0.41
Model: 0.135
Completion: 1.519
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-01-01
Updated: 2025-09-29
DeepSeek: DeepSeek V3 0324 deepseek/deepseek-chat-v3-0324 163.8K 65.5K Input: $0.2
Output: $0.77
Cache Read: $0.095
Model: 0.100
Completion: 3.850
Cache: 0.475
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-03-24
Updated: 2026-03-15
DeepSeek: R1 Distill Qwen 32B deepseek/deepseek-r1-distill-qwen-32b 32.8K 32.8K Input: $0.29
Output: $0.29
Model: 0.145
Completion: 1.000
🧠 🌡️ - In: text
Out: text
Open Weights
Released: 2025-01-01
Updated: 2025-11-25
DeepSeek: DeepSeek V3 deepseek/deepseek-chat 163.8K 163.8K Input: $0.32
Output: $0.89
Cache Read: $0.15
Model: 0.160
Completion: 2.781
Cache: 0.469
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-12-01
Updated: 2026-03-15
DeepSeek: DeepSeek V3.2 deepseek/deepseek-v3.2 163.8K 65.5K Input: $0.26
Output: $0.38
Cache Read: $0.125
Model: 0.130
Completion: 1.462
Cache: 0.481
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-12-01
Updated: 2026-03-15
DeepSeek: DeepSeek V3.1 deepseek/deepseek-chat-v3.1 32.8K 7.2K Input: $0.15
Output: $0.75
Model: 0.075
Completion: 5.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-21
MiniMax: MiniMax M2.5 minimax/minimax-m2.5 196.6K 196.6K Input: $0.25
Output: $1.2
Cache Read: $0.029
Model: 0.125
Completion: 4.800
Cache: 0.116
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-12
Updated: 2026-03-15
MiniMax: MiniMax M2.1 minimax/minimax-m2.1 196.6K 39.3K Input: $0.27
Output: $0.95
Cache Read: $0.03
Model: 0.135
Completion: 3.519
Cache: 0.111
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-12-23
MiniMax: MiniMax-01 minimax/minimax-01 1M 1M Input: $0.2
Output: $1.1
Model: 0.100
Completion: 5.500
📎 🌡️ - In: text, image
Out: text
Open Weights
Released: 2025-01-15
MiniMax: MiniMax M2 minimax/minimax-m2 196.6K 196.6K Input: $0.255
Output: $1
Cache Read: $0.03
Model: 0.128
Completion: 3.922
Cache: 0.118
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-10-23
Updated: 2026-03-15
MiniMax: MiniMax M2-her minimax/minimax-m2-her 65.5K 2K Input: $0.3
Output: $1.2
Model: 0.150
Completion: 4.000
🌡️ - In: text
Out: text
Open Weights
Released: 2026-01-23
Updated: 2026-03-15
MiniMax: MiniMax M2.7 minimax/minimax-m2.7 204.8K 131.1K Input: $0.3
Output: $1.2
Cache Read: $0.06
Model: 0.150
Completion: 4.000
Cache: 0.200
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-18
MiniMax: MiniMax M1 minimax/minimax-m1 1M 40K Input: $0.4
Output: $2.2
Model: 0.200
Completion: 5.500
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-06-17
Stealth: Claude Opus 4.7 (20% off) stealth/claude-opus-4.7 1M 128K Input: $4
Output: $20
Cache Read: $0.4
Cache Write: $5
Model: 2.000
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 - In: image, pdf, text
Out: text
Released: 2026-04-16
Updated: 2026-05-27
Stealth: Claude Sonnet 4.6 (20% off) stealth/claude-sonnet-4.6 1M 64K Input: $2.4
Output: $12
Cache Read: $0.24
Cache Write: $3
Model: 1.200
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ - In: image, pdf, text
Out: text
Released: 2026-02-17
Updated: 2026-05-27
Stealth: Claude Opus 4.6 (20% off) stealth/claude-opus-4.6 1M 128K Input: $4
Output: $20
Cache Read: $0.4
Cache Write: $5
Model: 2.000
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ - In: image, pdf, text
Out: text
Released: 2026-02-05
Updated: 2026-05-27
Kwaipilot: KAT-Coder-Pro V2 kwaipilot/kat-coder-pro-v2 256K 80K Input: $0.3
Output: $1.2
Cache Read: $0.06
Model: 0.150
Completion: 4.000
Cache: 0.200
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-27
Updated: 2026-04-11
NousResearch: Hermes 2 Pro - Llama-3 8B nousresearch/hermes-2-pro-llama-3-8b 8.2K 8.2K Input: $0.14
Output: $0.14
Model: 0.070
Completion: 1.000
🌡️ - In: text
Out: text
Open Weights
Released: 2024-05-27
Updated: 2024-06-27
Nous: Hermes 4 405B nousresearch/hermes-4-405b 131.1K 26.2K Input: $1
Output: $3
Model: 0.500
Completion: 3.000
🧠 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-25
Nous: Hermes 3 70B Instruct nousresearch/hermes-3-llama-3.1-70b 131.1K 32.8K Input: $0.3
Output: $0.3
Model: 0.150
Completion: 1.000
🌡️ - In: text
Out: text
Open Weights
Released: 2024-08-18
Updated: 2026-03-15
Nous: Hermes 3 405B Instruct nousresearch/hermes-3-llama-3.1-405b 131.1K 16.4K Input: $1
Output: $1
Model: 0.500
Completion: 1.000
🌡️ - In: text
Out: text
Open Weights
Released: 2024-08-16
Nous: Hermes 4 70B nousresearch/hermes-4-70b 131.1K 131.1K Input: $0.13
Output: $0.4
Cache Read: $0.055
Model: 0.065
Completion: 3.077
Cache: 0.423
🧠 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-25
Updated: 2026-03-15

Kimi For Coding

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Kimi K2.7 Code k2p7 262.1K 32.8K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 📎 🧠 🔧 2025-01 In: text, image, video
Out: text
Open Weights
Released: 2026-06-12
Kimi K2 Thinking kimi-k2-thinking 262.1K 32.8K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 🧠 🔧 🌡️ 2025-07 In: text
Out: text
Open Weights
Released: 2025-11
Updated: 2025-12
Kimi K2.5 k2p5 262.1K 32.8K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 🧠 🔧 🌡️ 2025-01 In: text, image, video
Out: text
Open Weights
Released: 2026-01
Kimi K2.6 k2p6 262.1K 32.8K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 🧠 🔧 🌡️ 2025-01 In: text, image, video
Out: text
Open Weights
Released: 2026-04

KUAE Cloud Coding Plan

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
GLM-4.7 GLM-4.7 204.8K 131.1K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-12-22

Lilac

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Kimi K2.6 moonshotai/kimi-k2.6 262.1K 262.1K Input: $0.7
Output: $3.5
Cache Read: $0.2
Model: 0.350
Completion: 5.000
Cache: 0.286
📎 🧠 🔧 🌡️ 2025-01 In: text, image
Out: text
Open Weights
Released: 2026-04-21
MiniMax M2.7 minimaxai/minimax-m2.7 204.8K 204.8K Input: $0.3
Output: $1.2
Cache Read: $0.055
Model: 0.150
Completion: 4.000
Cache: 0.183
🧠 🔧 🌡️ 2025-01 In: text
Out: text
Open Weights
Released: 2026-03-18
Gemma 4 31B IT google/gemma-4-31b-it 262.1K 262.1K Input: $0.11
Output: $0.35
Model: 0.055
Completion: 3.182
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video
Out: text
Open Weights
Released: 2026-04-02
GLM 5.1 zai-org/glm-5.1 202.8K 131.1K Input: $0.9
Output: $3
Cache Read: $0.27
Model: 0.450
Completion: 3.333
Cache: 0.300
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2026-03-27

Llama

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Llama-4-Scout-17B-16E-Instruct-FP8 llama-4-scout-17b-16e-instruct-fp8 128K 4.1K Input: $0
Output: $0
- 📎 🔧 🌡️ 2024-08 In: text, image
Out: text
Open Weights
Released: 2025-04-05
Cerebras-Llama-4-Maverick-17B-128E-Instruct cerebras-llama-4-maverick-17b-128e-instruct 128K 4.1K Input: $0
Output: $0
- 📎 🔧 🌡️ 2025-01 In: text
Out: text
Open Weights
Released: 2025-04-05
Llama-3.3-70B-Instruct llama-3.3-70b-instruct 128K 4.1K Input: $0
Output: $0
- 📎 🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-12-06
Groq-Llama-4-Maverick-17B-128E-Instruct groq-llama-4-maverick-17b-128e-instruct 128K 4.1K Input: $0
Output: $0
- 📎 🔧 🌡️ 2025-01 In: text
Out: text
Open Weights
Released: 2025-04-05
Cerebras-Llama-4-Scout-17B-16E-Instruct cerebras-llama-4-scout-17b-16e-instruct 128K 4.1K Input: $0
Output: $0
- 📎 🔧 🌡️ 2025-01 In: text
Out: text
Open Weights
Released: 2025-04-05
Llama-3.3-8B-Instruct llama-3.3-8b-instruct 128K 4.1K Input: $0
Output: $0
- 📎 🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-12-06
Llama-4-Maverick-17B-128E-Instruct-FP8 llama-4-maverick-17b-128e-instruct-fp8 128K 4.1K Input: $0
Output: $0
- 📎 🔧 🌡️ 2024-08 In: text, image
Out: text
Open Weights
Released: 2025-04-05

LLM Gateway

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Claude 3.7 Sonnet claude-3-7-sonnet 200K 8.2K Input: $3
Output: $15
Cache Read: $0.3
Model: 1.500
Completion: 5.000
Cache: 0.100
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-02-24
Qwen Coder Plus qwen-coder-plus 131.1K 8.2K Input: $0.502
Output: $1.004
Model: 0.251
Completion: 2.000
🔧 🌡️ - In: text
Out: text
Released: 2024-09-18
Mistral Large (latest) mistral-large-latest 262.1K 262.1K Input: $0.5
Output: $1.5
Model: 0.250
Completion: 3.000
📎 🔧 🌡️ 2024-11 In: text, image
Out: text
Open Weights
Released: 2024-11-01
Updated: 2025-12-02
Qwen3 VL 235B A22B Thinking qwen3-vl-235b-a22b-thinking 131.1K 8.2K Input: $0.5
Output: $2
Model: 0.250
Completion: 4.000
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2025-09-15
DeepSeek R1 (0528) deepseek-r1-0528 64K 16.4K Input: $0.55
Output: $2.19
Model: 0.275
Completion: 3.982
🧠 🌡️ - In: text
Out: text
Open Weights
Released: 2025-05-28
Devstral Small devstral-small-2507 128K 128K Input: $0.1
Output: $0.3
Model: 0.050
Completion: 3.000
🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2025-07-10
Qwen3 VL 30B A3B Thinking qwen3-vl-30b-a3b-thinking 131.1K 8.2K Input: $0.2
Output: $1
Model: 0.100
Completion: 5.000
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2025-10-02
DeepSeek V4 Flash deepseek-v4-flash 1M 384K Input: $0.14
Output: $0.28
Cache Read: $0.0028
Model: 0.070
Completion: 2.000
Cache: 0.020
🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2026-04-24
Qwen3 Coder Plus qwen3-coder-plus 1M 65.5K Input: $1
Output: $5
Model: 0.500
Completion: 5.000
🔧 🌡️ 2025-04 In: text
Out: text
Released: 2025-07-23
Qwen2.5 Coder 7B qwen25-coder-7b 131.1K 8.2K Input: $0.05
Output: $0.05
Model: 0.025
Completion: 1.000
🌡️ - In: text
Out: text
Open Weights
Released: 2024-09-19
MiniMax-M2.7-highspeed minimax-m2.7-highspeed 204.8K 131.1K Input: $0.6
Output: $2.4
Cache Read: $0.06
Cache Write: $0.375
Model: 0.300
Completion: 4.000
Cache: 0.100
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-18
Llama 3.1 8B Instruct llama-3.1-8b-instruct 128K 2K Input: $0.22
Output: $0.22
Model: 0.110
Completion: 1.000
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-07-23
Qwen Plus qwen-plus 1M 32.8K Input: $0.4
Output: $1.2
Reasoning: $4
Model: 0.200
Completion: 3.000
🧠 🔧 🌡️ 2024-04 In: text
Out: text
Released: 2024-01-25
Updated: 2025-09-11
o3 o3 200K 100K Input: $2
Output: $8
Cache Read: $0.5
Model: 1.000
Completion: 4.000
Cache: 0.250
📎 🧠 🔧 2024-05 In: text, image, pdf
Out: text
Released: 2025-04-16
Nemotron 3 Ultra 550B A55B nemotron-3-ultra-550b 1M 128K Input: $0.5
Output: $2.5
Cache Read: $0.15
Model: 0.250
Completion: 5.000
Cache: 0.300
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-06-04
MiniMax-M2.5 minimax-m2.5 204.8K 131.1K Input: $0.3
Output: $1.2
Cache Read: $0.03
Cache Write: $0.375
Model: 0.150
Completion: 4.000
Cache: 0.100
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-12
Grok 4.20 (Non-Reasoning) grok-4-20-beta-0309-non-reasoning 1M 30K Input: $1.25
Output: $2.5
Cache Read: $0.2
Model: 0.625
Completion: 2.000
Cache: 0.160
📎 🔧 🌡️ - In: text, image, pdf
Out: text
Released: 2026-03-09
GLM-4.7 glm-4.7 204.8K 131.1K Input: $0.6
Output: $2.2
Cache Read: $0.11
Cache Write: $0
Model: 0.300
Completion: 3.667
Cache: 0.183
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-12-22
Gemini 3.1 Flash Lite gemini-3.1-flash-lite 1M 65.5K Input: $0.25
Output: $1.5
Cache Read: $0.025
Input Audio: $0.5
Model: 0.250
Completion: 3.000
Cache: 0.050
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video, audio, pdf
Out: text
Released: 2026-05-07
Qwen3 235B A22B Instruct (2507) qwen3-235b-a22b-instruct-2507 131.1K 8.2K Input: $0.09
Output: $0.58
Model: 0.045
Completion: 6.444
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-07-08
Llama 3 70B Instruct llama-3-70b-instruct 8.2K 8K Input: $0.51
Output: $0.74
Model: 0.255
Completion: 1.451
🌡️ - In: text
Out: text
Open Weights
Released: 2024-04-18
Qwen3-Coder 30B-A3B Instruct qwen3-coder-30b-a3b-instruct 262.1K 65.5K Input: $0.45
Output: $2.25
Model: 0.225
Completion: 5.000
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-04
Seed 1.8 (251228) seed-1-8-251228 256K 8.2K Input: $0.25
Output: $2
Cache Read: $0.05
Model: 0.125
Completion: 8.000
Cache: 0.200
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2025-12-18
Hermes 2 Pro Llama 3 8B hermes-2-pro-llama-3-8b 8.2K 8.2K Input: $0.14
Output: $0.14
Model: 0.070
Completion: 1.000
🌡️ - In: text
Out: text
Open Weights
Released: 2024-05-27
Kimi K2 kimi-k2 131.1K 16.4K Input: $0.6
Output: $2.5
Cache Read: $0.12
Model: 0.300
Completion: 4.167
Cache: 0.200
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-07-11
Llama 3.1 70B Instruct llama-3.1-70b-instruct 128K 2K Input: $0.72
Output: $0.72
Model: 0.360
Completion: 1.000
🌡️ - In: text
Out: text
Open Weights
Released: 2024-07-23
GPT-5.2 Pro gpt-5.2-pro 400K 128K Input: $21
Output: $168
Model: 10.500
Completion: 8.000
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2025-12-11
MiniMax-M2.1 minimax-m2.1 204.8K 131.1K Input: $0.3
Output: $1.2
Model: 0.150
Completion: 4.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-12-23
Qwen3 32B qwen3-32b 131.1K 16.4K Input: $0.7
Output: $2.8
Reasoning: $8.4
Model: 0.350
Completion: 4.000
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-04
Pixtral Large (latest) pixtral-large-latest 128K 128K Input: $2
Output: $6
Model: 1.000
Completion: 3.000
📎 🔧 🌡️ 2024-11 In: text, image
Out: text
Open Weights
Released: 2024-11-01
Updated: 2024-11-04
GLM-4 32B (0414-128k) glm-4-32b-0414-128k 128K 16.4K Input: $0.1
Output: $0.1
Model: 0.050
Completion: 1.000
🔧 🌡️ - In: text
Out: text
Released: 2025-04-14
Seed 1.6 Flash (250715) seed-1-6-flash-250715 256K 8.2K Input: $0.07
Output: $0.3
Cache Read: $0.015
Model: 0.035
Completion: 4.286
Cache: 0.214
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2025-07-26
Qwen3-Next 80B-A3B Instruct qwen3-next-80b-a3b-instruct 131.1K 32.8K Input: $0.5
Output: $2
Model: 0.250
Completion: 4.000
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-09
Qwen3 VL 8B Instruct qwen3-vl-8b-instruct 131.1K 8.2K Input: $0.08
Output: $0.5
Model: 0.040
Completion: 6.250
📎 🌡️ - In: text, image
Out: text
Open Weights
Released: 2025-08-19
GPT-4o Mini Search Preview gpt-4o-mini-search-preview 128K 16.4K Input: $0.15
Output: $0.6
Model: 0.075
Completion: 4.000
📎 🌡️ - In: text, image
Out: text
Released: 2024-10-01
GLM-4.5V glm-4.5v 64K 16.4K Input: $0.6
Output: $1.8
Model: 0.300
Completion: 3.000
📎 🧠 🔧 🌡️ 2025-04 In: text, image, video
Out: text
Open Weights
Released: 2025-08-11
Qwen3.7 Plus qwen3.7-plus 1M 64K Input: $0.4
Output: $1.6
Cache Read: $0.08
Cache Write: $0.5
Model: 0.200
Completion: 4.000
Cache: 0.200
🧠 🔧 🌡️ 2025-04 In: text, image
Out: text
Released: 2026-06-02
Gemini 2.5 Pro gemini-2.5-pro 1M 65.5K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-06-17
GPT-5 gpt-5 400K 128K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-08-07
Claude Haiku 4.5 claude-haiku-4-5-20251001 200K 64K Input: $1
Output: $5
Cache Read: $0.1
Cache Write: $1.25
Model: 0.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-02-28 In: text, image, pdf
Out: text
Released: 2025-10-15
Kimi K2 Thinking Turbo kimi-k2-thinking-turbo 262.1K 262.1K Input: $1.15
Output: $8
Cache Read: $0.15
Model: 0.575
Completion: 6.957
Cache: 0.130
🧠 🔧 🌡️ 2024-08 In: text
Out: text
Open Weights
Released: 2025-11-06
Qwen3.6 35B-A3B qwen3.6-35b-a3b 262.1K 65.5K Input: $0.248
Output: $1.485
Model: 0.124
Completion: 5.988
📎 🧠 🔧 🌡️ - In: text, image, video, audio
Out: text
Open Weights
Released: 2026-04-17
GPT-3.5-turbo gpt-3.5-turbo 16.4K 4.1K Input: $0.5
Output: $1.5
Cache Read: $0
Model: 0.250
Completion: 3.000
🌡️ 2021-09-01 In: text
Out: text
Released: 2023-03-01
Updated: 2023-11-06
Seed 1.6 (250615) seed-1-6-250615 256K 8.2K Input: $0.25
Output: $2
Cache Read: $0.05
Model: 0.125
Completion: 8.000
Cache: 0.200
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2025-06-25
Qwen3.7 Max qwen3.7-max 1M 65.5K Input: $2.5
Output: $7.5
Cache Read: $0.5
Cache Write: $3.125
Model: 1.250
Completion: 3.000
Cache: 0.200
🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-05-21
GLM-4.5 glm-4.5 131.1K 98.3K Input: $0.6
Output: $2.2
Cache Read: $0.11
Cache Write: $0
Model: 0.300
Completion: 3.667
Cache: 0.183
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-28
GPT-5 Pro gpt-5-pro 400K 272K Input: $15
Output: $120
Model: 7.500
Completion: 8.000
📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-10-06
Gemini 2.5 Flash gemini-2.5-flash 1M 65.5K Input: $0.3
Output: $2.5
Cache Read: $0.03
Input Audio: $1
Model: 0.500
Completion: 2.500
Cache: 0.030
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-06-17
GPT-4o gpt-4o 128K 16.4K Input: $2.5
Output: $10
Cache Read: $1.25
Model: 1.250
Completion: 4.000
Cache: 0.500
📎 🔧 🌡️ 2023-09 In: text, image, pdf
Out: text
Released: 2024-05-13
Updated: 2024-08-06
Ministral 8B ministral-8b-2512 262.1K 8.2K Input: $0.15
Output: $0.15
Model: 0.075
Completion: 1.000
📎 🌡️ - In: text, image
Out: text
Open Weights
Released: 2025-12-02
GPT-4 gpt-4 8.2K 8.2K Input: $30
Output: $60
Model: 15.000
Completion: 2.000
📎 🔧 🌡️ 2023-11 In: text
Out: text
Released: 2023-11-06
Updated: 2024-04-09
o4-mini o4-mini 200K 100K Input: $1.1
Output: $4.4
Cache Read: $0.275
Model: 0.550
Completion: 4.000
Cache: 0.250
📎 🧠 🔧 2024-05 In: text, image
Out: text
Released: 2025-04-16
Qwen3 Max qwen3-max 262.1K 65.5K Input: $1.2
Output: $6
Model: 0.600
Completion: 5.000
🔧 🌡️ 2025-04 In: text
Out: text
Released: 2025-09-23
Mistral Small 3.2 mistral-small-2506 128K 16.4K Input: $0.1
Output: $0.3
Model: 0.050
Completion: 3.000
🔧 🌡️ 2025-03 In: text, image
Out: text
Open Weights
Released: 2025-06-20
Gemini 3.5 Flash gemini-3.5-flash 1M 65.5K Input: $1.5
Output: $9
Cache Read: $0.15
Input Audio: $1.5
Model: 0.750
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video, audio, pdf
Out: text
Released: 2026-05-19
Qwen Plus Latest qwen-plus-latest 131.1K 8.2K Input: $0.115
Output: $0.287
Model: 0.058
Completion: 2.496
📎 🔧 🌡️ - In: text, image
Out: text
Released: 2025-01-25
Claude Opus 4.1 claude-opus-4-1-20250805 200K 32K Input: $15
Output: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-08-05
Grok 4.3 grok-4-3 1M 30K Input: $1.25
Output: $2.5
Cache Read: $0.2
Model: 0.625
Completion: 2.000
Cache: 0.160
📎 🧠 🔧 🌡️ - In: text, image, pdf
Out: text
Released: 2026-04-17
Llama 4 Scout 17B Instruct llama-4-scout-17b-instruct 8.2K 2K Input: $0.17
Output: $0.66
Model: 0.085
Completion: 3.882
📎 🌡️ - In: text, image
Out: text
Open Weights
Released: 2025-04-05
Llama-3.3-70B-Instruct llama-3.3-70b-instruct 128K 4.1K Input: $0
Output: $0
- 📎 🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-12-06
Sonar Reasoning Pro sonar-reasoning-pro 128K 4.1K Input: $2
Output: $8
Model: 1.000
Completion: 4.000
📎 🧠 🌡️ 2025-09-01 In: text, image
Out: text
Released: 2024-01-01
Updated: 2025-09-01
Gemini 2.0 Flash gemini-2.0-flash 1M 8.2K Input: $0.1
Output: $0.4
Cache Read: $0.025
Model: 0.050
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-06 In: text, image, audio, video, pdf
Out: text
Released: 2024-12-11
Grok 4.20 (Non-Reasoning) grok-4-20-non-reasoning 1M 30K Input: $1.25
Output: $2.5
Cache Read: $0.2
Model: 0.625
Completion: 2.000
Cache: 0.160
📎 🔧 🌡️ - In: text, image, pdf
Out: text
Released: 2026-03-09
GLM-4.7-FlashX glm-4.7-flashx 200K 131.1K Input: $0.07
Output: $0.4
Cache Read: $0.01
Cache Write: $0
Model: 0.035
Completion: 5.714
Cache: 0.143
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2026-01-19
Qwen3 30B A3B Instruct (2507) qwen3-30b-a3b-instruct-2507 131.1K 8.2K Input: $0.1
Output: $0.3
Model: 0.050
Completion: 3.000
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-07-08
Kimi K2.7 Code kimi-k2.7-code 262.1K 262.1K Input: $0.95
Output: $4
Cache Read: $0.19
Model: 0.475
Completion: 4.211
Cache: 0.200
📎 🧠 🔧 2025-01 In: text, image, video
Out: text
Open Weights
Released: 2026-06-12
GLM-5.1 glm-5.1 200K 131.1K Input: $6
Output: $24
Cache Read: $1.3
Cache Write: $0
Model: 3.000
Completion: 4.000
Cache: 0.217
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-27
DeepSeek V4 Pro deepseek-v4-pro 1M 384K Input: $0.435
Output: $0.87
Cache Read: $0.003625
Model: 0.217
Completion: 2.000
Cache: 0.008
🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2026-04-24
Qwen3-Next 80B-A3B (Thinking) qwen3-next-80b-a3b-thinking 131.1K 32.8K Input: $0.5
Output: $6
Model: 0.250
Completion: 12.000
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-09
GLM-4.6 glm-4.6 204.8K 131.1K Input: $0.6
Output: $2.2
Cache Read: $0.11
Cache Write: $0
Model: 0.300
Completion: 3.667
Cache: 0.183
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-09-30
Qwen3.5 397B-A17B qwen35-397b-a17b 262.1K 65.5K Input: $0.6
Output: $3.6
Model: 0.300
Completion: 6.000
📎 🧠 🔧 🌡️ - In: text, image, video, audio
Out: text
Open Weights
Released: 2026-02-15
Qwen3 235B A22B Thinking (2507) qwen3-235b-a22b-thinking-2507 131.1K 8.2K Input: $0.2
Output: $0.6
Model: 0.100
Completion: 3.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-07-08
Kimi K2 Thinking kimi-k2-thinking 262.1K 262.1K Input: $0.6
Output: $2.5
Cache Read: $0.15
Model: 0.300
Completion: 4.167
Cache: 0.250
🧠 🔧 🌡️ 2024-08 In: text
Out: text
Open Weights
Released: 2025-11-06
Claude Sonnet 4.5 (latest) claude-sonnet-4-5 200K 64K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-07-31 In: text, image, pdf
Out: text
Released: 2025-09-29
Gemini Pro Latest gemini-pro-latest 1M 65.5K Input: $2
Output: $12
Cache Read: $0.2
Model: 1.000
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2026-02-27
Gemma 3 1B IT gemma-3-1b-it 1M 16.4K Input: $0.08
Output: $0.3
Model: 0.040
Completion: 3.750
🌡️ - In: text
Out: text
Open Weights
Released: 2025-03-12
GLM-4.5 X glm-4.5-x 128K 16.4K Input: $2.2
Output: $8.9
Cache Read: $0.45
Model: 1.100
Completion: 4.045
Cache: 0.205
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-07-28
Claude Opus 4.7 claude-opus-4-7 1M 128K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 2026-01-31 In: text, image, pdf
Out: text
Released: 2026-04-16
GPT-5.4 nano gpt-5.4-nano 400K 128K Input: $0.2
Output: $1.25
Cache Read: $0.02
Model: 0.100
Completion: 6.250
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2026-03-17
Grok 4.20 (Reasoning) grok-4-20-beta-0309-reasoning 1M 30K Input: $1.25
Output: $2.5
Cache Read: $0.2
Model: 0.625
Completion: 2.000
Cache: 0.160
📎 🧠 🔧 🌡️ - In: text, image, pdf
Out: text
Released: 2026-03-09
GLM-4.5 AirX glm-4.5-airx 128K 16.4K Input: $1.1
Output: $4.5
Cache Read: $0.22
Model: 0.550
Completion: 4.091
Cache: 0.200
🔧 🌡️ - In: text
Out: text
Released: 2025-07-28
GPT-5 Chat (latest) gpt-5-chat-latest 400K 128K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🌡️ 2024-09-30 In: text, image
Out: text
Released: 2025-08-07
Ministral 3B ministral-3b-2512 131.1K 8.2K Input: $0.1
Output: $0.1
Model: 0.050
Completion: 1.000
📎 🌡️ - In: text, image
Out: text
Open Weights
Released: 2025-12-02
Qwen2.5-VL 72B Instruct qwen2-5-vl-72b-instruct 131.1K 8.2K Input: $2.8
Output: $8.4
Model: 1.400
Completion: 3.000
🔧 🌡️ 2024-04 In: text, image
Out: text
Open Weights
Released: 2024-09
GLM-4.6V FlashX glm-4.6v-flashx 128K 16K Input: $0.04
Output: $0.4
Cache Read: $0.004
Model: 0.020
Completion: 10.000
Cache: 0.100
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2025-12-08
MiniMax-M3 minimax-m3 512K 128K Input: $0.6
Output: $2.4
Cache Read: $0.12
Model: 0.300
Completion: 4.000
Cache: 0.200
📎 🧠 🔧 🌡️ - In: text, image, video
Out: text
Open Weights
Released: 2026-06-01
Qwen3-VL Plus qwen3-vl-plus 262.1K 32.8K Input: $0.2
Output: $1.6
Reasoning: $4.8
Model: 0.100
Completion: 8.000
🧠 🔧 🌡️ 2025-04 In: text, image
Out: text
Released: 2025-09-23
Claude Opus 4.5 claude-opus-4-5-20251101 200K 64K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-11-01
GPT-5.1 Codex gpt-5.1-codex 400K 128K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-11-13
MiniMax-M2 minimax-m2 196.6K 128K Input: $0.3
Output: $1.2
Model: 0.150
Completion: 4.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-10-27
Claude Sonnet 3.5 v2 claude-3-5-sonnet-20241022 200K 8.2K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🔧 🌡️ 2024-04-30 In: text, image, pdf
Out: text
Released: 2024-10-22
Claude Opus 4.8 claude-opus-4-8 1M 128K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2026-05-28
Claude Opus 4 claude-opus-4-20250514 200K 32K Input: $15
Output: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-05-22
Qwen3 Max (2026-01-23) qwen3-max-2026-01-23 256K 32.8K Input: $0.359
Output: $1.434
Cache Read: $0.072
Model: 0.179
Completion: 3.994
Cache: 0.201
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2026-01-23
GPT-5.3 Chat (latest) gpt-5.3-chat-latest 128K 16.4K Input: $1.75
Output: $14
Cache Read: $0.175
Model: 0.875
Completion: 8.000
Cache: 0.100
📎 🔧 🌡️ 2025-08-31 In: text, image
Out: text
Released: 2026-03-03
Claude 3 Opus claude-3-opus 200K 4.1K Input: $15
Output: $75
Cache Read: $1.5
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🔧 🌡️ - In: text, image
Out: text
Released: 2024-03-04
Qwen-Omni Turbo qwen-omni-turbo 32.8K 2K Input: $0.07
Output: $0.27
Input Audio: $4.44
Output Audio: $8.89
Model: 2.220
Completion: 2.002
🔧 🌡️ 2024-04 In: text, image, audio, video
Out: text, audio
Released: 2025-01-19
Updated: 2025-03-26
o3-mini o3-mini 200K 100K Input: $1.1
Output: $4.4
Cache Read: $0.55
Model: 0.550
Completion: 4.000
Cache: 0.500
🧠 🔧 2024-05 In: text
Out: text
Released: 2024-12-20
Updated: 2025-01-29
Qwen3 32B FP8 qwen3-32b-fp8 131.1K 8.2K Input: $0.1
Output: $0.1
Model: 0.050
Completion: 1.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-04-28
Claude 3.5 Haiku claude-3-5-haiku 200K 8.2K Input: $0.8
Output: $4
Cache Read: $0.08
Model: 0.400
Completion: 5.000
Cache: 0.100
🔧 🌡️ - In: text
Out: text
Released: 2024-10-22
GPT-5.2 gpt-5.2 400K 128K Input: $1.75
Output: $14
Cache Read: $0.175
Model: 0.875
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2025-12-11
Gemma 3 27B gemma-3-27b 128K 16.4K Input: $0.27
Output: $0.27
Model: 0.135
Completion: 1.000
📎 🌡️ - In: text, image
Out: text
Open Weights
Released: 2025-03-12
GPT-5.3 Codex gpt-5.3-codex 400K 128K Input: $1.75
Output: $14
Cache Read: $0.175
Model: 0.875
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-02-05
Grok 4 (0709) grok-4-0709 256K 256K Input: $3
Output: $15
Model: 1.500
Completion: 5.000
🔧 🌡️ - In: text
Out: text
Released: 2025-07-09
MiniMax-M2.7 minimax-m2.7 204.8K 131.1K Input: $0.3
Output: $1.2
Cache Read: $0.06
Cache Write: $0.375
Model: 0.150
Completion: 4.000
Cache: 0.200
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-18
Qwen Flash qwen-flash 1M 32.8K Input: $0.05
Output: $0.4
Model: 0.025
Completion: 8.000
🧠 🔧 🌡️ 2024-04 In: text
Out: text
Released: 2025-07-28
Qwen3 4B FP8 qwen3-4b-fp8 131.1K 8.2K Input: $0.03
Output: $0.03
Model: 0.015
Completion: 1.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-04-28
Gemini 2.5 Flash-Lite gemini-2.5-flash-lite 1M 65.5K Input: $0.1
Output: $0.4
Cache Read: $0.01
Input Audio: $0.3
Model: 0.150
Completion: 1.333
Cache: 0.033
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-06-17
Qwen2.5 VL 32B Instruct qwen2-5-vl-32b-instruct 131.1K 8.2K Input: $1.4
Output: $4.2
Model: 0.700
Completion: 3.000
📎 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2025-03-15
Claude Sonnet 4 claude-sonnet-4-20250514 200K 64K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-05-22
GPT-5.1 Codex mini gpt-5.1-codex-mini 400K 128K Input: $0.25
Output: $2
Cache Read: $0.025
Model: 0.125
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-11-13
Qwen3 30B A3B Thinking (2507) qwen3-30b-a3b-thinking-2507 131.1K 8.2K Input: $0.1
Output: $0.1
Model: 0.050
Completion: 1.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-07-08
Kimi K2.5 kimi-k2.5 262.1K 262.1K Input: $0.6
Output: $3
Cache Read: $0.1
Model: 0.300
Completion: 5.000
Cache: 0.167
🧠 🔧 2025-01 In: text, image, video
Out: text
Open Weights
Released: 2026-01
Seed 1.6 (250915) seed-1-6-250915 256K 8.2K Input: $0.25
Output: $2
Cache Read: $0.05
Model: 0.125
Completion: 8.000
Cache: 0.200
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2025-09-15
GPT-5.2 Chat gpt-5.2-chat-latest 128K 16.4K Input: $1.75
Output: $14
Cache Read: $0.175
Model: 0.875
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2025-12-11
MiniMax Text 01 minimax-text-01 1M 131.1K Input: $0.2
Output: $1.1
Model: 0.100
Completion: 5.500
🧠 🌡️ - In: text
Out: text
Open Weights
Released: 2025-01-15
Grok 4 Fast Reasoning grok-4-fast-reasoning 2M 30K Input: $0.2
Output: $0.5
Cache Read: $0.05
Model: 0.100
Completion: 2.500
Cache: 0.250
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2025-07-09
GPT-4.1 nano gpt-4.1-nano 1M 32.8K Input: $0.1
Output: $0.4
Cache Read: $0.025
Model: 0.050
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2025-04-14
GPT OSS 120B gpt-oss-120b 131.1K 32.8K Input: $0.05
Output: $0.25
Model: 0.025
Completion: 5.000
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-08-05
Qwen-VL Max qwen-vl-max 131.1K 8.2K Input: $0.8
Output: $3.2
Model: 0.400
Completion: 4.000
🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2024-04-08
Updated: 2025-08-13
Llama 3 8B Instruct llama-3-8b-instruct 8.2K 8.2K Input: $0.04
Output: $0.04
Model: 0.020
Completion: 1.000
🌡️ - In: text
Out: text
Open Weights
Released: 2025-04-03
Qwen3 30B A3B FP8 qwen3-30b-a3b-fp8 131.1K 8.2K Input: $0.1
Output: $0.1
Model: 0.050
Completion: 1.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-04-28
Sonar sonar 128K 4.1K Input: $1
Output: $1
Model: 0.500
Completion: 1.000
🌡️ 2025-09-01 In: text
Out: text
Released: 2024-01-01
Updated: 2025-09-01
Qwen Max qwen-max 32.8K 8.2K Input: $1.6
Output: $6.4
Model: 0.800
Completion: 4.000
🔧 🌡️ 2024-04 In: text
Out: text
Released: 2024-04-03
Updated: 2025-01-25
Claude Sonnet 3.7 claude-3-7-sonnet-20250219 200K 64K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2024-10-31 In: text, image, pdf
Out: text
Released: 2025-02-19
o1 o1 200K 100K Input: $15
Output: $60
Cache Read: $7.5
Model: 7.500
Completion: 4.000
Cache: 0.500
📎 🧠 🔧 2023-09 In: text, image, pdf
Out: text
Released: 2024-12-05
MiniMax-M2.5-highspeed minimax-m2.5-highspeed 204.8K 131.1K Input: $0.6
Output: $2.4
Cache Read: $0.06
Cache Write: $0.375
Model: 0.300
Completion: 4.000
Cache: 0.100
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-13
DeepSeek V3.1 deepseek-v3.1 128K 32.8K Input: $0.56
Output: $1.68
Cache Read: $0.07
Model: 0.280
Completion: 3.000
Cache: 0.125
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2025-08-21
Llama 4 Scout llama-4-scout 32.8K 16.4K Input: $0.18
Output: $0.59
Model: 0.090
Completion: 3.278
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-04-05
Ministral 14B ministral-14b-2512 262.1K 8.2K Input: $0.2
Output: $0.2
Model: 0.100
Completion: 1.000
📎 🌡️ - In: text, image
Out: text
Open Weights
Released: 2025-12-02
Sonar Pro sonar-pro 200K 8.2K Input: $3
Output: $15
Model: 1.500
Completion: 5.000
📎 🌡️ 2025-09-01 In: text, image
Out: text
Released: 2024-01-01
Updated: 2025-09-01
GLM-4.6V glm-4.6v 128K 32.8K Input: $0.3
Output: $0.9
Model: 0.150
Completion: 3.000
📎 🧠 🔧 🌡️ 2025-04 In: text, image, video
Out: text
Open Weights
Released: 2025-12-08
Claude Haiku 4.5 (latest) claude-haiku-4-5 200K 64K Input: $1
Output: $5
Cache Read: $0.1
Cache Write: $1.25
Model: 0.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-02-28 In: text, image, pdf
Out: text
Released: 2025-10-15
MiniMax M2.1 Lightning minimax-m2.1-lightning 196.6K 131.1K Input: $0.12
Output: $0.48
Model: 0.060
Completion: 4.000
🧠 🌡️ - In: text
Out: text
Open Weights
Released: 2025-12-23
GPT-5.4 gpt-5.4 1.1M 128K Input: $2.5
Output: $15
Cache Read: $0.25
Model: 1.250
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-03-05
MiMo-V2.5 mimo-v2.5 1M 131.1K Input: $0.4
Output: $2
Cache Read: $0.08
Model: 0.200
Completion: 5.000
Cache: 0.200
📎 🧠 🔧 🌡️ 2024-12 In: text, image, audio, video
Out: text
Open Weights
Released: 2026-04-22
MiMo-V2-Omni mimo-v2-omni 262.1K 131.1K Input: $0.4
Output: $2
Cache Read: $0.08
Model: 0.200
Completion: 5.000
Cache: 0.200
📎 🔧 🌡️ 2024-12 In: text, image, audio, video, pdf
Out: text
Released: 2026-03-18
GPT-5.4 mini gpt-5.4-mini 400K 128K Input: $0.75
Output: $4.5
Cache Read: $0.075
Model: 0.375
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2026-03-17
Kimi K2.6 kimi-k2.6 262.1K 262.1K Input: $0.95
Output: $4
Cache Read: $0.16
Model: 0.475
Completion: 4.211
Cache: 0.168
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video
Out: text
Open Weights
Released: 2026-04-21
GPT-4.1 gpt-4.1 1M 32.8K Input: $2
Output: $8
Cache Read: $0.5
Model: 1.000
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-04 In: text, image, pdf
Out: text
Released: 2025-04-14
Gemini 3.1 Pro Preview gemini-3.1-pro-preview 1M 65.5K Input: $2
Output: $12
Cache Read: $0.2
Model: 1.000
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video, audio, pdf
Out: text
Released: 2026-02-19
Claude Opus 4.6 claude-opus-4-6 1M 128K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-05-31 In: text, image, pdf
Out: text
Released: 2026-02-05
Updated: 2026-03-13
Qwen3 Coder Next qwen3-coder-next 262.1K 65.5K Input: $0.108
Output: $0.675
Model: 0.054
Completion: 6.250
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-10-15
Llama 4 Maverick 17B Instruct llama-4-maverick-17b-instruct 8.2K 2K Input: $0.24
Output: $0.97
Model: 0.120
Completion: 4.042
📎 🌡️ - In: text, image
Out: text
Open Weights
Released: 2025-04-05
Qwen3-Coder 480B-A35B Instruct qwen3-coder-480b-a35b-instruct 262.1K 65.5K Input: $1.5
Output: $7.5
Model: 0.750
Completion: 5.000
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-04
Devstral 2 devstral-2512 262.1K 262.1K Input: $0.4
Output: $2
Model: 0.200
Completion: 5.000
🔧 🌡️ 2025-12 In: text
Out: text
Open Weights
Released: 2025-12-09
Claude Sonnet 4.5 claude-sonnet-4-5-20250929 200K 64K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-07-31 In: text, image, pdf
Out: text
Released: 2025-09-29
QwQ Plus qwq-plus 131.1K 8.2K Input: $0.8
Output: $2.4
Model: 0.400
Completion: 3.000
🧠 🔧 🌡️ 2024-04 In: text
Out: text
Released: 2025-03-05
Grok 4.1 Fast Reasoning grok-4-1-fast-reasoning 2M 30K Input: $0.2
Output: $0.5
Cache Read: $0.05
Model: 0.100
Completion: 2.500
Cache: 0.250
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2025-11-19
Qwen3 VL 30B A3B Instruct qwen3-vl-30b-a3b-instruct 131.1K 8.2K Input: $0.2
Output: $0.7
Model: 0.100
Completion: 3.500
📎 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2025-10-02
Qwen3 VL Flash qwen3-vl-flash 1M 32K Input: $0.022
Output: $0.215
Cache Read: $0.0044
Model: 0.011
Completion: 9.773
Cache: 0.200
📎 🔧 🌡️ - In: text, image
Out: text
Released: 2025-10-09
GPT-5 Mini gpt-5-mini 400K 128K Input: $0.25
Output: $2
Cache Read: $0.025
Model: 0.125
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2024-05-30 In: text, image
Out: text
Released: 2025-08-07
MiMo-V2-Flash mimo-v2-flash 262.1K 65.5K Input: $0.1
Output: $0.3
Cache Read: $0.01
Model: 0.050
Completion: 3.000
Cache: 0.100
🧠 🔧 🌡️ 2024-12-01 In: text
Out: text
Open Weights
Released: 2025-12-16
Updated: 2026-02-04
GPT-4.1 mini gpt-4.1-mini 1M 32.8K Input: $0.4
Output: $1.6
Cache Read: $0.1
Model: 0.200
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-04 In: text, image, pdf
Out: text
Released: 2025-04-14
Llama 3.1 Nemotron Ultra 253B llama-3.1-nemotron-ultra-253b 128K 8.2K Input: $0.6
Output: $1.8
Model: 0.300
Completion: 3.000
🌡️ - In: text
Out: text
Open Weights
Released: 2025-04-07
GPT-4 Turbo gpt-4-turbo 128K 4.1K Input: $10
Output: $30
Model: 5.000
Completion: 3.000
📎 🔧 🌡️ 2023-12 In: text, image
Out: text
Released: 2023-11-06
Updated: 2024-04-09
Qwen3 Coder Flash qwen3-coder-flash 1M 65.5K Input: $0.3
Output: $1.5
Model: 0.150
Completion: 5.000
🔧 🌡️ 2025-04 In: text
Out: text
Released: 2025-07-28
Gemma 2 27B IT gemma-2-27b-it-together 8.2K 16.4K Input: $0.08
Output: $0.08
Model: 0.040
Completion: 1.000
🌡️ - In: text
Out: text
Open Weights
Released: 2024-06-27
Grok Build 0.1 grok-build-0-1 256K 256K Input: $1
Output: $2
Cache Read: $0.2
Model: 0.500
Completion: 2.000
Cache: 0.200
📎 🧠 🔧 🌡️ - In: text, image, pdf
Out: text
Released: 2026-04-16
GPT-5 Nano gpt-5-nano 400K 128K Input: $0.05
Output: $0.4
Cache Read: $0.005
Model: 0.025
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2024-05-30 In: text, image
Out: text
Released: 2025-08-07
Llama 3.2 11B Instruct llama-3.2-11b-instruct 128K 8.2K Input: $0.07
Output: $0.33
Model: 0.035
Completion: 4.714
🌡️ - In: text
Out: text
Open Weights
Released: 2024-09-25
GPT-5.4 Pro gpt-5.4-pro 1.1M 128K Input: $30
Output: $180
Model: 15.000
Completion: 6.000
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2026-03-05
GLM-4.6V Flash glm-4.6v-flash 128K 16K Input: $0
Output: $0
- 📎 🧠 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2025-12-08
GLM-4.5-Air glm-4.5-air 131.1K 98.3K Input: $0.2
Output: $1.1
Cache Read: $0.03
Cache Write: $0
Model: 0.100
Completion: 5.500
Cache: 0.150
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-28
Claude Sonnet 4.6 claude-sonnet-4-6 1M 64K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-02-17
Updated: 2026-03-13
Gemini 3 Flash Preview gemini-3-flash-preview 1M 65.5K Input: $0.5
Output: $3
Cache Read: $0.05
Input Audio: $1
Model: 0.500
Completion: 3.000
Cache: 0.050
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video, audio, pdf
Out: text
Released: 2025-12-17
GLM-4.7-Flash glm-4.7-flash 200K 131.1K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2026-01-19
Qwen3 VL 235B A22B Instruct qwen3-vl-235b-a22b-instruct 131.1K 8.2K Input: $0.3
Output: $1.5
Model: 0.150
Completion: 5.000
📎 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2025-09-15
MiMo-V2-Pro mimo-v2-pro 1M 131.1K Input: $1
Output: $3
Cache Read: $0.2
Model: 0.500
Completion: 3.000
Cache: 0.200
🧠 🔧 🌡️ 2024-12 In: text
Out: text
Released: 2026-03-18
GPT-4o Search Preview gpt-4o-search-preview 128K 16.4K Input: $2.5
Output: $10
Model: 1.250
Completion: 4.000
📎 🌡️ - In: text, image
Out: text
Released: 2024-10-01
Llama 3.2 3B Instruct llama-3.2-3b-instruct 32.8K 32K Input: $0.03
Output: $0.05
Model: 0.015
Completion: 1.667
🌡️ - In: text
Out: text
Open Weights
Released: 2024-09-18
MiMo-V2.5-Pro mimo-v2.5-pro 1M 131.1K Input: $1
Output: $3
Cache Read: $0.2
Model: 0.500
Completion: 3.000
Cache: 0.200
🧠 🔧 🌡️ 2024-12 In: text
Out: text
Open Weights
Released: 2026-04-22
GPT-5.5 Pro gpt-5.5-pro 1.1M 128K Input: $30
Output: $180
Model: 15.000
Completion: 6.000
📎 🧠 🔧 2025-12-01 In: text, image, pdf
Out: text
Released: 2026-04-23
GPT-4o mini gpt-4o-mini 128K 16.4K Input: $0.15
Output: $0.6
Cache Read: $0.075
Model: 0.075
Completion: 4.000
Cache: 0.500
📎 🔧 🌡️ 2023-09 In: text, image, pdf
Out: text
Released: 2024-07-18
GPT OSS 20B gpt-oss-20b 131.1K 32.8K Input: $0.04
Output: $0.15
Model: 0.020
Completion: 3.750
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-08-05
GLM-4.5-Flash glm-4.5-flash 131.1K 98.3K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 🧠 🔧 🌡️ 2025-04 In: text
Out: text
Released: 2025-07-28
GLM-5 glm-5 204.8K 131.1K Input: $1
Output: $3.2
Cache Read: $0.2
Cache Write: $0
Model: 0.500
Completion: 3.200
Cache: 0.200
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-11
Qwen Max Latest qwen-max-latest 32.8K 8.2K Input: $0.345
Output: $1.377
Model: 0.172
Completion: 3.991
📎 🔧 🌡️ - In: text, image
Out: text
Released: 2025-01-25
Mistral Large 3 mistral-large-2512 262.1K 262.1K Input: $0.5
Output: $1.5
Model: 0.250
Completion: 3.000
📎 🔧 🌡️ 2024-11 In: text, image
Out: text
Open Weights
Released: 2024-11-01
Updated: 2025-12-02
Qwen Turbo qwen-turbo 1M 16.4K Input: $0.05
Output: $0.2
Reasoning: $0.5
Model: 0.025
Completion: 4.000
🧠 🔧 🌡️ 2024-04 In: text
Out: text
Released: 2024-11-01
Updated: 2025-04-28
Custom Model custom 128K 16.4K Input: $0
Output: $0
- 📎 🔧 🌡️ - In: text, image
Out: text
Released: 2024-01-01
Grok 4.20 (Reasoning) grok-4-20-reasoning 1M 30K Input: $1.25
Output: $2.5
Cache Read: $0.2
Model: 0.625
Completion: 2.000
Cache: 0.160
📎 🧠 🔧 🌡️ - In: text, image, pdf
Out: text
Released: 2026-03-09
DeepSeek V3.2 deepseek-v3.2 163.8K 16.4K Input: $0.28
Output: $0.42
Cache Read: $0.056
Model: 0.140
Completion: 1.500
Cache: 0.200
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2025-09-29
Qwen3.6 Max Preview qwen3.6-max-preview 262.1K 65.5K Input: $1.3
Output: $7.8
Cache Read: $0.13
Cache Write: $1.625
Model: 0.650
Completion: 6.000
Cache: 0.100
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Released: 2026-04-20
Auto Route auto 128K 16.4K Input: $0
Output: $0
- 📎 🔧 🌡️ - In: text, image
Out: text
Released: 2024-01-01
Gemini 3.1 Flash Lite Preview gemini-3.1-flash-lite-preview 1M 65.5K Input: $0.25
Output: $1.5
Cache Read: $0.025
Input Audio: $0.5
Model: 0.250
Completion: 3.000
Cache: 0.050
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video, audio, pdf
Out: text
Released: 2026-03-03
Codestral codestral-2508 256K 16.4K Input: $0.3
Output: $0.9
Model: 0.150
Completion: 3.000
🌡️ - In: text
Out: text
Open Weights
Released: 2025-07-30
Qwen3 235B A22B FP8 qwen3-235b-a22b-fp8 131.1K 8.2K Input: $0.2
Output: $0.8
Model: 0.100
Completion: 4.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-04-28
GPT-5.2 Codex gpt-5.2-codex 400K 128K Input: $1.75
Output: $14
Cache Read: $0.175
Model: 0.875
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image, pdf
Out: text
Released: 2025-12-11
Qwen-VL Plus qwen-vl-plus 131.1K 8.2K Input: $0.21
Output: $0.63
Model: 0.105
Completion: 3.000
🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2024-01-25
Updated: 2025-08-15
Qwen3.6 Plus qwen3.6-plus 1M 65.5K Input: $0.5
Output: $3
Cache Read: $0.05
Cache Write: $0.625
Model: 0.250
Completion: 6.000
Cache: 0.100
🧠 🔧 🌡️ 2025-04 In: text, image, video
Out: text
Released: 2026-04-02
Gemini 2.0 Flash-Lite gemini-2.0-flash-lite 1M 8.2K Input: $0.075
Output: $0.3
Model: 0.037
Completion: 4.000
📎 🔧 🌡️ 2024-06 In: text, image, audio, video, pdf
Out: text
Released: 2024-12-11
GPT-5.1 gpt-5.1 400K 128K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-11-13
GPT-5.5 gpt-5.5 1.1M 128K Input: $5
Output: $30
Cache Read: $0.5
Model: 2.500
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 2025-12-01 In: text, image, pdf
Out: text
Released: 2026-04-23

LLMTR

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Sincap sincap 128K 8.2K Input: $0
Output: $0
- 🌡️ - In: text
Out: text
Released: 2026-05-05
Magibu 11B v8 magibu-11b-v8 8.2K 4.1K Input: $0
Output: $0
- 🌡️ - In: text
Out: text
Released: 2026-06-05
Gemma 4 gemma-4 32.8K 8.2K Input: $5
Output: $10
Model: 2.500
Completion: 2.000
🌡️ - In: text
Out: text
Open Weights
Released: 2026-04-22
MedGemma 4B medgemma-4b 8.2K 4.1K Input: $3
Output: $5
Model: 1.500
Completion: 1.667
📎 🌡️ - In: text, image
Out: text
Open Weights
Released: 2026-04-26
Qwen3.6 35B-A3B qwen3-6-35b 16.4K 65.5K Input: $5
Output: $10
Model: 2.500
Completion: 2.000
📎 🧠 🔧 🌡️ - In: text, image, video, audio
Out: text
Open Weights
Released: 2026-04-17
Trendyol 7B trendyol-7b 32.8K 8.2K Input: $0
Output: $0
- 🌡️ - In: text
Out: text
Open Weights
Released: 2026-06-06

LMStudio

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
GPT OSS 20B openai/gpt-oss-20b 131.1K 32.8K Input: $0
Output: $0
- 🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-05
Qwen3 30B A3B 2507 qwen/qwen3-30b-a3b-2507 262.1K 16.4K Input: $0
Output: $0
- 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-30
Qwen3 Coder 30B qwen/qwen3-coder-30b 262.1K 65.5K Input: $0
Output: $0
- 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-23

LucidQuery AI

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
LucidNova RF1 100B lucidnova-rf1-100b 120K 8K Input: $2
Output: $5
Model: 1.000
Completion: 2.500
📎 🧠 🔧 2025-09-16 In: text
Out: text
Released: 2024-12-28
Updated: 2025-09-10
LucidQuery Nexus Coder lucidquery-nexus-coder 250K 60K Input: $2
Output: $5
Model: 1.000
Completion: 2.500
📎 🧠 🔧 2025-08-01 In: text
Out: text
Released: 2025-09-01

Meganova

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Llama 3.3 70B Instruct meta-llama/Llama-3.3-70B-Instruct 131.1K 16.4K Input: $0.1
Output: $0.3
Model: 0.050
Completion: 3.000
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-12-06
Kimi K2 Thinking moonshotai/Kimi-K2-Thinking 262.1K 262.1K Input: $0.6
Output: $2.6
Model: 0.300
Completion: 4.333
🧠 🔧 🌡️ 2024-08 In: text
Out: text
Open Weights
Released: 2025-11-06
Kimi K2.5 moonshotai/Kimi-K2.5 262.1K 262.1K Input: $0.45
Output: $2.8
Model: 0.225
Completion: 6.222
🧠 🔧 🌡️ 2026-01 In: text, image
Out: text
Open Weights
Released: 2026-01-27
Qwen2.5 VL 32B Instruct Qwen/Qwen2.5-VL-32B-Instruct 16.4K 16.4K Input: $0.2
Output: $0.6
Model: 0.100
Completion: 3.000
📎 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2025-03-24
Qwen3.5 Plus Qwen/Qwen3.5-Plus 1M 65.5K Input: $0.4
Output: $2.4
Reasoning: $2.4
Model: 0.200
Completion: 6.000
🧠 🔧 🌡️ 2025-04 In: text, image, video
Out: text
Released: 2026-02
Qwen3 235B A22B Instruct 2507 Qwen/Qwen3-235B-A22B-Instruct-2507 262K 262K Input: $0.09
Output: $0.6
Model: 0.045
Completion: 6.667
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-07-23
MiMo V2 Flash XiaomiMiMo/MiMo-V2-Flash 262.1K 32K Input: $0.1
Output: $0.3
Model: 0.050
Completion: 3.000
🧠 🔧 🌡️ 2024-12-01 In: text
Out: text
Open Weights
Released: 2025-12-17
Mistral Small 3.2 24B Instruct mistralai/Mistral-Small-3.2-24B-Instruct-2506 32.8K 8.2K Input: $0
Output: $0
- 📎 🔧 🌡️ 2024-10 In: text, image
Out: text
Open Weights
Released: 2025-06-20
Mistral Nemo Instruct 2407 mistralai/Mistral-Nemo-Instruct-2407 131.1K 65.5K Input: $0.02
Output: $0.04
Model: 0.010
Completion: 2.000
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-07-18
GLM-4.6 zai-org/GLM-4.6 202.8K 131.1K Input: $0.45
Output: $1.9
Model: 0.225
Completion: 4.222
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-09-30
GLM-5 zai-org/GLM-5 202.8K 131.1K Input: $0.8
Output: $2.56
Model: 0.400
Completion: 3.200
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-11
GLM-4.7 zai-org/GLM-4.7 202.8K 131.1K Input: $0.2
Output: $0.8
Model: 0.100
Completion: 4.000
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-12-22
DeepSeek V3 0324 deepseek-ai/DeepSeek-V3-0324 163.8K 163.8K Input: $0.25
Output: $0.88
Model: 0.125
Completion: 3.520
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-03-24
DeepSeek R1 0528 deepseek-ai/DeepSeek-R1-0528 163.8K 64K Input: $0.5
Output: $2.15
Model: 0.250
Completion: 4.300
🧠 🌡️ 2024-07 In: text
Out: text
Open Weights
Released: 2025-05-28
DeepSeek V3.1 deepseek-ai/DeepSeek-V3.1 164K 164K Input: $0.27
Output: $1
Model: 0.135
Completion: 3.704
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-25
DeepSeek V3.2 Exp deepseek-ai/DeepSeek-V3.2-Exp 164K 164K Input: $0.27
Output: $0.4
Model: 0.135
Completion: 1.481
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-10-10
DeepSeek V3.2 deepseek-ai/DeepSeek-V3.2 164K 164K Input: $0.26
Output: $0.38
Model: 0.130
Completion: 1.462
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-12-03
MiniMax M2.1 MiniMaxAI/MiniMax-M2.1 196.6K 131.1K Input: $0.28
Output: $1.2
Model: 0.140
Completion: 4.286
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-12-23
MiniMax M2.5 MiniMaxAI/MiniMax-M2.5 204.8K 131.1K Input: $0.3
Output: $1.2
Model: 0.150
Completion: 4.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-12

Merge Gateway

📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Grok 4.3 xai/grok-4.3 1M 30K Input: $1.25
Output: $2.5
Cache Read: $0.2
Model: 0.625
Completion: 2.000
Cache: 0.160
📎 🧠 🔧 🌡️ - In: text, image, pdf
Out: text
Released: 2026-04-17
Grok 4.20 (Reasoning) xai/grok-4.20-0309-reasoning 1M 30K Input: $1.25
Output: $2.5
Cache Read: $0.2
Model: 0.625
Completion: 2.000
Cache: 0.160
📎 🧠 🔧 🌡️ - In: text, image, pdf
Out: text
Released: 2026-03-09
Codestral (latest) mistral/codestral-latest 256K 4.1K Input: $0.3
Output: $0.9
Model: 0.150
Completion: 3.000
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2024-05-29
Updated: 2025-01-04
Mistral Large (latest) mistral/mistral-large-latest 262.1K 262.1K Input: $0.5
Output: $1.5
Model: 0.250
Completion: 3.000
📎 🔧 🌡️ 2024-11 In: text, image
Out: text
Open Weights
Released: 2024-11-01
Updated: 2025-12-02
Devstral Small mistral/devstral-small-2507 128K 128K Input: $0.1
Output: $0.3
Model: 0.050
Completion: 3.000
🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2025-07-10
Pixtral Large (latest) mistral/pixtral-large-latest 128K 128K Input: $2
Output: $6
Model: 1.000
Completion: 3.000
📎 🔧 🌡️ 2024-11 In: text, image
Out: text
Open Weights
Released: 2024-11-01
Updated: 2024-11-04
Mistral Medium (latest) mistral/mistral-medium-latest 262.1K 262.1K Input: $0.4
Output: $2
Model: 0.200
Completion: 5.000
📎 🔧 🌡️ 2025-05 In: text, image
Out: text
Released: 2025-08-12
Mistral Small (latest) mistral/mistral-small-latest 256K 256K Input: $0.15
Output: $0.6
Model: 0.075
Completion: 4.000
📎 🧠 🔧 🌡️ 2025-06 In: text, image
Out: text
Open Weights
Released: 2026-03-16
Mistral Medium 3 mistral/mistral-medium-2505 131.1K 131.1K Input: $0.4
Output: $2
Model: 0.200
Completion: 5.000
📎 🔧 🌡️ 2025-05 In: text, image
Out: text
Released: 2025-05-07
Mistral Large 2.1 mistral/mistral-large-2411 131.1K 16.4K Input: $2
Output: $6
Model: 1.000
Completion: 3.000
🔧 🌡️ 2024-11 In: text
Out: text
Open Weights
Released: 2024-11-18
Magistral Medium (latest) mistral/magistral-medium-latest 128K 16.4K Input: $2
Output: $5
Model: 1.000
Completion: 2.500
🧠 🔧 🌡️ 2025-06 In: text
Out: text
Released: 2025-03-17
Updated: 2025-03-20
Devstral 2 (latest) mistral/devstral-medium-latest 262.1K 262.1K Input: $0.4
Output: $2
Model: 0.200
Completion: 5.000
🔧 🌡️ 2025-12 In: text
Out: text
Open Weights
Released: 2025-12-02
Devstral 2 mistral/devstral-2512 262.1K 262.1K Input: $0.4
Output: $2
Model: 0.200
Completion: 5.000
🔧 🌡️ 2025-12 In: text
Out: text
Open Weights
Released: 2025-12-09
Mistral Large 3 mistral/mistral-large-2512 262.1K 262.1K Input: $0.5
Output: $1.5
Model: 0.250
Completion: 3.000
📎 🔧 🌡️ 2024-11 In: text, image
Out: text
Open Weights
Released: 2024-11-01
Updated: 2025-12-02
Devstral Medium mistral/devstral-medium-2507 128K 128K Input: $0.4
Output: $2
Model: 0.200
Completion: 5.000
🔧 🌡️ 2025-05 In: text
Out: text
Released: 2025-07-10
Gemini 3.1 Flash Lite google/gemini-3.1-flash-lite 1M 65.5K Input: $0.25
Output: $1.5
Cache Read: $0.025
Input Audio: $0.5
Model: 0.250
Completion: 3.000
Cache: 0.050
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video, audio, pdf
Out: text
Released: 2026-05-07
Gemini 2.5 Pro google/gemini-2.5-pro 1M 65.5K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-06-17
Gemini 2.5 Flash google/gemini-2.5-flash 1M 65.5K Input: $0.3
Output: $2.5
Cache Read: $0.03
Input Audio: $1
Model: 0.500
Completion: 2.500
Cache: 0.030
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-06-17
Gemini 3.5 Flash google/gemini-3.5-flash 1M 65.5K Input: $1.5
Output: $9
Cache Read: $0.15
Input Audio: $1.5
Model: 0.750
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video, audio, pdf
Out: text
Released: 2026-05-19
Gemma 4 31B IT google/gemma-4-31b-it 262.1K 32.8K - - 📎 🧠 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2026-04-02
Gemini 3.1 Pro Preview Custom Tools google/gemini-3.1-pro-preview-customtools 1M 65.5K Input: $2
Output: $12
Cache Read: $0.2
Model: 1.000
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video, audio, pdf
Out: text
Released: 2026-02-19
Gemini Flash-Lite Latest google/gemini-flash-lite-latest 1M 65.5K Input: $0.1
Output: $0.4
Cache Read: $0.025
Model: 0.050
Completion: 4.000
Cache: 0.250
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-09-25
Gemini 2.5 Flash-Lite google/gemini-2.5-flash-lite 1M 65.5K Input: $0.1
Output: $0.4
Cache Read: $0.01
Input Audio: $0.3
Model: 0.150
Completion: 1.333
Cache: 0.033
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-06-17
Gemini 3.1 Pro Preview google/gemini-3.1-pro-preview 1M 65.5K Input: $2
Output: $12
Cache Read: $0.2
Model: 1.000
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video, audio, pdf
Out: text
Released: 2026-02-19
Gemma 4 26B A4B IT google/gemma-4-26b-a4b-it 262.1K 32.8K - - 📎 🧠 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2026-04-02
Gemini 3 Pro Preview google/gemini-3-pro-preview 1M 65.5K Input: $2
Output: $12
Cache Read: $0.2
Model: 1.000
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video, audio, pdf
Out: text
Released: 2025-11-18
Gemini 3 Flash Preview google/gemini-3-flash-preview 1M 65.5K Input: $0.5
Output: $3
Cache Read: $0.05
Input Audio: $1
Model: 0.500
Completion: 3.000
Cache: 0.050
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video, audio, pdf
Out: text
Released: 2025-12-17
Gemini Flash Latest google/gemini-flash-latest 1M 65.5K Input: $0.3
Output: $2.5
Cache Read: $0.075
Input Audio: $1
Model: 0.500
Completion: 2.500
Cache: 0.075
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-09-25
Gemini 3.1 Flash Lite Preview google/gemini-3.1-flash-lite-preview 1M 65.5K Input: $0.25
Output: $1.5
Cache Read: $0.025
Input Audio: $0.5
Model: 0.250
Completion: 3.000
Cache: 0.050
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video, audio, pdf
Out: text
Released: 2026-03-03
o3 openai/o3 200K 100K Input: $2
Output: $8
Cache Read: $0.5
Model: 1.000
Completion: 4.000
Cache: 0.250
📎 🧠 🔧 2024-05 In: text, image, pdf
Out: text
Released: 2025-04-16
GPT-5 openai/gpt-5 400K 128K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-08-07
GPT-4o openai/gpt-4o 128K 16.4K Input: $2.5
Output: $10
Cache Read: $1.25
Model: 1.250
Completion: 4.000
Cache: 0.500
📎 🔧 🌡️ 2023-09 In: text, image, pdf
Out: text
Released: 2024-05-13
Updated: 2024-08-06
o4-mini openai/o4-mini 200K 100K Input: $1.1
Output: $4.4
Cache Read: $0.275
Model: 0.550
Completion: 4.000
Cache: 0.250
📎 🧠 🔧 2024-05 In: text, image
Out: text
Released: 2025-04-16
GPT-4o (2024-05-13) openai/gpt-4o-2024-05-13 128K 4.1K Input: $5
Output: $15
Model: 2.500
Completion: 3.000
📎 🔧 🌡️ 2023-09 In: text, image
Out: text
Released: 2024-05-13
GPT-5.4 nano openai/gpt-5.4-nano 400K 128K Input: $0.2
Output: $1.25
Cache Read: $0.02
Model: 0.100
Completion: 6.250
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2026-03-17
GPT-5 Chat (latest) openai/gpt-5-chat-latest 400K 128K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🌡️ 2024-09-30 In: text, image
Out: text
Released: 2025-08-07
GPT-5.3 Chat (latest) openai/gpt-5.3-chat-latest 128K 16.4K Input: $1.75
Output: $14
Cache Read: $0.175
Model: 0.875
Completion: 8.000
Cache: 0.100
📎 🔧 🌡️ 2025-08-31 In: text, image
Out: text
Released: 2026-03-03
GPT-4o (2024-08-06) openai/gpt-4o-2024-08-06 128K 16.4K Input: $2.5
Output: $10
Cache Read: $1.25
Model: 1.250
Completion: 4.000
Cache: 0.500
📎 🔧 🌡️ 2023-09 In: text, image
Out: text
Released: 2024-08-06
o3-mini openai/o3-mini 200K 100K Input: $1.1
Output: $4.4
Cache Read: $0.55
Model: 0.550
Completion: 4.000
Cache: 0.500
🧠 🔧 2024-05 In: text
Out: text
Released: 2024-12-20
Updated: 2025-01-29
GPT-5.2 openai/gpt-5.2 400K 128K Input: $1.75
Output: $14
Cache Read: $0.175
Model: 0.875
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2025-12-11
GPT-5.1 Chat openai/gpt-5.1-chat-latest 128K 16.4K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-11-13
GPT-5.2 Chat openai/gpt-5.2-chat-latest 128K 16.4K Input: $1.75
Output: $14
Cache Read: $0.175
Model: 0.875
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2025-12-11
GPT-4.1 nano openai/gpt-4.1-nano 1M 32.8K Input: $0.1
Output: $0.4
Cache Read: $0.025
Model: 0.050
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2025-04-14
GPT-4o (2024-11-20) openai/gpt-4o-2024-11-20 128K 16.4K Input: $2.5
Output: $10
Cache Read: $1.25
Model: 1.250
Completion: 4.000
Cache: 0.500
📎 🔧 🌡️ 2023-09 In: text, image
Out: text
Released: 2024-11-20
o1 openai/o1 200K 100K Input: $15
Output: $60
Cache Read: $7.5
Model: 7.500
Completion: 4.000
Cache: 0.500
📎 🧠 🔧 2023-09 In: text, image, pdf
Out: text
Released: 2024-12-05
GPT-5.4 openai/gpt-5.4 1.1M 128K Input: $2.5
Output: $15
Cache Read: $0.25
Model: 1.250
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-03-05
GPT-5.4 mini openai/gpt-5.4-mini 400K 128K Input: $0.75
Output: $4.5
Cache Read: $0.075
Model: 0.375
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2026-03-17
GPT-4.1 openai/gpt-4.1 1M 32.8K Input: $2
Output: $8
Cache Read: $0.5
Model: 1.000
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-04 In: text, image, pdf
Out: text
Released: 2025-04-14
GPT-5 Mini openai/gpt-5-mini 400K 128K Input: $0.25
Output: $2
Cache Read: $0.025
Model: 0.125
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2024-05-30 In: text, image
Out: text
Released: 2025-08-07
GPT-4.1 mini openai/gpt-4.1-mini 1M 32.8K Input: $0.4
Output: $1.6
Cache Read: $0.1
Model: 0.200
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-04 In: text, image, pdf
Out: text
Released: 2025-04-14
GPT-5 Nano openai/gpt-5-nano 400K 128K Input: $0.05
Output: $0.4
Cache Read: $0.005
Model: 0.025
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2024-05-30 In: text, image
Out: text
Released: 2025-08-07
GPT-4o mini openai/gpt-4o-mini 128K 16.4K Input: $0.15
Output: $0.6
Cache Read: $0.075
Model: 0.075
Completion: 4.000
Cache: 0.500
📎 🔧 🌡️ 2023-09 In: text, image, pdf
Out: text
Released: 2024-07-18
GPT-5.1 openai/gpt-5.1 400K 128K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-11-13
GPT-5.5 openai/gpt-5.5 1.1M 128K Input: $5
Output: $30
Cache Read: $0.5
Model: 2.500
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 2025-12-01 In: text, image, pdf
Out: text
Released: 2026-04-23
GLM-4.7 zai/glm-4.7 204.8K 131.1K Input: $0.6
Output: $2.2
Cache Read: $0.11
Cache Write: $0
Model: 0.300
Completion: 3.667
Cache: 0.183
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-12-22
GLM-4.5 zai/glm-4.5 131.1K 98.3K Input: $0.6
Output: $2.2
Cache Read: $0.11
Cache Write: $0
Model: 0.300
Completion: 3.667
Cache: 0.183
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-28
GLM-4.7-FlashX zai/glm-4.7-flashx 200K 131.1K Input: $0.07
Output: $0.4
Cache Read: $0.01
Cache Write: $0
Model: 0.035
Completion: 5.714
Cache: 0.143
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2026-01-19
GLM-5.1 zai/glm-5.1 200K 131.1K Input: $1.4
Output: $4.4
Cache Read: $0.26
Cache Write: $0
Model: 0.700
Completion: 3.143
Cache: 0.186
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-27
GLM-4.6 zai/glm-4.6 204.8K 131.1K Input: $0.6
Output: $2.2
Cache Read: $0.11
Cache Write: $0
Model: 0.300
Completion: 3.667
Cache: 0.183
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-09-30
GLM-4.5-Air zai/glm-4.5-air 131.1K 98.3K Input: $0.2
Output: $1.1
Cache Read: $0.03
Cache Write: $0
Model: 0.100
Completion: 5.500
Cache: 0.150
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-28
GLM-5 zai/glm-5 204.8K 131.1K Input: $1
Output: $3.2
Cache Read: $0.2
Cache Write: $0
Model: 0.500
Completion: 3.200
Cache: 0.200
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-11
GLM-5-Turbo zai/glm-5-turbo 200K 131.1K Input: $1.2
Output: $4
Cache Read: $0.24
Cache Write: $0
Model: 0.600
Completion: 3.333
Cache: 0.200
🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-03-16
Claude Haiku 4.5 anthropic/claude-haiku-4-5-20251001 200K 64K Input: $1
Output: $5
Cache Read: $0.1
Cache Write: $1.25
Model: 0.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-02-28 In: text, image, pdf
Out: text
Released: 2025-10-15
Claude Opus 4.1 anthropic/claude-opus-4-1-20250805 200K 32K Input: $15
Output: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-08-05
Claude Opus 4.7 anthropic/claude-opus-4-7 1M 128K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 2026-01-31 In: text, image, pdf
Out: text
Released: 2026-04-16
Claude Opus 4.5 anthropic/claude-opus-4-5-20251101 200K 64K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-11-01
Claude Opus 4 anthropic/claude-opus-4-20250514 200K 32K Input: $15
Output: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-05-22
Claude Sonnet 4 anthropic/claude-sonnet-4-20250514 200K 64K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-05-22
Claude Opus 4.6 anthropic/claude-opus-4-6 1M 128K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-05-31 In: text, image, pdf
Out: text
Released: 2026-02-05
Updated: 2026-03-13
Claude Sonnet 4.5 anthropic/claude-sonnet-4-5-20250929 200K 64K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-07-31 In: text, image, pdf
Out: text
Released: 2025-09-29
Claude Sonnet 4.6 anthropic/claude-sonnet-4-6 1M 64K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-02-17
Updated: 2026-03-13
Command A cohere/command-a-03-2025 256K 8K Input: $2.5
Output: $10
Model: 1.250
Completion: 4.000
🔧 🌡️ 2024-06-01 In: text
Out: text
Open Weights
Released: 2025-03-13
Command R cohere/command-r-08-2024 128K 4K Input: $0.15
Output: $0.6
Model: 0.075
Completion: 4.000
🔧 🌡️ 2024-06-01 In: text
Out: text
Open Weights
Released: 2024-08-30
Command R7B cohere/command-r7b-12-2024 128K 4K Input: $0.0375
Output: $0.15
Model: 0.019
Completion: 4.000
🔧 🌡️ 2024-06-01 In: text
Out: text
Open Weights
Released: 2024-12-02
Command R+ cohere/command-r-plus-08-2024 128K 4K Input: $2.5
Output: $10
Model: 1.250
Completion: 4.000
🔧 🌡️ 2024-06-01 In: text
Out: text
Open Weights
Released: 2024-08-30
DeepSeek V4 Flash deepseek/deepseek-v4-flash 1M 384K Input: $0.14
Output: $0.28
Cache Read: $0.0028
Model: 0.070
Completion: 2.000
Cache: 0.020
🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2026-04-24
DeepSeek V4 Pro deepseek/deepseek-v4-pro 1M 384K Input: $0.435
Output: $0.87
Cache Read: $0.003625
Model: 0.217
Completion: 2.000
Cache: 0.008
🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2026-04-24
MiniMax-M2.7-highspeed minimax/minimax-m2.7-highspeed 204.8K 131.1K Input: $0.6
Output: $2.4
Cache Read: $0.06
Cache Write: $0.375
Model: 0.300
Completion: 4.000
Cache: 0.100
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-18
MiniMax-M2.5 minimax/minimax-m2.5 204.8K 131.1K Input: $0.3
Output: $1.2
Cache Read: $0.03
Cache Write: $0.375
Model: 0.150
Completion: 4.000
Cache: 0.100
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-12
MiniMax-M2.1 minimax/minimax-m2.1 204.8K 131.1K Input: $0.3
Output: $1.2
Model: 0.150
Completion: 4.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-12-23
MiniMax-M2 minimax/minimax-m2 196.6K 128K Input: $0.3
Output: $1.2
Model: 0.150
Completion: 4.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-10-27
MiniMax-M2.7 minimax/minimax-m2.7 204.8K 131.1K Input: $0.3
Output: $1.2
Cache Read: $0.06
Cache Write: $0.375
Model: 0.150
Completion: 4.000
Cache: 0.200
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-18
MiniMax-M2.5-highspeed minimax/minimax-m2.5-highspeed 204.8K 131.1K Input: $0.6
Output: $2.4
Cache Read: $0.06
Cache Write: $0.375
Model: 0.300
Completion: 4.000
Cache: 0.100
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-13

MiniMax (minimax.io)

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
MiniMax-M2.1 MiniMax-M2.1 204.8K 131.1K Input: $0.3
Output: $1.2
Model: 0.150
Completion: 4.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-12-23
MiniMax-M2.5-highspeed MiniMax-M2.5-highspeed 204.8K 131.1K Input: $0.6
Output: $2.4
Cache Read: $0.06
Cache Write: $0.375
Model: 0.300
Completion: 4.000
Cache: 0.100
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-13
MiniMax-M2.7-highspeed MiniMax-M2.7-highspeed 204.8K 131.1K Input: $0.6
Output: $2.4
Cache Read: $0.06
Cache Write: $0.375
Model: 0.300
Completion: 4.000
Cache: 0.100
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-18
MiniMax-M2 MiniMax-M2 196.6K 128K Input: $0.3
Output: $1.2
Model: 0.150
Completion: 4.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-10-27
MiniMax-M2.5 MiniMax-M2.5 204.8K 131.1K Input: $0.3
Output: $1.2
Cache Read: $0.03
Cache Write: $0.375
Model: 0.150
Completion: 4.000
Cache: 0.100
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-12
MiniMax-M3 MiniMax-M3 512K 128K Input: $0.6
Output: $2.4
Cache Read: $0.12
Model: 0.300
Completion: 4.000
Cache: 0.200
📎 🧠 🔧 🌡️ - In: text, image, video
Out: text
Open Weights
Released: 2026-06-01
MiniMax-M2.7 MiniMax-M2.7 204.8K 131.1K Input: $0.3
Output: $1.2
Cache Read: $0.06
Cache Write: $0.375
Model: 0.150
Completion: 4.000
Cache: 0.200
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-18

MiniMax (minimaxi.com)

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
MiniMax-M2.1 MiniMax-M2.1 204.8K 131.1K Input: $0.3
Output: $1.2
Model: 0.150
Completion: 4.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-12-23
MiniMax-M2.5-highspeed MiniMax-M2.5-highspeed 204.8K 131.1K Input: $0.6
Output: $2.4
Cache Read: $0.06
Cache Write: $0.375
Model: 0.300
Completion: 4.000
Cache: 0.100
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-13
MiniMax-M2.7-highspeed MiniMax-M2.7-highspeed 204.8K 131.1K Input: $0.6
Output: $2.4
Cache Read: $0.06
Cache Write: $0.375
Model: 0.300
Completion: 4.000
Cache: 0.100
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-18
MiniMax-M2 MiniMax-M2 196.6K 128K Input: $0.3
Output: $1.2
Model: 0.150
Completion: 4.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-10-27
MiniMax-M2.5 MiniMax-M2.5 204.8K 131.1K Input: $0.3
Output: $1.2
Cache Read: $0.03
Cache Write: $0.375
Model: 0.150
Completion: 4.000
Cache: 0.100
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-12
MiniMax-M3 MiniMax-M3 512K 128K Input: $0.6
Output: $2.4
Cache Read: $0.12
Model: 0.300
Completion: 4.000
Cache: 0.200
📎 🧠 🔧 🌡️ - In: text, image, video
Out: text
Open Weights
Released: 2026-06-01
MiniMax-M2.7 MiniMax-M2.7 204.8K 131.1K Input: $0.3
Output: $1.2
Cache Read: $0.06
Cache Write: $0.375
Model: 0.150
Completion: 4.000
Cache: 0.200
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-18

MiniMax Token Plan (minimaxi.com)

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
MiniMax-M2.1 MiniMax-M2.1 204.8K 131.1K Input: $0
Output: $0
- 🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-12-23
MiniMax-M2.5-highspeed MiniMax-M2.5-highspeed 204.8K 131.1K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-13
MiniMax-M2.7-highspeed MiniMax-M2.7-highspeed 204.8K 131.1K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-18
MiniMax-M2 MiniMax-M2 196.6K 128K Input: $0
Output: $0
- 🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-10-27
MiniMax-M2.5 MiniMax-M2.5 204.8K 131.1K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-12
MiniMax-M3 MiniMax-M3 512K 128K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 📎 🧠 🔧 🌡️ - In: text, image, video
Out: text
Open Weights
Released: 2026-06-01
MiniMax-M2.7 MiniMax-M2.7 204.8K 131.1K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-18

MiniMax Token Plan (minimax.io)

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
MiniMax-M2.1 MiniMax-M2.1 204.8K 131.1K Input: $0
Output: $0
- 🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-12-23
MiniMax-M2.5-highspeed MiniMax-M2.5-highspeed 204.8K 131.1K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-13
MiniMax-M2.7-highspeed MiniMax-M2.7-highspeed 204.8K 131.1K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-18
MiniMax-M2 MiniMax-M2 196.6K 128K Input: $0
Output: $0
- 🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-10-27
MiniMax-M2.5 MiniMax-M2.5 204.8K 131.1K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-12
MiniMax-M3 MiniMax-M3 512K 128K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 📎 🧠 🔧 🌡️ - In: text, image, video
Out: text
Open Weights
Released: 2026-06-01
MiniMax-M2.7 MiniMax-M2.7 204.8K 131.1K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-18

Mistral

📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Codestral (latest) codestral-latest 256K 4.1K Input: $0.3
Output: $0.9
Model: 0.150
Completion: 3.000
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2024-05-29
Updated: 2025-01-04
Mistral Large (latest) mistral-large-latest 262.1K 262.1K Input: $0.5
Output: $1.5
Model: 0.250
Completion: 3.000
📎 🔧 🌡️ 2024-11 In: text, image
Out: text
Open Weights
Released: 2024-11-01
Updated: 2025-12-02
Mistral 7B open-mistral-7b 8K 8K Input: $0.25
Output: $0.25
Model: 0.125
Completion: 1.000
🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2023-09-27
Devstral Small devstral-small-2507 128K 128K Input: $0.1
Output: $0.3
Model: 0.050
Completion: 3.000
🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2025-07-10
Ministral 3B (latest) ministral-3b-latest 128K 128K Input: $0.04
Output: $0.04
Model: 0.020
Completion: 1.000
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2024-10-01
Updated: 2024-10-04
Pixtral Large (latest) pixtral-large-latest 128K 128K Input: $2
Output: $6
Model: 1.000
Completion: 3.000
📎 🔧 🌡️ 2024-11 In: text, image
Out: text
Open Weights
Released: 2024-11-01
Updated: 2024-11-04
Mistral Nemo mistral-nemo 128K 128K Input: $0.15
Output: $0.15
Model: 0.075
Completion: 1.000
🔧 🌡️ 2024-07 In: text
Out: text
Open Weights
Released: 2024-07-01
Mistral Embed mistral-embed 8K 3.1K Input: $0.1
Output: $0
Model: 0.050 - - In: text
Out: text
Released: 2023-12-11
Mistral Small 3.2 mistral-small-2506 128K 16.4K Input: $0.1
Output: $0.3
Model: 0.050
Completion: 3.000
🔧 🌡️ 2025-03 In: text, image
Out: text
Open Weights
Released: 2025-06-20
Ministral 8B (latest) ministral-8b-latest 128K 128K Input: $0.1
Output: $0.1
Model: 0.050
Completion: 1.000
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2024-10-01
Updated: 2024-10-04
Mixtral 8x22B open-mixtral-8x22b 64K 64K Input: $2
Output: $6
Model: 1.000
Completion: 3.000
🔧 🌡️ 2024-04 In: text
Out: text
Open Weights
Released: 2024-04-17
Mistral Medium (latest) mistral-medium-latest 262.1K 262.1K Input: $0.4
Output: $2
Model: 0.200
Completion: 5.000
📎 🔧 🌡️ 2025-05 In: text, image
Out: text
Released: 2025-08-12
Devstral Small 2505 devstral-small-2505 128K 128K Input: $0.1
Output: $0.3
Model: 0.050
Completion: 3.000
🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2025-05-07
Magistral Small magistral-small 128K 128K Input: $0.5
Output: $1.5
Model: 0.250
Completion: 3.000
🧠 🔧 🌡️ 2025-06 In: text
Out: text
Open Weights
Released: 2025-03-17
Mistral Medium 3.5 mistral-medium-2604 262.1K 262.1K Input: $1.5
Output: $7.5
Model: 0.750
Completion: 5.000
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2026-04-29
Mistral Small (latest) mistral-small-latest 256K 256K Input: $0.15
Output: $0.6
Model: 0.075
Completion: 4.000
📎 🧠 🔧 🌡️ 2025-06 In: text, image
Out: text
Open Weights
Released: 2026-03-16
Mixtral 8x7B open-mixtral-8x7b 32K 32K Input: $0.7
Output: $0.7
Model: 0.350
Completion: 1.000
🔧 🌡️ 2024-01 In: text
Out: text
Open Weights
Released: 2023-12-11
Devstral 2 devstral-latest 262.1K 262.1K Input: $0.4
Output: $2
Model: 0.200
Completion: 5.000
🔧 🌡️ 2025-12 In: text
Out: text
Open Weights
Released: 2025-12-09
Mistral Small 4 mistral-small-2603 256K 256K Input: $0.15
Output: $0.6
Model: 0.075
Completion: 4.000
📎 🧠 🔧 🌡️ 2025-06 In: text, image
Out: text
Open Weights
Released: 2026-03-16
Mistral Medium 3 mistral-medium-2505 131.1K 131.1K Input: $0.4
Output: $2
Model: 0.200
Completion: 5.000
📎 🔧 🌡️ 2025-05 In: text, image
Out: text
Released: 2025-05-07
Mistral Large 2.1 mistral-large-2411 131.1K 16.4K Input: $2
Output: $6
Model: 1.000
Completion: 3.000
🔧 🌡️ 2024-11 In: text
Out: text
Open Weights
Released: 2024-11-01
Updated: 2024-11-04
Mistral Medium 3.1 mistral-medium-2508 262.1K 262.1K Input: $0.4
Output: $2
Model: 0.200
Completion: 5.000
📎 🔧 🌡️ 2025-05 In: text, image
Out: text
Released: 2025-08-12
Open Mistral Nemo open-mistral-nemo 128K 128K Input: $0.15
Output: $0.15
Model: 0.075
Completion: 1.000
🔧 🌡️ 2024-07 In: text
Out: text
Open Weights
Released: 2024-07-01
Magistral Medium (latest) magistral-medium-latest 128K 16.4K Input: $2
Output: $5
Model: 1.000
Completion: 2.500
🧠 🔧 🌡️ 2025-06 In: text
Out: text
Open Weights
Released: 2025-03-17
Updated: 2025-03-20
Devstral 2 (latest) devstral-medium-latest 262.1K 262.1K Input: $0.4
Output: $2
Model: 0.200
Completion: 5.000
🔧 🌡️ 2025-12 In: text
Out: text
Open Weights
Released: 2025-12-02
Devstral 2 devstral-2512 262.1K 262.1K Input: $0.4
Output: $2
Model: 0.200
Completion: 5.000
🔧 🌡️ 2025-12 In: text
Out: text
Open Weights
Released: 2025-12-09
Devstral Small 2 labs-devstral-small-2512 256K 256K Input: $0
Output: $0
- 🔧 🌡️ 2025-12 In: text, image
Out: text
Open Weights
Released: 2025-12-09
Pixtral 12B pixtral-12b 128K 128K Input: $0.15
Output: $0.15
Model: 0.075
Completion: 1.000
📎 🔧 🌡️ 2024-09 In: text, image
Out: text
Open Weights
Released: 2024-09-01
Mistral Large 3 mistral-large-2512 262.1K 262.1K Input: $0.5
Output: $1.5
Model: 0.250
Completion: 3.000
📎 🔧 🌡️ 2024-11 In: text, image
Out: text
Open Weights
Released: 2024-11-01
Updated: 2025-12-02
Devstral Medium devstral-medium-2507 128K 128K Input: $0.4
Output: $2
Model: 0.200
Completion: 5.000
🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2025-07-10

Mixlayer

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Qwen3.5 27B qwen/qwen3.5-27b 262.1K 262.1K Input: $0.3
Output: $2.4
Model: 0.150
Completion: 8.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-18
Qwen3.5 35B A3B qwen/qwen3.5-35b-a3b 262.1K 262.1K Input: $0.25
Output: $1.3
Model: 0.125
Completion: 5.200
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-18
Qwen3.5 9B qwen/qwen3.5-9b 262.1K 262.1K Input: $0.1
Output: $0.4
Model: 0.050
Completion: 4.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-18
Qwen3.5 397B A17B qwen/qwen3.5-397b-a17b 262.1K 262.1K Input: $0.6
Output: $3.6
Model: 0.300
Completion: 6.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-18
Qwen3.5 122B A10B qwen/qwen3.5-122b-a10b 262.1K 262.1K Input: $0.4
Output: $3.2
Model: 0.200
Completion: 8.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-18

Moark

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
MiniMax-M2.1 MiniMax-M2.1 204.8K 131.1K Input: $2.1
Output: $8.4
Model: 1.050
Completion: 4.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-12-23
GLM-4.7 GLM-4.7 204.8K 131.1K Input: $3.5
Output: $14
Model: 1.750
Completion: 4.000
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-12-22

ModelScope

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Qwen3 30B A3B Thinking 2507 Qwen/Qwen3-30B-A3B-Thinking-2507 262.1K 32.8K Input: $0
Output: $0
- 🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-30
Qwen3-235B-A22B-Thinking-2507 Qwen/Qwen3-235B-A22B-Thinking-2507 262.1K 131.1K Input: $0
Output: $0
- 🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-25
Qwen3 Coder 30B A3B Instruct Qwen/Qwen3-Coder-30B-A3B-Instruct 262.1K 65.5K Input: $0
Output: $0
- 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-31
Qwen3 30B A3B Instruct 2507 Qwen/Qwen3-30B-A3B-Instruct-2507 262.1K 16.4K Input: $0
Output: $0
- 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-30
Qwen3 235B A22B Instruct 2507 Qwen/Qwen3-235B-A22B-Instruct-2507 262.1K 131.1K Input: $0
Output: $0
- 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-04-28
Updated: 2025-07-21
GLM-4.6 ZhipuAI/GLM-4.6 202.8K 98.3K Input: $0
Output: $0
- 🧠 🔧 🌡️ 2025-07 In: text
Out: text
Open Weights
Released: 2025-09-30
GLM-4.5 ZhipuAI/GLM-4.5 131.1K 98.3K Input: $0
Output: $0
- 🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-28

Moonshot AI

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Kimi K2 0905 kimi-k2-0905-preview 262.1K 262.1K Input: $0.6
Output: $2.5
Cache Read: $0.15
Model: 0.300
Completion: 4.167
Cache: 0.250
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-09-05
Kimi K2 Thinking Turbo kimi-k2-thinking-turbo 262.1K 262.1K Input: $1.15
Output: $8
Cache Read: $0.15
Model: 0.575
Completion: 6.957
Cache: 0.130
🧠 🔧 🌡️ 2024-08 In: text
Out: text
Open Weights
Released: 2025-11-06
Kimi K2.7 Code kimi-k2.7-code 262.1K 262.1K Input: $0.95
Output: $4
Cache Read: $0.19
Model: 0.475
Completion: 4.211
Cache: 0.200
📎 🧠 🔧 2025-01 In: text, image, video
Out: text
Open Weights
Released: 2026-06-12
Kimi K2 Thinking kimi-k2-thinking 262.1K 262.1K Input: $0.6
Output: $2.5
Cache Read: $0.15
Model: 0.300
Completion: 4.167
Cache: 0.250
🧠 🔧 🌡️ 2024-08 In: text
Out: text
Open Weights
Released: 2025-11-06
Kimi K2 0711 kimi-k2-0711-preview 131.1K 16.4K Input: $0.6
Output: $2.5
Cache Read: $0.15
Model: 0.300
Completion: 4.167
Cache: 0.250
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-07-14
Kimi K2 Turbo kimi-k2-turbo-preview 262.1K 262.1K Input: $2.4
Output: $10
Cache Read: $0.6
Model: 1.200
Completion: 4.167
Cache: 0.250
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-09-05
Kimi K2.5 kimi-k2.5 262.1K 262.1K Input: $0.6
Output: $3
Cache Read: $0.1
Model: 0.300
Completion: 5.000
Cache: 0.167
🧠 🔧 2025-01 In: text, image, video
Out: text
Open Weights
Released: 2026-01
Kimi K2.6 kimi-k2.6 262.1K 262.1K Input: $0.95
Output: $4
Cache Read: $0.16
Model: 0.475
Completion: 4.211
Cache: 0.168
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video
Out: text
Open Weights
Released: 2026-04-21

Moonshot AI (China)

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Kimi K2.6 kimi-k2.6 262.1K 262.1K Input: $0.95
Output: $4
Cache Read: $0.16
Model: 0.475
Completion: 4.211
Cache: 0.168
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video
Out: text
Open Weights
Released: 2026-04-21
Kimi K2.5 kimi-k2.5 262.1K 262.1K Input: $0.6
Output: $3
Cache Read: $0.1
Model: 0.300
Completion: 5.000
Cache: 0.167
🧠 🔧 2025-01 In: text, image, video
Out: text
Open Weights
Released: 2026-01
Kimi K2 Turbo kimi-k2-turbo-preview 262.1K 262.1K Input: $2.4
Output: $10
Cache Read: $0.6
Model: 1.200
Completion: 4.167
Cache: 0.250
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-09-05
Kimi K2 0711 kimi-k2-0711-preview 131.1K 16.4K Input: $0.6
Output: $2.5
Cache Read: $0.15
Model: 0.300
Completion: 4.167
Cache: 0.250
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-07-14
Kimi K2 Thinking kimi-k2-thinking 262.1K 262.1K Input: $0.6
Output: $2.5
Cache Read: $0.15
Model: 0.300
Completion: 4.167
Cache: 0.250
🧠 🔧 🌡️ 2024-08 In: text
Out: text
Open Weights
Released: 2025-11-06
Kimi K2 Thinking Turbo kimi-k2-thinking-turbo 262.1K 262.1K Input: $1.15
Output: $8
Cache Read: $0.15
Model: 0.575
Completion: 6.957
Cache: 0.130
🧠 🔧 🌡️ 2024-08 In: text
Out: text
Open Weights
Released: 2025-11-06
Kimi K2 0905 kimi-k2-0905-preview 262.1K 262.1K Input: $0.6
Output: $2.5
Cache Read: $0.15
Model: 0.300
Completion: 4.167
Cache: 0.250
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-09-05

Morph

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Morph v3 Fast morph-v3-fast 16K 16K Input: $0.8
Output: $1.2
Model: 0.400
Completion: 1.500
- - In: text
Out: text
Released: 2024-08-15
Morph v3 Large morph-v3-large 32K 32K Input: $0.9
Output: $1.9
Model: 0.450
Completion: 2.111
- - In: text
Out: text
Released: 2024-08-15
Auto auto 32K 32K Input: $0.85
Output: $1.55
Model: 0.425
Completion: 1.824
- - In: text
Out: text
Released: 2024-06-01

NanoGPT

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Step-3 step-3 65.5K 8.2K Input: $0.2499
Output: $0.6494
Model: 0.125
Completion: 2.599
📎 - In: text, image
Out: text
Released: 2025-07-31
Qwen3.5 35B A3B Thinking qwen3.5-35b-a3b:thinking 260.1K 65.5K Input: $0.225
Output: $1.8
Model: 0.113
Completion: 8.000
📎 🧠 - In: text, image, video
Out: text
Released: 2026-02-24
GLM 4.1V Thinking FlashX glm-4.1v-thinking-flashx 64K 8.2K Input: $0.3
Output: $0.3
Model: 0.150
Completion: 1.000
📎 - In: text, image
Out: text
Released: 2025-07-09
ERNIE X1.1 ernie-x1.1-preview 64K 8.2K Input: $0.15
Output: $0.6
Model: 0.075
Completion: 4.000
📎 - In: text, pdf
Out: text
Released: 2025-09-10
Qwen25 VL 72b qwen25-vl-72b-instruct 32K 32.8K Input: $0.69989
Output: $0.69989
Model: 0.350
Completion: 1.000
📎 - In: text, image
Out: text
Released: 2025-05-10
Gemini 2.0 Pro 0205 gemini-2.0-pro-exp-02-05 2.1M 8.2K Input: $1.989
Output: $7.956
Model: 0.995
Completion: 4.000
📎 - In: text, image
Out: text
Released: 2025-02-05
Doubao Seed 2.0 Lite doubao-seed-2-0-lite-260215 256K 32K Input: $0.1462
Output: $0.8738
Model: 0.073
Completion: 5.977
- - In: text
Out: text
Released: 2026-02-14
Qwen3.5 27B Writer V2 Derestricted Qwen3.5-27B-Writer-V2-Derestricted 262.1K 16.4K Input: $0.306
Output: $0.306
Model: 0.153
Completion: 1.000
📎 🧠 - In: text, image, video
Out: text
Released: 2026-04-06
Claude 4 Opus Thinking (8K) claude-opus-4-thinking:8192 200K 32K Input: $14.994
Output: $75.004
Model: 7.497
Completion: 5.002
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2025-05-22
Qwen3 VL 235B A22B Thinking qwen3-vl-235b-a22b-thinking 32.8K 32.8K Input: $0.5
Output: $6
Model: 0.250
Completion: 12.000
📎 🧠 - In: text, image
Out: text
Released: 2025-08-26
GLM 4 Air 0111 glm-4-air-0111 128K 4.1K Input: $0.1394
Output: $0.1394
Model: 0.070
Completion: 1.000
- - In: text
Out: text
Released: 2025-01-11
Qwen3.5 27B Queen Derestricted Qwen3.5-27B-Queen-Derestricted 262.1K 16.4K Input: $0.306
Output: $0.306
Model: 0.153
Completion: 1.000
📎 🧠 - In: text, image, video
Out: text
Released: 2026-04-30
Gemini 2.5 Pro Preview 0325 gemini-2.5-pro-preview-03-25 1M 65.5K Input: $2.5
Output: $10
Model: 1.250
Completion: 4.000
📎 🧠 - In: text, image
Out: text
Released: 2025-03-25
Brave (Research) brave-research 16.4K 16.4K Input: $5
Output: $5
Model: 2.500
Completion: 1.000
- - In: text
Out: text
Released: 2023-03-02
Updated: 2024-01-01
Qwen Plus qwen-plus 995.9K 32.8K Input: $0.3995
Output: $1.2002
Model: 0.200
Completion: 3.004
🧠 - In: text
Out: text
Released: 2024-01-25
Doubao Seed 2.0 Mini doubao-seed-2-0-mini-260215 256K 32K Input: $0.0493
Output: $0.4845
Model: 0.025
Completion: 9.828
- - In: text
Out: text
Released: 2026-02-14
Qwen3.5 27B BlueStar v3 Derestricted Lite Qwen3.5-27B-BlueStar-v3-Derestricted-Lite 262.1K 16.4K Input: $0.306
Output: $0.306
Model: 0.153
Completion: 1.000
📎 🧠 - In: text, image, video
Out: text
Released: 2026-04-30
DeepSeek Chat 0324 deepseek-v3-0324 128K 8.2K Input: $0.25
Output: $0.7
Model: 0.125
Completion: 2.800
🔧 - In: text
Out: text
Released: 2025-03-24
Gemma 4 31B Musica v1 Gemma-4-31B-Musica-v1 262.1K 16.4K Input: $0.306
Output: $0.306
Model: 0.153
Completion: 1.000
📎 🧠 - In: text, image, video
Out: text
Released: 2026-05-01
MiroThinker 1.7 Deep Research Mini mirothinker-1-7-deepresearch-mini 262.1K 16.4K Input: $1.25
Output: $10
Model: 0.625
Completion: 8.000
🧠 - In: text
Out: text
Released: 2026-05-11
Baichuan 4 Turbo Baichuan4-Turbo 128K 32.8K Input: $2.42
Output: $2.42
Model: 1.210
Completion: 1.000
- - In: text
Out: text
Released: 2025-08-19
Doubao Seed 1.6 Flash doubao-seed-1-6-flash-250615 256K 16.4K Input: $0.0374
Output: $0.374
Model: 0.019
Completion: 10.000
- - In: text
Out: text
Released: 2025-06-15
Kimi K2 0711 Fast kimi-k2-instruct-fast 131.1K 16.4K Input: $0.1
Output: $2
Model: 0.050
Completion: 20.000
📎 - In: text, pdf
Out: text
Released: 2025-07-15
GLM-4 Plus glm-4-plus 128K 4.1K Input: $7.497
Output: $7.497
Model: 3.748
Completion: 1.000
- - In: text
Out: text
Released: 2024-08-01
v0 1.0 MD v0-1.0-md 200K 64K Input: $3
Output: $15
Model: 1.500
Completion: 5.000
- - In: text
Out: text
Released: 2025-07-04
Qwen3 Coder 30B A3B Instruct qwen3-coder-30b-a3b-instruct 128K 65.5K Input: $0.1
Output: $0.4
Model: 0.050
Completion: 4.000
🔧 - In: text
Out: text
Released: 2025-08-05
Gemini 2.5 Flash Preview (09/2025) – Thinking gemini-2.5-flash-preview-09-2025-thinking 1M 65.5K Input: $0.3
Output: $2.5
Model: 0.150
Completion: 8.333
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2025-09-25
v0 1.5 LG v0-1.5-lg 1M 64K Input: $15
Output: $75
Model: 7.500
Completion: 5.000
- - In: text
Out: text
Released: 2025-07-04
Qwen3.5 Flash Thinking qwen3.5-flash:thinking 991.8K 65.5K Input: $0.09
Output: $0.36
Model: 0.045
Completion: 4.000
📎 🧠 - In: text, image, video
Out: text
Released: 2026-02-24
Jamba Large jamba-large 256K 4.1K Input: $1.989
Output: $7.99
Model: 0.995
Completion: 4.017
- - In: text
Out: text
Released: 2025-07-09
Kimi Thinking Preview kimi-thinking-preview 128K 16.4K Input: $31.46
Output: $31.46
Model: 15.730
Completion: 1.000
📎 - In: text, pdf
Out: text
Released: 2025-05-07
Gemini 2.0 Flash Thinking 0121 gemini-2.0-flash-thinking-exp-01-21 1M 8.2K Input: $0.306
Output: $1.003
Model: 0.153
Completion: 3.278
📎 🧠 - In: text, image
Out: text
Released: 2025-01-21
Claude 4.1 Opus Thinking (1K) claude-opus-4-1-thinking:1024 200K 32K Input: $14.994
Output: $75.004
Model: 7.497
Completion: 5.002
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2025-05-22
Qwen3.5 27B NaNovel Derestricted Lite Qwen3.5-27B-NaNovel-Derestricted-Lite 262.1K 16.4K Input: $0.306
Output: $0.306
Model: 0.153
Completion: 1.000
📎 🧠 - In: text, image, video
Out: text
Released: 2026-04-30
Doubao Seed 1.6 doubao-seed-1-6-250615 256K 16.4K Input: $0.204
Output: $0.51
Model: 0.102
Completion: 2.500
- - In: text
Out: text
Released: 2025-06-15
Ernie 5.0 Thinking Preview ernie-5.0-thinking-preview 128K 16.4K Input: $1.1
Output: $2
Model: 0.550
Completion: 1.818
📎 🧠 - In: text, image
Out: text
Released: 2025-11-18
Gemini 2.5 Flash 0520 Thinking gemini-2.5-flash-preview-05-20:thinking 1M 65.5K Input: $0.15
Output: $3.5
Model: 0.075
Completion: 23.333
📎 🧠 - In: text, image
Out: text
Released: 2025-05-20
Qwen3.5 27B Omega Evolution v2.2 Derestricted Qwen3.5-27B-Omega-Evolution-v2.2-Derestricted 262.1K 16.4K Input: $0.306
Output: $0.306
Model: 0.153
Completion: 1.000
📎 🧠 - In: text, image, video
Out: text
Released: 2026-05-02
Azure o1 azure-o1 200K 100K Input: $14.994
Output: $59.993
Model: 7.497
Completion: 4.001
- - In: text
Out: text
Released: 2024-12-17
Qwen3.7 Plus qwen3.7-plus 991.8K 65.5K Input: $0.4
Output: $1.6
Cache Read: $0.04
Model: 0.200
Completion: 4.000
Cache: 0.100
📎 - In: text, image, video
Out: text
Released: 2026-06-01
GLM-4 AirX glm-4-airx 8K 4.1K Input: $2.006
Output: $2.006
Model: 1.003
Completion: 1.000
- - In: text
Out: text
Released: 2024-06-05
Step R1 V Mini step-r1-v-mini 128K 65.5K Input: $2.5
Output: $11
Model: 1.250
Completion: 4.400
- - In: text
Out: text
Released: 2025-04-08
Gemini 2.5 Pro gemini-2.5-pro 1M 65.5K Input: $2.5
Output: $10
Model: 1.250
Completion: 4.000
📎 🧠 - In: text, image
Out: text
Released: 2025-06-05
Claude Haiku 4.5 Thinking claude-haiku-4-5-20251001-thinking 200K 64K Input: $1
Output: $5
Cache Read: $0.1
Model: 0.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2025-10-15
Jamba Mini 1.6 jamba-mini-1.6 256K 4.1K Input: $0.1989
Output: $0.408
Model: 0.099
Completion: 2.051
- - In: text
Out: text
Released: 2025-03-01
Claude Haiku 4.5 claude-haiku-4-5-20251001 200K 64K Input: $1
Output: $5
Model: 0.500
Completion: 5.000
📎 🔧 - In: text, image, pdf
Out: text
Released: 2025-10-15
Claude 3.5 Haiku claude-3-5-haiku-20241022 200K 8.2K Input: $0.8
Output: $4
Model: 0.400
Completion: 5.000
📎 🔧 - In: text, image, pdf
Out: text
Released: 2024-10-22
Claude 4 Sonnet Thinking (32K) claude-sonnet-4-thinking:32768 1M 64K Input: $2.992
Output: $14.994
Model: 1.496
Completion: 5.011
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2025-05-22
Hermes Low hermes-low 1M 65.5K Input: $0.25
Output: $1.5
Cache Read: $0.025
Model: 0.125
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2026-05-11
Claude 4 Sonnet Thinking claude-sonnet-4-thinking 1M 64K Input: $2.992
Output: $14.994
Model: 1.496
Completion: 5.011
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2025-02-24
Qwen3.7 Max qwen3.7-max 1M 65.5K Input: $2.5
Output: $7.5
Cache Read: $0.25
Model: 1.250
Completion: 3.000
Cache: 0.100
- - In: text
Out: text
Released: 2026-05-21
Yi Medium 200k yi-medium-200k 200K 4.1K Input: $2.499
Output: $2.499
Model: 1.250
Completion: 1.000
- - In: text
Out: text
Released: 2024-03-01
Qwen3.5 27B BlueStar v2 Derestricted Lite Qwen3.5-27B-BlueStar-v2-Derestricted-Lite 262.1K 16.4K Input: $0.306
Output: $0.306
Model: 0.153
Completion: 1.000
📎 🧠 - In: text, image, video
Out: text
Released: 2026-04-06
Gemma 4 31B Claude 4.6 Opus Reasoning Distilled Gemma-4-31B-Claude-4.6-Opus-Reasoning-Distilled 262.1K 16.4K Input: $0.306
Output: $0.306
Cache Read: $0.0306
Model: 0.153
Completion: 1.000
Cache: 0.100
📎 🧠 - In: text, image, video
Out: text
Released: 2026-05-01
Doubao 1.5 Thinking Vision Pro doubao-1-5-thinking-vision-pro-250428 128K 16.4K Input: $0.55
Output: $1.43
Model: 0.275
Completion: 2.600
📎 - In: text, image
Out: text
Released: 2025-05-15
Qwen3.5 27B BlueStar Derestricted Lite Qwen3.5-27B-BlueStar-Derestricted-Lite 262.1K 16.4K Input: $0.306
Output: $0.306
Model: 0.153
Completion: 1.000
📎 🧠 - In: text, image, video
Out: text
Released: 2026-04-06
Qwen 2.5 32b EVA Qwen2.5-32B-EVA-v0.2 24.6K 8.2K Input: $0.493
Output: $0.493
Model: 0.246
Completion: 1.000
- - In: text
Out: text
Released: 2024-09-01
Gemini 2.5 Flash gemini-2.5-flash 1M 65.5K Input: $0.3
Output: $2.5
Model: 0.150
Completion: 8.333
📎 🧠 - In: text, image
Out: text
Released: 2025-06-05
Qwen Long 10M qwen-long 10M 8.2K Input: $0.1003
Output: $0.408
Model: 0.050
Completion: 4.068
📎 - In: text, pdf
Out: text
Released: 2025-01-25
Gemma 4 31B DarkIdol Gemma-4-31B-DarkIdol 262.1K 16.4K Input: $0.306
Output: $0.306
Model: 0.153
Completion: 1.000
📎 🧠 - In: text, image, video
Out: text
Released: 2026-05-01
Step-2 16k Exp step-2-16k-exp 16K 8.2K Input: $7.004
Output: $19.992
Model: 3.502
Completion: 2.854
- - In: text
Out: text
Released: 2024-07-05
Qwen3.5 27B Marvin V2 Derestricted Lite Qwen3.5-27B-Marvin-V2-Derestricted-Lite 262.1K 16.4K Input: $0.306
Output: $0.306
Model: 0.153
Completion: 1.000
📎 🧠 - In: text, image, video
Out: text
Released: 2026-04-30
Qwen3.5 27B NaNovel Derestricted Qwen3.5-27B-NaNovel-Derestricted 262.1K 16.4K Input: $0.306
Output: $0.306
Model: 0.153
Completion: 1.000
📎 🧠 - In: text, image, video
Out: text
Released: 2026-04-30
Ernie 4.5 8k Preview ernie-4.5-8k-preview 8K 16.4K Input: $0.66
Output: $2.6
Model: 0.330
Completion: 3.939
- - In: text
Out: text
Released: 2025-03-25
Gemini Text + Image gemini-2.0-flash-exp-image-generation 32.8K 8.2K Input: $0.2
Output: $0.8
Model: 0.100
Completion: 4.000
- - In: text
Out: text
Released: 2025-02-19
Gemini 2.5 Flash Lite Preview (09/2025) gemini-2.5-flash-lite-preview-09-2025 1M 65.5K Input: $0.1
Output: $0.4
Model: 0.050
Completion: 4.000
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2025-09-25
Deepseek R1 Cheaper deepseek-reasoner-cheaper 128K 65.5K Input: $0.4
Output: $1.7
Model: 0.200
Completion: 4.250
- - In: text
Out: text
Released: 2025-01-20
Venice Uncensored venice-uncensored 128K 16.4K Input: $0.4
Output: $0.4
Model: 0.200
Completion: 1.000
- - In: text
Out: text
Released: 2025-02-24
Gemini 2.0 Flash gemini-2.0-flash-001 1M 8.2K Input: $0.1003
Output: $0.408
Model: 0.050
Completion: 4.068
📎 🔧 - In: text, image
Out: text
Released: 2024-12-11
Gemma 4 31B Larkspur v0.5 gemma-4-31B-Larkspur-v0.5 262.1K 16.4K Input: $0.306
Output: $0.306
Model: 0.153
Completion: 1.000
📎 🧠 - In: text, image, video
Out: text
Released: 2026-05-02
Claude 4.1 Opus claude-opus-4-1-20250805 200K 32K Input: $14.994
Output: $75.004
Model: 7.497
Completion: 5.002
📎 🔧 - In: text, image, pdf
Out: text
Released: 2025-08-05
Holo3-35B-A3B holo3-35b-a3b 65.5K 65.5K Input: $0.25
Output: $1.8
Model: 0.125
Completion: 7.200
📎 🧠 🔧 - In: text, image
Out: text
Released: 2024-01-01
Qwen3.5 Omni Plus qwen3.5-omni-plus 983.6K 65.5K Input: $0
Output: $0
- 📎 - In: text, image, video, audio
Out: text
Released: 2026-03-30
Gemma 4 31B IT Gemma-4-31B-it 262.1K 16.4K Input: $0.306
Output: $0.306
Model: 0.153
Completion: 1.000
📎 🧠 - In: text, image, video
Out: text
Released: 2026-04-09
Qwen3.5 27B Derestricted Qwen3.5-27B-Derestricted 262.1K 16.4K Input: $0.306
Output: $0.306
Model: 0.153
Completion: 1.000
📎 🧠 - In: text, image, video
Out: text
Released: 2026-03-17
Perplexity Reasoning Pro sonar-reasoning-pro 127K 128K Input: $2.006
Output: $7.9985
Model: 1.003
Completion: 3.987
🧠 - In: text
Out: text
Released: 2025-02-19
Venice Uncensored Web venice-uncensored:web 80K 16.4K Input: $0.4
Output: $0.4
Model: 0.200
Completion: 1.000
- - In: text
Out: text
Released: 2024-05-01
Doubao 1.5 Pro 32k doubao-1.5-pro-32k 32K 8.2K Input: $0.1343
Output: $0.3349
Model: 0.067
Completion: 2.494
- - In: text
Out: text
Released: 2025-01-22
Qwen3 30B A3B Instruct 2507 qwen3-30b-a3b-instruct-2507 256K 32.8K Input: $0.2
Output: $0.5
Model: 0.100
Completion: 2.500
- - In: text
Out: text
Released: 2025-02-20
ERNIE 5.1 Thinking ernie-5.1:thinking 119K 64K Input: $0.75
Output: $3
Cache Read: $0.75
Model: 0.375
Completion: 4.000
Cache: 1.000
📎 🧠 - In: text, image, video
Out: text
Released: 2026-05-10
MiroThinker 1.7 Deep Research mirothinker-1-7-deepresearch 262.1K 16.4K Input: $4
Output: $25
Model: 2.000
Completion: 6.250
🧠 - In: text
Out: text
Released: 2026-05-11
Hermes High hermes-high 1M 128K Input: $4.998
Output: $25.007
Model: 2.499
Completion: 5.003
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2026-05-11
Jamba Large 1.6 jamba-large-1.6 256K 4.1K Input: $1.989
Output: $7.99
Model: 0.995
Completion: 4.017
- - In: text
Out: text
Released: 2025-03-12
GLM-4 Long glm-4-long 1M 4.1K Input: $0.2006
Output: $0.2006
Model: 0.100
Completion: 1.000
- - In: text
Out: text
Released: 2024-08-01
Claude 4 Opus Thinking (32K) claude-opus-4-thinking:32768 200K 32K Input: $14.994
Output: $75.004
Model: 7.497
Completion: 5.002
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2025-05-22
Claude 4 Sonnet Thinking (8K) claude-sonnet-4-thinking:8192 1M 64K Input: $2.992
Output: $14.994
Model: 1.496
Completion: 5.011
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2025-05-22
Azure o3-mini azure-o3-mini 200K 65.5K Input: $1.088
Output: $4.3996
Model: 0.544
Completion: 4.044
- - In: text
Out: text
Released: 2025-01-31
Qwen: QvQ Max qvq-max 128K 8.2K Input: $1.4
Output: $5.3
Model: 0.700
Completion: 3.786
📎 - In: text, image
Out: text
Released: 2025-03-28
Qwen3.5 27B BlueStar v2 Derestricted Qwen3.5-27B-BlueStar-v2-Derestricted 262.1K 16.4K Input: $0.306
Output: $0.306
Model: 0.153
Completion: 1.000
📎 🧠 - In: text, image, video
Out: text
Released: 2026-04-06
Qwen3.5 27B Vivid Durian Qwen3.5-27B-Vivid-Durian 262.1K 16.4K Input: $0.306
Output: $0.306
Model: 0.153
Completion: 1.000
📎 🧠 - In: text, image, video
Out: text
Released: 2026-03-18
Jamba Large 1.7 jamba-large-1.7 256K 4.1K Input: $1.989
Output: $7.99
Model: 0.995
Completion: 4.017
- - In: text
Out: text
Released: 2025-07-09
Baichuan M2 32B Medical Baichuan-M2 32.8K 32.8K Input: $15.73
Output: $15.73
Model: 7.865
Completion: 1.000
- - In: text
Out: text
Released: 2025-08-19
Magistral Small 2506 Magistral-Small-2506 32.8K 32.8K Input: $0.4
Output: $1.4
Model: 0.200
Completion: 3.500
- - In: text
Out: text
Released: 2025-09-25
DeepSeek R1 deepseek-r1 128K 8.2K Input: $0.4
Output: $1.7
Model: 0.200
Completion: 4.250
🧠 - In: text
Out: text
Released: 2025-01-20
Qwen3.5 27B earica Derestricted Lite Qwen3.5-27B-earica-Derestricted-Lite 262.1K 16.4K Input: $0.306
Output: $0.306
Model: 0.153
Completion: 1.000
📎 🧠 - In: text, image, video
Out: text
Released: 2026-04-30
Doubao Seed 2.0 Pro doubao-seed-2-0-pro-260215 256K 128K Input: $0.782
Output: $3.876
Model: 0.391
Completion: 4.957
- - In: text
Out: text
Released: 2026-02-14
Step-2 Mini step-2-mini 8K 4.1K Input: $0.2006
Output: $0.408
Model: 0.100
Completion: 2.034
- - In: text
Out: text
Released: 2024-07-05
Gemini 2.0 Pro 1206 gemini-exp-1206 2.1M 8.2K Input: $1.258
Output: $4.998
Model: 0.629
Completion: 3.973
📎 - In: text, image
Out: text
Released: 2024-12-06
Claude 4.5 Opus Thinking claude-opus-4-5-20251101:thinking 200K 32K Input: $4.998
Output: $25.007
Model: 2.499
Completion: 5.003
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2025-11-01
Qwen3.5 27B BlueStar v3 Derestricted Qwen3.5-27B-BlueStar-v3-Derestricted 262.1K 16.4K Input: $0.306
Output: $0.306
Model: 0.153
Completion: 1.000
📎 🧠 - In: text, image, video
Out: text
Released: 2026-04-30
Claude 4 Opus Thinking claude-opus-4-thinking 200K 32K Input: $14.994
Output: $75.004
Model: 7.497
Completion: 5.002
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2025-07-15
Qwen3.5 Flash qwen3.5-flash 991.8K 65.5K Input: $0.09
Output: $0.36
Model: 0.045
Completion: 4.000
📎 - In: text, image, video
Out: text
Released: 2026-02-24
Exa (Research Pro) exa-research-pro 16.4K 16.4K Input: $2.5
Output: $2.5
Model: 1.250
Completion: 1.000
- - In: text
Out: text
Released: 2025-06-04
Jamba Mini jamba-mini 256K 4.1K Input: $0.1989
Output: $0.408
Model: 0.099
Completion: 2.051
- - In: text
Out: text
Released: 2025-07-09
Claude 4.1 Opus Thinking (32K) claude-opus-4-1-thinking:32000 200K 32K Input: $14.994
Output: $75.004
Model: 7.497
Completion: 5.002
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2025-05-22
KAT Coder Air V1 KAT-Coder-Air-V1 128K 32.8K Input: $0.1
Output: $0.2
Model: 0.050
Completion: 2.000
- - In: text
Out: text
Released: 2025-10-28
Exa (Research) exa-research 8.2K 8.2K Input: $2.5
Output: $2.5
Model: 1.250
Completion: 1.000
- - In: text
Out: text
Released: 2025-06-04
DeepClaude deepclaude 128K 8.2K Input: $3
Output: $15
Model: 1.500
Completion: 5.000
📎 - In: text, pdf
Out: text
Released: 2025-02-01
Gemini 2.5 Flash Preview (09/2025) gemini-2.5-flash-preview-09-2025 1M 65.5K Input: $0.3
Output: $2.5
Model: 0.150
Completion: 8.333
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2025-09-25
Claude 4.5 Opus claude-opus-4-5-20251101 200K 32K Input: $4.998
Output: $25.007
Model: 2.499
Completion: 5.003
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2025-11-01
GLM 4.6 Derestricted v5 GLM-4.6-Derestricted-v5 131.1K 8.2K Input: $0.4
Output: $1.5
Model: 0.200
Completion: 3.750
- - In: text
Out: text
Released: 2025-12-23
GLM Z1 AirX glm-z1-airx 32K 16.4K Input: $0.7
Output: $0.7
Model: 0.350
Completion: 1.000
🔧 - In: text
Out: text
Released: 2025-04-15
Claude 4 Sonnet Thinking (1K) claude-sonnet-4-thinking:1024 1M 64K Input: $2.992
Output: $14.994
Model: 1.496
Completion: 5.011
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2025-05-22
Holo3-35B-A3B Thinking holo3-35b-a3b:thinking 65.5K 65.5K Input: $0.25
Output: $1.8
Model: 0.125
Completion: 7.200
📎 🧠 🔧 - In: text, image
Out: text
Released: 2024-01-01
OWL owl 1M 262.1K Input: $0.1
Output: $0.3
Model: 0.050
Completion: 3.000
🔧 - In: text
Out: text
Released: 2026-05-01
Gemini 2.0 Pro Reasoner gemini-2.0-pro-reasoner 128K 65.5K Input: $1.292
Output: $4.998
Model: 0.646
Completion: 3.868
- - In: text
Out: text
Released: 2025-02-05
Gemma 4 31B Cognitive Unshackled Gemma-4-31B-Cognitive-Unshackled 262.1K 16.4K Input: $0.306
Output: $0.306
Model: 0.153
Completion: 1.000
📎 🧠 - In: text, image, video
Out: text
Released: 2026-05-01
Qwen3.5 27B earica Derestricted Qwen3.5-27B-earica-Derestricted 262.1K 16.4K Input: $0.306
Output: $0.306
Model: 0.153
Completion: 1.000
📎 🧠 - In: text, image, video
Out: text
Released: 2026-04-30
Cohere Command A+ (05/2026) command-a-plus-05-2026 128K 64K Input: $2.5
Output: $10
Model: 1.250
Completion: 4.000
📎 🧠 - In: text, image
Out: text
Released: 2026-05-22
Auto model (Premium) auto-model-premium 1M 1M Input: $9.996
Output: $19.992
Model: 4.998
Completion: 2.000
- - In: text
Out: text
Released: 2024-06-01
Gemini LearnLM Experimental learnlm-1.5-pro-experimental 32.8K 8.2K Input: $3.502
Output: $10.506
Model: 1.751
Completion: 3.000
- - In: text
Out: text
Released: 2024-05-14
DeepSeek R1 Fast deepseek-r1-sambanova 128K 4.1K Input: $4.998
Output: $6.987
Model: 2.499
Completion: 1.398
- - In: text
Out: text
Released: 2025-02-20
Claw Medium claw-medium 204.8K 131.1K Input: $0.3
Output: $1.2
Model: 0.150
Completion: 4.000
🧠 🔧 - In: text
Out: text
Released: 2026-05-11
Claude 4 Opus claude-opus-4-20250514 200K 32K Input: $14.994
Output: $75.004
Model: 7.497
Completion: 5.002
📎 🔧 - In: text, image, pdf
Out: text
Released: 2025-05-14
Yi Large yi-large 32K 4.1K Input: $3.196
Output: $3.196
Model: 1.598
Completion: 1.000
- - In: text
Out: text
Released: 2024-05-13
Qwen3 Max 2026-01-23 qwen3-max-2026-01-23 256K 32.8K Input: $1.2002
Output: $6.001
Model: 0.600
Completion: 5.000
- - In: text
Out: text
Released: 2026-01-26
Phi 4 Mini phi-4-mini-instruct 128K 16.4K Input: $0.17
Output: $0.68
Model: 0.085
Completion: 4.000
- - In: text
Out: text
Released: 2025-07-26
Ernie X1 Turbo 32k ernie-x1-turbo-32k 32K 16.4K Input: $0.165
Output: $0.66
Model: 0.083
Completion: 4.000
📎 - In: text, image, pdf
Out: text
Released: 2025-05-08
Claw Low claw-low 1M 65.5K Input: $0.25
Output: $1.5
Cache Read: $0.025
Model: 0.125
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2026-05-11
Gemini 3 Pro Image gemini-3-pro-image-preview 1M 65.5K Input: $2
Output: $12
Model: 1.000
Completion: 6.000
📎 - In: text, image
Out: text
Released: 2025-11-18
Gemma 4 31B Garnet gemma-4-31B-Garnet 262.1K 16.4K Input: $0.306
Output: $0.306
Model: 0.153
Completion: 1.000
📎 🧠 - In: text, image, video
Out: text
Released: 2026-05-02
Doubao 1.5 Vision Pro 32k doubao-1.5-vision-pro-32k 32K 8.2K Input: $0.459
Output: $1.377
Model: 0.230
Completion: 3.000
📎 - In: text, image
Out: text
Released: 2025-01-22
Auto model (Standard) auto-model-standard 1M 1M Input: $9.996
Output: $19.992
Model: 4.998
Completion: 2.000
- - In: text
Out: text
Released: 2024-06-01
Qwen3.5 27B Marvin DPO V2 Derestricted Lite Qwen3.5-27B-Marvin-DPO-V2-Derestricted-Lite 262.1K 16.4K Input: $0.306
Output: $0.306
Model: 0.153
Completion: 1.000
📎 🧠 - In: text, image, video
Out: text
Released: 2026-04-30
GLM-4 glm-4 128K 4.1K Input: $14.994
Output: $14.994
Model: 7.497
Completion: 1.000
- - In: text
Out: text
Released: 2024-01-16
Qwen3.5 27B Writer Derestricted Lite Qwen3.5-27B-Writer-Derestricted-Lite 262.1K 16.4K Input: $0.306
Output: $0.306
Model: 0.153
Completion: 1.000
📎 🧠 - In: text, image, video
Out: text
Released: 2026-04-06
Qwen 3.6 Plus qwen-3.6-plus 991.8K 65.5K Input: $0.45
Output: $2.7
Model: 0.225
Completion: 6.000
- - In: text, image, video
Out: text
Released: 2026-04-02
Brave (Pro) brave-pro 8.2K 8.2K Input: $5
Output: $5
Model: 2.500
Completion: 1.000
- - In: text
Out: text
Released: 2023-03-02
Updated: 2024-01-01
DeepSeek V3/Chat Cheaper deepseek-chat-cheaper 128K 8.2K Input: $0.25
Output: $0.7
Model: 0.125
Completion: 2.800
📎 🔧 - In: text, pdf
Out: text
Released: 2025-04-15
Gemma 4 31B Queen Gemma-4-31B-Queen 262.1K 16.4K Input: $0.306
Output: $0.306
Model: 0.153
Completion: 1.000
📎 🧠 - In: text, image, video
Out: text
Released: 2026-05-01
Gemma 4 31B Gemopus Gemma-4-31B-Gemopus 262.1K 16.4K Input: $0.306
Output: $0.306
Model: 0.153
Completion: 1.000
📎 🧠 - In: text, image, video
Out: text
Released: 2026-05-01
Mistral Code Latest mistral-code-latest 256K 32.8K Input: $0.3
Output: $0.9
Model: 0.150
Completion: 3.000
🔧 - In: text
Out: text
Released: 2026-06-02
Gemini 2.5 Flash Lite gemini-2.5-flash-lite 1M 65.5K Input: $0.1
Output: $0.4
Model: 0.050
Completion: 4.000
📎 🧠 - In: text, image
Out: text
Released: 2025-06-17
Jamba Mini 1.7 jamba-mini-1.7 256K 4.1K Input: $0.1989
Output: $0.408
Model: 0.099
Completion: 2.051
- - In: text
Out: text
Released: 2025-07-09
Universal Summarizer universal-summarizer 32.8K 32.8K Input: $30
Output: $30
Model: 15.000
Completion: 1.000
- - In: text
Out: text
Released: 2023-05-01
Updated: 2024-01-01
Qwen3.5 27B Anko Qwen3.5-27B-Anko 262.1K 16.4K Input: $0.306
Output: $0.306
Model: 0.153
Completion: 1.000
📎 🧠 - In: text, image, video
Out: text
Released: 2026-04-30
Claude 4 Sonnet claude-sonnet-4-20250514 200K 64K Input: $2.992
Output: $14.994
Model: 1.496
Completion: 5.011
📎 🔧 - In: text, image, pdf
Out: text
Released: 2025-09-29
Ernie X1 32k ernie-x1-32k-preview 32K 16.4K Input: $0.33
Output: $1.32
Model: 0.165
Completion: 4.000
- - In: text
Out: text
Released: 2025-04-03
Azure gpt-4-turbo azure-gpt-4-turbo 128K 4.1K Input: $9.996
Output: $30.005
Model: 4.998
Completion: 3.002
- - In: text
Out: text
Released: 2023-11-06
Updated: 2024-01-01
Llama 3.1 8B (decentralized) Meta-Llama-3-1-8B-Instruct-FP8 128K 16.4K Input: $0.02
Output: $0.03
Model: 0.010
Completion: 1.500
- - In: text
Out: text
Released: 2024-07-23
Claude Sonnet 4.5 Thinking claude-sonnet-4-5-20250929-thinking 1M 64K Input: $2.992
Output: $14.994
Model: 1.496
Completion: 5.011
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2025-09-29
ASI1 Mini asi1-mini 128K 16.4K Input: $1
Output: $1
Model: 0.500
Completion: 1.000
📎 - In: text, pdf
Out: text
Released: 2025-03-25
Qwen3.5 27B qwen3.5-27b 260.1K 65.5K Input: $0.27
Output: $2.16
Model: 0.135
Completion: 8.000
📎 - In: text, image, video
Out: text
Released: 2026-02-24
Qwen3.5 27B Omega Evolution v2.2 Derestricted Lite Qwen3.5-27B-Omega-Evolution-v2.2-Derestricted-Lite 262.1K 16.4K Input: $0.306
Output: $0.306
Model: 0.153
Completion: 1.000
📎 🧠 - In: text, image, video
Out: text
Released: 2026-05-02
Claude 4.1 Opus Thinking (32K) claude-opus-4-1-thinking:32768 200K 32K Input: $14.994
Output: $75.004
Model: 7.497
Completion: 5.002
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2025-05-22
Gemma 4 31B K1 v5 gemma-4-31B-K1-v5 262.1K 16.4K Input: $0.306
Output: $0.306
Model: 0.153
Completion: 1.000
📎 🧠 - In: text, image, video
Out: text
Released: 2026-05-02
Perplexity Simple sonar 127K 128K Input: $1.003
Output: $1.003
Model: 0.501
Completion: 1.000
- - In: text
Out: text
Released: 2025-02-19
MiniMax M2 MiniMax-M2 200K 131.1K Input: $0.17
Output: $1.53
Model: 0.085
Completion: 9.000
🧠 - In: text
Out: text
Released: 2025-10-25
Sarvam 105B sarvam-105b 131.1K 4.1K Input: $0.045
Output: $0.177
Cache Read: $0.028
Model: 0.022
Completion: 3.933
Cache: 0.622
🧠 🔧 - In: text
Out: text
Released: 2026-05-12
Qwen3.5 27B Writer V2 Derestricted Lite Qwen3.5-27B-Writer-V2-Derestricted-Lite 262.1K 16.4K Input: $0.306
Output: $0.306
Model: 0.153
Completion: 1.000
📎 🧠 - In: text, image, video
Out: text
Released: 2026-04-06
Qwen3.5 27B Marvin V2 Derestricted Qwen3.5-27B-Marvin-V2-Derestricted 262.1K 16.4K Input: $0.306
Output: $0.306
Model: 0.153
Completion: 1.000
📎 🧠 - In: text, image, video
Out: text
Released: 2026-04-30
Claw High claw-high 1M 128K Input: $4.998
Output: $25.007
Model: 2.499
Completion: 5.003
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2026-05-11
Qwen 2.5 Max qwen-max 32K 8.2K Input: $1.5997
Output: $6.392
Model: 0.800
Completion: 3.996
- - In: text
Out: text
Released: 2024-04-03
Gemma 4 31B Fabled gemma-4-31B-Fabled 262.1K 16.4K Input: $0.306
Output: $0.306
Model: 0.153
Completion: 1.000
📎 🧠 - In: text, image, video
Out: text
Released: 2026-05-02
v0 1.5 MD v0-1.5-md 200K 64K Input: $3
Output: $15
Model: 1.500
Completion: 5.000
- - In: text
Out: text
Released: 2025-07-04
Claude 4 Opus Thinking (1K) claude-opus-4-thinking:1024 200K 32K Input: $14.994
Output: $75.004
Model: 7.497
Completion: 5.002
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2025-05-22
Gemini 2.5 Flash Preview gemini-2.5-flash-preview-04-17 1M 65.5K Input: $0.15
Output: $0.6
Model: 0.075
Completion: 4.000
📎 🧠 - In: text, image
Out: text
Released: 2025-04-17
Gemini 2.5 Pro Experimental 0325 gemini-2.5-pro-exp-03-25 1M 65.5K Input: $2.5
Output: $10
Model: 1.250
Completion: 4.000
📎 🧠 - In: text, image
Out: text
Released: 2025-03-25
Hermes Medium hermes-medium 204.8K 131.1K Input: $0.3
Output: $1.2
Model: 0.150
Completion: 4.000
🧠 🔧 - In: text
Out: text
Released: 2026-05-11
Perplexity Pro sonar-pro 200K 128K Input: $2.992
Output: $14.994
Model: 1.496
Completion: 5.011
- - In: text
Out: text
Released: 2025-02-19
Doubao 1.5 Thinking Pro doubao-1-5-thinking-pro-250415 128K 16.4K Input: $0.6
Output: $2.4
Model: 0.300
Completion: 4.000
📎 - In: text, pdf
Out: text
Released: 2025-04-17
Qwen3.7 Max Thinking qwen3.7-max:thinking 1M 65.5K Input: $2.5
Output: $7.5
Cache Read: $0.25
Model: 1.250
Completion: 3.000
Cache: 0.100
🧠 - In: text
Out: text
Released: 2026-05-21
Ernie 4.5 Turbo VL 32k ernie-4.5-turbo-vl-32k 32K 16.4K Input: $0.495
Output: $1.43
Model: 0.247
Completion: 2.889
📎 - In: text, image
Out: text
Released: 2025-05-08
Gemini 2.5 Pro Preview 0506 gemini-2.5-pro-preview-05-06 1M 65.5K Input: $2.5
Output: $10
Model: 1.250
Completion: 4.000
📎 🧠 - In: text, image
Out: text
Released: 2025-05-06
Hunyuan Turbo S hunyuan-turbos-20250226 24K 8.2K Input: $0.187
Output: $0.374
Model: 0.093
Completion: 2.000
- - In: text
Out: text
Released: 2025-02-27
Claude 4 Sonnet Thinking (64K) claude-sonnet-4-thinking:64000 1M 64K Input: $2.992
Output: $14.994
Model: 1.496
Completion: 5.011
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2025-05-22
Ernie X1 32k ernie-x1-32k 32K 16.4K Input: $0.33
Output: $1.32
Model: 0.165
Completion: 4.000
📎 - In: text, image
Out: text
Released: 2025-05-08
Cohere Command A (08/2025) command-a-reasoning-08-2025 256K 8.2K Input: $2.5
Output: $10
Model: 1.250
Completion: 4.000
- - In: text
Out: text
Released: 2025-08-22
Doubao Seed 2.0 Code Preview doubao-seed-2-0-code-preview-260215 256K 128K Input: $0.782
Output: $3.893
Model: 0.391
Completion: 4.978
- - In: text
Out: text
Released: 2026-02-14
Doubao 1.5 Pro 256k doubao-1.5-pro-256k 256K 16.4K Input: $0.799
Output: $1.445
Model: 0.400
Completion: 1.809
- - In: text
Out: text
Released: 2025-03-12
Qwen3.5 35B A3B qwen3.5-35b-a3b 260.1K 65.5K Input: $0.225
Output: $1.8
Model: 0.113
Completion: 8.000
📎 - In: text, image, video
Out: text
Released: 2026-02-24
Qwen3.5 122B A10B Thinking qwen3.5-122b-a10b:thinking 260.1K 65.5K Input: $0.36
Output: $2.88
Model: 0.180
Completion: 8.000
📎 🧠 - In: text, image, video
Out: text
Released: 2026-02-24
Yi Lightning yi-lightning 12K 4.1K Input: $0.2006
Output: $0.2006
Model: 0.100
Completion: 1.000
- - In: text
Out: text
Released: 2024-10-16
Claude Sonnet 4.5 claude-sonnet-4-5-20250929 1M 64K Input: $2.992
Output: $14.994
Model: 1.496
Completion: 5.011
📎 🔧 - In: text, image, pdf
Out: text
Released: 2025-09-29
DeepSeek Math V2 deepseek-math-v2 128K 65.5K Input: $0.6
Output: $2.2
Model: 0.300
Completion: 3.667
- - In: text
Out: text
Released: 2025-12-03
Claude 4.1 Opus Thinking (8K) claude-opus-4-1-thinking:8192 200K 32K Input: $14.994
Output: $75.004
Model: 7.497
Completion: 5.002
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2025-05-22
DeepSeek Reasoner deepseek-reasoner 64K 65.5K Input: $0.4
Output: $1.7
Model: 0.200
Completion: 4.250
- - In: text
Out: text
Released: 2025-01-20
Gemini 2.5 Flash (No Thinking) gemini-2.5-flash-nothinking 1M 65.5K Input: $0.3
Output: $2.5
Model: 0.150
Completion: 8.333
📎 - In: text, image, pdf
Out: text
Released: 2025-06-05
Qwen3.5 27B Omega Evolution v2.0 Derestricted Lite Qwen3.5-27B-Omega-Evolution-v2.0-Derestricted-Lite 262.1K 16.4K Input: $0.306
Output: $0.306
Model: 0.153
Completion: 1.000
📎 🧠 - In: text, image, video
Out: text
Released: 2026-04-06
Web Answer fastgpt 32.8K 32.8K Input: $7.5
Output: $7.5
Model: 3.750
Completion: 1.000
- - In: text
Out: text
Released: 2023-08-01
Updated: 2024-01-01
Qwen3.5 27B Infracelestial Qwen3.5-27B-Infracelestial 262.1K 16.4K Input: $0.306
Output: $0.306
Model: 0.153
Completion: 1.000
📎 🧠 - In: text, image, video
Out: text
Released: 2026-04-30
GLM-4 Flash glm-4-flash 128K 4.1K Input: $0.1003
Output: $0.1003
Model: 0.050
Completion: 1.000
- - In: text
Out: text
Released: 2024-08-01
Azure gpt-4o-mini azure-gpt-4o-mini 128K 16.4K Input: $0.1496
Output: $0.595
Model: 0.075
Completion: 3.977
📎 🔧 - In: text, image
Out: text
Released: 2024-07-18
Perplexity Deep Research sonar-deep-research 60K 128K Input: $3.4
Output: $13.6
Model: 1.700
Completion: 4.000
- - In: text
Out: text
Released: 2025-02-25
Qwen: QwQ 32B qwq-32b 128K 32.8K Input: $0.25599999
Output: $0.30499999
Model: 0.128
Completion: 1.191
- - In: text
Out: text
Released: 2025-04-15
Mistral Code Agent Latest mistral-code-agent-latest 262.1K 32.8K Input: $0.4
Output: $2
Model: 0.200
Completion: 5.000
🔧 - In: text
Out: text
Released: 2026-06-02
GLM 4.1V Thinking Flash glm-4.1v-thinking-flash 64K 8.2K Input: $0.3
Output: $0.3
Model: 0.150
Completion: 1.000
📎 - In: text, image
Out: text
Released: 2025-07-09
Qwen3.5 Omni Flash qwen3.5-omni-flash 49.2K 16.4K Input: $0
Output: $0
- 📎 - In: text, image, video, audio
Out: text
Released: 2026-03-30
Gemma 4 31B MeroMero gemma-4-31B-MeroMero 262.1K 16.4K Input: $0.306
Output: $0.306
Model: 0.153
Completion: 1.000
📎 🧠 - In: text, image, video
Out: text
Released: 2026-05-02
Gemini 2.5 Flash Preview Thinking gemini-2.5-flash-preview-04-17:thinking 1M 65.5K Input: $0.15
Output: $3.5
Model: 0.075
Completion: 23.333
📎 🧠 - In: text, image
Out: text
Released: 2025-04-17
GLM-4 Air glm-4-air 128K 4.1K Input: $0.2006
Output: $0.2006
Model: 0.100
Completion: 1.000
- - In: text
Out: text
Released: 2024-06-05
Doubao Seed 1.6 Thinking doubao-seed-1-6-thinking-250615 256K 16.4K Input: $0.204
Output: $2.04
Model: 0.102
Completion: 10.000
- - In: text
Out: text
Released: 2025-06-15
Auto model (Basic) auto-model-basic 1M 1M Input: $9.996
Output: $19.992
Model: 4.998
Completion: 2.000
- - In: text
Out: text
Released: 2024-06-01
Qwen3.5 27B RpRMax v1 Qwen3.5-27B-RpRMax-v1 262.1K 16.4K Input: $0.306
Output: $0.306
Model: 0.153
Completion: 1.000
📎 🧠 - In: text, image, video
Out: text
Released: 2026-04-30
ERNIE 5.1 ernie-5.1 119K 64K Input: $0.75
Output: $3
Cache Read: $0.75
Model: 0.375
Completion: 4.000
Cache: 1.000
📎 - In: text, image, video
Out: text
Released: 2026-05-10
Qwen3.5 27B Marvin DPO V2 Derestricted Qwen3.5-27B-Marvin-DPO-V2-Derestricted 262.1K 16.4K Input: $0.306
Output: $0.306
Model: 0.153
Completion: 1.000
📎 🧠 - In: text, image, video
Out: text
Released: 2026-04-30
Azure gpt-4o azure-gpt-4o 128K 16.4K Input: $2.499
Output: $9.996
Model: 1.250
Completion: 4.000
📎 🔧 - In: text, image
Out: text
Released: 2024-05-13
DeepSeek V3/Deepseek Chat deepseek-chat 128K 8.2K Input: $0.25
Output: $0.7
Model: 0.125
Completion: 2.800
📎 🔧 - In: text, pdf
Out: text
Released: 2025-02-27
Gemini 2.5 Flash 0520 gemini-2.5-flash-preview-05-20 1M 65.5K Input: $0.15
Output: $0.6
Model: 0.075
Completion: 4.000
📎 - In: text, image
Out: text
Released: 2025-05-20
Mercury 2 mercury-2 128K 50K Input: $0.25
Output: $0.75
Cache Read: $0.025
Model: 0.125
Completion: 3.000
Cache: 0.100
🧠 🔧 - In: text
Out: text
Released: 2024-01-01
Qwen3.7 Plus Thinking qwen3.7-plus:thinking 983.6K 65.5K Input: $0.4
Output: $1.6
Cache Read: $0.04
Model: 0.200
Completion: 4.000
Cache: 0.100
📎 🧠 - In: text, image, video
Out: text
Released: 2026-06-01
Gemini 2.0 Flash Thinking 1219 gemini-2.0-flash-thinking-exp-1219 32.8K 8.2K Input: $0.1003
Output: $0.408
Model: 0.050
Completion: 4.068
- - In: text
Out: text
Released: 2024-12-19
GLM 4 Plus 0111 glm-4-plus-0111 128K 4.1K Input: $9.996
Output: $9.996
Model: 4.998
Completion: 1.000
- - In: text
Out: text
Released: 2025-02-19
Brave (Answers) brave 8.2K 8.2K Input: $5
Output: $5
Model: 2.500
Completion: 1.000
- - In: text
Out: text
Released: 2023-03-02
Updated: 2024-01-01
GLM Zero Preview glm-zero-preview 8K 4.1K Input: $1.802
Output: $1.802
Model: 0.901
Completion: 1.000
- - In: text
Out: text
Released: 2024-12-01
Gemini 2.5 Flash Lite Preview gemini-2.5-flash-lite-preview-06-17 1M 65.5K Input: $0.15
Output: $0.6
Model: 0.075
Completion: 4.000
📎 🧠 - In: text, image
Out: text
Released: 2025-06-17
Qwen3.5 27B Writer Derestricted Qwen3.5-27B-Writer-Derestricted 262.1K 16.4K Input: $0.306
Output: $0.306
Model: 0.153
Completion: 1.000
📎 🧠 - In: text, image, video
Out: text
Released: 2026-04-06
KAT Coder Exp 72B 1010 KAT-Coder-Exp-72B-1010 128K 32.8K Input: $0.1
Output: $0.2
Model: 0.050
Completion: 2.000
- - In: text
Out: text
Released: 2025-10-28
Gemini 2.5 Pro Preview 0605 gemini-2.5-pro-preview-06-05 1M 65.5K Input: $2.5
Output: $10
Model: 1.250
Completion: 4.000
📎 🧠 - In: text, image
Out: text
Released: 2025-06-05
Qwen3.5 27B Musica v1 Qwen3.5-27B-Musica-v1 262.1K 16.4K Input: $0.306
Output: $0.306
Model: 0.153
Completion: 1.000
📎 🧠 - In: text, image, video
Out: text
Released: 2026-03-27
MiniMax M1 MiniMax-M1 1M 131.1K Input: $0.1394
Output: $1.3328
Model: 0.070
Completion: 9.561
- - In: text
Out: text
Released: 2025-06-16
Doubao 1.5 Thinking Pro Vision doubao-1-5-thinking-pro-vision-250415 128K 16.4K Input: $0.6
Output: $2.4
Model: 0.300
Completion: 4.000
📎 - In: text, image
Out: text
Released: 2025-04-15
Qwen3.5 27B Thinking qwen3.5-27b:thinking 260.1K 65.5K Input: $0.27
Output: $2.16
Model: 0.135
Completion: 8.000
📎 🧠 - In: text, image, video
Out: text
Released: 2026-02-24
Qwen3.5 27B Omega Evolution v2.0 Derestricted Qwen3.5-27B-Omega-Evolution-v2.0-Derestricted 262.1K 16.4K Input: $0.306
Output: $0.306
Model: 0.153
Completion: 1.000
📎 🧠 - In: text, image, video
Out: text
Released: 2026-04-06
Gemma 4 31B Garnet V2 Gemma-4-31B-GarnetV2 262.1K 16.4K Input: $0.306
Output: $0.306
Model: 0.153
Completion: 1.000
📎 🧠 - In: text, image, video
Out: text
Released: 2026-05-01
Qwen Turbo qwen-turbo 1M 8.2K Input: $0.04998
Output: $0.2006
Model: 0.025
Completion: 4.014
- - In: text
Out: text
Released: 2024-11-01
Phi 4 Multimodal phi-4-multimodal-instruct 128K 16.4K Input: $0.07
Output: $0.11
Model: 0.035
Completion: 1.571
- - In: text
Out: text
Released: 2025-07-26
Mistral Small 31 24b Instruct mistral-small-31-24b-instruct 128K 131.1K Input: $0.1
Output: $0.3
Model: 0.050
Completion: 3.000
📎 - In: text, image
Out: text
Released: 2025-04-15
Ernie 4.5 Turbo 128k ernie-4.5-turbo-128k 128K 16.4K Input: $0.132
Output: $0.55
Model: 0.066
Completion: 4.167
📎 - In: text, image
Out: text
Released: 2025-05-08
Qwen3.5 27B BlueStar Derestricted Qwen3.5-27B-BlueStar-Derestricted 262.1K 16.4K Input: $0.306
Output: $0.306
Model: 0.153
Completion: 1.000
📎 🧠 - In: text, image, video
Out: text
Released: 2026-04-06
Gemini 2.5 Flash Lite Preview (09/2025) – Thinking gemini-2.5-flash-lite-preview-09-2025-thinking 1M 65.5K Input: $0.1
Output: $0.4
Model: 0.050
Completion: 4.000
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2025-09-25
Doubao Seed 1.8 doubao-seed-1-8-251215 128K 8.2K Input: $0.612
Output: $6.12
Model: 0.306
Completion: 10.000
- - In: text
Out: text
Released: 2025-12-15
Qwen3.6 Max Preview qwen3.6-max-preview 245.8K 65.5K Input: $1.3
Output: $7.8
Model: 0.650
Completion: 6.000
- - In: text
Out: text
Released: 2026-04-20
Updated: 2026-04-21
Exa (Answer) exa-answer 4.1K 4.1K Input: $2.5
Output: $2.5
Model: 1.250
Completion: 1.000
- - In: text
Out: text
Released: 2025-06-04
Baichuan 4 Air Baichuan4-Air 32.8K 32.8K Input: $0.157
Output: $0.157
Model: 0.079
Completion: 1.000
- - In: text
Out: text
Released: 2025-08-19
Qwen3.5 122B A10B qwen3.5-122b-a10b 260.1K 65.5K Input: $0.36
Output: $2.88
Model: 0.180
Completion: 8.000
📎 - In: text, image, video
Out: text
Released: 2026-02-24
Sarvam 30B sarvam-30b 65.5K 4.1K Input: $0.028
Output: $0.111
Cache Read: $0.017
Model: 0.014
Completion: 3.964
Cache: 0.607
🧠 🔧 - In: text
Out: text
Released: 2026-05-12
Claude 4.1 Opus Thinking claude-opus-4-1-thinking 200K 32K Input: $14.994
Output: $75.004
Model: 7.497
Completion: 5.002
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2025-05-22
Claude 4 Opus Thinking (32K) claude-opus-4-thinking:32000 200K 32K Input: $14.994
Output: $75.004
Model: 7.497
Completion: 5.002
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2025-05-22
Auto model auto-model 1M 1M Input: $0
Output: $0
- - - In: text
Out: text
Released: 2024-06-01
Qwen3 VL 235B A22B Instruct Original qwen3-vl-235b-a22b-instruct-original 32.8K 32.8K Input: $0.5
Output: $1.2
Model: 0.250
Completion: 2.400
📎 - In: text, image
Out: text
Released: 2025-09-25
GLM Z1 Air glm-z1-air 32K 16.4K Input: $0.07
Output: $0.07
Model: 0.035
Completion: 1.000
🔧 - In: text
Out: text
Released: 2025-04-15
Qwen3.5 27B Queen Derestricted Lite Qwen3.5-27B-Queen-Derestricted-Lite 262.1K 16.4K Input: $0.306
Output: $0.306
Model: 0.153
Completion: 1.000
📎 🧠 - In: text, image, video
Out: text
Released: 2026-04-30
Ling 2.6 1T inclusionai/ling-2.6-1t 262.1K 32.8K Input: $0.3
Output: $2.5
Cache Read: $0.06
Model: 0.150
Completion: 8.333
Cache: 0.200
🔧 - In: text
Out: text
Released: 2026-04-23
Ring 2.6 1T inclusionai/ring-2.6-1t 262.1K 65.5K Input: $1
Output: $3
Model: 0.500
Completion: 3.000
🧠 🔧 - In: text
Out: text
Released: 2026-05-08
Ling 2.6 Flash inclusionai/ling-2.6-flash 262.1K 32.8K Input: $0.08
Output: $0.24
Model: 0.040
Completion: 3.000
🔧 - In: text
Out: text
Released: 2026-04-21
Tongyi DeepResearch 30B A3B Alibaba-NLP/Tongyi-DeepResearch-30B-A3B 128K 65.5K Input: $0.08
Output: $0.24000000000000002
Model: 0.040
Completion: 3.000
- - In: text
Out: text
Released: 2025-08-26
Granite 4.1 8B ibm-granite/granite-4.1-8b 131.1K 131.1K Input: $0.05
Output: $0.1
Cache Read: $0.05
Model: 0.025
Completion: 2.000
Cache: 1.000
🔧 - In: text
Out: text
Released: 2026-04-29
Llama-xLAM-2 70B fc-r Salesforce/Llama-xLAM-2-70b-fc-r 128K 16.4K Input: $2.5
Output: $2.5
Model: 1.250
Completion: 1.000
- - In: text
Out: text
Released: 2025-04-13
GLM Z1 32B 0414 THUDM/GLM-Z1-32B-0414 128K 65.5K Input: $0.2
Output: $0.2
Model: 0.100
Completion: 1.000
- - In: text
Out: text
Released: 2025-04-15
GLM 4 32B 0414 THUDM/GLM-4-32B-0414 128K 65.5K Input: $0.2
Output: $0.2
Model: 0.100
Completion: 1.000
- - In: text
Out: text
Released: 2025-04-14
GLM 4 9B 0414 THUDM/GLM-4-9B-0414 32K 8K Input: $0.2
Output: $0.2
Model: 0.100
Completion: 1.000
- - In: text
Out: text
Released: 2025-04-14
GLM Z1 9B 0414 THUDM/GLM-Z1-9B-0414 32K 8K Input: $0.2
Output: $0.2
Model: 0.100
Completion: 1.000
- - In: text
Out: text
Released: 2025-04-14
Llama 3.1 8b Instruct meta-llama/llama-3.1-8b-instruct 131.1K 16.4K Input: $0.0544
Output: $0.0544
Model: 0.027
Completion: 1.000
- - In: text
Out: text
Released: 2024-07-23
Llama 4 Maverick meta-llama/llama-4-maverick 1M 65.5K Input: $0.18000000000000002
Output: $0.8
Model: 0.090
Completion: 4.444
📎 🔧 - In: text, image
Out: text
Released: 2025-09-05
Llama 3.3 70b Instruct meta-llama/llama-3.3-70b-instruct 131.1K 16.4K Input: $0.05
Output: $0.23
Model: 0.025
Completion: 4.600
🔧 - In: text
Out: text
Released: 2025-02-27
Llama 4 Scout meta-llama/llama-4-scout 328K 65.5K Input: $0.085
Output: $0.46
Model: 0.043
Completion: 5.412
📎 🔧 - In: text, image
Out: text
Released: 2025-09-05
Llama 3.2 3b Instruct meta-llama/llama-3.2-3b-instruct 131.1K 8.2K Input: $0.0306
Output: $0.0493
Model: 0.015
Completion: 1.611
📎 - In: text, pdf
Out: text
Released: 2024-09-25
Qwerky 72B featherless-ai/Qwerky-72B 32K 8.2K Input: $0.5
Output: $0.5
Model: 0.250
Completion: 1.000
- - In: text
Out: text
Released: 2025-03-20
Kimi K2 0711 moonshotai/kimi-k2-instruct-0711 128K 8.2K Input: $0.1
Output: $2
Model: 0.050
Completion: 20.000
🔧 - In: text
Out: text
Released: 2025-07-11
Kimi K2 0905 moonshotai/Kimi-K2-Instruct-0905 256K 262.1K Input: $0.4
Output: $2
Model: 0.200
Completion: 5.000
🔧 - In: text
Out: text
Released: 2025-09-25
Kimi K2 Thinking Original moonshotai/kimi-k2-thinking-original 256K 16.4K Input: $0.6
Output: $2.5
Model: 0.300
Completion: 4.167
🧠 - In: text
Out: text
Released: 2025-11-06
Kimi K2.5 Thinking moonshotai/kimi-k2.5:thinking 256K 65.5K Input: $0.3
Output: $1.9
Model: 0.150
Completion: 6.333
📎 🧠 🔧 - In: text, image
Out: text
Released: 2026-01-26
Kimi K2 Instruct moonshotai/kimi-k2-instruct 256K 8.2K Input: $0.1
Output: $2
Model: 0.050
Completion: 20.000
🔧 - In: text
Out: text
Released: 2025-07-01
Kimi K2 Thinking moonshotai/kimi-k2-thinking 256K 262.1K Input: $0.3
Output: $1.2
Model: 0.150
Completion: 4.000
🔧 - In: text
Out: text
Released: 2025-11-06
Kimi K2.6 Thinking moonshotai/kimi-k2.6:thinking 256K 65.5K Input: $0.53
Output: $2.73
Model: 0.265
Completion: 5.151
📎 🧠 🔧 - In: text, image
Out: text
Open Weights
Released: 2026-04-16
Updated: 2026-04-21
Kimi K2.5 moonshotai/kimi-k2.5 256K 65.5K Input: $0.3
Output: $1.9
Model: 0.150
Completion: 6.333
📎 🔧 - In: text, image
Out: text
Released: 2026-01-26
Kimi K2.6 moonshotai/kimi-k2.6 256K 65.5K Input: $0.53
Output: $2.73
Model: 0.265
Completion: 5.151
📎 🔧 - In: text, image
Out: text
Open Weights
Released: 2026-04-16
Updated: 2026-04-21
Kimi Latest moonshotai/kimi-latest 256K 65.5K Input: $0.5
Output: $2.6
Cache Read: $0.125
Model: 0.250
Completion: 5.200
Cache: 0.250
📎 🧠 🔧 - In: text, image
Out: text
Released: 2026-05-03
Kimi K2 Thinking Turbo Original moonshotai/kimi-k2-thinking-turbo-original 256K 16.4K Input: $1.15
Output: $8
Model: 0.575
Completion: 6.957
🧠 - In: text
Out: text
Released: 2025-11-06
ERNIE 4.5 VL 28B baidu/ernie-4.5-vl-28b-a3b 32.8K 16.4K Input: $0.13999999999999999
Output: $0.5599999999999999
Model: 0.070
Completion: 4.000
📎 - In: text, image
Out: text
Released: 2025-06-30
Perceptron Mk1 perceptron/perceptron-mk1 32.8K 8.2K Input: $0.15
Output: $1.5
Model: 0.075
Completion: 10.000
📎 🧠 - In: text, image, video
Out: text
Released: 2026-05-12
Llama 3 70B abliterated failspy/Meta-Llama-3-70B-Instruct-abliterated-v3.5 8.2K 8.2K Input: $0.7
Output: $0.7
Model: 0.350
Completion: 1.000
- - In: text
Out: text
Released: 2025-07-26
Coding Router Max nanogpt/coding-router:max 1M 128K Input: $5
Output: $30
Cache Read: $0.5
Model: 2.500
Completion: 6.000
Cache: 0.100
🧠 🔧 - In: text
Out: text
Released: 2026-05-12
Coding Router High nanogpt/coding-router:high 1M 128K Input: $1.1
Output: $2.2
Cache Read: $0.11
Model: 0.550
Completion: 2.000
Cache: 0.100
🧠 🔧 - In: text
Out: text
Released: 2026-05-12
Coding Router Low nanogpt/coding-router:low 1M 128K Input: $0.14
Output: $0.28
Cache Read: $0.028
Model: 0.070
Completion: 2.000
Cache: 0.200
🧠 🔧 - In: text
Out: text
Released: 2026-05-12
Coding Router Medium nanogpt/coding-router:medium 1M 128K Input: $0.14
Output: $0.28
Cache Read: $0.028
Model: 0.070
Completion: 2.000
Cache: 0.200
🧠 🔧 - In: text
Out: text
Released: 2026-05-12
Coding Router nanogpt/coding-router 1M 128K Input: $1.1
Output: $2.2
Cache Read: $0.11
Model: 0.550
Completion: 2.000
Cache: 0.100
🧠 🔧 - In: text
Out: text
Released: 2026-05-12
MN-LooseCannon-12B-v1 GalrionSoftworks/MN-LooseCannon-12B-v1 16.4K 8.2K Input: $0.49299999999999994
Output: $0.49299999999999994
Model: 0.246
Completion: 1.000
- - In: text
Out: text
Released: 2024-07-01
The Drummer Cydonia 24B v4.3 TheDrummer/Cydonia-24B-v4.3 32.8K 32.8K Input: $0.1003
Output: $0.1207
Model: 0.050
Completion: 1.203
- - In: text
Out: text
Released: 2025-12-25
The Drummer Cydonia 24B v4.1 TheDrummer/Cydonia-24B-v4.1 16.4K 32.8K Input: $0.1003
Output: $0.1207
Model: 0.050
Completion: 1.203
- - In: text
Out: text
Released: 2025-08-19
The Drummer Cydonia 24B v2 TheDrummer/Cydonia-24B-v2 16.4K 32.8K Input: $0.1003
Output: $0.1207
Model: 0.050
Completion: 1.203
- - In: text
Out: text
Released: 2025-02-17
Anubis 70B v1.1 TheDrummer/Anubis-70B-v1.1 131.1K 16.4K Input: $0.31
Output: $0.31
Model: 0.155
Completion: 1.000
- - In: text
Out: text
Released: 2024-01-01
UnslopNemo 12b v4 TheDrummer/UnslopNemo-12B-v4.1 32.8K 8.2K Input: $0.493
Output: $0.493
Model: 0.246
Completion: 1.000
📎 - In: text, pdf
Out: text
Released: 2024-01-01
TheDrummer Skyfall 36B V2 TheDrummer/skyfall-36b-v2 64K 32.8K Input: $0.493
Output: $0.493
Model: 0.246
Completion: 1.000
📎 - In: text, pdf
Out: text
Released: 2025-03-10
Rocinante 12b TheDrummer/Rocinante-12B-v1.1 16.4K 8.2K Input: $0.408
Output: $0.595
Model: 0.204
Completion: 1.458
- - In: text
Out: text
Released: 2024-01-01
The Drummer Magidonia 24B v4.3 TheDrummer/Magidonia-24B-v4.3 32.8K 32.8K Input: $0.1003
Output: $0.1207
Model: 0.050
Completion: 1.203
- - In: text
Out: text
Released: 2025-12-25
The Drummer Cydonia 24B v4 TheDrummer/Cydonia-24B-v4 16.4K 32.8K Input: $0.2006
Output: $0.2414
Model: 0.100
Completion: 1.203
- - In: text
Out: text
Released: 2025-07-22
TheDrummer Skyfall 31B v4.2 TheDrummer/Skyfall-31B-v4.2 131.1K 16.4K Input: $0.55
Output: $0.8
Model: 0.275
Completion: 1.455
📎 - In: text, pdf
Out: text
Released: 2026-03-26
Anubis 70B v1 TheDrummer/Anubis-70B-v1 65.5K 16.4K Input: $0.31
Output: $0.31
Model: 0.155
Completion: 1.000
- - In: text
Out: text
Released: 2024-01-01
Qwen 2.5 32B Abliterated huihui-ai/Qwen2.5-32B-Instruct-abliterated 32.8K 8.2K Input: $0.7
Output: $0.7
Model: 0.350
Completion: 1.000
- - In: text
Out: text
Released: 2025-01-06
Llama 3.3 70B Instruct abliterated huihui-ai/Llama-3.3-70B-Instruct-abliterated 16.4K 16.4K Input: $0.7
Output: $0.7
Model: 0.350
Completion: 1.000
- - In: text
Out: text
Released: 2025-08-08
DeepSeek R1 Qwen Abliterated huihui-ai/DeepSeek-R1-Distill-Qwen-32B-abliterated 16.4K 8.2K Input: $1.4
Output: $1.4
Model: 0.700
Completion: 1.000
🧠 - In: text
Out: text
Released: 2025-01-20
DeepSeek R1 Llama 70B Abliterated huihui-ai/DeepSeek-R1-Distill-Llama-70B-abliterated 16.4K 8.2K Input: $0.7
Output: $0.7
Model: 0.350
Completion: 1.000
🧠 - In: text
Out: text
Released: 2025-01-20
Llama 3.05 Storybreaker Ministral 70b Envoid/Llama-3.05-NT-Storybreaker-Ministral-70B 16.4K 8.2K Input: $0.49299999999999994
Output: $0.49299999999999994
Model: 0.246
Completion: 1.000
- - In: text
Out: text
Released: 2024-12-01
Nemotron Tenyxchat Storybreaker 70b Envoid/Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B 16.4K 8.2K Input: $0.49299999999999994
Output: $0.49299999999999994
Model: 0.246
Completion: 1.000
- - In: text
Out: text
Released: 2024-12-01
Step 3.5 Flash 2603 stepfun-ai/step-3.5-flash-2603 256K 256K Input: $0.1
Output: $0.3
Model: 0.050
Completion: 3.000
🧠 - In: text
Out: text
Released: 2026-04-14
Step 3.5 Flash stepfun-ai/step-3.5-flash 256K 256K Input: $0.2
Output: $0.5
Model: 0.100
Completion: 2.500
🧠 - In: text
Out: text
Released: 2026-02-02
Mistral Medium 3.5 mistral/mistral-medium-3.5 256K 32.8K Input: $1.5
Output: $7.5
Model: 0.750
Completion: 5.000
📎 🧠 🔧 - In: text, image
Out: text
Released: 2026-04-29
Mistral Medium 3.5 Thinking mistral/mistral-medium-3.5:thinking 256K 32.8K Input: $1.5
Output: $7.5
Model: 0.750
Completion: 5.000
📎 🧠 🔧 - In: text, image
Out: text
Released: 2026-04-30
QwenLong L1 32B Tongyi-Zhiwen/QwenLong-L1-32B 128K 41K Input: $0.13999999999999999
Output: $0.6
Model: 0.070
Completion: 4.286
- - In: text
Out: text
Released: 2025-01-25
Gemini 1.5 Flash google/gemini-flash-1.5 2M 8.2K Input: $0.0748
Output: $0.306
Model: 0.037
Completion: 4.091
- - In: text
Out: text
Released: 2024-05-14
Gemini 3.1 Pro (Preview High) google/gemini-3.1-pro-preview-high 1M 65.5K Input: $2
Output: $12
Cache Read: $0.2
Model: 1.000
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 - In: text, image
Out: text
Released: 2026-02-21
Gemini 3.1 Flash Lite google/gemini-3.1-flash-lite 1M 65.5K Input: $0.25
Output: $1.5
Cache Read: $0.025
Model: 0.125
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2026-03-03
Gemini 3 Flash Thinking google/gemini-3-flash-preview-thinking 1M 65.5K Input: $0.5
Output: $3
Model: 0.250
Completion: 6.000
📎 🧠 - In: text, image
Out: text
Released: 2025-12-17
Gemini 3.5 Flash google/gemini-3.5-flash 1M 65.5K Input: $1.5
Output: $9
Cache Read: $0.15
Model: 0.750
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 - In: text, image, audio
Out: text
Released: 2026-05-19
Gemma 4 31B google/gemma-4-31b-it 262.1K 131.1K Input: $0.1
Output: $0.35
Model: 0.050
Completion: 3.500
📎 🧠 - In: text, image
Out: text
Released: 2026-04-02
Gemini Pro Latest google/gemini-pro-latest 1M 65.5K Input: $2
Output: $12
Cache Read: $0.2
Model: 1.000
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 - In: text, image
Out: text
Released: 2026-03-29
Gemini 3.1 Pro (Preview Custom Tools) google/gemini-3.1-pro-preview-customtools 1M 65.5K Input: $2
Output: $12
Cache Read: $0.2
Model: 1.000
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 - In: text, image
Out: text
Released: 2026-02-27
Gemini Flash Lite Latest google/gemini-flash-lite-latest 1M 65.5K Input: $0.25
Output: $1.5
Cache Read: $0.025
Model: 0.125
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2026-03-29
Gemma 4 26B A4B Thinking google/gemma-4-26b-a4b-it:thinking 262.1K 131.1K Input: $0.13
Output: $0.4
Model: 0.065
Completion: 3.077
📎 🧠 - In: text, image
Out: text
Released: 2026-04-02
Gemini 3.1 Pro (Preview) google/gemini-3.1-pro-preview 1M 65.5K Input: $2
Output: $12
Cache Read: $0.2
Model: 1.000
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 - In: text, image
Out: text
Released: 2026-02-19
Gemma 4 26B A4B google/gemma-4-26b-a4b-it 262.1K 131.1K Input: $0.13
Output: $0.4
Model: 0.065
Completion: 3.077
📎 🧠 - In: text, image
Out: text
Released: 2026-04-02
Gemini 3.1 Pro (Preview Low) google/gemini-3.1-pro-preview-low 1M 65.5K Input: $2
Output: $12
Cache Read: $0.2
Model: 1.000
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 - In: text, image
Out: text
Released: 2026-02-21
Gemma 4 31B Thinking google/gemma-4-31b-it:thinking 262.1K 131.1K Input: $0.1
Output: $0.35
Model: 0.050
Completion: 3.500
📎 🧠 - In: text, image
Out: text
Released: 2026-04-02
Gemini 3.5 Flash Thinking google/gemini-3.5-flash-thinking 1M 65.5K Input: $1.5
Output: $9
Cache Read: $0.15
Model: 0.750
Completion: 6.000
Cache: 0.100
📎 🧠 - In: text, image
Out: text
Released: 2026-05-19
Gemini 3 Flash (Preview) google/gemini-3-flash-preview 1M 65.5K Input: $0.5
Output: $3
Model: 0.250
Completion: 6.000
📎 🧠 🔧 - In: text, image
Out: text
Released: 2025-12-17
Gemini Flash Latest google/gemini-flash-latest 1M 65.5K Input: $1.5
Output: $9
Cache Read: $0.15
Model: 0.750
Completion: 6.000
Cache: 0.100
📎 🧠 - In: text, image
Out: text
Released: 2026-03-29
LFM2 24B A2B liquid/lfm-2-24b-a2b 32.8K 32.8K Input: $0.03
Output: $0.12
Model: 0.015
Completion: 4.000
- - In: text
Out: text
Released: 2025-12-20
Grok 4.20 x-ai/grok-4.20 2M 131.1K Input: $2
Output: $6
Model: 1.000
Completion: 3.000
📎 🧠 🔧 - In: text, image
Out: text
Released: 2026-03-31
Grok 4.3 x-ai/grok-4.3 1M 1M Input: $1.25
Output: $2.5
Cache Read: $0.2
Model: 0.625
Completion: 2.000
Cache: 0.160
📎 🧠 🔧 - In: text, image
Out: text
Released: 2026-04-30
Grok 4.20 Multi-Agent x-ai/grok-4.20-multi-agent 2M 131.1K Input: $2
Output: $6
Model: 1.000
Completion: 3.000
📎 🧠 🔧 - In: text, image
Out: text
Released: 2026-03-31
Grok Latest x-ai/grok-latest 1M 1M Input: $1.25
Output: $2.5
Cache Read: $0.2
Model: 0.625
Completion: 2.000
Cache: 0.160
📎 🧠 🔧 - In: text, image
Out: text
Released: 2026-05-03
Grok Build 0.1 x-ai/grok-build-0.1 256K 256K Input: $1
Output: $2
Cache Read: $0.2
Model: 0.500
Completion: 2.000
Cache: 0.200
📎 🧠 🔧 - In: text, image
Out: text
Released: 2026-05-20
EVA Llama 3.33 70B EVA-UNIT-01/EVA-LLaMA-3.33-70B-v0.0 16.4K 16.4K Input: $2.006
Output: $2.006
Model: 1.003
Completion: 1.000
- - In: text
Out: text
Released: 2025-07-26
EVA-Qwen2.5-72B-v0.2 EVA-UNIT-01/EVA-Qwen2.5-72B-v0.2 16.4K 8.2K Input: $0.7989999999999999
Output: $0.7989999999999999
Model: 0.399
Completion: 1.000
- - In: text
Out: text
Released: 2025-09-25
EVA-LLaMA-3.33-70B-v0.1 EVA-UNIT-01/EVA-LLaMA-3.33-70B-v0.1 16.4K 16.4K Input: $2.006
Output: $2.006
Model: 1.003
Completion: 1.000
- - In: text
Out: text
Released: 2025-09-25
EVA-Qwen2.5-32B-v0.2 EVA-UNIT-01/EVA-Qwen2.5-32B-v0.2 16.4K 8.2K Input: $0.7989999999999999
Output: $0.7989999999999999
Model: 0.399
Completion: 1.000
- - In: text
Out: text
Released: 2025-07-26
WizardLM-2 8x22B microsoft/wizardlm-2-8x22b 65.5K 8.2K Input: $0.49299999999999994
Output: $0.49299999999999994
Model: 0.246
Completion: 1.000
📎 - In: text, pdf
Out: text
Released: 2025-04-15
MS3.2 24B Magnum Diamond Doctor-Shotgun/MS3.2-24B-Magnum-Diamond 16.4K 32.8K Input: $0.49299999999999994
Output: $0.49299999999999994
Model: 0.246
Completion: 1.000
- - In: text
Out: text
Released: 2025-11-24
Laguna XS.2 poolside/laguna-xs.2 128K 32.8K Input: $0.1
Output: $0.3
Model: 0.050
Completion: 3.000
- - In: text
Out: text
Released: 2026-04-29
Laguna M.1 poolside/laguna-m.1 128K 32.8K Input: $0.1
Output: $0.3
Model: 0.050
Completion: 3.000
- - In: text
Out: text
Released: 2026-04-29
GLM 4.5V Thinking z-ai/glm-4.5v:thinking 64K 96K Input: $0.6
Output: $1.7999999999999998
Model: 0.300
Completion: 3.000
📎 🧠 - In: text, image
Out: text
Released: 2025-11-22
GLM 4.6 Thinking z-ai/glm-4.6:thinking 200K 65.5K Input: $0.4
Output: $1.5
Model: 0.200
Completion: 3.750
🧠 🔧 - In: text
Out: text
Released: 2025-09-29
GLM 4.5V z-ai/glm-4.5v 64K 96K Input: $0.6
Output: $1.7999999999999998
Model: 0.300
Completion: 3.000
📎 🧠 - In: text, image
Out: text
Released: 2025-11-22
GLM 4.6 z-ai/glm-4.6 200K 65.5K Input: $0.4
Output: $1.5
Model: 0.200
Completion: 3.750
🧠 🔧 - In: text
Out: text
Released: 2025-09-30
GLM 5V Turbo Thinking z-ai/glm-5v-turbo:thinking 202.8K 131.1K Input: $1.2
Output: $4
Cache Read: $0.24
Model: 0.600
Completion: 3.333
Cache: 0.200
📎 🧠 🔧 - In: text, image
Out: text
Released: 2026-04-02
GLM 5V Turbo z-ai/glm-5v-turbo 202.8K 131.1K Input: $1.2
Output: $4
Cache Read: $0.24
Model: 0.600
Completion: 3.333
Cache: 0.200
📎 🔧 - In: text, image
Out: text
Released: 2026-04-01
GLM 5 Turbo z-ai/glm-5-turbo 202.8K 131.1K Input: $1.2
Output: $4
Cache Read: $0.24
Model: 0.600
Completion: 3.333
Cache: 0.200
🔧 - In: text
Out: text
Released: 2026-03-15
OpenAI o3-mini (Low) openai/o3-mini-low 200K 100K Input: $9.996
Output: $19.992
Model: 4.998
Completion: 2.000
🧠 🔧 - In: text
Out: text
Released: 2025-01-31
GPT OSS Safeguard 20B openai/gpt-oss-safeguard-20b 128K 16.4K Input: $0.075
Output: $0.3
Model: 0.037
Completion: 4.000
🧠 - In: text
Out: text
Released: 2025-10-29
OpenAI o3 openai/o3 200K 100K Input: $2
Output: $8
Model: 1.000
Completion: 4.000
- - In: text
Out: text
Released: 2025-04-16
OpenAI o4-mini high openai/o4-mini-high 200K 100K Input: $1.1
Output: $4.4
Model: 0.550
Completion: 4.000
🧠 🔧 - In: text
Out: text
Released: 2025-04-16
OpenAI o3-pro (2025-06-10) openai/o3-pro-2025-06-10 200K 100K Input: $9.996
Output: $19.992
Model: 4.998
Completion: 2.000
🧠 🔧 - In: text
Out: text
Released: 2025-06-10
GPT 5.2 Pro openai/gpt-5.2-pro 400K 128K Input: $21
Output: $168
Model: 10.500
Completion: 8.000
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2026-01-01
GPT-4o mini Search Preview openai/gpt-4o-mini-search-preview 128K 16.4K Input: $0.088
Output: $0.35
Model: 0.044
Completion: 3.977
- - In: text
Out: text
Released: 2024-07-18
GPT 5 openai/gpt-5 400K 128K Input: $1.25
Output: $10
Model: 0.625
Completion: 8.000
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2025-08-07
GPT-3.5 Turbo openai/gpt-3.5-turbo 16.4K 4.1K Input: $0.5
Output: $1.5
Model: 0.250
Completion: 3.000
- - In: text
Out: text
Released: 2022-11-30
Updated: 2024-01-01
GPT 5 Pro openai/gpt-5-pro 400K 128K Input: $15
Output: $120
Model: 7.500
Completion: 8.000
📎 🧠 - In: text, image
Out: text
Released: 2025-08-07
GPT-4o openai/gpt-4o 128K 16.4K Input: $2.499
Output: $9.996
Model: 1.250
Completion: 4.000
📎 - In: text, image
Out: text
Released: 2024-05-13
OpenAI o4-mini openai/o4-mini 200K 100K Input: $1.1
Output: $4.4
Model: 0.550
Completion: 4.000
🧠 🔧 - In: text
Out: text
Released: 2025-04-16
GPT 5.4 Nano openai/gpt-5.4-nano 400K 128K Input: $0.2
Output: $1.25
Cache Read: $0.02
Model: 0.100
Completion: 6.250
Cache: 0.100
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2026-03-17
GPT 5.1 Codex openai/gpt-5.1-codex 400K 128K Input: $1.25
Output: $10
Model: 0.625
Completion: 8.000
📎 🧠 - In: text, image
Out: text
Released: 2025-11-13
GPT 5.1 Codex Max openai/gpt-5.1-codex-max 400K 128K Input: $2.5
Output: $20
Model: 1.250
Completion: 8.000
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2025-11-13
GPT-4o (2024-08-06) openai/gpt-4o-2024-08-06 128K 16.4K Input: $2.499
Output: $9.996
Model: 1.250
Completion: 4.000
📎 - In: text, image
Out: text
Released: 2024-08-06
OpenAI o1-preview openai/o1-preview 128K 32.8K Input: $14.993999999999998
Output: $59.993
Model: 7.497
Completion: 4.001
🧠 - In: text
Out: text
Released: 2024-09-12
OpenAI o3-mini openai/o3-mini 200K 100K Input: $1.1
Output: $4.4
Model: 0.550
Completion: 4.000
🧠 🔧 - In: text
Out: text
Released: 2025-01-31
GPT 5.2 openai/gpt-5.2 400K 128K Input: $1.75
Output: $14
Model: 0.875
Completion: 8.000
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2026-01-01
GPT 5.3 Codex openai/gpt-5.3-codex 400K 128K Input: $1.75
Output: $14
Cache Read: $0.175
Model: 0.875
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2026-02-24
GPT Latest openai/gpt-latest 1M 128K Input: $5
Output: $30
Cache Read: $0.5
Model: 2.500
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2026-03-29
GPT 5.1 Codex Mini openai/gpt-5.1-codex-mini 400K 128K Input: $0.25
Output: $2
Model: 0.125
Completion: 8.000
📎 🧠 - In: text, image
Out: text
Released: 2025-11-13
OpenAI o4-mini Deep Research openai/o4-mini-deep-research 200K 100K Input: $9.996
Output: $19.992
Model: 4.998
Completion: 2.000
🧠 - In: text
Out: text
Released: 2025-04-16
GPT 4.1 Nano openai/gpt-4.1-nano 1M 32.8K Input: $0.1
Output: $0.4
Model: 0.050
Completion: 4.000
📎 - In: text, image, pdf
Out: text
Released: 2025-04-14
GPT OSS 120B openai/gpt-oss-120b 128K 16.4K Input: $0.05
Output: $0.25
Model: 0.025
Completion: 5.000
🧠 🔧 - In: text
Out: text
Released: 2025-08-05
GPT-4o (2024-11-20) openai/gpt-4o-2024-11-20 128K 16.4K Input: $2.5
Output: $10
Model: 1.250
Completion: 4.000
📎 - In: text, image
Out: text
Released: 2024-11-20
OpenAI o1 openai/o1 200K 100K Input: $14.993999999999998
Output: $59.993
Model: 7.497
Completion: 4.001
🧠 - In: text
Out: text
Released: 2024-12-17
OpenAI o1 Pro openai/o1-pro 200K 100K Input: $150
Output: $600
Model: 75.000
Completion: 4.000
📎 - In: text, image, pdf
Out: text
Released: 2025-01-25
GPT Chat Latest openai/gpt-chat-latest 400K 128K Input: $5
Output: $30
Cache Read: $0.5
Model: 2.500
Completion: 6.000
Cache: 0.100
📎 🔧 - In: text, image, pdf
Out: text
Released: 2026-05-03
GPT 5.4 openai/gpt-5.4 922K 128K Input: $2.5
Output: $15
Cache Read: $0.25
Model: 1.250
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2026-03-05
GPT 5.4 Mini openai/gpt-5.4-mini 400K 128K Input: $0.75
Output: $4.5
Cache Read: $0.075
Model: 0.375
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2026-03-17
GPT 4.1 openai/gpt-4.1 1M 32.8K Input: $2
Output: $8
Model: 1.000
Completion: 4.000
📎 🔧 - In: text, image, pdf
Out: text
Released: 2025-09-10
OpenAI o3 Deep Research openai/o3-deep-research 200K 100K Input: $9.996
Output: $19.992
Model: 4.998
Completion: 2.000
🧠 - In: text
Out: text
Released: 2025-04-16
GPT-4 Turbo Preview openai/gpt-4-turbo-preview 128K 4.1K Input: $9.996
Output: $30.004999999999995
Model: 4.998
Completion: 3.002
- - In: text
Out: text
Released: 2023-11-06
Updated: 2024-01-01
GPT 5 Mini openai/gpt-5-mini 400K 128K Input: $0.25
Output: $2
Model: 0.125
Completion: 8.000
📎 🧠 - In: text, image
Out: text
Released: 2025-08-07
GPT 4.1 Mini openai/gpt-4.1-mini 1M 32.8K Input: $0.4
Output: $1.6
Model: 0.200
Completion: 4.000
📎 - In: text, image
Out: text
Released: 2025-04-14
GPT-4 Turbo openai/gpt-4-turbo 128K 4.1K Input: $10
Output: $30
Model: 5.000
Completion: 3.000
📎 - In: text, image
Out: text
Released: 2023-11-06
Updated: 2024-01-01
GPT 5 Nano openai/gpt-5-nano 400K 128K Input: $0.05
Output: $0.4
Model: 0.025
Completion: 8.000
📎 🧠 - In: text, image
Out: text
Released: 2025-08-07
GPT 5.4 Pro openai/gpt-5.4-pro 922K 128K Input: $30
Output: $180
Cache Read: $3
Model: 15.000
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2026-03-05
OpenAI o3-mini (High) openai/o3-mini-high 200K 100K Input: $0.64
Output: $2.588
Model: 0.320
Completion: 4.044
🧠 🔧 - In: text
Out: text
Released: 2025-01-31
GPT-4o Search Preview openai/gpt-4o-search-preview 128K 16.4K Input: $1.47
Output: $5.88
Model: 0.735
Completion: 4.000
📎 - In: text, image
Out: text
Released: 2024-05-13
GPT-5.1 (2025-11-13) openai/gpt-5.1-2025-11-13 1M 32.8K Input: $1.25
Output: $10
Model: 0.625
Completion: 8.000
- - In: text
Out: text
Released: 2025-11-13
GPT-4o mini openai/gpt-4o-mini 128K 16.4K Input: $0.1496
Output: $0.595
Model: 0.075
Completion: 3.977
📎 - In: text, image
Out: text
Released: 2024-07-18
GPT OSS 20B openai/gpt-oss-20b 128K 16.4K Input: $0.04
Output: $0.15
Model: 0.020
Completion: 3.750
🧠 - In: text
Out: text
Released: 2025-08-05
GPT-5 Codex openai/gpt-5-codex 256K 32.8K Input: $9.996
Output: $19.992
Model: 4.998
Completion: 2.000
- - In: text
Out: text
Released: 2025-09-15
GPT 5.2 Codex openai/gpt-5.2-codex 400K 128K Input: $1.75
Output: $14
Model: 0.875
Completion: 8.000
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2026-01-14
GPT 5.1 openai/gpt-5.1 400K 128K Input: $1.25
Output: $10
Model: 0.625
Completion: 8.000
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2025-11-13
GPT 5.5 openai/gpt-5.5 1M 128K Input: $5
Output: $30
Cache Read: $0.5
Model: 2.500
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2026-04-23
MythoMax 13B Gryphe/MythoMax-L2-13b 4K 4.1K Input: $0.1003
Output: $0.1003
Model: 0.050
Completion: 1.000
- - In: text
Out: text
Released: 2025-08-08
M-Prometheus 14B Unbabel/M-Prometheus-14B 32.8K 8.2K Input: $0.2
Output: $0.2
Model: 0.100
Completion: 1.000
- - In: text
Out: text
Released: 2026-05-29
K2-Think LLM360/K2-Think 128K 32.8K Input: $0.17
Output: $0.68
Model: 0.085
Completion: 4.000
- - In: text
Out: text
Released: 2025-07-26
Hermes 4 Large NousResearch/hermes-4-405b 128K 8.2K Input: $0.3
Output: $1.2
Model: 0.150
Completion: 4.000
- - In: text
Out: text
Released: 2025-08-26
Hermes 3 70B NousResearch/hermes-3-llama-3.1-70b 65.5K 8.2K Input: $0.408
Output: $0.408
Model: 0.204
Completion: 1.000
- - In: text
Out: text
Released: 2026-01-07
Hermes 4 (Thinking) NousResearch/Hermes-4-70B:thinking 128K 8.2K Input: $0.2006
Output: $0.3995
Model: 0.100
Completion: 1.992
- - In: text
Out: text
Released: 2025-09-17
Hermes 4 Medium NousResearch/hermes-4-70b 128K 8.2K Input: $0.2006
Output: $0.3995
Model: 0.100
Completion: 1.992
- - In: text
Out: text
Released: 2025-07-03
DeepHermes-3 Mistral 24B (Preview) NousResearch/DeepHermes-3-Mistral-24B-Preview 128K 32.8K Input: $0.3
Output: $0.3
Model: 0.150
Completion: 1.000
- - In: text
Out: text
Released: 2025-05-10
Hermes 4 Large (Thinking) NousResearch/hermes-4-405b:thinking 128K 8.2K Input: $0.3
Output: $1.2
Model: 0.150
Completion: 4.000
- - In: text
Out: text
Released: 2024-01-01
Gemma 3 12B IT unsloth/gemma-3-12b-it 128K 131.1K Input: $0.272
Output: $0.272
Model: 0.136
Completion: 1.000
📎 - In: text, pdf
Out: text
Released: 2025-03-10
Gemma 3 4B IT unsloth/gemma-3-4b-it 128K 8.2K Input: $0.2006
Output: $0.2006
Model: 0.100
Completion: 1.000
📎 - In: text, pdf
Out: text
Released: 2025-03-10
Gemma 3 27B IT unsloth/gemma-3-27b-it 128K 96K Input: $0.2992
Output: $0.2992
Model: 0.150
Completion: 1.000
📎 - In: text, pdf
Out: text
Released: 2025-03-10
Lumimaid v0.2 NeverSleep/Lumimaid-v0.2-70B 16.4K 8.2K Input: $1
Output: $1.5
Model: 0.500
Completion: 1.500
- - In: text
Out: text
Released: 2024-07-01
Mixtral 8x7B mistralai/mixtral-8x7b-instruct-v0.1 32.8K 32.8K Input: $0.27
Output: $0.27
Model: 0.135
Completion: 1.000
- - In: text
Out: text
Released: 2025-12-11
Mistral Small 4 119B Thinking mistralai/mistral-small-4-119b-2603:thinking 262.1K 16.4K Input: $0.4
Output: $1.4
Model: 0.200
Completion: 3.500
📎 🧠 🔧 - In: text, image
Out: text
Released: 2026-03-17
Mixtral 8x22B mistralai/mixtral-8x22b-instruct-v0.1 65.5K 32.8K Input: $0.8999999999999999
Output: $0.8999999999999999
Model: 0.450
Completion: 1.000
- - In: text
Out: text
Released: 2025-12-11
Mistral Devstral Small 2505 mistralai/Devstral-Small-2505 32.8K 8.2K Input: $0.060000000000000005
Output: $0.060000000000000005
Model: 0.030
Completion: 1.000
- - In: text
Out: text
Released: 2025-08-02
Ministral 8B mistralai/ministral-8b-2512 262.1K 32.8K Input: $0.15
Output: $0.15
Model: 0.075
Completion: 1.000
- - In: text
Out: text
Released: 2025-12-04
Mistral Saba mistralai/mistral-saba 32K 32.8K Input: $0.1989
Output: $0.595
Model: 0.099
Completion: 2.991
- - In: text
Out: text
Released: 2025-02-17
Mistral Large 2411 mistralai/mistral-large 128K 256K Input: $2.006
Output: $6.001
Model: 1.003
Completion: 2.992
- - In: text
Out: text
Released: 2024-02-26
Mistral Medium 3.1 mistralai/mistral-medium-3.1 131.1K 32.8K Input: $0.4
Output: $2
Model: 0.200
Completion: 5.000
- - In: text
Out: text
Released: 2025-09-05
Ministral 3B mistralai/ministral-3b-2512 131.1K 32.8K Input: $0.1
Output: $0.1
Model: 0.050
Completion: 1.000
- - In: text
Out: text
Released: 2025-12-04
Ministral 3 14B mistralai/ministral-14b-instruct-2512 262.1K 32.8K Input: $0.1
Output: $0.4
Model: 0.050
Completion: 4.000
📎 - In: text, image
Out: text
Released: 2025-12-02
Mistral Small 4 119B mistralai/mistral-small-4-119b-2603 262.1K 16.4K Input: $0.4
Output: $1.4
Model: 0.200
Completion: 3.500
📎 🧠 🔧 - In: text, image
Out: text
Released: 2026-03-16
Ministral 14B mistralai/ministral-14b-2512 262.1K 32.8K Input: $0.2
Output: $0.2
Model: 0.100
Completion: 1.000
- - In: text
Out: text
Released: 2025-12-04
Mistral Large 3 675B mistralai/mistral-large-3-675b-instruct-2512 262.1K 256K Input: $1
Output: $3
Model: 0.500
Completion: 3.000
📎 - In: text, image
Out: text
Released: 2025-12-02
Mistral Medium 3 mistralai/mistral-medium-3 131.1K 32.8K Input: $0.4
Output: $2
Model: 0.200
Completion: 5.000
📎 - In: text, image
Out: text
Released: 2025-09-25
Devstral 2 123B mistralai/devstral-2-123b-instruct-2512 262.1K 65.5K Input: $0.4
Output: $1.4
Model: 0.200
Completion: 3.500
- - In: text
Out: text
Released: 2025-12-09
Codestral 2508 mistralai/codestral-2508 256K 32.8K Input: $0.3
Output: $0.8999999999999999
Model: 0.150
Completion: 3.000
- - In: text
Out: text
Released: 2025-08-01
Mistral Nemo mistralai/Mistral-Nemo-Instruct-2407 16.4K 8.2K Input: $0.1003
Output: $0.1207
Model: 0.050
Completion: 1.203
- - In: text
Out: text
Released: 2024-07-18
ByteDance Seed 2.0 Lite bytedance-seed/seed-2.0-lite 262.1K 131.1K Input: $0.25
Output: $2
Model: 0.125
Completion: 8.000
- - In: text
Out: text
Released: 2026-03-10
Magnum V2 72B anthracite-org/magnum-v2-72b 16.4K 8.2K Input: $2.006
Output: $2.992
Model: 1.003
Completion: 1.492
- - In: text
Out: text
Released: 2024-07-01
Magnum v4 72B anthracite-org/magnum-v4-72b 16.4K 8.2K Input: $2.006
Output: $2.992
Model: 1.003
Completion: 1.492
📎 - In: text, pdf
Out: text
Released: 2025-01-01
Mag Mell R1 inflatebot/MN-12B-Mag-Mell-R1 16.4K 8.2K Input: $0.49299999999999994
Output: $0.49299999999999994
Model: 0.246
Completion: 1.000
- - In: text
Out: text
Released: 2024-07-01
Nvidia Nemotron 3 Nano Omni nvidia/nemotron-3-nano-omni-30b-a3b-reasoning 256K 65.5K Input: $0.105
Output: $0.42
Model: 0.052
Completion: 4.000
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2026-04-28
Nvidia Nemotron 70b nvidia/Llama-3.1-Nemotron-70B-Instruct-HF 16.4K 8.2K Input: $0.357
Output: $0.408
Model: 0.178
Completion: 1.143
🌡️ - In: text
Out: text
Released: 2025-04-15
Nvidia Nemotron Nano 9B v2 nvidia/nvidia-nemotron-nano-9b-v2 128K 16.4K Input: $0.17
Output: $0.68
Model: 0.085
Completion: 4.000
🌡️ - In: text
Out: text
Released: 2025-08-18
Nvidia Nemotron Super 49B v1.5 nvidia/Llama-3_3-Nemotron-Super-49B-v1_5 128K 16.4K Input: $0.05
Output: $0.25
Model: 0.025
Completion: 5.000
🌡️ - In: text
Out: text
Released: 2025-08-08
Nvidia Nemotron 3 Nano 30B nvidia/nemotron-3-nano-30b-a3b 256K 262.1K Input: $0.17
Output: $0.68
Model: 0.085
Completion: 4.000
🌡️ - In: text
Out: text
Released: 2025-12-15
Nvidia Nemotron Super 49B nvidia/Llama-3.3-Nemotron-Super-49B-v1 128K 16.4K Input: $0.15
Output: $0.15
Model: 0.075
Completion: 1.000
🌡️ - In: text
Out: text
Released: 2025-08-08
Nvidia Nemotron 3 Super 120B Thinking nvidia/nemotron-3-super-120b-a12b:thinking 262.1K 16.4K Input: $0.05
Output: $0.25
Model: 0.025
Completion: 5.000
🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-03-01
Nvidia Nemotron 3 Super 120B nvidia/nemotron-3-super-120b-a12b 262.1K 16.4K Input: $0.05
Output: $0.25
Model: 0.025
Completion: 5.000
🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-03-01
Dolphin 72b cognitivecomputations/dolphin-2.9.2-qwen2-72b 8.2K 4.1K Input: $0.306
Output: $0.306
Model: 0.153
Completion: 1.000
- - In: text
Out: text
Released: 2025-02-27
MiMo V2 Flash Original xiaomi/mimo-v2-flash-original 256K 32.8K Input: $0.102
Output: $0.306
Model: 0.051
Completion: 3.000
🧠 - In: text
Out: text
Released: 2025-12-17
MiMo V2 Flash (Thinking) Original xiaomi/mimo-v2-flash-thinking-original 256K 32.8K Input: $0.102
Output: $0.306
Model: 0.051
Completion: 3.000
🧠 - In: text
Out: text
Released: 2025-12-17
MiMo V2.5 xiaomi/mimo-v2.5 1M 131.1K Input: $0.14
Output: $0.28
Cache Read: $0.0028
Model: 0.070
Completion: 2.000
Cache: 0.020
📎 🧠 🔧 - In: text, image, video
Out: text
Released: 2026-04-22
MiMo V2 Omni xiaomi/mimo-v2-omni 262.1K 65.5K Input: $0.4
Output: $2
Cache Read: $0.08
Model: 0.200
Completion: 5.000
Cache: 0.200
📎 🧠 🔧 - In: text, image, video, audio
Out: text
Released: 2026-03-19
MiMo V2 Flash xiaomi/mimo-v2-flash 256K 32.8K Input: $0.102
Output: $0.306
Model: 0.051
Completion: 3.000
🧠 - In: text
Out: text
Released: 2025-12-17
MiMo V2 Pro xiaomi/mimo-v2-pro 1M 131.1K Input: $1
Output: $3
Cache Read: $0.2
Model: 0.500
Completion: 3.000
Cache: 0.200
🧠 🔧 - In: text
Out: text
Released: 2026-03-19
MiMo V2.5 Pro xiaomi/mimo-v2.5-pro 1M 131.1K Input: $0.435
Output: $0.87
Cache Read: $0.0036
Model: 0.217
Completion: 2.000
Cache: 0.008
🧠 🔧 - In: text
Out: text
Released: 2026-04-22
MiMo V2 Flash (Thinking) xiaomi/mimo-v2-flash-thinking 256K 32.8K Input: $0.102
Output: $0.306
Model: 0.051
Completion: 3.000
🧠 - In: text
Out: text
Released: 2025-12-17
Claude Haiku Latest anthropic/claude-haiku-latest 200K 64K Input: $1
Output: $5
Cache Read: $0.1
Model: 0.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2026-03-29
Claude 4.7 Opus Thinking anthropic/claude-opus-4.7:thinking 1M 128K Input: $4.998
Output: $25.007
Cache Read: $0.4998
Model: 2.499
Completion: 5.003
Cache: 0.100
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2026-04-16
Claude 4.6 Opus Thinking Max anthropic/claude-opus-4.6🤔max 1M 128K Input: $4.998
Output: $25.007
Model: 2.499
Completion: 5.003
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2026-02-05
Claude 4.6 Opus Thinking Low anthropic/claude-opus-4.6🤔low 1M 128K Input: $4.998
Output: $25.007
Model: 2.499
Completion: 5.003
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2026-02-05
Claude Sonnet 4.6 Thinking anthropic/claude-sonnet-4.6:thinking 1M 128K Input: $2.992
Output: $14.993999999999998
Model: 1.496
Completion: 5.011
📎 🧠 🔧 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-02-17
Claude 4.7 Opus anthropic/claude-opus-4.7 1M 128K Input: $4.998
Output: $25.007
Cache Read: $0.4998
Model: 2.499
Completion: 5.003
Cache: 0.100
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2026-04-16
Claude 4.6 Opus Thinking Medium anthropic/claude-opus-4.6🤔medium 1M 128K Input: $4.998
Output: $25.007
Model: 2.499
Completion: 5.003
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2026-02-05
Claude Opus 4.8 Thinking anthropic/claude-opus-4.8:thinking 1M 128K Input: $4.998
Output: $25.007
Cache Read: $0.4998
Model: 2.499
Completion: 5.003
Cache: 0.100
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2026-05-28
Claude Sonnet Latest anthropic/claude-sonnet-latest 1M 128K Input: $2.992
Output: $14.994
Cache Read: $0.2992
Model: 1.496
Completion: 5.011
Cache: 0.100
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2026-03-01
Claude Opus 4.8 anthropic/claude-opus-4.8 1M 128K Input: $4.998
Output: $25.007
Cache Read: $0.4998
Model: 2.499
Completion: 5.003
Cache: 0.100
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2026-05-28
Claude Opus Latest anthropic/claude-opus-latest 1M 128K Input: $4.998
Output: $25.007
Cache Read: $0.4998
Model: 2.499
Completion: 5.003
Cache: 0.100
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2026-03-29
Claude Sonnet 4.6 anthropic/claude-sonnet-4.6 1M 128K Input: $2.992
Output: $14.993999999999998
Model: 1.496
Completion: 5.011
📎 🔧 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-02-17
Claude 4.6 Opus Thinking anthropic/claude-opus-4.6:thinking 1M 128K Input: $4.998
Output: $25.007
Model: 2.499
Completion: 5.003
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2026-02-05
Claude 4.6 Opus anthropic/claude-opus-4.6 1M 128K Input: $4.998
Output: $25.007
Model: 2.499
Completion: 5.003
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2026-02-05
Hunyuan MT 7B tencent/Hunyuan-MT-7B 8.2K 8.2K Input: $10
Output: $20
Model: 5.000
Completion: 2.000
- - In: text
Out: text
Released: 2025-09-18
Tencent: Hy3 preview tencent/hy3-preview 262.1K 262.1K Input: $0.066
Output: $0.26
Cache Read: $0.029
Model: 0.033
Completion: 3.939
Cache: 0.439
- - In: text
Out: text
Released: 2026-04-23
Mistral Small 3.2 24b Instruct chutesai/Mistral-Small-3.2-24B-Instruct-2506 128K 131.1K Input: $0.2
Output: $0.4
Model: 0.100
Completion: 2.000
- - In: text
Out: text
Released: 2025-04-15
DMind-1-Mini dmind/dmind-1-mini 32.8K 8.2K Input: $0.2
Output: $0.4
Model: 0.100
Completion: 2.000
- - In: text
Out: text
Released: 2025-06-01
DMind-1 dmind/dmind-1 32.8K 8.2K Input: $0.3
Output: $0.6
Model: 0.150
Completion: 2.000
- - In: text
Out: text
Released: 2025-06-01
NemoMix 12B Unleashed MarinaraSpaghetti/NemoMix-Unleashed-12B 32.8K 8.2K Input: $0.49299999999999994
Output: $0.49299999999999994
Model: 0.246
Completion: 1.000
- - In: text
Out: text
Released: 2024-07-01
Cogito v1 Preview Qwen 32B deepcogito/cogito-v1-preview-qwen-32B 128K 32.8K Input: $1.7999999999999998
Output: $1.7999999999999998
Model: 0.900
Completion: 1.000
- - In: text
Out: text
Released: 2025-05-10
Cohere: Command R cohere/command-r 128K 4.1K Input: $0.476
Output: $1.428
Model: 0.238
Completion: 3.000
- - In: text
Out: text
Released: 2024-03-11
Cohere: Command R+ cohere/command-r-plus-08-2024 128K 4.1K Input: $2.856
Output: $14.246
Model: 1.428
Completion: 4.988
🔧 - In: text
Out: text
Released: 2024-08-30
Step 3.7 Flash Thinking stepfun/step-3.7-flash:thinking 256K 256K Input: $0.2
Output: $1.15
Cache Read: $0.04
Model: 0.100
Completion: 5.750
Cache: 0.200
📎 🧠 🔧 - In: text, image, video
Out: text
Released: 2026-05-29
DeepSeek V3.1 Nex N1 nex-agi/deepseek-v3.1-nex-n1 128K 8.2K Input: $0.27999999999999997
Output: $0.42000000000000004
Model: 0.140
Completion: 1.500
- - In: text
Out: text
Released: 2025-12-10
ReMM SLERP 13B undi95/remm-slerp-l2-13b 6.1K 4.1K Input: $0.7989999999999999
Output: $1.2069999999999999
Model: 0.399
Completion: 1.511
📎 - In: text, pdf
Out: text
Released: 2025-01-01
Llama 3.1 70B Celeste v0.1 nothingiisreal/L3.1-70B-Celeste-V0.1-BF16 16.4K 16.4K Input: $0.49299999999999994
Output: $0.49299999999999994
Model: 0.246
Completion: 1.000
- - In: text
Out: text
Released: 2024-07-23
GLM 4.7 Flash Original Thinking zai-org/glm-4.7-flash-original:thinking 200K 128K Input: $0.07
Output: $0.4
Model: 0.035
Completion: 5.714
🧠 - In: text
Out: text
Released: 2026-01-19
GLM 4.7 zai-org/glm-4.7 200K 128K Input: $0.15
Output: $0.8
Model: 0.075
Completion: 5.333
🧠 🔧 - In: text
Out: text
Open Weights
Released: 2026-01-29
GLM 4.5 Air (Thinking) zai-org/GLM-4.5-Air:thinking 128K 98.3K Input: $0.12
Output: $0.8
Model: 0.060
Completion: 6.667
🧠 🔧 - In: text
Out: text
Released: 2024-01-01
GLM 4.5 (Thinking) zai-org/GLM-4.5:thinking 128K 65.5K Input: $0.3
Output: $1.3
Model: 0.150
Completion: 4.333
🧠 - In: text
Out: text
Released: 2024-01-01
GLM 4.5 zai-org/glm-4.5 128K 65.5K Input: $0.3
Output: $1.3
Model: 0.150
Completion: 4.333
- - In: text
Out: text
Released: 2025-04-15
GLM 5 Original zai-org/glm-5-original 200K 128K Input: $1
Output: $3.2
Cache Read: $0.2
Model: 0.500
Completion: 3.200
Cache: 0.200
🧠 🔧 - In: text
Out: text
Released: 2026-02-11
GLM 5.1 zai-org/glm-5.1 200K 131.1K Input: $0.3
Output: $2.55
Model: 0.150
Completion: 8.500
🧠 🔧 - In: text
Out: text
Open Weights
Released: 2026-03-27
GLM 4.7 Original zai-org/glm-4.7-original 200K 65.5K Input: $0.6
Output: $2.2
Cache Read: $0.11
Model: 0.300
Completion: 3.667
Cache: 0.183
🧠 🔧 - In: text
Out: text
Released: 2025-12-22
GLM 4.7 Original Thinking zai-org/glm-4.7-original:thinking 200K 65.5K Input: $0.6
Output: $2.2
Cache Read: $0.11
Model: 0.300
Completion: 3.667
Cache: 0.183
🧠 🔧 - In: text
Out: text
Released: 2025-12-22
GLM 4.7 Flash Original zai-org/glm-4.7-flash-original 200K 128K Input: $0.07
Output: $0.4
Model: 0.035
Completion: 5.714
🧠 🔧 - In: text
Out: text
Released: 2026-01-19
GLM 4.7 Thinking zai-org/glm-4.7:thinking 200K 65.5K Input: $0.2
Output: $0.8
Model: 0.100
Completion: 4.000
🧠 🔧 - In: text
Out: text
Released: 2025-12-22
GLM 5.1 Thinking zai-org/glm-5.1:thinking 200K 131.1K Input: $0.3
Output: $2.55
Model: 0.150
Completion: 8.500
🧠 🔧 - In: text
Out: text
Open Weights
Released: 2026-03-27
GLM 5 Thinking zai-org/glm-5:thinking 200K 128K Input: $0.3
Output: $2.55
Model: 0.150
Completion: 8.500
🧠 🔧 - In: text
Out: text
Open Weights
Released: 2026-02-11
GLM 4.6V zai-org/glm-4.6v 128K 24K Input: $0.3
Output: $0.9
Model: 0.150
Completion: 3.000
📎 - In: text, image
Out: text
Released: 2025-12-11
GLM 4.6 Original zai-org/glm-4.6-original 256K 65.5K Input: $0.35
Output: $1.4
Model: 0.175
Completion: 4.000
🧠 - In: text
Out: text
Released: 2025-12-11
GLM 5 Original Thinking zai-org/glm-5-original:thinking 200K 128K Input: $1
Output: $3.2
Cache Read: $0.2
Model: 0.500
Completion: 3.200
Cache: 0.200
🧠 🔧 - In: text
Out: text
Released: 2026-02-11
GLM 4.7 Flash Thinking zai-org/glm-4.7-flash:thinking 200K 128K Input: $0.07
Output: $0.4
Model: 0.035
Completion: 5.714
🧠 - In: text
Out: text
Released: 2026-01-19
GLM 4.6V Flash zai-org/glm-4.6v-flash-original 128K 24K Input: $0.1
Output: $0.4
Model: 0.050
Completion: 4.000
📎 - In: text, image, video
Out: text
Released: 2025-12-08
GLM 4.6V Original zai-org/glm-4.6v-original 128K 24K Input: $0.6
Output: $0.9
Model: 0.300
Completion: 1.500
📎 - In: text, image
Out: text
Released: 2025-12-08
GLM Latest zai-org/glm-latest 200K 131.1K Input: $0.75
Output: $2.6
Cache Read: $0.15
Model: 0.375
Completion: 3.467
Cache: 0.200
🧠 🔧 - In: text
Out: text
Released: 2026-05-03
GLM 4.7 Flash zai-org/glm-4.7-flash 200K 128K Input: $0.07
Output: $0.4
Model: 0.035
Completion: 5.714
🧠 🔧 - In: text
Out: text
Open Weights
Released: 2026-01-19
GLM 4.6 Turbo (Thinking) zai-org/GLM-4.6-turbo:thinking 200K 204.8K Input: $1
Output: $3
Model: 0.500
Completion: 3.000
🧠 - In: text
Out: text
Released: 2025-10-02
GLM 4.5 Air zai-org/GLM-4.5-Air 128K 98.3K Input: $0.12
Output: $0.8
Model: 0.060
Completion: 6.667
🔧 - In: text
Out: text
Released: 2025-04-15
GLM 4.6 Turbo zai-org/GLM-4.6-turbo 200K 204.8K Input: $1
Output: $3
Model: 0.500
Completion: 3.000
- - In: text
Out: text
Released: 2025-10-02
GLM 5 zai-org/glm-5 200K 128K Input: $0.3
Output: $2.55
Model: 0.150
Completion: 8.500
🧠 🔧 - In: text
Out: text
Open Weights
Released: 2026-02-11
Mistral Nemo Inferor 12B Infermatic/MN-12B-Inferor-v0.0 16.4K 8.2K Input: $0.25499999999999995
Output: $0.49299999999999994
Model: 0.127
Completion: 1.933
- - In: text
Out: text
Released: 2024-07-01
Shisa V2.1 Llama 3.3 70B shisa-ai/shisa-v2.1-llama3.3-70b 32.8K 4.1K Input: $0.5
Output: $0.5
Model: 0.250
Completion: 1.000
- - In: text
Out: text
Released: 2024-12-06
Shisa V2 Llama 3.3 70B shisa-ai/shisa-v2-llama3.3-70b 128K 16.4K Input: $0.5
Output: $0.5
Model: 0.250
Completion: 1.000
- - In: text
Out: text
Released: 2025-07-26
Llama 3.1 70B Dracarys 2 abacusai/Dracarys-72B-Instruct 16.4K 8.2K Input: $0.49299999999999994
Output: $0.49299999999999994
Model: 0.246
Completion: 1.000
- - In: text
Out: text
Released: 2025-08-02
DeepSeek V3.1 Thinking deepseek-ai/DeepSeek-V3.1:thinking 128K 65.5K Input: $0.2
Output: $0.7
Model: 0.100
Completion: 3.500
- - In: text
Out: text
Released: 2025-08-21
DeepSeek V3.1 Terminus deepseek-ai/DeepSeek-V3.1-Terminus 128K 65.5K Input: $0.25
Output: $0.7
Model: 0.125
Completion: 2.800
🔧 - In: text
Out: text
Released: 2025-08-02
DeepSeek V3.2 Exp Thinking deepseek-ai/deepseek-v3.2-exp-thinking 163.8K 65.5K Input: $0.27999999999999997
Output: $0.42000000000000004
Model: 0.140
Completion: 1.500
🧠 - In: text
Out: text
Released: 2025-09-29
DeepSeek R1 0528 deepseek-ai/DeepSeek-R1-0528 128K 163.8K Input: $0.4
Output: $1.7
Model: 0.200
Completion: 4.250
🧠 - In: text
Out: text
Released: 2025-05-28
DeepSeek V3.1 deepseek-ai/DeepSeek-V3.1 128K 65.5K Input: $0.2
Output: $0.7
Model: 0.100
Completion: 3.500
📎 - In: text, pdf
Out: text
Released: 2025-07-26
DeepSeek V3.2 Exp deepseek-ai/deepseek-v3.2-exp 163.8K 65.5K Input: $0.27999999999999997
Output: $0.42000000000000004
Model: 0.140
Completion: 1.500
- - In: text
Out: text
Released: 2025-09-29
DeepSeek V3.1 Terminus (Thinking) deepseek-ai/DeepSeek-V3.1-Terminus:thinking 128K 65.5K Input: $0.25
Output: $0.7
Model: 0.125
Completion: 2.800
🔧 - In: text
Out: text
Released: 2025-09-22
Trinity Large Thinking arcee-ai/trinity-large-thinking 262.1K 80K Input: $0.25
Output: $0.9
Model: 0.125
Completion: 3.600
🧠 🔧 - In: text
Out: text
Released: 2026-04-01
Trinity Mini arcee-ai/trinity-mini 131.1K 8.2K Input: $0.045000000000000005
Output: $0.15
Model: 0.023
Completion: 3.333
- - In: text
Out: text
Released: 2025-12-01
Grayline Qwen3 8B soob3123/GrayLine-Qwen3-8B 16.4K 32.8K Input: $0.3
Output: $0.3
Model: 0.150
Completion: 1.000
- - In: text
Out: text
Released: 2025-09-25
Veiled Calla 12B soob3123/Veiled-Calla-12B 32.8K 8.2K Input: $0.3
Output: $0.3
Model: 0.150
Completion: 1.000
- - In: text
Out: text
Released: 2025-04-13
Amoral Gemma3 27B v2 soob3123/amoral-gemma3-27B-v2 32.8K 8.2K Input: $0.3
Output: $0.3
Model: 0.150
Completion: 1.000
- - In: text
Out: text
Released: 2025-05-23
Manta Mini 1.0 meganova-ai/manta-mini-1.0 8.2K 8.2K Input: $0.02
Output: $0.16
Model: 0.010
Completion: 8.000
- - In: text
Out: text
Released: 2025-12-20
Manta Flash 1.0 meganova-ai/manta-flash-1.0 16.4K 16.4K Input: $0.02
Output: $0.16
Model: 0.010
Completion: 8.000
- - In: text
Out: text
Released: 2025-12-20
Manta Pro 1.0 meganova-ai/manta-pro-1.0 32.8K 32.8K Input: $0.060000000000000005
Output: $0.5
Model: 0.030
Completion: 8.333
- - In: text
Out: text
Released: 2025-12-20
Qwen3.6 35B A3B Thinking qwen/Qwen3.6-35B-A3B:thinking 262.1K 16.4K Input: $0.112
Output: $0.8
Model: 0.056
Completion: 7.143
📎 🧠 - In: text, image, video
Out: text
Released: 2026-04-19
Qwen3 Coder Plus qwen/qwen3-coder-plus 128K 65.5K Input: $1
Output: $5
Model: 0.500
Completion: 5.000
- - In: text
Out: text
Released: 2025-09-17
Qwen3.6 35B A3B qwen/Qwen3.6-35B-A3B 262.1K 16.4K Input: $0.112
Output: $0.8
Model: 0.056
Completion: 7.143
📎 - In: text, image, video
Out: text
Released: 2026-04-17
Qwen 3 32b qwen/qwen3-32b 41K 32.8K Input: $0.1
Output: $0.3
Model: 0.050
Completion: 3.000
📎 - In: text, pdf
Out: text
Released: 2024-01-01
Qwen3 VL 235B A22B Instruct qwen/Qwen3-VL-235B-A22B-Instruct 128K 262.1K Input: $0.3
Output: $1.2
Model: 0.150
Completion: 4.000
📎 - In: text, image
Out: text
Released: 2024-01-01
Qwen3 Max qwen/qwen3-max 256K 32.8K Input: $1.08018
Output: $5.4009
Model: 0.540
Completion: 5.000
- - In: text
Out: text
Released: 2025-09-05
Qwen3.5 397B A17B Thinking qwen/qwen3.5-397b-a17b-thinking 258K 65.5K Input: $0.6
Output: $3.6
Model: 0.300
Completion: 6.000
📎 🧠 - In: text, image, video
Out: text
Released: 2026-02-16
Qwen3 Next 80B A3B (Instruct) qwen/Qwen3-Next-80B-A3B-Instruct 256K 262.1K Input: $0.15
Output: $0.65
Model: 0.075
Completion: 4.333
🔧 - In: text
Out: text
Released: 2025-09-11
Qwen 3 235b A22B 2507 (TEE) qwen/Qwen3-235B-A22B-Instruct-2507-TEE 256K 262.1K Input: $0.13
Output: $0.5
Model: 0.065
Completion: 3.846
🔧 - In: text
Out: text
Released: 2025-07-25
Qwen QwQ 32B Preview qwen/qwq-32b-preview 32.8K 32.8K Input: $0.2
Output: $0.2
Model: 0.100
Completion: 1.000
- - In: text
Out: text
Released: 2025-02-27
Qwen3 Next 80B A3B (Thinking) qwen/qwen3-next-80b-a3b-thinking 256K 32.8K Input: $0.15
Output: $0.65
Model: 0.075
Completion: 4.333
- - In: text
Out: text
Released: 2024-01-01
Qwen 3 Coder 480B qwen/qwen3-coder 262K 65.5K Input: $0.13
Output: $0.5
Model: 0.065
Completion: 3.846
🔧 - In: text
Out: text
Released: 2026-03-17
Qwen 3 235b A22B 2507 Thinking qwen/Qwen3-235B-A22B-Thinking-2507 256K 262.1K Input: $0.3
Output: $0.5
Model: 0.150
Completion: 1.667
- - In: text
Out: text
Released: 2025-09-11
Qwen3.5 Plus qwen/qwen3.5-plus 983.6K 65.5K Input: $0.4
Output: $2.4
Cache Read: $0.04
Model: 0.200
Completion: 6.000
Cache: 0.100
📎 - In: text, image, video
Out: text
Released: 2026-02-16
Qwen3.5 Plus Thinking qwen/qwen3.5-plus-thinking 983.6K 65.5K Input: $0.4
Output: $2.4
Cache Read: $0.04
Model: 0.200
Completion: 6.000
Cache: 0.100
📎 🧠 - In: text, image, video
Out: text
Released: 2026-02-16
Qwen 3 235b A22B qwen/qwen3-235b-a22b 41K 32.8K Input: $0.3
Output: $0.5
Model: 0.150
Completion: 1.667
📎 🔧 - In: text, pdf
Out: text
Released: 2025-04-29
Qwen2.5 72B qwen/qwen-2.5-72b-instruct 131.1K 8.2K Input: $0.357
Output: $0.408
Model: 0.178
Completion: 1.143
- - In: text
Out: text
Released: 2025-07-03
Qwen3 Coder Next qwen/qwen3-coder-next 262.1K 65.5K Input: $0.15
Output: $1.5
Model: 0.075
Completion: 10.000
🔧 - In: text
Out: text
Released: 2025-12-08
Qwen3.5 9B qwen/qwen3.5-9b 256K 65.5K Input: $0.05
Output: $0.15
Model: 0.025
Completion: 3.000
📎 🧠 - In: text, image
Out: text
Released: 2026-03-10
Qwen3.5 397B A17B qwen/qwen3.5-397b-a17b 258K 65.5K Input: $0.6
Output: $3.6
Model: 0.300
Completion: 6.000
- - In: text, image, video
Out: text
Open Weights
Released: 2026-02-16
Qwen3 Coder Flash qwen/qwen3-coder-flash 128K 65.5K Input: $0.3
Output: $1.5
Model: 0.150
Completion: 5.000
- - In: text
Out: text
Released: 2025-09-17
Qwen 2.5 Coder 32b qwen/Qwen2.5-Coder-32B-Instruct 32K 8.2K Input: $0.2006
Output: $0.2006
Model: 0.100
Completion: 1.000
- - In: text
Out: text
Released: 2025-07-03
Qwen 3 8B qwen/Qwen3-8B 41K 32.8K Input: $0.47
Output: $0.47
Model: 0.235
Completion: 1.000
- - In: text
Out: text
Released: 2024-01-01
Qwen 3 14b qwen/qwen3-14b 41K 32.8K Input: $0.08
Output: $0.24
Model: 0.040
Completion: 3.000
- - In: text
Out: text
Released: 2024-01-01
Qwen 3 235b A22B 2507 qwen/Qwen3-235B-A22B-Instruct-2507 256K 262.1K Input: $0.13
Output: $0.5
Model: 0.065
Completion: 3.846
🔧 - In: text
Out: text
Released: 2025-07-25
Qwen3 30B A3B qwen/qwen3-30b-a3b 41K 32.8K Input: $0.1
Output: $0.3
Model: 0.050
Completion: 3.000
- - In: text
Out: text
Released: 2025-02-27
Amazon Nova Lite 1.0 amazon/nova-lite-v1 300K 5.1K Input: $0.0595
Output: $0.238
Model: 0.030
Completion: 4.000
- - In: text
Out: text
Released: 2024-12-03
Amazon Nova Pro 1.0 amazon/nova-pro-v1 300K 32K Input: $0.7989999999999999
Output: $3.1959999999999997
Model: 0.399
Completion: 4.000
- - In: text
Out: text
Released: 2024-12-03
Amazon Nova Micro 1.0 amazon/nova-micro-v1 128K 5.1K Input: $0.0357
Output: $0.1394
Model: 0.018
Completion: 3.905
- - In: text
Out: text
Released: 2024-12-03
Amazon Nova 2 Lite amazon/nova-2-lite-v1 1M 65.5K Input: $0.5099999999999999
Output: $4.25
Model: 0.255
Completion: 8.333
- - In: text
Out: text
Released: 2024-12-03
Qwen3.6 Flash alibaba/qwen3.6-flash 991.8K 65.5K Input: $0.19
Output: $1.16
Model: 0.095
Completion: 6.105
- - In: text, image, video
Out: text
Released: 2026-04-17
Qwen3.6 27B alibaba/qwen3.6-27b 260.1K 65.5K Input: $0.203
Output: $2.24
Model: 0.102
Completion: 11.034
📎 - In: text, image, video
Out: text
Released: 2026-04-23
Qwen3.6 27B Thinking alibaba/qwen3.6-27b:thinking 260.1K 65.5K Input: $0.203
Output: $2.24
Model: 0.102
Completion: 11.034
📎 🧠 - In: text, image, video
Out: text
Released: 2026-04-23
Llama 3.1 8b (uncensored) aion-labs/aion-rp-llama-3.1-8b 32.8K 16.4K Input: $0.2006
Output: $0.2006
Model: 0.100
Completion: 1.000
- - In: text
Out: text
Released: 2024-07-23
AionLabs: Aion-2.5 aion-labs/aion-2.5 131.1K 32.8K Input: $1
Output: $3
Cache Read: $0.35
Model: 0.500
Completion: 3.000
Cache: 0.350
- - In: text
Out: text
Released: 2026-03-20
Aion 1.0 mini (DeepSeek) aion-labs/aion-1.0-mini 131.1K 8.2K Input: $0.7989999999999999
Output: $1.394
Model: 0.399
Completion: 1.745
- - In: text
Out: text
Released: 2025-02-20
AionLabs: Aion-2.0 aion-labs/aion-2.0 131.1K 32.8K Input: $0.8
Output: $1.6
Model: 0.400
Completion: 2.000
- - In: text
Out: text
Released: 2026-02-23
Aion 1.0 aion-labs/aion-1.0 65.5K 8.2K Input: $3.995
Output: $7.99
Model: 1.998
Completion: 2.000
- - In: text
Out: text
Released: 2025-02-01
OpenReasoning Nemotron 32B pamanseau/OpenReasoning-Nemotron-32B 32.8K 65.5K Input: $0.1
Output: $0.4
Model: 0.050
Completion: 4.000
🧠 - In: text
Out: text
Released: 2025-08-21
Llama 3.3 70B Wayfarer LatitudeGames/Wayfarer-Large-70B-Llama-3.3 16.4K 16.4K Input: $0.700000007
Output: $0.700000007
Model: 0.350
Completion: 1.000
- - In: text
Out: text
Released: 2025-02-20
Kimi K2 0711 Instruct FP4 baseten/Kimi-K2-Instruct-FP4 128K 131.1K Input: $0.1
Output: $2
Model: 0.050
Completion: 20.000
- - In: text
Out: text
Released: 2025-07-11
Inflection 3 Pi inflection/inflection-3-pi 8K 4.1K Input: $2.499
Output: $9.996
Model: 1.250
Completion: 4.000
- - In: text
Out: text
Released: 2024-10-11
Inflection 3 Productivity inflection/inflection-3-productivity 8K 4.1K Input: $2.499
Output: $9.996
Model: 1.250
Completion: 4.000
- - In: text
Out: text
Released: 2024-10-11
Omega Directive 24B Unslop v2.0 ReadyArt/MS3.2-The-Omega-Directive-24B-Unslop-v2.0 16.4K 32.8K Input: $0.5
Output: $0.5
Model: 0.250
Completion: 1.000
- - In: text
Out: text
Released: 2025-12-08
MiniMax M1 80K MiniMaxAI/MiniMax-M1-80k 1M 131.1K Input: $0.6052
Output: $2.4225000000000003
Model: 0.303
Completion: 4.003
- - In: text
Out: text
Released: 2025-06-16
Solar Pro 3 upstage/solar-pro-3 128K 128K Input: $0.15
Output: $0.6
Cache Read: $0.015
Model: 0.075
Completion: 4.000
Cache: 0.100
- - In: text
Out: text
Released: 2026-03-03
Olmo 3 32B Think allenai/olmo-3-32b-think 128K 8.2K Input: $0.3
Output: $0.44999999999999996
Model: 0.150
Completion: 1.500
🧠 - In: text
Out: text
Released: 2025-11-01
RNJ-1 Instruct 8B essentialai/rnj-1-instruct 128K 8.2K Input: $0.15
Output: $0.15
Model: 0.075
Completion: 1.000
- - In: text
Out: text
Released: 2025-12-13
DeepSeek V4 Flash deepseek/deepseek-v4-flash 1M 384K Input: $0.14
Output: $0.28
Cache Read: $0.028
Model: 0.070
Completion: 2.000
Cache: 0.200
🧠 🔧 - In: text
Out: text
Released: 2026-04-24
DeepSeek V4 Flash (Thinking) deepseek/deepseek-v4-flash:thinking 1M 384K Input: $0.14
Output: $0.28
Cache Read: $0.028
Model: 0.070
Completion: 2.000
Cache: 0.200
🧠 🔧 - In: text
Out: text
Released: 2026-04-24
DeepSeek V4 Pro Cheaper (Thinking) deepseek/deepseek-v4-pro-cheaper:thinking 1M 384K Input: $0.435
Output: $0.87
Cache Read: $0.003625
Model: 0.217
Completion: 2.000
Cache: 0.008
🧠 🔧 - In: text
Out: text
Released: 2026-04-25
DeepSeek Prover v2 671B deepseek/deepseek-prover-v2-671b 160K 16.4K Input: $1
Output: $2.5
Model: 0.500
Completion: 2.500
- - In: text
Out: text
Released: 2025-04-30
DeepSeek Latest deepseek/deepseek-latest 1M 384K Input: $1.1
Output: $2.2
Cache Read: $0.11
Model: 0.550
Completion: 2.000
Cache: 0.100
🧠 🔧 - In: text
Out: text
Released: 2026-05-03
DeepSeek V4 Pro deepseek/deepseek-v4-pro 1M 384K Input: $1.1
Output: $2.2
Cache Read: $0.11
Model: 0.550
Completion: 2.000
Cache: 0.100
🧠 🔧 - In: text
Out: text
Released: 2026-04-24
DeepSeek V4 Pro (Thinking) deepseek/deepseek-v4-pro:thinking 1M 384K Input: $1.1
Output: $2.2
Cache Read: $0.11
Model: 0.550
Completion: 2.000
Cache: 0.100
🧠 🔧 - In: text
Out: text
Released: 2026-04-24
DeepSeek V3.2 Speciale deepseek/deepseek-v3.2-speciale 163K 65.5K Input: $0.27999999999999997
Output: $0.42000000000000004
Model: 0.140
Completion: 1.500
📎 🧠 - In: text, pdf
Out: text
Released: 2025-12-02
DeepSeek V4 Pro Cheaper deepseek/deepseek-v4-pro-cheaper 1M 384K Input: $0.435
Output: $0.87
Cache Read: $0.003625
Model: 0.217
Completion: 2.000
Cache: 0.008
🧠 🔧 - In: text
Out: text
Released: 2026-04-25
DeepSeek V3.2 Thinking deepseek/deepseek-v3.2:thinking 163K 65.5K Input: $0.27999999999999997
Output: $0.42000000000000004
Model: 0.140
Completion: 1.500
📎 🧠 🔧 - In: text, pdf
Out: text
Released: 2025-12-01
DeepSeek V3.2 deepseek/deepseek-v3.2 163K 65.5K Input: $0.27999999999999997
Output: $0.42000000000000004
Model: 0.140
Completion: 1.500
📎 🔧 - In: text, pdf
Out: text
Released: 2025-12-01
MiniMax M3 Thinking minimax/minimax-m3:thinking 512K 80K Input: $0.3
Output: $1.2
Cache Read: $0.06
Model: 0.150
Completion: 4.000
Cache: 0.200
📎 🧠 🔧 - In: text, image
Out: text
Released: 2026-06-01
MiniMax M2.5 minimax/minimax-m2.5 204.8K 131.1K Input: $0.3
Output: $1.2
Model: 0.150
Completion: 4.000
🧠 🔧 - In: text
Out: text
Released: 2026-02-12
MiniMax M2.1 minimax/minimax-m2.1 200K 131.1K Input: $0.33
Output: $1.32
Model: 0.165
Completion: 4.000
🧠 🔧 - In: text
Out: text
Released: 2025-12-19
MiniMax M2.7 Turbo minimax/minimax-m2.7-turbo 204.8K 131.1K Input: $0.6
Output: $2.4
Model: 0.300
Completion: 4.000
🧠 🔧 - In: text
Out: text
Released: 2026-03-18
MiniMax 01 minimax/minimax-01 1M 16.4K Input: $0.1394
Output: $1.1219999999999999
Model: 0.070
Completion: 8.049
📎 - In: text, pdf
Out: text
Released: 2025-01-15
MiniMax M3 minimax/minimax-m3 512K 80K Input: $0.3
Output: $1.2
Cache Read: $0.06
Model: 0.150
Completion: 4.000
Cache: 0.200
📎 🔧 - In: text, image
Out: text
Released: 2026-06-01
MiniMax M2-her minimax/minimax-m2-her 65.5K 2K Input: $0.30200000000000005
Output: $1.2069999999999999
Model: 0.151
Completion: 3.997
- - In: text
Out: text
Released: 2026-01-24
MiniMax Latest minimax/minimax-latest 512K 80K Input: $0.3
Output: $1.2
Cache Read: $0.06
Model: 0.150
Completion: 4.000
Cache: 0.200
📎 🧠 🔧 - In: text, image
Out: text
Released: 2026-05-03
MiniMax M2.7 minimax/minimax-m2.7 204.8K 131.1K Input: $0.3
Output: $1.2
Model: 0.150
Completion: 4.000
🧠 🔧 - In: text
Out: text
Released: 2026-03-18
KAT Coder Pro V2 kwaipilot/kat-coder-pro-v2 256K 80K Input: $0.3
Output: $1.2
Model: 0.150
Completion: 4.000
- - In: text
Out: text
Released: 2026-03-28
Neural Daredevil 8B abliterated mlabonne/NeuralDaredevil-8B-abliterated 8.2K 8.2K Input: $0.44
Output: $0.44
Model: 0.220
Completion: 1.000
- - In: text
Out: text
Released: 2024-12-01
Mistral Nemo Starcannon 12b v1 VongolaChouko/Starcannon-Unleashed-12B-v1.0 16.4K 8.2K Input: $0.49299999999999994
Output: $0.49299999999999994
Model: 0.246
Completion: 1.000
- - In: text
Out: text
Released: 2024-07-01
MiniMax M2.5 TEE TEE/minimax-m2.5 196.6K 131.1K Input: $0.2
Output: $1.38
Model: 0.100
Completion: 6.900
🧠 🔧 - In: text
Out: text
Released: 2026-04-20
GLM 4.7 TEE TEE/glm-4.7 131K 65.5K Input: $0.85
Output: $3.3
Model: 0.425
Completion: 3.882
- - In: text
Out: text
Released: 2026-01-29
Llama 3.3 70B TEE/llama3-3-70b 128K 16.4K Input: $2
Output: $2
Model: 1.000
Completion: 1.000
- - In: text
Out: text
Released: 2025-07-03
Gemma 4 31B IT TEE TEE/gemma-4-31b-it 262.1K 262.1K Input: $0.15
Output: $0.46
Model: 0.075
Completion: 3.067
🧠 🔧 - In: text
Out: text
Released: 2026-05-26
Qwen3 30B A3B Instruct 2507 TEE TEE/qwen3-30b-a3b-instruct-2507 262K 32.8K Input: $0.15
Output: $0.44999999999999996
Model: 0.075
Completion: 3.000
- - In: text
Out: text
Released: 2025-07-29
GLM 5.1 TEE TEE/glm-5.1 202.8K 65.5K Input: $1.5
Output: $5.25
Cache Read: $0.3
Model: 0.750
Completion: 3.500
Cache: 0.200
- - In: text
Out: text
Released: 2026-04-20
DeepSeek V4 Pro TEE TEE/deepseek-v4-pro 800K 65.5K Input: $1.5
Output: $5.25
Cache Read: $0.15
Model: 0.750
Completion: 3.500
Cache: 0.100
🧠 🔧 - In: text
Out: text
Released: 2026-04-25
DeepSeek V4 Pro Thinking TEE TEE/deepseek-v4-pro:thinking 800K 65.5K Input: $1.5
Output: $5.25
Cache Read: $0.15
Model: 0.750
Completion: 3.500
Cache: 0.100
🧠 🔧 - In: text
Out: text
Released: 2026-04-29
Gemma 4 31B Thinking TEE TEE/gemma4-31b:thinking 262.1K 131.1K Input: $0.45
Output: $1
Model: 0.225
Completion: 2.222
🧠 - In: text
Out: text
Released: 2026-05-02
Kimi K2.5 TEE TEE/kimi-k2.5 128K 65.5K Input: $0.3
Output: $1.9
Model: 0.150
Completion: 6.333
- - In: text
Out: text
Released: 2026-01-29
Qwen2.5 VL 72B TEE TEE/qwen2.5-vl-72b-instruct 65.5K 8.2K Input: $0.7
Output: $0.7
Model: 0.350
Completion: 1.000
📎 - In: text, image
Out: text
Released: 2025-02-01
GPT-OSS 120B TEE TEE/gpt-oss-120b 131.1K 16.4K Input: $2
Output: $2
Model: 1.000
Completion: 1.000
- - In: text
Out: text
Released: 2025-08-05
Qwen3.5 27B TEE TEE/qwen3.5-27b 262.1K 65.5K Input: $0.3
Output: $2.4
Model: 0.150
Completion: 8.000
📎 - In: text, image, video
Out: text
Released: 2026-03-13
Gemma 4 26B A4B Uncensored TEE TEE/gemma-4-26b-a4b-uncensored 65.5K 65.5K Input: $0.15
Output: $0.7
Model: 0.075
Completion: 4.667
📎 🔧 - In: text, image
Out: text
Released: 2026-05-23
DeepSeek V3.1 TEE TEE/deepseek-v3.1 164K 8.2K Input: $1
Output: $2.5
Model: 0.500
Completion: 2.500
- - In: text
Out: text
Released: 2025-08-21
Kimi K2.6 TEE TEE/kimi-k2.6 262.1K 65.5K Input: $1.5
Output: $5.25
Cache Read: $0.375
Model: 0.750
Completion: 3.500
Cache: 0.250
📎 🔧 - In: text, image
Out: text
Released: 2026-04-21
Qwen3.5 397B A17B TEE TEE/qwen3.5-397b-a17b 258K 65.5K Input: $0.6
Output: $3.6
Model: 0.300
Completion: 6.000
- - In: text
Out: text
Released: 2026-02-28
Kimi K2.5 Thinking TEE TEE/kimi-k2.5-thinking 128K 65.5K Input: $0.3
Output: $1.9
Model: 0.150
Completion: 6.333
🧠 - In: text
Out: text
Released: 2026-01-29
Qwen3.6 35B A3B Uncensored TEE TEE/qwen3.6-35b-a3b-uncensored 131.1K 131.1K Input: $0.3
Output: $1.5
Model: 0.150
Completion: 5.000
📎 🧠 🔧 - In: text, image
Out: text
Released: 2026-05-23
GLM 4.7 Flash TEE TEE/glm-4.7-flash 203K 65.5K Input: $0.15
Output: $0.5
Model: 0.075
Completion: 3.333
- - In: text
Out: text
Released: 2026-01-19
Gemma 4 31B TEE/gemma4-31b 262.1K 131.1K Input: $0.45
Output: $1
Model: 0.225
Completion: 2.222
- - In: text
Out: text
Released: 2026-04-04
Gemma 3 27B TEE TEE/gemma-3-27b-it 131.1K 8.2K Input: $0.2
Output: $0.8
Model: 0.100
Completion: 4.000
- - In: text
Out: text
Released: 2025-03-10
GPT-OSS 20B TEE TEE/gpt-oss-20b 131.1K 8.2K Input: $0.2
Output: $0.8
Model: 0.100
Completion: 4.000
- - In: text
Out: text
Released: 2025-08-05
GLM 5 TEE TEE/glm-5 203K 65.5K Input: $1.2
Output: $3.5
Model: 0.600
Completion: 2.917
- - In: text
Out: text
Released: 2026-02-11
GLM 5.1 Thinking TEE TEE/glm-5.1-thinking 202.8K 65.5K Input: $1.5
Output: $5.25
Cache Read: $0.3
Model: 0.750
Completion: 3.500
Cache: 0.200
🧠 🔧 - In: text
Out: text
Released: 2026-04-20
DeepSeek V3.2 TEE TEE/deepseek-v3.2 164K 65.5K Input: $0.5
Output: $1
Model: 0.250
Completion: 2.000
- - In: text
Out: text
Released: 2025-12-01
Qwen3.5 122B A10B TEE TEE/qwen3.5-122b-a10b 262.1K 262.1K Input: $0.46
Output: $3.68
Model: 0.230
Completion: 8.000
🧠 🔧 - In: text
Out: text
Released: 2026-05-26
Llama 3.1 70B Hanami Sao10K/L3.1-70B-Hanami-x1 16.4K 16.4K Input: $0.49299999999999994
Output: $0.49299999999999994
Model: 0.246
Completion: 1.000
- - In: text
Out: text
Released: 2024-07-23
Sao10K Stheno 8b Sao10K/L3-8B-Stheno-v3.2 16.4K 8.2K Input: $0.2006
Output: $0.2006
Model: 0.100
Completion: 1.000
- - In: text
Out: text
Released: 2024-11-29
Llama 3.3 70B Euryale Sao10K/L3.3-70B-Euryale-v2.3 20.5K 16.4K Input: $0.49299999999999994
Output: $0.49299999999999994
Model: 0.246
Completion: 1.000
- - In: text
Out: text
Released: 2024-12-06
Llama 3.1 70B Euryale Sao10K/L3.1-70B-Euryale-v2.2 20.5K 16.4K Input: $0.306
Output: $0.357
Model: 0.153
Completion: 1.167
- - In: text
Out: text
Released: 2024-07-23
MS Evalebis 70b Steelskull/L3.3-MS-Evalebis-70b 16.4K 16.4K Input: $0.49299999999999994
Output: $0.49299999999999994
Model: 0.246
Completion: 1.000
- - In: text
Out: text
Released: 2024-12-06
Steelskull Nevoria 70b Steelskull/L3.3-MS-Nevoria-70b 16.4K 16.4K Input: $0.49299999999999994
Output: $0.49299999999999994
Model: 0.246
Completion: 1.000
- - In: text
Out: text
Released: 2024-12-06
Llama 3.3 70B Cu Mai Steelskull/L3.3-Cu-Mai-R1-70b 16.4K 16.4K Input: $0.49299999999999994
Output: $0.49299999999999994
Model: 0.246
Completion: 1.000
- - In: text
Out: text
Released: 2024-12-06
**Evayale 70b ** Steelskull/L3.3-MS-Evayale-70B 16.4K 16.4K Input: $0.49299999999999994
Output: $0.49299999999999994
Model: 0.246
Completion: 1.000
- - In: text
Out: text
Released: 2024-12-06
Steelskull Nevoria R1 70b Steelskull/L3.3-Nevoria-R1-70b 16.4K 16.4K Input: $0.49299999999999994
Output: $0.49299999999999994
Model: 0.246
Completion: 1.000
- - In: text
Out: text
Released: 2024-12-06
Steelskull Electra R1 70b Steelskull/L3.3-Electra-R1-70b 16.4K 16.4K Input: $0.69989
Output: $0.69989
Model: 0.350
Completion: 1.000
- - In: text
Out: text
Released: 2024-12-06

NEAR AI Cloud

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Gemini 3.1 Flash Lite google/gemini-3.1-flash-lite 1M 65.5K Input: $0.25
Output: $1.5
Cache Read: $0.025
Input Audio: $0.5
Model: 0.250
Completion: 3.000
Cache: 0.050
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video, audio, pdf
Out: text
Released: 2026-05-07
Gemma 4 31B IT google/gemma-4-31B-it 262.1K 32.8K Input: $0.13
Output: $0.4
Cache Read: $0.026
Model: 0.065
Completion: 3.077
Cache: 0.200
📎 🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-04-02
Gemini 2.5 Pro google/gemini-2.5-pro 1M 65.5K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-06-17
Gemini 2.5 Flash google/gemini-2.5-flash 1M 65.5K Input: $0.3
Output: $2.5
Cache Read: $0.03
Input Audio: $1
Model: 0.500
Completion: 2.500
Cache: 0.030
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-06-17
Gemini 3.5 Flash google/gemini-3.5-flash 1M 65.5K Input: $1.5
Output: $9
Cache Read: $0.15
Input Audio: $1.5
Model: 0.750
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video, audio, pdf
Out: text
Released: 2026-05-19
Gemini 3 Pro Preview google/gemini-3-pro 1M 65.5K Input: $1.25
Output: $15
Cache Read: $0
Model: 0.625
Completion: 12.000
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video, audio, pdf
Out: text
Released: 2025-11-18
Gemini 2.5 Flash-Lite google/gemini-2.5-flash-lite 1M 65.5K Input: $0.1
Output: $0.4
Cache Read: $0.01
Input Audio: $0.3
Model: 0.150
Completion: 1.333
Cache: 0.033
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-06-17
Qwen3 Embedding 0.6B Qwen/Qwen3-Embedding-0.6B 41K 1K Input: $0.01
Output: $0
Model: 0.005 - - In: text
Out: text
Open Weights
Released: 2025-06-03
Qwen3 Reranker 0.6B Qwen/Qwen3-Reranker-0.6B 41K 1K Input: $0.01
Output: $0.01
Model: 0.005
Completion: 1.000
- - In: text
Out: text
Open Weights
Released: 2025-06-03
Qwen 3.6 35B A3B FP8 Qwen/Qwen3.6-35B-A3B-FP8 262.1K 32.8K Input: $0.17
Output: $1.1
Cache Read: $0.056
Model: 0.085
Completion: 6.471
Cache: 0.329
📎 🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-04-17
Qwen3.5 122B-A10B Qwen/Qwen3.5-122B-A10B 131.1K 32.8K Input: $0.4
Output: $3.2
Model: 0.200
Completion: 8.000
📎 🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-23
Qwen3 30B-A3B Instruct 2507 Qwen/Qwen3-30B-A3B-Instruct-2507 262.1K 32.8K Input: $0.15
Output: $0.55
Model: 0.075
Completion: 3.667
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-07-29
Qwen3-VL 30B-A3B Instruct Qwen/Qwen3-VL-30B-A3B-Instruct 256K 32.8K Input: $0.15
Output: $0.55
Model: 0.075
Completion: 3.667
📎 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2025-09-23
o3 openai/o3 200K 100K Input: $2
Output: $8
Cache Read: $0.5
Model: 1.000
Completion: 4.000
Cache: 0.250
📎 🧠 🔧 2024-05 In: text, image, pdf
Out: text
Released: 2025-04-16
GPT-5 openai/gpt-5 400K 128K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-08-07
o4-mini openai/o4-mini 200K 100K Input: $1.1
Output: $4.4
Cache Read: $0.275
Model: 0.550
Completion: 4.000
Cache: 0.250
📎 🧠 🔧 2024-05 In: text, image
Out: text
Released: 2025-04-16
GPT-5.4 nano openai/gpt-5.4-nano 400K 128K Input: $0.2
Output: $1.25
Cache Read: $0.02
Model: 0.100
Completion: 6.250
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2026-03-17
o3-mini openai/o3-mini 200K 100K Input: $1.1
Output: $4.4
Cache Read: $0.55
Model: 0.550
Completion: 4.000
Cache: 0.500
🧠 🔧 2024-05 In: text
Out: text
Released: 2024-12-20
Updated: 2025-01-29
GPT-5.2 openai/gpt-5.2 400K 128K Input: $1.8
Output: $15.5
Cache Read: $0.18
Model: 0.900
Completion: 8.611
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2025-12-11
GPT-4.1 nano openai/gpt-4.1-nano 1M 32.8K Input: $0.1
Output: $0.4
Cache Read: $0.025
Model: 0.050
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2025-04-14
GPT-OSS 120B openai/gpt-oss-120b 131K 32.8K Input: $0.15
Output: $0.55
Model: 0.075
Completion: 3.667
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-05
GPT-5.4 openai/gpt-5.4 1.1M 128K Input: $2.5
Output: $15
Cache Read: $0.25
Model: 1.250
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-03-05
GPT-5.4 mini openai/gpt-5.4-mini 400K 128K Input: $0.75
Output: $4.5
Cache Read: $0.075
Model: 0.375
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2026-03-17
GPT-4.1 openai/gpt-4.1 1M 32.8K Input: $2
Output: $8
Cache Read: $0.5
Model: 1.000
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-04 In: text, image, pdf
Out: text
Released: 2025-04-14
GPT-5 Mini openai/gpt-5-mini 400K 128K Input: $0.25
Output: $2
Cache Read: $0.025
Model: 0.125
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2024-05-30 In: text, image
Out: text
Released: 2025-08-07
GPT-4.1 mini openai/gpt-4.1-mini 1M 32.8K Input: $0.4
Output: $1.6
Cache Read: $0.1
Model: 0.200
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-04 In: text, image, pdf
Out: text
Released: 2025-04-14
GPT-5 Nano openai/gpt-5-nano 400K 128K Input: $0.05
Output: $0.4
Cache Read: $0.005
Model: 0.025
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2024-05-30 In: text, image
Out: text
Released: 2025-08-07
Whisper Large v3 openai/whisper-large-v3 448 448 Input: $0.01
Output: $0
Model: 0.005 - - In: audio
Out: text
Open Weights
Released: 2023-11-06
GPT-5.1 openai/gpt-5.1 400K 128K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-11-13
GPT-5.5 openai/gpt-5.5 1.1M 128K Input: $5
Output: $30
Cache Read: $0.5
Model: 2.500
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 2025-12-01 In: text, image, pdf
Out: text
Released: 2026-04-23
Claude Sonnet 4.5 (latest) anthropic/claude-sonnet-4-5 200K 64K Input: $3
Output: $15.5
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.167
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-07-31 In: text, image, pdf
Out: text
Released: 2025-09-29
Claude Opus 4.7 anthropic/claude-opus-4-7 1M 128K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 2026-01-31 In: text, image, pdf
Out: text
Released: 2026-04-16
Claude Haiku 4.5 (latest) anthropic/claude-haiku-4-5 200K 64K Input: $1
Output: $5
Cache Read: $0.1
Cache Write: $1.25
Model: 0.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-02-28 In: text, image, pdf
Out: text
Released: 2025-10-15
Claude Opus 4.6 anthropic/claude-opus-4-6 200K 128K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-05-31 In: text, image, pdf
Out: text
Released: 2026-02-05
Updated: 2026-03-13
Claude Sonnet 4.6 anthropic/claude-sonnet-4-6 1M 64K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-02-17
Updated: 2026-03-13
GLM-5.1 FP8 zai-org/GLM-5.1-FP8 202.8K 131.1K Input: $0.85
Output: $3.3
Model: 0.425
Completion: 3.882
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-27
FLUX.2 Klein 4B black-forest-labs/FLUX.2-klein-4B 128K 128K Input: $1
Output: $1
Model: 0.500
Completion: 1.000
🌡️ - In: text, image
Out: image
Open Weights
Released: 2026-01-14

Nebius Token Factory

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Llama-3.3-70B-Instruct meta-llama/Llama-3.3-70B-Instruct 128K 8.2K Input: $0.13
Output: $0.4
Cache Read: $0.013
Cache Write: $0.16
Model: 0.065
Completion: 3.077
Cache: 0.100
🔧 🌡️ 2025-08 In: text
Out: text
Open Weights
Released: 2025-12-05
Updated: 2026-02-04
Kimi-K2.5-fast moonshotai/Kimi-K2.5-fast 256K 8.2K Input: $0.5
Output: $2.5
Cache Read: $0.05
Cache Write: $0.625
Model: 0.250
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-06 In: text, image
Out: text
Open Weights
Released: 2025-12-15
Updated: 2026-02-04
Kimi-K2.5 moonshotai/Kimi-K2.5 256K 8.2K Input: $0.5
Output: $2.5
Cache Read: $0.05
Cache Write: $0.625
Reasoning: $2.5
Model: 0.250
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-06 In: text, image
Out: text
Open Weights
Released: 2025-12-15
Updated: 2026-02-04
Gemma-3-27b-it google/gemma-3-27b-it 110K 8.2K Input: $0.1
Output: $0.3
Cache Read: $0.01
Cache Write: $0.125
Model: 0.050
Completion: 3.000
Cache: 0.100
📎 🔧 🌡️ 2025-10 In: text, image
Out: text
Open Weights
Released: 2026-01-20
Updated: 2026-02-04
Qwen3-Next-80B-A3B-Thinking-fast Qwen/Qwen3-Next-80B-A3B-Thinking-fast 8K 8.2K Input: $0.15
Output: $1.2
Cache Read: $0.015
Cache Write: $0.1875
Model: 0.075
Completion: 8.000
Cache: 0.100
🧠 🔧 🌡️ 2025-07 In: text
Out: text
Open Weights
Released: 2025-07-25
Updated: 2026-05-07
Qwen2.5-VL-72B-Instruct Qwen/Qwen2.5-VL-72B-Instruct 128K 8.2K Input: $0.25
Output: $0.75
Cache Read: $0.025
Cache Write: $0.31
Model: 0.125
Completion: 3.000
Cache: 0.100
📎 🔧 🌡️ 2024-12 In: text, image
Out: text
Open Weights
Released: 2025-01-20
Updated: 2026-02-04
Qwen3-Embedding-8B Qwen/Qwen3-Embedding-8B 32.8K - Input: $0.01
Output: $0
Model: 0.005 - 2025-10 In: text
Out: text
Open Weights
Released: 2026-01-10
Updated: 2026-02-04
Qwen3.5-397B-A17B Qwen/Qwen3.5-397B-A17B 262.1K 8.2K Input: $0.6
Output: $3.6
Cache Read: $0.06
Cache Write: $0.75
Model: 0.300
Completion: 6.000
Cache: 0.100
🧠 🔧 🌡️ 2025-07 In: text
Out: text
Open Weights
Released: 2025-07-15
Updated: 2026-05-07
Qwen3.5-397B-A17B-fast Qwen/Qwen3.5-397B-A17B-fast 8K 8.2K Input: $0.6
Output: $3.6
Cache Read: $0.06
Cache Write: $0.75
Model: 0.300
Completion: 6.000
Cache: 0.100
🧠 🔧 🌡️ 2025-07 In: text
Out: text
Open Weights
Released: 2025-07-15
Updated: 2026-05-07
Qwen3-30B-A3B-Instruct-2507 Qwen/Qwen3-30B-A3B-Instruct-2507 128K 8.2K Input: $0.1
Output: $0.3
Cache Read: $0.01
Cache Write: $0.125
Model: 0.050
Completion: 3.000
Cache: 0.100
🔧 🌡️ 2025-12 In: text
Out: text
Open Weights
Released: 2026-01-28
Updated: 2026-02-04
Qwen3-Next-80B-A3B-Thinking Qwen/Qwen3-Next-80B-A3B-Thinking 128K 16.4K Input: $0.15
Output: $1.2
Cache Read: $0.015
Cache Write: $0.18
Reasoning: $1.2
Model: 0.075
Completion: 8.000
Cache: 0.100
🧠 🔧 🌡️ 2025-12 In: text
Out: text
Open Weights
Released: 2026-01-28
Updated: 2026-02-04
Qwen3 235B A22B Instruct 2507 Qwen/Qwen3-235B-A22B-Instruct-2507 262.1K 8.2K Input: $0.2
Output: $0.6
Model: 0.100
Completion: 3.000
🔧 🌡️ 2025-07 In: text
Out: text
Released: 2025-07-25
Updated: 2025-10-04
Qwen3-32B Qwen/Qwen3-32B 128K 8.2K Input: $0.1
Output: $0.3
Cache Read: $0.01
Cache Write: $0.125
Model: 0.050
Completion: 3.000
Cache: 0.100
🔧 🌡️ 2025-12 In: text
Out: text
Open Weights
Released: 2026-01-28
Updated: 2026-02-04
Qwen3-235B-A22B-Thinking-2507-fast Qwen/Qwen3-235B-A22B-Thinking-2507-fast 8K 8.2K Input: $0.5
Output: $2
Cache Read: $0.05
Cache Write: $0.625
Model: 0.250
Completion: 4.000
Cache: 0.100
🧠 🔧 🌡️ 2025-07 In: text
Out: text
Open Weights
Released: 2025-07-25
Updated: 2026-05-07
gpt-oss-120b-fast openai/gpt-oss-120b-fast 8K 8.2K Input: $0.1
Output: $0.5
Cache Read: $0.01
Cache Write: $0.125
Model: 0.050
Completion: 5.000
Cache: 0.100
🧠 🔧 🌡️ 2025-06 In: text
Out: text
Open Weights
Released: 2025-06-10
Updated: 2026-05-07
gpt-oss-120b openai/gpt-oss-120b 128K 8.2K Input: $0.15
Output: $0.6
Cache Read: $0.015
Cache Write: $0.18
Reasoning: $0.6
Model: 0.075
Completion: 4.000
Cache: 0.100
🧠 🔧 🌡️ 2025-09 In: text
Out: text
Open Weights
Released: 2026-01-10
Updated: 2026-02-04
Hermes-4-405B NousResearch/Hermes-4-405B 128K 8.2K Input: $1
Output: $3
Cache Read: $0.1
Cache Write: $1.25
Reasoning: $3
Model: 0.500
Completion: 3.000
Cache: 0.100
🧠 🔧 🌡️ 2025-11 In: text
Out: text
Open Weights
Released: 2026-01-30
Updated: 2026-02-04
Hermes-4-70B NousResearch/Hermes-4-70B 128K 8.2K Input: $0.13
Output: $0.4
Cache Read: $0.013
Cache Write: $0.16
Reasoning: $0.4
Model: 0.065
Completion: 3.077
Cache: 0.100
🧠 🔧 🌡️ 2025-11 In: text
Out: text
Open Weights
Released: 2026-01-30
Updated: 2026-02-04
Nemotron-3-Nano-30B-A3B nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B 32K 4.1K Input: $0.06
Output: $0.24
Cache Read: $0.006
Cache Write: $0.075
Model: 0.030
Completion: 4.000
Cache: 0.100
🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2025-08-10
Updated: 2026-02-04
Nemotron-3-Nano-Omni nvidia/Nemotron-3-Nano-Omni 65.5K 8.2K Input: $0.06
Output: $0.24
Cache Read: $0.006
Cache Write: $0.075
Model: 0.030
Completion: 4.000
Cache: 0.100
🧠 🔧 🌡️ 2025-01 In: text
Out: text
Open Weights
Released: 2025-01-20
Updated: 2026-05-07
Llama-3.1-Nemotron-Ultra-253B-v1 nvidia/Llama-3_1-Nemotron-Ultra-253B-v1 128K 4.1K Input: $0.6
Output: $1.8
Cache Read: $0.06
Cache Write: $0.75
Model: 0.300
Completion: 3.000
Cache: 0.100
🔧 🌡️ 2024-12 In: text
Out: text
Open Weights
Released: 2025-01-15
Updated: 2026-02-04
Nemotron-3-Super-120B-A12B nvidia/nemotron-3-super-120b-a12b 256K 32.8K Input: $0.3
Output: $0.9
Model: 0.150
Completion: 3.000
🧠 🔧 🌡️ 2026-02 In: text
Out: text
Open Weights
Released: 2026-03-11
Updated: 2026-03-12
GLM-5 zai-org/GLM-5 200K 16.4K Input: $1
Output: $3.2
Cache Read: $0.1
Cache Write: $1
Model: 0.500
Completion: 3.200
Cache: 0.100
🧠 🔧 🌡️ 2026-01 In: text
Out: text
Released: 2026-03-01
Updated: 2026-03-10
DeepSeek-V3.2-fast deepseek-ai/DeepSeek-V3.2-fast 8K 8.2K Input: $0.4
Output: $2
Cache Read: $0.04
Cache Write: $0.5
Model: 0.200
Completion: 5.000
Cache: 0.100
🧠 🔧 🌡️ 2025-01 In: text
Out: text
Open Weights
Released: 2025-01-27
Updated: 2026-05-07
DeepSeek V4 Pro deepseek-ai/DeepSeek-V4-Pro 1M 384K Input: $1.75
Output: $3.5
Cache Read: $0.15
Model: 0.875
Completion: 2.000
Cache: 0.086
🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2026-04-24
DeepSeek-V3.2 deepseek-ai/DeepSeek-V3.2 163K 16.4K Input: $0.3
Output: $0.45
Cache Read: $0.03
Cache Write: $0.375
Reasoning: $0.45
Model: 0.150
Completion: 1.500
Cache: 0.100
🧠 🔧 🌡️ 2025-11 In: text
Out: text
Open Weights
Released: 2026-01-20
Updated: 2026-02-04
MiniMax-M2.5 MiniMaxAI/MiniMax-M2.5 196.6K 8.2K Input: $0.3
Output: $1.2
Cache Read: $0.03
Cache Write: $0.375
Model: 0.150
Completion: 4.000
Cache: 0.100
🧠 🔧 🌡️ 2025-01 In: text
Out: text
Open Weights
Released: 2025-01-20
Updated: 2026-05-07
MiniMax-M2.5-fast MiniMaxAI/MiniMax-M2.5-fast 8K 8.2K Input: $0.3
Output: $1.2
Cache Read: $0.03
Cache Write: $0.375
Model: 0.150
Completion: 4.000
Cache: 0.100
🧠 🔧 🌡️ 2025-01 In: text
Out: text
Open Weights
Released: 2025-01-20
Updated: 2026-05-07
INTELLECT-3 PrimeIntellect/INTELLECT-3 128K 8.2K Input: $0.2
Output: $1.1
Cache Read: $0.02
Cache Write: $0.25
Model: 0.100
Completion: 5.500
Cache: 0.100
🔧 🌡️ 2025-10 In: text
Out: text
Open Weights
Released: 2026-01-25
Updated: 2026-02-04

Neon

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Gemini 3 Flash Preview gemini-3-flash 1M 65.5K Input: $0.5
Output: $3
Cache Read: $0.05
Input Audio: $1
Model: 0.500
Completion: 3.000
Cache: 0.050
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video, audio, pdf
Out: text
Released: 2025-12-17
Claude Sonnet 4.5 claude-sonnet-4 200K 64K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-07-31 In: text, image, pdf
Out: text
Released: 2025-09-29
Claude Opus 4.5 (latest) claude-opus-4-5 200K 64K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-11-24
GPT-5 gpt-5 400K 128K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-08-07
GPT-5.1 gpt-5-1 400K 128K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-11-13
Gemini 3 Pro Preview gemini-3-pro 1M 65.5K Input: $2
Output: $12
Cache Read: $0.2
Model: 1.000
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video, audio, pdf
Out: text
Released: 2025-11-18
GPT-5.2 gpt-5-2 400K 128K Input: $1.75
Output: $14
Cache Read: $0.175
Model: 0.875
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2025-12-11
Gemini 3.1 Flash Lite Preview gemini-3-1-flash-lite 1M 65.5K Input: $0.25
Output: $1.5
Cache Read: $0.025
Input Audio: $0.5
Model: 0.250
Completion: 3.000
Cache: 0.050
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video, audio, pdf
Out: text
Released: 2026-03-03
Claude Sonnet 4.5 (latest) claude-sonnet-4-5 200K 64K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-07-31 In: text, image, pdf
Out: text
Released: 2025-09-29
Claude Opus 4.7 claude-opus-4-7 1M 128K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 2026-01-31 In: text, image, pdf
Out: text
Released: 2026-04-16
Gemini 2.5 Pro gemini-2-5-pro 1M 65.5K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-06-17
GPT-5.4 nano gpt-5-4-nano 400K 128K Input: $0.2
Output: $1.25
Cache Read: $0.02
Model: 0.100
Completion: 6.250
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2026-03-17
Claude Opus 4.1 (latest) claude-opus-4-1 200K 32K Input: $15
Output: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-08-05
GPT-5.4 mini gpt-5-4-mini 400K 128K Input: $0.75
Output: $4.5
Cache Read: $0.075
Model: 0.375
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2026-03-17
Gemini 2.5 Flash gemini-2-5-flash 1M 65.5K Input: $0.3
Output: $2.5
Cache Read: $0.03
Input Audio: $1
Model: 0.500
Completion: 2.500
Cache: 0.030
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-06-17
GPT OSS 120B gpt-oss-120b 131.1K 32.8K Input: $0.072
Output: $0.28
Model: 0.036
Completion: 3.889
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-05
GPT-5.5 gpt-5-5 1.1M 128K Input: $5
Output: $30
Cache Read: $0.5
Model: 2.500
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 2025-12-01 In: text, image, pdf
Out: text
Released: 2026-04-23
Gemini 3.1 Pro Preview Custom Tools gemini-3-1-pro 1M 65.5K Input: $2
Output: $12
Cache Read: $0.2
Model: 1.000
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video, audio, pdf
Out: text
Released: 2026-02-19
Claude Haiku 4.5 (latest) claude-haiku-4-5 200K 64K Input: $1
Output: $5
Cache Read: $0.1
Cache Write: $1.25
Model: 0.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-02-28 In: text, image, pdf
Out: text
Released: 2025-10-15
Claude Opus 4.6 claude-opus-4-6 1M 128K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-05-31 In: text, image, pdf
Out: text
Released: 2026-02-05
Updated: 2026-03-13
GPT-5 Mini gpt-5-mini 400K 128K Input: $0.25
Output: $2
Cache Read: $0.025
Model: 0.125
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2024-05-30 In: text, image
Out: text
Released: 2025-08-07
GPT-5 Nano gpt-5-nano 400K 128K Input: $0.05
Output: $0.4
Cache Read: $0.005
Model: 0.025
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2024-05-30 In: text, image
Out: text
Released: 2025-08-07
Claude Sonnet 4.6 claude-sonnet-4-6 1M 64K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-02-17
Updated: 2026-03-13
GPT-5.4 gpt-5-4 1.1M 128K Input: $2.5
Output: $15
Cache Read: $0.25
Model: 1.250
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-03-05
GPT OSS 20B gpt-oss-20b 131.1K 32.8K Input: $0.05
Output: $0.2
Model: 0.025
Completion: 4.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-05

Neuralwatt

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Kimi K2.5 Fast kimi-k2.5-fast 262.1K 262.1K Input: $0.52
Output: $2.59
Model: 0.260
Completion: 4.981
📎 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2026-01-27
GLM 5 Fast glm-5-fast 202.7K 202.7K Input: $1.1
Output: $3.6
Model: 0.550
Completion: 3.273
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-04-07
Qwen3.5 397B Fast qwen3.5-397b-fast 262.1K 262.1K Input: $0.69
Output: $4.14
Model: 0.345
Completion: 6.000
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-01
GLM 5.1 Fast glm-5.1-fast 202.7K 202.7K Input: $1.1
Output: $3.6
Model: 0.550
Completion: 3.273
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-04-07
Kimi K2.6 Fast kimi-k2.6-fast 262.1K 262.1K Input: $0.69
Output: $3.22
Model: 0.345
Completion: 4.667
📎 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2026-04-21
Qwen3.6 35B Fast qwen3.6-35b-fast 131.1K 131.1K Input: $0.29
Output: $1.15
Model: 0.145
Completion: 3.966
📎 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2026-04-01
Kimi K2.6 moonshotai/Kimi-K2.6 262.1K 262.1K Input: $0.69
Output: $3.22
Model: 0.345
Completion: 4.667
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2026-04-21
Kimi K2.5 moonshotai/Kimi-K2.5 262.1K 262.1K Input: $0.52
Output: $2.59
Model: 0.260
Completion: 4.981
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2026-01-27
Qwen3.6 35B A3B Qwen/Qwen3.6-35B-A3B 131.1K 131.1K Input: $0.29
Output: $1.15
Model: 0.145
Completion: 3.966
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2026-04-01
Qwen3.5 397B A17B FP8 Qwen/Qwen3.5-397B-A17B-FP8 262.1K 262.1K Input: $0.69
Output: $4.14
Model: 0.345
Completion: 6.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-01
GPT OSS 20B openai/gpt-oss-20b 16.4K 16.4K Input: $0.03
Output: $0.16
Model: 0.015
Completion: 5.333
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-05
Devstral Small 2 24B Instruct 2512 mistralai/Devstral-Small-2-24B-Instruct-2512 262.1K 262.1K Input: $0.12
Output: $0.35
Model: 0.060
Completion: 2.917
📎 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2025-12-09
GLM 5.1 FP8 zai-org/GLM-5.1-FP8 202.7K 202.7K Input: $1.1
Output: $3.6
Model: 0.550
Completion: 3.273
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-04-07
MiniMax M2.5 MiniMaxAI/MiniMax-M2.5 196.6K 196.6K Input: $0.35
Output: $1.38
Model: 0.175
Completion: 3.943
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-12

Nova

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Nova 2 Pro nova-2-pro-v1 1M 64K Input: $0
Output: $0
Reasoning: $0
- 📎 🧠 🔧 🌡️ - In: text, image, video, pdf
Out: text
Released: 2025-12-03
Updated: 2026-01-03
Nova 2 Lite nova-2-lite-v1 1M 64K Input: $0
Output: $0
Reasoning: $0
- 📎 🧠 🔧 🌡️ - In: text, image, video, pdf
Out: text
Released: 2025-12-01

NovitaAI

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Ling-2.6-1T inclusionai/ling-2.6-1t 262.1K 32.8K Input: $0
Output: $0
- 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-04-23
Ring-2.6-1T inclusionai/ring-2.6-1t 262.1K 65.5K Input: $0.3
Output: $2.5
Cache Read: $0.06
Model: 0.150
Completion: 8.333
Cache: 0.200
🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-05-08
Updated: 2026-05-27
Ling-2.6-flash inclusionai/ling-2.6-flash 262.1K 32.8K Input: $0.1
Output: $0.3
Cache Read: $0.02
Model: 0.050
Completion: 3.000
Cache: 0.200
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-04-24
Llama 3.1 8B Instruct meta-llama/llama-3.1-8b-instruct 16.4K 16.4K Input: $0.02
Output: $0.05
Model: 0.010
Completion: 2.500
🌡️ - In: text
Out: text
Open Weights
Released: 2024-07-24
Llama3 70B Instruct meta-llama/llama-3-70b-instruct 8.2K 8K Input: $0.51
Output: $0.74
Model: 0.255
Completion: 1.451
🌡️ - In: text
Out: text
Open Weights
Released: 2024-04-25
Llama 4 Scout Instruct meta-llama/llama-4-scout-17b-16e-instruct 131.1K 131.1K Input: $0.18
Output: $0.59
Model: 0.090
Completion: 3.278
📎 🌡️ - In: text, image
Out: text
Open Weights
Released: 2025-04-06
Llama 3.3 70B Instruct meta-llama/llama-3.3-70b-instruct 131.1K 120K Input: $0.135
Output: $0.4
Model: 0.068
Completion: 2.963
🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-12-07
Llama 3 8B Instruct meta-llama/llama-3-8b-instruct 8.2K 8.2K Input: $0.04
Output: $0.04
Model: 0.020
Completion: 1.000
🌡️ - In: text
Out: text
Open Weights
Released: 2024-04-25
Llama 3.2 3B Instruct meta-llama/llama-3.2-3b-instruct 32.8K 32K Input: $0.03
Output: $0.05
Model: 0.015
Completion: 1.667
🌡️ - In: text
Out: text
Open Weights
Released: 2024-09-18
Llama 4 Maverick Instruct meta-llama/llama-4-maverick-17b-128e-instruct-fp8 1M 8.2K Input: $0.27
Output: $0.85
Model: 0.135
Completion: 3.148
📎 🌡️ - In: text, image
Out: text
Open Weights
Released: 2025-04-06
Kimi K2 Instruct moonshotai/kimi-k2-instruct 131.1K 32.8K Input: $0.57
Output: $2.3
Model: 0.285
Completion: 4.035
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-07-11
Kimi K2 Thinking moonshotai/kimi-k2-thinking 262.1K 262.1K Input: $0.6
Output: $2.5
Model: 0.300
Completion: 4.167
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-11-07
Kimi K2.5 moonshotai/kimi-k2.5 262.1K 262.1K Input: $0.6
Output: $3
Cache Read: $0.1
Model: 0.300
Completion: 5.000
Cache: 0.167
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video
Out: text
Open Weights
Released: 2026-01-27
Kimi K2.6 moonshotai/kimi-k2.6 262.1K 262.1K Input: $0.95
Output: $4
Cache Read: $0.16
Model: 0.475
Completion: 4.211
Cache: 0.168
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video
Out: text
Open Weights
Released: 2026-04-21
Kimi K2 0905 moonshotai/kimi-k2-0905 262.1K 262.1K Input: $0.6
Output: $2.5
Model: 0.300
Completion: 4.167
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-09-05
MiniMax M1 minimaxai/minimax-m1-80k 1M 40K Input: $0.55
Output: $2.2
Model: 0.275
Completion: 4.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-06-17
ERNIE 4.5 VL 28B A3B baidu/ernie-4.5-vl-28b-a3b 30K 8K Input: $1.4
Output: $5.6
Model: 0.700
Completion: 4.000
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2025-06-30
ERNIE 4.5 21B A3B baidu/ernie-4.5-21B-a3b 120K 8K Input: $0.07
Output: $0.28
Model: 0.035
Completion: 4.000
🔧 🌡️ 2025-03 In: text
Out: text
Open Weights
Released: 2025-06-30
ERNIE 4.5 VL 424B A47B baidu/ernie-4.5-vl-424b-a47b 123K 16K Input: $0.42
Output: $1.25
Model: 0.210
Completion: 2.976
📎 🧠 🌡️ - In: text, image
Out: text
Open Weights
Released: 2025-06-30
ERNIE-4.5-21B-A3B-Thinking baidu/ernie-4.5-21B-a3b-thinking 131.1K 65.5K Input: $0.07
Output: $0.28
Model: 0.035
Completion: 4.000
🧠 🌡️ 2025-03 In: text
Out: text
Open Weights
Released: 2025-09-19
ERNIE 4.5 300B A47B baidu/ernie-4.5-300b-a47b-paddle 123K 12K Input: $0.28
Output: $1.1
Model: 0.140
Completion: 3.929
🌡️ - In: text
Out: text
Open Weights
Released: 2025-06-30
ERNIE-4.5-VL-28B-A3B-Thinking baidu/ernie-4.5-vl-28b-a3b-thinking 131.1K 65.5K Input: $0.39
Output: $0.39
Model: 0.195
Completion: 1.000
📎 🧠 🔧 🌡️ - In: text, image, video
Out: text
Open Weights
Released: 2025-11-26
Gemma 4 31B google/gemma-4-31b-it 262.1K 131.1K Input: $0.14
Output: $0.4
Model: 0.070
Completion: 2.857
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2026-04-02
Gemma 4 26B A4B google/gemma-4-26b-a4b-it 262.1K 131.1K Input: $0.13
Output: $0.4
Model: 0.065
Completion: 3.077
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2026-04-02
Gemma 3 12B google/gemma-3-12b-it 131.1K 8.2K Input: $0.05
Output: $0.1
Model: 0.025
Completion: 2.000
📎 🌡️ - In: text, image
Out: text
Open Weights
Released: 2025-03-13
Gemma 3 27B google/gemma-3-27b-it 98.3K 16.4K Input: $0.119
Output: $0.2
Model: 0.059
Completion: 1.681
📎 🌡️ - In: text, image
Out: text
Open Weights
Released: 2025-03-25
Wizardlm 2 8x22B microsoft/wizardlm-2-8x22b 65.5K 8K Input: $0.62
Output: $0.62
Model: 0.310
Completion: 1.000
🌡️ - In: text
Out: text
Open Weights
Released: 2024-04-24
OpenAI GPT OSS 120B openai/gpt-oss-120b 131.1K 32.8K Input: $0.05
Output: $0.25
Model: 0.025
Completion: 5.000
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2025-08-06
OpenAI: GPT OSS 20B openai/gpt-oss-20b 131.1K 32.8K Input: $0.04
Output: $0.15
Model: 0.020
Completion: 3.750
📎 🧠 🌡️ - In: text, image
Out: text
Open Weights
Released: 2025-08-06
L31 70B Euryale V2.2 sao10K/l31-70b-euryale-v2.2 8.2K 8.2K Input: $1.48
Output: $1.48
Model: 0.740
Completion: 1.000
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-09-19
L3 8B Stheno V3.2 sao10K/L3-8B-stheno-v3.2 8.2K 32K Input: $0.05
Output: $0.05
Model: 0.025
Completion: 1.000
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-11-29
**Sao10k L3 8B Lunaris ** sao10K/l3-8b-lunaris 8.2K 8.2K Input: $0.05
Output: $0.05
Model: 0.025
Completion: 1.000
🌡️ - In: text
Out: text
Open Weights
Released: 2024-11-28
**L3 70B Euryale V2.1 ** sao10K/l3-70b-euryale-v2.1 8.2K 8.2K Input: $1.48
Output: $1.48
Model: 0.740
Completion: 1.000
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-06-18
baichuan-m2-32b baichuan/baichuan-m2-32b 131.1K 131.1K Input: $0.07
Output: $0.07
Model: 0.035
Completion: 1.000
🌡️ 2024-12 In: text
Out: text
Open Weights
Released: 2025-08-13
Mistral Nemo mistralai/mistral-nemo 60.3K 16K Input: $0.04
Output: $0.17
Model: 0.020
Completion: 4.250
🌡️ - In: text
Out: text
Open Weights
Released: 2024-07-30
XiaomiMiMo/MiMo-V2-Flash xiaomimimo/mimo-v2-flash 262.1K 32K Input: $0.1
Output: $0.3
Cache Read: $0.3
Model: 0.050
Completion: 3.000
Cache: 3.000
🧠 🔧 🌡️ 2024-12 In: text
Out: text
Open Weights
Released: 2025-12-19
MiMo-V2-Pro xiaomimimo/mimo-v2-pro 1M 131.1K Input: $2
Output: $6
Cache Read: $0.4
Model: 1.000
Completion: 3.000
Cache: 0.200
🧠 🔧 🌡️ 2024-12 In: text
Out: text
Released: 2026-03-18
Updated: 2026-05-27
MiMo-V2.5-Pro xiaomimimo/mimo-v2.5-pro 1M 131.1K Input: $2
Output: $6
Cache Read: $0.4
Model: 1.000
Completion: 3.000
Cache: 0.200
🧠 🔧 🌡️ 2024-12 In: text
Out: text
Open Weights
Released: 2026-04-22
Updated: 2026-05-27
Mythomax L2 13B gryphe/mythomax-l2-13b 4.1K 3.2K Input: $0.09
Output: $0.09
Model: 0.045
Completion: 1.000
🌡️ - In: text
Out: text
Open Weights
Released: 2024-04-25
GLM-4.7 zai-org/glm-4.7 204.8K 131.1K Input: $0.6
Output: $2.2
Cache Read: $0.11
Model: 0.300
Completion: 3.667
Cache: 0.183
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-12-22
GLM 4.5V zai-org/glm-4.5v 65.5K 16.4K Input: $0.6
Output: $1.8
Cache Read: $0.11
Model: 0.300
Completion: 3.000
Cache: 0.183
📎 🧠 🔧 🌡️ 2025-04 In: text, video, image
Out: text
Open Weights
Released: 2025-08-11
GLM-4.5 zai-org/glm-4.5 131.1K 98.3K Input: $0.6
Output: $2.2
Cache Read: $0.11
Model: 0.300
Completion: 3.667
Cache: 0.183
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-07-28
AutoGLM-Phone-9B-Multilingual zai-org/autoglm-phone-9b-multilingual 65.5K 65.5K Input: $0.035
Output: $0.138
Model: 0.018
Completion: 3.943
📎 🌡️ - In: text, image
Out: text
Open Weights
Released: 2025-12-10
GLM-5.1 zai-org/glm-5.1 204.8K 131.1K Input: $1.4
Output: $4.4
Cache Read: $0.26
Model: 0.700
Completion: 3.143
Cache: 0.186
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-27
GLM 4.6 zai-org/glm-4.6 204.8K 131.1K Input: $0.55
Output: $2.2
Cache Read: $0.11
Model: 0.275
Completion: 4.000
Cache: 0.200
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-09-30
GLM 4.6V zai-org/glm-4.6v 131.1K 32.8K Input: $0.3
Output: $0.9
Cache Read: $0.055
Model: 0.150
Completion: 3.000
Cache: 0.183
📎 🧠 🔧 🌡️ 2025-04 In: text, video, image
Out: text
Open Weights
Released: 2025-12-08
GLM 4.5 Air zai-org/glm-4.5-air 131.1K 98.3K Input: $0.13
Output: $0.85
Model: 0.065
Completion: 6.538
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-10-13
GLM-4.7-Flash zai-org/glm-4.7-flash 200K 128K Input: $0.07
Output: $0.4
Cache Read: $0.01
Model: 0.035
Completion: 5.714
Cache: 0.143
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2026-01-19
GLM-5 zai-org/glm-5 202.8K 131.1K Input: $1
Output: $3.2
Cache Read: $0.2
Model: 0.500
Completion: 3.200
Cache: 0.200
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-11
Updated: 2026-02-12
PaddleOCR-VL paddlepaddle/paddleocr-vl 16.4K 16.4K Input: $0.02
Output: $0.02
Model: 0.010
Completion: 1.000
📎 🌡️ - In: text, image
Out: text
Open Weights
Released: 2025-10-22
Qwen3 VL 235B A22B Thinking qwen/qwen3-vl-235b-a22b-thinking 131.1K 32.8K Input: $0.98
Output: $3.95
Model: 0.490
Completion: 4.031
📎 🧠 🌡️ - In: text, image, video
Out: text
Open Weights
Released: 2025-09-24
qwen/qwen3-vl-30b-a3b-thinking qwen/qwen3-vl-30b-a3b-thinking 131.1K 32.8K Input: $0.2
Output: $1
Model: 0.100
Completion: 5.000
📎 🔧 🌡️ - In: text, image, video
Out: text
Open Weights
Released: 2025-10-11
Qwen3 235B A22B Instruct 2507 qwen/qwen3-235b-a22b-instruct-2507 131.1K 16.4K Input: $0.09
Output: $0.58
Model: 0.045
Completion: 6.444
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-22
Qwen3 Coder 30b A3B Instruct qwen/qwen3-coder-30b-a3b-instruct 160K 32.8K Input: $0.07
Output: $0.27
Model: 0.035
Completion: 3.857
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-10-09
Qwen3 8B qwen/qwen3-8b-fp8 128K 20K Input: $0.035
Output: $0.138
Model: 0.018
Completion: 3.943
🧠 🌡️ - In: text
Out: text
Open Weights
Released: 2025-04-29
Qwen3 Next 80B A3B Instruct qwen/qwen3-next-80b-a3b-instruct 131.1K 32.8K Input: $0.15
Output: $1.5
Model: 0.075
Completion: 10.000
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-09-10
qwen/qwen3-vl-8b-instruct qwen/qwen3-vl-8b-instruct 131.1K 32.8K Input: $0.08
Output: $0.5
Model: 0.040
Completion: 6.250
📎 🔧 🌡️ - In: text, image, video
Out: text
Open Weights
Released: 2025-10-17
Qwen3 Omni 30B A3B Thinking qwen/qwen3-omni-30b-a3b-thinking 65.5K 16.4K Input: $0.25
Output: $0.97
Input Audio: $2.2
Output Audio: $1.788
Model: 1.100
Completion: 0.813
📎 🧠 🔧 🌡️ - In: text, audio, video, image
Out: text
Open Weights
Released: 2025-09-24
Qwen3.7-Max qwen/qwen3.7-max 1M 65.5K Input: $1.25
Output: $3.75
Cache Read: $0.125
Cache Write: $1.5625
Model: 0.625
Completion: 3.000
Cache: 0.100
🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-05-21
Updated: 2026-05-27
Qwen3 Max qwen/qwen3-max 262.1K 65.5K Input: $2.11
Output: $8.45
Model: 1.055
Completion: 4.005
🔧 🌡️ 2025-04 In: text
Out: text
Released: 2025-09-24
Qwen2.5 7B Instruct qwen/qwen2.5-7b-instruct 32K 32K Input: $0.07
Output: $0.07
Model: 0.035
Completion: 1.000
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-04-16
Qwen3 Next 80B A3B Thinking qwen/qwen3-next-80b-a3b-thinking 131.1K 32.8K Input: $0.15
Output: $1.5
Model: 0.075
Completion: 10.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-09-10
Qwen3 235B A22b Thinking 2507 qwen/qwen3-235b-a22b-thinking-2507 131.1K 32.8K Input: $0.3
Output: $3
Model: 0.150
Completion: 10.000
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-25
Qwen MT Plus qwen/qwen-mt-plus 16.4K 8.2K Input: $0.25
Output: $0.75
Model: 0.125
Completion: 3.000
🌡️ - In: text
Out: text
Open Weights
Released: 2025-09-03
Qwen3 32B qwen/qwen3-32b-fp8 41K 20K Input: $0.1
Output: $0.45
Model: 0.050
Completion: 4.500
🧠 🌡️ - In: text
Out: text
Open Weights
Released: 2025-04-29
Qwen3 4B qwen/qwen3-4b-fp8 128K 20K Input: $0.03
Output: $0.03
Model: 0.015
Completion: 1.000
🧠 🌡️ - In: text
Out: text
Open Weights
Released: 2025-04-29
Qwen2.5 VL 72B Instruct qwen/qwen2.5-vl-72b-instruct 32.8K 32.8K Input: $0.8
Output: $0.8
Model: 0.400
Completion: 1.000
📎 🌡️ - In: text, image, video
Out: text
Open Weights
Released: 2025-03-25
Qwen3 30B A3B qwen/qwen3-30b-a3b-fp8 41K 20K Input: $0.09
Output: $0.45
Model: 0.045
Completion: 5.000
🧠 🌡️ - In: text
Out: text
Open Weights
Released: 2025-04-29
Qwen3.5-27B qwen/qwen3.5-27b 262.1K 65.5K Input: $0.3
Output: $2.4
Model: 0.150
Completion: 8.000
📎 🧠 🔧 🌡️ - In: text, image, video
Out: text
Open Weights
Released: 2026-02-26
Qwen 2.5 72B Instruct qwen/qwen-2.5-72b-instruct 32K 8.2K Input: $0.38
Output: $0.4
Model: 0.190
Completion: 1.053
🔧 🌡️ 2024-04 In: text
Out: text
Open Weights
Released: 2024-10-15
Qwen3 Coder Next qwen/qwen3-coder-next 262.1K 65.5K Input: $0.2
Output: $1.5
Model: 0.100
Completion: 7.500
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-03
Qwen3.5-35B-A3B qwen/qwen3.5-35b-a3b 262.1K 65.5K Input: $0.25
Output: $2
Model: 0.125
Completion: 8.000
📎 🧠 🔧 🌡️ - In: text, image, video
Out: text
Open Weights
Released: 2026-02-26
Qwen3 Coder 480B A35B Instruct qwen/qwen3-coder-480b-a35b-instruct 262.1K 65.5K Input: $0.3
Output: $1.3
Model: 0.150
Completion: 4.333
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-23
Qwen3.5-397B-A17B qwen/qwen3.5-397b-a17b 262.1K 64K Input: $0.6
Output: $3.6
Model: 0.300
Completion: 6.000
📎 🧠 🔧 🌡️ - In: text, image, video
Out: text
Open Weights
Released: 2026-02-17
qwen/qwen3-vl-30b-a3b-instruct qwen/qwen3-vl-30b-a3b-instruct 131.1K 32.8K Input: $0.2
Output: $0.7
Model: 0.100
Completion: 3.500
📎 🔧 🌡️ - In: text, video, image
Out: text
Open Weights
Released: 2025-10-11
Qwen3 Omni 30B A3B Instruct qwen/qwen3-omni-30b-a3b-instruct 65.5K 16.4K Input: $0.25
Output: $0.97
Input Audio: $2.2
Output Audio: $1.788
Model: 1.100
Completion: 0.813
📎 🔧 🌡️ 2024-04 In: text, video, audio, image
Out: text, audio
Open Weights
Released: 2025-09-24
Qwen3 VL 235B A22B Instruct qwen/qwen3-vl-235b-a22b-instruct 131.1K 32.8K Input: $0.3
Output: $1.5
Model: 0.150
Completion: 5.000
📎 🔧 🌡️ - In: text, image, video
Out: text
Open Weights
Released: 2025-09-24
Qwen3.5-122B-A10B qwen/qwen3.5-122b-a10b 262.1K 65.5K Input: $0.4
Output: $3.2
Model: 0.200
Completion: 8.000
📎 🧠 🔧 🌡️ - In: text, image, video
Out: text
Open Weights
Released: 2026-02-26
Qwen3 235B A22B qwen/qwen3-235b-a22b-fp8 41K 20K Input: $0.2
Output: $0.8
Model: 0.100
Completion: 4.000
🧠 🌡️ - In: text
Out: text
Open Weights
Released: 2025-04-29
DeepSeek R1 0528 deepseek/deepseek-r1-0528 163.8K 32.8K Input: $0.7
Output: $2.5
Cache Read: $0.35
Model: 0.350
Completion: 3.571
Cache: 0.500
🧠 🔧 🌡️ 2024-07 In: text
Out: text
Open Weights
Released: 2025-05-28
DeepSeek V4 Flash deepseek/deepseek-v4-flash 1M 393.2K Input: $0.14
Output: $0.28
Cache Read: $0.028
Model: 0.070
Completion: 2.000
Cache: 0.200
🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2026-04-24
Deepseek V3.1 Terminus deepseek/deepseek-v3.1-terminus 131.1K 32.8K Input: $0.27
Output: $1
Cache Read: $0.135
Model: 0.135
Completion: 3.704
Cache: 0.500
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-09-22
DeepSeek V3 0324 deepseek/deepseek-v3-0324 163.8K 163.8K Input: $0.27
Output: $1.12
Cache Read: $0.135
Model: 0.135
Completion: 4.148
Cache: 0.500
🔧 🌡️ 2024-07 In: text
Out: text
Open Weights
Released: 2025-03-25
DeepSeek-OCR deepseek/deepseek-ocr 8.2K 8.2K Input: $0.03
Output: $0.03
Model: 0.015
Completion: 1.000
📎 🌡️ - In: text, image
Out: text
Open Weights
Released: 2025-10-24
DeepSeek R1 Distill LLama 70B deepseek/deepseek-r1-distill-llama-70b 8.2K 8.2K Input: $0.8
Output: $0.8
Model: 0.400
Completion: 1.000
🧠 🌡️ - In: text
Out: text
Open Weights
Released: 2025-01-27
**DeepSeek R1 (Turbo) ** deepseek/deepseek-r1-turbo 64K 16K Input: $0.7
Output: $2.5
Model: 0.350
Completion: 3.571
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-03-05
Deepseek Prover V2 671B deepseek/deepseek-prover-v2-671b 160K 160K Input: $0.7
Output: $2.5
Model: 0.350
Completion: 3.571
🌡️ - In: text
Out: text
Open Weights
Released: 2025-04-30
DeepSeek V4 Pro deepseek/deepseek-v4-pro 1M 393.2K Input: $1.69
Output: $3.38
Cache Read: $0.13
Model: 0.845
Completion: 2.000
Cache: 0.077
🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2026-04-24
Deepseek V3.2 Exp deepseek/deepseek-v3.2-exp 163.8K 65.5K Input: $0.27
Output: $0.41
Model: 0.135
Completion: 1.519
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-09-29
deepseek/deepseek-ocr-2 deepseek/deepseek-ocr-2 8.2K 8.2K Input: $0.03
Output: $0.03
Model: 0.015
Completion: 1.000
📎 - In: text, image
Out: text
Open Weights
Released: 2026-01-27
DeepSeek R1 Distill Qwen 14B deepseek/deepseek-r1-distill-qwen-14b 32.8K 16.4K Input: $0.15
Output: $0.15
Model: 0.075
Completion: 1.000
🌡️ - In: text
Out: text
Open Weights
Released: 2025-01-20
DeepSeek V3.1 deepseek/deepseek-v3.1 131.1K 32.8K Input: $0.27
Output: $1
Cache Read: $0.135
Model: 0.135
Completion: 3.704
Cache: 0.500
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-21
DeepSeek R1 0528 Qwen3 8B deepseek/deepseek-r1-0528-qwen3-8b 128K 32K Input: $0.06
Output: $0.09
Model: 0.030
Completion: 1.500
🧠 🌡️ - In: text
Out: text
Open Weights
Released: 2025-05-29
DeepSeek R1 Distill Qwen 32B deepseek/deepseek-r1-distill-qwen-32b 64K 32K Input: $0.3
Output: $0.3
Model: 0.150
Completion: 1.000
🌡️ - In: text
Out: text
Open Weights
Released: 2025-01-20
**DeepSeek V3 (Turbo) ** deepseek/deepseek-v3-turbo 64K 16K Input: $0.4
Output: $1.3
Model: 0.200
Completion: 3.250
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-03-05
Deepseek V3.2 deepseek/deepseek-v3.2 163.8K 65.5K Input: $0.269
Output: $0.4
Cache Read: $0.1345
Model: 0.135
Completion: 1.487
Cache: 0.500
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-12-01
MiniMax-M2.7-highspeed minimax/minimax-m2.7-highspeed 204.8K 131.1K Input: $0.6
Output: $2.4
Cache Read: $0.06
Cache Write: $0.375
Model: 0.300
Completion: 4.000
Cache: 0.100
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-18
Updated: 2026-05-27
MiniMax M2.5 minimax/minimax-m2.5 204.8K 131.1K Input: $0.3
Output: $1.2
Cache Read: $0.03
Model: 0.150
Completion: 4.000
Cache: 0.100
🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-02-12
Minimax M2.1 minimax/minimax-m2.1 204.8K 131.1K Input: $0.3
Output: $1.2
Cache Read: $0.03
Model: 0.150
Completion: 4.000
Cache: 0.100
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-12-23
MiniMax-M2 minimax/minimax-m2 204.8K 131.1K Input: $0.3
Output: $1.2
Cache Read: $0.03
Model: 0.150
Completion: 4.000
Cache: 0.100
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-10-27
MiniMax M2.7 minimax/minimax-m2.7 204.8K 131.1K Input: $0.3
Output: $1.2
Cache Read: $0.06
Model: 0.150
Completion: 4.000
Cache: 0.200
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-18
MiniMax M2.5 Highspeed minimax/minimax-m2.5-highspeed 204.8K 131.1K Input: $0.6
Output: $2.4
Cache Read: $0.03
Model: 0.300
Completion: 4.000
Cache: 0.050
🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-02-12
Kat Coder Pro kwaipilot/kat-coder-pro 256K 128K Input: $0.3
Output: $1.2
Cache Read: $0.06
Model: 0.150
Completion: 4.000
Cache: 0.200
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-01-05
Hermes 2 Pro Llama 3 8B nousresearch/hermes-2-pro-llama-3-8b 8.2K 8.2K Input: $0.14
Output: $0.14
Model: 0.070
Completion: 1.000
🌡️ - In: text
Out: text
Open Weights
Released: 2024-06-27

Nvidia

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
BGE M3 baai/bge-m3 8.2K 1K Input: $0
Output: $0
- - - In: text
Out: text
Open Weights
Released: 2024-01-30
Updated: 2026-04-30
Kimi K2 0905 moonshotai/kimi-k2-instruct-0905 262.1K 262.1K Input: $0
Output: $0
- 🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-09-05
Kimi K2.6 moonshotai/kimi-k2.6 262.1K 262.1K Input: $0
Output: $0
- 📎 🧠 🔧 🌡️ 2025-01 In: text, image, video
Out: text
Open Weights
Released: 2026-04-21
MiniMax-M2.7 minimaxai/minimax-m2.7 204.8K 131.1K Input: $0
Output: $0
- 🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-18
Updated: 2026-04-11
Step 3.7 Flash stepfun-ai/step-3.7-flash 256K 16.4K Input: $0
Output: $0
- 📎 🧠 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2026-05-28
Step 3.5 Flash stepfun-ai/step-3.5-flash 256K 16.4K Input: $0
Output: $0
- 🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-02
Gemma 3n E4b It google/gemma-3n-e4b-it 128K 4.1K Input: $0
Output: $0
- 📎 🔧 🌡️ 2024-06 In: text, image
Out: text
Open Weights
Released: 2025-06-03
Gemma 3n E2b It google/gemma-3n-e2b-it 128K 4.1K Input: $0
Output: $0
- 📎 🔧 🌡️ 2024-06 In: text, image
Out: text
Open Weights
Released: 2025-06-12
paligemma google/google-paligemma 128K 8.2K Input: $0
Output: $0
- 📎 🌡️ - In: text, image
Out: text
Open Weights
Released: 2024-05-14
Updated: 2024-08-26
Gemma-4-31B-IT google/gemma-4-31b-it 256K 16.4K Input: $0
Output: $0
- 📎 🧠 🔧 🌡️ 2025-01 In: text, image, video
Out: text
Open Weights
Released: 2026-04-02
Gemma 2 2b It google/gemma-2-2b-it 128K 4.1K Input: $0
Output: $0
- 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-07-16
Phi-4-Mini microsoft/phi-4-mini-instruct 131.1K 8.2K Input: $0
Output: $0
- 🔧 🌡️ 2024-12 In: text
Out: text
Open Weights
Released: 2024-12-01
Updated: 2025-09-05
Phi 4 Multimodal microsoft/phi-4-multimodal-instruct 128K 16.4K Input: $0
Output: $0
- - - In: text
Out: text
Released: 2025-07-26
GLM-5.1 z-ai/glm-5.1 131.1K 131.1K Input: $0
Output: $0
- 🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-27
GPT-OSS-120B openai/gpt-oss-120b 128K 8.2K Input: $0
Output: $0
- 🧠 🔧 🌡️ 2025-08 In: text
Out: text
Open Weights
Released: 2025-08-04
Updated: 2025-08-14
Whisper Large v3 openai/whisper-large-v3 - 4.1K Input: $0
Output: $0
- - 2023-09 In: audio
Out: text
Open Weights
Released: 2023-09-01
Updated: 2025-09-05
GPT OSS 20B openai/gpt-oss-20b 131.1K 32.8K Input: $0
Output: $0
- 🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-05
ByteDance-Seed/Seed-OSS-36B-Instruct bytedance/seed-oss-36b-instruct 262K 262K Input: $0
Output: $0
- 🔧 🌡️ - In: text
Out: text
Released: 2025-09-04
Updated: 2025-11-25
Mistral-7B-Instruct-v0.3 mistralai/mistral-7b-instruct-v03 65.5K 65.5K Input: $0
Output: $0
- 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-04-01
Magistral Small 2506 mistralai/magistral-small-2506 32.8K 32.8K Input: $0
Output: $0
- - - In: text
Out: text
Released: 2025-09-25
Mistral: Mixtral 8x7B Instruct mistralai/mixtral-8x7b-instruct 32.8K 16.4K Input: $0
Output: $0
- 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2023-12-10
Updated: 2026-03-15
Mistral Medium 3 mistralai/mistral-medium-3-instruct 131.1K 32.8K Input: $0
Output: $0
- 📎 - In: text, image
Out: text
Released: 2025-09-25
mistral-small-4-119b-2603 mistralai/mistral-small-4-119b-2603 128K 8.2K Input: $0
Output: $0
- 📎 🧠 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2026-03-16
mistral-nemotron mistralai/mistral-nemotron 128K 8.2K Input: $0
Output: $0
- 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-06-11
Updated: 2025-06-12
Mistral Large 3 675B Instruct 2512 mistralai/mistral-large-3-675b-instruct-2512 262.1K 262.1K Input: $0
Output: $0
- 📎 🔧 🌡️ 2025-01 In: text, image
Out: text
Open Weights
Released: 2025-12-02
Mistral: Mixtral 8x22B Instruct mistralai/mixtral-8x22b-instruct 65.5K 13.1K Input: $0
Output: $0
- 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-04-17
cosmos-transfer1-7b nvidia/cosmos-transfer1-7b - 4.1K Input: $0
Output: $0
- 📎 - In: text, image, video
Out: video
Open Weights
Released: 2025-06-13
Updated: 2025-06-30
cosmos-transfer2.5-2b nvidia/cosmos-transfer2_5-2b - 4.1K Input: $0
Output: $0
- 📎 - In: text, image, video
Out: video
Open Weights
Released: 2026-02-26
llama-nemotron-embed-vl-1b-v2 nvidia/llama-nemotron-embed-vl-1b-v2 32.8K 2K Input: $0
Output: $0
- 📎 - In: text, image
Out: text
Open Weights
Released: 2026-02-10
Nemotron 3 Nano Omni nvidia/nemotron-3-nano-omni-30b-a3b-reasoning 256K 65.5K Input: $0
Output: $0
- 📎 🧠 🔧 🌡️ - In: text, image, video, audio
Out: text
Open Weights
Released: 2026-04-28
magpie-tts-zeroshot nvidia/magpie-tts-zeroshot - 4.1K Input: $0
Output: $0
- 📎 - In: text, audio
Out: audio
Open Weights
Released: 2025-05-22
Updated: 2025-06-12
nvidia-nemotron-nano-9b-v2 nvidia/nvidia-nemotron-nano-9b-v2 131.1K 131.1K Input: $0
Output: $0
- 🧠 🔧 🌡️ 2024-09 In: text
Out: text
Open Weights
Released: 2025-08-18
synthetic-video-detector nvidia/synthetic-video-detector - 4.1K Input: $0
Output: $0
- 📎 🌡️ - In: video
Out: text
Open Weights
Released: 2026-04-16
nemotron-content-safety-reasoning-4b nvidia/nemotron-content-safety-reasoning-4b 128K 4.1K Input: $0
Output: $0
- 🧠 - In: text
Out: text
Open Weights
Released: 2026-01-22
nv-embed-v1 nvidia/nv-embed-v1 32.8K 2K Input: $0
Output: $0
- - - In: text
Out: text
Open Weights
Released: 2024-06-07
Updated: 2025-07-22
usdcode nvidia/usdcode 128K 4.1K Input: $0
Output: $0
- 🌡️ - In: text
Out: text
Released: 2026-01-01
riva-translate-4b-instruct-v1_1 nvidia/riva-translate-4b-instruct-v1_1 128K 4.1K Input: $0
Output: $0
- - - In: text
Out: text
Open Weights
Released: 2025-12-12
sparsedrive nvidia/sparsedrive 128K 8.2K Input: $0
Output: $0
- 📎 🌡️ - In: video
Out: text
Open Weights
Released: 2025-03-18
Updated: 2025-07-20
rerank-qa-mistral-4b nvidia/rerank-qa-mistral-4b 128K 4.1K Input: $0
Output: $0
- - - In: text
Out: text
Open Weights
Released: 2024-03-17
Updated: 2025-01-17
streampetr nvidia/streampetr 128K 8.2K Input: $0
Output: $0
- 📎 🌡️ - In: video
Out: text
Open Weights
Released: 2025-11-13
Active Speaker Detection nvidia/active-speaker-detection - 4.1K Input: $0
Output: $0
- 📎 - In: video
Out: text
Open Weights
Released: 2026-04-16
llama-3.1-nemotron-safety-guard-8b-v3 nvidia/llama-3_1-nemotron-safety-guard-8b-v3 128K 4.1K Input: $0
Output: $0
- - - In: text
Out: text
Open Weights
Released: 2025-10-28
llama-3_2-nemoretriever-300m-embed-v1 nvidia/llama-3_2-nemoretriever-300m-embed-v1 32.8K 2K Input: $0
Output: $0
- - - In: text
Out: text
Open Weights
Released: 2025-07-24
nemotron-voicechat nvidia/nemotron-voicechat 128K 8.2K Input: $0
Output: $0
- 📎 🔧 🌡️ - In: text, audio
Out: text
Open Weights
Released: 2026-03-16
nv-embedcode-7b-v1 nvidia/nv-embedcode-7b-v1 32.8K 2K Input: $0
Output: $0
- - - In: text
Out: text
Open Weights
Released: 2025-03-17
Updated: 2025-05-29
Nemotron 3 Ultra 550B A55B nvidia/nemotron-3-ultra-550b-a55b 1M 65.5K Input: $0.5
Output: $2.5
Cache Read: $0.15
Model: 0.250
Completion: 5.000
Cache: 0.300
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-06-04
nemotron-3-nano-30b-a3b nvidia/nemotron-3-nano-30b-a3b 131.1K 131.1K Input: $0
Output: $0
- 🧠 🔧 🌡️ 2024-09 In: text
Out: text
Open Weights
Released: 2024-12
cosmos-predict1-5b nvidia/cosmos-predict1-5b - 4.1K Input: $0
Output: $0
- 📎 - In: text, image, video
Out: video
Open Weights
Released: 2025-03-18
bevformer nvidia/bevformer 128K 8.2K Input: $0
Output: $0
- 📎 🌡️ - In: video
Out: text
Open Weights
Released: 2025-03-18
Updated: 2025-07-20
studiovoice nvidia/studiovoice 128K 8.2K Input: $0
Output: $0
- 🌡️ - In: text
Out: text
Open Weights
Released: 2024-10-03
Updated: 2025-06-13
gliner-pii nvidia/gliner-pii 128K 4.1K Input: $0
Output: $0
- 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-03
nemotron-mini-4b-instruct nvidia/nemotron-mini-4b-instruct 128K 8.2K Input: $0
Output: $0
- 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-08-21
Updated: 2024-08-26
llama-nemotron-rerank-vl-1b-v2 nvidia/llama-nemotron-rerank-vl-1b-v2 128K 4.1K Input: $0
Output: $0
- 📎 - In: text, image
Out: text
Open Weights
Released: 2026-03-31
Nemotron 3 Super nvidia/nemotron-3-super-120b-a12b 262.1K 262.1K Input: $0.2
Output: $0.8
Model: 0.100
Completion: 4.000
🧠 🔧 🌡️ 2024-04 In: text
Out: text
Open Weights
Released: 2026-03-11
usdvalidate nvidia/usdvalidate - 4.1K Input: $0
Output: $0
- - - In: text
Out: text
Open Weights
Released: 2024-07-24
Updated: 2025-01-08
nemotron-3-content-safety nvidia/nemotron-3-content-safety 128K 4.1K Input: $0
Output: $0
- - - In: text
Out: text
Open Weights
Released: 2026-04-16
dracarys-llama-3.1-70b-instruct abacusai/dracarys-llama-3_1-70b-instruct 128K 8.2K Input: $0
Output: $0
- 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-09-11
Updated: 2025-05-22
DeepSeek V4 Flash deepseek-ai/deepseek-v4-flash 1M 393.2K Input: $0.14
Output: $0.28
Cache Read: $0.0028
Model: 0.070
Completion: 2.000
Cache: 0.020
🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2026-04-24
DeepSeek V4 Pro deepseek-ai/deepseek-v4-pro 1M 393.2K Input: $0.435
Output: $0.87
Cache Read: $0.003625
Model: 0.217
Completion: 2.000
Cache: 0.008
🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2026-04-24
Qwen3-Next-80B-A3B-Instruct qwen/qwen3-next-80b-a3b-instruct 262.1K 16.4K Input: $0
Output: $0
- 🔧 🌡️ 2024-12 In: text
Out: text
Released: 2024-12-01
Updated: 2025-09-05
Qwen Image Edit qwen/qwen-image-edit - - Input: $0
Output: $0
- 📎 🌡️ - In: text, image
Out: image
Released: 2025-08-19
Qwen Image qwen/qwen-image - - Input: $0
Output: $0
- 📎 🌡️ - In: text, image
Out: image
Released: 2025-08-07
Qwen3 Coder 480B A35B Instruct qwen/qwen3-coder-480b-a35b-instruct 262.1K 66.5K Input: $0
Output: $0
- 🔧 🌡️ 2025-04 In: text
Out: text
Released: 2025-07-23
Qwen2.5 Coder 32b Instruct qwen/qwen2.5-coder-32b-instruct 128K 4.1K Input: $0
Output: $0
- 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-11-06
Qwen3.5-397B-A17B qwen/qwen3.5-397b-a17b 262.1K 8.2K Input: $0
Output: $0
- 📎 🧠 🔧 🌡️ 2026-01 In: text, image
Out: text
Open Weights
Released: 2026-02-16
Qwen3.5 122B-A10B qwen/qwen3.5-122b-a10b 262.1K 65.5K Input: $0
Output: $0
- 📎 🧠 🔧 🌡️ - In: text, image, video, audio
Out: text
Open Weights
Released: 2026-02-23
sarvam-m sarvamai/sarvam-m 128K 8.2K Input: $0
Output: $0
- 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-07-25
Llama 3.1 8B Instruct meta/llama-3.1-8b-instruct 16K 4.1K Input: $0
Output: $0
- 🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2025-01-01
Llama 3.1 70b Instruct meta/llama-3.1-70b-instruct 128K 4.1K Input: $0
Output: $0
- 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-07-16
Llama 3.2 1b Instruct meta/llama-3.2-1b-instruct 128K 4.1K Input: $0
Output: $0
- 🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-09-18
Llama 3.2 11b Vision Instruct meta/llama-3.2-11b-vision-instruct 128K 4.1K Input: $0
Output: $0
- 📎 🔧 🌡️ 2023-12 In: text, image
Out: text
Open Weights
Released: 2024-09-18
Llama 3.3 70b Instruct meta/llama-3.3-70b-instruct 128K 4.1K Input: $0
Output: $0
- 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-11-26
Llama Guard 4 12B meta/llama-guard-4-12b 128K 16.4K Input: $0
Output: $0
- 📎 🌡️ - In: text, image
Out: text
Open Weights
Released: 2025-04-05
Updated: 2026-04-30
esmfold meta/esmfold 128K 8.2K Input: $0
Output: $0
- 🌡️ - In: text
Out: text
Open Weights
Released: 2024-03-15
Updated: 2025-06-12
esm2-650m meta/esm2-650m 128K 8.2K Input: $0
Output: $0
- 🌡️ - In: text
Out: text
Open Weights
Released: 2024-08-29
Updated: 2025-03-10
Llama-3.2-90B-Vision-Instruct meta/llama-3.2-90b-vision-instruct 128K 8.2K Input: $0
Output: $0
- 📎 🔧 🌡️ 2023-12 In: text, image
Out: text
Open Weights
Released: 2024-09-25
Llama 4 Maverick 17b 128e Instruct meta/llama-4-maverick-17b-128e-instruct 128K 4.1K Input: $0
Output: $0
- 📎 🔧 🌡️ 2024-02 In: text, image
Out: text
Open Weights
Released: 2025-04-01
Llama 3.2 3B Instruct meta/llama-3.2-3b-instruct 32.8K 32K Input: $0
Output: $0
- 🌡️ - In: text
Out: text
Open Weights
Released: 2024-09-18
solar-10.7b-instruct upstage/solar-10_7b-instruct 128K 8.2K Input: $0
Output: $0
- 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-06-05
Updated: 2025-04-10
FLUX.1-schnell black-forest-labs/flux_1-schnell 77 - Input: $0
Output: $0
- - 2024-07 In: text
Out: image
Open Weights
Released: 2024-08-01
Updated: 2026-02-04
FLUX.1-Kontext-dev black-forest-labs/flux_1-kontext-dev 41K 41K Input: $0
Output: $0
- 📎 - In: text, image
Out: image
Open Weights
Released: 2025-08-12
FLUX.1-dev black-forest-labs/flux.1-dev 4.1K - Input: $0
Output: $0
- 🌡️ 2024-08 In: text
Out: image
Released: 2024-08-01
Updated: 2025-09-05
FLUX.2 Klein 4B black-forest-labs/flux_2-klein-4b 41K 41K Input: $0
Output: $0
- 🌡️ 2025-06 In: image, text
Out: image
Open Weights
Released: 2026-01-14
Updated: 2026-01-31

Ollama Cloud

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
deepseek-v4-flash deepseek-v4-flash 1M 1M - - 🧠 🔧 - In: text
Out: text
Open Weights
Released: 2026-04-24
minimax-m2.5 minimax-m2.5 204.8K 131.1K - - 🧠 🔧 2025-01 In: text
Out: text
Open Weights
Released: 2026-02-12
devstral-small-2:24b devstral-small-2:24b 262.1K 262.1K - - 📎 🔧 - In: text, image
Out: text
Open Weights
Released: 2025-12-09
Updated: 2026-01-19
glm-4.7 glm-4.7 202.8K 131.1K - - 🧠 🔧 - In: text
Out: text
Open Weights
Released: 2025-12-22
Updated: 2026-01-19
cogito-2.1:671b cogito-2.1:671b 163.8K 32K - - 🧠 🔧 - In: text
Out: text
Open Weights
Released: 2025-11-19
Updated: 2026-01-19
minimax-m2.1 minimax-m2.1 204.8K 131.1K - - 🧠 🔧 - In: text
Out: text
Open Weights
Released: 2025-12-23
Updated: 2026-01-19
gpt-oss:120b gpt-oss:120b 131.1K 32.8K - - 🧠 🔧 - In: text
Out: text
Open Weights
Released: 2025-08-05
Updated: 2026-01-19
nemotron-3-nano:30b nemotron-3-nano:30b 1M 131.1K - - 🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-12-15
Updated: 2026-01-19
ministral-3:8b ministral-3:8b 262.1K 128K - - 📎 🔧 - In: text, image
Out: text
Open Weights
Released: 2024-12-01
Updated: 2026-01-19
rnj-1:8b rnj-1:8b 32.8K 4.1K - - 🔧 - In: text
Out: text
Open Weights
Released: 2025-12-06
Updated: 2026-01-19
kimi-k2.7-code kimi-k2.7-code 262.1K 262.1K - - 📎 🧠 🔧 2025-01 In: text, image
Out: text
Open Weights
Released: 2026-06-12
glm-5.1 glm-5.1 202.8K 131.1K - - 🧠 🔧 - In: text
Out: text
Open Weights
Released: 2026-03-27
Updated: 2026-04-07
deepseek-v4-pro deepseek-v4-pro 1M 1M - - 🧠 🔧 - In: text
Out: text
Open Weights
Released: 2026-04-24
glm-4.6 glm-4.6 202.8K 131.1K - - 🧠 🔧 - In: text
Out: text
Open Weights
Released: 2025-09-29
Updated: 2026-01-19
kimi-k2-thinking kimi-k2-thinking 262.1K 262.1K - - 🧠 🔧 2024-08 In: text
Out: text
Open Weights
Released: 2025-11-06
Updated: 2026-01-19
nemotron-3-super nemotron-3-super 262.1K 65.5K - - 🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-11
Updated: 2026-03-12
ministral-3:14b ministral-3:14b 262.1K 128K - - 📎 🔧 - In: text, image
Out: text
Open Weights
Released: 2024-12-01
Updated: 2026-01-19
minimax-m3 minimax-m3 512K 131.1K - - 📎 🧠 🔧 🌡️ 2025-01 In: text, image, video
Out: text
Open Weights
Released: 2026-05-31
minimax-m2 minimax-m2 204.8K 128K - - 🧠 🔧 - In: text
Out: text
Open Weights
Released: 2025-10-23
Updated: 2026-01-19
qwen3-next:80b qwen3-next:80b 262.1K 32.8K - - 🧠 🔧 - In: text
Out: text
Open Weights
Released: 2025-09-15
Updated: 2026-01-19
qwen3-coder:480b qwen3-coder:480b 262.1K 65.5K - - 🔧 - In: text
Out: text
Open Weights
Released: 2025-07-22
Updated: 2026-01-19
kimi-k2:1t kimi-k2:1t 262.1K 262.1K - - 🔧 2024-10 In: text
Out: text
Open Weights
Released: 2025-07-11
Updated: 2026-01-19
minimax-m2.7 minimax-m2.7 196.6K 196.6K - - 🧠 🔧 - In: text
Out: text
Open Weights
Released: 2026-03-18
deepseek-v3.1:671b deepseek-v3.1:671b 163.8K 163.8K - - 🧠 🔧 - In: text
Out: text
Open Weights
Released: 2025-08-21
Updated: 2026-01-19
kimi-k2.5 kimi-k2.5 262.1K 262.1K - - 📎 🧠 🔧 - In: text, image
Out: text
Open Weights
Released: 2026-01-27
gemma4:31b gemma4:31b 262.1K 262.1K - - 📎 🧠 🔧 2025-01 In: text, image
Out: text
Open Weights
Released: 2026-04-02
Updated: 2026-04-08
ministral-3:3b ministral-3:3b 262.1K 128K - - 📎 🔧 - In: text, image
Out: text
Open Weights
Released: 2024-10-22
Updated: 2026-01-19
gemma3:12b gemma3:12b 131.1K 131.1K - - 📎 - In: text, image
Out: text
Open Weights
Released: 2024-12-01
Updated: 2026-01-19
gemma3:4b gemma3:4b 131.1K 131.1K - - 📎 - In: text, image
Out: text
Open Weights
Released: 2024-12-01
Updated: 2026-01-19
qwen3.5:397b qwen3.5:397b 262.1K 65.5K - - 📎 🧠 🔧 - In: text, image
Out: text
Open Weights
Released: 2026-02-15
Updated: 2026-02-17
kimi-k2.6 kimi-k2.6 262.1K 262.1K - - 📎 🧠 🔧 - In: text, image
Out: text
Open Weights
Released: 2026-04-20
qwen3-coder-next qwen3-coder-next 262.1K 65.5K - - 🔧 - In: text
Out: text
Open Weights
Released: 2026-02-02
Updated: 2026-02-08
qwen3-vl:235b-instruct qwen3-vl:235b-instruct 262.1K 131.1K - - 📎 🔧 - In: text, image
Out: text
Open Weights
Released: 2025-09-22
Updated: 2026-01-19
mistral-large-3:675b mistral-large-3:675b 262.1K 262.1K - - 📎 🔧 - In: text, image
Out: text
Open Weights
Released: 2025-12-02
Updated: 2026-01-19
gemma3:27b gemma3:27b 131.1K 131.1K - - 📎 - In: text, image
Out: text
Open Weights
Released: 2025-07-27
Updated: 2026-01-19
qwen3-vl:235b qwen3-vl:235b 262.1K 32.8K - - 📎 🧠 🔧 - In: text, image
Out: text
Open Weights
Released: 2025-09-22
Updated: 2026-01-19
nemotron-3-ultra nemotron-3-ultra 262.1K 128K - - 🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-06-04
gemini-3-flash-preview gemini-3-flash-preview 1M 65.5K - - 📎 🧠 🔧 2025-01 In: text, image
Out: text
Open Weights
Released: 2025-12-17
Updated: 2026-04-08
gpt-oss:20b gpt-oss:20b 131.1K 32.8K - - 🧠 🔧 - In: text
Out: text
Open Weights
Released: 2025-08-05
Updated: 2026-01-19
glm-5 glm-5 202.8K 131.1K - - 🧠 🔧 - In: text
Out: text
Open Weights
Released: 2026-02-11
deepseek-v3.2 deepseek-v3.2 163.8K 65.5K - - 🧠 🔧 - In: text
Out: text
Open Weights
Released: 2025-06-15
Updated: 2026-01-19
devstral-2:123b devstral-2:123b 262.1K 262.1K - - 🔧 - In: text
Out: text
Open Weights
Released: 2025-12-09
Updated: 2026-01-19

OpenAI

📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
o3 o3 200K 100K Input: $2
Output: $8
Cache Read: $0.5
Model: 1.000
Completion: 4.000
Cache: 0.250
📎 🧠 🔧 2024-05 In: text, image, pdf
Out: text
Released: 2025-04-16
TEXT-EMBEDDING-3-LARGE text-embedding-3-large 64K 2K Input: $7
Output: $10
Cache Read: $0.05
Cache Write: $0.4
Model: 3.500
Completion: 1.429
Cache: 0.007
📎 🧠 🔧 🌡️ 2023-10 In: text
Out: vector
Released: 2023-12-15
Updated: 2023-10-01
GPT-5.2 Pro gpt-5.2-pro 400K 128K Input: $21
Output: $168
Model: 10.500
Completion: 8.000
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2025-12-11
GPT-5 gpt-5 400K 128K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-08-07
GPT-3.5-turbo gpt-3.5-turbo 16.4K 4.1K Input: $0.5
Output: $1.5
Cache Read: $0
Model: 0.250
Completion: 3.000
🌡️ 2021-09-01 In: text
Out: text
Released: 2023-03-01
Updated: 2023-11-06
GPT-5 Pro gpt-5-pro 400K 272K Input: $15
Output: $120
Model: 7.500
Completion: 8.000
📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-10-06
GPT-4o gpt-4o 128K 16.4K Input: $2.5
Output: $10
Cache Read: $1.25
Model: 1.250
Completion: 4.000
Cache: 0.500
📎 🔧 🌡️ 2023-09 In: text, image, pdf
Out: text
Released: 2024-05-13
Updated: 2024-08-06
GPT-4 gpt-4 8.2K 8.2K Input: $30
Output: $60
Model: 15.000
Completion: 2.000
📎 🔧 🌡️ 2023-11 In: text
Out: text
Released: 2023-11-06
Updated: 2024-04-09
o4-mini o4-mini 200K 100K Input: $1.1
Output: $4.4
Cache Read: $0.275
Model: 0.550
Completion: 4.000
Cache: 0.250
📎 🧠 🔧 2024-05 In: text, image
Out: text
Released: 2025-04-16
o3-pro o3-pro 200K 100K Input: $20
Output: $80
Model: 10.000
Completion: 4.000
📎 🧠 🔧 2024-05 In: text, image
Out: text
Released: 2025-06-10
chatgpt-image-latest chatgpt-image-latest - - - - 📎 - In: text, image
Out: text, image
Released: 2025-12-16
GPT-4o (2024-05-13) gpt-4o-2024-05-13 128K 4.1K Input: $5
Output: $15
Model: 2.500
Completion: 3.000
📎 🔧 🌡️ 2023-09 In: text, image
Out: text
Released: 2024-05-13
GPT-5.4 nano gpt-5.4-nano 400K 128K Input: $0.2
Output: $1.25
Cache Read: $0.02
Model: 0.100
Completion: 6.250
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2026-03-17
GPT-5 Chat (latest) gpt-5-chat-latest 400K 128K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🌡️ 2024-09-30 In: text, image
Out: text
Released: 2025-08-07
GPT-5.1 Codex gpt-5.1-codex 400K 128K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-11-13
GPT-5.3 Codex Spark gpt-5.3-codex-spark 128K 32K Input: $1.75
Output: $14
Cache Read: $0.175
Model: 0.875
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-02-05
GPT-5.1 Codex Max gpt-5.1-codex-max 400K 128K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-11-13
GPT-5.3 Chat (latest) gpt-5.3-chat-latest 128K 16.4K Input: $1.75
Output: $14
Cache Read: $0.175
Model: 0.875
Completion: 8.000
Cache: 0.100
📎 🔧 🌡️ 2025-08-31 In: text, image
Out: text
Released: 2026-03-03
GPT-4o (2024-08-06) gpt-4o-2024-08-06 128K 16.4K Input: $2.5
Output: $10
Cache Read: $1.25
Model: 1.250
Completion: 4.000
Cache: 0.500
📎 🔧 🌡️ 2023-09 In: text, image
Out: text
Released: 2024-08-06
TEXT-EMBEDDING-ADA-002 text-embedding-ada-002 60K 1.5K Input: $6
Output: $12
Cache Read: $0.06
Cache Write: $0.45
Model: 3.000
Completion: 2.000
Cache: 0.010
📎 🧠 🔧 🌡️ 2023-10 In: text
Out: vector
Released: 2023-11-20
Updated: 2023-10-01
o3-mini o3-mini 200K 100K Input: $1.1
Output: $4.4
Cache Read: $0.55
Model: 0.550
Completion: 4.000
Cache: 0.500
🧠 🔧 2024-05 In: text
Out: text
Released: 2024-12-20
Updated: 2025-01-29
GPT-5.2 gpt-5.2 400K 128K Input: $1.75
Output: $14
Cache Read: $0.175
Model: 0.875
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2025-12-11
GPT-5.3 Codex gpt-5.3-codex 400K 128K Input: $1.75
Output: $14
Cache Read: $0.175
Model: 0.875
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-02-05
TEXT-EMBEDDING-3-SMALL text-embedding-3-small 32K 1K Input: $4
Output: $8
Cache Read: $0.04
Cache Write: $0.3
Model: 2.000
Completion: 2.000
Cache: 0.010
📎 🧠 🔧 🌡️ 2023-10 In: text
Out: vector
Released: 2023-11-10
Updated: 2023-10-01
GPT-5.1 Codex mini gpt-5.1-codex-mini 400K 128K Input: $0.25
Output: $2
Cache Read: $0.025
Model: 0.125
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-11-13
GPT-5.1 Chat gpt-5.1-chat-latest 128K 16.4K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-11-13
GPT-5.2 Chat gpt-5.2-chat-latest 128K 16.4K Input: $1.75
Output: $14
Cache Read: $0.175
Model: 0.875
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2025-12-11
o4-mini-deep-research o4-mini-deep-research 200K 100K Input: $2
Output: $8
Cache Read: $0.5
Model: 1.000
Completion: 4.000
Cache: 0.250
📎 🧠 🔧 2024-05 In: text, image
Out: text
Released: 2024-06-26
gpt-image-1.5 gpt-image-1.5 - - - - 📎 - In: text, image
Out: text, image
Released: 2025-11-25
GPT-4.1 nano gpt-4.1-nano 1M 32.8K Input: $0.1
Output: $0.4
Cache Read: $0.025
Model: 0.050
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2025-04-14
GPT-4o (2024-11-20) gpt-4o-2024-11-20 128K 16.4K Input: $2.5
Output: $10
Cache Read: $1.25
Model: 1.250
Completion: 4.000
Cache: 0.500
📎 🔧 🌡️ 2023-09 In: text, image
Out: text
Released: 2024-11-20
o1 o1 200K 100K Input: $15
Output: $60
Cache Read: $7.5
Model: 7.500
Completion: 4.000
Cache: 0.500
📎 🧠 🔧 2023-09 In: text, image, pdf
Out: text
Released: 2024-12-05
o1-pro o1-pro 200K 100K Input: $150
Output: $600
Model: 75.000
Completion: 4.000
📎 🧠 🔧 2023-09 In: text, image
Out: text
Released: 2025-03-19
GPT-5.4 gpt-5.4 1.1M 128K Input: $2.5
Output: $15
Cache Read: $0.25
Model: 1.250
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-03-05
GPT-5.4 mini gpt-5.4-mini 400K 128K Input: $0.75
Output: $4.5
Cache Read: $0.075
Model: 0.375
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2026-03-17
GPT-4.1 gpt-4.1 1M 32.8K Input: $2
Output: $8
Cache Read: $0.5
Model: 1.000
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-04 In: text, image, pdf
Out: text
Released: 2025-04-14
o3-deep-research o3-deep-research 200K 100K Input: $10
Output: $40
Cache Read: $2.5
Model: 5.000
Completion: 4.000
Cache: 0.250
📎 🧠 🔧 2024-05 In: text, image
Out: text
Released: 2024-06-26
GPT-5 Mini gpt-5-mini 400K 128K Input: $0.25
Output: $2
Cache Read: $0.025
Model: 0.125
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2024-05-30 In: text, image
Out: text
Released: 2025-08-07
GPT-IMAGE-1 gpt-image-1 1K 512 Input: $10
Output: $20
Cache Read: $0.1
Cache Write: $0.6
Model: 5.000
Completion: 2.000
Cache: 0.010
📎 🧠 🔧 🌡️ 2023-10 In: text
Out: image
Open Weights
Released: 2024-01-15
Updated: 2024-10-01
GPT-4.1 mini gpt-4.1-mini 1M 32.8K Input: $0.4
Output: $1.6
Cache Read: $0.1
Model: 0.200
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-04 In: text, image, pdf
Out: text
Released: 2025-04-14
GPT-4 Turbo gpt-4-turbo 128K 4.1K Input: $10
Output: $30
Model: 5.000
Completion: 3.000
📎 🔧 🌡️ 2023-12 In: text, image
Out: text
Released: 2023-11-06
Updated: 2024-04-09
gpt-image-1-mini gpt-image-1-mini - - - - 📎 - In: text, image
Out: text, image
Released: 2025-09-26
GPT-5 Nano gpt-5-nano 400K 128K Input: $0.05
Output: $0.4
Cache Read: $0.005
Model: 0.025
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2024-05-30 In: text, image
Out: text
Released: 2025-08-07
GPT-5.4 Pro gpt-5.4-pro 1.1M 128K Input: $30
Output: $180
Model: 15.000
Completion: 6.000
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2026-03-05
GPT-5.5 Pro gpt-5.5-pro 1.1M 128K Input: $30
Output: $180
Model: 15.000
Completion: 6.000
📎 🧠 🔧 2025-12-01 In: text, image, pdf
Out: text
Released: 2026-04-23
GPT-4o mini gpt-4o-mini 128K 16.4K Input: $0.15
Output: $0.6
Cache Read: $0.075
Model: 0.075
Completion: 4.000
Cache: 0.500
📎 🔧 🌡️ 2023-09 In: text, image, pdf
Out: text
Released: 2024-07-18
GPT-5-Codex gpt-5-codex 400K 128K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-09-15
GPT-5.2 Codex gpt-5.2-codex 400K 128K Input: $1.75
Output: $14
Cache Read: $0.175
Model: 0.875
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image, pdf
Out: text
Released: 2025-12-11
GPT-5.1 gpt-5.1 400K 128K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-11-13
GPT-5.5 gpt-5.5 1.1M 128K Input: $5
Output: $30
Cache Read: $0.5
Model: 2.500
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 2025-12-01 In: text, image, pdf
Out: text
Released: 2026-04-23
DALL-E 2 dall-e-2 1K 1 Input: $0.02
Output: $0.1
Cache Read: $0.01
Cache Write: $0.05
Model: 0.010
Completion: 5.000
Cache: 0.500
📎 🔧 2021-04 In: text
Out: image
Released: 2022-04-06
Updated: 2022-06-15
DALL-E 3 dall-e-3 2K 1 Input: $0.03
Output: $0.15
Cache Read: $0.01
Cache Write: $0.05
Model: 0.015
Completion: 5.000
Cache: 0.333
📎 🔧 2024-04 In: text
Out: image
Released: 2024-03-01
Updated: 2024-08-15

OpenCode Zen

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Ring 2.6 1T Free ring-2.6-1t-free 262K 66K Input: $0
Output: $0
- 🧠 🔧 🌡️ 2025-06 In: text
Out: text
Open Weights
Released: 2026-05-08
MiMo V2 Pro Free mimo-v2-pro-free 1M 64K Input: $0
Output: $0
Cache Read: $0
- 📎 🧠 🔧 🌡️ 2024-12 In: text
Out: text
Open Weights
Released: 2026-03-18
DeepSeek V4 Flash deepseek-v4-flash 1M 384K Input: $0.14
Output: $0.28
Cache Read: $0.028
Model: 0.070
Completion: 2.000
Cache: 0.200
🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2026-04-24
MiniMax M2.5 minimax-m2.5 204.8K 131.1K Input: $0.3
Output: $1.2
Cache Read: $0.06
Model: 0.150
Completion: 4.000
Cache: 0.200
🧠 🔧 🌡️ 2025-01 In: text
Out: text
Open Weights
Released: 2026-02-12
GLM-4.7 glm-4.7 204.8K 131.1K Input: $0.6
Output: $2.2
Cache Read: $0.1
Model: 0.300
Completion: 3.667
Cache: 0.167
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-12-22
MiMo V2.5 Free mimo-v2.5-free 200K 32K Input: $0
Output: $0
Cache Read: $0
- 📎 🧠 🔧 🌡️ 2024-12 In: text, image, audio, video
Out: text
Open Weights
Released: 2026-04-24
Kimi K2 kimi-k2 262.1K 262.1K Input: $0.4
Output: $2.5
Cache Read: $0.4
Model: 0.200
Completion: 6.250
Cache: 1.000
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-09-05
MiniMax M2.1 minimax-m2.1 204.8K 131.1K Input: $0.3
Output: $1.2
Cache Read: $0.1
Model: 0.150
Completion: 4.000
Cache: 0.333
🧠 🔧 🌡️ 2025-01 In: text
Out: text
Open Weights
Released: 2025-12-23
Nemotron 3 Ultra Free nemotron-3-ultra-free 1M 128K Input: $0
Output: $0
Cache Read: $0
- 🧠 🔧 🌡️ 2026-02 In: text
Out: text
Open Weights
Released: 2026-06-04
GLM-4.7 Free glm-4.7-free 204.8K 131.1K Input: $0
Output: $0
Cache Read: $0
- 🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-12-22
Gemini 3 Flash gemini-3-flash 1M 65.5K Input: $0.5
Output: $3
Cache Read: $0.05
Model: 0.250
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video, audio, pdf
Out: text
Released: 2025-12-17
DeepSeek V4 Flash Free deepseek-v4-flash-free 200K 128K Input: $0
Output: $0
Cache Read: $0
- 🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2026-04-24
Claude Sonnet 4 claude-sonnet-4 1M 64K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-05-22
Claude Opus 4.5 claude-opus-4-5 200K 64K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-11-24
GPT-5 gpt-5 400K 128K Input: $1.07
Output: $8.5
Cache Read: $0.107
Model: 0.535
Completion: 7.944
Cache: 0.100
📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-08-07
Gemini 3.5 Flash gemini-3.5-flash 1M 65.5K Input: $1.5
Output: $9
Cache Read: $0.15
Input Audio: $1.5
Model: 0.750
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video, audio, pdf
Out: text
Released: 2026-05-19
MiniMax M3 Free minimax-m3-free 200K 32K Input: $0
Output: $0
Cache Read: $0
- 🧠 🔧 🌡️ 2025-01 In: text, image, video
Out: text
Open Weights
Released: 2026-05-31
Gemini 3 Pro gemini-3-pro 1M 65.5K Input: $2
Output: $12
Cache Read: $0.2
Model: 1.000
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video, audio, pdf
Out: text
Released: 2025-11-18
Kimi K2.5 Free kimi-k2.5-free 262.1K 262.1K Input: $0
Output: $0
Cache Read: $0
- 📎 🧠 🔧 🌡️ 2024-10 In: text, image, video
Out: text
Open Weights
Released: 2026-01-27
GLM-5.1 glm-5.1 204.8K 131.1K Input: $1.4
Output: $4.4
Cache Read: $0.26
Model: 0.700
Completion: 3.143
Cache: 0.186
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2026-04-07
DeepSeek V4 Pro deepseek-v4-pro 1M 384K Input: $1.74
Output: $3.84
Cache Read: $0.145
Model: 0.870
Completion: 2.207
Cache: 0.083
🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2026-04-24
GLM-4.6 glm-4.6 204.8K 131.1K Input: $0.6
Output: $2.2
Cache Read: $0.1
Model: 0.300
Completion: 3.667
Cache: 0.167
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-09-30
Kimi K2 Thinking kimi-k2-thinking 262.1K 262.1K Input: $0.4
Output: $2.5
Cache Read: $0.4
Model: 0.200
Completion: 6.250
Cache: 1.000
🧠 🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-09-05
Claude Sonnet 4.5 claude-sonnet-4-5 1M 64K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-07-31 In: text, image, pdf
Out: text
Released: 2025-09-29
Claude Opus 4.7 claude-opus-4-7 1M 128K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 2026-01-31 In: text, image, pdf
Out: text
Released: 2026-04-16
GPT-5.4 Nano gpt-5.4-nano 400K 128K Input: $0.2
Output: $1.25
Cache Read: $0.02
Model: 0.100
Completion: 6.250
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-03-17
Qwen3 Coder qwen3-coder 262.1K 65.5K Input: $0.45
Output: $1.8
Model: 0.225
Completion: 4.000
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-23
Hy3 preview Free hy3-preview-free 256K 64K Input: $0
Output: $0
Cache Read: $0
- 🧠 🔧 🌡️ 2025-06 In: text
Out: text
Open Weights
Released: 2026-04-20
GPT-5.1 Codex gpt-5.1-codex 400K 128K Input: $1.07
Output: $8.5
Cache Read: $0.107
Model: 0.535
Completion: 7.944
Cache: 0.100
📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-11-13
GPT-5.3 Codex Spark gpt-5.3-codex-spark 128K 128K Input: $1.75
Output: $14
Cache Read: $0.175
Model: 0.875
Completion: 8.000
Cache: 0.100
🧠 🔧 2025-08-31 In: text
Out: text
Released: 2026-02-12
GPT-5.1 Codex Max gpt-5.1-codex-max 400K 128K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-11-13
Claude Opus 4.8 claude-opus-4-8 1M 128K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2026-05-28
Ling 2.6 Flash Free ling-2.6-flash-free 262.1K 32.8K Input: $0
Output: $0
- 🔧 🌡️ 2025-06 In: text
Out: text
Open Weights
Released: 2026-04-21
Qwen3.5 Plus qwen3.5-plus 262.1K 65.5K Input: $0.2
Output: $1.2
Cache Read: $0.02
Cache Write: $0.25
Model: 0.100
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-04 In: text, image, video
Out: text
Released: 2026-02-16
Claude Haiku 3.5 claude-3-5-haiku 200K 8.2K Input: $0.8
Output: $4
Cache Read: $0.08
Cache Write: $1
Model: 0.400
Completion: 5.000
Cache: 0.100
📎 🔧 🌡️ 2024-07-31 In: text, image, pdf
Out: text
Released: 2024-10-22
GPT-5.2 gpt-5.2 400K 128K Input: $1.75
Output: $14
Cache Read: $0.175
Model: 0.875
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2025-12-11
GPT-5.3 Codex gpt-5.3-codex 400K 128K Input: $1.75
Output: $14
Cache Read: $0.175
Model: 0.875
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-02-24
MiniMax M2.7 minimax-m2.7 204.8K 131.1K Input: $0.3
Output: $1.2
Cache Read: $0.06
Model: 0.150
Completion: 4.000
Cache: 0.200
🧠 🔧 🌡️ 2025-01 In: text
Out: text
Open Weights
Released: 2026-03-18
North Mini Code Free north-mini-code-free 256K 64K Input: $0
Output: $0
- 🧠 🔧 🌡️ 2025-09-23 In: text
Out: text
Open Weights
Released: 2026-06-09
Grok Code Fast 1 grok-code 256K 256K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 📎 🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-08-20
Claude Opus 4.1 claude-opus-4-1 200K 32K Input: $15
Output: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-08-05
GPT-5.1 Codex Mini gpt-5.1-codex-mini 400K 128K Input: $0.25
Output: $2
Cache Read: $0.025
Model: 0.125
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-11-13
Kimi K2.5 kimi-k2.5 262.1K 65.5K Input: $0.6
Output: $3
Cache Read: $0.08
Model: 0.300
Completion: 5.000
Cache: 0.133
📎 🧠 🔧 🌡️ 2024-10 In: text, image, video
Out: text
Open Weights
Released: 2026-01-27
Claude Fable 5 claude-fable-5 1M 128K Input: $10
Output: $50
Cache Read: $1
Cache Write: $12.5
Model: 5.000
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 2026-01-31 In: text, image, pdf
Out: text
Released: 2026-06-09
Claude Haiku 4.5 claude-haiku-4-5 200K 64K Input: $1
Output: $5
Cache Read: $0.1
Cache Write: $1.25
Model: 0.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-02-28 In: text, image, pdf
Out: text
Released: 2025-10-15
GPT-5.4 gpt-5.4 1.1M 128K Input: $2.5
Output: $15
Cache Read: $0.25
Model: 1.250
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-03-05
GPT-5.4 Mini gpt-5.4-mini 400K 128K Input: $0.75
Output: $4.5
Cache Read: $0.075
Model: 0.375
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-03-17
MiniMax M2.5 Free minimax-m2.5-free 204.8K 131.1K Input: $0
Output: $0
Cache Read: $0
- 🧠 🔧 🌡️ 2025-01 In: text
Out: text
Open Weights
Released: 2026-02-12
Kimi K2.6 kimi-k2.6 262.1K 65.5K Input: $0.95
Output: $4
Cache Read: $0.16
Model: 0.475
Completion: 4.211
Cache: 0.168
📎 🧠 🔧 🌡️ 2024-10 In: text, image, video
Out: text
Open Weights
Released: 2026-04-21
Claude Opus 4.6 claude-opus-4-6 1M 128K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-05-31 In: text, image, pdf
Out: text
Released: 2026-02-05
Updated: 2026-03-13
MiniMax M2.1 Free minimax-m2.1-free 204.8K 131.1K Input: $0
Output: $0
Cache Read: $0
- 🧠 🔧 🌡️ 2025-01 In: text
Out: text
Open Weights
Released: 2025-12-23
GPT-5 Nano gpt-5-nano 400K 128K Input: $0.05
Output: $0.4
Cache Read: $0.005
Model: 0.025
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2024-05-30 In: text, image
Out: text
Released: 2025-08-07
GPT-5.4 Pro gpt-5.4-pro 1.1M 128K Input: $30
Output: $180
Cache Read: $30
Model: 15.000
Completion: 6.000
Cache: 1.000
📎 🧠 🔧 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-03-05
Claude Sonnet 4.6 claude-sonnet-4-6 1M 64K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-02-17
MiMo V2 Flash Free mimo-v2-flash-free 262.1K 65.5K Input: $0
Output: $0
Cache Read: $0
- 🧠 🔧 🌡️ 2024-12 In: text
Out: text
Open Weights
Released: 2025-12-16
Gemini 3.1 Pro Preview gemini-3.1-pro 1M 65.5K Input: $2
Output: $12
Cache Read: $0.2
Model: 1.000
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video, audio, pdf
Out: text
Released: 2026-02-19
Trinity Large Preview trinity-large-preview-free 131.1K 131.1K Input: $0
Output: $0
- 🔧 🌡️ 2025-06 In: text
Out: text
Open Weights
Released: 2026-01-28
Big Pickle big-pickle 200K 32K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 🧠 🔧 🌡️ 2025-01 In: text
Out: text
Released: 2025-10-17
GPT-5.5 Pro gpt-5.5-pro 1.1M 128K Input: $30
Output: $180
Cache Read: $30
Model: 15.000
Completion: 6.000
Cache: 1.000
📎 🧠 🔧 2025-12-01 In: text, image, pdf
Out: text
Released: 2026-04-24
Qwen3.6 Plus Free qwen3.6-plus-free 262.1K 65.5K Input: $0
Output: $0
Cache Read: $0
- 📎 🧠 🔧 🌡️ 2025-04 In: text, image, video
Out: text
Released: 2026-04-02
GLM-5 glm-5 204.8K 131.1K Input: $1
Output: $3.2
Cache Read: $0.2
Model: 0.500
Completion: 3.200
Cache: 0.200
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2026-02-11
MiMo V2 Omni Free mimo-v2-omni-free 262.1K 64K Input: $0
Output: $0
Cache Read: $0
- 📎 🧠 🔧 🌡️ 2024-12 In: text, image, audio, pdf
Out: text
Open Weights
Released: 2026-03-18
GPT-5 Codex gpt-5-codex 400K 128K Input: $1.07
Output: $8.5
Cache Read: $0.107
Model: 0.535
Completion: 7.944
Cache: 0.100
📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-09-15
GPT-5.2 Codex gpt-5.2-codex 400K 128K Input: $1.75
Output: $14
Cache Read: $0.175
Model: 0.875
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-01-14
GLM-5 Free glm-5-free 204.8K 131.1K Input: $0
Output: $0
Cache Read: $0
- 🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2026-02-11
Qwen3.6 Plus qwen3.6-plus 262.1K 65.5K Input: $0.5
Output: $3
Cache Read: $0.05
Cache Write: $0.625
Model: 0.250
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-04 In: text, image, video
Out: text
Released: 2026-04-02
Nemotron 3 Super Free nemotron-3-super-free 204.8K 128K Input: $0
Output: $0
Cache Read: $0
- 🧠 🔧 🌡️ 2026-02 In: text
Out: text
Open Weights
Released: 2026-03-11
Grok Build 0.1 grok-build-0.1 256K 256K Input: $1
Output: $2
Cache Read: $0.2
Model: 0.500
Completion: 2.000
Cache: 0.200
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2026-05-20
GPT-5.1 gpt-5.1 400K 128K Input: $1.07
Output: $8.5
Cache Read: $0.107
Model: 0.535
Completion: 7.944
Cache: 0.100
📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-11-13
GPT-5.5 gpt-5.5 1.1M 128K Input: $5
Output: $30
Cache Read: $0.5
Model: 2.500
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 2025-12-01 In: text, image, pdf
Out: text
Released: 2026-04-23

OpenCode Go

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
DeepSeek V4 Flash deepseek-v4-flash 1M 384K Input: $0.14
Output: $0.28
Cache Read: $0.0028
Model: 0.070
Completion: 2.000
Cache: 0.020
🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2026-04-24
MiniMax M2.5 minimax-m2.5 204.8K 65.5K Input: $0.3
Output: $1.2
Cache Read: $0.03
Model: 0.150
Completion: 4.000
Cache: 0.100
🧠 🔧 🌡️ 2025-01 In: text
Out: text
Open Weights
Released: 2026-02-12
Qwen3.7 Plus qwen3.7-plus 1M 65.5K Input: $0.4
Output: $1.6
Cache Read: $0.04
Cache Write: $0.5
Model: 0.200
Completion: 4.000
Cache: 0.100
📎 🧠 🔧 🌡️ - In: text, image, video
Out: text
Released: 2026-06-02
Qwen3.7 Max qwen3.7-max 1M 65.5K Input: $2.5
Output: $7.5
Cache Read: $0.5
Cache Write: $3.125
Model: 1.250
Completion: 3.000
Cache: 0.200
🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-05-21
Kimi K2.7 Code kimi-k2.7-code 262.1K 262.1K Input: $0.95
Output: $4
Cache Read: $0.19
Model: 0.475
Completion: 4.211
Cache: 0.200
📎 🧠 🔧 2025-01 In: text, image, video
Out: text
Open Weights
Released: 2026-06-12
GLM-5.1 glm-5.1 202.8K 32.8K Input: $1.4
Output: $4.4
Cache Read: $0.26
Model: 0.700
Completion: 3.143
Cache: 0.186
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2026-04-07
DeepSeek V4 Pro deepseek-v4-pro 1M 384K Input: $1.74
Output: $3.48
Cache Read: $0.0145
Model: 0.870
Completion: 2.000
Cache: 0.008
🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2026-04-24
MiniMax M3 (3x usage) minimax-m3 512K 131.1K Input: $0.1
Output: $0.4
Cache Read: $0.02
Model: 0.050
Completion: 4.000
Cache: 0.200
🧠 🔧 🌡️ 2025-01 In: text, image, video
Out: text
Open Weights
Released: 2026-05-31
Qwen3.5 Plus qwen3.5-plus 262.1K 65.5K Input: $0.2
Output: $1.2
Cache Read: $0.02
Cache Write: $0.25
Model: 0.100
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-04 In: text, image, video
Out: text
Released: 2026-02-16
MiniMax M2.7 minimax-m2.7 204.8K 131.1K Input: $0.3
Output: $1.2
Cache Read: $0.06
Model: 0.150
Completion: 4.000
Cache: 0.200
🧠 🔧 🌡️ 2025-01 In: text
Out: text
Open Weights
Released: 2026-03-18
Kimi K2.5 kimi-k2.5 262.1K 65.5K Input: $0.6
Output: $3
Cache Read: $0.1
Model: 0.300
Completion: 5.000
Cache: 0.167
📎 🧠 🔧 🌡️ 2024-10 In: text, image, video
Out: text
Open Weights
Released: 2026-01-27
MiMo V2.5 mimo-v2.5 1M 128K Input: $0.14
Output: $0.28
Cache Read: $0.0028
Model: 0.070
Completion: 2.000
Cache: 0.020
📎 🧠 🔧 🌡️ 2024-12 In: text, image, audio, video
Out: text
Open Weights
Released: 2026-04-22
MiMo V2 Omni mimo-v2-omni 262.1K 128K Input: $0.4
Output: $2
Cache Read: $0.08
Model: 0.200
Completion: 5.000
Cache: 0.200
📎 🧠 🔧 🌡️ 2024-12 In: text, image, audio, pdf
Out: text
Open Weights
Released: 2026-03-18
Kimi K2.6 kimi-k2.6 262.1K 65.5K Input: $0.95
Output: $4
Cache Read: $0.16
Model: 0.475
Completion: 4.211
Cache: 0.168
📎 🧠 🔧 🌡️ 2024-10 In: text, image, video
Out: text
Open Weights
Released: 2026-04-21
MiMo V2 Pro mimo-v2-pro 1M 128K Input: $1
Output: $3
Cache Read: $0.2
Model: 0.500
Completion: 3.000
Cache: 0.200
📎 🧠 🔧 🌡️ 2024-12 In: text
Out: text
Open Weights
Released: 2026-03-18
MiMo V2.5 Pro mimo-v2.5-pro 1M 128K Input: $1.74
Output: $3.48
Cache Read: $0.0145
Model: 0.870
Completion: 2.000
Cache: 0.008
📎 🧠 🔧 🌡️ 2024-12 In: text
Out: text
Open Weights
Released: 2026-04-22
GLM-5 glm-5 202.8K 32.8K Input: $1
Output: $3.2
Cache Read: $0.2
Model: 0.500
Completion: 3.200
Cache: 0.200
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2026-02-11
Qwen3.6 Plus qwen3.6-plus 1M 65.5K Input: $0.5
Output: $3
Cache Read: $0.05
Cache Write: $0.625
Model: 0.250
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-04 In: text, image, video
Out: text
Released: 2026-04-02

OpenRouter

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Ling-2.6-1T inclusionai/ling-2.6-1t 262.1K 32.8K Input: $0.075
Output: $0.625
Cache Read: $0.015
Model: 0.037
Completion: 8.333
Cache: 0.200
🔧 🌡️ - In: text
Out: text
Released: 2026-04-23
Ring-2.6-1T inclusionai/ring-2.6-1t 262.1K 65.5K Input: $0.075
Output: $0.625
Cache Read: $0.015
Model: 0.037
Completion: 8.333
Cache: 0.200
🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-05-08
Ling-2.6-flash inclusionai/ling-2.6-flash 262.1K 32.8K Input: $0.01
Output: $0.03
Cache Read: $0.002
Model: 0.005
Completion: 3.000
Cache: 0.200
🔧 🌡️ - In: text
Out: text
Released: 2026-04-21
Granite 4.0 Micro ibm-granite/granite-4.0-h-micro 131K 131K Input: $0.017
Output: $0.112
Model: 0.009
Completion: 6.588
🌡️ - In: text
Out: text
Open Weights
Released: 2025-10-20
Granite 4.1 8B ibm-granite/granite-4.1-8b 131.1K 131.1K Input: $0.05
Output: $0.1
Cache Read: $0.05
Model: 0.025
Completion: 2.000
Cache: 1.000
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-04-30
Llama 3.1 8B Instruct meta-llama/llama-3.1-8b-instruct 131.1K 16.4K Input: $0.02
Output: $0.03
Model: 0.010
Completion: 1.500
🔧 🌡️ 2023-12-31 In: text
Out: text
Open Weights
Released: 2024-07-23
Llama 3 70B Instruct meta-llama/llama-3-70b-instruct 8.2K 8K Input: $0.51
Output: $0.74
Model: 0.255
Completion: 1.451
🌡️ 2023-12-31 In: text
Out: text
Open Weights
Released: 2024-04-18
Llama 3.1 70B Instruct meta-llama/llama-3.1-70b-instruct 131.1K 16.4K Input: $0.4
Output: $0.4
Model: 0.200
Completion: 1.000
🔧 🌡️ 2023-12-31 In: text
Out: text
Open Weights
Released: 2024-07-23
Llama 3.2 1B Instruct meta-llama/llama-3.2-1b-instruct 60K 60K Input: $0.027
Output: $0.201
Model: 0.013
Completion: 7.444
🌡️ 2023-12-31 In: text
Out: text
Open Weights
Released: 2024-09-25
Llama 4 Maverick meta-llama/llama-4-maverick 1M 16.4K Input: $0.15
Output: $0.6
Model: 0.075
Completion: 4.000
📎 🔧 🌡️ 2024-08-31 In: text, image
Out: text
Open Weights
Released: 2025-04-05
Llama 3.2 11B Vision Instruct meta-llama/llama-3.2-11b-vision-instruct 131.1K 16.4K Input: $0.345
Output: $0.345
Model: 0.172
Completion: 1.000
📎 🌡️ 2023-12-31 In: text, image
Out: text
Open Weights
Released: 2024-09-25
Llama 3.3 70B Instruct (free) meta-llama/llama-3.3-70b-instruct:free 65.5K 131.1K Input: $0
Output: $0
- 🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-12-06
Llama-3.3-70B-Instruct meta-llama/llama-3.3-70b-instruct 131.1K 16.4K Input: $0.1
Output: $0.32
Model: 0.050
Completion: 3.200
🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-12-06
Llama 3.2 3B Instruct (free) meta-llama/llama-3.2-3b-instruct:free 131.1K 131.1K Input: $0
Output: $0
- 🌡️ 2023-12-31 In: text
Out: text
Open Weights
Released: 2024-09-25
Llama Guard 4 12B meta-llama/llama-guard-4-12b 163.8K 16.4K Input: $0.18
Output: $0.18
Model: 0.090
Completion: 1.000
📎 🌡️ 2024-08-31 In: image, text
Out: text
Open Weights
Released: 2025-04-30
Llama 3 8B Instruct meta-llama/llama-3-8b-instruct 8.2K 8.2K Input: $0.14
Output: $0.14
Model: 0.070
Completion: 1.000
🌡️ 2023-12-31 In: text
Out: text
Open Weights
Released: 2024-04-18
Llama 4 Scout meta-llama/llama-4-scout 327.7K 16.4K Input: $0.1
Output: $0.3
Model: 0.050
Completion: 3.000
📎 🔧 🌡️ 2024-08-31 In: text, image
Out: text
Open Weights
Released: 2025-04-05
Llama 3.2 3B Instruct meta-llama/llama-3.2-3b-instruct 80K 80K Input: $0.0509
Output: $0.335
Model: 0.025
Completion: 6.582
🌡️ 2023-12-31 In: text
Out: text
Open Weights
Released: 2024-09-25
Anthropic Claude Haiku Latest ~anthropic/claude-haiku-latest 200K 64K Input: $1
Output: $5
Cache Read: $0.1
Cache Write: $1.25
Model: 0.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ - In: text, image, pdf
Out: text
Released: 2026-04-27
Claude Fable Latest ~anthropic/claude-fable-latest 1M 128K Input: $10
Output: $50
Cache Read: $1
Cache Write: $12.5
Model: 5.000
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2026-06-09
Anthropic Claude Sonnet Latest ~anthropic/claude-sonnet-latest 1M 128K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ - In: text, image, pdf
Out: text
Released: 2026-04-27
Claude Opus Latest ~anthropic/claude-opus-latest 1M 128K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2026-04-21
Kimi K2 0711 moonshotai/kimi-k2 131.1K 32.8K Input: $0.57
Output: $2.3
Model: 0.285
Completion: 4.035
🔧 🌡️ 2024-12-31 In: text
Out: text
Open Weights
Released: 2025-07-11
Kimi K2.7 Code moonshotai/kimi-k2.7-code 262.1K 262.1K Input: $0.75
Output: $3.5
Cache Read: $0.16
Model: 0.375
Completion: 4.667
Cache: 0.213
📎 🧠 🔧 🌡️ 2025-01 In: text, image
Out: text
Open Weights
Released: 2026-06-12
Kimi K2 Thinking moonshotai/kimi-k2-thinking 262.1K 262.1K Input: $0.6
Output: $2.5
Model: 0.300
Completion: 4.167
🧠 🔧 🌡️ 2024-08 In: text
Out: text
Open Weights
Released: 2025-11-06
Kimi K2.5 moonshotai/kimi-k2.5 256K 262.1K Input: $0.375
Output: $2.025
Model: 0.188
Completion: 5.400
📎 🧠 🔧 🌡️ 2025-01 In: text, image
Out: text
Open Weights
Released: 2026-01
Kimi K2.6 moonshotai/kimi-k2.6 262.1K 262.1K Input: $0.68
Output: $3.41
Cache Read: $0.34
Model: 0.340
Completion: 5.015
Cache: 0.500
📎 🧠 🔧 🌡️ 2025-01 In: text, image
Out: text
Open Weights
Released: 2026-04-21
Kimi K2 0905 moonshotai/kimi-k2-0905 262.1K 262.1K Input: $0.6
Output: $2.5
Model: 0.300
Completion: 4.167
🔧 🌡️ 2024-12-31 In: text
Out: text
Open Weights
Released: 2025-09-04
**ERNIE 4.5 VL 424B A47B ** baidu/ernie-4.5-vl-424b-a47b 123K 16K Input: $0.42
Output: $1.25
Model: 0.210
Completion: 2.976
📎 🧠 🌡️ 2025-03-31 In: image, text
Out: text
Open Weights
Released: 2025-06-30
Perceptron Mk1 perceptron/perceptron-mk1 32.8K 8.2K Input: $0.15
Output: $1.5
Model: 0.075
Completion: 10.000
📎 🧠 🌡️ - In: text, image, video
Out: text
Released: 2026-05-12
Gemini 3.1 Flash Lite google/gemini-3.1-flash-lite 1M 65.5K Input: $0.25
Output: $1.5
Cache Read: $0.025
Cache Write: $0.083333
Reasoning: $1.5
Model: 0.125
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video, audio, pdf
Out: text
Released: 2026-05-07
Gemma 3n 4B google/gemma-3n-e4b-it 32.8K 32.8K Input: $0.06
Output: $0.12
Model: 0.030
Completion: 2.000
🌡️ 2024-08-31 In: text
Out: text
Open Weights
Released: 2025-05-20
Gemma 4 26B A4B (free) google/gemma-4-26b-a4b-it:free 262.1K 32.8K Input: $0
Output: $0
- 📎 🧠 🔧 🌡️ - In: image, text, video
Out: text
Open Weights
Released: 2026-04-02
Gemini 2.5 Pro google/gemini-2.5-pro 1M 65.5K Input: $1.25
Output: $10
Cache Read: $0.125
Cache Write: $0.375
Reasoning: $10
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-06-17
Gemini 2.5 Flash google/gemini-2.5-flash 1M 65.5K Input: $0.3
Output: $2.5
Cache Read: $0.03
Cache Write: $0.083333
Reasoning: $2.5
Model: 0.150
Completion: 8.333
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-06-17
Gemini 3.5 Flash google/gemini-3.5-flash 1M 65.5K Input: $1.5
Output: $9
Cache Read: $0.15
Cache Write: $0.083333
Reasoning: $9
Model: 0.750
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video, audio, pdf
Out: text
Released: 2026-05-19
Gemini 2.5 Flash Lite Preview 09-2025 google/gemini-2.5-flash-lite-preview-09-2025 1M 65.5K Input: $0.1
Output: $0.4
Cache Read: $0.01
Cache Write: $0.083333
Reasoning: $0.4
Model: 0.050
Completion: 4.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01-31 In: text, image, pdf, audio, video
Out: text
Released: 2025-09-25
Gemma 4 31B IT google/gemma-4-31b-it 262.1K 262.1K Input: $0.12
Output: $0.35
Cache Read: $0.09
Model: 0.060
Completion: 2.917
Cache: 0.750
📎 🧠 🔧 🌡️ - In: image, text, video
Out: text
Open Weights
Released: 2026-04-02
Lyria 3 Clip Preview google/lyria-3-clip-preview 1M 65.5K Input: $0
Output: $0
- 📎 🌡️ - In: text, image
Out: text, audio
Released: 2026-03-30
Gemini 3.1 Pro Preview Custom Tools google/gemini-3.1-pro-preview-customtools 1M 65.5K Input: $2
Output: $12
Cache Read: $0.2
Cache Write: $0.375
Reasoning: $12
Model: 1.000
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video, audio, pdf
Out: text
Released: 2026-02-19
Nano Banana Pro google/gemini-3-pro-image-preview 65.5K 32.8K Input: $2
Output: $12
Cache Read: $0.2
Cache Write: $0.375
Reasoning: $12
Model: 1.000
Completion: 6.000
Cache: 0.100
📎 🧠 🌡️ 2025-01 In: text, image
Out: text, image
Released: 2025-11-20
Nano Banana google/gemini-2.5-flash-image 32.8K 32.8K Input: $0.3
Output: $2.5
Cache Read: $0.03
Cache Write: $0.083333
Model: 0.150
Completion: 8.333
Cache: 0.100
📎 🌡️ 2025-06 In: text, image
Out: text, image
Released: 2025-08-26
Gemini 2.5 Flash-Lite google/gemini-2.5-flash-lite 1M 65.5K Input: $0.1
Output: $0.4
Cache Read: $0.01
Cache Write: $0.083333
Reasoning: $0.4
Model: 0.050
Completion: 4.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-06-17
Nano Banana 2 google/gemini-3.1-flash-image-preview 65.5K 65.5K Input: $0.5
Output: $3
Model: 0.250
Completion: 6.000
📎 🧠 🌡️ 2025-01 In: image, text
Out: text, image
Released: 2026-02-26
Gemini 2.5 Pro Preview 05-06 google/gemini-2.5-pro-preview-05-06 1M 65.5K Input: $1.25
Output: $10
Cache Read: $0.125
Cache Write: $0.375
Reasoning: $10
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01-31 In: text, image, pdf, audio, video
Out: text
Released: 2025-05-07
Gemini 3.1 Pro Preview google/gemini-3.1-pro-preview 1M 65.5K Input: $2
Output: $12
Cache Read: $0.2
Cache Write: $0.375
Reasoning: $12
Model: 1.000
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video, audio, pdf
Out: text
Released: 2026-02-19
Gemma 4 26B A4B IT google/gemma-4-26b-a4b-it 262.1K 262.1K Input: $0.06
Output: $0.33
Model: 0.030
Completion: 5.500
📎 🧠 🔧 🌡️ - In: image, text, video
Out: text
Open Weights
Released: 2026-04-02
Gemini 2.5 Pro Preview 06-05 google/gemini-2.5-pro-preview 1M 65.5K Input: $1.25
Output: $10
Cache Read: $0.125
Cache Write: $0.375
Reasoning: $10
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01-31 In: pdf, image, text, audio
Out: text
Released: 2025-06-05
Gemini 3 Flash Preview google/gemini-3-flash-preview 1M 65.5K Input: $0.5
Output: $3
Cache Read: $0.05
Cache Write: $0.083333
Reasoning: $3
Model: 0.250
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video, audio, pdf
Out: text
Released: 2025-12-17
Gemma 3 12B google/gemma-3-12b-it 131.1K 16.4K Input: $0.05
Output: $0.15
Model: 0.025
Completion: 3.000
📎 🔧 🌡️ 2024-08-31 In: text, image
Out: text
Open Weights
Released: 2025-03-13
Gemma 3 4B google/gemma-3-4b-it 131.1K 16.4K Input: $0.05
Output: $0.1
Model: 0.025
Completion: 2.000
📎 🌡️ 2024-08-31 In: text, image
Out: text
Open Weights
Released: 2025-03-13
Gemma 3 27B google/gemma-3-27b-it 131.1K 16.4K Input: $0.08
Output: $0.16
Model: 0.040
Completion: 2.000
📎 🔧 🌡️ 2024-08-31 In: text, image
Out: text
Open Weights
Released: 2025-03-12
Lyria 3 Pro Preview google/lyria-3-pro-preview 1M 65.5K Input: $0
Output: $0
- 📎 🌡️ - In: text, image
Out: text, audio
Released: 2026-03-30
Gemma 4 31B (free) google/gemma-4-31b-it:free 262.1K 32.8K Input: $0
Output: $0
- 📎 🧠 🔧 🌡️ - In: image, text, video
Out: text
Open Weights
Released: 2026-04-02
Gemma 2 27B google/gemma-2-27b-it 8.2K 2K Input: $0.65
Output: $0.65
Model: 0.325
Completion: 1.000
🌡️ 2024-06-30 In: text
Out: text
Open Weights
Released: 2024-07-13
Gemini 3.1 Flash Lite Preview google/gemini-3.1-flash-lite-preview 1M 65.5K Input: $0.25
Output: $1.5
Cache Read: $0.025
Cache Write: $0.083333
Reasoning: $1.5
Model: 0.125
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video, audio, pdf
Out: text
Released: 2026-03-03
LFM2.5-1.2B-Thinking (free) liquid/lfm-2.5-1.2b-thinking:free 32.8K 32.8K Input: $0
Output: $0
- 🧠 🌡️ 2025-06 In: text
Out: text
Open Weights
Released: 2026-01-20
LFM2-24B-A2B liquid/lfm-2-24b-a2b 32.8K 32.8K Input: $0.03
Output: $0.12
Model: 0.015
Completion: 4.000
🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-25
LFM2.5-1.2B-Instruct (free) liquid/lfm-2.5-1.2b-instruct:free 32.8K 32.8K Input: $0
Output: $0
- 🌡️ 2025-06 In: text
Out: text
Open Weights
Released: 2026-01-20
Grok 4.20 x-ai/grok-4.20 2M 2M Input: $1.25
Output: $2.5
Cache Read: $0.2
Model: 0.625
Completion: 2.000
Cache: 0.160
📎 🧠 🔧 🌡️ 2025-09-01 In: text, image, pdf
Out: text
Released: 2026-03-31
Grok 4.3 x-ai/grok-4.3 1M 1M Input: $1.25
Output: $2.5
Cache Read: $0.2
Model: 0.625
Completion: 2.000
Cache: 0.160
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2026-04-17
Grok 4.20 Multi-Agent x-ai/grok-4.20-multi-agent 2M 2M Input: $2
Output: $6
Cache Read: $0.2
Model: 1.000
Completion: 3.000
Cache: 0.100
📎 🧠 🌡️ 2025-09-01 In: text, image, pdf
Out: text
Released: 2026-03-31
Grok Build 0.1 x-ai/grok-build-0.1 256K 256K Input: $1
Output: $2
Cache Read: $0.2
Model: 0.500
Completion: 2.000
Cache: 0.200
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2026-04-16
Google Gemini Pro Latest ~google/gemini-pro-latest 1M 65.5K Input: $2
Output: $12
Cache Read: $0.2
Cache Write: $0.375
Reasoning: $12
Model: 1.000
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 🌡️ - In: audio, pdf, image, text, video
Out: text
Released: 2026-04-27
Google Gemini Flash Latest ~google/gemini-flash-latest 1M 65.5K Input: $1.5
Output: $9
Cache Read: $0.15
Cache Write: $0.083333
Reasoning: $9
Model: 0.750
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01-01 In: text, image, video, pdf, audio
Out: text
Released: 2026-04-27
Phi 4 Mini Instruct microsoft/phi-4-mini-instruct 128K 128K Input: $0.08
Output: $0.35
Cache Read: $0.08
Model: 0.040
Completion: 4.375
Cache: 1.000
🌡️ - In: text
Out: text
Open Weights
Released: 2025-10-17
Phi 4 microsoft/phi-4 16.4K 16.4K Input: $0.065
Output: $0.14
Model: 0.033
Completion: 2.154
🌡️ 2024-06-30 In: text
Out: text
Open Weights
Released: 2025-01-10
WizardLM-2 8x22B microsoft/wizardlm-2-8x22b 65.5K 8K Input: $0.62
Output: $0.62
Model: 0.310
Completion: 1.000
🌡️ 2024-04-30 In: text
Out: text
Open Weights
Released: 2024-04-16
Laguna XS.2 (free) poolside/laguna-xs.2:free 262.1K 32.8K Input: $0
Output: $0
- 🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-04-28
Laguna M.1 (free) poolside/laguna-m.1:free 262.1K 32.8K Input: $0
Output: $0
- 🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-04-28
Palmyra X5 writer/palmyra-x5 1M 8.2K Input: $0.6
Output: $6
Model: 0.300
Completion: 10.000
🌡️ - In: text
Out: text
Released: 2026-01-21
GLM-4.7 z-ai/glm-4.7 202.8K 131.1K Input: $0.4
Output: $1.75
Cache Read: $0.08
Model: 0.200
Completion: 4.375
Cache: 0.200
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-12-22
GLM-4.5V z-ai/glm-4.5v 65.5K 16.4K Input: $0.6
Output: $1.8
Cache Read: $0.11
Model: 0.300
Completion: 3.000
Cache: 0.183
📎 🧠 🔧 🌡️ 2025-04 In: text, image
Out: text
Open Weights
Released: 2025-08-11
GLM-4.5 z-ai/glm-4.5 131.1K 98.3K Input: $0.6
Output: $2.2
Cache Read: $0.11
Model: 0.300
Completion: 3.667
Cache: 0.183
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-28
GLM-5.1 z-ai/glm-5.1 202.8K 131.1K Input: $0.98
Output: $3.08
Cache Read: $0.182
Model: 0.490
Completion: 3.143
Cache: 0.186
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-27
GLM-4.6 z-ai/glm-4.6 202.8K 131.1K Input: $0.43
Output: $1.74
Cache Read: $0.08
Model: 0.215
Completion: 4.047
Cache: 0.186
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-09-30
GLM-4.6V z-ai/glm-4.6v 131.1K 32.8K Input: $0.3
Output: $0.9
Cache Read: $0.055
Model: 0.150
Completion: 3.000
Cache: 0.183
📎 🧠 🔧 🌡️ 2025-04 In: text, image, video
Out: text
Open Weights
Released: 2025-12-08
GLM-4.5-Air z-ai/glm-4.5-air 131.1K 131.1K Input: $0.125
Output: $0.85
Cache Read: $0.06
Model: 0.063
Completion: 6.800
Cache: 0.480
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-28
GLM-4.7-Flash z-ai/glm-4.7-flash 202.8K 16.4K Input: $0.06
Output: $0.4
Cache Read: $0.01
Model: 0.030
Completion: 6.667
Cache: 0.167
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2026-01-19
GLM-5 z-ai/glm-5 202.8K 16.4K Input: $0.6
Output: $1.92
Cache Read: $0.12
Model: 0.300
Completion: 3.200
Cache: 0.200
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-11
GLM-5-Turbo z-ai/glm-5-turbo 262.1K 131.1K Input: $1.2
Output: $4
Cache Read: $0.24
Model: 0.600
Completion: 3.333
Cache: 0.200
🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-03-16
GPT-4o-mini (2024-07-18) openai/gpt-4o-mini-2024-07-18 128K 16.4K Input: $0.15
Output: $0.6
Cache Read: $0.075
Model: 0.075
Completion: 4.000
Cache: 0.500
📎 🔧 🌡️ 2023-10-31 In: text, image, pdf
Out: text
Released: 2024-07-18
gpt-oss-safeguard-20b openai/gpt-oss-safeguard-20b 131.1K 65.5K Input: $0.075
Output: $0.3
Cache Read: $0.037
Model: 0.037
Completion: 4.000
Cache: 0.493
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-10-29
GPT-3.5 Turbo Instruct openai/gpt-3.5-turbo-instruct 4.1K 4.1K Input: $1.5
Output: $2
Model: 0.750
Completion: 1.333
🌡️ 2021-09-30 In: text
Out: text
Released: 2023-09-28
GPT-5.2 Chat openai/gpt-5.2-chat 128K 16.4K Input: $1.75
Output: $14
Cache Read: $0.175
Model: 0.875
Completion: 8.000
Cache: 0.100
📎 🔧 2025-08-31 In: pdf, image, text
Out: text
Released: 2025-12-10
o3 openai/o3 200K 100K Input: $2
Output: $8
Cache Read: $0.5
Model: 1.000
Completion: 4.000
Cache: 0.250
📎 🧠 🔧 2024-05 In: text, image, pdf
Out: text
Released: 2025-04-16
o4 Mini High openai/o4-mini-high 200K 100K Input: $1.1
Output: $4.4
Cache Read: $0.275
Model: 0.550
Completion: 4.000
Cache: 0.250
📎 🧠 🔧 2024-06-30 In: image, text, pdf
Out: text
Released: 2025-04-16
GPT Audio openai/gpt-audio 128K 16.4K Input: $2.5
Output: $10
Model: 1.250
Completion: 4.000
📎 🔧 🌡️ - In: text, audio
Out: text, audio
Released: 2026-01-19
GPT-5.2 Pro openai/gpt-5.2-pro 400K 128K Input: $21
Output: $168
Model: 10.500
Completion: 8.000
📎 🧠 🔧 2025-08-31 In: image, text, pdf
Out: text
Released: 2025-12-11
GPT-4o-mini Search Preview openai/gpt-4o-mini-search-preview 128K 16.4K Input: $0.15
Output: $0.6
Model: 0.075
Completion: 4.000
- 2023-10-31 In: text
Out: text
Released: 2025-03-12
GPT-5 openai/gpt-5 400K 128K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2024-09-30 In: text, image, pdf
Out: text
Released: 2025-08-07
GPT-5 Chat openai/gpt-5-chat 128K 16.4K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 2024-09-30 In: pdf, image, text
Out: text
Released: 2025-08-07
GPT-3.5-turbo openai/gpt-3.5-turbo 16.4K 4.1K Input: $0.5
Output: $1.5
Model: 0.250
Completion: 3.000
🔧 🌡️ 2021-09-01 In: text
Out: text
Released: 2023-03-01
Updated: 2023-11-06
GPT-5 Pro openai/gpt-5-pro 400K 128K Input: $15
Output: $120
Model: 7.500
Completion: 8.000
📎 🧠 🔧 2024-09-30 In: image, text, pdf
Out: text
Released: 2025-10-06
GPT-4o openai/gpt-4o 128K 16.4K Input: $2.5
Output: $10
Model: 1.250
Completion: 4.000
📎 🔧 🌡️ 2023-09 In: text, image, pdf
Out: text
Released: 2024-05-13
Updated: 2024-08-06
GPT-4 openai/gpt-4 8.2K 4.1K Input: $30
Output: $60
Model: 15.000
Completion: 2.000
🔧 🌡️ 2023-11 In: text
Out: text
Released: 2023-11-06
Updated: 2024-04-09
o4-mini openai/o4-mini 200K 100K Input: $1.1
Output: $4.4
Cache Read: $0.275
Model: 0.550
Completion: 4.000
Cache: 0.250
📎 🧠 🔧 2024-05 In: image, text, pdf
Out: text
Released: 2025-04-16
GPT-3.5 Turbo 16k openai/gpt-3.5-turbo-16k 16.4K 4.1K Input: $3
Output: $4
Model: 1.500
Completion: 1.333
🔧 🌡️ 2021-09-30 In: text
Out: text
Released: 2023-08-28
o3-pro openai/o3-pro 200K 100K Input: $20
Output: $80
Model: 10.000
Completion: 4.000
📎 🧠 🔧 2024-05 In: text, pdf, image
Out: text
Released: 2025-06-10
GPT-5.1 Chat openai/gpt-5.1-chat 128K 32K Input: $1.25
Output: $10
Cache Read: $0.13
Model: 0.625
Completion: 8.000
Cache: 0.104
📎 🔧 2024-09-30 In: pdf, image, text
Out: text
Released: 2025-11-13
GPT-4o (2024-05-13) openai/gpt-4o-2024-05-13 128K 4.1K Input: $5
Output: $15
Model: 2.500
Completion: 3.000
📎 🔧 🌡️ 2023-09 In: text, image, pdf
Out: text
Released: 2024-05-13
GPT-5.4 nano openai/gpt-5.4-nano 400K 128K Input: $0.2
Output: $1.25
Cache Read: $0.02
Model: 0.100
Completion: 6.250
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: pdf, image, text
Out: text
Released: 2026-03-17
GPT-5.3 Chat openai/gpt-5.3-chat 128K 16.4K Input: $1.75
Output: $14
Cache Read: $0.175
Model: 0.875
Completion: 8.000
Cache: 0.100
📎 🔧 - In: text, image, pdf
Out: text
Released: 2026-03-03
GPT-3.5 Turbo (older v0613) openai/gpt-3.5-turbo-0613 4.1K 4.1K Input: $1
Output: $2
Model: 0.500
Completion: 2.000
🔧 🌡️ 2021-09-30 In: text
Out: text
Released: 2024-01-25
GPT-5 Image Mini openai/gpt-5-image-mini 400K 128K Input: $2.5
Output: $2
Cache Read: $0.25
Model: 1.250
Completion: 0.800
Cache: 0.100
📎 🧠 🌡️ - In: pdf, image, text
Out: image, text
Released: 2025-10-16
GPT-5.1 Codex openai/gpt-5.1-codex 400K 128K Input: $1.25
Output: $10
Cache Read: $0.13
Model: 0.625
Completion: 8.000
Cache: 0.104
📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-11-13
GPT-5.1 Codex Max openai/gpt-5.1-codex-max 400K 128K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-11-13
gpt-oss-120b (free) openai/gpt-oss-120b:free 131.1K 131.1K Input: $0
Output: $0
- 🧠 🔧 🌡️ 2024-06-30 In: text
Out: text
Open Weights
Released: 2025-08-05
GPT-4o (2024-08-06) openai/gpt-4o-2024-08-06 128K 16.4K Input: $2.5
Output: $10
Cache Read: $1.25
Model: 1.250
Completion: 4.000
Cache: 0.500
📎 🔧 🌡️ 2023-09 In: text, image, pdf
Out: text
Released: 2024-08-06
o3-mini openai/o3-mini 200K 100K Input: $1.1
Output: $4.4
Cache Read: $0.55
Model: 0.550
Completion: 4.000
Cache: 0.500
📎 🧠 🔧 2024-05 In: text, pdf
Out: text
Released: 2024-12-20
Updated: 2025-01-29
GPT-5.2 openai/gpt-5.2 400K 128K Input: $1.75
Output: $14
Cache Read: $0.175
Model: 0.875
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: pdf, image, text
Out: text
Released: 2025-12-11
GPT-5.3 Codex openai/gpt-5.3-codex 400K 128K Input: $1.75
Output: $14
Cache Read: $0.175
Model: 0.875
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-02-05
GPT Audio Mini openai/gpt-audio-mini 128K 16.4K Input: $0.6
Output: $2.4
Model: 0.300
Completion: 4.000
📎 🔧 🌡️ - In: text, audio
Out: text, audio
Released: 2026-01-19
GPT-5.1 Codex mini openai/gpt-5.1-codex-mini 400K 100K Input: $0.25
Output: $2
Cache Read: $0.025
Model: 0.125
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-11-13
o4-mini-deep-research openai/o4-mini-deep-research 200K 100K Input: $2
Output: $8
Cache Read: $0.5
Model: 1.000
Completion: 4.000
Cache: 0.250
📎 🧠 🔧 🌡️ 2024-05 In: pdf, image, text
Out: text
Released: 2024-06-26
GPT-4.1 nano openai/gpt-4.1-nano 1M 32.8K Input: $0.1
Output: $0.4
Cache Read: $0.025
Model: 0.050
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-04 In: image, text, pdf
Out: text
Released: 2025-04-14
gpt-oss-120b openai/gpt-oss-120b 131.1K 32.8K Input: $0.039
Output: $0.18
Model: 0.019
Completion: 4.615
🧠 🔧 🌡️ 2024-06-30 In: text
Out: text
Open Weights
Released: 2025-08-05
GPT-4o (2024-11-20) openai/gpt-4o-2024-11-20 128K 16.4K Input: $2.5
Output: $10
Cache Read: $1.25
Model: 1.250
Completion: 4.000
Cache: 0.500
📎 🔧 🌡️ 2023-09 In: text, image, pdf
Out: text
Released: 2024-11-20
o1 openai/o1 200K 100K Input: $15
Output: $60
Cache Read: $7.5
Model: 7.500
Completion: 4.000
Cache: 0.500
📎 🧠 🔧 2023-09 In: text, image, pdf
Out: text
Released: 2024-12-05
o1-pro openai/o1-pro 200K 100K Input: $150
Output: $600
Model: 75.000
Completion: 4.000
📎 🧠 2023-09 In: text, image, pdf
Out: text
Released: 2025-03-19
GPT Chat Latest openai/gpt-chat-latest 400K 128K Input: $5
Output: $30
Cache Read: $0.5
Model: 2.500
Completion: 6.000
Cache: 0.100
📎 🔧 - In: text, image, pdf
Out: text
Released: 2026-05-05
GPT-5 Image openai/gpt-5-image 400K 128K Input: $10
Output: $10
Cache Read: $1.25
Model: 5.000
Completion: 1.000
Cache: 0.125
📎 🧠 🌡️ 2024-10-01 In: image, text, pdf
Out: image, text
Released: 2025-10-14
GPT-5.4 openai/gpt-5.4 1.1M 128K Input: $2.5
Output: $15
Cache Read: $0.25
Model: 1.250
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-03-05
GPT-5.4 mini openai/gpt-5.4-mini 400K 128K Input: $0.75
Output: $4.5
Cache Read: $0.075
Model: 0.375
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: pdf, image, text
Out: text
Released: 2026-03-17
GPT-4.1 openai/gpt-4.1 1M 32.8K Input: $2
Output: $8
Cache Read: $0.5
Model: 1.000
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-04 In: text, image, pdf
Out: text
Released: 2025-04-14
o3-deep-research openai/o3-deep-research 200K 100K Input: $10
Output: $40
Cache Read: $2.5
Model: 5.000
Completion: 4.000
Cache: 0.250
📎 🧠 🔧 🌡️ 2024-05 In: image, text, pdf
Out: text
Released: 2024-06-26
GPT-4 Turbo Preview openai/gpt-4-turbo-preview 128K 4.1K Input: $10
Output: $30
Model: 5.000
Completion: 3.000
🔧 🌡️ 2023-12-31 In: text
Out: text
Released: 2024-01-25
GPT-5 Mini openai/gpt-5-mini 400K 128K Input: $0.25
Output: $2
Cache Read: $0.025
Model: 0.125
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2024-05-30 In: text, image, pdf
Out: text
Released: 2025-08-07
GPT-4.1 mini openai/gpt-4.1-mini 1M 32.8K Input: $0.4
Output: $1.6
Cache Read: $0.1
Model: 0.200
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-04 In: text, image, pdf
Out: text
Released: 2025-04-14
GPT-4 Turbo openai/gpt-4-turbo 128K 4.1K Input: $10
Output: $30
Model: 5.000
Completion: 3.000
📎 🔧 🌡️ 2023-12 In: text, image
Out: text
Released: 2023-11-06
Updated: 2024-04-09
GPT-5 Nano openai/gpt-5-nano 400K 128K Input: $0.05
Output: $0.4
Cache Read: $0.01
Model: 0.025
Completion: 8.000
Cache: 0.200
📎 🧠 🔧 2024-05-30 In: text, image, pdf
Out: text
Released: 2025-08-07
GPT-5.4 Pro openai/gpt-5.4-pro 1.1M 128K Input: $30
Output: $180
Model: 15.000
Completion: 6.000
📎 🧠 🔧 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-03-05
o3 Mini High openai/o3-mini-high 200K 100K Input: $1.1
Output: $4.4
Cache Read: $0.55
Model: 0.550
Completion: 4.000
Cache: 0.500
📎 🧠 🔧 2023-10-31 In: text, pdf
Out: text
Released: 2025-02-12
GPT-5.4 Image 2 openai/gpt-5.4-image-2 272K 128K Input: $8
Output: $15
Cache Read: $2
Model: 4.000
Completion: 1.875
Cache: 0.250
📎 🧠 - In: image, text, pdf
Out: image, text
Released: 2026-04-21
GPT-4o Search Preview openai/gpt-4o-search-preview 128K 16.4K Input: $2.5
Output: $10
Model: 1.250
Completion: 4.000
- 2023-10-31 In: text
Out: text
Released: 2025-03-12
GPT-5.5 Pro openai/gpt-5.5-pro 1.1M 128K Input: $30
Output: $180
Model: 15.000
Completion: 6.000
📎 🧠 🔧 2025-12-01 In: text, image, pdf
Out: text
Released: 2026-04-23
GPT-4o mini openai/gpt-4o-mini 128K 16.4K Input: $0.15
Output: $0.6
Cache Read: $0.075
Model: 0.075
Completion: 4.000
Cache: 0.500
📎 🔧 🌡️ 2023-09 In: text, image, pdf
Out: text
Released: 2024-07-18
gpt-oss-20b openai/gpt-oss-20b 131.1K 131.1K Input: $0.029
Output: $0.14
Model: 0.015
Completion: 4.828
🧠 🔧 🌡️ 2024-06-30 In: text
Out: text
Open Weights
Released: 2025-08-05
GPT-5-Codex openai/gpt-5-codex 400K 128K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-09-15
GPT-5.2 Codex openai/gpt-5.2-codex 400K 128K Input: $1.75
Output: $14
Cache Read: $0.175
Model: 0.875
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2025-12-11
gpt-oss-20b (free) openai/gpt-oss-20b:free 131.1K 8.2K Input: $0
Output: $0
- 🧠 🔧 🌡️ 2024-06-30 In: text
Out: text
Open Weights
Released: 2025-08-05
GPT-5.1 openai/gpt-5.1 400K 128K Input: $1.25
Output: $10
Cache Read: $0.13
Model: 0.625
Completion: 8.000
Cache: 0.104
📎 🧠 🔧 2024-09-30 In: image, text, pdf
Out: text
Released: 2025-11-13
GPT-5.5 openai/gpt-5.5 1.1M 128K Input: $5
Output: $30
Cache Read: $0.5
Model: 2.500
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 2025-12-01 In: text, image, pdf
Out: text
Released: 2026-04-23
Cydonia 24B V4.1 thedrummer/cydonia-24b-v4.1 131.1K 131.1K Input: $0.3
Output: $0.5
Cache Read: $0.15
Model: 0.150
Completion: 1.667
Cache: 0.500
🌡️ 2024-04-30 In: text
Out: text
Open Weights
Released: 2025-09-27
Skyfall 36B V2 thedrummer/skyfall-36b-v2 32.8K 32.8K Input: $0.55
Output: $0.8
Cache Read: $0.25
Model: 0.275
Completion: 1.455
Cache: 0.455
🌡️ 2024-06-30 In: text
Out: text
Open Weights
Released: 2025-03-10
UnslopNemo 12B thedrummer/unslopnemo-12b 32.8K 32.8K Input: $0.4
Output: $0.4
Model: 0.200
Completion: 1.000
🔧 🌡️ 2024-04-30 In: text
Out: text
Open Weights
Released: 2024-11-08
Rocinante 12B thedrummer/rocinante-12b 32.8K 32.8K Input: $0.17
Output: $0.43
Model: 0.085
Completion: 2.529
🔧 🌡️ 2024-04-30 In: text
Out: text
Open Weights
Released: 2024-09-30
**UI-TARS 7B ** bytedance/ui-tars-1.5-7b 128K 2K Input: $0.1
Output: $0.2
Cache Read: $0.1
Model: 0.050
Completion: 2.000
Cache: 1.000
📎 🌡️ 2025-01-31 In: image, text
Out: text
Open Weights
Released: 2025-07-22
Reka Flash 3 rekaai/reka-flash-3 65.5K 65.5K Input: $0.1
Output: $0.2
Model: 0.050
Completion: 2.000
🧠 🌡️ 2025-01-31 In: text
Out: text
Open Weights
Released: 2025-03-12
Reka Edge rekaai/reka-edge 16.4K 16.4K Input: $0.1
Output: $0.1
Model: 0.050
Completion: 1.000
📎 🔧 🌡️ - In: image, text, video
Out: text
Open Weights
Released: 2026-03-20
Mistral Large 2407 mistralai/mistral-large-2407 131.1K 131.1K Input: $2
Output: $6
Cache Read: $0.2
Model: 1.000
Completion: 3.000
Cache: 0.100
📎 🔧 🌡️ 2024-03-31 In: text, pdf
Out: text
Released: 2024-11-19
Mistral Small 3.2 24B mistralai/mistral-small-3.2-24b-instruct 128K 16.4K Input: $0.075
Output: $0.2
Model: 0.037
Completion: 2.667
📎 🔧 🌡️ 2023-10-31 In: image, text
Out: text
Open Weights
Released: 2025-06-20
Mistral Nemo mistralai/mistral-nemo 131.1K 131.1K Input: $0.02
Output: $0.03
Model: 0.010
Completion: 1.500
🔧 🌡️ 2024-07 In: text
Out: text
Open Weights
Released: 2024-07-01
Mistral Medium 3.5 mistralai/mistral-medium-3-5 262.1K 262.1K Input: $1.5
Output: $7.5
Model: 0.750
Completion: 5.000
📎 🧠 🔧 🌡️ - In: text, image, pdf
Out: text
Released: 2026-04-30
Ministral 3 8B 2512 mistralai/ministral-8b-2512 262.1K 262.1K Input: $0.15
Output: $0.15
Cache Read: $0.015
Model: 0.075
Completion: 1.000
Cache: 0.100
📎 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2025-12-02
Mistral Small 3.1 24B mistralai/mistral-small-3.1-24b-instruct 128K 128K Input: $0.351
Output: $0.555
Model: 0.175
Completion: 1.581
📎 🌡️ 2023-10-31 In: text, image
Out: text
Open Weights
Released: 2025-03-17
Saba mistralai/mistral-saba 32.8K 32.8K Input: $0.2
Output: $0.6
Cache Read: $0.02
Model: 0.100
Completion: 3.000
Cache: 0.100
📎 🔧 🌡️ 2024-09-30 In: text, pdf
Out: text
Released: 2025-02-17
Mistral Large mistralai/mistral-large 128K 128K Input: $2
Output: $6
Cache Read: $0.2
Model: 1.000
Completion: 3.000
Cache: 0.100
📎 🔧 🌡️ 2024-11-30 In: text, pdf
Out: text
Released: 2024-02-26
Mistral Medium 3.1 mistralai/mistral-medium-3.1 131.1K 262.1K Input: $0.4
Output: $2
Cache Read: $0.04
Model: 0.200
Completion: 5.000
Cache: 0.100
📎 🔧 🌡️ 2025-06-30 In: text, image, pdf
Out: text
Released: 2025-08-13
Mistral Small 3 mistralai/mistral-small-24b-instruct-2501 32.8K 16.4K Input: $0.05
Output: $0.08
Model: 0.025
Completion: 1.600
🌡️ 2023-10-31 In: text
Out: text
Open Weights
Released: 2025-01-30
Ministral 3 3B 2512 mistralai/ministral-3b-2512 131.1K 131.1K Input: $0.1
Output: $0.1
Cache Read: $0.01
Model: 0.050
Completion: 1.000
Cache: 0.100
📎 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2025-12-02
Mistral Small 4 mistralai/mistral-small-2603 262.1K 262.1K Input: $0.15
Output: $0.6
Cache Read: $0.015
Model: 0.075
Completion: 4.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-06 In: text, image
Out: text
Open Weights
Released: 2026-03-16
Ministral 3 14B 2512 mistralai/ministral-14b-2512 262.1K 262.1K Input: $0.2
Output: $0.2
Cache Read: $0.02
Model: 0.100
Completion: 1.000
Cache: 0.100
📎 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2025-12-02
Devstral 2 mistralai/devstral-2512 262.1K 262.1K Input: $0.4
Output: $2
Cache Read: $0.04
Model: 0.200
Completion: 5.000
Cache: 0.100
📎 🔧 🌡️ 2025-12 In: text, pdf
Out: text
Open Weights
Released: 2025-12-09
Mixtral 8x22B Instruct mistralai/mixtral-8x22b-instruct 65.5K 65.5K Input: $2
Output: $6
Cache Read: $0.2
Model: 1.000
Completion: 3.000
Cache: 0.100
📎 🔧 🌡️ 2024-01-31 In: text, pdf
Out: text
Open Weights
Released: 2024-04-17
Mistral Medium 3 mistralai/mistral-medium-3 131.1K 131.1K Input: $0.4
Output: $2
Cache Read: $0.04
Model: 0.200
Completion: 5.000
Cache: 0.100
📎 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-05-07
Voxtral Small 24B 2507 mistralai/voxtral-small-24b-2507 32K 32K Input: $0.1
Output: $0.3
Cache Read: $0.01
Model: 0.050
Completion: 3.000
Cache: 0.100
📎 🔧 🌡️ - In: text, audio, pdf
Out: text
Open Weights
Released: 2025-10-30
Mistral Large 3 mistralai/mistral-large-2512 262.1K 262.1K Input: $0.5
Output: $1.5
Cache Read: $0.05
Model: 0.250
Completion: 3.000
Cache: 0.100
📎 🔧 🌡️ 2024-11 In: text, image, pdf
Out: text
Open Weights
Released: 2024-11-01
Updated: 2025-12-02
Codestral 2508 mistralai/codestral-2508 256K 256K Input: $0.3
Output: $0.9
Cache Read: $0.03
Model: 0.150
Completion: 3.000
Cache: 0.100
📎 🔧 🌡️ 2025-03-31 In: text, pdf
Out: text
Released: 2025-08-01
Morph V3 Fast morph/morph-v3-fast 81.9K 38K Input: $0.8
Output: $1.2
Model: 0.400
Completion: 1.500
🌡️ - In: text
Out: text
Released: 2025-07-07
Morph V3 Large morph/morph-v3-large 262.1K 131.1K Input: $0.9
Output: $1.9
Model: 0.450
Completion: 2.111
🌡️ - In: text
Out: text
Released: 2025-07-07
Seed 1.6 Flash bytedance-seed/seed-1.6-flash 262.1K 32.8K Input: $0.075
Output: $0.3
Model: 0.037
Completion: 4.000
📎 🧠 🔧 🌡️ - In: image, text, video
Out: text
Released: 2025-12-23
Seed 1.6 bytedance-seed/seed-1.6 262.1K 32.8K Input: $0.25
Output: $2
Model: 0.125
Completion: 8.000
📎 🧠 🔧 🌡️ - In: image, text, video
Out: text
Released: 2025-12-23
Seed-2.0-Mini bytedance-seed/seed-2.0-mini 262.1K 131.1K Input: $0.1
Output: $0.4
Model: 0.050
Completion: 4.000
📎 🧠 🔧 🌡️ - In: text, image, video
Out: text
Released: 2026-02-26
Seed-2.0-Lite bytedance-seed/seed-2.0-lite 262.1K 131.1K Input: $0.25
Output: $2
Model: 0.125
Completion: 8.000
📎 🧠 🔧 🌡️ - In: text, image, video
Out: text
Released: 2026-03-10
Magnum v4 72B anthracite-org/magnum-v4-72b 16.4K 2K Input: $3
Output: $5
Model: 1.500
Completion: 1.667
🌡️ 2024-06-30 In: text
Out: text
Open Weights
Released: 2024-10-22
Nemotron 3 Nano 30B A3B (free) nvidia/nemotron-3-nano-30b-a3b:free 256K 256K Input: $0
Output: $0
- 🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-12-15
Nemotron Nano 9B V2 (free) nvidia/nemotron-nano-9b-v2:free 128K 128K Input: $0
Output: $0
- 🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-18
Nemotron Nano 12B 2 VL (free) nvidia/nemotron-nano-12b-v2-vl:free 128K 128K Input: $0
Output: $0
- 📎 🧠 🔧 🌡️ - In: text, image, video
Out: text
Open Weights
Released: 2025-10-28
Nemotron 3 Nano Omni (free) nvidia/nemotron-3-nano-omni-30b-a3b-reasoning:free 256K 65.5K Input: $0
Output: $0
- 📎 🧠 🔧 🌡️ - In: text, image, video, audio
Out: text
Open Weights
Released: 2026-04-28
Nemotron 3 Ultra (free) nvidia/nemotron-3-ultra-550b-a55b:free 1M 65.5K Input: $0
Output: $0
- 🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-06-04
Nemotron 3 Ultra 550B A55B nvidia/nemotron-3-ultra-550b-a55b 262.1K 16.4K Input: $0.5
Output: $2.5
Cache Read: $0.15
Model: 0.250
Completion: 5.000
Cache: 0.300
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-06-04
Nemotron 3.5 Content Safety (free) nvidia/nemotron-3.5-content-safety:free 128K 8.2K Input: $0
Output: $0
- 📎 🧠 🌡️ - In: text, image
Out: text
Open Weights
Released: 2026-06-04
Nemotron 3 Nano 30B A3B nvidia/nemotron-3-nano-30b-a3b 262.1K 228K Input: $0.05
Output: $0.2
Model: 0.025
Completion: 4.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-12-15
Llama 3.3 Nemotron Super 49B v1.5 nvidia/llama-3.3-nemotron-super-49b-v1.5 131.1K 16.4K Input: $0.4
Output: $0.4
Model: 0.200
Completion: 1.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-07-25
Nemotron 3 Super (free) nvidia/nemotron-3-super-120b-a12b:free 262.1K 262.1K Input: $0
Output: $0
- 🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-11
Nemotron 3 Super 120B A12B nvidia/nemotron-3-super-120b-a12b 262.1K 262.1K Input: $0.09
Output: $0.45
Model: 0.045
Completion: 5.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-11
Uncensored (free) cognitivecomputations/dolphin-mistral-24b-venice-edition:free 32.8K 32.8K Input: $0
Output: $0
- 🌡️ 2024-04-30 In: text
Out: text
Open Weights
Released: 2025-07-09
MiMo-V2.5 xiaomi/mimo-v2.5 1M 131.1K Input: $0.14
Output: $0.28
Cache Read: $0.0028
Model: 0.070
Completion: 2.000
Cache: 0.020
📎 🧠 🔧 🌡️ 2024-12 In: text, image, audio, video
Out: text
Open Weights
Released: 2026-04-22
MiMo-V2-Flash xiaomi/mimo-v2-flash 262.1K 65.5K Input: $0.1
Output: $0.3
Cache Read: $0.01
Model: 0.050
Completion: 3.000
Cache: 0.100
🧠 🔧 🌡️ 2024-12-01 In: text
Out: text
Open Weights
Released: 2025-12-16
Updated: 2026-02-04
MiMo-V2.5-Pro xiaomi/mimo-v2.5-pro 1M 131.1K Input: $0.435
Output: $0.87
Cache Read: $0.0036
Model: 0.217
Completion: 2.000
Cache: 0.008
🧠 🔧 🌡️ 2024-12 In: text
Out: text
Open Weights
Released: 2026-04-22
Mercury 2 inception/mercury-2 128K 50K Input: $0.25
Output: $0.75
Cache Read: $0.025
Model: 0.125
Completion: 3.000
Cache: 0.100
🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-03-04
Claude 3.5 Haiku anthropic/claude-3.5-haiku 200K 8.2K Input: $0.8
Output: $4
Cache Read: $0.08
Cache Write: $1
Model: 0.400
Completion: 5.000
Cache: 0.100
📎 🔧 🌡️ 2024-07-31 In: text, image
Out: text
Released: 2024-11-04
Claude Sonnet 4.5 (latest) anthropic/claude-sonnet-4.5 1M 64K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-07-31 In: text, image, pdf
Out: text
Released: 2025-09-29
Claude Sonnet 4 anthropic/claude-sonnet-4 1M 64K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01-31 In: image, text, pdf
Out: text
Released: 2025-05-22
Claude Opus 4.6 (Fast) anthropic/claude-opus-4.6-fast 1M 128K Input: $30
Output: $150
Cache Read: $3
Cache Write: $37.5
Model: 15.000
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ - In: text, image, pdf
Out: text
Released: 2026-04-07
Claude Haiku 4.5 (latest) anthropic/claude-haiku-4.5 200K 64K Input: $1
Output: $5
Cache Read: $0.1
Cache Write: $1.25
Model: 0.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-02-28 In: text, image, pdf
Out: text
Released: 2025-10-15
Claude Opus 4.7 (Fast) anthropic/claude-opus-4.7-fast 1M 128K Input: $30
Output: $150
Cache Read: $3
Cache Write: $37.5
Model: 15.000
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2026-05-12
Claude Opus 4.7 anthropic/claude-opus-4.7 1M 128K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 2026-01-31 In: text, image, pdf
Out: text
Released: 2026-04-16
Claude Opus 4.8 anthropic/claude-opus-4.8 1M 128K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2026-05-28
Claude Fable 5 anthropic/claude-fable-5 1M 128K Input: $10
Output: $50
Cache Read: $1
Cache Write: $12.5
Model: 5.000
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 2026-01-31 In: text, image, pdf
Out: text
Released: 2026-06-09
Claude Opus 4.1 (latest) anthropic/claude-opus-4.1 200K 32K Input: $15
Output: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-08-05
Claude Opus 4.5 (latest) anthropic/claude-opus-4.5 200K 64K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-11-24
Claude Sonnet 4.6 anthropic/claude-sonnet-4.6 1M 128K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-02-17
Updated: 2026-03-13
Claude 3 Haiku anthropic/claude-3-haiku 200K 4.1K Input: $0.25
Output: $1.25
Cache Read: $0.03
Cache Write: $0.3
Model: 0.125
Completion: 5.000
Cache: 0.120
📎 🔧 🌡️ 2023-08-31 In: text, image
Out: text
Released: 2024-03-13
Claude Opus 4 anthropic/claude-opus-4 200K 32K Input: $15
Output: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01-31 In: image, text, pdf
Out: text
Released: 2025-05-22
Claude Opus 4.8 (Fast) anthropic/claude-opus-4.8-fast 1M 128K Input: $10
Output: $50
Cache Read: $1
Cache Write: $12.5
Model: 5.000
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2026-05-27
Claude Opus 4.6 anthropic/claude-opus-4.6 1M 128K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-05-31 In: text, image, pdf
Out: text
Released: 2026-02-05
Updated: 2026-03-13
Hunyuan A13B Instruct tencent/hunyuan-a13b-instruct 131.1K 131.1K Input: $0.14
Output: $0.57
Model: 0.070
Completion: 4.071
🧠 🌡️ 2025-03-31 In: text
Out: text
Open Weights
Released: 2025-07-08
Hy3 preview tencent/hy3-preview 262.1K 262.1K Input: $0.063
Output: $0.21
Cache Read: $0.021
Model: 0.032
Completion: 3.333
Cache: 0.333
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-04-20
Cogito v2.1 671B deepcogito/cogito-v2.1-671b 128K 128K Input: $1.25
Output: $1.25
Model: 0.625
Completion: 1.000
🧠 🌡️ - In: text
Out: text
Released: 2025-11-13
Command A cohere/command-a 256K 8.2K Input: $2.5
Output: $10
Model: 1.250
Completion: 4.000
🌡️ 2024-08-31 In: text
Out: text
Open Weights
Released: 2025-03-13
Command R cohere/command-r-08-2024 128K 4K Input: $0.15
Output: $0.6
Model: 0.075
Completion: 4.000
🔧 🌡️ 2024-06-01 In: text
Out: text
Open Weights
Released: 2024-08-30
Command R7B cohere/command-r7b-12-2024 128K 4K Input: $0.0375
Output: $0.15
Model: 0.019
Completion: 4.000
🌡️ 2024-06-01 In: text
Out: text
Open Weights
Released: 2024-12-02
Command R+ cohere/command-r-plus-08-2024 128K 4K Input: $2.5
Output: $10
Model: 1.250
Completion: 4.000
🔧 🌡️ 2024-06-01 In: text
Out: text
Open Weights
Released: 2024-08-30
MythoMax 13B gryphe/mythomax-l2-13b 4.1K 4.1K Input: $0.06
Output: $0.06
Model: 0.030
Completion: 1.000
🌡️ 2023-06-30 In: text
Out: text
Open Weights
Released: 2023-07-02
Step 3.7 Flash stepfun/step-3.7-flash 256K 256K Input: $0.2
Output: $1.15
Cache Read: $0.04
Model: 0.100
Completion: 5.750
Cache: 0.200
📎 🧠 🔧 🌡️ 2026-01-01 In: text, image, video
Out: text
Open Weights
Released: 2026-05-29
Step 3.5 Flash stepfun/step-3.5-flash 262.1K 16.4K Input: $0.09
Output: $0.3
Cache Read: $0.02
Model: 0.045
Completion: 3.333
Cache: 0.222
🧠 🔧 🌡️ 2025-01 In: text
Out: text
Open Weights
Released: 2026-01-29
Updated: 2026-02-13
INTELLECT-3 prime-intellect/intellect-3 131.1K 131.1K Input: $0.2
Output: $1.1
Model: 0.100
Completion: 5.500
🧠 🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-11-27
Nex-N2-Pro (free) nex-agi/nex-n2-pro:free 262.1K 262.1K Input: $0
Output: $0
- 📎 🧠 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2026-06-08
ReMM SLERP 13B undi95/remm-slerp-l2-13b 6.1K 4.1K Input: $0.45
Output: $0.65
Model: 0.225
Completion: 1.444
🌡️ 2023-06-30 In: text
Out: text
Open Weights
Released: 2023-07-22
OpenAI GPT Mini Latest ~openai/gpt-mini-latest 400K 128K Input: $0.75
Output: $4.5
Cache Read: $0.075
Model: 0.375
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: pdf, image, text
Out: text
Released: 2026-04-27
OpenAI GPT Latest ~openai/gpt-latest 1.1M 128K Input: $5
Output: $30
Cache Read: $0.5
Model: 2.500
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 2025-12-01 In: pdf, image, text
Out: text
Released: 2026-04-27
MoonshotAI Kimi Latest ~moonshotai/kimi-latest 262.1K 262.1K Input: $0.68
Output: $3.41
Cache Read: $0.34
Model: 0.340
Completion: 5.015
Cache: 0.500
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2026-04-27
Relace Search relace/relace-search 256K 128K Input: $1
Output: $3
Model: 0.500
Completion: 3.000
🔧 🌡️ - In: text
Out: text
Released: 2025-12-08
Relace Apply 3 relace/relace-apply-3 256K 128K Input: $0.85
Output: $1.25
Model: 0.425
Completion: 1.471
- - In: text
Out: text
Released: 2025-09-26
Jamba Large 1.7 ai21/jamba-large-1.7 256K 4.1K Input: $2
Output: $8
Model: 1.000
Completion: 4.000
🔧 🌡️ 2024-08-31 In: text
Out: text
Open Weights
Released: 2025-08-08
Coder Large arcee-ai/coder-large 32.8K 32.8K Input: $0.5
Output: $0.8
Model: 0.250
Completion: 1.600
🌡️ 2025-03-31 In: text
Out: text
Released: 2025-05-05
Virtuoso Large arcee-ai/virtuoso-large 131.1K 64K Input: $0.75
Output: $1.2
Model: 0.375
Completion: 1.600
🔧 🌡️ 2025-03-31 In: text
Out: text
Released: 2025-05-05
Trinity Large Thinking arcee-ai/trinity-large-thinking 262.1K 262.1K Input: $0.22
Output: $0.85
Cache Read: $0.06
Model: 0.110
Completion: 3.864
Cache: 0.273
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-04-01
Trinity Mini arcee-ai/trinity-mini 131.1K 131.1K Input: $0.045
Output: $0.15
Model: 0.022
Completion: 3.333
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-12-01
Weaver (alpha) mancer/weaver 8K 2K Input: $0.75
Output: $1
Model: 0.375
Completion: 1.333
🌡️ 2023-06-30 In: text
Out: text
Released: 2023-08-02
Sonar Reasoning Pro perplexity/sonar-reasoning-pro 128K 128K Input: $2
Output: $8
Model: 1.000
Completion: 4.000
📎 🧠 🌡️ - In: text, image
Out: text
Released: 2025-03-07
Sonar perplexity/sonar 127.1K 127.1K Input: $1
Output: $1
Model: 0.500
Completion: 1.000
📎 🌡️ - In: text, image
Out: text
Released: 2025-01-27
Sonar Pro perplexity/sonar-pro 200K 8K Input: $3
Output: $15
Model: 1.500
Completion: 5.000
📎 🌡️ - In: text, image
Out: text
Released: 2025-03-07
Sonar Pro Search perplexity/sonar-pro-search 200K 8K Input: $3
Output: $15
Model: 1.500
Completion: 5.000
📎 🧠 🌡️ - In: text, image
Out: text
Released: 2025-10-30
Sonar Deep Research perplexity/sonar-deep-research 128K 128K Input: $2
Output: $8
Reasoning: $3
Model: 1.000
Completion: 4.000
🧠 🌡️ - In: text
Out: text
Released: 2025-03-07
Switchpoint Router switchpoint/router 131.1K 131.1K Input: $0.85
Output: $3.4
Model: 0.425
Completion: 4.000
🧠 🌡️ - In: text
Out: text
Released: 2025-07-11
Body Builder (beta) openrouter/bodybuilder 128K 128K - - - - In: text
Out: text
Released: 2025-12-05
Free Models Router openrouter/free 200K 8K Input: $0
Output: $0
- 📎 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2026-02-01
Fusion openrouter/fusion 128K 128K - - - - In: text
Out: text
Released: 2026-06-13
Owl Alpha openrouter/owl-alpha 1M 262.1K Input: $0
Output: $0
- 🔧 🌡️ - In: text
Out: text
Released: 2026-04-28
Pareto Code Router openrouter/pareto-code 2M 200K - - - - In: text
Out: text
Released: 2026-04-21
Auto Router openrouter/auto 2M 2M - - 📎 🧠 🔧 🌡️ - In: text, image, audio, pdf, video
Out: text, image
Released: 2023-11-08
Qwen3.5 Plus 2026-04-20 qwen/qwen3.5-plus-20260420 1M 65.5K Input: $0.3
Output: $1.8
Cache Write: $0.375
Model: 0.150
Completion: 6.000
📎 🧠 🔧 🌡️ - In: text, image, video
Out: text
Released: 2026-04-27
Qwen3 Next 80B A3B Instruct (free) qwen/qwen3-next-80b-a3b-instruct:free 262.1K 262.1K Input: $0
Output: $0
- 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-09
Qwen3 VL 235B A22B Thinking qwen/qwen3-vl-235b-a22b-thinking 131.1K 32.8K Input: $0.26
Output: $2.6
Model: 0.130
Completion: 10.000
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image
Out: text
Open Weights
Released: 2025-09-23
Qwen3 VL 30B A3B Thinking qwen/qwen3-vl-30b-a3b-thinking 131.1K 32.8K Input: $0.13
Output: $1.56
Model: 0.065
Completion: 12.000
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image
Out: text
Open Weights
Released: 2025-10-06
Qwen3 Coder Plus qwen/qwen3-coder-plus 1M 65.5K Input: $0.65
Output: $3.25
Cache Read: $0.13
Cache Write: $0.8125
Model: 0.325
Completion: 5.000
Cache: 0.200
🔧 🌡️ 2025-04 In: text
Out: text
Released: 2025-07-23
Qwen Plus qwen/qwen-plus 1M 32.8K Input: $0.26
Output: $0.78
Cache Read: $0.052
Cache Write: $0.325
Model: 0.130
Completion: 3.000
Cache: 0.200
🔧 🌡️ 2024-04 In: text
Out: text
Released: 2024-01-25
Updated: 2025-09-11
Qwen3-Coder 30B-A3B Instruct qwen/qwen3-coder-30b-a3b-instruct 160K 32.8K Input: $0.07
Output: $0.27
Model: 0.035
Completion: 3.857
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-04
Qwen3 32B qwen/qwen3-32b 41K 16.4K Input: $0.08
Output: $0.28
Model: 0.040
Completion: 3.500
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-04
Qwen3-Next 80B-A3B Instruct qwen/qwen3-next-80b-a3b-instruct 262.1K 16.4K Input: $0.09
Output: $1.1
Model: 0.045
Completion: 12.222
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-09
Qwen3 VL 8B Instruct qwen/qwen3-vl-8b-instruct 131.1K 32.8K Input: $0.08
Output: $0.5
Model: 0.040
Completion: 6.250
📎 🔧 🌡️ - In: image, text
Out: text
Open Weights
Released: 2025-10-14
Qwen3.7 Plus qwen/qwen3.7-plus 1M 65.5K Input: $0.32
Output: $1.28
Cache Read: $0.064
Cache Write: $0.4
Model: 0.160
Completion: 4.000
Cache: 0.200
📎 🧠 🔧 🌡️ 2025-04 In: text, image
Out: text
Released: 2026-06-02
Qwen3.6 35B-A3B qwen/qwen3.6-35b-a3b 262.1K 262.1K Input: $0.15
Output: $1
Cache Read: $0.05
Model: 0.075
Completion: 6.667
Cache: 0.333
📎 🧠 🔧 🌡️ - In: text, image, video
Out: text
Open Weights
Released: 2026-04-17
Qwen3.7 Max qwen/qwen3.7-max 1M 65.5K Input: $1.25
Output: $3.75
Cache Read: $0.25
Cache Write: $1.5625
Model: 0.625
Completion: 3.000
Cache: 0.200
🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-05-21
Qwen3 Max qwen/qwen3-max 262.1K 32.8K Input: $0.78
Output: $3.9
Cache Read: $0.156
Cache Write: $0.975
Model: 0.390
Completion: 5.000
Cache: 0.200
🔧 🌡️ 2025-04 In: text
Out: text
Released: 2025-09-23
Qwen3 8B qwen/qwen3-8b 41K 8.2K Input: $0.05
Output: $0.4
Cache Read: $0.05
Model: 0.025
Completion: 8.000
Cache: 1.000
🧠 🔧 🌡️ 2025-03-31 In: text
Out: text
Open Weights
Released: 2025-04-28
Qwen Plus 0728 qwen/qwen-plus-2025-07-28 1M 32.8K Input: $0.26
Output: $0.78
Model: 0.130
Completion: 3.000
🔧 🌡️ 2025-03-31 In: text
Out: text
Released: 2025-09-08
Qwen3.5-Flash qwen/qwen3.5-flash-02-23 1M 65.5K Input: $0.065
Output: $0.26
Model: 0.033
Completion: 4.000
📎 🧠 🔧 🌡️ - In: text, image, video
Out: text
Released: 2026-02-25
Qwen3 Coder 480B A35B (free) qwen/qwen3-coder:free 262K 262K Input: $0
Output: $0
- 🔧 🌡️ 2025-06-30 In: text
Out: text
Open Weights
Released: 2025-07-23
Qwen3 30B A3B Instruct 2507 qwen/qwen3-30b-a3b-instruct-2507 128K 32K Input: $0.04815
Output: $0.19305
Model: 0.024
Completion: 4.009
🔧 🌡️ 2025-06-30 In: text
Out: text
Open Weights
Released: 2025-07-29
Qwen2.5 Coder 32B Instruct qwen/qwen-2.5-coder-32b-instruct 32.8K 32.8K Input: $0.66
Output: $1
Model: 0.330
Completion: 1.515
🌡️ 2024-06-30 In: text
Out: text
Open Weights
Released: 2024-11-11
Qwen3-Next 80B-A3B (Thinking) qwen/qwen3-next-80b-a3b-thinking 131.1K 32.8K Input: $0.0975
Output: $0.78
Model: 0.049
Completion: 8.000
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-09
Qwen3 235B A22B Thinking 2507 qwen/qwen3-235b-a22b-thinking-2507 262.1K 262.1K Input: $0.1
Output: $0.1
Cache Read: $0.1
Model: 0.050
Completion: 1.000
Cache: 1.000
🧠 🔧 🌡️ 2025-06-30 In: text
Out: text
Open Weights
Released: 2025-07-25
Qwen3 VL 32B Instruct qwen/qwen3-vl-32b-instruct 131.1K 32.8K Input: $0.104
Output: $0.416
Model: 0.052
Completion: 4.000
📎 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2025-10-23
Qwen3 Coder 480B A35B qwen/qwen3-coder 262.1K 65.5K Input: $0.22
Output: $1.8
Model: 0.110
Completion: 8.182
🔧 🌡️ 2025-06-30 In: text
Out: text
Open Weights
Released: 2025-07-23
Qwen3.6 Flash qwen/qwen3.6-flash 1M 65.5K Input: $0.1875
Output: $1.125
Cache Write: $0.234375
Model: 0.094
Completion: 6.000
📎 🧠 🔧 🌡️ - In: text, image, video
Out: text
Released: 2026-04-27
Qwen3.5 Plus 2026-02-15 qwen/qwen3.5-plus-02-15 1M 65.5K Input: $0.26
Output: $1.56
Model: 0.130
Completion: 6.000
📎 🧠 🔧 🌡️ 2025-04 In: text, image, video
Out: text
Released: 2026-02-16
Qwen2.5 7B Instruct qwen/qwen-2.5-7b-instruct 32.8K 32.8K Input: $0.04
Output: $0.1
Model: 0.020
Completion: 2.500
🌡️ 2024-06-30 In: text
Out: text
Open Weights
Released: 2024-10-16
Qwen3 VL 8B Thinking qwen/qwen3-vl-8b-thinking 131.1K 32.8K Input: $0.117
Output: $1.365
Model: 0.059
Completion: 11.667
📎 🧠 🔧 🌡️ - In: image, text
Out: text
Open Weights
Released: 2025-10-14
Qwen3 Max Thinking qwen/qwen3-max-thinking 262.1K 32.8K Input: $0.78
Output: $3.9
Model: 0.390
Completion: 5.000
🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-02-09
Qwen3 30B A3B Thinking 2507 qwen/qwen3-30b-a3b-thinking-2507 131.1K 131.1K Input: $0.08
Output: $0.4
Cache Read: $0.08
Model: 0.040
Completion: 5.000
Cache: 1.000
🧠 🔧 🌡️ 2025-06-30 In: text
Out: text
Open Weights
Released: 2025-08-28
Qwen2.5 VL 72B Instruct qwen/qwen2.5-vl-72b-instruct 128K 128K Input: $0.8
Output: $1
Cache Read: $0.4
Model: 0.400
Completion: 1.250
Cache: 0.500
📎 🌡️ 2024-06-30 In: text, image
Out: text
Open Weights
Released: 2025-02-01
Qwen3.5 27B qwen/qwen3.5-27b 262.1K 65.5K Input: $0.195
Output: $1.56
Model: 0.098
Completion: 8.000
📎 🧠 🔧 🌡️ - In: text, image, video
Out: text
Open Weights
Released: 2026-02-23
Qwen3 235B-A22B qwen/qwen3-235b-a22b 131.1K 8.2K Input: $0.455
Output: $1.82
Model: 0.228
Completion: 4.000
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-04
Qwen2.5 72B Instruct qwen/qwen-2.5-72b-instruct 32.8K 16.4K Input: $0.36
Output: $0.4
Model: 0.180
Completion: 1.111
🔧 🌡️ 2024-06-30 In: text
Out: text
Open Weights
Released: 2024-09-19
Qwen Plus 0728 (thinking) qwen/qwen-plus-2025-07-28:thinking 1M 32.8K Input: $0.26
Output: $0.78
Cache Write: $0.325
Model: 0.130
Completion: 3.000
🧠 🔧 🌡️ 2025-03-31 In: text
Out: text
Released: 2025-09-08
Qwen3 Coder Next qwen/qwen3-coder-next 262.1K 262.1K Input: $0.11
Output: $0.8
Cache Read: $0.07
Model: 0.055
Completion: 7.273
Cache: 0.636
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-04
Qwen3.6 27B qwen/qwen3.6-27b 262.1K 262.1K Input: $0.2885
Output: $3.17
Model: 0.144
Completion: 10.988
📎 🧠 🔧 🌡️ - In: text, image, video
Out: text
Open Weights
Released: 2026-04-22
Qwen3.5 35B-A3B qwen/qwen3.5-35b-a3b 262.1K 262.1K Input: $0.14
Output: $1
Cache Read: $0.05
Model: 0.070
Completion: 7.143
Cache: 0.357
📎 🧠 🔧 🌡️ - In: text, image, video
Out: text
Open Weights
Released: 2026-02-23
Qwen3.5-9B qwen/qwen3.5-9b 262.1K 262.1K Input: $0.1
Output: $0.15
Model: 0.050
Completion: 1.500
📎 🧠 🔧 🌡️ - In: text, image, video
Out: text
Open Weights
Released: 2026-03-10
Qwen3.5 397B-A17B qwen/qwen3.5-397b-a17b 262.1K 65.5K Input: $0.39
Output: $2.34
Model: 0.195
Completion: 6.000
📎 🧠 🔧 🌡️ - In: text, image, video
Out: text
Open Weights
Released: 2026-02-15
Qwen3 VL 30B A3B Instruct qwen/qwen3-vl-30b-a3b-instruct 131.1K 32.8K Input: $0.13
Output: $0.52
Model: 0.065
Completion: 4.000
📎 🔧 🌡️ 2025-03-31 In: text, image
Out: text
Open Weights
Released: 2025-10-06
Qwen3 235B A22B Instruct 2507 qwen/qwen3-235b-a22b-2507 262.1K 16.4K Input: $0.09
Output: $0.1
Model: 0.045
Completion: 1.111
🔧 🌡️ 2025-06-30 In: text
Out: text
Open Weights
Released: 2025-07-21
Qwen3 Coder Flash qwen/qwen3-coder-flash 1M 65.5K Input: $0.195
Output: $0.975
Cache Read: $0.039
Cache Write: $0.24375
Model: 0.098
Completion: 5.000
Cache: 0.200
🔧 🌡️ 2025-04 In: text
Out: text
Released: 2025-07-28
Qwen3 14B qwen/qwen3-14b 41K 41K Input: $0.1
Output: $0.24
Model: 0.050
Completion: 2.400
🧠 🔧 🌡️ 2025-03-31 In: text
Out: text
Open Weights
Released: 2025-04-28
Qwen3 VL 235B A22B Instruct qwen/qwen3-vl-235b-a22b-instruct 262.1K 16.4K Input: $0.2
Output: $0.88
Cache Read: $0.11
Model: 0.100
Completion: 4.400
Cache: 0.550
📎 🔧 🌡️ 2025-03-31 In: text, image
Out: text
Open Weights
Released: 2025-09-23
Qwen3 30B A3B qwen/qwen3-30b-a3b 41K 16.4K Input: $0.12
Output: $0.5
Model: 0.060
Completion: 4.167
🧠 🔧 🌡️ 2025-03-31 In: text
Out: text
Open Weights
Released: 2025-04-28
Qwen3.6 Max Preview qwen/qwen3.6-max-preview 262.1K 65.5K Input: $1.04
Output: $6.24
Cache Write: $1.3
Model: 0.520
Completion: 6.000
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Released: 2026-04-20
Qwen3.5 122B-A10B qwen/qwen3.5-122b-a10b 262.1K 262.1K Input: $0.26
Output: $2.08
Model: 0.130
Completion: 8.000
📎 🧠 🔧 🌡️ - In: text, image, video
Out: text
Open Weights
Released: 2026-02-23
Qwen3.6 Plus qwen/qwen3.6-plus 1M 65.5K Input: $0.325
Output: $1.95
Cache Write: $0.40625
Model: 0.163
Completion: 6.000
📎 🧠 🔧 🌡️ 2025-04 In: text, image, video
Out: text
Released: 2026-04-02
Nova Lite 1.0 amazon/nova-lite-v1 300K 5.1K Input: $0.06
Output: $0.24
Model: 0.030
Completion: 4.000
📎 🔧 🌡️ 2024-10-31 In: text, image
Out: text
Released: 2024-12-05
Nova Premier 1.0 amazon/nova-premier-v1 1M 32K Input: $2.5
Output: $12.5
Cache Read: $0.625
Model: 1.250
Completion: 5.000
Cache: 0.250
📎 🔧 🌡️ - In: text, image
Out: text
Released: 2025-10-31
Nova Pro 1.0 amazon/nova-pro-v1 300K 5.1K Input: $0.8
Output: $3.2
Model: 0.400
Completion: 4.000
📎 🔧 🌡️ 2024-10-31 In: text, image
Out: text
Released: 2024-12-05
Nova Micro 1.0 amazon/nova-micro-v1 128K 5.1K Input: $0.035
Output: $0.14
Model: 0.018
Completion: 4.000
🔧 🌡️ 2024-10-31 In: text
Out: text
Released: 2024-12-05
Nova 2 Lite amazon/nova-2-lite-v1 1M 65.5K Input: $0.3
Output: $2.5
Model: 0.150
Completion: 8.333
📎 🧠 🔧 🌡️ - In: text, image, video, pdf
Out: text
Released: 2025-12-02
Aion-RP 1.0 (8B) aion-labs/aion-rp-llama-3.1-8b 32.8K 32.8K Input: $0.8
Output: $1.6
Model: 0.400
Completion: 2.000
🌡️ 2023-12-31 In: text
Out: text
Released: 2025-02-04
Aion-1.0-Mini aion-labs/aion-1.0-mini 131.1K 32.8K Input: $0.7
Output: $1.4
Model: 0.350
Completion: 2.000
🧠 🌡️ - In: text
Out: text
Open Weights
Released: 2025-02-04
Aion-2.0 aion-labs/aion-2.0 131.1K 32.8K Input: $0.8
Output: $1.6
Cache Read: $0.2
Model: 0.400
Completion: 2.000
Cache: 0.250
🧠 🌡️ - In: text
Out: text
Released: 2026-02-23
Aion-1.0 aion-labs/aion-1.0 131.1K 32.8K Input: $4
Output: $8
Model: 2.000
Completion: 2.000
🧠 🌡️ - In: text
Out: text
Released: 2025-02-04
Inflection 3 Pi inflection/inflection-3-pi 8K 1K Input: $2.5
Output: $10
Model: 1.250
Completion: 4.000
🌡️ 2024-10-31 In: text
Out: text
Released: 2024-10-11
Inflection 3 Productivity inflection/inflection-3-productivity 8K 1K Input: $2.5
Output: $10
Model: 1.250
Completion: 4.000
🌡️ 2024-10-31 In: text
Out: text
Released: 2024-10-11
Llama 3.1 Euryale 70B v2.2 sao10k/l3.1-euryale-70b 131.1K 16.4K Input: $0.85
Output: $0.85
Model: 0.425
Completion: 1.000
🔧 🌡️ 2023-12-31 In: text
Out: text
Open Weights
Released: 2024-08-28
Llama 3.3 Euryale 70B sao10k/l3.3-euryale-70b 131.1K 16.4K Input: $0.65
Output: $0.75
Model: 0.325
Completion: 1.154
🌡️ 2023-12-31 In: text
Out: text
Open Weights
Released: 2024-12-18
Llama 3 8B Lunaris sao10k/l3-lunaris-8b 8.2K 16.4K Input: $0.04
Output: $0.05
Model: 0.020
Completion: 1.250
🌡️ 2023-12-31 In: text
Out: text
Open Weights
Released: 2024-08-13
Llama 3.1 70B Hanami x1 sao10k/l3.1-70b-hanami-x1 16K 16K Input: $3
Output: $3
Model: 1.500
Completion: 1.000
🌡️ 2023-12-31 In: text
Out: text
Open Weights
Released: 2025-01-08
Solar Pro 3 upstage/solar-pro-3 128K 128K Input: $0.15
Output: $0.6
Cache Read: $0.015
Model: 0.075
Completion: 4.000
Cache: 0.100
🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-01-27
Olmo 3 32B Think allenai/olmo-3-32b-think 65.5K 65.5K Input: $0.15
Output: $0.5
Model: 0.075
Completion: 3.333
🧠 🌡️ - In: text
Out: text
Open Weights
Released: 2025-11-21
Rnj 1 Instruct essentialai/rnj-1-instruct 32.8K 32.8K Input: $0.15
Output: $0.15
Model: 0.075
Completion: 1.000
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-12-07
R1 0528 deepseek/deepseek-r1-0528 163.8K 32.8K Input: $0.5
Output: $2.15
Cache Read: $0.35
Model: 0.250
Completion: 4.300
Cache: 0.700
🧠 🔧 🌡️ 2025-03-31 In: text
Out: text
Open Weights
Released: 2025-05-28
DeepSeek V4 Flash deepseek/deepseek-v4-flash 1M 65.5K Input: $0.09
Output: $0.18
Cache Read: $0.02
Model: 0.045
Completion: 2.000
Cache: 0.222
🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2026-04-24
DeepSeek V3.1 Terminus deepseek/deepseek-v3.1-terminus 163.8K 32.8K Input: $0.27
Output: $0.95
Cache Read: $0.13
Model: 0.135
Completion: 3.519
Cache: 0.481
🧠 🔧 🌡️ 2025-03-31 In: text
Out: text
Open Weights
Released: 2025-09-22
R1 Distill Llama 70B deepseek/deepseek-r1-distill-llama-70b 8.2K 8.2K Input: $0.8
Output: $0.8
Model: 0.400
Completion: 1.000
🧠 🌡️ 2024-07-31 In: text
Out: text
Open Weights
Released: 2025-01-23
DeepSeek V4 Pro deepseek/deepseek-v4-pro 1M 384K Input: $0.435
Output: $0.87
Cache Read: $0.003625
Model: 0.217
Completion: 2.000
Cache: 0.008
🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2026-04-24
DeepSeek-R1 deepseek/deepseek-r1 64K 16K Input: $0.7
Output: $2.5
Model: 0.350
Completion: 3.571
🧠 🔧 🌡️ 2024-07 In: text
Out: text
Open Weights
Released: 2025-01-20
Updated: 2025-05-29
DeepSeek V3.2 Exp deepseek/deepseek-v3.2-exp 163.8K 65.5K Input: $0.27
Output: $0.41
Model: 0.135
Completion: 1.519
🧠 🔧 🌡️ 2025-07-31 In: text
Out: text
Open Weights
Released: 2025-09-29
DeepSeek V3 0324 deepseek/deepseek-chat-v3-0324 163.8K 16.4K Input: $0.2
Output: $0.77
Cache Read: $0.135
Model: 0.100
Completion: 3.850
Cache: 0.675
🔧 🌡️ 2024-07-31 In: text
Out: text
Open Weights
Released: 2025-03-24
R1 Distill Qwen 32B deepseek/deepseek-r1-distill-qwen-32b 32.8K 32.8K Input: $0.29
Output: $0.29
Model: 0.145
Completion: 1.000
🧠 🌡️ 2024-07-31 In: text
Out: text
Open Weights
Released: 2025-01-29
DeepSeek Chat deepseek/deepseek-chat 128K 16K Input: $0.2002
Output: $0.8001
Model: 0.100
Completion: 3.997
🔧 🌡️ 2025-09 In: text
Out: text
Open Weights
Released: 2025-12-01
Updated: 2026-02-28
DeepSeek V3.2 deepseek/deepseek-v3.2 128K 64K Input: $0.2288
Output: $0.3432
Model: 0.114
Completion: 1.500
🧠 🔧 🌡️ 2024-07 In: text
Out: text
Open Weights
Released: 2025-12-01
DeepSeek V3.1 deepseek/deepseek-chat-v3.1 163.8K 32.8K Input: $0.21
Output: $0.79
Cache Read: $0.13
Model: 0.105
Completion: 3.762
Cache: 0.619
🧠 🔧 🌡️ 2025-03-31 In: text
Out: text
Open Weights
Released: 2025-08-21
MiniMax-M2.5 minimax/minimax-m2.5 196.6K 196.6K Input: $0.15
Output: $0.9
Cache Read: $0.05
Model: 0.075
Completion: 6.000
Cache: 0.333
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-12
MiniMax-M2.1 minimax/minimax-m2.1 196.6K 196.6K Input: $0.29
Output: $0.95
Cache Read: $0.03
Model: 0.145
Completion: 3.276
Cache: 0.103
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-12-23
MiniMax-01 minimax/minimax-01 1M 1M Input: $0.2
Output: $1.1
Model: 0.100
Completion: 5.500
📎 🌡️ 2024-03-31 In: text, image
Out: text
Open Weights
Released: 2025-01-15
MiniMax-M3 minimax/minimax-m3 524.3K 512K Input: $0.3
Output: $1.2
Cache Read: $0.06
Model: 0.150
Completion: 4.000
Cache: 0.200
📎 🧠 🔧 🌡️ - In: text, image, video
Out: text
Open Weights
Released: 2026-06-01
MiniMax-M2 minimax/minimax-m2 196.6K 196.6K Input: $0.255
Output: $1
Cache Read: $0.03
Model: 0.128
Completion: 3.922
Cache: 0.118
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-10-27
MiniMax M2-her minimax/minimax-m2-her 65.5K 2K Input: $0.3
Output: $1.2
Cache Read: $0.03
Model: 0.150
Completion: 4.000
Cache: 0.100
🌡️ - In: text
Out: text
Released: 2026-01-23
MiniMax-M2.7 minimax/minimax-m2.7 196.6K 131.1K Input: $0.25
Output: $1
Cache Read: $0.05
Model: 0.125
Completion: 4.000
Cache: 0.200
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-18
MiniMax M1 minimax/minimax-m1 1M 40K Input: $0.4
Output: $2.2
Model: 0.200
Completion: 5.500
🧠 🔧 🌡️ 2024-06-30 In: text
Out: text
Released: 2025-06-17
KAT-Coder-Pro V2 kwaipilot/kat-coder-pro-v2 256K 80K Input: $0.3
Output: $1.2
Cache Read: $0.06
Model: 0.150
Completion: 4.000
Cache: 0.200
🔧 🌡️ - In: text
Out: text
Released: 2026-03-27
Hermes 3 405B Instruct (free) nousresearch/hermes-3-llama-3.1-405b:free 131.1K 131.1K Input: $0
Output: $0
- 🌡️ 2023-12-31 In: text
Out: text
Open Weights
Released: 2024-08-16
Hermes 4 405B nousresearch/hermes-4-405b 131.1K 131.1K Input: $1
Output: $3
Model: 0.500
Completion: 3.000
🧠 🌡️ 2024-08-31 In: text
Out: text
Open Weights
Released: 2025-08-26
Hermes 3 70B Instruct nousresearch/hermes-3-llama-3.1-70b 131.1K 16.4K Input: $0.7
Output: $0.7
Model: 0.350
Completion: 1.000
🌡️ 2023-12-31 In: text
Out: text
Open Weights
Released: 2024-08-18
Hermes 3 405B Instruct nousresearch/hermes-3-llama-3.1-405b 131.1K 16.4K Input: $1
Output: $1
Model: 0.500
Completion: 1.000
🌡️ 2023-12-31 In: text
Out: text
Open Weights
Released: 2024-08-16
Hermes 4 70B nousresearch/hermes-4-70b 131.1K 131.1K Input: $0.13
Output: $0.4
Model: 0.065
Completion: 3.077
🧠 🌡️ 2024-08-31 In: text
Out: text
Open Weights
Released: 2025-08-26

OrcaRouter

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Gemini 2.5 Pro google/gemini-2.5-pro 1M 65.5K Input: $2.5
Output: $15
Cache Read: $0.125
Model: 1.250
Completion: 6.000
Cache: 0.050
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-06-17
Gemini 2.5 Flash google/gemini-2.5-flash 1M 65.5K Input: $0.3
Output: $2.5
Cache Read: $0.03
Input Audio: $1
Model: 0.500
Completion: 2.500
Cache: 0.030
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-06-17
Gemma 4 31B IT google/gemma-4-31b-it 262.1K 32.8K Input: $0.13
Output: $0.38
Model: 0.065
Completion: 2.923
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2026-04-02
Gemini 3.1 Pro Preview Custom Tools google/gemini-3.1-pro-preview-customtools 1M 65.5K Input: $4
Output: $18
Cache Read: $0.2
Model: 2.000
Completion: 4.500
Cache: 0.050
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video, audio, pdf
Out: text
Released: 2026-02-19
Gemini Flash-Lite Latest google/gemini-flash-lite-latest 1M 65.5K Input: $0.25
Output: $1.5
Cache Read: $0.025
Model: 0.125
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-09-25
Gemini 2.5 Flash-Lite google/gemini-2.5-flash-lite 1M 65.5K Input: $0.1
Output: $0.4
Cache Read: $0.01
Input Audio: $0.3
Model: 0.150
Completion: 1.333
Cache: 0.033
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-06-17
Gemini 3.1 Pro Preview google/gemini-3.1-pro-preview 1M 65.5K Input: $4
Output: $18
Cache Read: $0.2
Model: 2.000
Completion: 4.500
Cache: 0.050
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video, audio, pdf
Out: text
Released: 2026-02-19
Gemma 4 26B A4B IT google/gemma-4-26b-a4b-it 262.1K 32.8K Input: $0.06
Output: $0.33
Model: 0.030
Completion: 5.500
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2026-04-02
Gemini 3 Pro Preview google/gemini-3-pro-preview 1M 65.5K Input: $4
Output: $18
Cache Read: $0.2
Model: 2.000
Completion: 4.500
Cache: 0.050
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video, audio, pdf
Out: text
Released: 2025-11-18
Gemini 3 Flash Preview google/gemini-3-flash-preview 1M 65.5K Input: $0.5
Output: $3
Cache Read: $0.05
Input Audio: $1
Model: 0.500
Completion: 3.000
Cache: 0.050
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video, audio, pdf
Out: text
Released: 2025-12-17
Gemini Flash Latest google/gemini-flash-latest 1M 65.5K Input: $0.5
Output: $3
Cache Read: $0.075
Input Audio: $1
Model: 0.500
Completion: 3.000
Cache: 0.075
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-09-25
Gemini 3.1 Flash Lite Preview google/gemini-3.1-flash-lite-preview 1M 65.5K Input: $0.25
Output: $1.5
Cache Read: $0.025
Input Audio: $0.5
Model: 0.250
Completion: 3.000
Cache: 0.050
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video, audio, pdf
Out: text
Released: 2026-03-03
GLM-4.7 z-ai/glm-4.7 204.8K 131.1K Input: $0.6
Output: $2.2
Cache Read: $0.11
Cache Write: $0
Model: 0.300
Completion: 3.667
Cache: 0.183
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-12-22
GLM-4.5 z-ai/glm-4.5 131.1K 98.3K Input: $0.6
Output: $2.2
Cache Read: $0.11
Cache Write: $0
Model: 0.300
Completion: 3.667
Cache: 0.183
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-28
GLM-5.1 z-ai/glm-5.1 200K 131.1K Input: $1.4
Output: $4.4
Cache Read: $0.26
Cache Write: $0
Model: 0.700
Completion: 3.143
Cache: 0.186
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-27
GLM-4.6 z-ai/glm-4.6 204.8K 131.1K Input: $0.6
Output: $2.2
Cache Read: $0.11
Cache Write: $0
Model: 0.300
Completion: 3.667
Cache: 0.183
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-09-30
GLM-4.5-Air z-ai/glm-4.5-air 131.1K 98.3K Input: $0.2
Output: $1.1
Cache Read: $0.03
Cache Write: $0
Model: 0.100
Completion: 5.500
Cache: 0.150
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-28
GLM-5 z-ai/glm-5 204.8K 131.1K Input: $1
Output: $3.2
Cache Read: $0.2
Cache Write: $0
Model: 0.500
Completion: 3.200
Cache: 0.200
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-11
GPT-5.2 Pro openai/gpt-5.2-pro 400K 128K Input: $21
Output: $168
Model: 10.500
Completion: 8.000
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2025-12-11
GPT-5 openai/gpt-5 400K 128K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-08-07
GPT-3.5-turbo openai/gpt-3.5-turbo 16.4K 4.1K Input: $0.5
Output: $1.5
Cache Read: $0
Model: 0.250
Completion: 3.000
🌡️ 2021-09-01 In: text
Out: text
Released: 2023-03-01
Updated: 2023-11-06
GPT-5 Pro openai/gpt-5-pro 400K 272K Input: $15
Output: $120
Model: 7.500
Completion: 8.000
📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-10-06
GPT-4o openai/gpt-4o 128K 16.4K Input: $2.5
Output: $10
Cache Read: $1.25
Model: 1.250
Completion: 4.000
Cache: 0.500
📎 🔧 🌡️ 2023-09 In: text, image, pdf
Out: text
Released: 2024-05-13
Updated: 2024-08-06
GPT-4 openai/gpt-4 8.2K 8.2K Input: $30
Output: $60
Model: 15.000
Completion: 2.000
📎 🔧 🌡️ 2023-11 In: text
Out: text
Released: 2023-11-06
Updated: 2024-04-09
GPT-4o (2024-05-13) openai/gpt-4o-2024-05-13 128K 4.1K Input: $5
Output: $15
Model: 2.500
Completion: 3.000
📎 🔧 🌡️ 2023-09 In: text, image
Out: text
Released: 2024-05-13
GPT-5.4 nano openai/gpt-5.4-nano 400K 128K Input: $0.2
Output: $1.25
Cache Read: $0.02
Model: 0.100
Completion: 6.250
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2026-03-17
GPT-5 Chat (latest) openai/gpt-5-chat-latest 400K 128K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🌡️ 2024-09-30 In: text, image
Out: text
Released: 2025-08-07
GPT-5.1 Codex openai/gpt-5.1-codex 400K 128K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-11-13
GPT-5.1 Codex Max openai/gpt-5.1-codex-max 400K 128K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-11-13
GPT-5.3 Chat (latest) openai/gpt-5.3-chat-latest 128K 16.4K Input: $1.75
Output: $14
Cache Read: $0.175
Model: 0.875
Completion: 8.000
Cache: 0.100
📎 🔧 🌡️ 2025-08-31 In: text, image
Out: text
Released: 2026-03-03
GPT-4o (2024-08-06) openai/gpt-4o-2024-08-06 128K 16.4K Input: $2.5
Output: $10
Cache Read: $1.25
Model: 1.250
Completion: 4.000
Cache: 0.500
📎 🔧 🌡️ 2023-09 In: text, image
Out: text
Released: 2024-08-06
GPT-5.2 openai/gpt-5.2 400K 128K Input: $1.75
Output: $14
Cache Read: $0.175
Model: 0.875
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2025-12-11
GPT-5.3 Codex openai/gpt-5.3-codex 400K 128K Input: $1.75
Output: $14
Cache Read: $0.175
Model: 0.875
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-02-05
GPT-5.1 Codex mini openai/gpt-5.1-codex-mini 400K 128K Input: $0.25
Output: $2
Cache Read: $0.025
Model: 0.125
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-11-13
GPT-5.1 Chat openai/gpt-5.1-chat-latest 128K 16.4K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-11-13
GPT-5.2 Chat openai/gpt-5.2-chat-latest 128K 16.4K Input: $1.75
Output: $14
Cache Read: $0.175
Model: 0.875
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2025-12-11
GPT-4.1 nano openai/gpt-4.1-nano 1M 32.8K Input: $0.1
Output: $0.4
Cache Read: $0.025
Model: 0.050
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2025-04-14
GPT-4o (2024-11-20) openai/gpt-4o-2024-11-20 128K 16.4K Input: $2.5
Output: $10
Cache Read: $1.25
Model: 1.250
Completion: 4.000
Cache: 0.500
📎 🔧 🌡️ 2023-09 In: text, image
Out: text
Released: 2024-11-20
GPT-5.4 openai/gpt-5.4 1.1M 128K Input: $5
Output: $22.5
Cache Read: $0.25
Model: 2.500
Completion: 4.500
Cache: 0.050
📎 🧠 🔧 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-03-05
GPT-5.4 mini openai/gpt-5.4-mini 400K 128K Input: $0.75
Output: $4.5
Cache Read: $0.075
Model: 0.375
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2026-03-17
GPT-4.1 openai/gpt-4.1 1M 32.8K Input: $2
Output: $8
Cache Read: $0.5
Model: 1.000
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-04 In: text, image, pdf
Out: text
Released: 2025-04-14
GPT-5 Mini openai/gpt-5-mini 400K 128K Input: $0.25
Output: $2
Cache Read: $0.025
Model: 0.125
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2024-05-30 In: text, image
Out: text
Released: 2025-08-07
GPT-4.1 mini openai/gpt-4.1-mini 1M 32.8K Input: $0.4
Output: $1.6
Cache Read: $0.1
Model: 0.200
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-04 In: text, image, pdf
Out: text
Released: 2025-04-14
GPT-4 Turbo openai/gpt-4-turbo 128K 4.1K Input: $10
Output: $30
Model: 5.000
Completion: 3.000
📎 🔧 🌡️ 2023-12 In: text, image
Out: text
Released: 2023-11-06
Updated: 2024-04-09
GPT-5 Nano openai/gpt-5-nano 400K 128K Input: $0.05
Output: $0.4
Cache Read: $0.005
Model: 0.025
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2024-05-30 In: text, image
Out: text
Released: 2025-08-07
GPT-5.4 Pro openai/gpt-5.4-pro 1.1M 128K Input: $60
Output: $270
Model: 30.000
Completion: 4.500
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2026-03-05
GPT-5.5 Pro openai/gpt-5.5-pro 1.1M 128K Input: $30
Output: $180
Model: 15.000
Completion: 6.000
📎 🧠 🔧 2025-12-01 In: text, image, pdf
Out: text
Released: 2026-04-23
GPT-4o mini openai/gpt-4o-mini 128K 16.4K Input: $0.15
Output: $0.6
Cache Read: $0.075
Model: 0.075
Completion: 4.000
Cache: 0.500
📎 🔧 🌡️ 2023-09 In: text, image, pdf
Out: text
Released: 2024-07-18
GPT-5-Codex openai/gpt-5-codex 400K 128K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-09-15
GPT-5.2 Codex openai/gpt-5.2-codex 400K 128K Input: $1.75
Output: $14
Cache Read: $0.175
Model: 0.875
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image, pdf
Out: text
Released: 2025-12-11
GPT-5.1 openai/gpt-5.1 400K 128K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-11-13
GPT-5.5 openai/gpt-5.5 1.1M 128K Input: $5
Output: $30
Cache Read: $0.5
Model: 2.500
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 2025-12-01 In: text, image, pdf
Out: text
Released: 2026-04-23
Kimi K2.5 kimi/kimi-k2.5 262.1K 262.1K Input: $0.6
Output: $3
Cache Read: $0.1
Model: 0.300
Completion: 5.000
Cache: 0.167
🧠 🔧 2025-01 In: text, image, video
Out: text
Open Weights
Released: 2026-01
Kimi K2.6 kimi/kimi-k2.6 262.1K 262.1K Input: $0.95
Output: $4
Cache Read: $0.16
Model: 0.475
Completion: 4.211
Cache: 0.168
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video
Out: text
Open Weights
Released: 2026-04-21
OrcaRouter Auto orcarouter/auto 128K 16.4K Input: $0
Output: $0
- 📎 🔧 🌡️ - In: text, image
Out: text
Released: 2025-01-01
Updated: 2026-05-14
Claude Sonnet 4.5 (latest) anthropic/claude-sonnet-4.5 200K 64K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-07-31 In: text, image, pdf
Out: text
Released: 2025-09-29
Claude Sonnet 4 (latest) anthropic/claude-sonnet-4 200K 64K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-05-22
Claude Haiku 4.5 (latest) anthropic/claude-haiku-4.5 200K 64K Input: $1
Output: $5
Cache Read: $0.1
Cache Write: $1.25
Model: 0.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-02-28 In: text, image, pdf
Out: text
Released: 2025-10-15
Claude Opus 4.7 anthropic/claude-opus-4.7 1M 128K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 2026-01-31 In: text, image, pdf
Out: text
Released: 2026-04-16
Claude Opus 4.1 (latest) anthropic/claude-opus-4.1 200K 32K Input: $15
Output: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-08-05
Claude Opus 4.5 (latest) anthropic/claude-opus-4.5 200K 64K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-11-24
Claude Sonnet 4.6 anthropic/claude-sonnet-4.6 1M 64K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-02-17
Updated: 2026-03-13
Claude Opus 4 (latest) anthropic/claude-opus-4 200K 32K Input: $15
Output: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-05-22
Claude Opus 4.6 anthropic/claude-opus-4.6 1M 128K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-05-31 In: text, image, pdf
Out: text
Released: 2026-02-05
Updated: 2026-03-13
Grok 4.3 grok/grok-4.3 1M 30K Input: $1.25
Output: $2.5
Cache Read: $0.2
Model: 0.625
Completion: 2.000
Cache: 0.160
📎 🧠 🔧 🌡️ - In: text, image, pdf
Out: text
Released: 2026-04-17
Qwen3.6 35B-A3B qwen/qwen3.6-35b-a3b 262.1K 65.5K Input: $0.248
Output: $1.485
Model: 0.124
Completion: 5.988
📎 🧠 🔧 🌡️ - In: text, image, video, audio
Out: text
Open Weights
Released: 2026-04-17
Qwen3 Max qwen/qwen3-max 262.1K 65.5K Input: $0.359
Output: $1.434
Model: 0.179
Completion: 3.994
🔧 🌡️ 2025-04 In: text
Out: text
Released: 2025-09-23
Qwen3.5 Plus qwen/qwen3.5-plus 1M 65.5K Input: $0.115
Output: $0.688
Reasoning: $2.4
Model: 0.058
Completion: 5.983
🧠 🔧 🌡️ 2025-04 In: text, image, video
Out: text
Released: 2026-02-16
Qwen3.5 27B qwen/qwen3.5-27b 262.1K 65.5K Input: $0.086
Output: $0.688
Model: 0.043
Completion: 8.000
📎 🧠 🔧 🌡️ - In: text, image, video, audio
Out: text
Open Weights
Released: 2026-02-23
Qwen3.5 35B-A3B qwen/qwen3.5-35b-a3b 262.1K 65.5K Input: $0.057
Output: $0.459
Model: 0.029
Completion: 8.053
📎 🧠 🔧 🌡️ - In: text, image, video, audio
Out: text
Open Weights
Released: 2026-02-23
Qwen3.5 397B-A17B qwen/qwen3.5-397b-a17b 262.1K 65.5K Input: $0.172
Output: $1.032
Model: 0.086
Completion: 6.000
📎 🧠 🔧 🌡️ - In: text, image, video, audio
Out: text
Open Weights
Released: 2026-02-15
Qwen3.5 122B-A10B qwen/qwen3.5-122b-a10b 262.1K 65.5K Input: $0.115
Output: $0.917
Model: 0.058
Completion: 7.974
📎 🧠 🔧 🌡️ - In: text, image, video, audio
Out: text
Open Weights
Released: 2026-02-23
Qwen3.6 Plus qwen/qwen3.6-plus 1M 65.5K Input: $0.5
Output: $3
Cache Read: $0.05
Cache Write: $0.625
Model: 0.250
Completion: 6.000
Cache: 0.100
🧠 🔧 🌡️ 2025-04 In: text, image, video
Out: text
Released: 2026-04-02
DeepSeek V4 Flash deepseek/deepseek-v4-flash 1M 384K Input: $0.19
Output: $0.37
Cache Read: $0.0028
Model: 0.095
Completion: 1.947
Cache: 0.015
🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2026-04-24
DeepSeek V4 Pro deepseek/deepseek-v4-pro 1M 384K Input: $0.56
Output: $1.12
Cache Read: $0.003625
Model: 0.280
Completion: 2.000
Cache: 0.006
🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2026-04-24
DeepSeek Reasoner deepseek/deepseek-reasoner 1M 384K Input: $0.435
Output: $0.87
Cache Read: $0.028
Model: 0.217
Completion: 2.000
Cache: 0.064
📎 🧠 🔧 🌡️ 2025-09 In: text
Out: text
Open Weights
Released: 2025-12-01
Updated: 2026-02-28
DeepSeek Chat deepseek/deepseek-chat 1M 384K Input: $0.14
Output: $0.28
Cache Read: $0.028
Model: 0.070
Completion: 2.000
Cache: 0.200
📎 🔧 🌡️ 2025-09 In: text
Out: text
Open Weights
Released: 2025-12-01
Updated: 2026-02-28
MiniMax-M2.7-highspeed minimax/minimax-m2.7-highspeed 204.8K 131.1K Input: $0.6
Output: $2.4
Cache Read: $0.06
Cache Write: $0.375
Model: 0.300
Completion: 4.000
Cache: 0.100
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-18
MiniMax-M2.5 minimax/minimax-m2.5 204.8K 131.1K Input: $0.3
Output: $1.2
Cache Read: $0.03
Cache Write: $0.375
Model: 0.150
Completion: 4.000
Cache: 0.100
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-12
MiniMax-M2.7 minimax/minimax-m2.7 204.8K 131.1K Input: $0.3
Output: $1.2
Cache Read: $0.06
Cache Write: $0.375
Model: 0.150
Completion: 4.000
Cache: 0.200
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-18
MiniMax-M2.5-highspeed minimax/minimax-m2.5-highspeed 204.8K 131.1K Input: $0.6
Output: $2.4
Cache Read: $0.06
Cache Write: $0.375
Model: 0.300
Completion: 4.000
Cache: 0.100
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-13

OVHcloud AI Endpoints

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Llama-3.1-8B-Instruct llama-3.1-8b-instruct 131.1K 131.1K Input: $0.11
Output: $0.11
Model: 0.055
Completion: 1.000
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-06-11
Qwen3-Coder-30B-A3B-Instruct qwen3-coder-30b-a3b-instruct 262.1K 262.1K Input: $0.07
Output: $0.26
Model: 0.035
Completion: 3.714
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-10-28
Qwen3-32B qwen3-32b 32.8K 32.8K Input: $0.09
Output: $0.25
Model: 0.045
Completion: 2.778
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-07-16
Qwen3Guard-Gen-8B qwen3guard-gen-8b 32.8K 16.4K - - 🌡️ - In: text
Out: text
Open Weights
Released: 2026-01-22
Qwen3Guard-Gen-0.6B qwen3guard-gen-0.6b 32.8K 16.4K - - 🌡️ - In: text
Out: text
Open Weights
Released: 2026-01-22
Meta-Llama-3_3-70B-Instruct meta-llama-3_3-70b-instruct 131.1K 131.1K Input: $0.74
Output: $0.74
Model: 0.370
Completion: 1.000
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-04-01
Mistral-Small-3.2-24B-Instruct-2506 mistral-small-3.2-24b-instruct-2506 131.1K 131.1K Input: $0.1
Output: $0.31
Model: 0.050
Completion: 3.100
📎 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2025-07-16
Qwen2.5-VL-72B-Instruct qwen2.5-vl-72b-instruct 32.8K 32.8K Input: $1.01
Output: $1.01
Model: 0.505
Completion: 1.000
📎 🌡️ - In: text, image
Out: text
Open Weights
Released: 2025-03-31
gpt-oss-120b gpt-oss-120b 131.1K 131.1K Input: $0.09
Output: $0.47
Model: 0.045
Completion: 5.222
🧠 🔧 - In: text
Out: text
Open Weights
Released: 2025-08-28
Mistral-7B-Instruct-v0.3 mistral-7b-instruct-v0.3 65.5K 65.5K Input: $0.11
Output: $0.11
Model: 0.055
Completion: 1.000
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-04-01
Mistral-Nemo-Instruct-2407 mistral-nemo-instruct-2407 65.5K 65.5K Input: $0.14
Output: $0.14
Model: 0.070
Completion: 1.000
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-11-20
Qwen3.6-27B qwen3.6-27b 262.1K 262.1K Input: $0.47
Output: $3.19
Model: 0.235
Completion: 6.787
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2026-06-01
Qwen3.5-9B qwen3.5-9b 262.1K 262.1K Input: $0.12
Output: $0.18
Model: 0.060
Completion: 1.500
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2026-04-22
Qwen3.5-397B-A17B qwen3.5-397b-a17b 262.1K 262.1K Input: $0.71
Output: $4.25
Model: 0.355
Completion: 5.986
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2026-05-18
gpt-oss-20b gpt-oss-20b 131.1K 131.1K Input: $0.05
Output: $0.18
Model: 0.025
Completion: 3.600
🧠 🔧 - In: text
Out: text
Open Weights
Released: 2025-08-28

Perplexity

📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Sonar Reasoning Pro sonar-reasoning-pro 128K 4.1K Input: $2
Output: $8
Model: 1.000
Completion: 4.000
📎 🧠 🌡️ 2025-09-01 In: text, image
Out: text
Released: 2024-01-01
Updated: 2025-09-01
Sonar sonar 128K 4.1K Input: $1
Output: $1
Model: 0.500
Completion: 1.000
🌡️ 2025-09-01 In: text
Out: text
Released: 2024-01-01
Updated: 2025-09-01
Sonar Pro sonar-pro 200K 8.2K Input: $3
Output: $15
Model: 1.500
Completion: 5.000
📎 🌡️ 2025-09-01 In: text, image
Out: text
Released: 2024-01-01
Updated: 2025-09-01
Perplexity Sonar Deep Research sonar-deep-research 128K 32.8K Input: $2
Output: $8
Reasoning: $3
Model: 1.000
Completion: 4.000
🧠 2025-01 In: text
Out: text
Released: 2025-02-01
Updated: 2025-09-01

Perplexity Agent

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Grok 4.1 Fast (Non-Reasoning) xai/grok-4-1-fast-non-reasoning 2M 30K Input: $0.2
Output: $0.5
Cache Read: $0.05
Model: 0.100
Completion: 2.500
Cache: 0.250
📎 🔧 🌡️ 2025-07 In: text, image
Out: text
Released: 2025-11-19
Gemini 2.5 Pro google/gemini-2.5-pro 1M 65.5K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-03-20
Updated: 2025-06-05
Gemini 2.5 Flash google/gemini-2.5-flash 1M 65.5K Input: $0.3
Output: $2.5
Cache Read: $0.03
Model: 0.150
Completion: 8.333
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-03-20
Updated: 2025-06-05
Gemini 3.1 Pro Preview google/gemini-3.1-pro-preview 1M 65.5K Input: $2
Output: $12
Cache Read: $0.2
Model: 1.000
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video, audio, pdf
Out: text
Released: 2026-02-19
Gemini 3 Flash Preview google/gemini-3-flash-preview 1M 65.5K Input: $0.5
Output: $3
Cache Read: $0.05
Model: 0.250
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video, audio, pdf
Out: text
Released: 2025-12-17
GPT-5.2 openai/gpt-5.2 400K 128K Input: $1.75
Output: $14
Cache Read: $0.175
Model: 0.875
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2025-12-11
GPT-5.4 openai/gpt-5.4 1.1M 128K Input: $2.5
Output: $15
Cache Read: $0.25
Model: 1.250
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2026-03-05
GPT-5 Mini openai/gpt-5-mini 400K 128K Input: $0.25
Output: $2
Cache Read: $0.025
Model: 0.125
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2024-05-30 In: text, image
Out: text
Released: 2025-08-07
GPT-5.1 openai/gpt-5.1 400K 128K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-11-13
GPT-5.5 openai/gpt-5.5 1.1M 128K Input: $5
Output: $30
Cache Read: $0.5
Model: 2.500
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 2025-12-01 In: text, image, pdf
Out: text
Released: 2026-04-23
Nemotron 3 Super 120B nvidia/nemotron-3-super-120b-a12b 1M 32K Input: $0.25
Output: $2.5
Model: 0.125
Completion: 10.000
🧠 🔧 🌡️ 2026-02 In: text
Out: text
Open Weights
Released: 2026-03-11
Claude Opus 4.5 anthropic/claude-opus-4-5 200K 64K Input: $5
Output: $25
Cache Read: $0.5
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-11-24
Claude Sonnet 4.5 anthropic/claude-sonnet-4-5 200K 64K Input: $3
Output: $15
Cache Read: $0.3
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-07-31 In: text, image, pdf
Out: text
Released: 2025-09-29
Claude Opus 4.7 anthropic/claude-opus-4-7 1M 128K Input: $5
Output: $25
Cache Read: $0.5
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 2026-01-31 In: text, image, pdf
Out: text
Released: 2026-04-16
Claude Haiku 4.5 anthropic/claude-haiku-4-5 200K 64K Input: $1
Output: $5
Cache Read: $0.1
Model: 0.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-02-28 In: text, image, pdf
Out: text
Released: 2025-10-15
Claude Opus 4.6 anthropic/claude-opus-4-6 200K 128K Input: $5
Output: $25
Cache Read: $0.5
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-05-31 In: text, image, pdf
Out: text
Released: 2026-02-05
Claude Sonnet 4.6 anthropic/claude-sonnet-4-6 200K 64K Input: $3
Output: $15
Cache Read: $0.3
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-02-17
Sonar perplexity/sonar 128K 8.2K Input: $0.25
Output: $2.5
Cache Read: $0.0625
Model: 0.125
Completion: 10.000
Cache: 0.250
🔧 🌡️ 2025-09-01 In: text
Out: text
Released: 2024-01-01
Updated: 2025-09-01

Poe

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Tako trytako/tako 2K - - - 📎 🔧 - In: text
Out: text
Released: 2024-08-15
Grok Code Fast 1 xai/grok-code-fast-1 256K 128K Input: $0.2
Output: $1.5
Cache Read: $0.02
Model: 0.100
Completion: 7.500
Cache: 0.100
📎 🧠 🔧 - In: text
Out: text
Released: 2025-08-22
Grok-4.1-Fast-Reasoning xai/grok-4.1-fast-reasoning 2M 30K - - 📎 🧠 🔧 - In: text, image
Out: text
Released: 2025-11-19
Grok 3 Mini xai/grok-3-mini 131.1K 8.2K Input: $0.3
Output: $0.5
Cache Read: $0.075
Model: 0.150
Completion: 1.667
Cache: 0.250
📎 🧠 🔧 - In: text
Out: text
Released: 2025-04-11
Grok-4.1-Fast-Non-Reasoning xai/grok-4.1-fast-non-reasoning 2M 30K - - 📎 🔧 - In: text, image
Out: text
Released: 2025-11-19
Grok 3 xai/grok-3 131.1K 8.2K Input: $3
Output: $15
Cache Read: $0.75
Model: 1.500
Completion: 5.000
Cache: 0.250
📎 🔧 - In: text
Out: text
Released: 2025-04-11
Grok-4-Fast-Reasoning xai/grok-4-fast-reasoning 2M 128K Input: $0.2
Output: $0.5
Cache Read: $0.05
Model: 0.100
Completion: 2.500
Cache: 0.250
📎 🧠 🔧 - In: text, image
Out: text
Released: 2025-09-16
Grok-4 xai/grok-4 256K 128K Input: $3
Output: $15
Cache Read: $0.75
Model: 1.500
Completion: 5.000
Cache: 0.250
📎 🧠 🔧 - In: text, image
Out: text
Released: 2025-07-10
Grok-4-Fast-Non-Reasoning xai/grok-4-fast-non-reasoning 2M 128K Input: $0.2
Output: $0.5
Cache Read: $0.05
Model: 0.100
Completion: 2.500
Cache: 0.250
📎 🔧 - In: text, image
Out: text
Released: 2025-09-16
Grok-4.20-Multi-Agent xai/grok-4.20-multi-agent 128K - Input: $2
Output: $6
Cache Read: $0.2
Model: 1.000
Completion: 3.000
Cache: 0.100
📎 🔧 - In: text, image
Out: text
Released: 2026-03-13
TopazLabs topazlabs-co/topazlabs 204 - - - 📎 🔧 - In: text
Out: image
Released: 2024-12-03
Kimi-K2.5-FW fireworks-ai/kimi-k2.5-fw 262.1K 16.4K Input: $0
Output: $0
- 📎 🔧 - In: text, image
Out: text
Released: 2026-01-27
Veo-3.1-Fast google/veo-3.1-fast 480 - - - 📎 🔧 - In: text, image
Out: video
Released: 2025-10-15
Imagen-3 google/imagen-3 480 - - - 📎 🔧 - In: text
Out: image
Released: 2024-10-15
Nano-Banana-Pro google/nano-banana-pro 65.5K - Input: $2
Output: $12
Cache Read: $0.2
Model: 1.000
Completion: 6.000
Cache: 0.100
📎 🔧 - In: text, image
Out: image
Released: 2025-11-19
Lyria google/lyria - - - - 📎 🔧 - In: text
Out: audio
Released: 2025-06-04
Gemini-3.1-Flash-Lite google/gemini-3.1-flash-lite 1M 65.5K Input: $0.25
Output: $1.5
Model: 0.125
Completion: 6.000
📎 🧠 🔧 - In: text, image, video, audio
Out: text
Released: 2026-02-18
Nano-Banana google/nano-banana 65.5K - Input: $0.21
Output: $1.8
Cache Read: $0.021
Model: 0.105
Completion: 8.571
Cache: 0.100
📎 🔧 - In: text, image
Out: text, image
Released: 2025-08-21
gemini-deep-research google/gemini-deep-research 1M - Input: $1.6
Output: $9.6
Model: 0.800
Completion: 6.000
📎 🧠 🔧 - In: text, image, video
Out: text
Released: 2025-12-11
Veo-3 google/veo-3 480 - - - 📎 🔧 - In: text
Out: video
Released: 2025-05-21
Gemini-3-Flash google/gemini-3-flash 1M 65.5K Input: $0.4
Output: $2.4
Cache Read: $0.04
Model: 0.200
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 - In: text, image, video, audio
Out: text
Released: 2025-10-07
Imagen-3-Fast google/imagen-3-fast 480 - - - 📎 🔧 - In: text
Out: image
Released: 2024-10-17
Gemini-2.5-Pro google/gemini-2.5-pro 1.1M 65.5K Input: $0.87
Output: $7
Cache Read: $0.087
Model: 0.435
Completion: 8.046
Cache: 0.100
📎 🧠 🔧 - In: text, image, video, audio
Out: text
Released: 2025-02-05
Veo-3.1 google/veo-3.1 480 - - - 📎 🔧 - In: text
Out: video
Released: 2025-10-15
Gemini-2.5-Flash google/gemini-2.5-flash 1.1M 65.5K Input: $0.21
Output: $1.8
Cache Read: $0.021
Model: 0.105
Completion: 8.571
Cache: 0.100
📎 🧠 🔧 - In: text, image, video, audio
Out: text
Released: 2025-04-26
Imagen-4-Fast google/imagen-4-fast 480 - - - 📎 🔧 - In: text
Out: image
Released: 2025-06-25
Gemini-3.5-Flash google/gemini-3.5-flash 1M 65.5K Input: $1.5152
Output: $9.0909
Cache Read: $0.1515
Model: 0.758
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 2025-01 In: text, image, audio
Out: text
Released: 2026-05-19
Gemma-4-31B google/gemma-4-31b 262.1K 8.2K Input: $0
Output: $0
- 📎 🔧 - In: text, image
Out: text
Released: 2026-04-02
Veo-3-Fast google/veo-3-fast 480 - - - 📎 🔧 - In: text
Out: video
Released: 2025-10-13
Gemini-3-Pro google/gemini-3-pro 1M 65.5K Input: $1.6
Output: $9.6
Cache Read: $0.16
Model: 0.800
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 - In: text, image, video, audio
Out: text
Released: 2025-10-22
Gemini-2.0-Flash google/gemini-2.0-flash 990K 8.2K Input: $0.1
Output: $0.42
Model: 0.050
Completion: 4.200
📎 🔧 - In: text, image, video, audio
Out: text
Released: 2024-12-11
Gemini-2.5-Flash-Lite google/gemini-2.5-flash-lite 1M 64K Input: $0.07
Output: $0.28
Model: 0.035
Completion: 4.000
📎 🧠 🔧 - In: text, image, video, audio
Out: text
Released: 2025-06-19
Imagen-4 google/imagen-4 480 - - - 📎 🔧 - In: text
Out: image
Released: 2025-05-22
Imagen-4-Ultra google/imagen-4-ultra 480 - - - 📎 🔧 - In: text
Out: image
Released: 2025-05-24
Gemini-3.1-Pro google/gemini-3.1-pro 1M 65.5K Input: $2
Output: $12
Cache Read: $0.2
Model: 1.000
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 - In: text, image, video, audio
Out: text
Released: 2026-02-19
Veo-2 google/veo-2 480 - - - 📎 🔧 - In: text
Out: video
Released: 2024-12-02
Gemini-2.0-Flash-Lite google/gemini-2.0-flash-lite 990K 8.2K Input: $0.052
Output: $0.21
Model: 0.026
Completion: 4.038
📎 🔧 - In: text, image, video, audio
Out: text
Released: 2025-02-05
ElevenLabs-Music elevenlabs/elevenlabs-music 2K - - - 📎 🔧 - In: text
Out: audio
Released: 2025-08-29
ElevenLabs-v2.5-Turbo elevenlabs/elevenlabs-v2.5-turbo 128K - - - 📎 🔧 - In: text
Out: audio
Released: 2024-10-28
ElevenLabs-v3 elevenlabs/elevenlabs-v3 128K - - - 📎 🔧 - In: text
Out: audio
Released: 2025-06-05
ChatGPT-4o-Latest openai/chatgpt-4o-latest 128K 8.2K Input: $4.5
Output: $14
Model: 2.250
Completion: 3.111
📎 🔧 - In: text, image
Out: text
Released: 2024-08-14
GPT-3.5-Turbo-Instruct openai/gpt-3.5-turbo-instruct 3.5K 1K Input: $1.4
Output: $1.8
Model: 0.700
Completion: 1.286
📎 🔧 - In: text, image
Out: text
Released: 2023-09-20
o3 openai/o3 200K 100K Input: $1.8
Output: $7.2
Cache Read: $0.45
Model: 0.900
Completion: 4.000
Cache: 0.250
📎 🧠 🔧 - In: text, image
Out: text
Released: 2025-04-16
GPT-5.2-Instant openai/gpt-5.2-instant 128K 16.4K Input: $1.6
Output: $13
Cache Read: $0.16
Model: 0.800
Completion: 8.125
Cache: 0.100
📎 🔧 - In: text, image
Out: text
Released: 2025-12-11
GPT-4o-Search openai/gpt-4o-search 128K 8.2K Input: $2.2
Output: $9
Model: 1.100
Completion: 4.091
📎 🔧 - In: text
Out: text
Released: 2025-03-11
GPT-5.2-Pro openai/gpt-5.2-pro 400K 128K Input: $19
Output: $150
Model: 9.500
Completion: 7.895
📎 🧠 🔧 - In: text, image
Out: text
Released: 2025-12-11
GPT-4-Classic-0314 openai/gpt-4-classic-0314 8.2K 4.1K Input: $27
Output: $54
Model: 13.500
Completion: 2.000
📎 🔧 - In: text, image
Out: text
Released: 2024-08-26
GPT-5 openai/gpt-5 400K 128K Input: $1.1
Output: $9
Cache Read: $0.11
Model: 0.550
Completion: 8.182
Cache: 0.100
📎 🧠 🔧 - In: text, image
Out: text
Released: 2025-08-05
GPT-5-Chat openai/gpt-5-chat 128K 16.4K Input: $1.1
Output: $9
Cache Read: $0.11
Model: 0.550
Completion: 8.182
Cache: 0.100
📎 🔧 - In: text, image
Out: text
Released: 2025-08-07
GPT-3.5-Turbo openai/gpt-3.5-turbo 16.4K 2K Input: $0.45
Output: $1.4
Model: 0.225
Completion: 3.111
📎 🔧 - In: text, image
Out: text
Released: 2023-09-13
GPT-5-Pro openai/gpt-5-pro 400K 128K Input: $14
Output: $110
Model: 7.000
Completion: 7.857
📎 🧠 🔧 - In: text, image
Out: text
Released: 2025-10-06
GPT-4o openai/gpt-4o 128K 8.2K - - 📎 🔧 - In: text, image
Out: text
Released: 2024-05-13
o4-mini openai/o4-mini 200K 100K Input: $0.99
Output: $4
Cache Read: $0.25
Model: 0.495
Completion: 4.040
Cache: 0.253
📎 🧠 🔧 - In: text, image
Out: text
Released: 2025-04-16
Sora-2 openai/sora-2 - - - - 📎 🔧 - In: text, image
Out: video
Released: 2025-10-06
o3-pro openai/o3-pro 200K 100K Input: $18
Output: $72
Model: 9.000
Completion: 4.000
📎 🧠 🔧 - In: text, image
Out: text
Released: 2025-06-10
GPT-4-Classic openai/gpt-4-classic 8.2K 4.1K Input: $27
Output: $54
Model: 13.500
Completion: 2.000
📎 🔧 - In: text, image
Out: text
Released: 2024-03-25
GPT-4o-Aug openai/gpt-4o-aug 128K 8.2K Input: $2.2
Output: $9
Cache Read: $1.1
Model: 1.100
Completion: 4.091
Cache: 0.500
📎 🔧 - In: text, image
Out: text
Released: 2024-11-21
GPT-5.4-Nano openai/gpt-5.4-nano 400K 128K Input: $0.18
Output: $1.1
Cache Read: $0.018
Model: 0.090
Completion: 6.111
Cache: 0.100
📎 🧠 🔧 - In: text, image
Out: text
Released: 2026-03-11
Sora-2-Pro openai/sora-2-pro - - - - 📎 🔧 - In: text, image
Out: video
Released: 2025-10-06
GPT-5.1-Codex openai/gpt-5.1-codex 400K 128K Input: $1.1
Output: $9
Cache Read: $0.11
Model: 0.550
Completion: 8.182
Cache: 0.100
📎 🧠 🔧 - In: text, image
Out: text
Released: 2025-11-12
GPT-5.3-Instant openai/gpt-5.3-instant 128K 16.4K Input: $1.6
Output: $13
Cache Read: $0.16
Model: 0.800
Completion: 8.125
Cache: 0.100
📎 🔧 - In: text, image
Out: text
Released: 2026-03-03
GPT-5.3-Codex-Spark openai/gpt-5.3-codex-spark 128K 16.4K Input: $0
Output: $0
- 📎 🧠 🔧 - In: text
Out: text
Released: 2026-03-04
GPT-5.1-Codex-Max openai/gpt-5.1-codex-max 400K 128K Input: $1.1
Output: $9
Cache Read: $0.11
Model: 0.550
Completion: 8.182
Cache: 0.100
📎 🧠 🔧 - In: text, image
Out: text
Released: 2025-12-08
DALL-E-3 openai/dall-e-3 800 - - - 📎 🔧 - In: text
Out: image
Released: 2023-11-06
o3-mini openai/o3-mini 200K 100K Input: $0.99
Output: $4
Model: 0.495
Completion: 4.040
📎 🧠 🔧 - In: text, image
Out: text
Released: 2025-01-31
GPT-5.2 openai/gpt-5.2 400K 128K Input: $1.6
Output: $13
Cache Read: $0.16
Model: 0.800
Completion: 8.125
Cache: 0.100
📎 🧠 🔧 - In: text, image
Out: text
Released: 2025-12-08
GPT-5.3-Codex openai/gpt-5.3-codex 400K 128K Input: $1.6
Output: $13
Cache Read: $0.16
Model: 0.800
Completion: 8.125
Cache: 0.100
📎 🧠 🔧 - In: text, image
Out: text
Released: 2026-02-10
GPT-5.1-Codex-Mini openai/gpt-5.1-codex-mini 400K 128K Input: $0.22
Output: $1.8
Cache Read: $0.022
Model: 0.110
Completion: 8.182
Cache: 0.100
📎 🧠 🔧 - In: text
Out: text
Released: 2025-11-12
o4-mini-deep-research openai/o4-mini-deep-research 200K 100K Input: $1.8
Output: $7.2
Cache Read: $0.45
Model: 0.900
Completion: 4.000
Cache: 0.250
📎 🧠 🔧 - In: text
Out: text
Released: 2025-06-27
gpt-image-1.5 openai/gpt-image-1.5 128K - - - 📎 - In: text, image
Out: image
Released: 2025-12-16
GPT-4.1-nano openai/gpt-4.1-nano 1M 32.8K Input: $0.09
Output: $0.36
Cache Read: $0.022
Model: 0.045
Completion: 4.000
Cache: 0.244
📎 🔧 - In: text, image
Out: text
Released: 2025-04-15
GPT-3.5-Turbo-Raw openai/gpt-3.5-turbo-raw 4.5K 2K Input: $0.45
Output: $1.4
Model: 0.225
Completion: 3.111
📎 🔧 - In: text, image
Out: text
Released: 2023-09-27
o1 openai/o1 200K 100K Input: $14
Output: $54
Model: 7.000
Completion: 3.857
📎 🧠 🔧 - In: text, image
Out: text
Released: 2024-12-18
o1-pro openai/o1-pro 200K 100K Input: $140
Output: $540
Model: 70.000
Completion: 3.857
📎 🧠 🔧 - In: text, image
Out: text
Released: 2025-03-19
GPT-5.4 openai/gpt-5.4 1.1M 128K Input: $2.2
Output: $14
Cache Read: $0.22
Model: 1.100
Completion: 6.364
Cache: 0.100
📎 🧠 🔧 - In: text, image, pdf
Out: image
Released: 2026-02-26
GPT-5.4-Mini openai/gpt-5.4-mini 400K 128K Input: $0.68
Output: $4
Cache Read: $0.068
Model: 0.340
Completion: 5.882
Cache: 0.100
📎 🧠 🔧 - In: text, image
Out: text
Released: 2026-03-12
GPT-4.1 openai/gpt-4.1 1M 32.8K Input: $1.8
Output: $7.2
Cache Read: $0.45
Model: 0.900
Completion: 4.000
Cache: 0.250
📎 🔧 - In: text, image
Out: text
Released: 2025-04-14
o3-deep-research openai/o3-deep-research 200K 100K Input: $9
Output: $36
Cache Read: $2.2
Model: 4.500
Completion: 4.000
Cache: 0.244
📎 🧠 🔧 - In: text
Out: text
Released: 2025-06-27
GPT-5-mini openai/gpt-5-mini 400K 128K Input: $0.22
Output: $1.8
Cache Read: $0.022
Model: 0.110
Completion: 8.182
Cache: 0.100
📎 🧠 🔧 - In: text, image
Out: text
Released: 2025-06-25
GPT-Image-1 openai/gpt-image-1 128K - - - 📎 🔧 - In: text, image
Out: image
Released: 2025-03-31
GPT-4.1-mini openai/gpt-4.1-mini 1M 32.8K Input: $0.36
Output: $1.4
Cache Read: $0.09
Model: 0.180
Completion: 3.889
Cache: 0.250
📎 🔧 - In: text, image
Out: text
Released: 2025-04-15
GPT-4-Turbo openai/gpt-4-turbo 128K 4.1K Input: $9
Output: $27
Model: 4.500
Completion: 3.000
📎 🔧 - In: text, image
Out: text
Released: 2023-09-13
GPT-Image-1-Mini openai/gpt-image-1-mini - - - - 📎 🔧 - In: text, image
Out: image
Released: 2025-08-26
GPT-5-nano openai/gpt-5-nano 400K 128K Input: $0.045
Output: $0.36
Cache Read: $0.0045
Model: 0.022
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 - In: text, image
Out: text
Released: 2025-08-05
GPT-5.4-Pro openai/gpt-5.4-pro 1.1M 128K Input: $27
Output: $160
Model: 13.500
Completion: 5.926
📎 🧠 🔧 - In: text, image
Out: image
Released: 2026-03-05
o3-mini-high openai/o3-mini-high 200K 100K Input: $0.99
Output: $4
Model: 0.495
Completion: 4.040
📎 🧠 🔧 - In: text, image
Out: text
Released: 2025-01-31
GPT-5.5-Pro openai/gpt-5.5-pro 400K 128K Input: $27.2727
Output: $163.6364
Model: 13.636
Completion: 6.000
📎 🧠 🔧 2025-12-01 In: text, image
Out: text, image
Released: 2026-04-08
GPT-4o-mini openai/gpt-4o-mini 124.1K 4.1K Input: $0.14
Output: $0.54
Cache Read: $0.068
Model: 0.070
Completion: 3.857
Cache: 0.486
📎 🔧 - In: text, image
Out: text
Released: 2024-07-18
GPT-4o-mini-Search openai/gpt-4o-mini-search 128K 8.2K Input: $0.14
Output: $0.54
Model: 0.070
Completion: 3.857
📎 🔧 - In: text
Out: text
Released: 2025-03-11
GPT-5-Codex openai/gpt-5-codex 400K 128K Input: $1.1
Output: $9
Model: 0.550
Completion: 8.182
📎 🧠 🔧 - In: text, image
Out: text
Released: 2025-09-23
GPT-5.2-Codex openai/gpt-5.2-codex 400K 128K Input: $1.6
Output: $13
Cache Read: $0.16
Model: 0.800
Completion: 8.125
Cache: 0.100
📎 🧠 🔧 - In: text, image
Out: text
Released: 2026-01-14
GPT-5.1-Instant openai/gpt-5.1-instant 128K 16.4K Input: $1.1
Output: $9
Cache Read: $0.11
Model: 0.550
Completion: 8.182
Cache: 0.100
📎 🧠 🔧 - In: text, image
Out: text
Released: 2025-11-12
GPT-Image-2 openai/gpt-image-2 - - Input: $5.0505
Output: $32.3232
Cache Read: $1.2626
Model: 2.525
Completion: 6.400
Cache: 0.250
📎 - In: text, image
Out: image
Released: 2026-04-21
GPT-5.1 openai/gpt-5.1 400K 128K Input: $1.1
Output: $9
Cache Read: $0.11
Model: 0.550
Completion: 8.182
Cache: 0.100
📎 🧠 🔧 - In: text, image
Out: text
Released: 2025-11-12
GPT-5.5 openai/gpt-5.5 400K 128K Input: $4.5455
Output: $27.2727
Cache Read: $0.4545
Model: 2.273
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 2025-12-01 In: text, image
Out: text, image
Released: 2026-04-08
Runway runwayml/runway 256 - - - 📎 🔧 - In: text, image
Out: video
Released: 2024-10-11
Runway-Gen-4-Turbo runwayml/runway-gen-4-turbo 256 - - - 📎 🔧 - In: text, image
Out: video
Released: 2025-05-09
Llama-3.1-8B-CS cerebras/llama-3.1-8b-cs 128K - Input: $0.1
Output: $0.1
Model: 0.050
Completion: 1.000
📎 🔧 - In: text
Out: text
Released: 2025-05-13
qwen3-235b-2507-cs cerebras/qwen3-235b-2507-cs - - - - 📎 🧠 🔧 - In: text
Out: text
Released: 2025-08-06
GPT-OSS-120B-CS cerebras/gpt-oss-120b-cs 128K - Input: $0.35
Output: $0.75
Model: 0.175
Completion: 2.143
📎 🧠 🔧 - In: text
Out: text
Released: 2025-08-06
llama-3.3-70b-cs cerebras/llama-3.3-70b-cs - - - - 📎 - In: text
Out: text
Released: 2025-05-13
qwen3-32b-cs cerebras/qwen3-32b-cs - - - - 📎 🧠 🔧 - In: text
Out: text
Released: 2025-05-15
Claude-Sonnet-3.5 anthropic/claude-sonnet-3.5 189.1K 8.2K Input: $2.6
Output: $13
Cache Read: $0.26
Cache Write: $3.2
Model: 1.300
Completion: 5.000
Cache: 0.100
📎 🔧 - In: text, image, pdf
Out: text
Released: 2024-06-05
Claude-Sonnet-3.7 anthropic/claude-sonnet-3.7 196.6K 128K Input: $2.6
Output: $13
Cache Read: $0.26
Cache Write: $3.2
Model: 1.300
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2025-02-19
Claude-Sonnet-3.5-June anthropic/claude-sonnet-3.5-june 189.1K 8.2K Input: $2.6
Output: $13
Cache Read: $0.26
Cache Write: $3.2
Model: 1.300
Completion: 5.000
Cache: 0.100
📎 🔧 - In: text, image, pdf
Out: text
Released: 2024-11-18
Claude-Sonnet-4.5 anthropic/claude-sonnet-4.5 983K 32.8K Input: $2.6
Output: $13
Cache Read: $0.26
Cache Write: $3.2
Model: 1.300
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2025-09-26
Claude-Sonnet-4 anthropic/claude-sonnet-4 983K 64K Input: $2.6
Output: $13
Cache Read: $0.26
Cache Write: $3.2
Model: 1.300
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2025-05-21
Claude-Haiku-4.5 anthropic/claude-haiku-4.5 192K 64K Input: $0.85
Output: $4.3
Cache Read: $0.085
Cache Write: $1.1
Model: 0.425
Completion: 5.059
Cache: 0.100
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2025-10-15
Claude-Haiku-3 anthropic/claude-haiku-3 189.1K 8.2K Input: $0.21
Output: $1.1
Cache Read: $0.021
Cache Write: $0.26
Model: 0.105
Completion: 5.238
Cache: 0.100
📎 🔧 - In: text, image, pdf
Out: text
Released: 2024-03-09
Claude-Opus-4.7 anthropic/claude-opus-4.7 1M 128K Input: $4.3
Output: $21
Cache Read: $0.43
Cache Write: $5.4
Model: 2.150
Completion: 4.884
Cache: 0.100
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2026-04-15
Claude-Haiku-3.5 anthropic/claude-haiku-3.5 189.1K 8.2K Input: $0.68
Output: $3.4
Cache Read: $0.068
Cache Write: $0.85
Model: 0.340
Completion: 5.000
Cache: 0.100
📎 🔧 - In: text, image, pdf
Out: text
Released: 2024-10-01
Claude-Opus-4.8 anthropic/claude-opus-4.8 1M 128K Input: $4.2929
Output: $21.4646
Model: 2.146
Completion: 5.000
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2026-05-28
Claude-Opus-4.1 anthropic/claude-opus-4.1 196.6K 32K Input: $13
Output: $64
Cache Read: $1.3
Cache Write: $16
Model: 6.500
Completion: 4.923
Cache: 0.100
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2025-08-05
Claude-Opus-4.5 anthropic/claude-opus-4.5 196.6K 64K Input: $4.3
Output: $21
Cache Read: $0.43
Cache Write: $5.3
Model: 2.150
Completion: 4.884
Cache: 0.100
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2025-11-21
Claude-Sonnet-4.6 anthropic/claude-sonnet-4.6 983K 128K Input: $2.6
Output: $13
Cache Read: $0.26
Cache Write: $3.2
Model: 1.300
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2026-02-05
Claude-Opus-4 anthropic/claude-opus-4 192.5K 28.7K Input: $13
Output: $64
Cache Read: $1.3
Cache Write: $16
Model: 6.500
Completion: 4.923
Cache: 0.100
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2025-05-21
Claude-Opus-4.6 anthropic/claude-opus-4.6 983K 128K Input: $4.3
Output: $21
Cache Read: $0.43
Cache Write: $5.3
Model: 2.150
Completion: 4.884
Cache: 0.100
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2026-02-04
Ray2 lumalabs/ray2 5K - - - 📎 🔧 - In: text, image
Out: video
Released: 2025-02-20
claude-code poetools/claude-code - - - - 📎 🧠 🔧 - In: text
Out: text
Released: 2025-11-27
DeepSeek-V4-Pro-EL empiriolabs/deepseek-v4-pro-el 1M 384K Input: $1.67
Output: $3.33
Model: 0.835
Completion: 1.994
📎 🧠 🔧 - In: text
Out: text
Open Weights
Released: 2026-04-24
Updated: 2026-05-02
DeepSeek-V4-Flash-EL empiriolabs/deepseek-v4-flash-el 1M 384K Input: $0.14
Output: $0.28
Model: 0.070
Completion: 2.000
📎 🧠 🔧 - In: text
Out: text
Open Weights
Released: 2026-04-24
Updated: 2026-05-02
StableDiffusionXL stabilityai/stablediffusionxl 200 - - - 📎 🔧 - In: text, image
Out: image
Released: 2023-07-09
Ideogram ideogramai/ideogram 150 - - - 📎 🔧 - In: text, image
Out: image
Released: 2024-04-03
Ideogram-v2a ideogramai/ideogram-v2a 150 - - - 📎 🔧 - In: text
Out: image
Released: 2025-02-27
Ideogram-v2 ideogramai/ideogram-v2 150 - - - 📎 🔧 - In: text, image
Out: image
Released: 2024-08-21
Ideogram-v2a-Turbo ideogramai/ideogram-v2a-turbo 150 - - - 📎 🔧 - In: text
Out: image
Released: 2025-02-27
glm-4.7 novita/glm-4.7 205K 131.1K - - 📎 🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-12-22
minimax-m2.1 novita/minimax-m2.1 205K 131.1K - - 📎 🧠 🔧 - In: text
Out: text
Released: 2025-12-26
GLM-4.6 novita/glm-4.6 - - - - 📎 🔧 - In: text
Out: text
Released: 2025-09-30
kimi-k2-thinking novita/kimi-k2-thinking 256K - - - 📎 🧠 🔧 - In: text
Out: text
Released: 2025-11-07
Kimi-K2.5 novita/kimi-k2.5 128K 262.1K Input: $0.6
Output: $3
Cache Read: $0.1
Model: 0.300
Completion: 5.000
Cache: 0.167
📎 🧠 🔧 🌡️ - In: text, image, video
Out: text
Released: 2026-01-27
glm-4.6v novita/glm-4.6v 131K 32.8K - - 📎 🧠 🔧 - In: text, image
Out: text
Released: 2025-12-09
Kimi-K2.6 novita/kimi-k2.6 262.1K 262.1K Input: $0.96
Output: $4.04
Cache Read: $0.16
Model: 0.480
Completion: 4.208
Cache: 0.167
📎 🧠 🔧 🌡️ 2025-04 In: text, image, video
Out: text
Open Weights
Released: 2026-04-20
Updated: 2026-05-02
glm-4.7-flash novita/glm-4.7-flash 200K 65.5K - - 📎 🧠 🔧 - In: text
Out: text
Released: 2026-01-19
glm-4.7-n novita/glm-4.7-n 205K 131.1K - - 📎 🧠 🔧 - In: text
Out: text
Released: 2025-12-22
GLM-5 novita/glm-5 205K 131.1K Input: $1
Output: $3.2
Cache Read: $0.2
Model: 0.500
Completion: 3.200
Cache: 0.200
📎 🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-02-15
DeepSeek-V3.2 novita/deepseek-v3.2 128K - Input: $0.27
Output: $0.4
Cache Read: $0.13
Model: 0.135
Completion: 1.481
Cache: 0.481
📎 🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-12-01

Poolside

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Laguna XS.2 poolside/laguna-xs.2 131K 8.2K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-04-28
Laguna M.1 poolside/laguna-m.1 131K 8.2K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-04-28

Privatemode AI

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Qwen3-Embedding 4B qwen3-embedding-4b 32K 2.6K Input: $0
Output: $0
- 🌡️ 2025-06 In: text
Out: text
Open Weights
Released: 2025-06-06
Gemma 3 27B gemma-3-27b 128K 8.2K Input: $0
Output: $0
- 📎 🔧 🌡️ 2024-08 In: text, image
Out: text
Open Weights
Released: 2025-03-12
gpt-oss-120b gpt-oss-120b 128K 128K Input: $0
Output: $0
- 🧠 🔧 🌡️ 2025-08 In: text
Out: text
Open Weights
Released: 2025-08-04
Updated: 2025-08-14
Whisper large-v3 whisper-large-v3 - 4.1K Input: $0
Output: $0
- 📎 🌡️ 2023-09 In: audio
Out: text
Open Weights
Released: 2023-09-01
Qwen3-Coder 30B-A3B qwen3-coder-30b-a3b 128K 32.8K Input: $0
Output: $0
- 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-04

QiHang

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Claude Haiku 4.5 claude-haiku-4-5-20251001 200K 64K Input: $0.14
Output: $0.71
Model: 0.070
Completion: 5.071
📎 🧠 🔧 🌡️ 2025-07-31 In: text, image, pdf
Out: text
Released: 2025-10-01
Gemini 2.5 Flash gemini-2.5-flash 1M 65.5K Input: $0.09
Output: $0.71
Model: 0.045
Completion: 7.889
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video, audio, pdf
Out: text
Released: 2025-12-17
Claude Opus 4.5 claude-opus-4-5-20251101 200K 32K Input: $0.71
Output: $3.57
Model: 0.355
Completion: 5.028
📎 🧠 🔧 🌡️ 2025-03 In: text, image
Out: text
Released: 2025-11-01
GPT-5.2 gpt-5.2 400K 128K Input: $0.25
Output: $2
Model: 0.125
Completion: 8.000
📎 🧠 🔧 🌡️ 2025-08-31 In: text, image
Out: text
Released: 2025-12-11
Claude Sonnet 4.5 claude-sonnet-4-5-20250929 200K 64K Input: $0.43
Output: $2.14
Model: 0.215
Completion: 4.977
📎 🧠 🔧 🌡️ 2025-07-31 In: text, image, pdf
Out: text
Released: 2025-09-29
Gemini 3 Pro Preview gemini-3-pro-preview 1M 65K Input: $0.57
Output: $3.43
Model: 0.285
Completion: 6.018
📎 🧠 🔧 🌡️ 2025-11 In: text, image, audio, video
Out: text
Released: 2025-11-19
GPT-5-Mini gpt-5-mini 200K 64K Input: $0.04
Output: $0.29
Model: 0.020
Completion: 7.250
📎 🧠 🔧 🌡️ 2024-09-30 In: text, image
Out: text
Released: 2025-09-15
Gemini 3 Flash Preview gemini-3-flash-preview 1M 65.5K Input: $0.07
Output: $0.43
Model: 0.035
Completion: 6.143
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video, audio, pdf
Out: text
Released: 2025-12-17
GPT-5.2 Codex gpt-5.2-codex 400K 128K Input: $0.14
Output: $1.14
Model: 0.070
Completion: 8.143
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2025-12-11

Qiniu

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
DeepSeek-R1-0528 deepseek-r1-0528 128K 32K - - 🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-08-05
Doubao 1.5 Thinking Pro doubao-1.5-thinking-pro 128K 16K - - 🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-08-05
Qwen3-Vl 30b A3b Thinking qwen3-vl-30b-a3b-thinking 128K 32K - - 📎 🔧 🌡️ - In: text, image, video
Out: text
Released: 2026-02-09
Claude 3.5 Haiku claude-3.5-haiku 200K 8.2K - - 📎 🔧 🌡️ - In: text, image
Out: text
Released: 2025-08-26
DeepSeek-V3-0324 deepseek-v3-0324 128K 16K - - 🔧 🌡️ - In: text
Out: text
Released: 2025-08-05
Qwen3 235b A22B Instruct 2507 qwen3-235b-a22b-instruct-2507 262.1K 64K - - 🔧 🌡️ - In: text
Out: text
Released: 2025-08-12
DeepSeek-V3 deepseek-v3 128K 16K - - 🌡️ - In: text
Out: text
Released: 2025-08-13
Kimi K2 kimi-k2 128K 128K - - 🔧 🌡️ - In: text
Out: text
Released: 2025-08-05
Qwen3 32B qwen3-32b 40K 4.1K - - 🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-08-05
Qwen3 Max Preview qwen3-max-preview 256K 64K - - 🔧 🌡️ - In: text
Out: text
Released: 2025-09-06
Claude 3.5 Sonnet claude-3.5-sonnet 200K 8.2K - - 📎 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2025-09-09
Qwen3 Next 80B A3B Instruct qwen3-next-80b-a3b-instruct 131.1K 32.8K - - 🔧 🌡️ - In: text
Out: text
Released: 2025-09-12
Gemini 2.5 Pro gemini-2.5-pro 1M 65.5K - - 📎 🧠 🔧 🌡️ - In: text, image, video, audio
Out: text
Released: 2025-08-05
Claude 4.5 Haiku claude-4.5-haiku 200K 64K - - 📎 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2025-10-16
Kling-V2 6 kling-v2-6 100M 100M - - 📎 🌡️ - In: text, image, video
Out: video
Released: 2026-01-13
GLM 4.5 glm-4.5 131.1K 98.3K - - 🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-08-05
Claude 4.1 Opus claude-4.1-opus 200K 32K - - 📎 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2025-08-06
Gemini 2.5 Flash gemini-2.5-flash 1M 64K - - 📎 🧠 🔧 🌡️ - In: text, image, audio, video
Out: text
Released: 2025-08-05
Qwen3 Max qwen3-max 262.1K 65.5K - - 🔧 🌡️ - In: text
Out: text
Released: 2025-09-24
Doubao Seed 2.0 Pro doubao-seed-2.0-pro 256K 128K - - 📎 🧠 🔧 🌡️ - In: text, image, video
Out: text
Released: 2026-02-14
Doubao-Seed 1.6 doubao-seed-1.6 256K 32K - - 📎 🧠 🔧 🌡️ - In: text, image, video
Out: text
Released: 2025-08-15
Doubao-Seed 1.6 Thinking doubao-seed-1.6-thinking 256K 32K - - 📎 🧠 🔧 🌡️ - In: image, text, video
Out: text
Released: 2025-08-15
Gemini 2.0 Flash gemini-2.0-flash 1M 8.2K - - 📎 🔧 🌡️ - In: text, image, audio, video
Out: text
Released: 2025-08-05
Qwen2.5-Max-2025-01-25 qwen-max-2025-01-25 128K 4.1K - - 🔧 🌡️ - In: text
Out: text
Released: 2025-08-05
Claude 4.0 Sonnet claude-4.0-sonnet 200K 64K - - 📎 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2025-08-05
Doubao 1.5 Pro 32k doubao-1.5-pro-32k 128K 12K - - 🔧 🌡️ - In: text
Out: text
Released: 2025-08-05
Qwen3 30b A3b Instruct 2507 qwen3-30b-a3b-instruct-2507 128K 32K - - 🔧 🌡️ - In: text
Out: text
Released: 2026-02-04
Qwen3 Next 80B A3B Thinking qwen3-next-80b-a3b-thinking 131.1K 32.8K - - 🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-09-12
Qwen3 235B A22B Thinking 2507 qwen3-235b-a22b-thinking-2507 262.1K 4.1K - - 🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-08-12
DeepSeek-R1 deepseek-r1 128K 32K - - 🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-08-05
Doubao 1.5 Vision Pro doubao-1.5-vision-pro 128K 16K - - 📎 🌡️ - In: text, image, video
Out: text
Released: 2025-08-05
Gemini 3.0 Pro Image Preview gemini-3.0-pro-image-preview 32.8K 8.2K - - 📎 🌡️ - In: text, image
Out: text, image
Released: 2025-11-20
Gemini 2.5 Flash Image gemini-2.5-flash-image 32.8K 8.2K - - 📎 🌡️ - In: text, image
Out: image
Released: 2025-10-22
Gemini 2.5 Flash Lite gemini-2.5-flash-lite 1M 64K - - 📎 🔧 🌡️ - In: text, image, audio, video
Out: text
Released: 2025-08-05
Claude 3.7 Sonnet claude-3.7-sonnet 200K 128K - - 📎 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2025-08-05
Qwen3 30b A3b Thinking 2507 qwen3-30b-a3b-thinking-2507 126K 32K - - 🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-02-04
Qwen 2.5 VL 72B Instruct qwen2.5-vl-72b-instruct 128K 8.2K - - 📎 🔧 🌡️ - In: text, image, audio, video
Out: text
Released: 2025-08-05
gpt-oss-120b gpt-oss-120b 128K 4.1K - - 🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-08-06
Doubao-Seed 1.6 Flash doubao-seed-1.6-flash 256K 32K - - 📎 🧠 🔧 🌡️ - In: text, image, video
Out: text
Released: 2025-08-15
DeepSeek-V3.1 deepseek-v3.1 128K 32K - - 🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-08-19
Qwen 3 235B A22B qwen3-235b-a22b 128K 32K - - 🔧 🌡️ - In: text
Out: text
Released: 2025-08-05
Qwen3 Coder 480B A35B Instruct qwen3-coder-480b-a35b-instruct 262K 4.1K - - 🔧 🌡️ - In: text
Out: text
Released: 2025-08-14
Qwen3.5 397B A17B qwen3.5-397b-a17b 256K 64K - - 📎 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2026-02-22
Mimo-V2-Flash mimo-v2-flash 256K 256K Input: $0.1
Output: $0.3
Cache Read: $0.01
Model: 0.050
Completion: 3.000
Cache: 0.100
🧠 🔧 🌡️ 2024-12-01 In: text
Out: text
Open Weights
Released: 2025-12-16
Updated: 2026-02-04
Qwen VL-MAX-2025-01-25 qwen-vl-max-2025-01-25 128K 4.1K - - 📎 🔧 🌡️ - In: text, image, audio, video
Out: text
Released: 2025-08-05
Qwen 2.5 VL 7B Instruct qwen2.5-vl-7b-instruct 128K 8.2K - - 📎 🔧 🌡️ - In: text, image, audio, video
Out: text
Released: 2025-08-05
GLM 4.5 Air glm-4.5-air 131K 4.1K - - 🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-08-05
Claude 4.5 Opus claude-4.5-opus 200K 200K - - 📎 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2025-11-25
Gemini 3.0 Pro Preview gemini-3.0-pro-preview 1M 64K - - 📎 🧠 🔧 🌡️ - In: text, image, video, pdf, audio
Out: text
Released: 2025-11-19
Doubao Seed 2.0 Mini doubao-seed-2.0-mini 256K 32K - - 📎 🧠 🔧 🌡️ - In: text, image, video
Out: text
Released: 2026-02-14
Claude 4.0 Opus claude-4.0-opus 200K 32K - - 📎 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2025-08-05
gpt-oss-20b gpt-oss-20b 128K 4.1K - - 🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-08-06
Gemini 3.0 Flash Preview gemini-3.0-flash-preview 1M 64K - - 📎 🧠 🔧 🌡️ - In: text, image, audio, video, pdf
Out: text
Released: 2025-12-18
MiniMax M1 MiniMax-M1 1M 80K - - 🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-08-05
Qwen-Turbo qwen-turbo 1M 4.1K - - 🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-08-05
Qwen3 30B A3B qwen3-30b-a3b 40K 4.1K - - 🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-08-05
Claude 4.5 Sonnet claude-4.5-sonnet 200K 64K - - 📎 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2025-09-30
Doubao Seed 2.0 Lite doubao-seed-2.0-lite 256K 32K - - 📎 🧠 🔧 🌡️ - In: text, image, video
Out: text
Released: 2026-02-14
Gemini 2.0 Flash Lite gemini-2.0-flash-lite 1M 8.2K - - 📎 🧠 🔧 🌡️ - In: text, image, audio, video
Out: text
Released: 2025-08-05
Doubao Seed 2.0 Code doubao-seed-2.0-code 256K 128K - - 📎 🧠 🔧 🌡️ - In: text, image, video
Out: text
Released: 2026-02-14
Kimi K2 Thinking moonshotai/kimi-k2-thinking 256K 100K - - 🔧 🌡️ - In: text
Out: text
Released: 2025-11-07
Moonshotai/Kimi-K2.5 moonshotai/kimi-k2.5 256K 256K - - 📎 🔧 🌡️ - In: text, image, video
Out: text
Released: 2026-01-28
Kimi K2 0905 moonshotai/kimi-k2-0905 256K 100K - - 🔧 🌡️ - In: text
Out: text
Released: 2025-09-08
Stepfun-Ai/Gelab Zero 4b Preview stepfun-ai/gelab-zero-4b-preview 8.2K 4.1K - - 📎 🔧 🌡️ - In: text, image
Out: text
Released: 2025-12-23
x-AI/Grok-Code-Fast 1 x-ai/grok-code-fast-1 256K 10K - - 🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-09-02
X-Ai/Grok 4.1 Fast Reasoning x-ai/grok-4.1-fast-reasoning 20M 2M - - 📎 🧠 🔧 🌡️ - In: text, image, audio, video
Out: text
Released: 2025-12-19
X-Ai/Grok 4.1 Fast Non Reasoning x-ai/grok-4.1-fast-non-reasoning 2M 2M - - 📎 🧠 🔧 🌡️ - In: text, image, audio, video
Out: text
Released: 2025-12-19
X-Ai/Grok-4-Fast-Reasoning x-ai/grok-4-fast-reasoning 2M 2M - - 📎 🧠 🔧 🌡️ - In: text, image, audio, video
Out: text
Released: 2025-12-18
x-AI/Grok-4-Fast x-ai/grok-4-fast 2M 2M - - 📎 🧠 🔧 🌡️ - In: text, image, audio, video
Out: text
Released: 2025-09-20
X-Ai/Grok-4-Fast-Non-Reasoning x-ai/grok-4-fast-non-reasoning 2M 2M - - 📎 🧠 🔧 🌡️ - In: text, image, audio, video
Out: text
Released: 2025-12-18
x-AI/Grok-4.1-Fast x-ai/grok-4.1-fast 2M 2M - - 🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-11-20
Z-Ai/Autoglm Phone 9b z-ai/autoglm-phone-9b 12.8K 4.1K - - 📎 🔧 🌡️ - In: text, image
Out: text
Released: 2025-12-23
Z-Ai/GLM 4.7 z-ai/glm-4.7 200K 200K - - 🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-12-23
Z-AI/GLM 4.6 z-ai/glm-4.6 200K 200K - - 🔧 🌡️ - In: text
Out: text
Released: 2025-10-11
Z-Ai/GLM 5 z-ai/glm-5 200K 128K - - 🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-02-12
OpenAI/GPT-5 openai/gpt-5 400K 128K - - 🔧 🌡️ - In: text
Out: text
Released: 2025-09-19
OpenAI/GPT-5.2 openai/gpt-5.2 400K 128K - - 📎 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2025-12-11
Xiaomi/Mimo-V2-Flash xiaomi/mimo-v2-flash 256K 256K Input: $0.1
Output: $0.3
Cache Read: $0.01
Model: 0.050
Completion: 3.000
Cache: 0.100
🧠 🔧 🌡️ 2024-12-01 In: text
Out: text
Open Weights
Released: 2025-12-16
Updated: 2026-02-04
Stepfun/Step-3.5 Flash stepfun/step-3.5-flash 64K 4.1K - - 📎 🌡️ - In: text, image
Out: text
Released: 2026-02-02
Meituan/Longcat-Flash-Lite meituan/longcat-flash-lite 256K 320K - - 🔧 🌡️ - In: text
Out: text
Released: 2026-02-06
Meituan/Longcat-Flash-Chat meituan/longcat-flash-chat 131.1K 131.1K - - 🌡️ - In: text
Out: text
Released: 2025-11-05
DeepSeek/DeepSeek-V3.1-Terminus deepseek/deepseek-v3.1-terminus 128K 32K - - 🔧 🌡️ - In: text
Out: text
Released: 2025-09-22
DeepSeek/DeepSeek-V3.1-Terminus-Thinking deepseek/deepseek-v3.1-terminus-thinking 128K 32K - - 🧠 🌡️ - In: text
Out: text
Released: 2025-09-22
DeepSeek/DeepSeek-V3.2-Exp-Thinking deepseek/deepseek-v3.2-exp-thinking 128K 32K - - 🧠 🌡️ - In: text
Out: text
Released: 2025-09-29
DeepSeek/DeepSeek-V3.2-Exp deepseek/deepseek-v3.2-exp 128K 32K - - 🔧 🌡️ - In: text
Out: text
Released: 2025-09-29
Deepseek/DeepSeek-V3.2 deepseek/deepseek-v3.2-251201 128K 32K - - 🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-12-01
Deepseek/Deepseek-Math-V2 deepseek/deepseek-math-v2 160K 160K - - 🧠 🌡️ - In: text
Out: text
Released: 2025-12-04
Minimax/Minimax-M2.5 minimax/minimax-m2.5 204.8K 128K - - 🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-02-12
Minimax/Minimax-M2.1 minimax/minimax-m2.1 204.8K 128K - - 🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-12-23
Minimax/Minimax-M2 minimax/minimax-m2 200K 128K - - 🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-10-28
Minimax/Minimax-M2.5 Highspeed minimax/minimax-m2.5-highspeed 204.8K 128K - - 🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-02-14

Regolo AI

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Llama 3.1 8B Instruct llama-3.1-8b-instruct 120K 120K Input: $0.05
Output: $0.25
Model: 0.025
Completion: 5.000
🔧 🌡️ - In: text
Out: text
Released: 2025-04-07
MiniMax 2.5 minimax-m2.5 190K 64K Input: $0.8
Output: $3.5
Model: 0.400
Completion: 4.375
🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-03-10
Mistral Small 3.2 mistral-small3.2 120K 120K Input: $0.5
Output: $2.2
Model: 0.250
Completion: 4.400
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-01-31
Qwen3-Reranker-4B qwen3-reranker-4b 32.8K 8.2K Input: $0.12
Output: $0.12
Model: 0.060
Completion: 1.000
- - In: text
Out: text
Open Weights
Released: 2026-02-01
Qwen3-Embedding-8B qwen3-embedding-8b 32.8K 8.2K Input: $0.1
Output: $0.1
Model: 0.050
Completion: 1.000
- - In: text
Out: text
Open Weights
Released: 2026-02-01
Llama 3.3 70B Instruct llama-3.3-70b-instruct 128K 16.4K Input: $0.6
Output: $2.7
Model: 0.300
Completion: 4.500
🔧 🌡️ - In: text
Out: text
Released: 2025-04-28
Qwen-Image qwen-image 8.2K 4.1K Input: $0.5
Output: $2
Model: 0.250
Completion: 4.000
🌡️ - In: text
Out: image
Released: 2026-03-01
Qwen3.5-122B qwen3.5-122b 262.1K 16.4K Input: $0.9
Output: $3.6
Model: 0.450
Completion: 4.000
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2026-02-01
GPT-OSS-120B gpt-oss-120b 128K 16.4K Input: $1
Output: $4.2
Model: 0.500
Completion: 4.200
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-08-05
Qwen3-Coder-Next qwen3-coder-next 262.1K 16.4K Input: $0.3
Output: $1.2
Model: 0.150
Completion: 4.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-01
Qwen3.5-9B qwen3.5-9b 262.1K 8.2K Input: $0.15
Output: $0.6
Model: 0.075
Completion: 4.000
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2026-02-01
Mistral Small 4 119B mistral-small-4-119b 256K 16.4K Input: $0.75
Output: $3
Model: 0.375
Completion: 4.000
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2026-03-15
GPT-OSS-20B gpt-oss-20b 128K 16.4K Input: $0.4
Output: $1.8
Model: 0.200
Completion: 4.500
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-01

Requesty

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Grok 4 xai/grok-4 256K 64K Input: $3
Output: $15
Cache Read: $0.75
Cache Write: $3
Model: 1.500
Completion: 5.000
Cache: 0.250
📎 🧠 🔧 🌡️ 2025-01 In: text, image
Out: text
Released: 2025-09-09
Grok 4 Fast xai/grok-4-fast 2M 64K Input: $0.2
Output: $0.5
Cache Read: $0.05
Cache Write: $0.2
Model: 0.100
Completion: 2.500
Cache: 0.250
📎 🧠 🔧 🌡️ 2025-01 In: text
Out: text
Released: 2025-09-19
Gemini 2.5 Pro google/gemini-2.5-pro 1M 65.5K Input: $1.25
Output: $10
Cache Read: $0.31
Cache Write: $2.375
Model: 0.625
Completion: 8.000
Cache: 0.248
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-06-17
Gemini 2.5 Flash google/gemini-2.5-flash 1M 65.5K Input: $0.3
Output: $2.5
Cache Read: $0.075
Cache Write: $0.55
Model: 0.150
Completion: 8.333
Cache: 0.250
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-06-17
Gemini 3 Pro google/gemini-3-pro-preview 1M 65.5K Input: $2
Output: $12
Cache Read: $0.2
Cache Write: $4.5
Model: 1.000
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-11-18
Gemini 3 Flash google/gemini-3-flash-preview 1M 65.5K Input: $0.5
Output: $3
Cache Read: $0.05
Cache Write: $1
Model: 0.250
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-12-17
GPT-5.2 Chat openai/gpt-5.2-chat 128K 16.4K Input: $1.75
Output: $14
Cache Read: $0.175
Model: 0.875
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2025-12-11
GPT-5.2 Pro openai/gpt-5.2-pro 400K 128K Input: $21
Output: $168
Model: 10.500
Completion: 8.000
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2025-12-11
GPT-5 openai/gpt-5 400K 128K Input: $1.25
Output: $10
Cache Read: $0.13
Model: 0.625
Completion: 8.000
Cache: 0.104
📎 🧠 🔧 2024-09-30 In: text, audio, image, video
Out: text, audio, image
Released: 2025-08-07
GPT-5 Chat (latest) openai/gpt-5-chat 400K 128K Input: $1.25
Output: $10
Model: 0.625
Completion: 8.000
📎 🧠 🌡️ 2024-09-30 In: text, image
Out: text
Released: 2025-08-07
GPT-5 Pro openai/gpt-5-pro 400K 272K Input: $15
Output: $120
Model: 7.500
Completion: 8.000
📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-10-06
o4 Mini openai/o4-mini 200K 100K Input: $1.1
Output: $4.4
Cache Read: $0.28
Model: 0.550
Completion: 4.000
Cache: 0.255
📎 🧠 🔧 🌡️ 2024-06 In: text, image
Out: text
Released: 2025-04-16
GPT-5.1 Chat openai/gpt-5.1-chat 128K 16.4K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2024-09-30 In: text, image
Out: text
Released: 2025-11-13
GPT-5.1-Codex openai/gpt-5.1-codex 400K 128K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2024-09-30 In: text, image
Out: text
Released: 2025-11-13
GPT-5.1-Codex-Max openai/gpt-5.1-codex-max 400K 128K Input: $1.1
Output: $9
Cache Read: $0.11
Model: 0.550
Completion: 8.182
Cache: 0.100
📎 🧠 🔧 🌡️ 2024-09-30 In: text, image
Out: text
Released: 2025-11-13
GPT-5.2 openai/gpt-5.2 400K 128K Input: $1.75
Output: $14
Cache Read: $0.175
Model: 0.875
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2025-12-11
GPT-5.3-Codex openai/gpt-5.3-codex 400K 128K Input: $1.75
Output: $14
Cache Read: $0.175
Model: 0.875
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-02-24
GPT-5.1-Codex-Mini openai/gpt-5.1-codex-mini 400K 100K Input: $0.25
Output: $2
Cache Read: $0.025
Model: 0.125
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2024-09-30 In: text, image
Out: text
Released: 2025-11-13
GPT-5 Image openai/gpt-5-image 400K 128K Input: $5
Output: $10
Cache Read: $1.25
Model: 2.500
Completion: 2.000
Cache: 0.250
📎 🧠 🔧 🌡️ 2024-10-01 In: text, image, pdf
Out: text, image
Released: 2025-10-14
GPT-5.4 openai/gpt-5.4 1.1M 128K Input: $2.5
Output: $15
Cache Read: $0.25
Model: 1.250
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-03-05
GPT-4.1 openai/gpt-4.1 1M 32.8K Input: $2
Output: $8
Cache Read: $0.5
Model: 1.000
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2025-04-14
GPT-5 Mini openai/gpt-5-mini 128K 32K Input: $0.25
Output: $2
Cache Read: $0.03
Model: 0.125
Completion: 8.000
Cache: 0.120
📎 🧠 🔧 2024-05-30 In: text, image
Out: text
Released: 2025-08-07
GPT-4.1 Mini openai/gpt-4.1-mini 1M 32.8K Input: $0.4
Output: $1.6
Cache Read: $0.1
Model: 0.200
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2025-04-14
GPT-5 Nano openai/gpt-5-nano 16K 4K Input: $0.05
Output: $0.4
Cache Read: $0.01
Model: 0.025
Completion: 8.000
Cache: 0.200
📎 🧠 🔧 2024-05-30 In: text
Out: text
Released: 2025-08-07
GPT-5.4 Pro openai/gpt-5.4-pro 1.1M 128K Input: $30
Output: $180
Cache Read: $30
Model: 15.000
Completion: 6.000
Cache: 1.000
📎 🧠 🔧 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-03-05
GPT-4o Mini openai/gpt-4o-mini 128K 16.4K Input: $0.15
Output: $0.6
Cache Read: $0.08
Model: 0.075
Completion: 4.000
Cache: 0.533
📎 🔧 🌡️ 2024-10 In: text, image
Out: text
Released: 2024-07-18
GPT-5 Codex openai/gpt-5-codex 400K 128K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2024-10-01 In: text, image
Out: text
Released: 2025-09-15
GPT-5.2-Codex openai/gpt-5.2-codex 400K 128K Input: $1.75
Output: $14
Cache Read: $0.175
Model: 0.875
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-08-31 In: text, image
Out: text
Released: 2026-01-14
GPT-5.1 openai/gpt-5.1 400K 128K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2024-09-30 In: text, image
Out: text
Released: 2025-11-13
Claude Sonnet 3.7 anthropic/claude-3-7-sonnet 200K 64K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2024-01 In: text, image, pdf
Out: text
Released: 2025-02-19
Claude Sonnet 4 anthropic/claude-sonnet-4 200K 64K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-05-22
Claude Opus 4.5 anthropic/claude-opus-4-5 200K 64K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-11-24
Claude Sonnet 4.5 anthropic/claude-sonnet-4-5 1M 64K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-07-31 In: text, image, pdf
Out: text
Released: 2025-09-29
Claude Opus 4.1 anthropic/claude-opus-4-1 200K 32K Input: $15
Output: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-08-05
Claude Haiku 4.5 anthropic/claude-haiku-4-5 200K 62K Input: $1
Output: $5
Cache Read: $0.1
Cache Write: $1.25
Model: 0.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-02-01 In: text, image, pdf
Out: text
Released: 2025-10-15
Claude Opus 4.6 anthropic/claude-opus-4-6 1M 128K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-05-31 In: text, image, pdf
Out: text
Released: 2026-02-05
Claude Sonnet 4.6 anthropic/claude-sonnet-4-6 1M 128K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-08-31 In: text, image
Out: text
Released: 2026-02-17
Claude Opus 4 anthropic/claude-opus-4 200K 32K Input: $15
Output: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-05-22

routing.run

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
DeepSeek V4 Flash route/deepseek-v4-flash 1M 131.1K Input: $0.4928
Output: $0.7392
Cache Read: $0.0028
Model: 0.246
Completion: 1.500
Cache: 0.006
🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2026-04-24
DeepSeek V4 Flash 6bit route/deepseek-v4-flash-6bit 1M 131.1K Input: $0.4928
Output: $0.7392
Cache Read: $0.0028
Model: 0.246
Completion: 1.500
Cache: 0.006
🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2026-04-24
MiniMax M2.7 Highspeed route/minimax-m2.7-highspeed 100K 131.1K Input: $0.33
Output: $1.32
Cache Read: $0.06
Cache Write: $0.375
Model: 0.165
Completion: 4.000
Cache: 0.182
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-18
GLM 5.1 6bit route/glm-5.1-6bit 202.8K 65.5K Input: $1
Output: $3
Cache Read: $0.26
Cache Write: $0
Model: 0.500
Completion: 3.000
Cache: 0.260
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-27
MiniMax M2.5 route/minimax-m2.5 100K 131.1K Input: $0.193
Output: $1.238
Cache Read: $0.03
Cache Write: $0.375
Model: 0.097
Completion: 6.415
Cache: 0.155
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-12
Mistral Small 2503 route/mistral-small-2503 128K 32.8K Input: $0.15
Output: $0.6
Model: 0.075
Completion: 4.000
📎 🔧 🌡️ 2025-06 In: text, image, video
Out: text
Open Weights
Released: 2026-03-16
Gemma 4 31B IT route/gemma-4-31b-it 131.1K 65.5K Input: $0.1
Output: $0.3
Model: 0.050
Completion: 3.000
📎 🔧 🌡️ - In: text, image, video
Out: text
Open Weights
Released: 2026-04-02
Step 3.5 Flash 2603 route/step-3.5-flash-2603 262.1K 65.5K Input: $0.1
Output: $0.3
Cache Read: $0.02
Model: 0.050
Completion: 3.000
Cache: 0.200
🧠 🔧 🌡️ 2025-01 In: text
Out: text
Open Weights
Released: 2026-04-02
GLM 5.1 route/glm-5.1 202.8K 65.5K Input: $1
Output: $3
Cache Read: $0.26
Cache Write: $0
Model: 0.500
Completion: 3.000
Cache: 0.260
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-27
DeepSeek V4 Pro route/deepseek-v4-pro 1M 131.1K Input: $0.4928
Output: $0.7392
Cache Read: $0.003625
Model: 0.246
Completion: 1.500
Cache: 0.007
🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2026-04-24
Mistral Large 3 route/mistral-large-3 128K 32.8K Input: $0.5
Output: $1.5
Model: 0.250
Completion: 3.000
🔧 🌡️ 2024-11 In: text
Out: text
Open Weights
Released: 2024-11-01
Updated: 2025-12-02
DeepSeek V4 Pro 6bit route/deepseek-v4-pro-6bit 1M 131.1K Input: $0.4928
Output: $0.7392
Cache Read: $0.003625
Model: 0.246
Completion: 1.500
Cache: 0.007
🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2026-04-24
Mistral Medium 2505 route/mistral-medium-2505 128K 32.8K Input: $0.4
Output: $2
Model: 0.200
Completion: 5.000
📎 🔧 🌡️ 2025-05 In: text, image, video
Out: text
Released: 2025-05-07
MiniMax M2.7 route/minimax-m2.7 100K 131.1K Input: $0.33
Output: $1.32
Cache Read: $0.06
Cache Write: $0.375
Model: 0.165
Completion: 4.000
Cache: 0.182
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-18
Qwen3.6 27B 202K route/qwen3.6-27b-202k 202K 32.8K Input: $1.1
Output: $3.3
Model: 0.550
Completion: 3.000
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-04-22
Kimi K2.5 route/kimi-k2.5 131.1K 32.8K Input: $0.462
Output: $2.42
Cache Read: $0.1
Model: 0.231
Completion: 5.238
Cache: 0.216
📎 🔧 2025-01 In: text, image, video
Out: text
Open Weights
Released: 2026-01
MiniMax M2.5 Highspeed route/minimax-m2.5-highspeed 100K 131.1K Input: $0.193
Output: $1.238
Cache Read: $0.06
Cache Write: $0.375
Model: 0.097
Completion: 6.415
Cache: 0.311
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-13
Kimi K2.6 6bit route/kimi-k2.6-6bit 262.1K 262.1K Input: $0.462
Output: $2.42
Cache Read: $0.16
Model: 0.231
Completion: 5.238
Cache: 0.346
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video
Out: text
Open Weights
Released: 2026-04-21
MiMo V2.5 route/mimo-v2.5 1M 262.1K Input: $0.45
Output: $1.35
Cache Read: $0.2
Model: 0.225
Completion: 3.000
Cache: 0.444
📎 🧠 🔧 🌡️ 2024-12 In: text, image, video
Out: text
Open Weights
Released: 2026-04-22
Kimi K2.6 route/kimi-k2.6 262.1K 262.1K Input: $0.462
Output: $2.42
Cache Read: $0.16
Model: 0.231
Completion: 5.238
Cache: 0.346
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video
Out: text
Open Weights
Released: 2026-04-21
Qwen3.6 27B route/qwen3.6-27b 202K 32.8K Input: $1.1
Output: $3.3
Model: 0.550
Completion: 3.000
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-04-22
MiMo V2.5 Pro 6bit route/mimo-v2.5-pro-6bit 1M 262.1K Input: $0.45
Output: $1.35
Cache Read: $0.2
Model: 0.225
Completion: 3.000
Cache: 0.444
📎 🧠 🔧 🌡️ 2024-12 In: text, image, video
Out: text
Open Weights
Released: 2026-04-22
StepFun 3.5 Flash route/stepfun-3.5-flash 262.1K 65.5K Input: $0.096
Output: $0.288
Cache Read: $0.019
Model: 0.048
Completion: 3.000
Cache: 0.198
🧠 🔧 🌡️ 2025-01 In: text
Out: text
Open Weights
Released: 2026-01-29
Updated: 2026-02-13
MiMo V2.5 Pro route/mimo-v2.5-pro 1M 262.1K Input: $0.45
Output: $1.35
Cache Read: $0.2
Model: 0.225
Completion: 3.000
Cache: 0.444
📎 🧠 🔧 🌡️ 2024-12 In: text, image, video
Out: text
Open Weights
Released: 2026-04-22
Step 3.5 Flash route/step-3.5-flash 262.1K 65.5K Input: $0.096
Output: $0.288
Cache Read: $0.019
Model: 0.048
Completion: 3.000
Cache: 0.198
🧠 🔧 🌡️ 2025-01 In: text
Out: text
Open Weights
Released: 2026-01-29
Updated: 2026-02-13
DeepSeek V3.2 route/deepseek-v3.2 163.8K 163.8K Input: $0.4928
Output: $0.7392
Model: 0.246
Completion: 1.500
📎 🧠 🔧 🌡️ 2024-07 In: text, image, video
Out: text
Open Weights
Released: 2025-12-01

SAP AI Core

📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
anthropic--claude-4.6-sonnet anthropic--claude-4.6-sonnet 1M 64K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-08 In: text, image, pdf
Out: text
Released: 2026-02-17
Updated: 2026-03-13
anthropic--claude-3-sonnet anthropic--claude-3-sonnet 200K 4.1K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🔧 🌡️ 2023-08-31 In: text, image, pdf
Out: text
Released: 2024-03-04
anthropic--claude-4-sonnet anthropic--claude-4-sonnet 200K 64K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-05-22
gemini-2.5-pro gemini-2.5-pro 1M 65.5K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-03-25
Updated: 2025-06-05
gpt-5 gpt-5 400K 128K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-08-07
gemini-2.5-flash gemini-2.5-flash 1M 65.5K Input: $0.3
Output: $2.5
Cache Read: $0.03
Input Audio: $1
Model: 0.500
Completion: 2.500
Cache: 0.030
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-04-17
Updated: 2025-06-05
anthropic--claude-4.5-haiku anthropic--claude-4.5-haiku 200K 64K Input: $1
Output: $5
Cache Read: $0.1
Cache Write: $1.25
Model: 0.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-02-28 In: text, image, pdf
Out: text
Released: 2025-10-15
anthropic--claude-3-haiku anthropic--claude-3-haiku 200K 4.1K Input: $0.25
Output: $1.25
Cache Read: $0.03
Cache Write: $0.3
Model: 0.125
Completion: 5.000
Cache: 0.120
📎 🔧 🌡️ 2023-08-31 In: text, image, pdf
Out: text
Released: 2024-03-13
anthropic--claude-4-opus anthropic--claude-4-opus 200K 32K Input: $15
Output: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-05-22
anthropic--claude-4.5-sonnet anthropic--claude-4.5-sonnet 200K 64K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-07-31 In: text, image, pdf
Out: text
Released: 2025-09-29
anthropic--claude-3.5-sonnet anthropic--claude-3.5-sonnet 200K 8.2K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🔧 🌡️ 2024-04-30 In: text, image, pdf
Out: text
Released: 2024-10-22
anthropic--claude-4.6-opus anthropic--claude-4.6-opus 1M 128K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-05 In: text, image, pdf
Out: text
Released: 2026-02-05
Updated: 2026-03-13
gemini-2.5-flash-lite gemini-2.5-flash-lite 1M 65.5K Input: $0.1
Output: $0.4
Cache Read: $0.01
Input Audio: $0.3
Model: 0.150
Completion: 1.333
Cache: 0.033
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-06-17
anthropic--claude-3.7-sonnet anthropic--claude-3.7-sonnet 200K 64K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2024-10-31 In: text, image, pdf
Out: text
Released: 2025-02-24
sonar sonar 128K 4.1K Input: $1
Output: $1
Model: 0.500
Completion: 1.000
🌡️ 2025-09-01 In: text
Out: text
Released: 2024-01-01
Updated: 2025-09-01
sonar-pro sonar-pro 200K 8.2K Input: $3
Output: $15
Model: 1.500
Completion: 5.000
📎 🌡️ 2025-09-01 In: text, image
Out: text
Released: 2024-01-01
Updated: 2025-09-01
gpt-5.4 gpt-5.4 1.1M 128K Input: $2.5
Output: $15
Cache Read: $0.25
Model: 1.250
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-03-05
gpt-4.1 gpt-4.1 1M 32.8K Input: $2
Output: $8
Cache Read: $0.5
Model: 1.000
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-04 In: text, image, pdf
Out: text
Released: 2025-04-14
anthropic--claude-3-opus anthropic--claude-3-opus 200K 4.1K Input: $15
Output: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🔧 🌡️ 2023-08-31 In: text, image, pdf
Out: text
Released: 2024-02-29
gpt-5-mini gpt-5-mini 400K 128K Input: $0.25
Output: $2
Cache Read: $0.025
Model: 0.125
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2024-05-30 In: text, image
Out: text
Released: 2025-08-07
gpt-4.1-mini gpt-4.1-mini 1M 32.8K Input: $0.4
Output: $1.6
Cache Read: $0.1
Model: 0.200
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-04 In: text, image, pdf
Out: text
Released: 2025-04-14
sonar-deep-research sonar-deep-research 128K 32.8K Input: $2
Output: $8
Reasoning: $3
Model: 1.000
Completion: 4.000
🧠 2025-01 In: text
Out: text
Released: 2025-02-01
Updated: 2025-09-01
gpt-5-nano gpt-5-nano 400K 128K Input: $0.05
Output: $0.4
Cache Read: $0.005
Model: 0.025
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2024-05-30 In: text, image
Out: text
Released: 2025-08-07
anthropic--claude-4.5-opus anthropic--claude-4.5-opus 200K 64K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-05 In: text, image, pdf
Out: text
Released: 2025-11-24
anthropic--claude-4.7-opus anthropic--claude-4.7-opus 1M 128K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 2026-01-31 In: text, image, pdf
Out: text
Released: 2026-04-16
gpt-5.5 gpt-5.5 1.1M 128K Input: $5
Output: $30
Cache Read: $0.5
Model: 2.500
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 2025-12-01 In: text, image, pdf
Out: text
Released: 2026-04-23

Sarvam AI

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Sarvam-105B sarvam-105b 131.1K 131.1K - - 🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-18
Updated: 2026-03-06
Sarvam-30B sarvam-30b 65.5K 65.5K - - 🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-18
Updated: 2026-03-06

Scaleway

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Qwen3 235B A22B Instruct 2507 qwen3-235b-a22b-instruct-2507 260K 16.4K Input: $0.75
Output: $2.25
Reasoning: $8.4
Model: 0.375
Completion: 3.000
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-01
Updated: 2026-03-17
Qwen3-Coder 30B-A3B Instruct qwen3-coder-30b-a3b-instruct 128K 32.8K Input: $0.2
Output: $0.8
Model: 0.100
Completion: 4.000
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-04
Updated: 2026-03-17
Qwen3 Embedding 8B qwen3-embedding-8b 32.8K 4.1K Input: $0.1
Output: $0
Model: 0.050 - - In: text
Out: text
Released: 2025-25-11
Updated: 2026-03-17
BGE Multilingual Gemma2 bge-multilingual-gemma2 8.2K 3.1K Input: $0.1
Output: $0
Model: 0.050 - - In: text
Out: text
Released: 2024-07-26
Updated: 2025-06-15
Qwen3.6 35B A3B qwen3.6-35b-a3b 128K 16.4K Input: $0.25
Output: $1.5
Model: 0.125
Completion: 6.000
📎 🧠 🔧 🌡️ 2025-04 In: text, image
Out: text
Open Weights
Released: 2026-05-01
Updated: 2026-05-22
Llama-3.3-70B-Instruct llama-3.3-70b-instruct 100K 16.4K Input: $0.9
Output: $0.9
Model: 0.450
Completion: 1.000
📎 🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-12-06
Updated: 2026-03-17
Pixtral 12B 2409 pixtral-12b-2409 128K 4.1K Input: $0.2
Output: $0.2
Model: 0.100
Completion: 1.000
📎 🔧 🌡️ 2024-09 In: text, image
Out: text
Open Weights
Released: 2024-09-25
Updated: 2026-03-17
Mistral Small 3.2 24B Instruct (2506) mistral-small-3.2-24b-instruct-2506 128K 32.8K Input: $0.15
Output: $0.35
Model: 0.075
Completion: 2.333
🔧 🌡️ 2025-03 In: text, image
Out: text
Open Weights
Released: 2025-06-20
Updated: 2026-03-17
GPT-OSS 120B gpt-oss-120b 128K 32.8K Input: $0.15
Output: $0.6
Model: 0.075
Completion: 4.000
📎 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-01-01
Updated: 2026-03-17
Gemma 4 26B A4B IT gemma-4-26b-a4b-it 256K 16.4K Input: $0.25
Output: $0.5
Model: 0.125
Completion: 2.000
📎 🧠 🔧 🌡️ 2025-04 In: text, image
Out: text
Open Weights
Released: 2026-04-01
Updated: 2026-05-22
Mistral Medium 3.5 128B mistral-medium-3.5-128b 256K 16.4K Input: $1.5
Output: $7.5
Model: 0.750
Completion: 5.000
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2026-04-29
Qwen3.5 397B A17B qwen3.5-397b-a17b 256K 16.4K Input: $0.6
Output: $3.6
Model: 0.300
Completion: 6.000
📎 🧠 🔧 🌡️ 2025-04 In: text, image, video
Out: text
Open Weights
Released: 2026-03-17
Whisper Large v3 whisper-large-v3 - 8.2K Input: $0.003
Output: $0
Model: 0.002 - 2023-09 In: audio
Out: text
Open Weights
Released: 2023-09-01
Updated: 2026-03-17
Gemma-3-27B-IT gemma-3-27b-it 40K 8.2K Input: $0.25
Output: $0.5
Model: 0.125
Completion: 2.000
📎 🧠 🔧 🌡️ 2024-12 In: text, image
Out: text
Released: 2024-12-01
Updated: 2026-03-17
Voxtral Small 24B 2507 voxtral-small-24b-2507 32K 16.4K Input: $0.15
Output: $0.35
Model: 0.075
Completion: 2.333
📎 🔧 🌡️ - In: text, audio
Out: text
Open Weights
Released: 2025-07-01
Updated: 2026-03-17
Devstral 2 123B Instruct (2512) devstral-2-123b-instruct-2512 256K 16.4K Input: $0.4
Output: $2
Model: 0.200
Completion: 5.000
🔧 🌡️ 2025-12 In: text
Out: text
Open Weights
Released: 2026-01-07
Updated: 2026-03-17

SiliconFlow

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
THUDM/GLM-Z1-32B-0414 THUDM/GLM-Z1-32B-0414 131K 131K Input: $0.14
Output: $0.57
Model: 0.070
Completion: 4.071
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-04-18
Updated: 2025-11-25
THUDM/GLM-4-32B-0414 THUDM/GLM-4-32B-0414 33K 33K Input: $0.27
Output: $0.27
Model: 0.135
Completion: 1.000
🔧 🌡️ - In: text
Out: text
Released: 2025-04-18
Updated: 2025-11-25
THUDM/GLM-4-9B-0414 THUDM/GLM-4-9B-0414 33K 33K Input: $0.086
Output: $0.086
Model: 0.043
Completion: 1.000
🔧 🌡️ - In: text
Out: text
Released: 2025-04-18
Updated: 2025-11-25
THUDM/GLM-Z1-9B-0414 THUDM/GLM-Z1-9B-0414 131K 131K Input: $0.086
Output: $0.086
Model: 0.043
Completion: 1.000
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-04-18
Updated: 2025-11-25
moonshotai/Kimi-K2-Thinking moonshotai/Kimi-K2-Thinking 262K 262K Input: $0.55
Output: $2.5
Model: 0.275
Completion: 4.545
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-11-07
Updated: 2025-11-25
moonshotai/Kimi-K2-Instruct-0905 moonshotai/Kimi-K2-Instruct-0905 262K 262K Input: $0.4
Output: $2
Model: 0.200
Completion: 5.000
🔧 🌡️ - In: text
Out: text
Released: 2025-09-08
Updated: 2025-11-25
moonshotai/Kimi-K2-Instruct moonshotai/Kimi-K2-Instruct 131K 131K Input: $0.58
Output: $2.29
Model: 0.290
Completion: 3.948
🔧 🌡️ - In: text
Out: text
Released: 2025-07-13
Updated: 2025-11-25
moonshotai/Kimi-K2.6 moonshotai/Kimi-K2.6 262K 262K Input: $0.95
Output: $4
Cache Read: $0.16
Model: 0.475
Completion: 4.211
Cache: 0.168
🧠 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2026-04-21
moonshotai/Kimi-K2.5 moonshotai/Kimi-K2.5 262K 262K Input: $0.45
Output: $2.25
Model: 0.225
Completion: 5.000
🧠 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2026-01-27
baidu/ERNIE-4.5-300B-A47B baidu/ERNIE-4.5-300B-A47B 131K 131K Input: $0.28
Output: $1.1
Model: 0.140
Completion: 3.929
🔧 🌡️ - In: text
Out: text
Released: 2025-07-02
Updated: 2025-11-25
ByteDance-Seed/Seed-OSS-36B-Instruct ByteDance-Seed/Seed-OSS-36B-Instruct 262K 262K Input: $0.21
Output: $0.57
Model: 0.105
Completion: 2.714
🔧 🌡️ - In: text
Out: text
Released: 2025-09-04
Updated: 2025-11-25
stepfun-ai/Step-3.5-Flash stepfun-ai/Step-3.5-Flash 262K 262K Input: $0.1
Output: $0.3
Model: 0.050
Completion: 3.000
🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-02-11
Gemma 4 31B IT google/gemma-4-31B-it 262.1K 262.1K Input: $0.13
Output: $0.4
Model: 0.065
Completion: 3.077
🔧 🌡️ - In: text
Out: text
Released: 2026-04-02
Gemma 4 26B A4B IT google/gemma-4-26B-A4B-it 262.1K 262.1K Input: $0.12
Output: $0.4
Model: 0.060
Completion: 3.333
🔧 🌡️ - In: text
Out: text
Released: 2026-04-02
inclusionAI/Ling-flash-2.0 inclusionAI/Ling-flash-2.0 131K 131K Input: $0.14
Output: $0.57
Model: 0.070
Completion: 4.071
🔧 🌡️ - In: text
Out: text
Released: 2025-09-18
Updated: 2025-11-25
Qwen/Qwen3-Omni-30B-A3B-Thinking Qwen/Qwen3-Omni-30B-A3B-Thinking 66K 66K Input: $0.1
Output: $0.4
Model: 0.050
Completion: 4.000
📎 🧠 🔧 🌡️ - In: text, image, audio
Out: text
Released: 2025-10-04
Updated: 2025-11-25
Qwen/Qwen2.5-VL-72B-Instruct Qwen/Qwen2.5-VL-72B-Instruct 131K 4K Input: $0.59
Output: $0.59
Model: 0.295
Completion: 1.000
📎 🔧 🌡️ - In: text, image
Out: text
Released: 2025-01-28
Updated: 2025-11-25
Qwen/Qwen2.5-VL-32B-Instruct Qwen/Qwen2.5-VL-32B-Instruct 131K 131K Input: $0.27
Output: $0.27
Model: 0.135
Completion: 1.000
📎 🔧 🌡️ - In: text, image
Out: text
Released: 2025-03-24
Updated: 2025-11-25
Qwen/Qwen3-30B-A3B-Thinking-2507 Qwen/Qwen3-30B-A3B-Thinking-2507 262K 131K Input: $0.09
Output: $0.3
Model: 0.045
Completion: 3.333
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-07-31
Updated: 2025-11-25
Qwen/Qwen2.5-7B-Instruct Qwen/Qwen2.5-7B-Instruct 33K 4K Input: $0.05
Output: $0.05
Model: 0.025
Completion: 1.000
🔧 🌡️ - In: text
Out: text
Released: 2024-09-18
Updated: 2025-11-25
Qwen/Qwen3-VL-235B-A22B-Instruct Qwen/Qwen3-VL-235B-A22B-Instruct 262K 262K Input: $0.3
Output: $1.5
Model: 0.150
Completion: 5.000
📎 🔧 🌡️ - In: text, image
Out: text
Released: 2025-10-04
Updated: 2025-11-25
Qwen/Qwen2.5-32B-Instruct Qwen/Qwen2.5-32B-Instruct 33K 4K Input: $0.18
Output: $0.18
Model: 0.090
Completion: 1.000
🔧 🌡️ - In: text
Out: text
Released: 2024-09-19
Updated: 2025-11-25
Qwen/Qwen3-Next-80B-A3B-Instruct Qwen/Qwen3-Next-80B-A3B-Instruct 262K 262K Input: $0.14
Output: $1.4
Model: 0.070
Completion: 10.000
🔧 🌡️ - In: text
Out: text
Released: 2025-09-18
Updated: 2025-11-25
Qwen/Qwen3-235B-A22B-Thinking-2507 Qwen/Qwen3-235B-A22B-Thinking-2507 262K 262K Input: $0.13
Output: $0.6
Model: 0.065
Completion: 4.615
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-07-28
Updated: 2025-11-25
Qwen/Qwen2.5-72B-Instruct-128K Qwen/Qwen2.5-72B-Instruct-128K 131K 4K Input: $0.59
Output: $0.59
Model: 0.295
Completion: 1.000
🔧 🌡️ - In: text
Out: text
Released: 2024-09-18
Updated: 2025-11-25
Qwen/Qwen3-Coder-480B-A35B-Instruct Qwen/Qwen3-Coder-480B-A35B-Instruct 262K 262K Input: $0.25
Output: $1
Model: 0.125
Completion: 4.000
🔧 🌡️ - In: text
Out: text
Released: 2025-07-31
Updated: 2025-11-25
Qwen/Qwen3-Coder-30B-A3B-Instruct Qwen/Qwen3-Coder-30B-A3B-Instruct 262K 262K Input: $0.07
Output: $0.28
Model: 0.035
Completion: 4.000
🔧 🌡️ - In: text
Out: text
Released: 2025-08-01
Updated: 2025-11-25
Qwen/Qwen3-30B-A3B-Instruct-2507 Qwen/Qwen3-30B-A3B-Instruct-2507 262K 262K Input: $0.09
Output: $0.3
Model: 0.045
Completion: 3.333
🔧 🌡️ - In: text
Out: text
Released: 2025-07-30
Updated: 2025-11-25
Qwen/Qwen3-VL-30B-A3B-Thinking Qwen/Qwen3-VL-30B-A3B-Thinking 262K 262K Input: $0.29
Output: $1
Model: 0.145
Completion: 3.448
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2025-10-11
Updated: 2025-11-25
Qwen/Qwen3-VL-8B-Thinking Qwen/Qwen3-VL-8B-Thinking 262K 262K Input: $0.18
Output: $2
Model: 0.090
Completion: 11.111
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2025-10-15
Updated: 2025-11-25
Qwen/Qwen3-VL-8B-Instruct Qwen/Qwen3-VL-8B-Instruct 262K 262K Input: $0.18
Output: $0.68
Model: 0.090
Completion: 3.778
📎 🔧 🌡️ - In: text, image
Out: text
Released: 2025-10-15
Updated: 2025-11-25
Qwen/QwQ-32B Qwen/QwQ-32B 131K 131K Input: $0.15
Output: $0.58
Model: 0.075
Completion: 3.867
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-03-06
Updated: 2025-11-25
Qwen/Qwen3-Omni-30B-A3B-Captioner Qwen/Qwen3-Omni-30B-A3B-Captioner 66K 66K Input: $0.1
Output: $0.4
Model: 0.050
Completion: 4.000
📎 🔧 🌡️ - In: audio
Out: text
Released: 2025-10-04
Updated: 2025-11-25
Qwen/Qwen3-VL-235B-A22B-Thinking Qwen/Qwen3-VL-235B-A22B-Thinking 262K 262K Input: $0.45
Output: $3.5
Model: 0.225
Completion: 7.778
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2025-10-04
Updated: 2025-11-25
Qwen/Qwen3-Next-80B-A3B-Thinking Qwen/Qwen3-Next-80B-A3B-Thinking 262K 262K Input: $0.14
Output: $0.57
Model: 0.070
Completion: 4.071
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-09-25
Updated: 2025-11-25
Qwen/Qwen2.5-Coder-32B-Instruct Qwen/Qwen2.5-Coder-32B-Instruct 33K 4K Input: $0.18
Output: $0.18
Model: 0.090
Completion: 1.000
🔧 🌡️ - In: text
Out: text
Released: 2024-11-11
Updated: 2025-11-25
Qwen/Qwen2.5-VL-7B-Instruct Qwen/Qwen2.5-VL-7B-Instruct 33K 4K Input: $0.05
Output: $0.05
Model: 0.025
Completion: 1.000
📎 🔧 🌡️ - In: text, image
Out: text
Released: 2025-01-28
Updated: 2025-11-25
Qwen/Qwen3-8B Qwen/Qwen3-8B 131K 131K Input: $0.06
Output: $0.06
Model: 0.030
Completion: 1.000
🔧 🌡️ - In: text
Out: text
Released: 2025-04-30
Updated: 2025-11-25
Qwen/Qwen3-235B-A22B-Instruct-2507 Qwen/Qwen3-235B-A22B-Instruct-2507 262K 262K Input: $0.09
Output: $0.6
Model: 0.045
Completion: 6.667
🔧 🌡️ - In: text
Out: text
Released: 2025-07-23
Updated: 2025-11-25
Qwen/Qwen3-32B Qwen/Qwen3-32B 131K 131K Input: $0.14
Output: $0.57
Model: 0.070
Completion: 4.071
🔧 🌡️ - In: text
Out: text
Released: 2025-04-30
Updated: 2025-11-25
Qwen/Qwen3-VL-32B-Instruct Qwen/Qwen3-VL-32B-Instruct 262K 262K Input: $0.2
Output: $0.6
Model: 0.100
Completion: 3.000
📎 🔧 🌡️ - In: text, image
Out: text
Released: 2025-10-21
Updated: 2025-11-25
Qwen/Qwen2.5-14B-Instruct Qwen/Qwen2.5-14B-Instruct 33K 4K Input: $0.1
Output: $0.1
Model: 0.050
Completion: 1.000
🔧 🌡️ - In: text
Out: text
Released: 2024-09-18
Updated: 2025-11-25
Qwen/Qwen3-14B Qwen/Qwen3-14B 131K 131K Input: $0.07
Output: $0.28
Model: 0.035
Completion: 4.000
🔧 🌡️ - In: text
Out: text
Released: 2025-04-30
Updated: 2025-11-25
Qwen/Qwen3-235B-A22B Qwen/Qwen3-235B-A22B 131K 131K Input: $0.35
Output: $1.42
Model: 0.175
Completion: 4.057
🔧 🌡️ - In: text
Out: text
Released: 2025-04-30
Updated: 2025-11-25
Qwen/Qwen2.5-72B-Instruct Qwen/Qwen2.5-72B-Instruct 33K 4K Input: $0.59
Output: $0.59
Model: 0.295
Completion: 1.000
🔧 🌡️ - In: text
Out: text
Released: 2024-09-18
Updated: 2025-11-25
Qwen/Qwen3-Omni-30B-A3B-Instruct Qwen/Qwen3-Omni-30B-A3B-Instruct 66K 66K Input: $0.1
Output: $0.4
Model: 0.050
Completion: 4.000
📎 🔧 🌡️ - In: text, image, audio
Out: text
Released: 2025-10-04
Updated: 2025-11-25
Qwen/Qwen3-VL-30B-A3B-Instruct Qwen/Qwen3-VL-30B-A3B-Instruct 262K 262K Input: $0.29
Output: $1
Model: 0.145
Completion: 3.448
📎 🔧 🌡️ - In: text, image
Out: text
Released: 2025-10-05
Updated: 2025-11-25
Qwen/Qwen3-VL-32B-Thinking Qwen/Qwen3-VL-32B-Thinking 262K 262K Input: $0.2
Output: $1.5
Model: 0.100
Completion: 7.500
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2025-10-21
Updated: 2025-11-25
openai/gpt-oss-120b openai/gpt-oss-120b 131K 8K Input: $0.05
Output: $0.45
Model: 0.025
Completion: 9.000
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-08-13
Updated: 2025-11-25
openai/gpt-oss-20b openai/gpt-oss-20b 131K 8K Input: $0.04
Output: $0.18
Model: 0.020
Completion: 4.500
🔧 🌡️ - In: text
Out: text
Released: 2025-08-13
Updated: 2025-11-25
tencent/Hunyuan-MT-7B tencent/Hunyuan-MT-7B 33K 33K Input: $0
Output: $0
- 🔧 🌡️ - In: text
Out: text
Released: 2025-09-18
Updated: 2025-11-25
tencent/Hunyuan-A13B-Instruct tencent/Hunyuan-A13B-Instruct 131K 131K Input: $0.14
Output: $0.57
Model: 0.070
Completion: 4.071
🔧 🌡️ - In: text
Out: text
Released: 2025-06-30
Updated: 2025-11-25
zai-org/GLM-5V-Turbo zai-org/GLM-5V-Turbo 200K 131.1K Input: $1.2
Output: $4
Cache Write: $0
Model: 0.600
Completion: 3.333
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2026-04-01
zai-org/GLM-4.6 zai-org/GLM-4.6 205K 205K Input: $0.5
Output: $1.9
Model: 0.250
Completion: 3.800
🔧 🌡️ - In: text
Out: text
Released: 2025-10-04
Updated: 2025-11-25
zai-org/GLM-4.6V zai-org/GLM-4.6V 131K 131K Input: $0.3
Output: $0.9
Model: 0.150
Completion: 3.000
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2025-12-07
zai-org/GLM-5 zai-org/GLM-5 205K 205K Input: $1
Output: $3.2
Model: 0.500
Completion: 3.200
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-12
zai-org/GLM-4.7 zai-org/GLM-4.7 205K 205K Input: $0.6
Output: $2.2
Model: 0.300
Completion: 3.667
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-12-22
zai-org/GLM-4.5V zai-org/GLM-4.5V 66K 66K Input: $0.14
Output: $0.86
Model: 0.070
Completion: 6.143
📎 🔧 🌡️ - In: text, image
Out: text
Released: 2025-08-13
Updated: 2025-11-25
zai-org/GLM-4.5-Air zai-org/GLM-4.5-Air 131K 131K Input: $0.14
Output: $0.86
Model: 0.070
Completion: 6.143
🔧 🌡️ - In: text
Out: text
Released: 2025-07-28
Updated: 2025-11-25
zai-org/GLM-5.1 zai-org/GLM-5.1 205K 205K Input: $1.4
Output: $4.4
Cache Write: $0
Model: 0.700
Completion: 3.143
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-04-08
zai-org/GLM-4.5 zai-org/GLM-4.5 131K 131K Input: $0.4
Output: $2
Model: 0.200
Completion: 5.000
🔧 🌡️ - In: text
Out: text
Released: 2025-07-28
Updated: 2025-11-25
DeepSeek V4 Flash deepseek-ai/deepseek-v4-flash 1M 384K Input: $0.14
Output: $0.28
Cache Read: $0.028
Model: 0.070
Completion: 2.000
Cache: 0.200
🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2026-04-24
deepseek-ai/DeepSeek-R1 deepseek-ai/DeepSeek-R1 164K 164K Input: $0.5
Output: $2.18
Model: 0.250
Completion: 4.360
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-05-28
Updated: 2025-11-25
deepseek-ai/DeepSeek-R1-Distill-Qwen-32B deepseek-ai/DeepSeek-R1-Distill-Qwen-32B 131K 131K Input: $0.18
Output: $0.18
Model: 0.090
Completion: 1.000
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-01-20
Updated: 2025-11-25
deepseek-ai/DeepSeek-V3.1-Terminus deepseek-ai/DeepSeek-V3.1-Terminus 164K 164K Input: $0.27
Output: $1
Model: 0.135
Completion: 3.704
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-09-29
Updated: 2025-11-25
DeepSeek V4 Pro deepseek-ai/deepseek-v4-pro 1M 384K Input: $1.74
Output: $3.48
Cache Read: $0.145
Model: 0.870
Completion: 2.000
Cache: 0.083
🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2026-04-24
deepseek-ai/DeepSeek-V3.1 deepseek-ai/DeepSeek-V3.1 164K 164K Input: $0.27
Output: $1
Model: 0.135
Completion: 3.704
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-08-25
Updated: 2025-11-25
deepseek-ai/DeepSeek-V3.2-Exp deepseek-ai/DeepSeek-V3.2-Exp 164K 164K Input: $0.27
Output: $0.41
Model: 0.135
Completion: 1.519
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-10-10
Updated: 2025-11-25
deepseek-ai/deepseek-vl2 deepseek-ai/deepseek-vl2 4K 4K Input: $0.15
Output: $0.15
Model: 0.075
Completion: 1.000
📎 🔧 🌡️ - In: text, image
Out: text
Released: 2024-12-13
Updated: 2025-11-25
deepseek-ai/DeepSeek-V3.2 deepseek-ai/DeepSeek-V3.2 164K 164K Input: $0.27
Output: $0.42
Model: 0.135
Completion: 1.556
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-12-03
deepseek-ai/DeepSeek-V3 deepseek-ai/DeepSeek-V3 164K 164K Input: $0.25
Output: $1
Model: 0.125
Completion: 4.000
🔧 🌡️ - In: text
Out: text
Released: 2024-12-26
Updated: 2025-11-25
deepseek-ai/DeepSeek-R1-Distill-Qwen-14B deepseek-ai/DeepSeek-R1-Distill-Qwen-14B 131K 131K Input: $0.1
Output: $0.1
Model: 0.050
Completion: 1.000
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-01-20
Updated: 2025-11-25
MiniMaxAI/MiniMax-M2.5 MiniMaxAI/MiniMax-M2.5 197K 131K Input: $0.3
Output: $1.2
Model: 0.150
Completion: 4.000
🔧 🌡️ - In: text
Out: text
Released: 2026-02-15

SiliconFlow (China)

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
THUDM/GLM-Z1-9B-0414 THUDM/GLM-Z1-9B-0414 131K 131K Input: $0.086
Output: $0.086
Model: 0.043
Completion: 1.000
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-04-18
Updated: 2025-11-25
THUDM/GLM-4-9B-0414 THUDM/GLM-4-9B-0414 33K 33K Input: $0.086
Output: $0.086
Model: 0.043
Completion: 1.000
🔧 🌡️ - In: text
Out: text
Released: 2025-04-18
Updated: 2025-11-25
THUDM/GLM-4-32B-0414 THUDM/GLM-4-32B-0414 33K 33K Input: $0.27
Output: $0.27
Model: 0.135
Completion: 1.000
🔧 🌡️ - In: text
Out: text
Released: 2025-04-18
Updated: 2025-11-25
THUDM/GLM-Z1-32B-0414 THUDM/GLM-Z1-32B-0414 131K 131K Input: $0.14
Output: $0.57
Model: 0.070
Completion: 4.071
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-04-18
Updated: 2025-11-25
moonshotai/Kimi-K2-Instruct-0905 moonshotai/Kimi-K2-Instruct-0905 262K 262K Input: $0.4
Output: $2
Model: 0.200
Completion: 5.000
🔧 🌡️ - In: text
Out: text
Released: 2025-09-08
Updated: 2025-11-25
moonshotai/Kimi-K2-Thinking moonshotai/Kimi-K2-Thinking 262K 262K Input: $0.55
Output: $2.5
Model: 0.275
Completion: 4.545
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-11-07
Updated: 2025-11-25
baidu/ERNIE-4.5-300B-A47B baidu/ERNIE-4.5-300B-A47B 131K 131K Input: $0.28
Output: $1.1
Model: 0.140
Completion: 3.929
🔧 🌡️ - In: text
Out: text
Released: 2025-07-02
Updated: 2025-11-25
ByteDance-Seed/Seed-OSS-36B-Instruct ByteDance-Seed/Seed-OSS-36B-Instruct 262K 262K Input: $0.21
Output: $0.57
Model: 0.105
Completion: 2.714
🔧 🌡️ - In: text
Out: text
Released: 2025-09-04
Updated: 2025-11-25
stepfun-ai/Step-3.5-Flash stepfun-ai/Step-3.5-Flash 262K 262K Input: $0.1
Output: $0.3
Model: 0.050
Completion: 3.000
🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-02-11
inclusionAI/Ling-flash-2.0 inclusionAI/Ling-flash-2.0 131K 131K Input: $0.14
Output: $0.57
Model: 0.070
Completion: 4.071
🔧 🌡️ - In: text
Out: text
Released: 2025-09-18
Updated: 2025-11-25
Pro/moonshotai/Kimi-K2-Thinking Pro/moonshotai/Kimi-K2-Thinking 262K 262K Input: $0.55
Output: $2.5
Model: 0.275
Completion: 4.545
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-11-07
Updated: 2025-11-25
Pro/moonshotai/Kimi-K2-Instruct-0905 Pro/moonshotai/Kimi-K2-Instruct-0905 262K 262K Input: $0.4
Output: $2
Model: 0.200
Completion: 5.000
🔧 🌡️ - In: text
Out: text
Released: 2025-09-08
Updated: 2025-11-25
Pro/moonshotai/Kimi-K2.6 Pro/moonshotai/Kimi-K2.6 262K 262K Input: $0.95
Output: $4
Cache Read: $0.16
Model: 0.475
Completion: 4.211
Cache: 0.168
🧠 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2026-04-21
Pro/moonshotai/Kimi-K2.5 Pro/moonshotai/Kimi-K2.5 262K 262K Input: $0.45
Output: $2.25
Model: 0.225
Completion: 5.000
🧠 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2026-01-27
Pro/zai-org/GLM-5 Pro/zai-org/GLM-5 205K 205K Input: $1
Output: $3.2
Model: 0.500
Completion: 3.200
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-12
Pro/zai-org/GLM-4.7 Pro/zai-org/GLM-4.7 205K 205K Input: $0.6
Output: $2.2
Model: 0.300
Completion: 3.667
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-12-22
Pro/zai-org/GLM-5.1 Pro/zai-org/GLM-5.1 205K 205K Input: $1.4
Output: $4.4
Cache Write: $0
Model: 0.700
Completion: 3.143
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-04-08
Pro/deepseek-ai/DeepSeek-R1 Pro/deepseek-ai/DeepSeek-R1 164K 164K Input: $0.5
Output: $2.18
Model: 0.250
Completion: 4.360
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-05-28
Updated: 2025-11-25
Pro/deepseek-ai/DeepSeek-V3.1-Terminus Pro/deepseek-ai/DeepSeek-V3.1-Terminus 164K 164K Input: $0.27
Output: $1
Model: 0.135
Completion: 3.704
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-09-29
Updated: 2025-11-25
Pro/deepseek-ai/DeepSeek-V3.2 Pro/deepseek-ai/DeepSeek-V3.2 164K 164K Input: $0.27
Output: $0.42
Model: 0.135
Completion: 1.556
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-12-03
Pro/deepseek-ai/DeepSeek-V3 Pro/deepseek-ai/DeepSeek-V3 164K 164K Input: $0.25
Output: $1
Model: 0.125
Completion: 4.000
🔧 🌡️ - In: text
Out: text
Released: 2024-12-26
Updated: 2025-11-25
Pro/MiniMaxAI/MiniMax-M2.1 Pro/MiniMaxAI/MiniMax-M2.1 197K 131K Input: $0.3
Output: $1.2
Model: 0.150
Completion: 4.000
🔧 🌡️ - In: text
Out: text
Released: 2025-12-23
Pro/MiniMaxAI/MiniMax-M2.5 Pro/MiniMaxAI/MiniMax-M2.5 192K 131K Input: $0.3
Output: $1.22
Model: 0.150
Completion: 4.067
🔧 🌡️ - In: text
Out: text
Released: 2026-02-13
Qwen/Qwen3.6-35B-A3B Qwen/Qwen3.6-35B-A3B 262.1K 65.5K Input: $0.23
Output: $1.86
Model: 0.115
Completion: 8.087
🧠 🔧 🌡️ 2025-04 In: text, image, video
Out: text
Open Weights
Released: 2026-04-17
Qwen/Qwen3.5-397B-A17B Qwen/Qwen3.5-397B-A17B 262.1K 65.5K Input: $0.29
Output: $1.74
Model: 0.145
Completion: 6.000
🧠 🔧 🌡️ 2025-04 In: text, image, video
Out: text
Open Weights
Released: 2026-02-16
Qwen/Qwen3.5-122B-A10B Qwen/Qwen3.5-122B-A10B 262.1K 65.5K Input: $0.29
Output: $2.32
Model: 0.145
Completion: 8.000
🧠 🔧 🌡️ 2025-04 In: text, image, video
Out: text
Open Weights
Released: 2026-02-26
Qwen/Qwen3.5-27B Qwen/Qwen3.5-27B 262.1K 65.5K Input: $0.26
Output: $2.09
Model: 0.130
Completion: 8.038
🧠 🔧 🌡️ 2025-04 In: text, image, video
Out: text
Open Weights
Released: 2026-02-25
Qwen/Qwen3.5-4B Qwen/Qwen3.5-4B 262.1K 65.5K Input: $0
Output: $0
- 🧠 🔧 🌡️ 2025-04 In: text, image, video
Out: text
Open Weights
Released: 2026-03-03
Qwen/Qwen3.5-9B Qwen/Qwen3.5-9B 262.1K 65.5K Input: $0.22
Output: $1.74
Model: 0.110
Completion: 7.909
🧠 🔧 🌡️ 2025-04 In: text, image, video
Out: text
Open Weights
Released: 2026-03-03
Qwen/Qwen3.5-35B-A3B Qwen/Qwen3.5-35B-A3B 262.1K 65.5K Input: $0.23
Output: $1.86
Model: 0.115
Completion: 8.087
🧠 🔧 🌡️ 2025-04 In: text, image, video
Out: text
Open Weights
Released: 2026-02-25
Qwen/Qwen3-VL-32B-Thinking Qwen/Qwen3-VL-32B-Thinking 262K 262K Input: $0.2
Output: $1.5
Model: 0.100
Completion: 7.500
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2025-10-21
Updated: 2025-11-25
Qwen/Qwen3-VL-30B-A3B-Instruct Qwen/Qwen3-VL-30B-A3B-Instruct 262K 262K Input: $0.29
Output: $1
Model: 0.145
Completion: 3.448
📎 🔧 🌡️ - In: text, image
Out: text
Released: 2025-10-05
Updated: 2025-11-25
Qwen/Qwen3-Omni-30B-A3B-Instruct Qwen/Qwen3-Omni-30B-A3B-Instruct 66K 66K Input: $0.1
Output: $0.4
Model: 0.050
Completion: 4.000
📎 🔧 🌡️ - In: text, image, audio
Out: text
Released: 2025-10-04
Updated: 2025-11-25
Qwen/Qwen2.5-72B-Instruct Qwen/Qwen2.5-72B-Instruct 33K 4K Input: $0.59
Output: $0.59
Model: 0.295
Completion: 1.000
🔧 🌡️ - In: text
Out: text
Released: 2024-09-18
Updated: 2025-11-25
Qwen/Qwen3-14B Qwen/Qwen3-14B 131K 131K Input: $0.07
Output: $0.28
Model: 0.035
Completion: 4.000
🔧 🌡️ - In: text
Out: text
Released: 2025-04-30
Updated: 2025-11-25
Qwen/Qwen2.5-14B-Instruct Qwen/Qwen2.5-14B-Instruct 33K 4K Input: $0.1
Output: $0.1
Model: 0.050
Completion: 1.000
🔧 🌡️ - In: text
Out: text
Released: 2024-09-18
Updated: 2025-11-25
Qwen/Qwen3-VL-32B-Instruct Qwen/Qwen3-VL-32B-Instruct 262K 262K Input: $0.2
Output: $0.6
Model: 0.100
Completion: 3.000
📎 🔧 🌡️ - In: text, image
Out: text
Released: 2025-10-21
Updated: 2025-11-25
Qwen/Qwen3-32B Qwen/Qwen3-32B 131K 131K Input: $0.14
Output: $0.57
Model: 0.070
Completion: 4.071
🔧 🌡️ - In: text
Out: text
Released: 2025-04-30
Updated: 2025-11-25
Qwen/Qwen3-235B-A22B-Instruct-2507 Qwen/Qwen3-235B-A22B-Instruct-2507 262K 262K Input: $0.09
Output: $0.6
Model: 0.045
Completion: 6.667
🔧 🌡️ - In: text
Out: text
Released: 2025-07-23
Updated: 2025-11-25
Qwen/Qwen3-8B Qwen/Qwen3-8B 131K 131K Input: $0.06
Output: $0.06
Model: 0.030
Completion: 1.000
🔧 🌡️ - In: text
Out: text
Released: 2025-04-30
Updated: 2025-11-25
Qwen/Qwen2.5-Coder-32B-Instruct Qwen/Qwen2.5-Coder-32B-Instruct 33K 4K Input: $0.18
Output: $0.18
Model: 0.090
Completion: 1.000
🔧 🌡️ - In: text
Out: text
Released: 2024-11-11
Updated: 2025-11-25
Qwen/Qwen3-Next-80B-A3B-Thinking Qwen/Qwen3-Next-80B-A3B-Thinking 262K 262K Input: $0.14
Output: $0.57
Model: 0.070
Completion: 4.071
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-09-25
Updated: 2025-11-25
Qwen/Qwen3-VL-235B-A22B-Thinking Qwen/Qwen3-VL-235B-A22B-Thinking 262K 262K Input: $0.45
Output: $3.5
Model: 0.225
Completion: 7.778
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2025-10-04
Updated: 2025-11-25
Qwen/Qwen3-Omni-30B-A3B-Captioner Qwen/Qwen3-Omni-30B-A3B-Captioner 66K 66K Input: $0.1
Output: $0.4
Model: 0.050
Completion: 4.000
📎 🔧 🌡️ - In: audio
Out: text
Released: 2025-10-04
Updated: 2025-11-25
Qwen/QwQ-32B Qwen/QwQ-32B 131K 131K Input: $0.15
Output: $0.58
Model: 0.075
Completion: 3.867
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-03-06
Updated: 2025-11-25
Qwen/Qwen3-VL-8B-Instruct Qwen/Qwen3-VL-8B-Instruct 262K 262K Input: $0.18
Output: $0.68
Model: 0.090
Completion: 3.778
📎 🔧 🌡️ - In: text, image
Out: text
Released: 2025-10-15
Updated: 2025-11-25
Qwen/Qwen3-VL-8B-Thinking Qwen/Qwen3-VL-8B-Thinking 262K 262K Input: $0.18
Output: $2
Model: 0.090
Completion: 11.111
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2025-10-15
Updated: 2025-11-25
Qwen/Qwen3-VL-30B-A3B-Thinking Qwen/Qwen3-VL-30B-A3B-Thinking 262K 262K Input: $0.29
Output: $1
Model: 0.145
Completion: 3.448
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2025-10-11
Updated: 2025-11-25
Qwen/Qwen3-30B-A3B-Instruct-2507 Qwen/Qwen3-30B-A3B-Instruct-2507 262K 262K Input: $0.09
Output: $0.3
Model: 0.045
Completion: 3.333
🔧 🌡️ - In: text
Out: text
Released: 2025-07-30
Updated: 2025-11-25
Qwen/Qwen3-Coder-30B-A3B-Instruct Qwen/Qwen3-Coder-30B-A3B-Instruct 262K 262K Input: $0.07
Output: $0.28
Model: 0.035
Completion: 4.000
🔧 🌡️ - In: text
Out: text
Released: 2025-08-01
Updated: 2025-11-25
Qwen/Qwen3-Coder-480B-A35B-Instruct Qwen/Qwen3-Coder-480B-A35B-Instruct 262K 262K Input: $0.25
Output: $1
Model: 0.125
Completion: 4.000
🔧 🌡️ - In: text
Out: text
Released: 2025-07-31
Updated: 2025-11-25
Qwen/Qwen2.5-72B-Instruct-128K Qwen/Qwen2.5-72B-Instruct-128K 131K 4K Input: $0.59
Output: $0.59
Model: 0.295
Completion: 1.000
🔧 🌡️ - In: text
Out: text
Released: 2024-09-18
Updated: 2025-11-25
Qwen/Qwen3-235B-A22B-Thinking-2507 Qwen/Qwen3-235B-A22B-Thinking-2507 262K 262K Input: $0.13
Output: $0.6
Model: 0.065
Completion: 4.615
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-07-28
Updated: 2025-11-25
Qwen/Qwen3-Next-80B-A3B-Instruct Qwen/Qwen3-Next-80B-A3B-Instruct 262K 262K Input: $0.14
Output: $1.4
Model: 0.070
Completion: 10.000
🔧 🌡️ - In: text
Out: text
Released: 2025-09-18
Updated: 2025-11-25
Qwen/Qwen2.5-32B-Instruct Qwen/Qwen2.5-32B-Instruct 33K 4K Input: $0.18
Output: $0.18
Model: 0.090
Completion: 1.000
🔧 🌡️ - In: text
Out: text
Released: 2024-09-19
Updated: 2025-11-25
Qwen/Qwen3-VL-235B-A22B-Instruct Qwen/Qwen3-VL-235B-A22B-Instruct 262K 262K Input: $0.3
Output: $1.5
Model: 0.150
Completion: 5.000
📎 🔧 🌡️ - In: text, image
Out: text
Released: 2025-10-04
Updated: 2025-11-25
Qwen/Qwen2.5-7B-Instruct Qwen/Qwen2.5-7B-Instruct 33K 4K Input: $0.05
Output: $0.05
Model: 0.025
Completion: 1.000
🔧 🌡️ - In: text
Out: text
Released: 2024-09-18
Updated: 2025-11-25
Qwen/Qwen3-30B-A3B-Thinking-2507 Qwen/Qwen3-30B-A3B-Thinking-2507 262K 131K Input: $0.09
Output: $0.3
Model: 0.045
Completion: 3.333
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-07-31
Updated: 2025-11-25
Qwen/Qwen2.5-VL-32B-Instruct Qwen/Qwen2.5-VL-32B-Instruct 131K 131K Input: $0.27
Output: $0.27
Model: 0.135
Completion: 1.000
📎 🔧 🌡️ - In: text, image
Out: text
Released: 2025-03-24
Updated: 2025-11-25
Qwen/Qwen2.5-VL-72B-Instruct Qwen/Qwen2.5-VL-72B-Instruct 131K 4K Input: $0.59
Output: $0.59
Model: 0.295
Completion: 1.000
📎 🔧 🌡️ - In: text, image
Out: text
Released: 2025-01-28
Updated: 2025-11-25
Qwen/Qwen3-Omni-30B-A3B-Thinking Qwen/Qwen3-Omni-30B-A3B-Thinking 66K 66K Input: $0.1
Output: $0.4
Model: 0.050
Completion: 4.000
📎 🧠 🔧 🌡️ - In: text, image, audio
Out: text
Released: 2025-10-04
Updated: 2025-11-25
PaddlePaddle/PaddleOCR-VL-1.5 PaddlePaddle/PaddleOCR-VL-1.5 16.4K 16.4K Input: $0
Output: $0
- 📎 🌡️ - In: text, image
Out: text
Open Weights
Released: 2026-01-29
tencent/Hunyuan-A13B-Instruct tencent/Hunyuan-A13B-Instruct 131K 131K Input: $0.14
Output: $0.57
Model: 0.070
Completion: 4.071
🔧 🌡️ - In: text
Out: text
Released: 2025-06-30
Updated: 2025-11-25
tencent/Hunyuan-MT-7B tencent/Hunyuan-MT-7B 33K 33K Input: $0
Output: $0
- 🔧 🌡️ - In: text
Out: text
Released: 2025-09-18
Updated: 2025-11-25
zai-org/GLM-4.5-Air zai-org/GLM-4.5-Air 131K 131K Input: $0.14
Output: $0.86
Model: 0.070
Completion: 6.143
🔧 🌡️ - In: text
Out: text
Released: 2025-07-28
Updated: 2025-11-25
zai-org/GLM-4.5V zai-org/GLM-4.5V 66K 66K Input: $0.14
Output: $0.86
Model: 0.070
Completion: 6.143
📎 🔧 🌡️ - In: text, image
Out: text
Released: 2025-08-13
Updated: 2025-11-25
zai-org/GLM-4.6V zai-org/GLM-4.6V 131K 131K Input: $0.3
Output: $0.9
Model: 0.150
Completion: 3.000
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2025-12-07
zai-org/GLM-4.6 zai-org/GLM-4.6 205K 205K Input: $0.5
Output: $1.9
Model: 0.250
Completion: 3.800
🔧 🌡️ - In: text
Out: text
Released: 2025-10-04
Updated: 2025-11-25
deepseek-ai/DeepSeek-V4-Pro deepseek-ai/DeepSeek-V4-Pro 1M 393K Input: $1.74
Output: $3.48
Cache Read: $0.145
Model: 0.870
Completion: 2.000
Cache: 0.083
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-04-24
deepseek-ai/DeepSeek-OCR deepseek-ai/DeepSeek-OCR 8.2K 8.2K Input: $0
Output: $0
- 📎 🌡️ - In: text, image
Out: text
Open Weights
Released: 2025-10-20
deepseek-ai/DeepSeek-R1-Distill-Qwen-14B deepseek-ai/DeepSeek-R1-Distill-Qwen-14B 131K 131K Input: $0.1
Output: $0.1
Model: 0.050
Completion: 1.000
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-01-20
Updated: 2025-11-25
deepseek-ai/DeepSeek-V3 deepseek-ai/DeepSeek-V3 164K 164K Input: $0.25
Output: $1
Model: 0.125
Completion: 4.000
🔧 🌡️ - In: text
Out: text
Released: 2024-12-26
Updated: 2025-11-25
deepseek-ai/DeepSeek-V3.2 deepseek-ai/DeepSeek-V3.2 164K 164K Input: $0.27
Output: $0.42
Model: 0.135
Completion: 1.556
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-12-03
deepseek-ai/deepseek-vl2 deepseek-ai/deepseek-vl2 4K 4K Input: $0.15
Output: $0.15
Model: 0.075
Completion: 1.000
📎 🔧 🌡️ - In: text, image
Out: text
Released: 2024-12-13
Updated: 2025-11-25
deepseek-ai/DeepSeek-V3.1-Terminus deepseek-ai/DeepSeek-V3.1-Terminus 164K 164K Input: $0.27
Output: $1
Model: 0.135
Completion: 3.704
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-09-29
Updated: 2025-11-25
deepseek-ai/DeepSeek-R1-Distill-Qwen-32B deepseek-ai/DeepSeek-R1-Distill-Qwen-32B 131K 131K Input: $0.18
Output: $0.18
Model: 0.090
Completion: 1.000
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-01-20
Updated: 2025-11-25
deepseek-ai/DeepSeek-R1 deepseek-ai/DeepSeek-R1 164K 164K Input: $0.5
Output: $2.18
Model: 0.250
Completion: 4.360
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-05-28
Updated: 2025-11-25
Kwaipilot/KAT-Dev Kwaipilot/KAT-Dev 128K 128K Input: $0.2
Output: $0.6
Model: 0.100
Completion: 3.000
🔧 🌡️ - In: text
Out: text
Released: 2025-09-27
Updated: 2026-01-16

Snowflake Cortex

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
GPT-5.1 openai-gpt-5.1 400K 128K - - 📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-11-13
Llama-3.3-70B-Instruct snowflake-llama3.3-70b 128K 4.1K - - 📎 🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-12-06
GPT-5.2 openai-gpt-5.2 400K 128K - - 📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2025-12-11
Claude Sonnet 4.5 (latest) claude-sonnet-4-5 200K 16.4K - - 📎 🧠 🔧 🌡️ 2025-07-31 In: text, image, pdf
Out: text
Released: 2025-09-29
Claude Opus 4.7 claude-opus-4-7 1M 128K - - 📎 🧠 🔧 2026-01-31 In: text, image, pdf
Out: text
Released: 2026-04-16
DeepSeek-R1 deepseek-r1 128K 32.8K - - 🧠 🔧 🌡️ 2024-07 In: text
Out: text
Open Weights
Released: 2025-01-20
Updated: 2025-05-29
Claude Opus 4.8 claude-opus-4-8 1M 128K - - 📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2026-05-28
GPT-5 openai-gpt-5 400K 128K - - 📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-08-07
GPT-5.5 openai-gpt-5.5 1.1M 128K - - 📎 🧠 🔧 2025-12-01 In: text, image, pdf
Out: text
Released: 2026-04-23
Claude Fable 5 claude-fable-5 1M 128K - - 📎 🧠 🔧 2026-01-31 In: text, image, pdf
Out: text
Released: 2026-06-09
GPT-5 Nano openai-gpt-5-nano 400K 128K - - 📎 🧠 🔧 2024-05-30 In: text, image
Out: text
Released: 2025-08-07
Claude Haiku 4.5 (latest) claude-haiku-4-5 200K 16.4K - - 📎 🧠 🔧 🌡️ 2025-02-28 In: text, image, pdf
Out: text
Released: 2025-10-15
Mistral Large (latest) mistral-large2 262.1K 262.1K - - 📎 🔧 🌡️ 2024-11 In: text, image
Out: text
Open Weights
Released: 2024-11-01
Updated: 2025-12-02
GPT-4.1 openai-gpt-4.1 1M 32.8K - - 📎 🔧 🌡️ 2024-04 In: text, image, pdf
Out: text
Released: 2025-04-14
Claude Sonnet 4.6 claude-sonnet-4-6 1M 16.4K - - 📎 🧠 🔧 🌡️ 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-02-17
Updated: 2026-03-13
GPT-5.4 openai-gpt-5.4 1.1M 128K - - 📎 🧠 🔧 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-03-05
Gemini 3.1 Pro Preview gemini-3.1-pro 1M 65.5K - - 📎 🧠 🔧 🌡️ 2025-01 In: text, image, video, audio, pdf
Out: text
Released: 2026-02-19
GPT-5 Mini openai-gpt-5-mini 272K 8.2K - - 📎 🧠 🔧 2024-05-30 In: text, image
Out: text
Released: 2025-08-07

STACKIT

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Llama 3.1 8B neuralmagic/Meta-Llama-3.1-8B-Instruct-FP8 128K 8.2K Input: $0.16
Output: $0.27
Model: 0.080
Completion: 1.688
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-07-23
Mistral Nemo neuralmagic/Mistral-Nemo-Instruct-2407-FP8 128K 8.2K Input: $0.49
Output: $0.71
Model: 0.245
Completion: 1.449
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-07-01
Llama 3.3 70B cortecs/Llama-3.3-70B-Instruct-FP8-Dynamic 128K 8.2K Input: $0.49
Output: $0.71
Model: 0.245
Completion: 1.449
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-12-05
Gemma 3 27B google/gemma-3-27b-it 37K 8.2K Input: $0.49
Output: $0.71
Model: 0.245
Completion: 1.449
📎 🌡️ - In: text, image
Out: text
Open Weights
Released: 2025-05-17
Qwen3-VL Embedding 8B Qwen/Qwen3-VL-Embedding-8B 32K 4.1K Input: $0.09
Output: $0.09
Model: 0.045
Completion: 1.000
📎 - In: text, image
Out: text
Open Weights
Released: 2026-02-05
Qwen3-VL 235B Qwen/Qwen3-VL-235B-A22B-Instruct-FP8 218K 8.2K Input: $1.64
Output: $1.91
Model: 0.820
Completion: 1.165
📎 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2024-11-01
GPT-OSS 120B openai/gpt-oss-120b 131K 8.2K Input: $0.49
Output: $0.71
Model: 0.245
Completion: 1.449
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-05
E5 Mistral 7B intfloat/e5-mistral-7b-instruct 4.1K 4.1K Input: $0.02
Output: $0.02
Model: 0.010
Completion: 1.000
- - In: text
Out: text
Open Weights
Released: 2023-12-11

StepFun

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Step 1 (32K) step-1-32k 32.8K 32.8K Input: $2.05
Output: $9.59
Cache Read: $0.41
Model: 1.025
Completion: 4.678
Cache: 0.200
🧠 🔧 🌡️ 2024-06 In: text
Out: text
Released: 2025-01-01
Updated: 2026-02-13
Step 3.5 Flash 2603 step-3.5-flash-2603 256K 256K Input: $0.1
Output: $0.3
Cache Read: $0.02
Model: 0.050
Completion: 3.000
Cache: 0.200
🧠 🔧 🌡️ 2025-01 In: text
Out: text
Open Weights
Released: 2026-04-02
Step 3.5 Flash step-3.5-flash 256K 256K Input: $0.096
Output: $0.288
Cache Read: $0.019
Model: 0.048
Completion: 3.000
Cache: 0.198
🧠 🔧 🌡️ 2025-01 In: text
Out: text
Open Weights
Released: 2026-01-29
Updated: 2026-02-13
Step 2 (16K) step-2-16k 16.4K 8.2K Input: $5.21
Output: $16.44
Cache Read: $1.04
Model: 2.605
Completion: 3.155
Cache: 0.200
🧠 🔧 🌡️ 2024-06 In: text
Out: text
Released: 2025-01-01
Updated: 2026-02-13

StepFun AI

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Step 3.5 Flash step-3.5-flash 256K 256K Input: $0.096
Output: $0.288
Cache Read: $0.019
Model: 0.048
Completion: 3.000
Cache: 0.198
🧠 🔧 🌡️ 2025-01 In: text
Out: text
Open Weights
Released: 2026-01-29
Updated: 2026-02-13
Step 3.5 Flash 2603 step-3.5-flash-2603 256K 256K Input: $0.1
Output: $0.3
Cache Read: $0.02
Model: 0.050
Completion: 3.000
Cache: 0.200
🧠 🔧 🌡️ 2025-01 In: text
Out: text
Open Weights
Released: 2026-04-02

submodel

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Qwen3 235B A22B Thinking 2507 Qwen/Qwen3-235B-A22B-Thinking-2507 262.1K 131.1K Input: $0.2
Output: $0.6
Model: 0.100
Completion: 3.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-23
Qwen3 Coder 480B A35B Instruct Qwen/Qwen3-Coder-480B-A35B-Instruct-FP8 262.1K 262.1K Input: $0.2
Output: $0.8
Model: 0.100
Completion: 4.000
🔧 🌡️ - In: text
Out: text
Released: 2025-08-23
Qwen3 235B A22B Instruct 2507 Qwen/Qwen3-235B-A22B-Instruct-2507 262.1K 131.1K Input: $0.2
Output: $0.3
Model: 0.100
Completion: 1.500
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-23
GPT OSS 120B openai/gpt-oss-120b 131.1K 32.8K Input: $0.1
Output: $0.5
Model: 0.050
Completion: 5.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-23
GLM 4.5 FP8 zai-org/GLM-4.5-FP8 131.1K 131.1K Input: $0.2
Output: $0.8
Model: 0.100
Completion: 4.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-07-28
GLM 4.5 Air zai-org/GLM-4.5-Air 131.1K 131.1K Input: $0.1
Output: $0.5
Model: 0.050
Completion: 5.000
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-07-28
DeepSeek V3 0324 deepseek-ai/DeepSeek-V3-0324 75K 163.8K Input: $0.2
Output: $0.8
Model: 0.100
Completion: 4.000
🔧 🌡️ - In: text
Out: text
Released: 2025-08-23
DeepSeek R1 0528 deepseek-ai/DeepSeek-R1-0528 75K 163.8K Input: $0.5
Output: $2.15
Model: 0.250
Completion: 4.300
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-08-23
DeepSeek V3.1 deepseek-ai/DeepSeek-V3.1 75K 163.8K Input: $0.2
Output: $0.8
Model: 0.100
Completion: 4.000
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-08-23

Synthetic

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Llama-4-Maverick-17B-128E-Instruct-FP8 hf:meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8 524K 4.1K Input: $0.22
Output: $0.88
Model: 0.110
Completion: 4.000
📎 🔧 🌡️ 2024-08 In: text, image
Out: text
Open Weights
Released: 2025-04-05
Llama-4-Scout-17B-16E-Instruct hf:meta-llama/Llama-4-Scout-17B-16E-Instruct 328K 4.1K Input: $0.15
Output: $0.6
Model: 0.075
Completion: 4.000
📎 🔧 🌡️ 2024-08 In: text, image
Out: text
Open Weights
Released: 2025-04-05
Llama-3.3-70B-Instruct hf:meta-llama/Llama-3.3-70B-Instruct 128K 32.8K Input: $0.9
Output: $0.9
Model: 0.450
Completion: 1.000
🧠 🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-12-06
Llama-3.1-70B-Instruct hf:meta-llama/Llama-3.1-70B-Instruct 128K 32.8K Input: $0.9
Output: $0.9
Model: 0.450
Completion: 1.000
🧠 🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-07-23
Llama-3.1-405B-Instruct hf:meta-llama/Llama-3.1-405B-Instruct 128K 32.8K Input: $3
Output: $3
Model: 1.500
Completion: 1.000
🧠 🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-07-23
Llama-3.1-8B-Instruct hf:meta-llama/Llama-3.1-8B-Instruct 128K 32.8K Input: $0.2
Output: $0.2
Model: 0.100
Completion: 1.000
🧠 🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-07-23
DeepSeek R1 hf:deepseek-ai/DeepSeek-R1 128K 128K Input: $0.55
Output: $2.19
Model: 0.275
Completion: 3.982
🧠 🔧 🌡️ 2025-01 In: text
Out: text
Open Weights
Released: 2025-01-20
DeepSeek V3 (0324) hf:deepseek-ai/DeepSeek-V3-0324 128K 128K Input: $1.2
Output: $1.2
Model: 0.600
Completion: 1.000
🔧 🌡️ - In: text
Out: text
Released: 2025-08-01
DeepSeek V3.1 Terminus hf:deepseek-ai/DeepSeek-V3.1-Terminus 128K 128K Input: $1.2
Output: $1.2
Model: 0.600
Completion: 1.000
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-09-22
Updated: 2025-09-25
DeepSeek R1 (0528) hf:deepseek-ai/DeepSeek-R1-0528 128K 128K Input: $3
Output: $8
Model: 1.500
Completion: 2.667
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-08-01
DeepSeek V3.1 hf:deepseek-ai/DeepSeek-V3.1 128K 128K Input: $0.56
Output: $1.68
Model: 0.280
Completion: 3.000
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-08-21
DeepSeek V3.2 hf:deepseek-ai/DeepSeek-V3.2 162.8K 8K Input: $0.27
Output: $0.4
Cache Read: $0.27
Cache Write: $0
Model: 0.135
Completion: 1.481
Cache: 1.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-12-01
DeepSeek V3 hf:deepseek-ai/DeepSeek-V3 128K 128K Input: $1.25
Output: $1.25
Model: 0.625
Completion: 1.000
🧠 🔧 🌡️ 2024-07 In: text
Out: text
Open Weights
Released: 2025-01-20
Updated: 2025-05-29
Kimi K2 Thinking hf:moonshotai/Kimi-K2-Thinking 262.1K 262.1K Input: $0.55
Output: $2.19
Model: 0.275
Completion: 3.982
🧠 🔧 🌡️ 2025-11 In: text
Out: text
Open Weights
Released: 2025-11-07
Kimi K2 0905 hf:moonshotai/Kimi-K2-Instruct-0905 262.1K 32.8K Input: $1.2
Output: $1.2
Model: 0.600
Completion: 1.000
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-09-05
Kimi K2.6 hf:moonshotai/Kimi-K2.6 262.1K 65.5K Input: $0.95
Output: $4
Cache Read: $0.95
Model: 0.475
Completion: 4.211
Cache: 1.000
📎 🧠 🔧 🌡️ 2025-01 In: text, image
Out: text
Open Weights
Released: 2026-04-21
Kimi K2.5 hf:moonshotai/Kimi-K2.5 262.1K 65.5K Input: $0.55
Output: $2.19
Model: 0.275
Completion: 3.982
🧠 🔧 🌡️ 2025-01 In: text, image
Out: text
Open Weights
Released: 2026-01
GLM-4.7-Flash hf:zai-org/GLM-4.7-Flash 196.6K 65.5K Input: $0.06
Output: $0.4
Cache Read: $0.06
Model: 0.030
Completion: 6.667
Cache: 1.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-01-18
GLM 4.6 hf:zai-org/GLM-4.6 200K 64K Input: $0.55
Output: $2.19
Model: 0.275
Completion: 3.982
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-09-30
GLM-5 hf:zai-org/GLM-5 196.6K 65.5K Input: $1
Output: $3
Cache Read: $1
Model: 0.500
Completion: 3.000
Cache: 1.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-12
Updated: 2026-04-08
GLM 4.7 hf:zai-org/GLM-4.7 200K 64K Input: $0.55
Output: $2.19
Model: 0.275
Completion: 3.982
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-12-22
GLM 5.1 hf:zai-org/GLM-5.1 196.6K 65.5K Input: $1
Output: $3
Cache Read: $1
Model: 0.500
Completion: 3.000
Cache: 1.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-27
Updated: 2026-04-12
MiniMax-M2.1 hf:MiniMaxAI/MiniMax-M2.1 204.8K 131.1K Input: $0.55
Output: $2.19
Model: 0.275
Completion: 3.982
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-12-23
MiniMax-M2 hf:MiniMaxAI/MiniMax-M2 196.6K 131K Input: $0.55
Output: $2.19
Model: 0.275
Completion: 3.982
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-10-27
MiniMax-M2.5 hf:MiniMaxAI/MiniMax-M2.5 191.5K 65.5K Input: $0.6
Output: $3
Cache Read: $0.6
Model: 0.300
Completion: 5.000
Cache: 1.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-07
MiniMax-M3 hf:MiniMaxAI/MiniMax-M3 524.3K 65.5K Input: $0.6
Output: $1.2
Cache Read: $0.6
Model: 0.300
Completion: 2.000
Cache: 1.000
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2026-06-12
GPT OSS 120B hf:openai/gpt-oss-120b 128K 32.8K Input: $0.1
Output: $0.1
Model: 0.050
Completion: 1.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-05
Qwen3.5-97B-A17B hf:Qwen/Qwen3.5-397B-A17B 262.1K 65.5K Input: $0.6
Output: $3
Cache Read: $0.6
Model: 0.300
Completion: 5.000
Cache: 1.000
🧠 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2026-02-11
Qwen3 235B A22B Thinking 2507 hf:Qwen/Qwen3-235B-A22B-Thinking-2507 256K 32K Input: $0.65
Output: $3
Model: 0.325
Completion: 4.615
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-25
Qwen 3 Coder 480B hf:Qwen/Qwen3-Coder-480B-A35B-Instruct 256K 32K Input: $2
Output: $2
Model: 1.000
Completion: 1.000
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-23
Qwen2.5-Coder-32B-Instruct hf:Qwen/Qwen2.5-Coder-32B-Instruct 32.8K 32.8K Input: $0.8
Output: $0.8
Model: 0.400
Completion: 1.000
🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2024-11-11
Qwen 3 235B Instruct hf:Qwen/Qwen3-235B-A22B-Instruct-2507 256K 32K Input: $0.2
Output: $0.6
Model: 0.100
Completion: 3.000
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-04-28
Updated: 2025-07-21
Kimi K2.5 (NVFP4) hf:nvidia/Kimi-K2.5-NVFP4 262.1K 65.5K Input: $0.55
Output: $2.19
Model: 0.275
Completion: 3.982
🧠 🔧 🌡️ 2025-01 In: text, image
Out: text
Open Weights
Released: 2026-01
Nemotron 3 Super 120B hf:nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-NVFP4 262.1K 65.5K Input: $0.3
Output: $1
Cache Read: $0.3
Model: 0.150
Completion: 3.333
Cache: 1.000
🧠 🔧 🌡️ 2024-04 In: text
Out: text
Open Weights
Released: 2026-03-11
Updated: 2026-04-03

Tencent Coding Plan (China)

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
MiniMax-M2.5 minimax-m2.5 204.8K 32.8K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-12
Kimi-K2.5 kimi-k2.5 262.1K 32.8K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 📎 🧠 🔧 🌡️ 2025-01 In: text, image, video
Out: text
Open Weights
Released: 2026-01-27
Hunyuan-TurboS hunyuan-turbos 131.1K 16.4K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 🔧 🌡️ - In: text
Out: text
Released: 2026-03-08
Hunyuan-T1 hunyuan-t1 131.1K 16.4K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-03-08
Auto tc-code-latest 131.1K 16.4K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 🔧 🌡️ - In: text
Out: text
Released: 2026-03-08
GLM-5 glm-5 202.8K 16.4K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-02-11
Tencent HY 2.0 Instruct hunyuan-2.0-instruct 131.1K 16.4K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 🔧 🌡️ - In: text
Out: text
Released: 2026-03-08
Tencent HY 2.0 Think hunyuan-2.0-thinking 131.1K 16.4K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-03-08

Tencent TokenHub

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Hy3 preview hy3-preview 256K 64K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-04-20

The Grid AI

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Agent Prime agent-prime 128K 64K - - 🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-05-04
Updated: 2026-05-19
Agent Max agent-max 1M 128K - - 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2026-05-04
Updated: 2026-05-19
Text Standard text-standard 128K 16K - - 🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-02-26
Updated: 2026-05-19
Code Prime code-prime 128K 64K - - 🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-05-04
Updated: 2026-05-19
Text Prime text-prime 128K 30K - - 🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-02-26
Updated: 2026-05-19
Code Max code-max 1M 128K - - 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2026-05-04
Updated: 2026-05-19
Agent Standard agent-standard 128K 16K - - 🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-05-04
Updated: 2026-05-19
Text Max text-max 1M 128K - - 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2026-03-24
Updated: 2026-05-19
Code Standard code-standard 128K 16K - - 🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-05-04
Updated: 2026-05-19

Together AI

📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
LFM2-24B-A2B LiquidAI/LFM2-24B-A2B 32.8K 32.8K Input: $0.03
Output: $0.12
Model: 0.015
Completion: 4.000
🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-25
Meta Llama 3 8B Instruct Lite meta-llama/Meta-Llama-3-8B-Instruct-Lite 8.2K 8.2K Input: $0.14
Output: $0.14
Model: 0.070
Completion: 1.000
🌡️ - In: text
Out: text
Open Weights
Released: 2024-04-18
Llama 3.3 70B meta-llama/Llama-3.3-70B-Instruct-Turbo 131.1K 131.1K Input: $0.88
Output: $0.88
Model: 0.440
Completion: 1.000
🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-12-06
Kimi K2.6 moonshotai/Kimi-K2.6 262.1K 131K Input: $1.2
Output: $4.5
Cache Read: $0.2
Model: 0.600
Completion: 3.750
Cache: 0.167
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video
Out: text
Open Weights
Released: 2026-04-21
Kimi K2.5 moonshotai/Kimi-K2.5 262.1K 262.1K Input: $0.5
Output: $2.8
Model: 0.250
Completion: 5.600
🧠 🔧 🌡️ 2026-01 In: text, image
Out: text
Open Weights
Released: 2026-01-27
Kimi K2.7 Code moonshotai/Kimi-K2.7-Code 262.1K 131.1K Input: $0.95
Output: $4
Cache Read: $0.19
Model: 0.475
Completion: 4.211
Cache: 0.200
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-06-14
Gemma 4 31B Instruct google/gemma-4-31B-it 262.1K 131.1K Input: $0.39
Output: $0.97
Model: 0.195
Completion: 2.487
📎 🧠 🔧 🌡️ 2025-01 In: text, image
Out: text
Open Weights
Released: 2026-04-07
Gemma 3N E4B Instruct google/gemma-3n-E4B-it 32.8K 32.8K Input: $0.06
Output: $0.12
Model: 0.030
Completion: 2.000
🌡️ - In: text
Out: text
Open Weights
Released: 2025-05-20
Qwen3.7 Max Qwen/Qwen3.7-Max 1M 500K Input: $2.5
Output: $7.5
Model: 1.250
Completion: 3.000
🔧 🌡️ - In: text
Out: text
Released: 2026-05-21
Qwen3.6 Plus Qwen/Qwen3.6-Plus 1M 500K Input: $0.5
Output: $3
Model: 0.250
Completion: 6.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-04-30
Qwen3.5 397B A17B Qwen/Qwen3.5-397B-A17B 262.1K 130K Input: $0.6
Output: $3.6
Model: 0.300
Completion: 6.000
🧠 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2026-02-16
Qwen3 Coder Next FP8 Qwen/Qwen3-Coder-Next-FP8 262.1K 262.1K Input: $0.5
Output: $1.2
Model: 0.250
Completion: 2.400
🔧 🌡️ 2026-02-03 In: text
Out: text
Open Weights
Released: 2026-02-03
Qwen3 Coder 480B A35B Instruct Qwen/Qwen3-Coder-480B-A35B-Instruct-FP8 262.1K 262.1K Input: $2
Output: $2
Model: 1.000
Completion: 1.000
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-23
Qwen3 235B A22B Instruct 2507 FP8 Qwen/Qwen3-235B-A22B-Instruct-2507-tput 262.1K 262.1K Input: $0.2
Output: $0.6
Model: 0.100
Completion: 3.000
🔧 🌡️ 2025-07 In: text
Out: text
Open Weights
Released: 2025-07-25
Qwen 2.5 7B Instruct Turbo Qwen/Qwen2.5-7B-Instruct-Turbo 32.8K 32.8K Input: $0.3
Output: $0.3
Model: 0.150
Completion: 1.000
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-09-19
Qwen3.5 9B Qwen/Qwen3.5-9B 262.1K 65.5K Input: $0.17
Output: $0.25
Model: 0.085
Completion: 1.471
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2026-03-03
GPT OSS 120B openai/gpt-oss-120b 131.1K 131.1K Input: $0.15
Output: $0.6
Model: 0.075
Completion: 4.000
🧠 🔧 🌡️ 2025-08 In: text
Out: text
Open Weights
Released: 2025-08-05
GPT OSS 20B openai/gpt-oss-20b 131.1K 131.1K Input: $0.05
Output: $0.2
Model: 0.025
Completion: 4.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-05
Pearl AI Gemma 4 31B Instruct pearl-ai/gemma-4-31b-it 32K 32K Input: $0.28
Output: $0.86
Model: 0.140
Completion: 3.071
🧠 🌡️ - In: text
Out: text
Released: 2026-04-07
Nemotron 3 Ultra 550B A55B nvidia/nemotron-3-ultra-550b-a55b 512.3K 512.3K Input: $0.6
Output: $3.6
Cache Read: $0.2
Model: 0.300
Completion: 6.000
Cache: 0.333
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-06-04
Cogito v2.1 671B deepcogito/cogito-v2-1-671b 163.8K 163.8K Input: $1.25
Output: $1.25
Model: 0.625
Completion: 1.000
🧠 🌡️ - In: text
Out: text
Released: 2025-11-13
GLM-5 zai-org/GLM-5 202.8K 131.1K Input: $1
Output: $3.2
Model: 0.500
Completion: 3.200
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-11
GLM-5.1 zai-org/GLM-5.1 202.8K 131.1K Input: $1.4
Output: $4.4
Model: 0.700
Completion: 3.143
🧠 🔧 🌡️ 2025-11 In: text
Out: text
Open Weights
Released: 2026-04-07
DeepSeek-R1 deepseek-ai/DeepSeek-R1 163.8K 163.8K Input: $3
Output: $7
Model: 1.500
Completion: 2.333
🧠 🌡️ 2024-07 In: text
Out: text
Open Weights
Released: 2025-01-20
Updated: 2025-03-24
DeepSeek V3.1 deepseek-ai/DeepSeek-V3-1 131.1K 131.1K Input: $0.6
Output: $1.7
Model: 0.300
Completion: 2.833
🧠 🔧 🌡️ 2025-08 In: text
Out: text
Open Weights
Released: 2025-08-21
DeepSeek V4 Pro deepseek-ai/DeepSeek-V4-Pro 512K 384K Input: $1.74
Output: $3.48
Cache Read: $0.2
Model: 0.870
Completion: 2.000
Cache: 0.115
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-04-24
DeepSeek-V3 deepseek-ai/DeepSeek-V3 131.1K 131.1K Input: $1.25
Output: $1.25
Model: 0.625
Completion: 1.000
🔧 🌡️ 2024-07 In: text
Out: text
Open Weights
Released: 2024-12-26
Updated: 2025-05-29
MiniMax-M2.5 MiniMaxAI/MiniMax-M2.5 204.8K 131.1K Input: $0.3
Output: $1.2
Cache Read: $0.06
Model: 0.150
Completion: 4.000
Cache: 0.200
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-12
MiniMax-M3 MiniMaxAI/MiniMax-M3 524.3K 250K Input: $0.3
Output: $1.2
Cache Read: $0.06
Model: 0.150
Completion: 4.000
Cache: 0.200
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2026-06-12
MiniMax-M2.7 MiniMaxAI/MiniMax-M2.7 202.8K 131.1K Input: $0.3
Output: $1.2
Cache Read: $0.06
Model: 0.150
Completion: 4.000
Cache: 0.200
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-18
Rnj-1 Instruct essentialai/Rnj-1-Instruct 32.8K 32.8K Input: $0.15
Output: $0.15
Model: 0.075
Completion: 1.000
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-12-05

Umans AI

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Kimi K2.7 Code umans-kimi-k2.7 262.1K 32.8K Input: $0.95
Output: $4
Cache Read: $0.19
Model: 0.475
Completion: 4.211
Cache: 0.200
📎 🧠 🔧 2025-01 In: text, image, video
Out: text
Open Weights
Released: 2026-06-12
Kimi K2.6 umans-kimi-k2.6 262.1K 32.8K Input: $0.95
Output: $4
Cache Read: $0.2
Model: 0.475
Completion: 4.211
Cache: 0.211
📎 🧠 🔧 2025-01 In: text, image, video
Out: text
Open Weights
Released: 2026-04-21
GLM 5.1 umans-glm-5.1 204.8K 131.1K Input: $1.4
Output: $4.4
Cache Read: $0.29
Model: 0.700
Completion: 3.143
Cache: 0.207
🧠 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2026-03-27
Umans Coder umans-coder 262.1K 32.8K Input: $0.95
Output: $4
Cache Read: $0.2
Model: 0.475
Completion: 4.211
Cache: 0.211
📎 🧠 🔧 2025-01 In: text, image
Out: text
Open Weights
Released: 2026-04-21
Umans Flash umans-flash 262.1K 32.8K Input: $0.15
Output: $1
Cache Read: $0.05
Model: 0.075
Completion: 6.667
Cache: 0.333
📎 🧠 🔧 - In: text, image
Out: text
Open Weights
Released: 2026-04-17

Umans AI Coding Plan

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Kimi K2.7 Code umans-kimi-k2.7 262.1K 262.1K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 📎 🧠 🔧 2025-01 In: text, image, video
Out: text
Open Weights
Released: 2026-06-12
Kimi K2.6 umans-kimi-k2.6 262.1K 262.1K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 📎 🧠 🔧 2025-01 In: text, image, video
Out: text
Open Weights
Released: 2026-04-21
GLM 5.1 umans-glm-5.1 204.8K 131.1K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 🧠 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2026-03-27
Umans Coder umans-coder 262.1K 262.1K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 📎 🧠 🔧 2025-01 In: text, image
Out: text
Open Weights
Released: 2026-04-21
Umans Flash umans-flash 262.1K 262.1K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 📎 🧠 🔧 - In: text, image
Out: text
Open Weights
Released: 2026-04-17
Qwen3.6 35B A3B umans-qwen3.6-35b-a3b 262.1K 262.1K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 📎 🧠 🔧 - In: text, image
Out: text
Open Weights
Released: 2026-04-17

Upstage

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
solar-pro2 solar-pro2 65.5K 8.2K Input: $0.25
Output: $0.25
Model: 0.125
Completion: 1.000
🧠 🔧 🌡️ 2025-03 In: text
Out: text
Released: 2025-05-20
solar-pro3 solar-pro3 131.1K 8.2K Input: $0.25
Output: $0.25
Model: 0.125
Completion: 1.000
🧠 🔧 🌡️ 2025-03 In: text
Out: text
Released: 2026-01
solar-mini solar-mini 32.8K 4.1K Input: $0.15
Output: $0.15
Model: 0.075
Completion: 1.000
🔧 🌡️ 2024-09 In: text
Out: text
Released: 2024-06-12
Updated: 2025-04-22

v0

📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
v0-1.0-md v0-1.0-md 128K 32K Input: $3
Output: $15
Model: 1.500
Completion: 5.000
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2025-05-22
v0-1.5-lg v0-1.5-lg 512K 32K Input: $15
Output: $75
Model: 7.500
Completion: 5.000
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2025-06-09
v0-1.5-md v0-1.5-md 128K 32K Input: $3
Output: $15
Model: 1.500
Completion: 5.000
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2025-06-09

Venice AI

📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
GLM 5 Turbo z-ai-glm-5-turbo 200K 32.8K Input: $1.2
Output: $4
Cache Read: $0.24
Model: 0.600
Completion: 3.333
Cache: 0.200
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-15
Updated: 2026-06-11
Grok 4.20 Multi-Agent grok-4-20-multi-agent 2M 128K Input: $1.42
Output: $2.83
Cache Read: $0.23
Model: 0.710
Completion: 1.993
Cache: 0.162
📎 🧠 - In: text, image
Out: text
Released: 2026-03-12
Updated: 2026-06-11
DeepSeek V4 Flash deepseek-v4-flash 1M 32.8K Input: $0.17
Output: $0.35
Cache Read: $0.028
Model: 0.085
Completion: 2.059
Cache: 0.165
🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2026-04-24
Updated: 2026-06-11
Google Gemma 4 31B Instruct google-gemma-4-31b-it 256K 8.2K Input: $0.12
Output: $0.36
Cache Read: $0.09
Model: 0.060
Completion: 3.000
Cache: 0.750
📎 🧠 🔧 🌡️ - In: text, image, video
Out: text
Open Weights
Released: 2026-04-03
Updated: 2026-06-11
Kimi K2.6 kimi-k2-6 256K 65.5K Input: $0.85
Output: $4.655
Cache Read: $0.22
Model: 0.425
Completion: 5.476
Cache: 0.259
📎 🧠 🔧 🌡️ 2025-01 In: text, image
Out: text
Open Weights
Released: 2026-04-20
Updated: 2026-06-11
Qwen 3 235B A22B Instruct 2507 qwen3-235b-a22b-instruct-2507 128K 16.4K Input: $0.15
Output: $0.75
Model: 0.075
Completion: 5.000
🔧 - In: text
Out: text
Open Weights
Released: 2025-04-29
Updated: 2026-06-11
Nemotron Cascade 2 30B A3B nvidia-nemotron-cascade-2-30b-a3b 256K 32.8K Input: $0.14
Output: $0.8
Model: 0.070
Completion: 5.714
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-24
Updated: 2026-06-11
Claude Opus 4.7 Fast claude-opus-4-7-fast 1M 128K Input: $36
Output: $180
Cache Read: $3.6
Cache Write: $45
Model: 18.000
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 2026-01-31 In: text, image
Out: text
Released: 2026-05-14
Updated: 2026-06-11
GPT-5.5 Pro openai-gpt-55-pro 1M 128K Input: $37.5
Output: $225
Model: 18.750
Completion: 6.000
📎 🧠 🔧 2025-12-01 In: text, image
Out: text
Released: 2026-04-24
Updated: 2026-06-11
Llama 3.3 70B llama-3.3-70b 128K 4.1K Input: $0.7
Output: $2.8
Model: 0.350
Completion: 4.000
🔧 - In: text
Out: text
Open Weights
Released: 2025-04-06
Updated: 2026-06-11
Qwen 3.5 397B qwen3-5-397b-a17b 128K 32.8K Input: $0.75
Output: $4.5
Model: 0.375
Completion: 6.000
📎 🧠 🔧 🌡️ - In: text, image, video
Out: text
Open Weights
Released: 2026-02-16
Updated: 2026-06-11
Claude Opus 4.5 claude-opus-4-5 198K 32.8K Input: $6
Output: $30
Cache Read: $0.6
Cache Write: $7.5
Model: 3.000
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image
Out: text
Released: 2025-12-06
Updated: 2026-06-11
GLM 5 zai-org-glm-5 198K 32K Input: $1
Output: $3.2
Cache Read: $0.2
Model: 0.500
Completion: 3.200
Cache: 0.200
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-11
Updated: 2026-06-11
Qwen 3.5 35B A3B qwen3-5-35b-a3b 256K 16.4K Input: $0.3125
Output: $1.25
Cache Read: $0.15625
Model: 0.156
Completion: 4.000
Cache: 0.500
📎 🧠 🔧 🌡️ - In: text, image, video
Out: text
Open Weights
Released: 2026-02-25
Updated: 2026-06-11
Venice Role Play Uncensored venice-uncensored-role-play 128K 4.1K Input: $0.5
Output: $2
Model: 0.250
Completion: 4.000
📎 🔧 - In: text, image
Out: text
Open Weights
Released: 2026-02-20
Updated: 2026-06-11
Qwen 3.6 Plus Uncensored qwen-3-6-plus 1M 65.5K Input: $0.625
Output: $3.75
Cache Read: $0.0625
Cache Write: $0.78
Model: 0.313
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-04 In: text, image, video
Out: text
Released: 2026-04-06
Updated: 2026-06-11
GLM 4.6 zai-org-glm-4.6 198K 16.4K Input: $0.85
Output: $2.75
Cache Read: $0.3
Model: 0.425
Completion: 3.235
Cache: 0.353
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2024-04-01
Updated: 2026-06-11
GPT-4o openai-gpt-4o-2024-11-20 128K 16.4K Input: $3.125
Output: $12.5
Model: 1.563
Completion: 4.000
📎 🔧 🌡️ 2023-09 In: text, image
Out: text
Released: 2026-02-28
Updated: 2026-06-11
Grok 4.3 grok-4-3 1M 32K Input: $1.42
Output: $2.83
Cache Read: $0.23
Model: 0.710
Completion: 1.993
Cache: 0.162
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2026-04-18
Updated: 2026-06-11
Qwen 3.7 Plus qwen-3-7-plus 1M 65.5K Input: $0.5
Output: $2
Cache Read: $0.05
Cache Write: $0.625
Model: 0.250
Completion: 4.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-04 In: text, image, video
Out: text
Released: 2026-06-02
Updated: 2026-06-11
MiniMax M2.7 minimax-m27 198K 32.8K Input: $0.375
Output: $1.5
Cache Read: $0.06875
Model: 0.188
Completion: 4.000
Cache: 0.183
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-18
Updated: 2026-06-11
DeepSeek V4 Pro deepseek-v4-pro 1M 32.8K Input: $1.73
Output: $3.796
Cache Read: $0.33
Model: 0.865
Completion: 2.194
Cache: 0.191
🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2026-04-24
Updated: 2026-06-11
Qwen 3 235B A22B Thinking 2507 qwen3-235b-a22b-thinking-2507 128K 16.4K Input: $0.45
Output: $3.5
Model: 0.225
Completion: 7.778
🧠 🔧 - In: text
Out: text
Open Weights
Released: 2025-04-29
Updated: 2026-06-11
Claude Sonnet 4.5 claude-sonnet-4-5 198K 64K Input: $3.75
Output: $18.75
Cache Read: $0.375
Cache Write: $4.69
Model: 1.875
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-07-31 In: text, image
Out: text
Released: 2025-01-15
Updated: 2026-06-11
GPT-4o Mini openai-gpt-4o-mini-2024-07-18 128K 16.4K Input: $0.1875
Output: $0.75
Cache Read: $0.09375
Model: 0.094
Completion: 4.000
Cache: 0.500
📎 🔧 🌡️ 2023-09 In: text, image
Out: text
Released: 2026-02-28
Updated: 2026-06-11
Claude Opus 4.7 claude-opus-4-7 1M 128K Input: $6
Output: $30
Cache Read: $0.6
Cache Write: $7.5
Model: 3.000
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 2026-01-31 In: text, image
Out: text
Released: 2026-04-16
Updated: 2026-06-11
GPT-5.3 Codex openai-gpt-53-codex 400K 128K Input: $2.19
Output: $17.5
Cache Read: $0.219
Model: 1.095
Completion: 7.991
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2026-02-24
Updated: 2026-06-11
GPT-5.4 openai-gpt-54 1M 131.1K Input: $3.13
Output: $18.8
Cache Read: $0.313
Model: 1.565
Completion: 6.006
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2026-03-05
Updated: 2026-06-11
Google Gemma 3 27B Instruct google-gemma-3-27b-it 198K 16.4K Input: $0.12
Output: $0.2
Model: 0.060
Completion: 1.667
📎 🔧 - In: text, image
Out: text
Open Weights
Released: 2025-11-04
Updated: 2026-06-11
MiniMax M3 minimax-m3 500K 131.1K Input: $0.3
Output: $1.2
Cache Read: $0.06
Model: 0.150
Completion: 4.000
Cache: 0.200
📎 🧠 🔧 🌡️ - In: text, image, video
Out: text
Open Weights
Released: 2026-06-01
Updated: 2026-06-11
Claude Opus 4.8 Fast claude-opus-4-8-fast 1M 128K Input: $12
Output: $60
Cache Read: $1.2
Cache Write: $15
Model: 6.000
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 - In: text, image
Out: text
Released: 2026-05-28
Updated: 2026-06-11
Gemma 4 Uncensored gemma-4-uncensored 256K 8.2K Input: $0.1625
Output: $0.5
Model: 0.081
Completion: 3.077
📎 🔧 - In: text, image
Out: text
Open Weights
Released: 2026-04-13
Updated: 2026-06-11
Qwen 3 Next 80b qwen3-next-80b 256K 16.4K Input: $0.35
Output: $1.9
Model: 0.175
Completion: 5.429
🔧 - In: text
Out: text
Open Weights
Released: 2025-04-29
Updated: 2026-06-11
Mistral Small 4 mistral-small-2603 256K 65.5K Input: $0.1875
Output: $0.75
Model: 0.094
Completion: 4.000
📎 🧠 🔧 🌡️ 2025-06 In: text, image
Out: text
Open Weights
Released: 2026-03-16
Updated: 2026-06-11
Claude Opus 4.8 claude-opus-4-8 1M 128K Input: $6
Output: $30
Cache Read: $0.6
Cache Write: $7.5
Model: 3.000
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 - In: text, image
Out: text
Released: 2026-05-28
Updated: 2026-06-11
GLM 4.7 Flash zai-org-glm-4.7-flash 128K 16.4K Input: $0.125
Output: $0.5
Model: 0.063
Completion: 4.000
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2026-01-29
Updated: 2026-06-11
Google Gemma 4 26B A4B Instruct google-gemma-4-26b-a4b-it 256K 8.2K Input: $0.1625
Output: $0.5
Model: 0.081
Completion: 3.077
📎 🧠 🔧 🌡️ - In: text, image, video
Out: text
Open Weights
Released: 2026-04-02
Updated: 2026-06-11
Grok 4.20 grok-4-20 2M 128K Input: $1.42
Output: $2.83
Cache Read: $0.23
Model: 0.710
Completion: 1.993
Cache: 0.162
📎 🧠 🔧 - In: text, image
Out: text
Released: 2026-03-12
Updated: 2026-06-11
Hermes 3 Llama 3.1 405b hermes-3-llama-3.1-405b 128K 16.4K Input: $1.1
Output: $3
Model: 0.550
Completion: 2.727
- - In: text
Out: text
Open Weights
Released: 2025-09-25
Updated: 2026-06-11
Aion 2.0 aion-labs-aion-2-0 128K 32.8K Input: $1
Output: $2
Cache Read: $0.25
Model: 0.500
Completion: 2.000
Cache: 0.250
🧠 - In: text
Out: text
Released: 2026-03-24
Updated: 2026-06-11
Qwen3 VL 235B qwen3-vl-235b-a22b 256K 16.4K Input: $0.25
Output: $1.5
Model: 0.125
Completion: 6.000
📎 🔧 - In: text, image
Out: text
Open Weights
Released: 2026-01-16
Updated: 2026-06-11
MiMo-V2.5 xiaomi-mimo-v2-5 1M 65.5K Input: $0.175
Output: $0.35
Cache Read: $0.0625
Model: 0.087
Completion: 2.000
Cache: 0.357
📎 🧠 🔧 🌡️ 2024-12 In: text, image, audio, video
Out: text
Open Weights
Released: 2026-06-11
Venice Uncensored 1.2 venice-uncensored-1-2 128K 8.2K Input: $0.2
Output: $0.9
Model: 0.100
Completion: 4.500
📎 🔧 - In: text, image
Out: text
Open Weights
Released: 2026-04-01
Updated: 2026-06-11
Gemini 3.5 Flash gemini-3-5-flash 1M 65.5K Input: $1.55
Output: $9.45
Cache Read: $0.155
Cache Write: $0.086
Model: 0.775
Completion: 6.097
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video
Out: text
Released: 2026-05-22
Updated: 2026-06-11
Claude Fable 5 claude-fable-5 1M 128K Input: $12
Output: $60
Cache Read: $1.2
Cache Write: $15
Model: 6.000
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 2026-01-31 In: text, image
Out: text
Released: 2026-06-10
Updated: 2026-06-11
OpenAI GPT OSS 120B openai-gpt-oss-120b 128K 16.4K Input: $0.07
Output: $0.3
Model: 0.035
Completion: 4.286
🧠 🔧 - In: text
Out: text
Open Weights
Released: 2025-11-06
Updated: 2026-06-11
NVIDIA Nemotron 3 Nano 30B nvidia-nemotron-3-nano-30b-a3b 128K 16.4K Input: $0.075
Output: $0.3
Model: 0.037
Completion: 4.000
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-01-27
Updated: 2026-06-11
Claude Opus 4.6 claude-opus-4-6 1M 128K Input: $6
Output: $30
Cache Read: $0.6
Cache Write: $7.5
Model: 3.000
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-05-31 In: text, image
Out: text
Released: 2026-02-05
Updated: 2026-06-11
Mistral Small 3.2 24B Instruct mistral-small-3-2-24b-instruct 256K 16.4K Input: $0.09375
Output: $0.25
Model: 0.047
Completion: 2.667
🔧 - In: text
Out: text
Open Weights
Released: 2026-01-15
Updated: 2026-06-11
GPT-5.4 Pro openai-gpt-54-pro 1M 128K Input: $37.5
Output: $225
Model: 18.750
Completion: 6.000
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2026-03-05
Updated: 2026-06-11
MiniMax M3 Preview minimax-m3-preview 524.3K 65.5K Input: $0.3
Output: $1.2
Cache Read: $0.06
Model: 0.150
Completion: 4.000
Cache: 0.200
🧠 🔧 - In: text
Out: text
Open Weights
Released: 2026-06-12
Updated: 2026-06-13
Gemini 3.1 Pro Preview gemini-3-1-pro-preview 1M 32.8K Input: $2.5
Output: $15
Cache Read: $0.5
Cache Write: $0.5
Model: 1.250
Completion: 6.000
Cache: 0.200
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video
Out: text
Released: 2026-02-19
Updated: 2026-06-11
Qwen 3 Coder 480B Turbo qwen3-coder-480b-a35b-instruct-turbo 256K 65.5K Input: $0.35
Output: $1.5
Cache Read: $0.04
Model: 0.175
Completion: 4.286
Cache: 0.114
🔧 - In: text
Out: text
Open Weights
Released: 2026-01-27
Updated: 2026-06-11
Qwen 3.6 27B qwen3-6-27b 256K 65.5K Input: $0.325
Output: $3.25
Model: 0.163
Completion: 10.000
📎 🧠 🔧 🌡️ - In: text, image, video
Out: text
Released: 2026-04-24
Updated: 2026-06-11
GPT-5.2 Codex openai-gpt-52-codex 256K 65.5K Input: $2.19
Output: $17.5
Cache Read: $0.219
Model: 1.095
Completion: 7.991
Cache: 0.100
📎 🧠 🔧 2025-08 In: text, image
Out: text
Released: 2025-01-15
Updated: 2026-06-11
Claude Opus 4.6 Fast claude-opus-4-6-fast 1M 128K Input: $36
Output: $180
Cache Read: $3.6
Cache Write: $45
Model: 18.000
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-05-31 In: text, image
Out: text
Released: 2026-04-08
Updated: 2026-06-11
Qwen 3.7 Max qwen-3-7-max 1M 65.5K Input: $2.7
Output: $8.05
Cache Read: $0.27
Cache Write: $3.35
Model: 1.350
Completion: 2.981
Cache: 0.100
🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-05-22
Updated: 2026-06-11
GLM 4.7 Flash Heretic olafangensan-glm-4.7-flash-heretic 200K 24K Input: $0.14
Output: $0.8
Model: 0.070
Completion: 5.714
🧠 🔧 - In: text
Out: text
Open Weights
Released: 2026-02-04
Updated: 2026-06-11
GPT-5.4 Mini openai-gpt-54-mini 400K 128K Input: $0.9375
Output: $5.625
Cache Read: $0.09375
Model: 0.469
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2026-03-27
Updated: 2026-06-11
Grok Build 0.1 grok-build-0-1 256K 65.5K Input: $1
Output: $2
Cache Read: $0.2
Model: 0.500
Completion: 2.000
Cache: 0.200
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2026-05-21
Updated: 2026-06-11
GLM 4.7 zai-org-glm-4.7 198K 16.4K Input: $0.55
Output: $2.65
Cache Read: $0.11
Model: 0.275
Completion: 4.818
Cache: 0.200
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-12-24
Updated: 2026-06-11
Claude Sonnet 4.6 claude-sonnet-4-6 1M 64K Input: $3.6
Output: $18
Cache Read: $0.36
Cache Write: $4.5
Model: 1.800
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-08-31 In: text, image
Out: text
Released: 2026-02-17
Updated: 2026-06-11
Gemini 3 Flash Preview gemini-3-flash-preview 256K 65.5K Input: $0.7
Output: $3.75
Cache Read: $0.07
Model: 0.350
Completion: 5.357
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video, audio, pdf
Out: text
Released: 2025-12-19
Updated: 2026-06-11
Trinity Large Thinking arcee-trinity-large-thinking 256K 65.5K Input: $0.3125
Output: $1.125
Cache Read: $0.075
Model: 0.156
Completion: 3.600
Cache: 0.240
🧠 🔧 - In: text
Out: text
Open Weights
Released: 2026-04-02
Updated: 2026-06-11
GPT-5.5 openai-gpt-55 1M 131.1K Input: $6.25
Output: $37.5
Cache Read: $0.625
Model: 3.125
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 2025-12-01 In: text, image
Out: text
Released: 2026-04-23
Updated: 2026-06-11
Qwen 3.5 9B qwen3-5-9b 256K 32.8K Input: $0.1
Output: $0.15
Model: 0.050
Completion: 1.500
📎 🧠 🔧 - In: text, image
Out: text
Open Weights
Released: 2026-03-05
Updated: 2026-06-11
Mercury 2 mercury-2 128K 50K Input: $0.3125
Output: $0.9375
Cache Read: $0.03125
Model: 0.156
Completion: 3.000
Cache: 0.100
🧠 🔧 - In: text
Out: text
Released: 2026-02-20
Updated: 2026-06-11
GLM 5.1 zai-org-glm-5-1 200K 24K Input: $1.75
Output: $5.5
Cache Read: $0.325
Model: 0.875
Completion: 3.143
Cache: 0.186
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-04-07
Updated: 2026-06-11
Kimi K2.5 kimi-k2-5 256K 65.5K Input: $0.56
Output: $3.5
Cache Read: $0.22
Model: 0.280
Completion: 6.250
Cache: 0.393
📎 🧠 🔧 2024-04 In: text, image
Out: text
Released: 2026-01-27
Updated: 2026-06-11
GLM 5V Turbo z-ai-glm-5v-turbo 200K 32.8K Input: $1.5
Output: $5
Cache Read: $0.3
Model: 0.750
Completion: 3.333
Cache: 0.200
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2026-04-01
Updated: 2026-06-11
NVIDIA Nemotron 3 Ultra nvidia-nemotron-3-ultra-550b-a55b 256K 32.8K Input: $0.625
Output: $3.125
Cache Read: $0.1875
Model: 0.313
Completion: 5.000
Cache: 0.300
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-06-04
Updated: 2026-06-11
GPT-5.2 openai-gpt-52 256K 65.5K Input: $2.19
Output: $17.5
Cache Read: $0.219
Model: 1.095
Completion: 7.991
Cache: 0.100
🧠 🔧 2025-08-31 In: text
Out: text
Released: 2025-12-13
Updated: 2026-06-11
DeepSeek V3.2 deepseek-v3.2 160K 32.8K Input: $0.33
Output: $0.48
Cache Read: $0.16
Model: 0.165
Completion: 1.455
Cache: 0.485
🧠 🔧 - In: text
Out: text
Open Weights
Released: 2025-12-04
Updated: 2026-06-11
MiniMax M2.5 minimax-m25 198K 32.8K Input: $0.34
Output: $1.19
Cache Read: $0.04
Model: 0.170
Completion: 3.500
Cache: 0.118
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-12
Updated: 2026-06-11
Llama 3.2 3B llama-3.2-3b 128K 4.1K Input: $0.15
Output: $0.6
Model: 0.075
Completion: 4.000
🔧 - In: text
Out: text
Open Weights
Released: 2024-10-03
Updated: 2026-06-11
Hy3 Preview tencent-hy3-preview 256K 32.8K Input: $0.063
Output: $0.21
Cache Read: $0.021
Model: 0.032
Completion: 3.333
Cache: 0.333
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-06-08
Updated: 2026-06-11

Vercel AI Gateway

📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Grok 4.1 Fast Reasoning xai/grok-4.1-fast-reasoning 1M 1M Input: $0.2
Output: $0.5
Cache Read: $0.05
Model: 0.100
Completion: 2.500
Cache: 0.250
🧠 🔧 🌡️ 2024-10 In: text, image, pdf
Out: text
Released: 2025-07-09
Grok 4.20 Beta Non-Reasoning xai/grok-4.20-non-reasoning-beta 2M 2M Input: $1.25
Output: $2.5
Cache Read: $0.4
Model: 0.625
Completion: 2.000
Cache: 0.320
📎 🔧 🌡️ - In: text, image, pdf
Out: text
Released: 2026-03-11
Updated: 2026-03-13
Grok 4.3 xai/grok-4.3 1M 1M Input: $1.25
Output: $2.5
Cache Read: $0.2
Model: 0.625
Completion: 2.000
Cache: 0.160
📎 🧠 🔧 🌡️ - In: text, image, pdf
Out: text
Released: 2026-04-30
Updated: 2026-04-17
Grok 4.1 Fast Non-Reasoning xai/grok-4.1-fast-non-reasoning 1M 1M Input: $0.2
Output: $0.5
Cache Read: $0.05
Model: 0.100
Completion: 2.500
Cache: 0.250
🔧 🌡️ 2024-10 In: text, image, pdf
Out: text
Released: 2025-07-09
Grok Imagine xai/grok-imagine-video - - - - 🌡️ - In: text
Out: video
Released: 2026-01-28
Grok 4.20 Multi Agent Beta xai/grok-4.20-multi-agent-beta 2M 2M Input: $1.25
Output: $2.5
Cache Read: $0.2
Model: 0.625
Completion: 2.000
Cache: 0.160
🧠 🔧 🌡️ - In: text, image, pdf
Out: text
Released: 2026-03-11
Updated: 2026-03-13
Grok 4.20 Reasoning xai/grok-4.20-reasoning 2M 2M Input: $1.25
Output: $2.5
Cache Read: $0.2
Model: 0.625
Completion: 2.000
Cache: 0.160
📎 🧠 🔧 🌡️ - In: text, image, pdf
Out: text
Released: 2026-03-09
Updated: 2026-03-23
Grok 4.20 Beta Reasoning xai/grok-4.20-reasoning-beta 2M 2M Input: $1.25
Output: $2.5
Cache Read: $0.2
Model: 0.625
Completion: 2.000
Cache: 0.160
📎 🧠 🔧 🌡️ - In: text, image, pdf
Out: text
Released: 2026-03-11
Updated: 2026-03-13
Grok Imagine Video 1.5 Preview xai/grok-imagine-video-1.5-preview - - - - 🌡️ - In: text
Out: video
Released: 2026-05-30
Grok 4.20 Non-Reasoning xai/grok-4.20-non-reasoning 2M 2M Input: $1.25
Output: $2.5
Cache Read: $0.2
Model: 0.625
Completion: 2.000
Cache: 0.160
📎 🔧 🌡️ - In: text, image, pdf
Out: text
Released: 2026-03-09
Updated: 2026-03-23
Grok 4.20 Multi-Agent xai/grok-4.20-multi-agent 2M 2M Input: $1.25
Output: $2.5
Cache Read: $0.2
Model: 0.625
Completion: 2.000
Cache: 0.160
🧠 🔧 🌡️ - In: text, image, pdf
Out: text
Released: 2026-03-09
Updated: 2026-03-23
Grok Imagine Image xai/grok-imagine-image - - - - 🌡️ - In: text
Out: text, image
Released: 2026-01-28
Updated: 2026-02-19
Grok Build 0.1 xai/grok-build-0.1 256K 256K Input: $1
Output: $2
Cache Read: $0.2
Model: 0.500
Completion: 2.000
Cache: 0.200
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2026-05-20
Updated: 2026-04-16
Kimi K2 Instruct moonshotai/kimi-k2 131.1K 131.1K Input: $0.57
Output: $2.3
Model: 0.285
Completion: 4.035
🔧 🌡️ - In: text
Out: text
Released: 2025-09-05
Kimi K2.7 Code moonshotai/kimi-k2.7-code 256K 32.8K Input: $0.95
Output: $4
Cache Read: $0.19
Model: 0.475
Completion: 4.211
Cache: 0.200
📎 🧠 🔧 🌡️ 2025-01 In: text, image, pdf
Out: text
Released: 2026-06-12
Kimi K2 Thinking moonshotai/kimi-k2-thinking 262.1K 262.1K Input: $0.6
Output: $2.5
Cache Read: $0.15
Model: 0.300
Completion: 4.167
Cache: 0.250
🧠 🔧 🌡️ 2024-08 In: text
Out: text
Released: 2025-11-06
Kimi K2.5 moonshotai/kimi-k2.5 262.1K 262.1K Input: $0.6
Output: $3
Cache Read: $0.1
Model: 0.300
Completion: 5.000
Cache: 0.167
📎 🧠 🔧 🌡️ 2025-01 In: text, image
Out: text
Open Weights
Released: 2026-01-26
Updated: 2026-01
Kimi K2.6 moonshotai/kimi-k2.6 262K 262K Input: $0.95
Output: $4
Cache Read: $0.16
Model: 0.475
Completion: 4.211
Cache: 0.168
📎 🧠 🔧 🌡️ 2025-01 In: text, image, pdf
Out: text
Open Weights
Released: 2026-04-20
Updated: 2026-04-21
Kling v3.0 Motion Control klingai/kling-v3.0-motion-control - - - - 🌡️ - In: text
Out: video
Released: 2026-03-04
Kling v2.6 Image-to-Video klingai/kling-v2.6-i2v - - - - 🌡️ - In: text
Out: video
Released: 2025-12-21
Kling v2.5 Turbo Text-to-Video klingai/kling-v2.5-turbo-t2v - - - - 🌡️ - In: text
Out: video
Released: 2025-09-23
Kling v3.0 Image-to-Video klingai/kling-v3.0-i2v - - - - 🌡️ - In: text
Out: video
Released: 2026-02-05
Kling v2.5 Turbo Image-to-Video klingai/kling-v2.5-turbo-i2v - - - - 🌡️ - In: text
Out: video
Released: 2025-09-23
Kling v3.0 Text-to-Video klingai/kling-v3.0-t2v - - - - 🌡️ - In: text
Out: video
Released: 2026-02-05
Kling v2.6 Motion Control klingai/kling-v2.6-motion-control - - - - 🌡️ - In: text
Out: video
Released: 2025-12-21
Kling v2.6 Text-to-Video klingai/kling-v2.6-t2v - - - - 🌡️ - In: text
Out: video
Released: 2025-12-21
voyage-4-lite voyage/voyage-4-lite 32K - - - 🌡️ - In: text
Out: text
Released: 2026-03-06
voyage-law-2 voyage/voyage-law-2 8.2K 1.5K - - 🌡️ - In: text
Out: text
Released: 2024-03-01
Updated: 2024-03
voyage-4 voyage/voyage-4 32K - - - 🌡️ - In: text
Out: text
Released: 2026-03-06
voyage-code-3 voyage/voyage-code-3 8.2K 1.5K - - 🌡️ - In: text
Out: text
Released: 2024-09-01
Updated: 2024-09
voyage-4-large voyage/voyage-4-large 32K - - - 🌡️ - In: text
Out: text
Released: 2026-03-06
Voyage Rerank 2.5 voyage/rerank-2.5 32K 32K - - 🌡️ - In: text
Out: text
Released: 2025-08-11
Voyage Rerank 2.5 Lite voyage/rerank-2.5-lite 32K 32K - - 🌡️ - In: text
Out: text
Released: 2025-08-11
voyage-code-2 voyage/voyage-code-2 8.2K 1.5K - - 🌡️ - In: text
Out: text
Released: 2024-01-01
Updated: 2024-01
voyage-3.5-lite voyage/voyage-3.5-lite 8.2K 1.5K - - 🌡️ - In: text
Out: text
Released: 2025-05-20
voyage-3.5 voyage/voyage-3.5 8.2K 1.5K - - 🌡️ - In: text
Out: text
Released: 2025-05-20
voyage-3-large voyage/voyage-3-large 8.2K 1.5K - - 🌡️ - In: text
Out: text
Released: 2024-09-01
Updated: 2024-09
voyage-finance-2 voyage/voyage-finance-2 8.2K 1.5K - - 🌡️ - In: text
Out: text
Released: 2024-03-01
Updated: 2024-03
Mistral Nemo mistral/mistral-nemo 131.1K 131.1K Input: $0.02
Output: $0.04
Model: 0.010
Completion: 2.000
🔧 🌡️ 2024-04 In: text
Out: text
Released: 2024-07-01
Codestral Embed mistral/codestral-embed 8.2K 1.5K - - 🌡️ - In: text
Out: text
Released: 2025-05-28
Mistral Embed mistral/mistral-embed 8.2K 1.5K - - 🌡️ - In: text
Out: text
Released: 2023-12-11
Devstral Small 1.1 mistral/devstral-small 128K 64K Input: $0.1
Output: $0.3
Model: 0.050
Completion: 3.000
🔧 🌡️ 2024-10 In: text
Out: text
Released: 2025-05-07
Mistral Large 3 mistral/mistral-large-3 256K 256K Input: $0.5
Output: $1.5
Model: 0.250
Completion: 3.000
📎 🌡️ 2024-10 In: text, image
Out: text
Released: 2025-12-02
Mistral Medium Latest mistral/mistral-medium-3.5 256K 256K Input: $1.5
Output: $7.5
Model: 0.750
Completion: 5.000
🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-05-21
Mistral Medium 3.1 mistral/mistral-medium 128K 64K Input: $0.4
Output: $2
Model: 0.200
Completion: 5.000
📎 🔧 🌡️ 2024-10 In: text, image
Out: text
Released: 2025-05-07
Devstral Small 2 mistral/devstral-small-2 256K 256K Input: $0.1
Output: $0.3
Model: 0.050
Completion: 3.000
🔧 🌡️ 2024-10 In: text
Out: text
Released: 2025-05-07
Ministral 14B mistral/ministral-14b 256K 256K Input: $0.2
Output: $0.2
Model: 0.100
Completion: 1.000
📎 🌡️ 2024-10 In: text, image, pdf
Out: text
Released: 2025-12-01
Devstral 2 mistral/devstral-2 256K 256K Input: $0.4
Output: $2
Model: 0.200
Completion: 5.000
🔧 🌡️ 2024-10 In: text
Out: text
Released: 2025-12-09
Ministral 8B (latest) mistral/ministral-8b 128K 128K Input: $0.1
Output: $0.1
Model: 0.050
Completion: 1.000
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2024-10-01
Updated: 2024-10-04
Mistral Small (latest) mistral/mistral-small 256K 256K Input: $0.15
Output: $0.6
Model: 0.075
Completion: 4.000
📎 🧠 🔧 🌡️ 2025-06 In: text, image
Out: text
Open Weights
Released: 2026-03-16
Codestral (latest) mistral/codestral 256K 4.1K Input: $0.3
Output: $0.9
Model: 0.150
Completion: 3.000
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2024-05-29
Updated: 2025-01-04
Pixtral 12B mistral/pixtral-12b 128K 128K Input: $0.15
Output: $0.15
Model: 0.075
Completion: 1.000
📎 🔧 🌡️ 2024-09 In: text, image
Out: text
Open Weights
Released: 2024-09-01
Pixtral Large (latest) mistral/pixtral-large 128K 128K Input: $2
Output: $6
Model: 1.000
Completion: 3.000
📎 🔧 🌡️ 2024-11 In: text, image
Out: text
Open Weights
Released: 2024-11-01
Updated: 2024-11-04
Ministral 3B (latest) mistral/ministral-3b 128K 128K Input: $0.04
Output: $0.04
Model: 0.020
Completion: 1.000
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2024-10-01
Updated: 2024-10-04
Magistral Small mistral/magistral-small 128K 128K Input: $0.5
Output: $1.5
Model: 0.250
Completion: 3.000
🧠 🔧 🌡️ 2025-06 In: text
Out: text
Open Weights
Released: 2025-03-17
Magistral Medium (latest) mistral/magistral-medium 128K 16.4K Input: $2
Output: $5
Model: 1.000
Completion: 2.500
🧠 🔧 🌡️ 2025-06 In: text
Out: text
Open Weights
Released: 2025-03-17
Updated: 2025-03-20
Gemini Embedding 2 google/gemini-embedding-2 - - - - 🌡️ - In: text
Out: text
Released: 2026-03-10
Updated: 2026-03-23
Text Multilingual Embedding 002 google/text-multilingual-embedding-002 8.2K 1.5K - - 🌡️ - In: text
Out: text
Released: 2024-03-01
Updated: 2024-03
Gemini 3.1 Flash Lite google/gemini-3.1-flash-lite 1M 65K Input: $0.25
Output: $1.5
Cache Read: $0.03
Model: 0.125
Completion: 6.000
Cache: 0.120
📎 🧠 🔧 🌡️ 2025-01 In: text, image, pdf
Out: text
Released: 2026-05-07
Gemini 3.1 Flash Image (Nano Banana 2) google/gemini-3.1-flash-image 131.1K 32.8K Input: $0.5
Output: $3
Cache Read: $0.05
Model: 0.250
Completion: 6.000
Cache: 0.100
📎 🧠 🌡️ - In: text, image
Out: text, image
Released: 2026-05-28
Gemini 3 Flash google/gemini-3-flash 1M 65K Input: $0.5
Output: $3
Cache Read: $0.05
Model: 0.250
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03 In: text, image, pdf
Out: text
Released: 2025-12-17
Veo 3.1 google/veo-3.1-generate-001 - - - - 🌡️ - In: text
Out: video
Released: 2026-06-08
Gemini 3.5 Flash google/gemini-3.5-flash 1M 64K Input: $1.5
Output: $9
Cache Read: $0.15
Model: 0.750
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01 In: text, image, pdf
Out: text
Released: 2026-05-19
Gemma 4 31B IT google/gemma-4-31b-it 262.1K 131.1K Input: $0.14
Output: $0.4
Model: 0.070
Completion: 2.857
📎 🧠 🔧 🌡️ - In: text, image, pdf
Out: text
Released: 2026-04-02
Veo 3.0 google/veo-3.0-generate-001 - - - - 🌡️ - In: text
Out: video
Released: 2026-06-08
Veo 3.0 Fast Generate google/veo-3.0-fast-generate-001 - - - - 🌡️ - In: text
Out: video
Released: 2026-06-08
Text Embedding 005 google/text-embedding-005 8.2K 1.5K - - 🌡️ - In: text
Out: text
Released: 2024-08-01
Updated: 2024-08
Gemini Embedding 001 google/gemini-embedding-001 8.2K 1.5K - - 🌡️ 2025-05 In: text
Out: text
Released: 2025-05-20
Nano Banana (Gemini 2.5 Flash Image) google/gemini-2.5-flash-image 32.8K 65.5K Input: $0.3
Output: $2.5
Cache Read: $0.03
Model: 0.150
Completion: 8.333
Cache: 0.100
🌡️ 2025-01 In: text
Out: text, image
Released: 2025-03-20
Updated: 2025-08-26
Imagen 4 Fast google/imagen-4.0-fast-generate-001 480 - - - 🌡️ - In: text
Out: image
Released: 2025-06-01
Updated: 2025-06
Gemini 2.5 Flash Lite google/gemini-2.5-flash-lite 1M 65.5K Input: $0.1
Output: $0.4
Cache Read: $0.01
Model: 0.050
Completion: 4.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01 In: text, image, pdf
Out: text
Released: 2025-06-17
Gemini 3.1 Flash Image Preview (Nano Banana 2) google/gemini-3.1-flash-image-preview 131.1K 32.8K Input: $0.5
Output: $3
Cache Read: $0.05
Model: 0.250
Completion: 6.000
Cache: 0.100
📎 🧠 🌡️ 2025-01 In: text, image
Out: text, image
Released: 2026-02-26
Imagen 4 Ultra google/imagen-4.0-ultra-generate-001 480 - - - - - In: text
Out: image
Released: 2025-05-24
Gemini 3.1 Pro Preview google/gemini-3.1-pro-preview 1M 64K Input: $2
Output: $12
Cache Read: $0.2
Model: 1.000
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01 In: text, image, pdf
Out: text
Released: 2025-11-18
Updated: 2026-02-19
Gemma 4 26B A4B IT google/gemma-4-26b-a4b-it 262.1K 131.1K Input: $0.15
Output: $0.6
Cache Read: $0.015
Model: 0.075
Completion: 4.000
Cache: 0.100
📎 🧠 🔧 🌡️ - In: text, image, pdf
Out: text
Released: 2026-04-02
Gemini 3 Pro Preview google/gemini-3-pro-preview 1M 64K Input: $2
Output: $12
Cache Read: $0.2
Model: 1.000
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01 In: text, image, pdf
Out: text
Released: 2025-11-18
Veo 3.1 Fast Generate google/veo-3.1-fast-generate-001 - - - - 🌡️ - In: text
Out: video
Released: 2026-06-08
Nano Banana Pro (Gemini 3 Pro Image) google/gemini-3-pro-image 65.5K 32.8K Input: $2
Output: $12
Cache Read: $0.2
Model: 1.000
Completion: 6.000
Cache: 0.100
🌡️ 2025-03 In: text
Out: text, image
Released: 2025-09-01
Updated: 2025-09
Gemini 3.1 Flash Lite Preview google/gemini-3.1-flash-lite-preview 1M 65K Input: $0.25
Output: $1.5
Cache Read: $0.03
Model: 0.125
Completion: 6.000
Cache: 0.120
📎 🧠 🔧 🌡️ 2025-01 In: text, image, pdf
Out: text
Released: 2026-03-03
Imagen 4 google/imagen-4.0-generate-001 480 - - - - - In: text
Out: image
Released: 2025-05-22
Gemini 2.5 Flash google/gemini-2.5-flash 1M 65.5K Input: $0.3
Output: $2.5
Cache Read: $0.03
Input Audio: $1
Model: 0.500
Completion: 2.500
Cache: 0.030
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-03-20
Updated: 2025-06-05
Gemini 2.5 Pro google/gemini-2.5-pro 1M 65.5K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-03-20
Updated: 2025-06-05
Flux Schnell prodia/flux-fast-schnell 512 - - - 🌡️ - In: text
Out: image
Released: 2026-06-08
gpt-oss-safeguard-20b openai/gpt-oss-safeguard-20b 131.1K 65.5K Input: $0.075
Output: $0.3
Cache Read: $0.037
Model: 0.037
Completion: 4.000
Cache: 0.493
🧠 🔧 🌡️ 2024-10 In: text
Out: text
Released: 2024-12-01
GPT-3.5 Turbo Instruct openai/gpt-3.5-turbo-instruct 8.2K 4.1K Input: $1.5
Output: $2
Model: 0.750
Completion: 1.333
🌡️ 2021-09 In: text
Out: text
Released: 2023-09-28
Updated: 2023-03-01
GPT-5.2 Chat openai/gpt-5.2-chat 128K 16.4K Input: $1.75
Output: $14
Cache Read: $0.175
Model: 0.875
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2024-10 In: text, image, pdf
Out: text
Released: 2025-12-11
Updated: 2025-08-07
text-embedding-3-large openai/text-embedding-3-large 8.2K 1.5K - - 🌡️ - In: text
Out: text
Released: 2024-01-25
**GPT 5.2 ** openai/gpt-5.2-pro 400K 128K Input: $21
Output: $168
Model: 10.500
Completion: 8.000
📎 🧠 🔧 🌡️ 2024-10 In: text, image, pdf
Out: text
Released: 2025-12-11
GPT 4o Mini Search Preview openai/gpt-4o-mini-search-preview 128K 16.4K Input: $0.15
Output: $0.6
Model: 0.075
Completion: 4.000
🌡️ 2023-09 In: text
Out: text
Released: 2025-03-12
Updated: 2025-01
GPT-5 Chat openai/gpt-5-chat 128K 16.4K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2024-10 In: text, image, pdf
Out: text, image
Released: 2025-08-07
GPT-3.5 Turbo openai/gpt-3.5-turbo 16.4K 4.1K Input: $0.5
Output: $1.5
Model: 0.250
Completion: 3.000
🌡️ 2021-09 In: text
Out: text
Released: 2023-05-28
Updated: 2023-11-06
GPT-5 pro openai/gpt-5-pro 400K 272K Input: $15
Output: $120
Model: 7.500
Completion: 8.000
📎 🧠 🔧 🌡️ 2024-10 In: text, image, pdf
Out: text, image
Released: 2025-08-07
Updated: 2025-10-06
o3 Pro openai/o3-pro 200K 100K Input: $20
Output: $80
Model: 10.000
Completion: 4.000
📎 🧠 🔧 🌡️ 2024-10 In: text, image, pdf
Out: text
Released: 2025-04-16
Updated: 2025-06-10
GPT 5.4 Nano openai/gpt-5.4-nano 400K 128K Input: $0.2
Output: $1.25
Cache Read: $0.02
Model: 0.100
Completion: 6.250
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-03-17
GPT-5.3 Chat openai/gpt-5.3-chat 128K 16.4K Input: $1.75
Output: $14
Cache Read: $0.175
Model: 0.875
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 🌡️ - In: text, image, pdf
Out: text
Released: 2026-03-03
Updated: 2026-03-06
GPT 5.1 Thinking openai/gpt-5.1-thinking 400K 128K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2024-10 In: text, image, pdf
Out: text, image
Released: 2025-11-12
Updated: 2025-08-07
GPT-5.1-Codex openai/gpt-5.1-codex 400K 128K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2024-10 In: text, image, pdf
Out: text
Released: 2025-11-12
Updated: 2025-11-13
GPT 5.1 Codex Max openai/gpt-5.1-codex-max 400K 128K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2024-10 In: text, image, pdf
Out: text
Released: 2025-11-19
Updated: 2025-11-13
text-embedding-ada-002 openai/text-embedding-ada-002 8.2K 1.5K - - 🌡️ - In: text
Out: text
Released: 2022-12-15
GPT-5.2 openai/gpt-5.2 400K 128K Input: $1.75
Output: $14
Cache Read: $0.175
Model: 0.875
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2024-10 In: text, image, pdf
Out: text
Released: 2025-12-11
GPT 5.3 Codex openai/gpt-5.3-codex 400K 128K Input: $1.75
Output: $14
Cache Read: $0.175
Model: 0.875
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-02-24
Updated: 2026-02-05
text-embedding-3-small openai/text-embedding-3-small 8.2K 1.5K - - 🌡️ - In: text
Out: text
Released: 2024-01-25
GPT-5.1 Codex mini openai/gpt-5.1-codex-mini 400K 128K Input: $0.25
Output: $2
Cache Read: $0.025
Model: 0.125
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2024-10 In: text, image, pdf
Out: text
Released: 2025-11-12
Updated: 2025-11-13
GPT Image 1.5 openai/gpt-image-1.5 - - Input: $5
Output: $32
Cache Read: $1.25
Model: 2.500
Completion: 6.400
Cache: 0.250
🌡️ - In: text
Out: image
Released: 2025-12-16
GPT OSS 120B openai/gpt-oss-120b 131.1K 131K Input: $0.35
Output: $0.75
Cache Read: $0.25
Model: 0.175
Completion: 2.143
Cache: 0.714
🧠 🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-08-05
GPT 5.4 openai/gpt-5.4 1.1M 128K Input: $2.5
Output: $15
Cache Read: $0.25
Model: 1.250
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-03-05
GPT 5.4 Mini openai/gpt-5.4-mini 400K 128K Input: $0.75
Output: $4.5
Cache Read: $0.075
Model: 0.375
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-03-17
o3-deep-research openai/o3-deep-research 200K 100K Input: $10
Output: $40
Cache Read: $2.5
Model: 5.000
Completion: 4.000
Cache: 0.250
📎 🧠 🔧 🌡️ 2024-10 In: text, image, pdf
Out: text
Released: 2024-06-26
GPT Image 1 openai/gpt-image-1 - - Input: $5
Output: $40
Cache Read: $1.25
Model: 2.500
Completion: 8.000
Cache: 0.250
🌡️ - In: text
Out: image
Released: 2025-03-25
GPT Image 1 Mini openai/gpt-image-1-mini - - Input: $2
Output: $8
Cache Read: $0.2
Model: 1.000
Completion: 4.000
Cache: 0.100
🌡️ - In: text
Out: image
Released: 2025-10-06
GPT 5.4 Pro openai/gpt-5.4-pro 1.1M 128K Input: $30
Output: $180
Model: 15.000
Completion: 6.000
📎 🧠 🔧 🌡️ 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-03-05
GPT 5.5 Pro openai/gpt-5.5-pro 1M 128K Input: $30
Output: $180
Model: 15.000
Completion: 6.000
📎 🧠 🔧 🌡️ 2025-12-01 In: text, image, pdf
Out: text
Released: 2026-04-24
Updated: 2026-04-23
GPT OSS 20B openai/gpt-oss-20b 131.1K 8.2K Input: $0.05
Output: $0.2
Model: 0.025
Completion: 4.000
🧠 🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-08-05
GPT-5.2-Codex openai/gpt-5.2-codex 400K 128K Input: $1.75
Output: $14
Cache Read: $0.175
Model: 0.875
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2024-10 In: text, image, pdf
Out: text
Released: 2025-12-18
Updated: 2025-12-11
GPT-5.1 Instant openai/gpt-5.1-instant 128K 16.4K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2024-10 In: text, image, pdf
Out: text
Released: 2025-11-12
Updated: 2025-08-07
GPT Image 2 openai/gpt-image-2 - - Input: $5
Output: $30
Cache Read: $1.25
Model: 2.500
Completion: 6.000
Cache: 0.250
🌡️ - In: text
Out: image
Released: 2026-04-21
GPT 5.5 openai/gpt-5.5 1M 128K Input: $5
Output: $30
Cache Read: $0.5
Model: 2.500
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-12-01 In: text, image, pdf
Out: text
Released: 2026-04-24
Updated: 2026-04-23
GPT-5-Codex openai/gpt-5-codex 400K 128K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-09-15
GPT-4o mini openai/gpt-4o-mini 128K 16.4K Input: $0.15
Output: $0.6
Cache Read: $0.075
Model: 0.075
Completion: 4.000
Cache: 0.500
📎 🔧 🌡️ 2023-09 In: text, image, pdf
Out: text
Released: 2024-07-18
GPT-5 Nano openai/gpt-5-nano 400K 128K Input: $0.05
Output: $0.4
Cache Read: $0.005
Model: 0.025
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2024-05-30 In: text, image
Out: text
Released: 2025-08-07
GPT-4 Turbo openai/gpt-4-turbo 128K 4.1K Input: $10
Output: $30
Model: 5.000
Completion: 3.000
📎 🔧 🌡️ 2023-12 In: text, image
Out: text
Released: 2023-11-06
Updated: 2024-04-09
GPT-4.1 mini openai/gpt-4.1-mini 1M 32.8K Input: $0.4
Output: $1.6
Cache Read: $0.1
Model: 0.200
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-04 In: text, image, pdf
Out: text
Released: 2025-04-14
GPT-5 Mini openai/gpt-5-mini 400K 128K Input: $0.25
Output: $2
Cache Read: $0.025
Model: 0.125
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2024-05-30 In: text, image
Out: text
Released: 2025-08-07
GPT-4.1 openai/gpt-4.1 1M 32.8K Input: $2
Output: $8
Cache Read: $0.5
Model: 1.000
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-04 In: text, image, pdf
Out: text
Released: 2025-04-14
o1 openai/o1 200K 100K Input: $15
Output: $60
Cache Read: $7.5
Model: 7.500
Completion: 4.000
Cache: 0.500
📎 🧠 🔧 2023-09 In: text, image, pdf
Out: text
Released: 2024-12-05
GPT-4.1 nano openai/gpt-4.1-nano 1M 32.8K Input: $0.1
Output: $0.4
Cache Read: $0.025
Model: 0.050
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2025-04-14
o3-mini openai/o3-mini 200K 100K Input: $1.1
Output: $4.4
Cache Read: $0.55
Model: 0.550
Completion: 4.000
Cache: 0.500
🧠 🔧 2024-05 In: text
Out: text
Released: 2024-12-20
Updated: 2025-01-29
o4-mini openai/o4-mini 200K 100K Input: $1.1
Output: $4.4
Cache Read: $0.275
Model: 0.550
Completion: 4.000
Cache: 0.250
📎 🧠 🔧 2024-05 In: text, image
Out: text
Released: 2025-04-16
GPT-4o openai/gpt-4o 128K 16.4K Input: $2.5
Output: $10
Cache Read: $1.25
Model: 1.250
Completion: 4.000
Cache: 0.500
📎 🔧 🌡️ 2023-09 In: text, image, pdf
Out: text
Released: 2024-05-13
Updated: 2024-08-06
GPT-5 openai/gpt-5 400K 128K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-08-07
o3 openai/o3 200K 100K Input: $2
Output: $8
Cache Read: $0.5
Model: 1.000
Completion: 4.000
Cache: 0.250
📎 🧠 🔧 2024-05 In: text, image, pdf
Out: text
Released: 2025-04-16
GLM 4.7 zai/glm-4.7 131K 40K Input: $2.25
Output: $2.75
Cache Read: $2.25
Model: 1.125
Completion: 1.222
Cache: 1.000
🧠 🔧 🌡️ 2024-10 In: text
Out: text
Released: 2025-12-22
GLM 4.5V zai/glm-4.5v 66K 16K Input: $0.6
Output: $1.8
Cache Read: $0.11
Model: 0.300
Completion: 3.000
Cache: 0.183
📎 🧠 🔧 🌡️ 2025-08 In: text, image
Out: text
Open Weights
Released: 2025-08-11
GLM 4.5 zai/glm-4.5 128K 96K Input: $0.6
Output: $2.2
Cache Read: $0.11
Model: 0.300
Completion: 3.667
Cache: 0.183
🧠 🔧 🌡️ 2025-07 In: text
Out: text
Open Weights
Released: 2025-07-28
GLM 4.7 FlashX zai/glm-4.7-flashx 200K 128K Input: $0.06
Output: $0.4
Cache Read: $0.01
Model: 0.030
Completion: 6.667
Cache: 0.167
🧠 🔧 🌡️ 2025-01 In: text
Out: text
Open Weights
Released: 2025-01-01
Updated: 2026-01-19
GLM 5.1 zai/glm-5.1 202.8K 64K Input: $1.4
Output: $4.4
Cache Read: $0.26
Model: 0.700
Completion: 3.143
Cache: 0.186
📎 🧠 🔧 🌡️ - In: text, image, pdf
Out: text
Released: 2026-04-07
Updated: 2026-03-27
GLM 4.6 zai/glm-4.6 200K 96K Input: $0.6
Output: $2.2
Cache Read: $0.11
Model: 0.300
Completion: 3.667
Cache: 0.183
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-09-30
GLM-4.6V zai/glm-4.6v 128K 24K Input: $0.3
Output: $0.9
Cache Read: $0.05
Model: 0.150
Completion: 3.000
Cache: 0.167
📎 🧠 🔧 🌡️ 2024-10 In: text, image, pdf
Out: text
Released: 2025-09-30
Updated: 2025-12-08
GLM 5V Turbo zai/glm-5v-turbo 200K 128K Input: $1.2
Output: $4
Cache Read: $0.24
Model: 0.600
Completion: 3.333
Cache: 0.200
📎 🧠 🔧 🌡️ - In: text, image, pdf
Out: text
Released: 2026-04-01
GLM-4.6V-Flash zai/glm-4.6v-flash 128K 24K - - 📎 🧠 🔧 🌡️ 2024-10 In: text, image, pdf
Out: text
Released: 2025-09-30
GLM 4.5 Air zai/glm-4.5-air 128K 96K Input: $0.2
Output: $1.1
Cache Read: $0.03
Model: 0.100
Completion: 5.500
Cache: 0.150
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-28
GLM 4.7 Flash zai/glm-4.7-flash 200K 131K Input: $0.07
Output: $0.4
Model: 0.035
Completion: 5.714
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Released: 2026-03-13
Updated: 2026-01-19
GLM-5 zai/glm-5 202.8K 131.1K Input: $1
Output: $3.2
Cache Read: $0.2
Model: 0.500
Completion: 3.200
Cache: 0.200
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-12
Updated: 2026-02-11
GLM 5 Turbo zai/glm-5-turbo 202.8K 131.1K Input: $1.2
Output: $4
Cache Read: $0.24
Model: 0.600
Completion: 3.333
Cache: 0.200
🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-03-15
Updated: 2026-03-16
Seedream 5.0 Lite bytedance/seedream-5.0-lite - - - - 🌡️ - In: text
Out: image
Released: 2026-01-28
Seedance 2.0 Fast bytedance/seedance-2.0-fast - - - - 📎 🌡️ - In: text, image
Out: video
Released: 2026-04-14
Seedance v1.0 Pro bytedance/seedance-v1.0-pro - - - - 🌡️ - In: text
Out: video
Released: 2025-06-11
Seed 1.6 bytedance/seed-1.6 256K 32K Input: $0.25
Output: $2
Cache Read: $0.05
Model: 0.125
Completion: 8.000
Cache: 0.200
🧠 🔧 🌡️ 2024-10 In: text
Out: text
Released: 2025-09-01
Updated: 2025-09
Seedance 2.0 bytedance/seedance-2.0 - - - - 📎 🌡️ - In: text, image
Out: video
Released: 2026-04-14
Seedance v1.5 Pro bytedance/seedance-v1.5-pro - - - - 🌡️ - In: text
Out: video
Released: 2025-12-16
Seedance v1.0 Lite Text-to-Video bytedance/seedance-v1.0-lite-t2v - - - - 🌡️ - In: text
Out: video
Released: 2025-06-01
Seedance v1.0 Pro Fast bytedance/seedance-v1.0-pro-fast - - - - 🌡️ - In: text
Out: video
Released: 2025-10-31
Seedream 4.0 bytedance/seedream-4.0 - - - - 🌡️ - In: text
Out: image
Released: 2025-08-28
Seedream 4.5 bytedance/seedream-4.5 - - - - 🌡️ - In: text
Out: image
Released: 2025-11-28
Seed 1.8 bytedance/seed-1.8 256K 64K Input: $0.25
Output: $2
Cache Read: $0.05
Model: 0.125
Completion: 8.000
Cache: 0.200
🧠 🔧 🌡️ 2024-10 In: text, image
Out: text
Released: 2025-09-01
Updated: 2025-10
Seedance v1.0 Lite Image-to-Video bytedance/seedance-v1.0-lite-i2v - - - - 🌡️ - In: text
Out: video
Released: 2025-06-01
Morph v3 Large morph/morph-v3-large 32K 32K Input: $0.9
Output: $1.9
Model: 0.450
Completion: 2.111
- - In: text
Out: text
Released: 2024-08-15
Morph v3 Fast morph/morph-v3-fast 16K 16K Input: $0.8
Output: $1.2
Model: 0.400
Completion: 1.500
- - In: text
Out: text
Released: 2024-08-15
Nemotron 3 Ultra nvidia/nemotron-3-ultra-550b-a55b 1M 65K Input: $0.6
Output: $2.4
Cache Read: $0.12
Model: 0.300
Completion: 4.000
Cache: 0.200
🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-06-04
Nemotron 3 Nano 30B A3B nvidia/nemotron-3-nano-30b-a3b 262.1K 262.1K Input: $0.05
Output: $0.24
Model: 0.025
Completion: 4.800
🧠 🌡️ 2024-10 In: text
Out: text
Released: 2024-12-01
Updated: 2025-12-15
Nvidia Nemotron Nano 9B V2 nvidia/nemotron-nano-9b-v2 131.1K 131.1K Input: $0.06
Output: $0.23
Model: 0.030
Completion: 3.833
🧠 🔧 🌡️ 2024-10 In: text
Out: text
Released: 2025-08-18
NVIDIA Nemotron 3 Super 120B A12B nvidia/nemotron-3-super-120b-a12b 256K 32K Input: $0.15
Output: $0.65
Model: 0.075
Completion: 4.333
🌡️ - In: text
Out: text
Released: 2026-03-18
Updated: 2026-03-11
Nvidia Nemotron Nano 12B V2 VL nvidia/nemotron-nano-12b-v2-vl 131.1K 131.1K Input: $0.2
Output: $0.6
Model: 0.100
Completion: 3.000
📎 🧠 🔧 🌡️ 2024-10 In: text, image
Out: text
Released: 2024-12-01
Updated: 2025-10-28
MiMo M2.5 xiaomi/mimo-v2.5 1.1M 131.1K Input: $0.14
Output: $0.28
Cache Read: $0.0028
Model: 0.070
Completion: 2.000
Cache: 0.020
📎 🧠 🔧 🌡️ 2024-12 In: text, image, pdf
Out: text
Released: 2026-04-22
MiMo V2 Flash xiaomi/mimo-v2-flash 262.1K 32K Input: $0.1
Output: $0.3
Cache Read: $0.01
Model: 0.050
Completion: 3.000
Cache: 0.100
🧠 🔧 🌡️ 2024-10 In: text
Out: text
Released: 2025-12-17
Updated: 2026-02-04
MiMo V2 Pro xiaomi/mimo-v2-pro 1M 128K Input: $1
Output: $3
Cache Read: $0.2
Model: 0.500
Completion: 3.000
Cache: 0.200
🧠 🔧 🌡️ 2024-12 In: text
Out: text
Released: 2026-03-18
MiMo V2.5 Pro xiaomi/mimo-v2.5-pro 1.1M 131K Input: $0.435
Output: $0.87
Cache Read: $0.0036
Model: 0.217
Completion: 2.000
Cache: 0.008
📎 🧠 🔧 🌡️ 2024-12 In: text, image, pdf
Out: text
Released: 2026-04-22
Mercury Coder Small Beta inception/mercury-coder-small 32K 16.4K Input: $0.25
Output: $1
Model: 0.125
Completion: 4.000
🔧 🌡️ - In: text
Out: text
Released: 2025-02-26
Mercury 2 inception/mercury-2 128K 128K Input: $0.25
Output: $0.75
Cache Read: $0.024999999999999998
Model: 0.125
Completion: 3.000
Cache: 0.100
🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-02-24
Updated: 2026-03-06
Claude Haiku 4.5 anthropic/claude-haiku-4.5 200K 64K Input: $1
Output: $5
Cache Read: $0.1
Cache Write: $1.25
Model: 0.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-02-28 In: text, image, pdf
Out: text
Released: 2025-10-15
Claude Opus 4.7 anthropic/claude-opus-4.7 1M 128K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2026-01-31 In: text, image, pdf
Out: text
Released: 2026-04-16
Claude Opus 4.8 anthropic/claude-opus-4.8 1M 128K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ - In: text, image, pdf
Out: text
Released: 2026-05-28
Claude Opus 4.5 anthropic/claude-opus-4.5 200K 64K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2024-11-24
Updated: 2025-11-24
Claude Sonnet 4.6 anthropic/claude-sonnet-4.6 1M 128K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-02-17
Updated: 2026-03-13
Claude Opus 4.6 anthropic/claude-opus-4.6 1M 128K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-05-31 In: text, image, pdf
Out: text
Released: 2026-02-05
Updated: 2026-03-13
Claude Opus 4 anthropic/claude-opus-4 200K 32K Input: $15
Output: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-05-22
Claude Haiku 3 anthropic/claude-3-haiku 200K 4.1K Input: $0.25
Output: $1.25
Cache Read: $0.03
Cache Write: $0.3
Model: 0.125
Completion: 5.000
Cache: 0.120
📎 🔧 🌡️ 2023-08-31 In: text, image, pdf
Out: text
Released: 2024-03-13
Claude Opus 4.1 anthropic/claude-opus-4.1 200K 32K Input: $15
Output: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-08-05
Claude Sonnet 4 anthropic/claude-sonnet-4 200K 64K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image, pdf
Out: text
Released: 2025-05-22
Claude Sonnet 4.5 anthropic/claude-sonnet-4.5 200K 64K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-07-31 In: text, image, pdf
Out: text
Released: 2025-09-29
Claude Haiku 3.5 anthropic/claude-3.5-haiku 200K 8.2K Input: $0.8
Output: $4
Cache Read: $0.08
Cache Write: $1
Model: 0.400
Completion: 5.000
Cache: 0.100
📎 🔧 🌡️ 2024-07-31 In: text, image, pdf
Out: text
Released: 2024-10-22
Cohere Rerank 3.5 cohere/rerank-v3.5 4.1K 4.1K - - 🌡️ - In: text
Out: text
Released: 2024-12-02
Command A cohere/command-a 256K 8K Input: $2.5
Output: $10
Model: 1.250
Completion: 4.000
🔧 🌡️ 2024-10 In: text
Out: text
Released: 2025-03-13
Cohere Rerank 4 Fast cohere/rerank-v4-fast 32K 32K - - 🌡️ - In: text
Out: text
Released: 2025-12-11
Embed v4.0 cohere/embed-v4.0 128K 1.5K - - 🌡️ - In: text
Out: text
Released: 2025-04-15
Cohere Rerank 4 Pro cohere/rerank-v4-pro 32K 32K - - 🌡️ - In: text
Out: text
Released: 2025-12-11
Step 3.7 Flash stepfun/step-3.7-flash 256K 256K Input: $0.2
Output: $1.15
Cache Read: $0.04
Model: 0.100
Completion: 5.750
Cache: 0.200
📎 🧠 🔧 🌡️ 2026-01-01 In: text, image
Out: text
Released: 2026-05-28
Updated: 2026-05-29
StepFun 3.5 Flash stepfun/step-3.5-flash 262.1K 262.1K Input: $0.09
Output: $0.3
Cache Write: $0.02
Model: 0.045
Completion: 3.333
🧠 🔧 🌡️ 2025-01 In: text
Out: text
Released: 2026-01-29
Updated: 2026-02-13
Interfaze Beta interfaze/interfaze-beta 1M 32K Input: $1.5
Output: $3.5
Model: 0.750
Completion: 2.333
🧠 🌡️ - In: text
Out: text
Released: 2025-10-07
Updated: 2026-04-29
FLUX.1 Kontext Max bfl/flux-kontext-max 512 - - - 🌡️ - In: text
Out: image
Released: 2025-06-01
Updated: 2025-06
FLUX.2 [flex] bfl/flux-2-flex - - - - 🌡️ - In: text
Out: image
Released: 2026-06-08
FLUX1.1 [pro] Ultra bfl/flux-pro-1.1-ultra 512 - - - 🌡️ - In: text
Out: image
Released: 2024-11-01
Updated: 2024-11
FLUX.2 [max] bfl/flux-2-max 67.3K 67.3K - - 🌡️ - In: text
Out: image
Released: 2026-06-08
FLUX1.1 [pro] bfl/flux-pro-1.1 512 - - - 🌡️ - In: text
Out: image
Released: 2024-10-01
Updated: 2024-10
FLUX.1 Fill [pro] bfl/flux-pro-1.0-fill 512 - - - 🌡️ - In: text
Out: image
Released: 2024-10-01
Updated: 2024-10
FLUX.2 [klein] 4B bfl/flux-2-klein-4b - - - - 🌡️ - In: text
Out: image
Released: 2026-06-08
FLUX.2 [klein] 9B bfl/flux-2-klein-9b - - - - 🌡️ - In: text
Out: image
Released: 2026-06-08
FLUX.1 Kontext Pro bfl/flux-kontext-pro 512 - - - 🌡️ - In: text
Out: image
Released: 2025-06-01
Updated: 2025-06
FLUX.2 [pro] bfl/flux-2-pro 67.3K 67.3K - - 🌡️ - In: text
Out: image
Released: 2026-06-08
Recraft V4.1 Pro recraft/recraft-v4.1-pro - - - - 🌡️ - In: text
Out: image
Released: 2026-05-14
Recraft V4.1 recraft/recraft-v4.1 - - - - 🌡️ - In: text
Out: image
Released: 2026-05-14
Recraft V4 recraft/recraft-v4 - - - - 🌡️ - In: text
Out: image
Released: 2026-02-17
Recraft V4 Pro recraft/recraft-v4-pro - - - - 🌡️ - In: text
Out: image
Released: 2026-02-17
Recraft V2 recraft/recraft-v2 512 - - - 🌡️ - In: text
Out: image
Released: 2024-03-01
Updated: 2024-03
Recraft V3 recraft/recraft-v3 512 - - - 🌡️ - In: text
Out: image
Released: 2024-10-01
Updated: 2024-10
Recraft V4.1 Utility Pro recraft/recraft-v4.1-utility-pro - - - - 🌡️ - In: text
Out: image
Released: 2026-05-14
Recraft V4.1 Utility recraft/recraft-v4.1-utility - - - - 🌡️ - In: text
Out: image
Released: 2026-05-14
Trinity Large Preview arcee-ai/trinity-large-preview 131K 131K Input: $0.25
Output: $1
Model: 0.125
Completion: 4.000
🔧 🌡️ 2024-10 In: text
Out: text
Released: 2025-01-01
Updated: 2025-01
Trinity Large Thinking arcee-ai/trinity-large-thinking 262.1K 80K Input: $0.25
Output: $0.8999999999999999
Model: 0.125
Completion: 3.600
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-04-01
Updated: 2026-04-03
Trinity Mini arcee-ai/trinity-mini 131.1K 131.1K Input: $0.045
Output: $0.15
Model: 0.022
Completion: 3.333
🌡️ 2024-10 In: text
Out: text
Released: 2025-12-01
Updated: 2025-12
Sonar Reasoning Pro perplexity/sonar-reasoning-pro 127K 8K - - 🧠 🌡️ 2025-09 In: text
Out: text
Released: 2025-02-19
Sonar perplexity/sonar 127K 8K - - 📎 🔧 🌡️ 2025-02 In: text, image
Out: text
Released: 2025-02-19
Sonar Pro perplexity/sonar-pro 200K 8K - - 📎 🔧 🌡️ 2025-09 In: text, image
Out: text
Released: 2025-02-19
Titan Text Embeddings V2 amazon/titan-embed-text-v2 8.2K 1.5K - - 🌡️ - In: text
Out: text
Released: 2024-04-01
Updated: 2024-04
Nova 2 Lite amazon/nova-2-lite 1M 1M Input: $0.3
Output: $2.5
Cache Read: $0.075
Model: 0.150
Completion: 8.333
Cache: 0.250
📎 🧠 🌡️ 2024-10 In: text, image
Out: text
Released: 2024-12-01
Nova Lite amazon/nova-lite 300K 8.2K Input: $0.06
Output: $0.24
Cache Read: $0.015
Model: 0.030
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-10 In: text, image, video
Out: text
Released: 2024-12-03
Nova Micro amazon/nova-micro 128K 8.2K Input: $0.035
Output: $0.14
Cache Read: $0.00875
Model: 0.018
Completion: 4.000
Cache: 0.250
🔧 🌡️ 2024-10 In: text
Out: text
Released: 2024-12-03
Nova Pro amazon/nova-pro 300K 8.2K Input: $0.8
Output: $3.2
Cache Read: $0.2
Model: 0.400
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-10 In: text, image, video
Out: text
Released: 2024-12-03
Qwen3 VL Thinking alibaba/qwen3-vl-thinking 131.1K 32.8K Input: $0.4
Output: $4
Model: 0.200
Completion: 10.000
📎 🧠 🔧 🌡️ 2025-09 In: text, image, pdf
Out: text
Open Weights
Released: 2025-09-24
Qwen3 Coder Plus alibaba/qwen3-coder-plus 1M 65.5K Input: $1
Output: $5
Cache Read: $0.2
Model: 0.500
Completion: 5.000
Cache: 0.200
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-23
Wan v2.6 Reference-to-Video alibaba/wan-v2.6-r2v - - - - 🌡️ - In: text
Out: video
Released: 2025-12-16
Qwen3 Embedding 0.6B alibaba/qwen3-embedding-0.6b 32.8K 32.8K - - 🌡️ - In: text
Out: text
Released: 2025-11-14
Qwen3 Max Preview alibaba/qwen3-max-preview 262.1K 32.8K Input: $1.2
Output: $6
Cache Read: $0.24
Model: 0.600
Completion: 5.000
Cache: 0.200
🔧 🌡️ 2025-04 In: text
Out: text
Released: 2025-09-23
Qwen3 Embedding 8B alibaba/qwen3-embedding-8b 32.8K 32.8K - - 🌡️ - In: text
Out: text
Released: 2025-06-05
Qwen3 Next 80B A3B Instruct alibaba/qwen3-next-80b-a3b-instruct 131.1K 32.8K Input: $0.15
Output: $1.2
Model: 0.075
Completion: 8.000
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-09-12
Updated: 2025-09
Qwen 3.7 Plus alibaba/qwen3.7-plus 1M 64K Input: $0.4
Output: $1.6
Cache Read: $0.08
Cache Write: $0.5
Model: 0.200
Completion: 4.000
Cache: 0.200
📎 🧠 🔧 🌡️ 2025-04 In: text, image, pdf
Out: text
Released: 2026-06-01
Updated: 2026-06-02
Qwen3 VL Instruct alibaba/qwen3-vl-instruct 131.1K 129K Input: $0.4
Output: $1.6
Model: 0.200
Completion: 4.000
📎 🔧 🌡️ 2025-04 In: text, image, pdf
Out: text
Open Weights
Released: 2025-09-24
Qwen 3.7 Max alibaba/qwen3.7-max 991K 64K Input: $1.25
Output: $3.75
Cache Read: $0.25
Cache Write: $1.5625
Model: 0.625
Completion: 3.000
Cache: 0.200
📎 🧠 🔧 🌡️ - In: text, pdf
Out: text
Released: 2026-05-21
Wan v2.5 Text-to-Video Preview alibaba/wan-v2.5-t2v-preview - - - - 🌡️ - In: text
Out: video
Released: 2025-09-24
Qwen3 Max alibaba/qwen3-max 262.1K 32.8K Input: $1.2
Output: $6
Cache Read: $0.24
Model: 0.600
Completion: 5.000
Cache: 0.200
🔧 🌡️ 2025-04 In: text
Out: text
Released: 2025-09-23
Qwen3 Next 80B A3B Thinking alibaba/qwen3-next-80b-a3b-thinking 131.1K 32.8K Input: $0.15
Output: $1.2
Model: 0.075
Completion: 8.000
🧠 🔧 🌡️ 2025-09 In: text
Out: text
Open Weights
Released: 2025-09-12
Updated: 2025-09
Qwen3 Embedding 4B alibaba/qwen3-embedding-4b 32.8K 32.8K - - 🌡️ - In: text
Out: text
Released: 2025-06-05
Qwen 3.5 Flash alibaba/qwen3.5-flash 1M 64K Input: $0.1
Output: $0.4
Cache Read: $0.001
Cache Write: $0.125
Model: 0.050
Completion: 4.000
Cache: 0.010
📎 🧠 🔧 🌡️ - In: text, image, pdf
Out: text
Released: 2026-02-24
Qwen3 Coder 480B A35B Instruct alibaba/qwen3-coder 262.1K 65.5K Input: $1.5
Output: $7.5
Cache Read: $0.3
Model: 0.750
Completion: 5.000
Cache: 0.200
🔧 🌡️ 2025-04 In: text
Out: text
Released: 2025-04-01
Updated: 2025-04
Qwen3 235B A22B Instruct 2507 alibaba/qwen-3-235b 262.1K 16.4K Input: $0.22
Output: $0.88
Model: 0.110
Completion: 4.000
🔧 🌡️ 2025-04 In: text
Out: text
Released: 2025-04-01
Updated: 2025-04
Qwen 3.5 Plus alibaba/qwen3.5-plus 1M 64K Input: $0.4
Output: $2.4
Cache Read: $0.04
Cache Write: $0.5
Model: 0.200
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-04 In: text, image, pdf
Out: text
Released: 2026-02-16
Wan v2.6 Text-to-Video alibaba/wan-v2.6-t2v - - - - 🌡️ - In: text
Out: video
Released: 2025-12-16
Qwen 3 Max Thinking alibaba/qwen3-max-thinking 256K 65.5K Input: $1.2
Output: $6
Cache Read: $0.24
Model: 0.600
Completion: 5.000
Cache: 0.200
🧠 🔧 🌡️ 2025-01 In: text
Out: text
Open Weights
Released: 2025-01
Wan v2.6 Image-to-Video Flash alibaba/wan-v2.6-i2v-flash - - - - 🌡️ - In: text
Out: video
Released: 2025-12-16
Wan v2.6 Reference-to-Video Flash alibaba/wan-v2.6-r2v-flash - - - - 🌡️ - In: text
Out: video
Released: 2025-12-16
Qwen3 Coder Next alibaba/qwen3-coder-next 256K 256K Input: $0.5
Output: $1.2
Model: 0.250
Completion: 2.400
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-07-22
Updated: 2026-02-19
Qwen 3.6 27B alibaba/qwen3.6-27b 256K 256K Input: $0.6
Output: $3.6
Model: 0.300
Completion: 6.000
📎 🧠 🔧 🌡️ - In: text, image, pdf
Out: text
Released: 2026-04-22
Wan v2.6 Image-to-Video alibaba/wan-v2.6-i2v - - - - 🌡️ - In: text
Out: video
Released: 2025-12-16
Qwen3-30B-A3B alibaba/qwen-3-30b 41K 16.4K Input: $0.12
Output: $0.5
Model: 0.060
Completion: 4.167
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Released: 2025-04-01
Updated: 2025-04
Qwen3 235B A22B Thinking 2507 alibaba/qwen3-235b-a22b-thinking 131.1K 32.8K Input: $0.4
Output: $4
Model: 0.200
Completion: 10.000
📎 🧠 🔧 🌡️ 2025-04 In: text, image, pdf
Out: text
Released: 2025-09-24
Updated: 2025-04
Qwen3 VL 235B A22B Instruct alibaba/qwen3-vl-235b-a22b-instruct 131.1K 129K Input: $0.4
Output: $1.6
Model: 0.200
Completion: 4.000
📎 🌡️ - In: text, image, pdf
Out: text
Released: 2025-09-24
Updated: 2026-05-01
Qwen3-14B alibaba/qwen-3-14b 41K 16.4K Input: $0.12
Output: $0.24
Model: 0.060
Completion: 2.000
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Released: 2025-04-01
Updated: 2025-04
Qwen 3.32B alibaba/qwen-3-32b 128K 8.2K Input: $0.16
Output: $0.64
Model: 0.080
Completion: 4.000
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Released: 2025-04-01
Updated: 2025-04
Qwen 3.6 Max Preview alibaba/qwen-3.6-max-preview 240K 64K Input: $1.3
Output: $7.8
Cache Read: $0.26
Cache Write: $1.625
Model: 0.650
Completion: 6.000
Cache: 0.200
📎 🧠 🔧 🌡️ - In: text, pdf
Out: text
Open Weights
Released: 2026-04-20
Updated: 2026-04-24
Qwen 3 Coder 30B A3B Instruct alibaba/qwen3-coder-30b-a3b 262.1K 8.2K Input: $0.15
Output: $0.6
Model: 0.075
Completion: 4.000
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Released: 2025-04-01
Updated: 2025-04
Qwen 3.6 Plus alibaba/qwen3.6-plus 1M 64K Input: $0.5
Output: $3
Cache Read: $0.1
Cache Write: $0.625
Model: 0.250
Completion: 6.000
Cache: 0.200
📎 🧠 🔧 🌡️ 2025-04 In: text, image, pdf
Out: text
Released: 2026-04-02
LongCat Flash Thinking 2601 meituan/longcat-flash-thinking-2601 32.8K 32.8K - - 🧠 🌡️ - In: text
Out: text
Released: 2026-03-13
LongCat Flash Chat meituan/longcat-flash-chat 128K 100K - - 🔧 🌡️ 2024-10 In: text
Out: text
Released: 2025-08-30
Llama 3.2 1B Instruct meta/llama-3.2-1b 128K 8.2K Input: $0.1
Output: $0.1
Model: 0.050
Completion: 1.000
🌡️ 2023-12 In: text
Out: text
Released: 2024-09-18
Llama 3.2 11B Vision Instruct meta/llama-3.2-11b 128K 8.2K Input: $0.16
Output: $0.16
Model: 0.080
Completion: 1.000
📎 🔧 🌡️ 2023-12 In: text, image
Out: text
Released: 2024-09-25
Llama 3.1 8B Instruct meta/llama-3.1-8b 128K 8.2K Input: $0.22
Output: $0.22
Model: 0.110
Completion: 1.000
🔧 🌡️ 2023-12 In: text
Out: text
Released: 2024-07-23
Llama 3.2 90B Vision Instruct meta/llama-3.2-90b 128K 8.2K Input: $0.72
Output: $0.72
Model: 0.360
Completion: 1.000
📎 🔧 🌡️ 2023-12 In: text, image
Out: text
Released: 2024-09-25
Llama 3.1 70B Instruct meta/llama-3.1-70b 128K 8.2K Input: $0.72
Output: $0.72
Model: 0.360
Completion: 1.000
🔧 🌡️ 2023-12 In: text
Out: text
Released: 2024-07-23
Llama 3.2 3B Instruct meta/llama-3.2-3b 128K 8.2K Input: $0.15
Output: $0.15
Model: 0.075
Completion: 1.000
🌡️ 2023-12 In: text
Out: text
Released: 2024-09-18
Llama-4-Scout-17B-16E-Instruct-FP8 meta/llama-4-scout 128K 4.1K Input: $0
Output: $0
- 📎 🔧 🌡️ 2024-08 In: text, image
Out: text
Open Weights
Released: 2025-04-05
Llama-3.3-70B-Instruct meta/llama-3.3-70b 128K 4.1K Input: $0
Output: $0
- 📎 🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-12-06
Llama-4-Maverick-17B-128E-Instruct-FP8 meta/llama-4-maverick 128K 4.1K Input: $0
Output: $0
- 📎 🔧 🌡️ 2024-08 In: text, image
Out: text
Open Weights
Released: 2025-04-05
DeepSeek V4 Flash deepseek/deepseek-v4-flash 1M 384K Input: $0.14
Output: $0.28
Cache Read: $0.0028
Model: 0.070
Completion: 2.000
Cache: 0.020
🧠 🔧 🌡️ 2025-05 In: text, image, pdf
Out: text
Open Weights
Released: 2026-04-23
Updated: 2026-04-24
DeepSeek V3.1 Terminus deepseek/deepseek-v3.1-terminus 131.1K 65.5K Input: $0.27
Output: $1
Cache Read: $0.135
Model: 0.135
Completion: 3.704
Cache: 0.500
🧠 🔧 🌡️ 2025-07 In: text
Out: text
Open Weights
Released: 2025-09-22
DeepSeek V3 0324 deepseek/deepseek-v3 163.8K 163.8K Input: $0.27
Output: $1.12
Cache Read: $0.135
Model: 0.135
Completion: 4.148
Cache: 0.500
🔧 🌡️ 2024-07 In: text
Out: text
Released: 2024-12-26
DeepSeek V4 Pro deepseek/deepseek-v4-pro 1M 384K Input: $0.435
Output: $0.87
Cache Read: $0.0036
Model: 0.217
Completion: 2.000
Cache: 0.008
🧠 🔧 🌡️ 2025-05 In: text, pdf
Out: text
Open Weights
Released: 2026-04-23
Updated: 2026-04-24
DeepSeek V3.2 Thinking deepseek/deepseek-v3.2-thinking 128K 8K Input: $0.62
Output: $1.85
Model: 0.310
Completion: 2.984
🧠 🔧 🌡️ 2024-07 In: text, image, pdf
Out: text
Released: 2025-12-01
DeepSeek-V3.1 deepseek/deepseek-v3.1 163.8K 8.2K Input: $0.56
Output: $1.68
Cache Read: $0.28
Model: 0.280
Completion: 3.000
Cache: 0.500
🧠 🔧 🌡️ 2024-07 In: text
Out: text
Released: 2025-08-21
DeepSeek V3.2 deepseek/deepseek-v3.2 128K 8K Input: $0.28
Output: $0.42
Cache Read: $0.028
Model: 0.140
Completion: 1.500
Cache: 0.100
🌡️ 2024-07 In: text, image, pdf
Out: text
Released: 2025-12-01
DeepSeek-R1 deepseek/deepseek-r1 128K 32.8K Input: $1.35
Output: $5.4
Model: 0.675
Completion: 4.000
🧠 🔧 🌡️ 2024-07 In: text
Out: text
Released: 2025-01-20
Updated: 2025-05-29
MiniMax M2.7 High Speed minimax/minimax-m2.7-highspeed 204.8K 131.1K Input: $0.6
Output: $2.4
Cache Read: $0.06
Cache Write: $0.375
Model: 0.300
Completion: 4.000
Cache: 0.100
📎 🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-18
MiniMax M2.5 minimax/minimax-m2.5 204.8K 131K Input: $0.3
Output: $1.2
Cache Read: $0.03
Cache Write: $0.375
Model: 0.150
Completion: 4.000
Cache: 0.100
🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-02-12
MiniMax M2.1 minimax/minimax-m2.1 204.8K 131.1K Input: $0.3
Output: $1.2
Cache Read: $0.03
Cache Write: $0.375
Model: 0.150
Completion: 4.000
Cache: 0.100
🧠 🔧 🌡️ 2024-10 In: text
Out: text
Released: 2025-10-27
Updated: 2025-12-23
MiniMax M3 minimax/minimax-m3 1M 1M Input: $0.3
Output: $1.2
Cache Read: $0.06
Model: 0.150
Completion: 4.000
Cache: 0.200
📎 🧠 🔧 🌡️ - In: text, image, pdf
Out: text
Open Weights
Released: 2026-05-31
Updated: 2026-06-01
MiniMax M2 minimax/minimax-m2 205K 205K Input: $0.3
Output: $1.2
Cache Read: $0.03
Cache Write: $0.375
Model: 0.150
Completion: 4.000
Cache: 0.100
🧠 🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-10-27
Minimax M2.7 minimax/minimax-m2.7 204.8K 131K Input: $0.3
Output: $1.2
Cache Read: $0.06
Cache Write: $0.375
Model: 0.150
Completion: 4.000
Cache: 0.200
📎 🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-18
MiniMax M2.5 High Speed minimax/minimax-m2.5-highspeed 204.8K 131K Input: $0.6
Output: $2.4
Cache Read: $0.03
Cache Write: $0.375
Model: 0.300
Completion: 4.000
Cache: 0.050
🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-02-12
Updated: 2026-02-13
MiniMax M2.1 Lightning minimax/minimax-m2.1-lightning 204.8K 131.1K Input: $0.3
Output: $2.4
Cache Read: $0.03
Cache Write: $0.375
Model: 0.150
Completion: 8.000
Cache: 0.100
🧠 🔧 🌡️ 2024-10 In: text
Out: text
Released: 2025-10-27
KAT-Coder-Pro V1 kwaipilot/kat-coder-pro-v1 256K 32K Input: $0.03
Output: $1.2
Cache Read: $0.06
Model: 0.015
Completion: 40.000
Cache: 2.000
🧠 🌡️ 2024-10 In: text
Out: text
Released: 2025-10-24
Kat Coder Pro V2 kwaipilot/kat-coder-pro-v2 256K 256K Input: $0.3
Output: $1.2
Cache Read: $0.06
Model: 0.150
Completion: 4.000
Cache: 0.200
🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-03-27
Updated: 2026-03-30

Vivgrid

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
DeepSeek V4 Pro deepseek-v4-pro 1M 384K Input: $0.435
Output: $0.87
Cache Read: $0.003625
Model: 0.217
Completion: 2.000
Cache: 0.008
🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2026-04-24
GPT-5.4 Nano gpt-5.4-nano 400K 128K Input: $0.2
Output: $1.25
Cache Read: $0.02
Model: 0.100
Completion: 6.250
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-03-17
GPT-5.1 Codex gpt-5.1-codex 400K 128K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-11-13
GPT-5.1 Codex Max gpt-5.1-codex-max 400K 128K Input: $1.25
Output: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-11-13
GPT-5.3 Codex gpt-5.3-codex 400K 128K Input: $1.75
Output: $14
Cache Read: $0.175
Model: 0.875
Completion: 8.000
Cache: 0.100
🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2026-02-24
GPT-5.4 gpt-5.4 400K 128K Input: $2.5
Output: $15
Cache Read: $0.25
Model: 1.250
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-03-05
GPT-5.4 Mini gpt-5.4-mini 400K 128K Input: $0.75
Output: $4.5
Cache Read: $0.075
Model: 0.375
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 2025-08-31 In: text, image, pdf
Out: text
Released: 2026-03-17
Gemini 3.1 Pro Preview gemini-3.1-pro-preview 1M 65.5K Input: $2
Output: $12
Cache Read: $0.2
Model: 1.000
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video, audio, pdf
Out: text
Released: 2026-02-19
GPT-5 Mini gpt-5-mini 272K 128K Input: $0.25
Output: $2
Cache Read: $0.03
Model: 0.125
Completion: 8.000
Cache: 0.120
📎 🧠 🔧 2024-05-30 In: text, image
Out: text
Released: 2025-08-07
DeepSeek-V3.2 deepseek-v3.2 128K 128K Input: $0.28
Output: $0.42
Model: 0.140
Completion: 1.500
🧠 🔧 🌡️ 2024-07 In: text
Out: text
Open Weights
Released: 2025-12-01
Gemini 3.1 Flash Lite Preview gemini-3.1-flash-lite-preview 1M 65.5K Input: $0.25
Output: $1.5
Cache Read: $0.025
Cache Write: $1
Model: 0.125
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video, audio, pdf
Out: text
Released: 2026-03-03
GPT-5.2 Codex gpt-5.2-codex 400K 128K Input: $1.75
Output: $14
Cache Read: $0.175
Model: 0.875
Completion: 8.000
Cache: 0.100
🧠 🔧 2025-08-31 In: text, image
Out: text
Released: 2026-01-14
GPT-5.5 gpt-5.5 1.1M 128K Input: $5
Output: $30
Cache Read: $0.5
Model: 2.500
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 2025-12-01 In: text, image, pdf
Out: text
Released: 2026-04-23

Vultr

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Kimi K2.6 moonshotai/Kimi-K2.6 262.1K 131.1K Input: $0.15
Output: $0.6
Model: 0.075
Completion: 4.000
📎 🧠 🔧 🌡️ 2025-01 In: text
Out: text
Open Weights
Released: 2026-04-21
Llama 3.1 Nemotron Safety Guard nvidia/Llama-3.1-Nemotron-Safety-Guard-8B-v3 8.2K 4.1K Input: $0.01
Output: $0.01
Model: 0.005
Completion: 1.000
🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2025-10-28
NVIDIA Nemotron 3 Nano Omni nvidia/Nemotron-3-Nano-Omni-30B-A3B-Reasoning-BF16 262.1K 131.1K Input: $0.13
Output: $0.38
Model: 0.065
Completion: 2.923
🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2026-04-28
DeepSeek V3.2 nvidia/DeepSeek-V3.2-NVFP4 131.1K 131.1K Input: $0.55
Output: $1.65
Model: 0.275
Completion: 3.000
🧠 🔧 🌡️ 2024-07 In: text
Out: text
Open Weights
Released: 2025-12-01
NVIDIA Nemotron Cascade 2 nvidia/Nemotron-Cascade-2-30B-A3B 262.1K 131.1K Input: $0.15
Output: $0.6
Model: 0.075
Completion: 4.000
🧠 🔧 🌡️ 2024-07 In: text
Out: text
Open Weights
Released: 2025-12-01
GLM-5.1 zai-org/GLM-5.1-FP8 200K 131.1K Input: $0.85
Output: $3.1
Model: 0.425
Completion: 3.647
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-27
MiniMax-M2.7 MiniMaxAI/MiniMax-M2.7 204.8K 131.1K Input: $0.3
Output: $1.2
Model: 0.150
Completion: 4.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-18

Wafer

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
DeepSeek V4 Flash deepseek-v4-flash 1M 384K Input: $0.14
Output: $0.28
Cache Read: $0.01
Cache Write: $0
Model: 0.070
Completion: 2.000
Cache: 0.071
🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2026-04-24
Updated: 2026-05-30
Qwen3.6-35B-A3B Qwen3.6-35B-A3B 256K 65.5K Input: $0.15
Output: $1
Cache Read: $0.02
Cache Write: $0
Model: 0.075
Completion: 6.667
Cache: 0.133
📎 🧠 🔧 🌡️ 2025-04 In: text, image, video
Out: text
Open Weights
Released: 2026-05-11
Updated: 2026-05-30
Qwen3.7-Max qwen3.7-max 256K 65.5K Input: $5
Output: $15
Cache Read: $0.5
Cache Write: $0
Model: 2.500
Completion: 3.000
Cache: 0.100
🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-05-21
Updated: 2026-05-30
DeepSeek V4 Pro deepseek-v4-pro 1M 384K Input: $1.74
Output: $3.48
Cache Read: $0.02
Cache Write: $0
Model: 0.870
Completion: 2.000
Cache: 0.011
🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2026-04-24
Updated: 2026-05-30
Qwen3.5-397B-A17B Qwen3.5-397B-A17B 262.1K 65.5K Input: $0.43
Output: $2.6
Cache Read: $0.04
Cache Write: $0
Model: 0.215
Completion: 6.047
Cache: 0.093
📎 🧠 🔧 🌡️ 2025-04 In: text, image, video
Out: text
Open Weights
Released: 2026-02-16
Updated: 2026-06-01
Kimi-K2.6 Kimi-K2.6 262.1K 65.5K Input: $0.68
Output: $3.15
Cache Read: $0.07
Cache Write: $0
Model: 0.340
Completion: 4.632
Cache: 0.103
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video
Out: text
Open Weights
Released: 2026-05-13
Updated: 2026-06-01
GLM-5.1 GLM-5.1 202.8K 131.1K Input: $1
Output: $3.2
Cache Read: $0.1
Cache Write: $0
Model: 0.500
Completion: 3.200
Cache: 0.100
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2026-04-07
Updated: 2026-06-01

Weights & Biases

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Llama 4 Scout 17B 16E Instruct meta-llama/Llama-4-Scout-17B-16E-Instruct 64K 64K Input: $0.17
Output: $0.66
Model: 0.085
Completion: 3.882
🧠 🔧 🌡️ 2024-12 In: text, image
Out: text
Open Weights
Released: 2025-01-31
Updated: 2026-03-12
Llama-3.3-70B-Instruct meta-llama/Llama-3.3-70B-Instruct 128K 128K Input: $0.71
Output: $0.71
Model: 0.355
Completion: 1.000
🧠 🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-12-06
Updated: 2026-03-12
Llama 3.1 70B meta-llama/Llama-3.1-70B-Instruct 128K 128K Input: $0.8
Output: $0.8
Model: 0.400
Completion: 1.000
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-07-23
Updated: 2026-03-12
Meta-Llama-3.1-8B-Instruct meta-llama/Llama-3.1-8B-Instruct 128K 128K Input: $0.22
Output: $0.22
Model: 0.110
Completion: 1.000
🧠 🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-07-23
Updated: 2026-03-12
Kimi K2.5 moonshotai/Kimi-K2.5 262.1K 262.1K Input: $0.5
Output: $2.85
Model: 0.250
Completion: 5.700
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2026-01-27
Updated: 2026-03-12
Phi-4-mini-instruct microsoft/Phi-4-mini-instruct 128K 128K Input: $0.08
Output: $0.35
Model: 0.040
Completion: 4.375
🧠 🔧 🌡️ 2023-10 In: text
Out: text
Open Weights
Released: 2024-12-11
Updated: 2026-03-12
Qwen3-235B-A22B-Thinking-2507 Qwen/Qwen3-235B-A22B-Thinking-2507 262.1K 262.1K Input: $0.1
Output: $0.1
Model: 0.050
Completion: 1.000
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-25
Updated: 2026-03-12
Qwen3-Coder-480B-A35B-Instruct Qwen/Qwen3-Coder-480B-A35B-Instruct 262.1K 262.1K Input: $1
Output: $1.5
Model: 0.500
Completion: 1.500
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-23
Updated: 2026-03-12
Qwen3 30B A3B Instruct 2507 Qwen/Qwen3-30B-A3B-Instruct-2507 262.1K 262.1K Input: $0.1
Output: $0.3
Model: 0.050
Completion: 3.000
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-07-29
Updated: 2026-03-12
Qwen3 235B A22B Instruct 2507 Qwen/Qwen3-235B-A22B-Instruct-2507 262.1K 262.1K Input: $0.1
Output: $0.1
Model: 0.050
Completion: 1.000
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-04-28
Updated: 2026-03-12
gpt-oss-120b openai/gpt-oss-120b 131.1K 131.1K Input: $0.15
Output: $0.6
Model: 0.075
Completion: 4.000
🔧 🌡️ - In: text
Out: text
Released: 2025-08-05
Updated: 2026-03-12
gpt-oss-20b openai/gpt-oss-20b 131.1K 131.1K Input: $0.05
Output: $0.2
Model: 0.025
Completion: 4.000
🔧 🌡️ - In: text
Out: text
Released: 2025-08-05
Updated: 2026-03-12
NVIDIA Nemotron 3 Super 120B nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-FP8 262.1K 262.1K Input: $0.2
Output: $0.8
Model: 0.100
Completion: 4.000
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-11
Updated: 2026-03-12
OpenPipe Qwen3 14B Instruct OpenPipe/Qwen3-14B-Instruct 32.8K 32.8K Input: $0.05
Output: $0.22
Model: 0.025
Completion: 4.400
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-04-29
Updated: 2026-03-12
GLM 5 zai-org/GLM-5-FP8 200K 200K Input: $1
Output: $3.2
Model: 0.500
Completion: 3.200
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-11
Updated: 2026-03-12
GLM-5.1 zai-org/GLM-5.1 200K 131.1K Input: $1.4
Output: $4.4
Cache Read: $0.26
Cache Write: $0
Model: 0.700
Completion: 3.143
Cache: 0.186
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-03-27
DeepSeek V3.1 deepseek-ai/DeepSeek-V3.1 161K 161K Input: $0.55
Output: $1.65
Model: 0.275
Completion: 3.000
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-21
Updated: 2026-03-12
MiniMax M2.5 MiniMaxAI/MiniMax-M2.5 196.6K 196.6K Input: $0.3
Output: $1.2
Model: 0.150
Completion: 4.000
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-12
Updated: 2026-03-12

xAI

📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Grok 4.20 Multi-Agent grok-4.20-multi-agent-0309 1M 30K Input: $1.25
Output: $2.5
Cache Read: $0.2
Model: 0.625
Completion: 2.000
Cache: 0.160
📎 🧠 🌡️ - In: text, image, pdf
Out: text
Released: 2026-03-09
Grok 4.20 (Non-Reasoning) grok-4.20-0309-non-reasoning 1M 30K Input: $1.25
Output: $2.5
Cache Read: $0.2
Model: 0.625
Completion: 2.000
Cache: 0.160
📎 🔧 🌡️ - In: text, image, pdf
Out: text
Released: 2026-03-09
Grok 4.3 grok-4.3 1M 30K Input: $1.25
Output: $2.5
Cache Read: $0.2
Model: 0.625
Completion: 2.000
Cache: 0.160
📎 🧠 🔧 🌡️ - In: text, image, pdf
Out: text
Released: 2026-04-17
Grok Imagine Image Quality grok-imagine-image-quality 8K - - - 📎 - In: text, image, pdf
Out: image, pdf
Released: 2026-04-03
Grok Imagine Video grok-imagine-video 1K - - - 📎 - In: text, image, video, pdf
Out: video
Released: 2026-01-28
Grok 4.20 (Reasoning) grok-4.20-0309-reasoning 1M 30K Input: $1.25
Output: $2.5
Cache Read: $0.2
Model: 0.625
Completion: 2.000
Cache: 0.160
📎 🧠 🔧 🌡️ - In: text, image, pdf
Out: text
Released: 2026-03-09
Grok Imagine Image grok-imagine-image 8K - - - 📎 - In: text, image, pdf
Out: image, pdf
Released: 2026-01-28
Grok Build 0.1 grok-build-0.1 256K 256K Input: $1
Output: $2
Cache Read: $0.2
Model: 0.500
Completion: 2.000
Cache: 0.200
📎 🧠 🔧 🌡️ - In: text, image, pdf
Out: text
Released: 2026-04-16

Xiaomi

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
MiMo-V2.5-Pro-UltraSpeed mimo-v2.5-pro-ultraspeed 1M 131.1K Input: $1.305
Output: $2.61
Cache Read: $0.0108
Model: 0.652
Completion: 2.000
Cache: 0.008
🧠 🔧 🌡️ 2024-12 In: text
Out: text
Open Weights
Released: 2026-06-08
Updated: 2026-06-09
MiMo-V2.5 mimo-v2.5 1M 131.1K Input: $0.4
Output: $2
Cache Read: $0.08
Model: 0.200
Completion: 5.000
Cache: 0.200
📎 🧠 🔧 🌡️ 2024-12 In: text, image, audio, video
Out: text
Open Weights
Released: 2026-04-22
MiMo-V2-Omni mimo-v2-omni 262.1K 131.1K Input: $0.4
Output: $2
Cache Read: $0.08
Model: 0.200
Completion: 5.000
Cache: 0.200
📎 🧠 🔧 🌡️ 2024-12 In: text, image, audio, video, pdf
Out: text
Released: 2026-03-18
MiMo-V2-Flash mimo-v2-flash 262.1K 65.5K Input: $0.1
Output: $0.3
Cache Read: $0.01
Model: 0.050
Completion: 3.000
Cache: 0.100
🧠 🔧 🌡️ 2024-12-01 In: text
Out: text
Open Weights
Released: 2025-12-16
Updated: 2026-02-04
MiMo-V2-Pro mimo-v2-pro 1M 131.1K Input: $1
Output: $3
Cache Read: $0.2
Model: 0.500
Completion: 3.000
Cache: 0.200
🧠 🔧 🌡️ 2024-12 In: text
Out: text
Released: 2026-03-18
MiMo-V2.5-Pro mimo-v2.5-pro 1M 131.1K Input: $1
Output: $3
Cache Read: $0.2
Model: 0.500
Completion: 3.000
Cache: 0.200
🧠 🔧 🌡️ 2024-12 In: text
Out: text
Open Weights
Released: 2026-04-22

Xiaomi Token Plan (Europe)

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
MiMo-V2.5-TTS mimo-v2.5-tts 8.2K 8.2K Input: $0
Output: $0
- - - In: text
Out: audio
Open Weights
Released: 2026-04-22
MiMo-V2.5-Pro mimo-v2.5-pro 1M 131.1K Input: $0
Output: $0
Cache Read: $0
- 🧠 🔧 🌡️ 2024-12 In: text
Out: text
Open Weights
Released: 2026-04-22
MiMo-V2-Pro mimo-v2-pro 1M 131.1K Input: $0
Output: $0
Cache Read: $0
- 🧠 🔧 🌡️ 2024-12 In: text
Out: text
Released: 2026-03-18
MiMo-V2-TTS mimo-v2-tts 8.2K 8.2K Input: $0
Output: $0
- - - In: text
Out: audio
Open Weights
Released: 2026-03-18
MiMo-V2-Omni mimo-v2-omni 262.1K 131.1K Input: $0
Output: $0
Cache Read: $0
- 📎 🧠 🔧 🌡️ 2024-12 In: text, image, audio, video, pdf
Out: text
Released: 2026-03-18
MiMo-V2.5 mimo-v2.5 1M 131.1K Input: $0
Output: $0
Cache Read: $0
- 📎 🧠 🔧 🌡️ 2024-12 In: text, image, audio, video
Out: text
Open Weights
Released: 2026-04-22
MiMo-V2.5-TTS-VoiceDesign mimo-v2.5-tts-voicedesign 8.2K 8.2K Input: $0
Output: $0
- - - In: text
Out: audio
Open Weights
Released: 2026-04-22
MiMo-V2.5-TTS-VoiceClone mimo-v2.5-tts-voiceclone 8.2K 8.2K Input: $0
Output: $0
- - - In: text
Out: audio
Open Weights
Released: 2026-04-22

Xiaomi Token Plan (China)

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
MiMo-V2.5-TTS-VoiceClone mimo-v2.5-tts-voiceclone 8.2K 8.2K Input: $0
Output: $0
- - - In: text
Out: audio
Open Weights
Released: 2026-04-22
MiMo-V2.5-TTS-VoiceDesign mimo-v2.5-tts-voicedesign 8.2K 8.2K Input: $0
Output: $0
- - - In: text
Out: audio
Open Weights
Released: 2026-04-22
MiMo-V2.5 mimo-v2.5 1M 131.1K Input: $0
Output: $0
Cache Read: $0
- 📎 🧠 🔧 🌡️ 2024-12 In: text, image, audio, video
Out: text
Open Weights
Released: 2026-04-22
MiMo-V2-Omni mimo-v2-omni 262.1K 131.1K Input: $0
Output: $0
Cache Read: $0
- 📎 🧠 🔧 🌡️ 2024-12 In: text, image, audio, video, pdf
Out: text
Released: 2026-03-18
MiMo-V2-TTS mimo-v2-tts 8.2K 8.2K Input: $0
Output: $0
- - - In: text
Out: audio
Open Weights
Released: 2026-03-18
MiMo-V2-Pro mimo-v2-pro 1M 131.1K Input: $0
Output: $0
Cache Read: $0
- 🧠 🔧 🌡️ 2024-12 In: text
Out: text
Released: 2026-03-18
MiMo-V2.5-Pro mimo-v2.5-pro 1M 131.1K Input: $0
Output: $0
Cache Read: $0
- 🧠 🔧 🌡️ 2024-12 In: text
Out: text
Open Weights
Released: 2026-04-22
MiMo-V2.5-TTS mimo-v2.5-tts 8.2K 8.2K Input: $0
Output: $0
- - - In: text
Out: audio
Open Weights
Released: 2026-04-22

Xiaomi Token Plan (Singapore)

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
MiMo-V2.5-TTS mimo-v2.5-tts 8.2K 8.2K Input: $0
Output: $0
- - - In: text
Out: audio
Open Weights
Released: 2026-04-22
MiMo-V2.5-Pro mimo-v2.5-pro 1M 131.1K Input: $0
Output: $0
Cache Read: $0
- 🧠 🔧 🌡️ 2024-12 In: text
Out: text
Open Weights
Released: 2026-04-22
MiMo-V2-Pro mimo-v2-pro 1M 131.1K Input: $0
Output: $0
Cache Read: $0
- 🧠 🔧 🌡️ 2024-12 In: text
Out: text
Released: 2026-03-18
MiMo-V2-TTS mimo-v2-tts 8.2K 8.2K Input: $0
Output: $0
- - - In: text
Out: audio
Open Weights
Released: 2026-03-18
MiMo-V2-Omni mimo-v2-omni 262.1K 131.1K Input: $0
Output: $0
Cache Read: $0
- 📎 🧠 🔧 🌡️ 2024-12 In: text, image, audio, video, pdf
Out: text
Released: 2026-03-18
MiMo-V2.5 mimo-v2.5 1M 131.1K Input: $0
Output: $0
Cache Read: $0
- 📎 🧠 🔧 🌡️ 2024-12 In: text, image, audio, video
Out: text
Open Weights
Released: 2026-04-22
MiMo-V2.5-TTS-VoiceDesign mimo-v2.5-tts-voicedesign 8.2K 8.2K Input: $0
Output: $0
- - - In: text
Out: audio
Open Weights
Released: 2026-04-22
MiMo-V2.5-TTS-VoiceClone mimo-v2.5-tts-voiceclone 8.2K 8.2K Input: $0
Output: $0
- - - In: text
Out: audio
Open Weights
Released: 2026-04-22

Xpersona

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
GPT-5.5 xpersona-gpt-5.5 1M 128K Input: $3
Output: $18
Cache Read: $0.3
Reasoning: $18
Model: 1.500
Completion: 6.000
Cache: 0.100
🧠 🔧 2025-12-30 In: text, image
Out: text
Released: 2026-05-29
Xpersona Frieren 1 xpersona-frieren-coder 1M 384K Input: $1.5
Output: $6
Cache Read: $0.15
Reasoning: $6
Model: 0.750
Completion: 4.000
Cache: 0.100
🧠 🔧 2025-12-30 In: text, image
Out: text
Released: 2026-05-01
Updated: 2026-05-25

Z.AI

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
GLM-4.7 glm-4.7 204.8K 131.1K Input: $0.6
Output: $2.2
Cache Read: $0.11
Cache Write: $0
Model: 0.300
Completion: 3.667
Cache: 0.183
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-12-22
GLM-4.5V glm-4.5v 64K 16.4K Input: $0.6
Output: $1.8
Model: 0.300
Completion: 3.000
📎 🧠 🔧 🌡️ 2025-04 In: text, image, video
Out: text
Open Weights
Released: 2025-08-11
GLM-4.5 glm-4.5 131.1K 98.3K Input: $0.6
Output: $2.2
Cache Read: $0.11
Cache Write: $0
Model: 0.300
Completion: 3.667
Cache: 0.183
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-28
GLM-4.7-FlashX glm-4.7-flashx 200K 131.1K Input: $0.07
Output: $0.4
Cache Read: $0.01
Cache Write: $0
Model: 0.035
Completion: 5.714
Cache: 0.143
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2026-01-19
GLM-5.1 glm-5.1 200K 131.1K Input: $1.4
Output: $4.4
Cache Read: $0.26
Cache Write: $0
Model: 0.700
Completion: 3.143
Cache: 0.186
🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-03-27
GLM-4.6 glm-4.6 204.8K 131.1K Input: $0.6
Output: $2.2
Cache Read: $0.11
Cache Write: $0
Model: 0.300
Completion: 3.667
Cache: 0.183
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-09-30
GLM-4.6V glm-4.6v 128K 32.8K Input: $0.3
Output: $0.9
Model: 0.150
Completion: 3.000
📎 🧠 🔧 🌡️ 2025-04 In: text, image, video
Out: text
Open Weights
Released: 2025-12-08
GLM-5V-Turbo glm-5v-turbo 200K 131.1K Input: $1.2
Output: $4
Cache Read: $0.24
Cache Write: $0
Model: 0.600
Completion: 3.333
Cache: 0.200
📎 🧠 🔧 🌡️ - In: text, image, video, pdf
Out: text
Released: 2026-04-01
GLM-4.5-Air glm-4.5-air 131.1K 98.3K Input: $0.2
Output: $1.1
Cache Read: $0.03
Cache Write: $0
Model: 0.100
Completion: 5.500
Cache: 0.150
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-28
GLM-4.7-Flash glm-4.7-flash 200K 131.1K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2026-01-19
GLM-4.5-Flash glm-4.5-flash 131.1K 98.3K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-28
GLM-5 glm-5 204.8K 131.1K Input: $1
Output: $3.2
Cache Read: $0.2
Cache Write: $0
Model: 0.500
Completion: 3.200
Cache: 0.200
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-11
GLM-5-Turbo glm-5-turbo 200K 131.1K Input: $1.2
Output: $4
Cache Read: $0.24
Cache Write: $0
Model: 0.600
Completion: 3.333
Cache: 0.200
🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-03-16

Z.AI Coding Plan

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
GLM-4.7 glm-4.7 204.8K 131.1K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-12-22
GLM-5.1 glm-5.1 200K 131.1K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-03-27
GLM-5.2 glm-5.2 1M 131.1K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-06-13
GLM-5V-Turbo glm-5v-turbo 200K 131.1K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 📎 🧠 🔧 🌡️ - In: text, image, video, pdf
Out: text
Released: 2026-04-01
GLM-4.5-Air glm-4.5-air 131.1K 98.3K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-28
GLM-5-Turbo glm-5-turbo 200K 131.1K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-03-16

Zeldoc

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Z-Code z-code 262.1K 262.1K Input: $0
Output: $0
- 📎 🧠 🔧 🌡️ 2025-01 In: text, image, video
Out: text
Released: 2026-04-15

ZenMux

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Ling-1T inclusionai/ling-1t 128K 64K Input: $0.56
Output: $2.24
Cache Read: $0.11
Model: 0.280
Completion: 4.000
Cache: 0.196
🔧 🌡️ 2025-01-01 In: text
Out: text
Released: 2025-10-09
inclusionAI: Ring-2.6-1T inclusionai/ring-2.6-1t 262K 65K Input: $0.3
Output: $2.5
Cache Read: $0.06
Model: 0.150
Completion: 8.333
Cache: 0.200
📎 🧠 🔧 🌡️ 2025-12-31 In: text
Out: text
Open Weights
Released: 2026-05-07
Updated: 2026-05-14
Ring-1T inclusionai/ring-1t 128K 64K Input: $0.56
Output: $2.24
Cache Read: $0.11
Model: 0.280
Completion: 4.000
Cache: 0.196
🧠 🔧 🌡️ 2025-01-01 In: text
Out: text
Released: 2025-10-12
Kimi K2.7 Code (Free) moonshotai/kimi-k2.7-code-free 262.1K 262.1K Input: $0
Output: $0
Cache Read: $0
- 📎 🧠 🔧 2025-01 In: text, image, video
Out: text
Open Weights
Released: 2026-06-12
Kimi K2 Thinking Turbo moonshotai/kimi-k2-thinking-turbo 262K 64K Input: $1.15
Output: $8
Cache Read: $0.15
Model: 0.575
Completion: 6.957
Cache: 0.130
🧠 🔧 🌡️ 2025-01-01 In: text
Out: text
Released: 2025-11-06
Kimi K2.7 Code moonshotai/kimi-k2.7-code 262.1K 262.1K Input: $0.95
Output: $4
Cache Read: $0.16
Model: 0.475
Completion: 4.211
Cache: 0.168
📎 🧠 🔧 2025-01 In: text, image, video
Out: text
Open Weights
Released: 2026-06-12
Kimi K2 Thinking moonshotai/kimi-k2-thinking 262K 64K Input: $0.6
Output: $2.5
Cache Read: $0.15
Model: 0.300
Completion: 4.167
Cache: 0.250
🧠 🔧 🌡️ 2025-01-01 In: text
Out: text
Released: 2025-11-06
Kimi K2.5 moonshotai/kimi-k2.5 262K 64K Input: $0.58
Output: $3.02
Cache Read: $0.1
Model: 0.290
Completion: 5.207
Cache: 0.172
📎 🧠 🔧 2025-01-01 In: text, image, video
Out: text
Released: 2026-01-27
Kimi K2.6 moonshotai/kimi-k2.6 262.1K 262.1K Input: $0.95
Output: $4
Cache Read: $0.16
Model: 0.475
Completion: 4.211
Cache: 0.168
📎 🧠 🔧 2025-01-01 In: text, image, video
Out: text
Open Weights
Released: 2026-04-20
Kimi K2 0905 moonshotai/kimi-k2-0905 262K 64K Input: $0.6
Output: $2.5
Cache Read: $0.15
Model: 0.300
Completion: 4.167
Cache: 0.250
🔧 🌡️ 2025-01-01 In: text
Out: text
Released: 2025-09-04
ERNIE 5.0 baidu/ernie-5.0-thinking-preview 128K 64K Input: $0.84
Output: $3.37
Model: 0.420
Completion: 4.012
📎 🧠 🔧 🌡️ 2025-01-01 In: text, image, video
Out: text
Released: 2026-01-22
Gemini 3.1 Flash Lite google/gemini-3.1-flash-lite 1M 65.5K Input: $0.25
Output: $1.5
Cache Read: $0.025
Model: 0.125
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video, audio, pdf
Out: text
Released: 2026-05-07
Gemini 2.5 Pro google/gemini-2.5-pro 1M 64K Input: $1.25
Output: $10
Cache Read: $0.31
Cache Write: $4.5
Model: 0.625
Completion: 8.000
Cache: 0.248
📎 🧠 🔧 🌡️ 2025-01-01 In: pdf, image, text, audio, video
Out: text
Released: 2025-06-17
Gemini 2.5 Flash google/gemini-2.5-flash 1M 64K Input: $0.3
Output: $2.5
Cache Read: $0.07
Cache Write: $1
Model: 0.150
Completion: 8.333
Cache: 0.233
📎 🧠 🔧 🌡️ 2025-01-01 In: pdf, image, text, audio
Out: text
Released: 2025-06-17
Gemini 3.5 Flash google/gemini-3.5-flash 1M 65.5K Input: $1.5
Output: $9
Cache Read: $0.15
Model: 0.750
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01 In: text, image, video, audio, pdf
Out: text
Released: 2026-05-19
Gemini 2.5 Flash Lite google/gemini-2.5-flash-lite 1M 64K Input: $0.1
Output: $0.4
Cache Read: $0.03
Cache Write: $1
Model: 0.050
Completion: 4.000
Cache: 0.300
📎 🔧 🌡️ 2025-01-01 In: pdf, image, text, audio
Out: text
Released: 2025-07-22
Gemini 3.1 Pro Preview google/gemini-3.1-pro-preview 1M 64K Input: $2
Output: $12
Cache Read: $0.2
Cache Write: $4.5
Model: 1.000
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2026-02-19 In: text, image, pdf, audio, video
Out: text
Released: 2026-02-19
Gemini 3 Flash Preview google/gemini-3-flash-preview 1M 64K Input: $0.5
Output: $3
Cache Read: $0.05
Cache Write: $1
Model: 0.250
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01-01 In: text, image, pdf, audio
Out: text
Released: 2025-12-17
Gemini 3.1 Flash Lite Preview google/gemini-3.1-flash-lite-preview 1.1M 65.5K Input: $0.25
Output: $1.5
Model: 0.125
Completion: 6.000
📎 🔧 🌡️ - In: text, image, audio, video
Out: text
Released: 2025-03-20
Grok Code Fast 1 x-ai/grok-code-fast-1 256K 64K Input: $0.2
Output: $1.5
Cache Read: $0.02
Model: 0.100
Completion: 7.500
Cache: 0.100
🧠 🔧 🌡️ 2025-01-01 In: text
Out: text
Released: 2025-08-26
Grok 4.3 x-ai/grok-4.3 1M 1M Input: $1.25
Output: $2.5
Cache Read: $0.2
Cache Write: $0
Model: 0.625
Completion: 2.000
Cache: 0.160
📎 🧠 🔧 🌡️ - In: text, image, pdf
Out: text
Released: 2026-04-17
Grok 4.1 Fast Non Reasoning x-ai/grok-4.1-fast-non-reasoning 2M 64K Input: $0.2
Output: $0.5
Cache Read: $0.05
Model: 0.100
Completion: 2.500
Cache: 0.250
📎 🔧 🌡️ 2025-01-01 In: text, image
Out: text
Released: 2025-11-20
Grok 4 x-ai/grok-4 256K 64K Input: $3
Output: $15
Cache Read: $0.75
Model: 1.500
Completion: 5.000
Cache: 0.250
📎 🧠 🔧 🌡️ 2025-01-01 In: image, text
Out: text
Released: 2025-07-09
Grok 4 Fast x-ai/grok-4-fast 2M 64K Input: $0.2
Output: $0.5
Cache Read: $0.05
Model: 0.100
Completion: 2.500
Cache: 0.250
📎 🧠 🔧 🌡️ 2025-01-01 In: text, image
Out: text
Released: 2025-09-19
Grok 4.2 Fast Non Reasoning x-ai/grok-4.2-fast-non-reasoning 2M 30K Input: $3
Output: $9
Model: 1.500
Completion: 3.000
📎 🔧 🌡️ 2025-08-31 In: text, image, video
Out: text
Released: 2026-03-20
Grok 4.2 Fast x-ai/grok-4.2-fast 2M 30K Input: $3
Output: $9
Model: 1.500
Completion: 3.000
📎 🧠 🔧 🌡️ 2025-08-31 In: text, image, video
Out: text
Released: 2026-03-20
Grok 4.1 Fast x-ai/grok-4.1-fast 2M 64K Input: $0.2
Output: $0.5
Cache Read: $0.05
Model: 0.100
Completion: 2.500
Cache: 0.250
📎 🧠 🔧 🌡️ 2025-01-01 In: text, image
Out: text
Released: 2025-11-20
Grok Build 0.1 x-ai/grok-build-0.1 256K 256K Input: $1
Output: $2
Cache Read: $0.2
Model: 0.500
Completion: 2.000
Cache: 0.200
📎 🧠 🔧 🌡️ - In: text, image, pdf
Out: text
Released: 2026-04-16
GLM 4.7 z-ai/glm-4.7 200K 64K Input: $0.28
Output: $1.14
Cache Read: $0.06
Model: 0.140
Completion: 4.071
Cache: 0.214
🧠 🔧 🌡️ 2025-01-01 In: text
Out: text
Released: 2025-12-23
GLM 4.5 z-ai/glm-4.5 128K 64K Input: $0.35
Output: $1.54
Cache Read: $0.07
Model: 0.175
Completion: 4.400
Cache: 0.200
🧠 🔧 🌡️ 2025-01-01 In: text
Out: text
Released: 2025-07-25
GLM 4.7 FlashX z-ai/glm-4.7-flashx 200K 64K Input: $0.07
Output: $0.42
Cache Read: $0.01
Model: 0.035
Completion: 6.000
Cache: 0.143
🧠 🔧 🌡️ 2025-01-01 In: text
Out: text
Released: 2026-01-19
GLM-5.1 z-ai/glm-5.1 200K 131.1K Input: $0.8781
Output: $3.5126
Cache Read: $0.1903
Model: 0.439
Completion: 4.000
Cache: 0.217
🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-04-03
GLM 4.6 z-ai/glm-4.6 200K 64K Input: $0.35
Output: $1.54
Cache Read: $0.07
Model: 0.175
Completion: 4.400
Cache: 0.200
🧠 🔧 🌡️ 2025-01-01 In: text
Out: text
Released: 2025-09-30
GLM 4.6V Flash (Free) z-ai/glm-4.6v-flash-free 200K 64K Input: $0
Output: $0
- 📎 🧠 🔧 🌡️ 2025-01-01 In: text, image, video
Out: text
Released: 2025-12-08
GLM 4.7 Flash (Free) z-ai/glm-4.7-flash-free 200K 64K Input: $0
Output: $0
- 🧠 🔧 🌡️ 2025-01-01 In: text
Out: text
Released: 2026-01-19
GLM 4.6V z-ai/glm-4.6v 200K 64K Input: $0.14
Output: $0.42
Cache Read: $0.03
Model: 0.070
Completion: 3.000
Cache: 0.214
📎 🧠 🔧 🌡️ 2025-01-01 In: text, image, video
Out: text
Released: 2025-12-08
GLM 5V Turbo z-ai/glm-5v-turbo 200K 128K Input: $0.726
Output: $3.1946
Cache Read: $0.1743
Model: 0.363
Completion: 4.400
Cache: 0.240
📎 🧠 🔧 🌡️ - In: text, image, video, pdf
Out: text
Released: 2026-04-01
GLM 4.6V FlashX z-ai/glm-4.6v-flash 200K 64K Input: $0.02
Output: $0.21
Cache Read: $0.0043
Model: 0.010
Completion: 10.500
Cache: 0.215
📎 🧠 🔧 🌡️ 2025-01-01 In: text, image, video
Out: text
Released: 2025-12-08
GLM 4.5 Air z-ai/glm-4.5-air 128K 64K Input: $0.11
Output: $0.56
Cache Read: $0.02
Model: 0.055
Completion: 5.091
Cache: 0.182
🧠 🔧 🌡️ 2025-01-01 In: text
Out: text
Released: 2025-07-25
GLM 5 z-ai/glm-5 200K 128K Input: $0.58
Output: $2.6
Cache Read: $0.14
Model: 0.290
Completion: 4.483
Cache: 0.241
🧠 🔧 🌡️ 2025-01-01 In: text
Out: text
Open Weights
Released: 2026-02-12
GLM 5 Turbo z-ai/glm-5-turbo 200K 128K Input: $0.88
Output: $3.48
Model: 0.440
Completion: 3.955
📎 🧠 🔧 🌡️ 2025-01-01 In: text
Out: text
Released: 2026-03-20
GPT-5.5 Instant openai/gpt-5.5-instant 400K 128K Input: $5
Output: $30
Cache Read: $0.5
Model: 2.500
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-12-01 In: text, image, pdf
Out: text
Released: 2026-05-05
Updated: 2026-05-28
GPT-5.2-Pro openai/gpt-5.2-pro 400K 128K Input: $21
Output: $168
Model: 10.500
Completion: 8.000
📎 🧠 🔧 2025-08-31 In: text, image, pdf
Out: text
Released: 2025-12-11
GPT-5 openai/gpt-5 400K 64K Input: $1.25
Output: $10
Cache Read: $0.12
Model: 0.625
Completion: 8.000
Cache: 0.096
📎 🧠 🔧 🌡️ 2025-01-01 In: text, image, pdf
Out: text
Released: 2025-08-07
GPT-5.1 Chat openai/gpt-5.1-chat 128K 64K Input: $1.25
Output: $10
Cache Read: $0.12
Model: 0.625
Completion: 8.000
Cache: 0.096
📎 🔧 🌡️ 2025-01-01 In: pdf, image, text
Out: text
Released: 2025-11-13
GPT-5.4 Nano openai/gpt-5.4-nano 400K 128K Input: $0.2
Output: $1.25
Model: 0.100
Completion: 6.250
🔧 🌡️ 2025-08-31 In: text
Out: text
Released: 2026-03-20
GPT-5.3 Chat openai/gpt-5.3-chat 128K 16.4K Input: $1.75
Output: $14
Model: 0.875
Completion: 8.000
📎 🔧 🌡️ 2025-08-31 In: text
Out: text
Released: 2026-03-20
GPT-5.1-Codex openai/gpt-5.1-codex 400K 64K Input: $1.25
Output: $10
Cache Read: $0.12
Model: 0.625
Completion: 8.000
Cache: 0.096
📎 🧠 🔧 🌡️ 2025-01-01 In: text, image
Out: text
Released: 2025-11-13
GPT-5.2 openai/gpt-5.2 400K 64K Input: $1.75
Output: $14
Cache Read: $0.17
Model: 0.875
Completion: 8.000
Cache: 0.097
📎 🧠 🔧 2025-01-01 In: image, text, pdf
Out: text
Released: 2025-12-11
GPT-5.3 Codex openai/gpt-5.3-codex 400K 128K Input: $1.75
Output: $14
Model: 0.875
Completion: 8.000
📎 🧠 🔧 🌡️ 2025-08-31 In: text
Out: text
Released: 2026-03-20
GPT-5.1-Codex-Mini openai/gpt-5.1-codex-mini 400K 64K Input: $0.25
Output: $2
Cache Read: $0.03
Model: 0.125
Completion: 8.000
Cache: 0.120
📎 🧠 🔧 🌡️ 2025-01-01 In: image, text
Out: text
Released: 2025-11-13
GPT-5.4 openai/gpt-5.4 1.1M 128K Input: $3.75
Output: $18.75
Model: 1.875
Completion: 5.000
📎 🧠 🔧 🌡️ 2025-08-31 In: text, image
Out: text
Released: 2026-03-20
GPT-5.4 Mini openai/gpt-5.4-mini 400K 128K Input: $0.75
Output: $4.5
Model: 0.375
Completion: 6.000
📎 🔧 🌡️ 2025-08-31 In: text
Out: text
Released: 2026-03-20
GPT-5.4 Pro openai/gpt-5.4-pro 1.1M 128K Input: $45
Output: $225
Model: 22.500
Completion: 5.000
📎 🧠 🔧 🌡️ 2025-08-31 In: text, image
Out: text
Released: 2026-03-20
GPT-5.5 Pro openai/gpt-5.5-pro 1.1M 128K Input: $30
Output: $180
Model: 15.000
Completion: 6.000
📎 🧠 🔧 2025-12-01 In: text, image, pdf
Out: text
Released: 2026-04-23
GPT-5 Codex openai/gpt-5-codex 400K 64K Input: $1.25
Output: $10
Cache Read: $0.12
Model: 0.625
Completion: 8.000
Cache: 0.096
📎 🧠 🔧 🌡️ 2025-01-01 In: text, image
Out: text
Released: 2025-09-23
GPT-5.2-Codex openai/gpt-5.2-codex 400K 64K Input: $1.75
Output: $14
Cache Read: $0.17
Model: 0.875
Completion: 8.000
Cache: 0.097
📎 🧠 🔧 2025-01-01 In: text, image, pdf
Out: text
Released: 2026-01-15
GPT-5.1 openai/gpt-5.1 400K 64K Input: $1.25
Output: $10
Cache Read: $0.12
Model: 0.625
Completion: 8.000
Cache: 0.096
📎 🧠 🔧 🌡️ 2025-01-01 In: image, text, pdf
Out: text
Released: 2025-11-13
GPT-5.5 openai/gpt-5.5 1.1M 128K Input: $5
Output: $30
Cache Read: $0.5
Model: 2.500
Completion: 6.000
Cache: 0.100
📎 🧠 🔧 2025-12-01 In: text, image, pdf
Out: text
Released: 2026-04-23
MiMo-V2.5 xiaomi/mimo-v2.5 1M 131.1K Input: $0.4
Output: $2
Cache Read: $0.08
Model: 0.200
Completion: 5.000
Cache: 0.200
📎 🧠 🔧 🌡️ 2024-12 In: text, image, audio, video
Out: text
Open Weights
Released: 2026-04-22
MiMo V2 Omni xiaomi/mimo-v2-omni 265K 265K Input: $0.4
Output: $2
Cache Read: $0.08
Model: 0.200
Completion: 5.000
Cache: 0.200
📎 🧠 🔧 🌡️ 2024-12 In: text, image, audio, video, pdf
Out: text
Released: 2026-03-18
MiMo-V2-Flash xiaomi/mimo-v2-flash 262.1K 65.5K Input: $0.1
Output: $0.3
Cache Read: $0.01
Model: 0.050
Completion: 3.000
Cache: 0.100
🧠 🔧 🌡️ 2024-12-01 In: text
Out: text
Open Weights
Released: 2025-12-16
Updated: 2026-02-04
MiMo V2 Pro xiaomi/mimo-v2-pro 1M 256K Input: $1
Output: $3
Cache Read: $0.2
Model: 0.500
Completion: 3.000
Cache: 0.200
🧠 🔧 🌡️ 2024-12 In: text
Out: text
Released: 2026-03-18
MiMo-V2.5-Pro xiaomi/mimo-v2.5-pro 1M 131.1K Input: $1
Output: $3
Cache Read: $0.2
Model: 0.500
Completion: 3.000
Cache: 0.200
🧠 🔧 🌡️ 2024-12 In: text
Out: text
Open Weights
Released: 2026-04-22
Agnes 1.5 Pro sapiens-ai/agnes-1.5-pro 256K 256K Input: $0.16
Output: $0.8
Model: 0.080
Completion: 5.000
🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-03-21
Agnes 1.5 Lite sapiens-ai/agnes-1.5-lite 256K 256K Input: $0.12
Output: $0.6
Model: 0.060
Completion: 5.000
📎 🔧 🌡️ - In: text, image
Out: text
Released: 2026-03-26
Doubao-Seed-2.0-pro volcengine/doubao-seed-2.0-pro 256K 64K Input: $0.45
Output: $2.24
Cache Read: $0.09
Cache Write: $0.0024
Model: 0.225
Completion: 4.978
Cache: 0.200
📎 🧠 🔧 🌡️ 2026-02-14 In: text, image, video
Out: text
Released: 2026-02-14
Doubao-Seed-Code volcengine/doubao-seed-code 256K 64K Input: $0.17
Output: $1.12
Cache Read: $0.03
Model: 0.085
Completion: 6.588
Cache: 0.176
📎 🧠 🔧 🌡️ 2025-01-01 In: text, image
Out: text
Released: 2025-11-11
Doubao-Seed-1.8 volcengine/doubao-seed-1.8 256K 64K Input: $0.11
Output: $0.28
Cache Read: $0.02
Cache Write: $0.0024
Model: 0.055
Completion: 2.545
Cache: 0.182
📎 🧠 🔧 🌡️ 2025-01-01 In: text, image, video
Out: text
Released: 2025-12-18
Doubao-Seed-2.0-mini volcengine/doubao-seed-2.0-mini 256K 64K Input: $0.03
Output: $0.28
Cache Read: $0.01
Cache Write: $0.0024
Model: 0.015
Completion: 9.333
Cache: 0.333
📎 🧠 🔧 🌡️ 2026-02-14 In: text, image, video
Out: text
Released: 2026-02-14
Doubao-Seed-2.0-lite volcengine/doubao-seed-2.0-lite 256K 64K Input: $0.09
Output: $0.51
Cache Read: $0.02
Cache Write: $0.0024
Model: 0.045
Completion: 5.667
Cache: 0.222
📎 🧠 🔧 🌡️ 2026-02-14 In: text, image, video
Out: text
Released: 2026-02-14
Doubao Seed 2.0 Code volcengine/doubao-seed-2.0-code 256K 32K Input: $0.9
Output: $4.48
Model: 0.450
Completion: 4.978
📎 🔧 🌡️ 2025-01-01 In: text
Out: text
Released: 2026-03-20
Claude 3.5 Haiku anthropic/claude-3.5-haiku 200K 64K Input: $0.8
Output: $4
Cache Read: $0.08
Cache Write: $1
Model: 0.400
Completion: 5.000
Cache: 0.100
📎 🔧 🌡️ 2025-01-01 In: text, image
Out: text
Released: 2024-11-04
Claude Sonnet 4.5 anthropic/claude-sonnet-4.5 1M 64K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01-01 In: text, image, pdf
Out: text
Released: 2025-09-29
Claude Sonnet 4 anthropic/claude-sonnet-4 1M 64K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01-01 In: image, text, pdf
Out: text
Released: 2025-05-22
Claude Haiku 4.5 anthropic/claude-haiku-4.5 200K 64K Input: $1
Output: $5
Cache Read: $0.1
Cache Write: $1.25
Model: 0.500
Completion: 5.000
Cache: 0.100
📎 🔧 🌡️ 2025-01-01 In: image, text
Out: text
Released: 2025-10-15
Claude Opus 4.7 anthropic/claude-opus-4.7 1M 128K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 2026-01-31 In: text, image, pdf
Out: text
Released: 2026-04-16
Claude 3.7 Sonnet anthropic/claude-3.7-sonnet 200K 64K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01-01 In: text, image, pdf
Out: text
Released: 2025-02-24
Claude Opus 4.8 anthropic/claude-opus-4.8 1M 128K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 - In: text, image, pdf
Out: text
Released: 2026-05-28
Claude Fable 5 anthropic/claude-fable-5 1M 128K Input: $10
Output: $50
Cache Read: $1
Cache Write: $12.5
Model: 5.000
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 2026-01-31 In: text, image, pdf
Out: text
Released: 2026-06-09
Claude Opus 4.1 anthropic/claude-opus-4.1 200K 64K Input: $15
Output: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01-01 In: image, text, pdf
Out: text
Released: 2025-08-05
Claude Opus 4.5 anthropic/claude-opus-4.5 200K 64K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01-01 In: pdf, image, text
Out: text
Released: 2025-11-24
Claude Sonnet 4.6 anthropic/claude-sonnet-4.6 1M 64K Input: $3
Output: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-08-31 In: text, image
Out: text
Released: 2026-02-18
Claude Opus 4 anthropic/claude-opus-4 200K 32K Input: $15
Output: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-01-01 In: image, text, pdf
Out: text
Released: 2025-05-22
Claude Opus 4.6 anthropic/claude-opus-4.6 1M 128K Input: $5
Output: $25
Cache Read: $0.5
Cache Write: $6.25
Model: 2.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-05-31 In: image, text
Out: text
Released: 2026-02-06
Hy3 preview tencent/hy3-preview 256K 64K Input: $0.172
Output: $0.572
Cache Read: $0.058
Cache Write: $0
Model: 0.086
Completion: 3.326
Cache: 0.337
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-04-20
Step-3 stepfun/step-3 65.5K 64K Input: $0.21
Output: $0.57
Model: 0.105
Completion: 2.714
📎 🧠 🔧 🌡️ 2025-01-01 In: image, text
Out: text
Released: 2025-07-31
Step 3.7 Flash stepfun/step-3.7-flash 256K 256K Input: $0.2
Output: $1.15
Model: 0.100
Completion: 5.750
📎 🧠 🔧 🌡️ 2026-01-01 In: text, image
Out: text
Open Weights
Released: 2026-05-29
Step 3.7 Flash (Free) stepfun/step-3.7-flash-free 256K 256K Input: $0
Output: $0
- 📎 🧠 🔧 🌡️ 2026-01-01 In: text, image
Out: text
Open Weights
Released: 2026-05-29
Step 3.5 Flash stepfun/step-3.5-flash 256K 64K Input: $0.1
Output: $0.3
Model: 0.050
Completion: 3.000
🔧 🌡️ 2025-01-01 In: text
Out: text
Released: 2026-02-02
KAT-Coder-Pro-V2 kuaishou/kat-coder-pro-v2 256K 80K Input: $0.3
Output: $1.2
Cache Read: $0.06
Model: 0.150
Completion: 4.000
Cache: 0.200
🔧 🌡️ - In: text
Out: text
Released: 2026-03-30
Qwen3-Coder-Plus qwen/qwen3-coder-plus 1M 64K Input: $1
Output: $5
Cache Read: $0.1
Cache Write: $1.25
Model: 0.500
Completion: 5.000
Cache: 0.100
🔧 🌡️ 2025-01-01 In: text
Out: text
Released: 2025-07-23
Qwen3.7 Plus qwen/qwen3.7-plus 1M 64K Input: $0.4
Output: $1.6
Cache Read: $0.08
Cache Write: $0.5
Model: 0.200
Completion: 4.000
Cache: 0.200
🧠 🔧 🌡️ 2025-04 In: text, image
Out: text
Released: 2026-06-02
Qwen3.7 Max qwen/qwen3.7-max 1M 65.5K Input: $2.5
Output: $7.5
Cache Read: $0.5
Cache Write: $3.125
Model: 1.250
Completion: 3.000
Cache: 0.200
🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-05-21
Qwen3-Max-Thinking qwen/qwen3-max 256K 64K Input: $1.2
Output: $6
Model: 0.600
Completion: 5.000
🧠 🔧 🌡️ 2025-01-01 In: text
Out: text
Released: 2026-01-23
Qwen3.5 Flash qwen/qwen3.5-flash 1M 1M Input: $0.1
Output: $0.4
Model: 0.050
Completion: 4.000
📎 🔧 🌡️ 2025-01-01 In: text, image
Out: text
Released: 2026-03-20
Qwen3.5 Plus qwen/qwen3.5-plus 1M 64K Input: $0.8
Output: $4.8
Model: 0.400
Completion: 6.000
📎 🧠 🔧 🌡️ 2025-01-01 In: text, image
Out: text
Released: 2026-03-20
Qwen3.6-Plus qwen/qwen3.6-plus 1M 64K Input: $0.5
Output: $3
Cache Read: $0.05
Cache Write: $0.625
Model: 0.250
Completion: 6.000
Cache: 0.100
🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-03-30
DeepSeek V4 Flash deepseek/deepseek-v4-flash 1M 384K Input: $0.14
Output: $0.28
Cache Read: $0.0028
Model: 0.070
Completion: 2.000
Cache: 0.020
🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2026-04-24
DeepSeek V4 Pro deepseek/deepseek-v4-pro 1M 384K Input: $0.435
Output: $0.87
Cache Read: $0.003625
Model: 0.217
Completion: 2.000
Cache: 0.008
🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2026-04-24
DeepSeek-V3.2-Exp deepseek/deepseek-v3.2-exp 163K 64K Input: $0.22
Output: $0.33
Model: 0.110
Completion: 1.500
🧠 🔧 🌡️ 2025-01-01 In: text
Out: text
Released: 2025-09-29
DeepSeek-V3.2 (Non-thinking Mode) deepseek/deepseek-chat 128K 64K Input: $0.28
Output: $0.42
Cache Read: $0.03
Model: 0.140
Completion: 1.500
Cache: 0.107
🔧 🌡️ 2025-01-01 In: text
Out: text
Released: 2025-12-01
DeepSeek V3.2 deepseek/deepseek-v3.2 128K 64K Input: $0.28
Output: $0.43
Model: 0.140
Completion: 1.536
🧠 🔧 🌡️ 2025-01-01 In: text
Out: text
Released: 2025-12-05
MiniMax M2.7 highspeed minimax/minimax-m2.7-highspeed 204.8K 131.1K Input: $0.611
Output: $2.4439
Model: 0.305
Completion: 4.000
📎 🧠 🔧 🌡️ 2025-01-01 In: text
Out: text
Released: 2026-03-20
MiniMax M2.5 minimax/minimax-m2.5 204.8K 131.1K Input: $0.3
Output: $1.2
Cache Read: $0.03
Cache Write: $0.375
Model: 0.150
Completion: 4.000
Cache: 0.100
🧠 🔧 🌡️ 2025-01-01 In: text
Out: text
Released: 2026-02-13
MiniMax M2.5 highspeed minimax/minimax-m2.5-lightning 204.8K 131.1K Input: $0.6
Output: $4.8
Cache Read: $0.06
Cache Write: $0.75
Model: 0.300
Completion: 8.000
Cache: 0.100
🧠 🔧 🌡️ 2025-01-01 In: text
Out: text
Released: 2026-02-13
MiniMax M2.1 minimax/minimax-m2.1 204K 64K Input: $0.3
Output: $1.2
Cache Read: $0.03
Cache Write: $0.38
Model: 0.150
Completion: 4.000
Cache: 0.100
🧠 🔧 🌡️ 2025-01-01 In: text
Out: text
Released: 2025-12-22
MiniMax-M3 minimax/minimax-m3 512K 128K Input: $0.6
Output: $2.4
Model: 0.300
Completion: 4.000
📎 🧠 🔧 🌡️ - In: text, image, video
Out: text
Open Weights
Released: 2026-06-01
MiniMax M2 minimax/minimax-m2 204K 64K Input: $0.3
Output: $1.2
Cache Read: $0.03
Cache Write: $0.38
Model: 0.150
Completion: 4.000
Cache: 0.100
🧠 🔧 🌡️ 2025-01-01 In: text
Out: text
Released: 2025-10-27
MiniMax M2.7 minimax/minimax-m2.7 204.8K 131.1K Input: $0.3055
Output: $1.2219
Model: 0.153
Completion: 4.000
📎 🧠 🔧 🌡️ 2025-01-01 In: text
Out: text
Released: 2026-03-20

Zhipu AI

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
GLM-5.1 glm-5.1 200K 131.1K Input: $6
Output: $24
Cache Read: $1.3
Cache Write: $0
Model: 3.000
Completion: 4.000
Cache: 0.217
🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-03-27
GLM-5V-Turbo glm-5v-turbo 200K 131.1K Input: $5
Output: $22
Cache Read: $1.2
Cache Write: $0
Model: 2.500
Completion: 4.400
Cache: 0.240
📎 🧠 🔧 🌡️ - In: text, image, video, pdf
Out: text
Released: 2026-04-01
GLM-5 glm-5 204.8K 131.1K Input: $1
Output: $3.2
Cache Read: $0.2
Cache Write: $0
Model: 0.500
Completion: 3.200
Cache: 0.200
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2026-02-11
GLM-4.5-Flash glm-4.5-flash 131.1K 98.3K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-28
GLM-4.7-Flash glm-4.7-flash 200K 131.1K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2026-01-19
GLM-4.5-Air glm-4.5-air 131.1K 98.3K Input: $0.2
Output: $1.1
Cache Read: $0.03
Cache Write: $0
Model: 0.100
Completion: 5.500
Cache: 0.150
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-28
GLM-4.6V glm-4.6v 128K 32.8K Input: $0.3
Output: $0.9
Model: 0.150
Completion: 3.000
📎 🧠 🔧 🌡️ 2025-04 In: text, image, video
Out: text
Open Weights
Released: 2025-12-08
GLM-4.6 glm-4.6 204.8K 131.1K Input: $0.6
Output: $2.2
Cache Read: $0.11
Cache Write: $0
Model: 0.300
Completion: 3.667
Cache: 0.183
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-09-30
GLM-4.7-FlashX glm-4.7-flashx 200K 131.1K Input: $0.07
Output: $0.4
Cache Read: $0.01
Cache Write: $0
Model: 0.035
Completion: 5.714
Cache: 0.143
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2026-01-19
GLM-4.5 glm-4.5 131.1K 98.3K Input: $0.6
Output: $2.2
Cache Read: $0.11
Cache Write: $0
Model: 0.300
Completion: 3.667
Cache: 0.183
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-28
GLM-4.5V glm-4.5v 64K 16.4K Input: $0.6
Output: $1.8
Model: 0.300
Completion: 3.000
📎 🧠 🔧 🌡️ 2025-04 In: text, image, video
Out: text
Open Weights
Released: 2025-08-11
GLM-4.7 glm-4.7 204.8K 131.1K Input: $0.6
Output: $2.2
Cache Read: $0.11
Cache Write: $0
Model: 0.300
Completion: 3.667
Cache: 0.183
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-12-22

Zhipu AI Coding Plan

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing (1M) NewAPI Ratios Capabilities Knowledge Modalities Details
GLM-5.1 glm-5.1 200K 131.1K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-03-27
GLM-5V-Turbo glm-5v-turbo 200K 131.1K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 📎 🧠 🔧 🌡️ - In: text, image, video, pdf
Out: text
Released: 2026-04-01
GLM-5-Turbo glm-5-turbo 200K 131.1K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-03-16
GLM-4.5-Air glm-4.5-air 131.1K 98.3K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-28
GLM-4.6V glm-4.6v 128K 32.8K Input: $0.3
Output: $0.9
Model: 0.150
Completion: 3.000
📎 🧠 🔧 🌡️ 2025-04 In: text, image, video
Out: text
Open Weights
Released: 2025-12-08
GLM-5.2 glm-5.2 1M 131.1K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 🧠 🔧 🌡️ - In: text
Out: text
Released: 2026-06-13
GLM-4.7 glm-4.7 204.8K 131.1K Input: $0
Output: $0
Cache Read: $0
Cache Write: $0
- 🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-12-22