Skip to content

Data Browser

This page displays comprehensive information about all LLM providers and models, automatically generated from API data.

Statistics

Provider Count: 57    Model Count: 967    Last Updated: 10/21/2025, 11:36:21 AM

Capabilities Legend: 🧠 Reasoning   🔧 Tools   📎 Attachment   🌡️ Temperature

AIHubMix

📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
GPT-4.1 nano gpt-4.1-nano 1M 32.8K In: $0.1
Out: $0.4
Cache Read: $0.03
Model: 0.050
Completion: 4.000
Cache: 0.300
📎 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2025-04-14
Qwen3 235B A22B Instruct 2507 qwen3-235b-a22b-instruct-2507 262.1K 262.1K In: $0.28
Out: $1.12
Model: 0.140
Completion: 4.000
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-30
Claude Opus 4.1 claude-opus-4-1 200K 32K In: $16.5
Out: $82.5
Cache Read: $1.5
Cache Write: $18.75
Model: 8.250
Completion: 5.000
Cache: 0.091
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image
Out: text
Released: 2025-08-05
Claude Haiku 4.5 claude-haiku-4-5 200K 64K In: $1.1
Out: $5.5
Cache Read: $0.11
Cache Write: $1.25
Model: 0.550
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-07-31 In: text, image
Out: text
Released: 2025-09-29
Gemini 2.5 Flash gemini-2.5-flash 1M 65K In: $0.075
Out: $0.3
Cache Read: $0.02
Model: 0.037
Completion: 4.000
Cache: 0.267
📎 🔧 🌡️ 2025-04 In: text, image, audio, video
Out: text
Released: 2025-09-15
GPT-4.1 mini gpt-4.1-mini 1M 32.8K In: $0.4
Out: $1.6
Cache Read: $0.1
Model: 0.200
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2025-04-14
Claude Sonnet 4.5 claude-sonnet-4-5 200K 64K In: $3.3
Out: $16.5
Cache Read: $0.3
Cache Write: $3.75
Model: 1.650
Completion: 5.000
Cache: 0.091
📎 🧠 🔧 🌡️ 2025-07-31 In: text, image
Out: text
Released: 2025-09-29
DeepSeek-V3.2-Exp DeepSeek-V3.2-Exp 163K 163K In: $0.27
Out: $0.41
Model: 0.135
Completion: 1.519
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-09-29
Qwen3 235B A22B Thinking 2507 qwen3-235b-a22b-thinking-2507 262.1K 262.1K In: $0.28
Out: $2.8
Model: 0.140
Completion: 10.000
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-30
GPT-5-Nano gpt-5-nano 128K 16.4K In: $0.5
Out: $2
Cache Read: $0.25
Model: 0.250
Completion: 4.000
Cache: 0.500
📎 🔧 🌡️ 2024-09-30 In: text, image
Out: text
Released: 2025-09-15
GPT-5-Codex gpt-5-codex 400K 128K - - 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-09-15
GPT-4o gpt-4o 128K 16.4K In: $2.5
Out: $10
Cache Read: $1.25
Model: 1.250
Completion: 4.000
Cache: 0.500
📎 🔧 🌡️ 2023-09 In: text, image
Out: text
Released: 2024-05-13
Updated: 2024-08-06
GPT-4.1 gpt-4.1 1M 32.8K In: $2
Out: $8
Cache Read: $0.5
Model: 1.000
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2025-04-14
GLM-4.6 glm-4.6 204.8K 204.8K In: $0.27
Out: $1.1
Cache Read: $0.11
Model: 0.135
Completion: 4.074
Cache: 0.407
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-09-30
o4-mini o4-mini 200K 65.5K In: $1.5
Out: $6
Cache Read: $0.75
Model: 0.750
Completion: 4.000
Cache: 0.500
🧠 2024-09 In: text
Out: text
Released: 2025-09-15
GPT-5-Mini gpt-5-mini 200K 64K In: $1.5
Out: $6
Cache Read: $0.75
Model: 0.750
Completion: 4.000
Cache: 0.500
📎 🧠 🔧 🌡️ 2024-09-30 In: text, image
Out: text
Released: 2025-09-15
Gemini 2.5 Pro gemini-2.5-pro 2M 65K In: $1.25
Out: $5
Cache Read: $0.31
Model: 0.625
Completion: 4.000
Cache: 0.248
📎 🧠 🔧 🌡️ 2025-04 In: text, image, audio, video
Out: text
Released: 2025-09-15
DeepSeek-V3.2-Exp-Think DeepSeek-V3.2-Exp-Think 131K 64K In: $0.27
Out: $0.41
Model: 0.135
Completion: 1.519
🧠 🔧 🌡️ 2025-09 In: text
Out: text
Open Weights
Released: 2025-09-29
GPT-4o (2024-11-20) gpt-4o-2024-11-20 128K 16.4K In: $2.5
Out: $10
Cache Read: $1.25
Model: 1.250
Completion: 4.000
Cache: 0.500
📎 🔧 🌡️ 2023-09 In: text, image
Out: text
Released: 2024-11-20
Qwen3 Coder 480B A35B Instruct qwen3-coder-480b-a35b-instruct 262.1K 131K In: $0.82
Out: $3.29
Model: 0.410
Completion: 4.012
🔧 🌡️ - In: text
Out: text
Released: 2025-08-01
GPT-5 gpt-5 400K 128K In: $5
Out: $20
Cache Read: $2.5
Model: 2.500
Completion: 4.000
Cache: 0.500
📎 🧠 🔧 🌡️ 2024-09-30 In: text, image
Out: text
Released: 2025-09-15
Kimi K2 0905 Kimi-K2-0905 262.1K 262.1K In: $0.55
Out: $2.19
Model: 0.275
Completion: 3.982
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-09-05
GPT-5-Pro gpt-5-pro 400K 128K In: $7
Out: $28
Cache Read: $3.5
Model: 3.500
Completion: 4.000
Cache: 0.500
📎 🧠 🔧 🌡️ 2024-09-30 In: text, image
Out: text
Released: 2025-09-15

Alibaba

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Qwen3-LiveTranslate Flash Realtime qwen3-livetranslate-flash-realtime 53.2K 4.1K In: $10
Out: $10
Model: 5.000
Completion: 1.000
🌡️ 2024-04 In: text, image, audio, video
Out: text, audio
Released: 2025-09-22
Qwen3-ASR Flash qwen3-asr-flash 53.2K 4.1K In: $0.035
Out: $0.035
Model: 0.018
Completion: 1.000
- 2024-04 In: audio
Out: text
Released: 2025-09-08
Qwen-Omni Turbo qwen-omni-turbo 32.8K 2K In: $0.07
Out: $0.27
Model: 0.035
Completion: 3.857
🔧 🌡️ 2024-04 In: text, image, audio, video
Out: text, audio
Released: 2025-01-19
Updated: 2025-03-26
Qwen-VL Max qwen-vl-max 131.1K 8.2K In: $0.8
Out: $3.2
Model: 0.400
Completion: 4.000
🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2024-04-08
Updated: 2025-08-13
Qwen3-Next 80B-A3B Instruct qwen3-next-80b-a3b-instruct 131.1K 32.8K In: $0.5
Out: $2
Model: 0.250
Completion: 4.000
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-09
Qwen Turbo qwen-turbo 1M 16.4K In: $0.05
Out: $0.2
Model: 0.025
Completion: 4.000
🧠 🔧 🌡️ 2024-04 In: text
Out: text
Released: 2024-11-01
Updated: 2025-04-28
Qwen3-VL 235B-A22B qwen3-vl-235b-a22b 131.1K 32.8K In: $0.7
Out: $2.8
Model: 0.350
Completion: 4.000
🧠 🔧 🌡️ 2025-04 In: text, image
Out: text
Open Weights
Released: 2025-04
Qwen3 Coder Flash qwen3-coder-flash 1M 65.5K In: $0.3
Out: $1.5
Model: 0.150
Completion: 5.000
🔧 🌡️ 2025-04 In: text
Out: text
Released: 2025-07-28
Qwen3-VL 30B-A3B qwen3-vl-30b-a3b 131.1K 32.8K In: $0.2
Out: $0.8
Model: 0.100
Completion: 4.000
🧠 🔧 🌡️ 2025-04 In: text, image
Out: text
Open Weights
Released: 2025-04
Qwen3 14B qwen3-14b 131.1K 8.2K In: $0.35
Out: $1.4
Model: 0.175
Completion: 4.000
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-04
QVQ Max qvq-max 131.1K 8.2K In: $1.2
Out: $4.8
Model: 0.600
Completion: 4.000
🧠 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2025-03-25
Qwen Plus Character (Japanese) qwen-plus-character-ja 8.2K 512 In: $0.5
Out: $1.4
Model: 0.250
Completion: 2.800
🔧 🌡️ 2024-04 In: text
Out: text
Released: 2024-01
Qwen2.5 14B Instruct qwen2-5-14b-instruct 131.1K 8.2K In: $0.35
Out: $1.4
Model: 0.175
Completion: 4.000
🔧 🌡️ 2024-04 In: text
Out: text
Open Weights
Released: 2024-09
QwQ Plus qwq-plus 131.1K 8.2K In: $0.8
Out: $2.4
Model: 0.400
Completion: 3.000
🧠 🔧 🌡️ 2024-04 In: text
Out: text
Released: 2025-03-05
Qwen3-Coder 30B-A3B Instruct qwen3-coder-30b-a3b-instruct 262.1K 65.5K In: $0.45
Out: $2.25
Model: 0.225
Completion: 5.000
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-04
Qwen-VL OCR qwen-vl-ocr 34.1K 4.1K In: $0.72
Out: $0.72
Model: 0.360
Completion: 1.000
🌡️ 2024-04 In: text, image
Out: text
Released: 2024-10-28
Updated: 2025-04-13
Qwen2.5 72B Instruct qwen2-5-72b-instruct 131.1K 8.2K In: $1.4
Out: $5.6
Model: 0.700
Completion: 4.000
🔧 🌡️ 2024-04 In: text
Out: text
Open Weights
Released: 2024-09
Qwen3-Omni Flash qwen3-omni-flash 65.5K 16.4K In: $0.43
Out: $1.66
Model: 0.215
Completion: 3.860
🧠 🔧 🌡️ 2024-04 In: text, image, audio, video
Out: text, audio
Released: 2025-09-15
Qwen Flash qwen-flash 1M 32.8K In: $0.05
Out: $0.4
Model: 0.025
Completion: 8.000
🧠 🔧 🌡️ 2024-04 In: text
Out: text
Released: 2025-07-28
Qwen3 8B qwen3-8b 131.1K 8.2K In: $0.18
Out: $0.7
Model: 0.090
Completion: 3.889
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-04
Qwen3-Omni Flash Realtime qwen3-omni-flash-realtime 65.5K 16.4K In: $0.52
Out: $1.99
Model: 0.260
Completion: 3.827
🔧 🌡️ 2024-04 In: text, image, audio, video
Out: text, audio
Released: 2025-09-15
Qwen2.5-VL 72B Instruct qwen2-5-vl-72b-instruct 131.1K 8.2K In: $2.8
Out: $8.4
Model: 1.400
Completion: 3.000
🔧 🌡️ 2024-04 In: text, image
Out: text
Open Weights
Released: 2024-09
Qwen3-VL Plus qwen3-vl-plus 262.1K 32.8K In: $0.2
Out: $1.6
Model: 0.100
Completion: 8.000
🧠 🔧 🌡️ 2025-04 In: text, image
Out: text
Released: 2025-09-23
Qwen Plus qwen-plus 1M 32.8K In: $0.4
Out: $1.2
Model: 0.200
Completion: 3.000
🧠 🔧 🌡️ 2024-04 In: text
Out: text
Released: 2024-01-25
Updated: 2025-09-11
Qwen2.5 32B Instruct qwen2-5-32b-instruct 131.1K 8.2K In: $0.7
Out: $2.8
Model: 0.350
Completion: 4.000
🔧 🌡️ 2024-04 In: text
Out: text
Open Weights
Released: 2024-09
Qwen2.5-Omni 7B qwen2-5-omni-7b 32.8K 2K In: $0.1
Out: $0.4
Model: 0.050
Completion: 4.000
🔧 🌡️ 2024-04 In: text, image, audio, video
Out: text, audio
Open Weights
Released: 2024-12
Qwen Max qwen-max 32.8K 8.2K In: $1.6
Out: $6.4
Model: 0.800
Completion: 4.000
🔧 🌡️ 2024-04 In: text
Out: text
Released: 2024-04-03
Updated: 2025-01-25
Qwen2.5 7B Instruct qwen2-5-7b-instruct 131.1K 8.2K In: $0.175
Out: $0.7
Model: 0.087
Completion: 4.000
🔧 🌡️ 2024-04 In: text
Out: text
Open Weights
Released: 2024-09
Qwen2.5-VL 7B Instruct qwen2-5-vl-7b-instruct 131.1K 8.2K In: $0.35
Out: $1.05
Model: 0.175
Completion: 3.000
🔧 🌡️ 2024-04 In: text, image
Out: text
Open Weights
Released: 2024-09
Qwen3 235B-A22B qwen3-235b-a22b 131.1K 16.4K In: $0.7
Out: $2.8
Model: 0.350
Completion: 4.000
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-04
Qwen-Omni Turbo Realtime qwen-omni-turbo-realtime 32.8K 2K In: $0.27
Out: $1.07
Model: 0.135
Completion: 3.963
🔧 🌡️ 2024-04 In: text, image, audio
Out: text, audio
Released: 2025-05-08
Qwen-MT Turbo qwen-mt-turbo 16.4K 8.2K In: $0.16
Out: $0.49
Model: 0.080
Completion: 3.063
🌡️ 2024-04 In: text
Out: text
Released: 2025-01
Qwen3-Coder 480B-A35B Instruct qwen3-coder-480b-a35b-instruct 262.1K 65.5K In: $1.5
Out: $7.5
Model: 0.750
Completion: 5.000
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-04
Qwen-MT Plus qwen-mt-plus 16.4K 8.2K In: $2.46
Out: $7.37
Model: 1.230
Completion: 2.996
🌡️ 2024-04 In: text
Out: text
Released: 2025-01
Qwen3 Max qwen3-max 262.1K 65.5K In: $1.2
Out: $6
Model: 0.600
Completion: 5.000
🔧 🌡️ 2025-04 In: text
Out: text
Released: 2025-09-23
Qwen3 Coder Plus qwen3-coder-plus 1M 65.5K In: $1
Out: $5
Model: 0.500
Completion: 5.000
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-23
Qwen3-Next 80B-A3B (Thinking) qwen3-next-80b-a3b-thinking 131.1K 32.8K In: $0.5
Out: $6
Model: 0.250
Completion: 12.000
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-09
Qwen3 32B qwen3-32b 131.1K 16.4K In: $0.7
Out: $2.8
Model: 0.350
Completion: 4.000
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-04
Qwen-VL Plus qwen-vl-plus 131.1K 8.2K In: $0.21
Out: $0.63
Model: 0.105
Completion: 3.000
🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2024-01-25
Updated: 2025-08-15
DeepSeek R1 deepseek-r1 128K - In: $4
Out: $16
Model: 2.000
Completion: 4.000
- - In: text
Out: text
-

Alibaba (China)

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
DeepSeek R1 Distill Qwen 7B deepseek-r1-distill-qwen-7b 32.8K 16.4K In: $0.072
Out: $0.144
Model: 0.036
Completion: 2.000
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-01-01
Qwen3-ASR Flash qwen3-asr-flash 53.2K 4.1K In: $0.032
Out: $0.032
Model: 0.016
Completion: 1.000
- 2024-04 In: audio
Out: text
Released: 2025-09-08
DeepSeek R1 0528 deepseek-r1-0528 131.1K 16.4K In: $0.574
Out: $2.294
Model: 0.287
Completion: 3.997
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-05-28
DeepSeek V3 deepseek-v3 65.5K 8.2K In: $0.287
Out: $1.147
Model: 0.143
Completion: 3.997
🔧 🌡️ - In: text
Out: text
Released: 2024-12-01
Qwen-Omni Turbo qwen-omni-turbo 32.8K 2K In: $0.058
Out: $0.23
Model: 0.029
Completion: 3.966
🔧 🌡️ 2024-04 In: text, image, audio, video
Out: text, audio
Released: 2025-01-19
Updated: 2025-03-26
Qwen-VL Max qwen-vl-max 131.1K 8.2K In: $0.23
Out: $0.574
Model: 0.115
Completion: 2.496
🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2024-04-08
Updated: 2025-08-13
DeepSeek V3.2 Exp deepseek-v3-2-exp 131.1K 65.5K In: $0.287
Out: $0.431
Model: 0.143
Completion: 1.502
🔧 🌡️ - In: text
Out: text
Released: 2025-01-01
Qwen3-Next 80B-A3B Instruct qwen3-next-80b-a3b-instruct 131.1K 32.8K In: $0.144
Out: $0.574
Model: 0.072
Completion: 3.986
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-09
DeepSeek R1 deepseek-r1 131.1K 16.4K In: $0.574
Out: $2.294
Model: 0.287
Completion: 3.997
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-01-01
Qwen Turbo qwen-turbo 1M 16.4K In: $0.044
Out: $0.087
Model: 0.022
Completion: 1.977
🧠 🔧 🌡️ 2024-04 In: text
Out: text
Released: 2024-11-01
Updated: 2025-07-15
Qwen3-VL 235B-A22B qwen3-vl-235b-a22b 131.1K 32.8K In: $0.286705
Out: $1.14682
Model: 0.143
Completion: 4.000
🧠 🔧 🌡️ 2025-04 In: text, image
Out: text
Open Weights
Released: 2025-04
Qwen3 Coder Flash qwen3-coder-flash 1M 65.5K In: $0.144
Out: $0.574
Model: 0.072
Completion: 3.986
🔧 🌡️ 2025-04 In: text
Out: text
Released: 2025-07-28
Qwen3-VL 30B-A3B qwen3-vl-30b-a3b 131.1K 32.8K In: $0.108
Out: $0.431
Model: 0.054
Completion: 3.991
🧠 🔧 🌡️ 2025-04 In: text, image
Out: text
Open Weights
Released: 2025-04
Qwen3 14B qwen3-14b 131.1K 8.2K In: $0.144
Out: $0.574
Model: 0.072
Completion: 3.986
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-04
QVQ Max qvq-max 131.1K 8.2K In: $1.147
Out: $4.588
Model: 0.574
Completion: 4.000
🧠 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2025-03-25
DeepSeek R1 Distill Qwen 32B deepseek-r1-distill-qwen-32b 32.8K 16.4K In: $0.287
Out: $0.861
Model: 0.143
Completion: 3.000
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-01-01
Qwen Plus Character qwen-plus-character 32.8K 4.1K In: $0.115
Out: $0.287
Model: 0.058
Completion: 2.496
🔧 🌡️ 2024-04 In: text
Out: text
Released: 2024-01
Qwen2.5 14B Instruct qwen2-5-14b-instruct 131.1K 8.2K In: $0.144
Out: $0.431
Model: 0.072
Completion: 2.993
🔧 🌡️ 2024-04 In: text
Out: text
Open Weights
Released: 2024-09
QwQ Plus qwq-plus 131.1K 8.2K In: $0.23
Out: $0.574
Model: 0.115
Completion: 2.496
🧠 🔧 🌡️ 2024-04 In: text
Out: text
Released: 2025-03-05
Qwen2.5-Coder 32B Instruct qwen2-5-coder-32b-instruct 131.1K 8.2K In: $0.287
Out: $0.861
Model: 0.143
Completion: 3.000
🔧 🌡️ 2024-04 In: text
Out: text
Open Weights
Released: 2024-11
Qwen3-Coder 30B-A3B Instruct qwen3-coder-30b-a3b-instruct 262.1K 65.5K In: $0.216
Out: $0.861
Model: 0.108
Completion: 3.986
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-04
Qwen Math Plus qwen-math-plus 4.1K 3.1K In: $0.574
Out: $1.721
Model: 0.287
Completion: 2.998
🔧 🌡️ 2024-04 In: text
Out: text
Released: 2024-08-16
Updated: 2024-09-19
Qwen-VL OCR qwen-vl-ocr 34.1K 4.1K In: $0.717
Out: $0.717
Model: 0.358
Completion: 1.000
🌡️ 2024-04 In: text, image
Out: text
Released: 2024-10-28
Updated: 2025-04-13
Qwen Doc Turbo qwen-doc-turbo 131.1K 8.2K In: $0.087
Out: $0.144
Model: 0.043
Completion: 1.655
🔧 🌡️ 2024-04 In: text
Out: text
Released: 2024-01
Qwen Deep Research qwen-deep-research 1M 32.8K In: $7.742
Out: $23.367
Model: 3.871
Completion: 3.018
🔧 🌡️ 2024-04 In: text
Out: text
Released: 2024-01
Qwen2.5 72B Instruct qwen2-5-72b-instruct 131.1K 8.2K In: $0.574
Out: $1.721
Model: 0.287
Completion: 2.998
🔧 🌡️ 2024-04 In: text
Out: text
Open Weights
Released: 2024-09
Qwen3-Omni Flash qwen3-omni-flash 65.5K 16.4K In: $0.058
Out: $0.23
Model: 0.029
Completion: 3.966
🧠 🔧 🌡️ 2024-04 In: text, image, audio, video
Out: text, audio
Released: 2025-09-15
Qwen Flash qwen-flash 1M 32.8K In: $0.022
Out: $0.216
Model: 0.011
Completion: 9.818
🧠 🔧 🌡️ 2024-04 In: text
Out: text
Released: 2025-07-28
Qwen3 8B qwen3-8b 131.1K 8.2K In: $0.072
Out: $0.287
Model: 0.036
Completion: 3.986
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-04
Qwen3-Omni Flash Realtime qwen3-omni-flash-realtime 65.5K 16.4K In: $0.23
Out: $0.918
Model: 0.115
Completion: 3.991
🔧 🌡️ 2024-04 In: text, image, audio
Out: text, audio
Released: 2025-09-15
Qwen2.5-VL 72B Instruct qwen2-5-vl-72b-instruct 131.1K 8.2K In: $2.294
Out: $6.881
Model: 1.147
Completion: 3.000
🔧 🌡️ 2024-04 In: text, image
Out: text
Open Weights
Released: 2024-09
Qwen3-VL Plus qwen3-vl-plus 262.1K 32.8K In: $0.143353
Out: $1.433525
Model: 0.072
Completion: 10.000
🧠 🔧 🌡️ 2025-04 In: text, image
Out: text
Released: 2025-09-23
Qwen Plus qwen-plus 1M 32.8K In: $0.115
Out: $0.287
Model: 0.058
Completion: 2.496
🧠 🔧 🌡️ 2024-04 In: text
Out: text
Released: 2024-01-25
Updated: 2025-09-11
Qwen2.5 32B Instruct qwen2-5-32b-instruct 131.1K 8.2K In: $0.287
Out: $0.861
Model: 0.143
Completion: 3.000
🔧 🌡️ 2024-04 In: text
Out: text
Open Weights
Released: 2024-09
Qwen2.5-Omni 7B qwen2-5-omni-7b 32.8K 2K In: $0.087
Out: $0.345
Model: 0.043
Completion: 3.966
🔧 🌡️ 2024-04 In: text, image, audio, video
Out: text, audio
Open Weights
Released: 2024-12
Qwen Max qwen-max 131.1K 8.2K In: $0.345
Out: $1.377
Model: 0.172
Completion: 3.991
🔧 🌡️ 2024-04 In: text
Out: text
Released: 2024-04-03
Updated: 2025-01-25
Qwen Long qwen-long 10M 8.2K In: $0.072
Out: $0.287
Model: 0.036
Completion: 3.986
🔧 🌡️ 2024-04 In: text
Out: text
Released: 2025-01-25
Qwen2.5-Math 72B Instruct qwen2-5-math-72b-instruct 4.1K 3.1K In: $0.574
Out: $1.721
Model: 0.287
Completion: 2.998
🔧 🌡️ 2024-04 In: text
Out: text
Open Weights
Released: 2024-09
Moonshot Kimi K2 Instruct moonshot-kimi-k2-instruct 131.1K 131.1K In: $0.574
Out: $2.294
Model: 0.287
Completion: 3.997
🔧 🌡️ - In: text
Out: text
Released: 2025-01-01
Tongyi Intent Detect V3 tongyi-intent-detect-v3 8.2K 1K In: $0.058
Out: $0.144
Model: 0.029
Completion: 2.483
🌡️ 2024-04 In: text
Out: text
Released: 2024-01
Qwen2.5 7B Instruct qwen2-5-7b-instruct 131.1K 8.2K In: $0.072
Out: $0.144
Model: 0.036
Completion: 2.000
🔧 🌡️ 2024-04 In: text
Out: text
Open Weights
Released: 2024-09
Qwen2.5-VL 7B Instruct qwen2-5-vl-7b-instruct 131.1K 8.2K In: $0.287
Out: $0.717
Model: 0.143
Completion: 2.498
🔧 🌡️ 2024-04 In: text, image
Out: text
Open Weights
Released: 2024-09
DeepSeek V3.1 deepseek-v3-1 131.1K 65.5K In: $0.574
Out: $1.721
Model: 0.287
Completion: 2.998
🔧 🌡️ - In: text
Out: text
Released: 2025-01-01
DeepSeek R1 Distill Llama 70B deepseek-r1-distill-llama-70b 32.8K 16.4K In: $0.287
Out: $0.861
Model: 0.143
Completion: 3.000
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-01-01
Qwen3 235B-A22B qwen3-235b-a22b 131.1K 16.4K In: $0.287
Out: $1.147
Model: 0.143
Completion: 3.997
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-04
Qwen2.5-Coder 7B Instruct qwen2-5-coder-7b-instruct 131.1K 8.2K In: $0.144
Out: $0.287
Model: 0.072
Completion: 1.993
🔧 🌡️ 2024-04 In: text
Out: text
Open Weights
Released: 2024-11
DeepSeek R1 Distill Qwen 14B deepseek-r1-distill-qwen-14b 32.8K 16.4K In: $0.144
Out: $0.431
Model: 0.072
Completion: 2.993
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-01-01
Qwen-Omni Turbo Realtime qwen-omni-turbo-realtime 32.8K 2K In: $0.23
Out: $0.918
Model: 0.115
Completion: 3.991
🔧 🌡️ 2024-04 In: text, image, audio
Out: text, audio
Released: 2025-05-08
Qwen Math Turbo qwen-math-turbo 4.1K 3.1K In: $0.287
Out: $0.861
Model: 0.143
Completion: 3.000
🔧 🌡️ 2024-04 In: text
Out: text
Released: 2024-09-19
Qwen-MT Turbo qwen-mt-turbo 16.4K 8.2K In: $0.101
Out: $0.28
Model: 0.051
Completion: 2.772
🌡️ 2024-04 In: text
Out: text
Released: 2025-01
DeepSeek R1 Distill Llama 8B deepseek-r1-distill-llama-8b 32.8K 16.4K - - 🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-01-01
Qwen3-Coder 480B-A35B Instruct qwen3-coder-480b-a35b-instruct 262.1K 65.5K In: $0.861
Out: $3.441
Model: 0.430
Completion: 3.997
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-04
Qwen-MT Plus qwen-mt-plus 16.4K 8.2K In: $0.259
Out: $0.775
Model: 0.130
Completion: 2.992
🌡️ 2024-04 In: text
Out: text
Released: 2025-01
Qwen3 Max qwen3-max 262.1K 65.5K In: $0.861
Out: $3.441
Model: 0.430
Completion: 3.997
🔧 🌡️ 2025-04 In: text
Out: text
Released: 2025-09-23
QwQ 32B qwq-32b 131.1K 8.2K In: $0.287
Out: $0.861
Model: 0.143
Completion: 3.000
🧠 🔧 🌡️ 2024-04 In: text
Out: text
Open Weights
Released: 2024-12
Qwen2.5-Math 7B Instruct qwen2-5-math-7b-instruct 4.1K 3.1K In: $0.144
Out: $0.287
Model: 0.072
Completion: 1.993
🔧 🌡️ 2024-04 In: text
Out: text
Open Weights
Released: 2024-09
Qwen3-Next 80B-A3B (Thinking) qwen3-next-80b-a3b-thinking 131.1K 32.8K In: $0.144
Out: $1.434
Model: 0.072
Completion: 9.958
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-09
DeepSeek R1 Distill Qwen 1.5B deepseek-r1-distill-qwen-1-5b 32.8K 16.4K - - 🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-01-01
Qwen3 32B qwen3-32b 131.1K 16.4K In: $0.287
Out: $1.147
Model: 0.143
Completion: 3.997
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-04
Qwen-VL Plus qwen-vl-plus 131.1K 8.2K In: $0.115
Out: $0.287
Model: 0.058
Completion: 2.496
🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2024-01-25
Updated: 2025-08-15
Qwen3 Coder Plus qwen3-coder-plus 1M 65.5K In: $1
Out: $5
Model: 0.500
Completion: 5.000
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-23

Amazon Bedrock

📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Command R+ cohere.command-r-plus-v1:0 128K 4.1K In: $3
Out: $15
Model: 1.500
Completion: 5.000
🔧 🌡️ 2024-04 In: text
Out: text
Open Weights
Released: 2024-04-04
Claude 2 anthropic.claude-v2 100K 4.1K In: $8
Out: $24
Model: 4.000
Completion: 3.000
🌡️ 2023-08 In: text
Out: text
Released: 2023-07-11
Claude Sonnet 3.7 anthropic.claude-3-7-sonnet-20250219-v1:0 200K 8.2K In: $3
Out: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2025-02-19
Claude Sonnet 4 anthropic.claude-sonnet-4-20250514-v1:0 200K 64K In: $3
Out: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2025-05-22
Llama 3.2 11B Instruct meta.llama3-2-11b-instruct-v1:0 128K 4.1K In: $0.16
Out: $0.16
Model: 0.080
Completion: 1.000
📎 🔧 🌡️ 2023-12 In: text, image
Out: text
Open Weights
Released: 2024-09-25
Claude Haiku 3 anthropic.claude-3-haiku-20240307-v1:0 200K 4.1K In: $0.25
Out: $1.25
Model: 0.125
Completion: 5.000
📎 🔧 🌡️ 2024-02 In: text, image
Out: text
Released: 2024-03-13
Llama 3.2 90B Instruct meta.llama3-2-90b-instruct-v1:0 128K 4.1K In: $0.72
Out: $0.72
Model: 0.360
Completion: 1.000
📎 🔧 🌡️ 2023-12 In: text, image
Out: text
Open Weights
Released: 2024-09-25
Llama 3.2 1B Instruct meta.llama3-2-1b-instruct-v1:0 131K 4.1K In: $0.1
Out: $0.1
Model: 0.050
Completion: 1.000
🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-09-25
Claude 2.1 anthropic.claude-v2:1 200K 4.1K In: $8
Out: $24
Model: 4.000
Completion: 3.000
🌡️ 2023-08 In: text
Out: text
Released: 2023-11-21
Command Light cohere.command-light-text-v14 4.1K 4.1K In: $0.3
Out: $0.6
Model: 0.150
Completion: 2.000
🌡️ 2023-08 In: text
Out: text
Open Weights
Released: 2023-11-01
Jamba 1.5 Large ai21.jamba-1-5-large-v1:0 256K 4.1K In: $2
Out: $8
Model: 1.000
Completion: 4.000
🔧 🌡️ 2024-08 In: text
Out: text
Open Weights
Released: 2024-08-15
Llama 3.3 70B Instruct meta.llama3-3-70b-instruct-v1:0 128K 4.1K In: $0.72
Out: $0.72
Model: 0.360
Completion: 1.000
🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-12-06
Claude Opus 3 anthropic.claude-3-opus-20240229-v1:0 200K 4.1K In: $15
Out: $75
Model: 7.500
Completion: 5.000
📎 🔧 🌡️ 2023-08 In: text, image
Out: text
Released: 2024-02-29
Nova Pro amazon.nova-pro-v1:0 300K 8.2K In: $0.8
Out: $3.2
Cache Read: $0.2
Model: 0.400
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-10 In: text, image, video
Out: text
Released: 2024-12-03
Llama 3.1 8B Instruct meta.llama3-1-8b-instruct-v1:0 128K 4.1K In: $0.22
Out: $0.22
Model: 0.110
Completion: 1.000
🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-07-23
Claude Sonnet 3.5 anthropic.claude-3-5-sonnet-20240620-v1:0 200K 8.2K In: $3
Out: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2024-06-20
Claude Haiku 4.5 anthropic.claude-haiku-4-5-20251001-v1:0 200K 64K In: $1
Out: $5
Cache Read: $0.1
Cache Write: $1.25
Model: 0.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-02-28 In: text, image
Out: text
Released: 2025-10-15
Command R cohere.command-r-v1:0 128K 4.1K In: $0.5
Out: $1.5
Model: 0.250
Completion: 3.000
🔧 🌡️ 2024-04 In: text
Out: text
Open Weights
Released: 2024-03-11
Nova Micro amazon.nova-micro-v1:0 128K 8.2K In: $0.035
Out: $0.14
Cache Read: $0.00875
Model: 0.018
Completion: 4.000
Cache: 0.250
🔧 🌡️ 2024-10 In: text
Out: text
Released: 2024-12-03
Llama 3.1 70B Instruct meta.llama3-1-70b-instruct-v1:0 128K 4.1K In: $0.72
Out: $0.72
Model: 0.360
Completion: 1.000
🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-07-23
Llama 3 70B Instruct meta.llama3-70b-instruct-v1:0 8.2K 2K In: $2.65
Out: $3.5
Model: 1.325
Completion: 1.321
🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-07-23
DeepSeek-R1 deepseek.r1-v1:0 128K 32.8K In: $1.35
Out: $5.4
Model: 0.675
Completion: 4.000
🧠 🔧 🌡️ 2024-07 In: text
Out: text
Released: 2025-01-20
Updated: 2025-05-29
Claude Sonnet 3.5 v2 anthropic.claude-3-5-sonnet-20241022-v2:0 200K 8.2K In: $3
Out: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2024-10-22
Command cohere.command-text-v14 4.1K 4.1K In: $1.5
Out: $2
Model: 0.750
Completion: 1.333
🌡️ 2023-08 In: text
Out: text
Open Weights
Released: 2023-11-01
Claude Opus 4 anthropic.claude-opus-4-20250514-v1:0 200K 32K In: $15
Out: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2025-05-22
Claude Sonnet 4.5 anthropic.claude-sonnet-4-5-20250929-v1:0 200K 64K In: $3
Out: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-07-31 In: text, image
Out: text
Released: 2025-09-29
Llama 3.2 3B Instruct meta.llama3-2-3b-instruct-v1:0 131K 4.1K In: $0.15
Out: $0.15
Model: 0.075
Completion: 1.000
🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-09-25
Claude Instant anthropic.claude-instant-v1 100K 4.1K In: $0.8
Out: $2.4
Model: 0.400
Completion: 3.000
🌡️ 2023-08 In: text
Out: text
Released: 2023-03-01
Nova Premier amazon.nova-premier-v1:0 1M 16.4K In: $2.5
Out: $12.5
Model: 1.250
Completion: 5.000
📎 🧠 🔧 🌡️ 2024-10 In: text, image, video
Out: text
Released: 2024-12-03
Claude Opus 4.1 anthropic.claude-opus-4-1-20250805-v1:0 200K 32K In: $15
Out: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image
Out: text
Released: 2025-08-05
Llama 4 Scout 17B Instruct meta.llama4-scout-17b-instruct-v1:0 3.5M 16.4K In: $0.17
Out: $0.66
Model: 0.085
Completion: 3.882
📎 🔧 🌡️ 2024-08 In: text, image
Out: text
Open Weights
Released: 2025-04-05
Jamba 1.5 Mini ai21.jamba-1-5-mini-v1:0 256K 4.1K In: $0.2
Out: $0.4
Model: 0.100
Completion: 2.000
🔧 🌡️ 2024-08 In: text
Out: text
Open Weights
Released: 2024-08-15
Llama 3 8B Instruct meta.llama3-8b-instruct-v1:0 8.2K 2K In: $0.3
Out: $0.6
Model: 0.150
Completion: 2.000
🌡️ 2023-03 In: text
Out: text
Open Weights
Released: 2024-07-23
Claude Sonnet 3 anthropic.claude-3-sonnet-20240229-v1:0 200K 4.1K In: $3
Out: $15
Model: 1.500
Completion: 5.000
📎 🔧 🌡️ 2023-08 In: text, image
Out: text
Released: 2024-03-04
Llama 4 Maverick 17B Instruct meta.llama4-maverick-17b-instruct-v1:0 1M 16.4K In: $0.24
Out: $0.97
Model: 0.120
Completion: 4.042
📎 🔧 🌡️ 2024-08 In: text, image
Out: text
Open Weights
Released: 2025-04-05
Nova Lite amazon.nova-lite-v1:0 300K 8.2K In: $0.06
Out: $0.24
Cache Read: $0.015
Model: 0.030
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-10 In: text, image, video
Out: text
Released: 2024-12-03
Claude Haiku 3.5 anthropic.claude-3-5-haiku-20241022-v1:0 200K 8.2K In: $0.8
Out: $4
Cache Read: $0.08
Cache Write: $1
Model: 0.400
Completion: 5.000
Cache: 0.100
📎 🔧 🌡️ 2024-07 In: text
Out: text
Released: 2024-10-22

Anthropic

📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Claude Sonnet 3.5 v2 claude-3-5-sonnet-20241022 200K 8.2K In: $3
Out: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🔧 🌡️ 2024-04-30 In: text, image
Out: text
Released: 2024-10-22
Claude Sonnet 3.5 claude-3-5-sonnet-20240620 200K 8.2K In: $3
Out: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🔧 🌡️ 2024-04-30 In: text, image
Out: text
Released: 2024-06-20
Claude Opus 3 claude-3-opus-20240229 200K 4.1K In: $15
Out: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🔧 🌡️ 2023-08-31 In: text, image
Out: text
Released: 2024-02-29
Claude Sonnet 4.5 claude-sonnet-4-5-20250929 200K 64K In: $3
Out: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-07-31 In: text, image
Out: text
Released: 2025-09-29
Claude Sonnet 4 claude-sonnet-4-20250514 200K 64K In: $3
Out: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image
Out: text
Released: 2025-05-22
Claude Opus 4 claude-opus-4-20250514 200K 32K In: $15
Out: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image
Out: text
Released: 2025-05-22
Claude Haiku 3.5 claude-3-5-haiku-20241022 200K 8.2K In: $0.8
Out: $4
Cache Read: $0.08
Cache Write: $1
Model: 0.400
Completion: 5.000
Cache: 0.100
📎 🔧 🌡️ 2024-07-31 In: text, image
Out: text
Released: 2024-10-22
Claude Haiku 3 claude-3-haiku-20240307 200K 4.1K In: $0.25
Out: $1.25
Cache Read: $0.03
Cache Write: $0.3
Model: 0.125
Completion: 5.000
Cache: 0.120
📎 🔧 🌡️ 2023-08-31 In: text, image
Out: text
Released: 2024-03-13
Claude Sonnet 3.7 claude-3-7-sonnet-20250219 200K 64K In: $3
Out: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2024-10-31 In: text, image
Out: text
Released: 2025-02-19
Claude Opus 4.1 claude-opus-4-1-20250805 200K 32K In: $15
Out: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image
Out: text
Released: 2025-08-05
Claude Sonnet 3 claude-3-sonnet-20240229 200K 4.1K In: $3
Out: $15
Cache Read: $0.3
Cache Write: $0.3
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🔧 🌡️ 2023-08-31 In: text, image
Out: text
Released: 2024-03-04
Claude Haiku 4.5 claude-haiku-4-5-20251001 200K 64K In: $1
Out: $5
Cache Read: $0.1
Cache Write: $1.25
Model: 0.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-02-31 In: text, image
Out: text
Released: 2025-10-15
Claude Sonnet 4 claude-sonnet-4-0 200K 64K In: $3
Out: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image
Out: text
Released: 2025-05-22
Claude Sonnet 3.7 claude-3-7-sonnet-latest 200K 64K In: $3
Out: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2024-10-31 In: text, image
Out: text
Released: 2025-02-19
Claude Sonnet 4.5 claude-sonnet-4-5 200K 64K In: $3
Out: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-07-31 In: text, image
Out: text
Released: 2025-09-29
Claude Haiku 3.5 claude-3-5-haiku-latest 200K 8.2K In: $0.8
Out: $4
Cache Read: $0.08
Cache Write: $1
Model: 0.400
Completion: 5.000
Cache: 0.100
📎 🔧 🌡️ 2024-07-31 In: text, image
Out: text
Released: 2024-10-22
Claude Haiku 4.5 claude-haiku-4-5 200K 64K In: $1
Out: $5
Cache Read: $0.1
Cache Write: $1.25
Model: 0.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-02-31 In: text, image
Out: text
Released: 2025-10-15
Claude Opus 4.1 claude-opus-4-1 200K 32K In: $15
Out: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image
Out: text
Released: 2025-08-05
Claude Opus 4 claude-opus-4-0 200K 32K In: $15
Out: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image
Out: text
Released: 2025-05-22

Azure

📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
GPT-4.1 nano gpt-4.1-nano 1M 32.8K In: $0.1
Out: $0.4
Cache Read: $0.03
Model: 0.050
Completion: 4.000
Cache: 0.300
📎 🔧 🌡️ 2024-05 In: text, image
Out: text
Released: 2025-04-14
GPT-4 gpt-4 8.2K 8.2K In: $60
Out: $120
Model: 30.000
Completion: 2.000
🔧 🌡️ 2023-11 In: text
Out: text
Released: 2023-03-14
GPT-4 32K gpt-4-32k 32.8K 32.8K In: $60
Out: $120
Model: 30.000
Completion: 2.000
🔧 🌡️ 2023-11 In: text
Out: text
Released: 2023-03-14
GPT-4.1 mini gpt-4.1-mini 1M 32.8K In: $0.4
Out: $1.6
Cache Read: $0.1
Model: 0.200
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-05 In: text, image
Out: text
Released: 2025-04-14
GPT-5 Chat gpt-5-chat 128K 16.4K In: $1.25
Out: $10
Cache Read: $0.13
Model: 0.625
Completion: 8.000
Cache: 0.104
📎 🧠 2024-10-24 In: text, image
Out: text
Released: 2025-08-07
GPT-3.5 Turbo 0125 gpt-3.5-turbo-0125 16.4K 16.4K In: $0.5
Out: $1.5
Model: 0.250
Completion: 3.000
🌡️ 2021-08 In: text
Out: text
Released: 2024-01-25
GPT-4 Turbo gpt-4-turbo 128K 4.1K In: $10
Out: $30
Model: 5.000
Completion: 3.000
📎 🔧 🌡️ 2023-11 In: text, image
Out: text
Released: 2023-11-06
Updated: 2024-04-09
GPT-3.5 Turbo 0613 gpt-3.5-turbo-0613 16.4K 16.4K In: $3
Out: $4
Model: 1.500
Completion: 1.333
🌡️ 2021-08 In: text
Out: text
Released: 2023-06-13
o1-preview o1-preview 128K 32.8K In: $16.5
Out: $66
Cache Read: $8.25
Model: 8.250
Completion: 4.000
Cache: 0.500
🧠 🔧 2023-09 In: text
Out: text
Released: 2024-09-12
o3-mini o3-mini 200K 100K In: $1.1
Out: $4.4
Cache Read: $0.55
Model: 0.550
Completion: 4.000
Cache: 0.500
🧠 🔧 2024-05 In: text
Out: text
Released: 2024-12-20
Updated: 2025-01-29
GPT-5 Nano gpt-5-nano 272K 128K In: $0.05
Out: $0.4
Cache Read: $0.01
Model: 0.025
Completion: 8.000
Cache: 0.200
📎 🧠 🔧 2024-05-30 In: text, image
Out: text
Released: 2025-08-07
GPT-5-Codex gpt-5-codex 400K 128K - - 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-09-15
GPT-4o gpt-4o 128K 16.4K In: $2.5
Out: $10
Cache Read: $1.25
Model: 1.250
Completion: 4.000
Cache: 0.500
📎 🔧 🌡️ 2023-09 In: text, image
Out: text
Released: 2024-05-13
GPT-3.5 Turbo 0301 gpt-3.5-turbo-0301 4.1K 4.1K In: $1.5
Out: $2
Model: 0.750
Completion: 1.333
🌡️ 2021-08 In: text
Out: text
Released: 2023-03-01
GPT-4.1 gpt-4.1 1M 32.8K In: $2
Out: $8
Cache Read: $0.5
Model: 1.000
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-05 In: text, image
Out: text
Released: 2025-04-14
o4-mini o4-mini 200K 100K In: $1.1
Out: $4.4
Cache Read: $0.28
Model: 0.550
Completion: 4.000
Cache: 0.255
📎 🧠 🔧 2024-05 In: text, image
Out: text
Released: 2025-04-16
o1 o1 200K 100K In: $15
Out: $60
Cache Read: $7.5
Model: 7.500
Completion: 4.000
Cache: 0.500
🧠 🔧 2023-09 In: text, image
Out: text
Released: 2024-12-05
GPT-5 Mini gpt-5-mini 272K 128K In: $0.25
Out: $2
Cache Read: $0.03
Model: 0.125
Completion: 8.000
Cache: 0.120
📎 🧠 🔧 2024-05-30 In: text, image
Out: text
Released: 2025-08-07
o1-mini o1-mini 128K 65.5K In: $1.1
Out: $4.4
Cache Read: $0.55
Model: 0.550
Completion: 4.000
Cache: 0.500
🧠 🔧 2023-09 In: text
Out: text
Released: 2024-09-12
GPT-3.5 Turbo Instruct gpt-3.5-turbo-instruct 4.1K 4.1K In: $1.5
Out: $2
Model: 0.750
Completion: 1.333
🌡️ 2021-08 In: text
Out: text
Released: 2023-09-21
o3 o3 200K 100K In: $2
Out: $8
Cache Read: $0.5
Model: 1.000
Completion: 4.000
Cache: 0.250
📎 🧠 🔧 2024-05 In: text, image
Out: text
Released: 2025-04-16
Codex Mini codex-mini 200K 100K In: $1.5
Out: $6
Cache Read: $0.375
Model: 0.750
Completion: 4.000
Cache: 0.250
📎 🧠 🔧 2024-04 In: text
Out: text
Released: 2025-05-16
GPT-4 Turbo Vision gpt-4-turbo-vision 128K 4.1K In: $10
Out: $30
Model: 5.000
Completion: 3.000
📎 🔧 🌡️ 2023-11 In: text, image
Out: text
Released: 2023-11-06
Updated: 2024-04-09
GPT-4o mini gpt-4o-mini 128K 16.4K In: $0.15
Out: $0.6
Cache Read: $0.08
Model: 0.075
Completion: 4.000
Cache: 0.533
📎 🔧 🌡️ 2023-09 In: text, image
Out: text
Released: 2024-07-18
GPT-5 gpt-5 272K 128K In: $1.25
Out: $10
Cache Read: $0.13
Model: 0.625
Completion: 8.000
Cache: 0.104
📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-08-07
GPT-3.5 Turbo 1106 gpt-3.5-turbo-1106 16.4K 16.4K In: $1
Out: $2
Model: 0.500
Completion: 2.000
🌡️ 2021-08 In: text
Out: text
Released: 2023-11-06

Baseten

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Kimi K2 Instruct 0905 moonshotai/Kimi-K2-Instruct-0905 262.1K 262.1K In: $0.6
Out: $2.5
Model: 0.300
Completion: 4.167
🔧 🌡️ 2025-08 In: text
Out: text
Open Weights
Released: 2025-09-05
Qwen3 Coder 480B A35B Instruct Qwen3/Qwen3-Coder-480B-A35B-Instruct 262.1K 66.5K In: $0.38
Out: $1.53
Model: 0.190
Completion: 4.026
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-23
GLM 4.6 zai-org/GLM-4.6 200K 200K In: $0.6
Out: $2.2
Model: 0.300
Completion: 3.667
🔧 🌡️ 2025-08 In: text
Out: text
Open Weights
Released: 2025-16-09

Cerebras

📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Qwen 3 235B Instruct qwen-3-235b-a22b-instruct-2507 131K 32K In: $0.6
Out: $1.2
Model: 0.300
Completion: 2.000
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-22
Qwen 3 Coder 480B qwen-3-coder-480b 131K 32K In: $2
Out: $2
Model: 1.000
Completion: 1.000
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-23
GPT OSS 120B gpt-oss-120b 131.1K 32.8K In: $0.25
Out: $0.69
Model: 0.125
Completion: 2.760
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-05

Chutes

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Kimi Dev 72B moonshotai/Kimi-Dev-72B 131.1K 131.1K In: $0.06664
Out: $0.266688
Model: 0.033
Completion: 4.002
🔧 🌡️ - In: text
Out: text
Released: 2024-12-01
Kimi K2 Instruct moonshotai/Kimi-K2-Instruct-75k 75K 75K In: $0.15
Out: $0.59
Model: 0.075
Completion: 3.933
🔧 🌡️ - In: text
Out: text
Released: 2025-08-01
Kimi K2 Instruct 0905 moonshotai/Kimi-K2-Instruct-0905 262.1K 262.1K In: $0.296176
Out: $1.18528
Model: 0.148
Completion: 4.002
🔧 🌡️ - In: text
Out: text
Released: 2024-09-05
Kimi VL A3B Thinking moonshotai/Kimi-VL-A3B-Thinking 131.1K 131.1K In: $0.02499
Out: $0.100008
Model: 0.012
Completion: 4.002
🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2024-12-01
LongCat Flash Chat FP8 meituan-longcat/LongCat-Flash-Chat-FP8 131.1K 131.1K In: $0.25
Out: $1
Model: 0.125
Completion: 4.000
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-09-10
DeepSeek R1T Chimera tngtech/DeepSeek-R1T-Chimera 163.8K 163.8K In: $0.18
Out: $0.72
Model: 0.090
Completion: 4.000
🧠 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-04-26
DeepSeek TNG R1T2 Chimera tngtech/DeepSeek-TNG-R1T2-Chimera 163.8K 163.8K In: $0.2
Out: $0.8
Model: 0.100
Completion: 4.000
🧠 🌡️ 2025-07 In: text
Out: text
Open Weights
Released: 2025-07-08
GPT OSS 120B openai/gpt-oss-120b 131.1K 32.8K In: $0.1
Out: $0.41
Model: 0.050
Completion: 4.100
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-05
Devstral Small (2505) chutesai/Devstral-Small-2505 32.8K 32.8K In: $0.02
Out: $0.08
Model: 0.010
Completion: 4.000
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-05-21
Mistral Small 3.2 24B Instruct (2506) chutesai/Mistral-Small-3.2-24B-Instruct-2506 131.1K 131.1K In: $0.02
Out: $0.08
Model: 0.010
Completion: 4.000
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-06-20
Qwen3 30B A3B Qwen/Qwen3-30B-A3B 41K 41K In: $0.02
Out: $0.08
Model: 0.010
Completion: 4.000
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-04-28
Qwen3 30B A3B Thinking 2507 Qwen/Qwen3-30B-A3B-Thinking-2507 262.1K 262.1K In: $0.08
Out: $0.29
Model: 0.040
Completion: 3.625
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-30
Qwen3 235B A22B Instruct 2507 Qwen/Qwen3-235B-A22B-Instruct-2507 262.1K 131.1K In: $0.078
Out: $0.312
Model: 0.039
Completion: 4.000
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-04-28
Updated: 2025-07-21
Qwen3 Coder 30B A3B Instruct Qwen/Qwen3-Coder-30B-A3B-Instruct 262.1K 262.1K - - 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-25
Qwen3 Coder 480B A35B Instruct (FP8) Qwen/Qwen3-Coder-480B-A35B-Instruct-FP8 262.1K 262.1K In: $0.2
Out: $0.8
Model: 0.100
Completion: 4.000
🔧 🌡️ - In: text
Out: text
Released: 2025-08-01
Qwen3 30B A3B Instruct 2507 Qwen/Qwen3-30B-A3B-Instruct-2507 262.1K 262.1K In: $0.05
Out: $0.2
Model: 0.025
Completion: 4.000
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-25
Qwen3-235B-A22B-Thinking-2507 Qwen/Qwen3-235B-A22B-Thinking-2507 262.1K 262.1K In: $0.078
Out: $0.312
Model: 0.039
Completion: 4.000
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-25
Qwen3 Next 80B A3B Instruct Qwen/Qwen3-Next-80B-A3B-Instruct 262.1K 262.1K In: $0.1
Out: $0.8
Model: 0.050
Completion: 8.000
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-09-11
Qwen3 Next 80B A3B Thinking Qwen/Qwen3-Next-80B-A3B-Thinking 262.1K 262.1K In: $0.1
Out: $0.8
Model: 0.050
Completion: 8.000
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-09-11
GLM 4.5 Turbo zai-org/GLM-4.5-turbo 131.1K 131.1K In: $1
Out: $3
Model: 0.500
Completion: 3.000
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-28
GLM 4.6 FP8 zai-org/GLM-4.6-FP8 204.8K 131.1K In: $0.39
Out: $1.55
Model: 0.195
Completion: 3.974
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-09-30
GLM 4.6 Turbo zai-org/GLM-4.6-turbo 204.8K 131.1K In: $1.15
Out: $3.25
Model: 0.575
Completion: 2.826
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-10-03
GLM 4.5 FP8 zai-org/GLM-4.5-FP8 131.1K 131.1K - - 🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-28
GLM 4.5 Air zai-org/GLM-4.5-Air 131.1K 98.3K - - 🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-28
DeepSeek R1 0528 Qwen3 8B deepseek-ai/DeepSeek-R1-0528-Qwen3-8B 131.1K 131.1K In: $0.02
Out: $0.07
Model: 0.010
Completion: 3.500
🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2025-05-29
DeepSeek R1 (0528) deepseek-ai/DeepSeek-R1-0528 75K 163.8K In: $0.18
Out: $0.72
Model: 0.090
Completion: 4.000
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-08-01
DeepSeek V3.2 Exp deepseek-ai/DeepSeek-V3.2-Exp 128K 64K In: $0.25
Out: $0.35
Model: 0.125
Completion: 1.400
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-09-29
DeepSeek V3.1 Terminus deepseek-ai/DeepSeek-V3.1-Terminus 131.1K 65.5K In: $0.25
Out: $1
Model: 0.125
Completion: 4.000
🧠 🔧 🌡️ 2025-07 In: text
Out: text
Open Weights
Released: 2025-09-22
DeepSeek V3.1 Turbo deepseek-ai/DeepSeek-V3.1-turbo 128K 128K In: $1
Out: $3
Model: 0.500
Completion: 3.000
🔧 🌡️ - In: text
Out: text
Released: 2025-08-21
DeepSeek V3.1 Reasoning deepseek-ai/DeepSeek-V3.1:THINKING 163.8K 163.8K In: $0.2
Out: $0.8
Model: 0.100
Completion: 4.000
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-08-21
DeepSeek R1 Distill Llama 70B deepseek-ai/DeepSeek-R1-Distill-Llama-70B 131.1K 131.1K In: $0.03
Out: $0.14
Model: 0.015
Completion: 4.667
🧠 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-01-23
DeepSeek V3.1 deepseek-ai/DeepSeek-V3.1 163.8K 163.8K In: $0.2
Out: $0.8
Model: 0.100
Completion: 4.000
🔧 🌡️ - In: text
Out: text
Released: 2025-08-21
DeepSeek V3 (0324) deepseek-ai/DeepSeek-V3-0324 75K 163.8K In: $0.18
Out: $0.72
Model: 0.090
Completion: 4.000
🔧 🌡️ - In: text
Out: text
Released: 2025-08-01

Cloudflare Workers AI

📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
hf/thebloke/mistral-7b-instruct-v0.1-awq mistral-7b-instruct-v0.1-awq 4.1K 4.1K - - 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2023-09-27
Updated: 2023-11-09
cf/deepgram/aura-1 aura-1 - - In: $0.015
Out: $0.015
Model: 0.007
Completion: 1.000
- - In: text
Out: audio
Open Weights
Released: 2025-08-27
Updated: 2025-07-07
hf/mistral/mistral-7b-instruct-v0.2 mistral-7b-instruct-v0.2 3.1K 3.1K - - 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2023-12-11
Updated: 2025-07-24
cf/tinyllama/tinyllama-1.1b-chat-v1.0 tinyllama-1.1b-chat-v1.0 2K 2K - - 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2023-12-30
Updated: 2024-03-17
cf/qwen/qwen1.5-0.5b-chat qwen1.5-0.5b-chat 32K 32K - - 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-01-31
Updated: 2024-04-30
cf/meta/llama-3.2-11b-vision-instruct llama-3.2-11b-vision-instruct 128K 128K In: $0.049
Out: $0.68
Model: 0.025
Completion: 13.878
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-09-18
Updated: 2024-12-04
hf/thebloke/llama-2-13b-chat-awq llama-2-13b-chat-awq 4.1K 4.1K - - 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2023-09-19
Updated: 2023-11-09
cf/meta/llama-3.1-8b-instruct-fp8 llama-3.1-8b-instruct-fp8 32K 32K In: $0.15
Out: $0.29
Model: 0.075
Completion: 1.933
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-07-25
cf/openai/whisper whisper - - In: $0.00045
Out: $0.00045
Model: 0.000
Completion: 1.000
- - In: audio
Out: text
Open Weights
Released: 2023-11-07
Updated: 2024-08-12
cf/stabilityai/stable-diffusion-xl-base-1.0 stable-diffusion-xl-base-1.0 - - - - - - In: text
Out: image
Open Weights
Released: 2023-07-25
Updated: 2023-10-30
cf/meta/llama-2-7b-chat-fp16 llama-2-7b-chat-fp16 4.1K 4.1K In: $0.56
Out: $6.67
Model: 0.280
Completion: 11.911
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2023-07-26
cf/microsoft/resnet-50 resnet-50 - - In: $0.0000025
Out: $-
Model: 0.000 - - In: image
Out: text
Open Weights
Released: 2022-03-16
Updated: 2024-02-13
cf/runwayml/stable-diffusion-v1-5-inpainting stable-diffusion-v1-5-inpainting - - - - - - In: text
Out: image
Open Weights
Released: 2024-02-27
cf/defog/sqlcoder-7b-2 sqlcoder-7b-2 10K 10K - - 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-02-05
Updated: 2024-02-12
cf/meta/llama-3-8b-instruct llama-3-8b-instruct 8K 8K In: $0.28
Out: $0.83
Model: 0.140
Completion: 2.964
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-04-17
Updated: 2025-06-19
cf/meta-llama/llama-2-7b-chat-hf-lora llama-2-7b-chat-hf-lora 8.2K 8.2K - - 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2023-07-13
Updated: 2024-04-17
cf/meta/llama-3.1-8b-instruct llama-3.1-8b-instruct 8K 8K In: $0.28
Out: $0.83
Model: 0.140
Completion: 2.964
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-07-18
Updated: 2024-09-25
cf/openchat/openchat-3.5-0106 openchat-3.5-0106 8.2K 8.2K - - 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-01-07
Updated: 2024-05-18
hf/thebloke/openhermes-2.5-mistral-7b-awq openhermes-2.5-mistral-7b-awq 4.1K 4.1K - - 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2023-11-02
Updated: 2023-11-09
cf/leonardo/lucid-origin lucid-origin - - In: $0.007
Out: $0.007
Model: 0.004
Completion: 1.000
- - In: text
Out: image
Released: 2025-08-25
Updated: 2025-08-05
cf/facebook/bart-large-cnn bart-large-cnn - - - - - - In: text
Out: text
Open Weights
Released: 2022-03-02
Updated: 2024-02-13
cf/black-forest-labs/flux-1-schnell flux-1-schnell 2K - In: $0.000053
Out: $0.00011
Model: 0.000
Completion: 2.075
- - In: text
Out: image
Open Weights
Released: 2024-07-31
Updated: 2024-08-16
cf/deepseek-ai/deepseek-r1-distill-qwen-32b deepseek-r1-distill-qwen-32b 80K 80K In: $0.5
Out: $4.88
Model: 0.250
Completion: 9.760
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-01-20
Updated: 2025-02-24
cf/google/gemma-2b-it-lora gemma-2b-it-lora 8.2K 8.2K - - 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-04-02
cf/fblgit/una-cybertron-7b-v2-bf16 una-cybertron-7b-v2-bf16 15K 15K - - 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2023-12-02
Updated: 2024-03-08
cf/meta/m2m100-1.2b m2m100-1.2b - - In: $0.34
Out: $0.34
Model: 0.170
Completion: 1.000
- - In: text
Out: text
Open Weights
Released: 2022-03-02
Updated: 2023-11-16
cf/meta/llama-3.2-3b-instruct llama-3.2-3b-instruct 128K 128K In: $0.051
Out: $0.34
Model: 0.025
Completion: 6.667
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-09-18
Updated: 2024-10-24
cf/qwen/qwen2.5-coder-32b-instruct qwen2.5-coder-32b-instruct 32.8K 32.8K In: $0.66
Out: $1
Model: 0.330
Completion: 1.515
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-11-06
Updated: 2025-01-12
cf/runwayml/stable-diffusion-v1-5-img2img stable-diffusion-v1-5-img2img - - - - - - In: text
Out: image
Open Weights
Released: 2024-02-27
cf/google/gemma-7b-it-lora gemma-7b-it-lora 3.5K 3.5K - - 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-04-02
cf/qwen/qwen1.5-14b-chat-awq qwen1.5-14b-chat-awq 7.5K 7.5K - - 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-02-03
Updated: 2024-04-30
cf/qwen/qwen1.5-1.8b-chat qwen1.5-1.8b-chat 32K 32K - - 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-01-30
Updated: 2024-04-30
cf/mistralai/mistral-small-3.1-24b-instruct mistral-small-3.1-24b-instruct 128K 128K In: $0.35
Out: $0.56
Model: 0.175
Completion: 1.600
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-03-11
Updated: 2025-07-28
hf/google/gemma-7b-it gemma-7b-it 8.2K 8.2K - - 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-02-13
Updated: 2024-08-14
hf/thebloke/llamaguard-7b-awq llamaguard-7b-awq 4.1K 4.1K - - 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2023-12-11
hf/nousresearch/hermes-2-pro-mistral-7b hermes-2-pro-mistral-7b 24K 24K - - 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-03-11
Updated: 2024-09-08
cf/tiiuae/falcon-7b-instruct falcon-7b-instruct 4.1K 4.1K - - 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2023-04-25
Updated: 2024-10-12
cf/meta/llama-3.3-70b-instruct-fp8-fast llama-3.3-70b-instruct-fp8-fast 24K 24K In: $0.29
Out: $2.25
Model: 0.145
Completion: 7.759
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-12-06
cf/meta/llama-3-8b-instruct-awq llama-3-8b-instruct-awq 8.2K 8.2K In: $0.12
Out: $0.27
Model: 0.060
Completion: 2.250
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-05-09
cf/leonardo/phoenix-1.0 phoenix-1.0 - - In: $0.0058
Out: $0.0058
Model: 0.003
Completion: 1.000
- - In: text
Out: image
Released: 2025-08-25
cf/microsoft/phi-2 phi-2 2K 2K - - 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2023-12-13
Updated: 2024-04-29
cf/lykon/dreamshaper-8-lcm dreamshaper-8-lcm - - - - 📎 - In: text
Out: image
Open Weights
Released: 2023-12-06
Updated: 2023-12-07
cf/thebloke/discolm-german-7b-v1-awq discolm-german-7b-v1-awq 4.1K 4.1K - - 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-01-18
Updated: 2024-01-24
cf/meta/llama-2-7b-chat-int8 llama-2-7b-chat-int8 8.2K 8.2K - - 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2023-09-25
cf/meta/llama-3.2-1b-instruct llama-3.2-1b-instruct 60K 60K In: $0.027
Out: $0.2
Model: 0.013
Completion: 7.407
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-09-18
Updated: 2024-10-24
cf/openai/whisper-large-v3-turbo whisper-large-v3-turbo - - In: $0.00051
Out: $0.00051
Model: 0.000
Completion: 1.000
- - In: audio
Out: text
Open Weights
Released: 2024-10-01
Updated: 2024-10-04
cf/meta/llama-4-scout-17b-16e-instruct llama-4-scout-17b-16e-instruct 131K 131K In: $0.27
Out: $0.85
Model: 0.135
Completion: 3.148
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-04-02
Updated: 2025-05-23
hf/nexusflow/starling-lm-7b-beta starling-lm-7b-beta 4.1K 4.1K - - 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-03-19
Updated: 2024-04-03
hf/thebloke/deepseek-coder-6.7b-base-awq deepseek-coder-6.7b-base-awq 4.1K 4.1K - - 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2023-11-05
Updated: 2023-11-09
cf/google/gemma-3-12b-it gemma-3-12b-it 80K 80K In: $0.35
Out: $0.56
Model: 0.175
Completion: 1.600
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-03-01
Updated: 2025-03-21
cf/meta/llama-guard-3-8b llama-guard-3-8b - - In: $0.48
Out: $0.03
Model: 0.240
Completion: 0.063
🌡️ - In: text
Out: text
Open Weights
Released: 2024-07-22
Updated: 2024-10-11
hf/thebloke/neural-chat-7b-v3-1-awq neural-chat-7b-v3-1-awq 4.1K 4.1K - - 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2023-11-15
Updated: 2023-11-17
cf/openai/whisper-tiny-en whisper-tiny-en - - - - - - In: audio
Out: text
Open Weights
Released: 2022-09-26
Updated: 2024-01-22
cf/bytedance/stable-diffusion-xl-lightning stable-diffusion-xl-lightning - - - - - - In: text
Out: image
Open Weights
Released: 2024-02-20
Updated: 2024-04-03
cf/mistral/mistral-7b-instruct-v0.1 mistral-7b-instruct-v0.1 2.8K 2.8K In: $0.11
Out: $0.19
Model: 0.055
Completion: 1.727
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2023-09-27
Updated: 2025-07-24
cf/llava-hf/llava-1.5-7b-hf llava-1.5-7b-hf - - - - 📎 🌡️ - In: image, text
Out: text
Open Weights
Released: 2023-12-05
Updated: 2025-06-06
cf/openai/gpt-oss-20b gpt-oss-20b 128K 128K In: $0.2
Out: $0.3
Model: 0.100
Completion: 1.500
- - In: text
Out: text
Open Weights
Released: 2025-08-04
Updated: 2025-08-14
cf/deepseek-ai/deepseek-math-7b-instruct deepseek-math-7b-instruct 4.1K 4.1K - - 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-02-05
Updated: 2024-02-06
cf/openai/gpt-oss-120b gpt-oss-120b 128K 128K In: $0.35
Out: $0.75
Model: 0.175
Completion: 2.143
- - In: text
Out: text
Open Weights
Released: 2025-08-04
Updated: 2025-08-14
cf/myshell-ai/melotts melotts - - In: $0.0002
Out: $-
Model: 0.000 📎 - In: text
Out: audio
Open Weights
Released: 2024-07-19
cf/qwen/qwen1.5-7b-chat-awq qwen1.5-7b-chat-awq 20K 20K - - 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-02-03
Updated: 2024-04-30
cf/meta/llama-3.1-8b-instruct-fast llama-3.1-8b-instruct-fast 128K 128K - - 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-07-18
Updated: 2024-09-25
cf/deepgram/nova-3 nova-3 - - In: $0.0052
Out: $0.0052
Model: 0.003
Completion: 1.000
- - In: audio
Out: text
Open Weights
Released: 2025-06-05
Updated: 2025-07-08
cf/meta/llama-3.1-70b-instruct llama-3.1-70b-instruct 24K 24K - - 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-07-16
Updated: 2024-12-15
cf/qwen/qwq-32b qwq-32b 24K 24K In: $0.66
Out: $1
Model: 0.330
Completion: 1.515
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-03-05
Updated: 2025-03-11
hf/thebloke/zephyr-7b-beta-awq zephyr-7b-beta-awq 4.1K 4.1K - - 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2023-10-27
Updated: 2023-11-09
hf/thebloke/deepseek-coder-6.7b-instruct-awq deepseek-coder-6.7b-instruct-awq 4.1K 4.1K - - 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2023-11-05
Updated: 2023-11-13
cf/meta/llama-3.1-8b-instruct-awq llama-3.1-8b-instruct-awq 8.2K 8.2K In: $0.12
Out: $0.27
Model: 0.060
Completion: 2.250
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-07-25
cf/mistral/mistral-7b-instruct-v0.2-lora mistral-7b-instruct-v0.2-lora 15K 15K - - 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-04-01
cf/unum/uform-gen2-qwen-500m uform-gen2-qwen-500m - - - - - - In: image, text
Out: text
Open Weights
Released: 2024-02-15
Updated: 2024-04-24

Cortecs

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Nova Pro 1.0 nova-pro-v1 300K 5K In: $1.016
Out: $4.061
Model: 0.508
Completion: 3.997
🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2024-12-03
Claude 4.5 Sonnet claude-4-5-sonnet 200K 200K In: $3.259
Out: $16.296
Model: 1.629
Completion: 5.000
📎 🧠 🔧 🌡️ 2025-07-31 In: text, image
Out: text
Released: 2025-09-29
DeepSeek V3 0324 deepseek-v3-0324 128K 128K In: $0.551
Out: $1.654
Model: 0.276
Completion: 3.002
🔧 🌡️ 2024-07 In: text
Out: text
Open Weights
Released: 2025-03-24
Kimi K2 Instruct kimi-k2-instruct 131K 131K In: $0.551
Out: $2.646
Model: 0.276
Completion: 4.802
🔧 🌡️ 2024-07 In: text
Out: text
Open Weights
Released: 2025-07-11
Updated: 2025-09-05
GPT 4.1 gpt-4.1 1M 32.8K In: $2.354
Out: $9.417
Model: 1.177
Completion: 4.000
🔧 🌡️ 2024-06 In: text, image
Out: text
Released: 2025-04-14
Gemini 2.5 Pro gemini-2.5-pro 1M 65.5K In: $1.654
Out: $11.024
Model: 0.827
Completion: 6.665
🔧 🌡️ 2025-01 In: text, image
Out: text
Released: 2025-03-20
Updated: 2025-06-17
GPT Oss 120b gpt-oss-120b 128K 128K - - 🔧 🌡️ 2024-01 In: text
Out: text
Open Weights
Released: 2025-08-05
Qwen3 Coder 480B A35B Instruct qwen3-coder-480b-a35b-instruct 262K 262K In: $0.441
Out: $1.984
Model: 0.221
Completion: 4.499
🔧 🌡️ 2025-01 In: text
Out: text
Open Weights
Released: 2025-07-25
Claude Sonnet 4 claude-sonnet-4 200K 64K In: $3.307
Out: $16.536
Model: 1.653
Completion: 5.000
🔧 🌡️ 2025-03 In: text, image
Out: text
Released: 2025-05-22
Llama 3.1 405B Instruct llama-3.1-405b-instruct 128K 128K - - 🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-07-23
Qwen3 32B qwen3-32b 16.4K 16.4K In: $0.099
Out: $0.33
Model: 0.050
Completion: 3.333
🔧 🌡️ 2024-12 In: text
Out: text
Open Weights
Released: 2025-04-29

Deep Infra

📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Kimi K2 moonshotai/Kimi-K2-Instruct 131.1K 32.8K In: $0.5
Out: $2
Model: 0.250
Completion: 4.000
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-07-11
Qwen3 Coder 480B A35B Instruct Qwen/Qwen3-Coder-480B-A35B-Instruct 262.1K 66.5K In: $0.4
Out: $1.6
Model: 0.200
Completion: 4.000
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-23
Qwen3 Coder 480B A35B Instruct Turbo Qwen/Qwen3-Coder-480B-A35B-Instruct-Turbo 262.1K 66.5K In: $0.3
Out: $1.2
Model: 0.150
Completion: 4.000
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-23
GLM-4.5 zai-org/GLM-4.5 131.1K 98.3K In: $0.6
Out: $2.2
Model: 0.300
Completion: 3.667
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-28

DeepSeek

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
DeepSeek Chat deepseek-chat 128K 8.2K In: $0.57
Out: $1.68
Cache Read: $0.07
Model: 0.285
Completion: 2.947
Cache: 0.123
📎 🔧 🌡️ 2024-07 In: text
Out: text
Released: 2024-12-26
Updated: 2025-08-21
DeepSeek Reasoner deepseek-reasoner 128K 128K In: $0.57
Out: $1.68
Cache Read: $0.07
Model: 0.285
Completion: 2.947
Cache: 0.123
📎 🧠 🔧 🌡️ 2024-07 In: text
Out: text
Released: 2025-01-20
Updated: 2025-08-21

doubao

📖 API Address

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
doubao-seed-1-6-flash doubao-seed-1-6-flash 256K 32K - - 🔧 🌡️ 2023-10 In: text, image
Out: text
Released: 2025-06-11
Updated: 2025-07-15
doubao-seed-1-6-thinking doubao-seed-1-6-thinking 256K 32K - - 🧠 🔧 🌡️ 2023-10 In: text, image
Out: text
Released: 2025-06-11
Updated: 2025-07-15
doubao-seed-1-6 doubao-seed-1-6 256K 32K - - 🧠 🔧 🌡️ 2023-10 In: text, image
Out: text
Released: 2025-06-11
Updated: 2025-06-15

ExampleCorp AI

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Novus 1 novus-1 128K 4.1K In: $5
Out: $15
Cache Read: $0.075
Cache Write: $0.5
Model: 2.500
Completion: 3.000
Cache: 0.015
📎 🧠 🔧 🌡️ 2024-07 In: text, image, audio, video, pdf
Out: text, image, audio, video, pdf
Released: 2025-01-20
Updated: 2025-08-21

FastRouter

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Kimi K2 moonshotai/kimi-k2 131.1K 32.8K In: $0.55
Out: $2.2
Model: 0.275
Completion: 4.000
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-07-11
Grok 4 x-ai/grok-4 256K 64K In: $3
Out: $15
Cache Read: $0.75
Cache Write: $15
Model: 1.500
Completion: 5.000
Cache: 0.250
🧠 🔧 🌡️ 2025-07 In: text
Out: text
Released: 2025-07-09
Gemini 2.5 Flash google/gemini-2.5-flash 1M 65.5K In: $0.3
Out: $2.5
Cache Read: $0.0375
Model: 0.150
Completion: 8.333
Cache: 0.125
📎 🧠 🔧 🌡️ 2025-01 In: text, image, pdf
Out: text
Released: 2025-06-17
Gemini 2.5 Pro google/gemini-2.5-pro 1M 65.5K In: $1.25
Out: $10
Cache Read: $0.31
Model: 0.625
Completion: 8.000
Cache: 0.248
📎 🧠 🔧 🌡️ 2025-01 In: text, image, pdf
Out: text
Released: 2025-06-17
GPT-5 Nano openai/gpt-5-nano 400K 128K In: $0.05
Out: $0.4
Cache Read: $0.005
Model: 0.025
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2024-10-01 In: text, image
Out: text
Released: 2025-08-07
GPT-4.1 openai/gpt-4.1 1M 32.8K In: $2
Out: $8
Cache Read: $0.5
Model: 1.000
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2025-04-14
GPT-5 Mini openai/gpt-5-mini 400K 128K In: $0.25
Out: $2
Cache Read: $0.025
Model: 0.125
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2024-10-01 In: text, image
Out: text
Released: 2025-08-07
GPT OSS 20B openai/gpt-oss-20b 131.1K 65.5K In: $0.05
Out: $0.2
Model: 0.025
Completion: 4.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-05
GPT OSS 120B openai/gpt-oss-120b 131.1K 32.8K In: $0.15
Out: $0.6
Model: 0.075
Completion: 4.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-05
GPT-5 openai/gpt-5 400K 128K In: $1.25
Out: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2024-10-01 In: text, image
Out: text
Released: 2025-08-07
Qwen3 Coder qwen/qwen3-coder 262.1K 66.5K In: $0.3
Out: $1.2
Model: 0.150
Completion: 4.000
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-23
Claude Opus 4.1 anthropic/claude-opus-4.1 200K 32K In: $15
Out: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image
Out: text
Released: 2025-08-05
Claude Sonnet 4 anthropic/claude-sonnet-4 200K 64K In: $3
Out: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image
Out: text
Released: 2025-05-22
DeepSeek R1 Distill Llama 70B deepseek-ai/deepseek-r1-distill-llama-70b 131.1K 131.1K In: $0.03
Out: $0.14
Model: 0.015
Completion: 4.667
🧠 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-01-23

Fireworks AI

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Deepseek R1 05/28 accounts/fireworks/models/deepseek-r1-0528 160K 16.4K In: $3
Out: $8
Model: 1.500
Completion: 2.667
🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2025-05-28
DeepSeek V3.1 accounts/fireworks/models/deepseek-v3p1 163.8K 163.8K In: $0.56
Out: $1.68
Model: 0.280
Completion: 3.000
🧠 🔧 🌡️ 2025-07 In: text
Out: text
Open Weights
Released: 2025-08-21
Deepseek V3 03-24 accounts/fireworks/models/deepseek-v3-0324 160K 16.4K In: $0.9
Out: $0.9
Model: 0.450
Completion: 1.000
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-03-24
Kimi K2 Instruct accounts/fireworks/models/kimi-k2-instruct 128K 16.4K In: $1
Out: $3
Model: 0.500
Completion: 3.000
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-07-11
Qwen3 235B-A22B accounts/fireworks/models/qwen3-235b-a22b 128K 16.4K In: $0.22
Out: $0.88
Model: 0.110
Completion: 4.000
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-04-29
GPT OSS 20B accounts/fireworks/models/gpt-oss-20b 131.1K 32.8K In: $0.05
Out: $0.2
Model: 0.025
Completion: 4.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-05
GPT OSS 120B accounts/fireworks/models/gpt-oss-120b 131.1K 32.8K In: $0.15
Out: $0.6
Model: 0.075
Completion: 4.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-05
GLM 4.5 Air accounts/fireworks/models/glm-4p5-air 131.1K 131.1K In: $0.22
Out: $0.88
Model: 0.110
Completion: 4.000
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-08-01
Qwen3 Coder 480B A35B Instruct accounts/fireworks/models/qwen3-coder-480b-a35b-instruct 256K 32.8K In: $0.45
Out: $1.8
Model: 0.225
Completion: 4.000
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-07-22
GLM 4.5 accounts/fireworks/models/glm-4p5 131.1K 131.1K In: $0.55
Out: $2.19
Model: 0.275
Completion: 3.982
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-29

GitHub Copilot

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Gemini 2.0 Flash gemini-2.0-flash-001 1M 8.2K - - 📎 🔧 🌡️ 2024-06 In: text, image, audio, video
Out: text
Released: 2024-12-11
Claude Opus 4 claude-opus-4 80K 16K - - 📎 🧠 2025-03-31 In: text, image
Out: text
Released: 2025-05-22
Grok Code Fast 1 grok-code-fast-1 256K 10K - - 🧠 🔧 🌡️ 2025-08 In: text
Out: text
Released: 2025-08-27
Claude Haiku 4.5 claude-haiku-4.5 144K 16K - - 📎 🧠 🔧 🌡️ 2025-02-31 In: text, image
Out: text
Released: 2025-10-15
Claude Sonnet 3.5 claude-3.5-sonnet 90K 8.2K - - 📎 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2024-10-22
o3-mini o3-mini 128K 65.5K - - 🧠 2024-10 In: text
Out: text
Released: 2024-12-20
Updated: 2025-01-29
GPT-5-Codex gpt-5-codex 128K 64K - - 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-09-15
GPT-4o gpt-4o 128K 16.4K - - 📎 🔧 🌡️ 2023-09 In: text, image
Out: text
Released: 2024-05-13
GPT-4.1 gpt-4.1 128K 16.4K - - 📎 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2025-04-14
o4-mini (Preview) o4-mini 128K 65.5K - - 🧠 2024-10 In: text
Out: text
Released: 2025-04-16
Claude Opus 4.1 claude-opus-41 80K 16K - - 📎 🧠 🌡️ 2025-03-31 In: text, image
Out: text
Released: 2025-08-05
GPT-5-mini gpt-5-mini 128K 64K - - 📎 🧠 🔧 🌡️ 2024-06 In: text, image
Out: text
Released: 2025-08-13
Claude Sonnet 3.7 claude-3.7-sonnet 200K 16.4K - - 📎 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2025-02-19
Gemini 2.5 Pro gemini-2.5-pro 128K 64K - - 📎 🔧 🌡️ 2025-01 In: text, image, audio, video
Out: text
Released: 2025-03-20
Updated: 2025-06-05
o3 (Preview) o3 128K 16.4K - - 📎 🧠 🔧 2024-05 In: text, image
Out: text
Released: 2025-04-16
Claude Sonnet 4 claude-sonnet-4 128K 16K - - 📎 🧠 🔧 🌡️ 2025-03-31 In: text, image
Out: text
Released: 2025-05-22
GPT-5 gpt-5 128K 64K - - 📎 🧠 🔧 🌡️ 2024-10 In: text, image
Out: text
Released: 2025-08-07
Claude Sonnet 3.7 Thinking claude-3.7-sonnet-thought 200K 16.4K - - 📎 🧠 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2025-02-19
Claude Sonnet 4.5 claude-sonnet-4.5 128K 16K - - 📎 🧠 🔧 🌡️ 2025-03-31 In: text, image
Out: text
Released: 2025-09-29

GitHub Models

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
JAIS 30b Chat core42/jais-30b-chat 8.2K 2K - - 🧠 🔧 🌡️ 2023-03 In: text
Out: text
Open Weights
Released: 2023-08-30
Grok 3 xai/grok-3 128K 8.2K - - 🧠 🔧 🌡️ 2024-10 In: text
Out: text
Released: 2024-12-09
Grok 3 Mini xai/grok-3-mini 128K 8.2K - - 🧠 🔧 🌡️ 2024-10 In: text
Out: text
Released: 2024-12-09
Cohere Command R 08-2024 cohere/cohere-command-r-08-2024 128K 4.1K - - 🧠 🔧 🌡️ 2024-03 In: text
Out: text
Released: 2024-08-01
Cohere Command A cohere/cohere-command-a 128K 4.1K - - 🧠 🔧 🌡️ 2024-03 In: text
Out: text
Released: 2024-11-01
Cohere Command R+ 08-2024 cohere/cohere-command-r-plus-08-2024 128K 4.1K - - 🧠 🔧 🌡️ 2024-03 In: text
Out: text
Released: 2024-08-01
Cohere Command R cohere/cohere-command-r 128K 4.1K - - 🧠 🔧 🌡️ 2024-03 In: text
Out: text
Released: 2024-03-11
Updated: 2024-08-01
Cohere Command R+ cohere/cohere-command-r-plus 128K 4.1K - - 🧠 🔧 🌡️ 2024-03 In: text
Out: text
Released: 2024-04-04
Updated: 2024-08-01
DeepSeek-R1-0528 deepseek/deepseek-r1-0528 65.5K 8.2K - - 🧠 🔧 🌡️ 2024-06 In: text
Out: text
Open Weights
Released: 2025-05-28
DeepSeek-R1 deepseek/deepseek-r1 65.5K 8.2K - - 🧠 🔧 🌡️ 2024-06 In: text
Out: text
Open Weights
Released: 2025-01-20
DeepSeek-V3-0324 deepseek/deepseek-v3-0324 128K 8.2K - - 🧠 🔧 🌡️ 2024-06 In: text
Out: text
Open Weights
Released: 2025-03-24
Mistral Medium 3 (25.05) mistral-ai/mistral-medium-2505 128K 32.8K - - 🧠 🔧 🌡️ 2024-09 In: text, image
Out: text
Released: 2025-05-01
Ministral 3B mistral-ai/ministral-3b 128K 8.2K - - 🧠 🔧 🌡️ 2024-03 In: text
Out: text
Open Weights
Released: 2024-10-22
Mistral Nemo mistral-ai/mistral-nemo 128K 8.2K - - 🧠 🔧 🌡️ 2024-03 In: text
Out: text
Open Weights
Released: 2024-07-18
Mistral Large 24.11 mistral-ai/mistral-large-2411 128K 32.8K - - 🧠 🔧 🌡️ 2024-09 In: text
Out: text
Released: 2024-11-01
Codestral 25.01 mistral-ai/codestral-2501 32K 8.2K - - 🧠 🔧 🌡️ 2024-03 In: text
Out: text
Released: 2025-01-01
Mistral Small 3.1 mistral-ai/mistral-small-2503 128K 32.8K - - 🧠 🔧 🌡️ 2024-09 In: text, image
Out: text
Released: 2025-03-01
Phi-3-medium instruct (128k) microsoft/phi-3-medium-128k-instruct 128K 4.1K - - 🧠 🔧 🌡️ 2023-10 In: text
Out: text
Open Weights
Released: 2024-04-23
Phi-3-mini instruct (4k) microsoft/phi-3-mini-4k-instruct 4.1K 1K - - 🧠 🔧 🌡️ 2023-10 In: text
Out: text
Open Weights
Released: 2024-04-23
Phi-3-small instruct (128k) microsoft/phi-3-small-128k-instruct 128K 4.1K - - 🧠 🔧 🌡️ 2023-10 In: text
Out: text
Open Weights
Released: 2024-04-23
Phi-3.5-vision instruct (128k) microsoft/phi-3.5-vision-instruct 128K 4.1K - - 🧠 🔧 🌡️ 2023-10 In: text, image
Out: text
Open Weights
Released: 2024-08-20
Phi-4 microsoft/phi-4 16K 4.1K - - 🧠 🔧 🌡️ 2023-10 In: text
Out: text
Open Weights
Released: 2024-12-11
Phi-4-mini-reasoning microsoft/phi-4-mini-reasoning 128K 4.1K - - 🧠 🔧 🌡️ 2023-10 In: text
Out: text
Open Weights
Released: 2024-12-11
Phi-3-small instruct (8k) microsoft/phi-3-small-8k-instruct 8.2K 2K - - 🧠 🔧 🌡️ 2023-10 In: text
Out: text
Open Weights
Released: 2024-04-23
Phi-3.5-mini instruct (128k) microsoft/phi-3.5-mini-instruct 128K 4.1K - - 🧠 🔧 🌡️ 2023-10 In: text
Out: text
Open Weights
Released: 2024-08-20
Phi-4-multimodal-instruct microsoft/phi-4-multimodal-instruct 128K 4.1K - - 🧠 🔧 🌡️ 2023-10 In: text, image, audio
Out: text
Open Weights
Released: 2024-12-11
Phi-3-mini instruct (128k) microsoft/phi-3-mini-128k-instruct 128K 4.1K - - 🧠 🔧 🌡️ 2023-10 In: text
Out: text
Open Weights
Released: 2024-04-23
Phi-3.5-MoE instruct (128k) microsoft/phi-3.5-moe-instruct 128K 4.1K - - 🧠 🔧 🌡️ 2023-10 In: text
Out: text
Open Weights
Released: 2024-08-20
Phi-4-mini-instruct microsoft/phi-4-mini-instruct 128K 4.1K - - 🧠 🔧 🌡️ 2023-10 In: text
Out: text
Open Weights
Released: 2024-12-11
Phi-3-medium instruct (4k) microsoft/phi-3-medium-4k-instruct 4.1K 1K - - 🧠 🔧 🌡️ 2023-10 In: text
Out: text
Open Weights
Released: 2024-04-23
Phi-4-Reasoning microsoft/phi-4-reasoning 128K 4.1K - - 🧠 🔧 🌡️ 2023-10 In: text
Out: text
Open Weights
Released: 2024-12-11
MAI-DS-R1 microsoft/mai-ds-r1 65.5K 8.2K - - 🧠 🔧 🌡️ 2024-06 In: text
Out: text
Released: 2025-01-20
GPT-4.1-nano openai/gpt-4.1-nano 128K 16.4K - - 📎 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2025-04-14
GPT-4.1-mini openai/gpt-4.1-mini 128K 16.4K - - 📎 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2025-04-14
OpenAI o1-preview openai/o1-preview 128K 32.8K - - 🧠 2023-10 In: text
Out: text
Released: 2024-09-12
OpenAI o3-mini openai/o3-mini 200K 100K - - 🧠 2024-04 In: text
Out: text
Released: 2025-01-31
GPT-4o openai/gpt-4o 128K 16.4K - - 📎 🔧 🌡️ 2023-10 In: text, image, audio
Out: text
Released: 2024-05-13
GPT-4.1 openai/gpt-4.1 128K 16.4K - - 📎 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2025-04-14
OpenAI o4-mini openai/o4-mini 200K 100K - - 🧠 2024-04 In: text, image
Out: text
Released: 2025-01-31
OpenAI o1 openai/o1 200K 100K - - 🧠 2023-10 In: text, image
Out: text
Released: 2024-09-12
Updated: 2024-12-17
OpenAI o1-mini openai/o1-mini 128K 65.5K - - 🧠 2023-10 In: text
Out: text
Released: 2024-09-12
Updated: 2024-12-17
OpenAI o3 openai/o3 200K 100K - - 🧠 2024-04 In: text, image
Out: text
Released: 2025-01-31
GPT-4o mini openai/gpt-4o-mini 128K 16.4K - - 📎 🔧 🌡️ 2023-10 In: text, image, audio
Out: text
Released: 2024-07-18
Llama-3.2-11B-Vision-Instruct meta/llama-3.2-11b-vision-instruct 128K 8.2K - - 🧠 🔧 🌡️ 2023-12 In: text, image, audio
Out: text
Open Weights
Released: 2024-09-25
Meta-Llama-3.1-405B-Instruct meta/meta-llama-3.1-405b-instruct 128K 32.8K - - 🧠 🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-07-23
Llama 4 Maverick 17B 128E Instruct FP8 meta/llama-4-maverick-17b-128e-instruct-fp8 128K 8.2K - - 🧠 🔧 🌡️ 2024-12 In: text, image
Out: text
Open Weights
Released: 2025-01-31
Meta-Llama-3-70B-Instruct meta/meta-llama-3-70b-instruct 8.2K 2K - - 🧠 🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-04-18
Meta-Llama-3.1-70B-Instruct meta/meta-llama-3.1-70b-instruct 128K 32.8K - - 🧠 🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-07-23
Llama-3.3-70B-Instruct meta/llama-3.3-70b-instruct 128K 32.8K - - 🧠 🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-12-06
Llama-3.2-90B-Vision-Instruct meta/llama-3.2-90b-vision-instruct 128K 8.2K - - 🧠 🔧 🌡️ 2023-12 In: text, image, audio
Out: text
Open Weights
Released: 2024-09-25
Meta-Llama-3-8B-Instruct meta/meta-llama-3-8b-instruct 8.2K 2K - - 🧠 🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-04-18
Llama 4 Scout 17B 16E Instruct meta/llama-4-scout-17b-16e-instruct 128K 8.2K - - 🧠 🔧 🌡️ 2024-12 In: text, image
Out: text
Open Weights
Released: 2025-01-31
Meta-Llama-3.1-8B-Instruct meta/meta-llama-3.1-8b-instruct 128K 32.8K - - 🧠 🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-07-23
AI21 Jamba 1.5 Large ai21-labs/ai21-jamba-1.5-large 256K 4.1K - - 🧠 🔧 🌡️ 2024-03 In: text
Out: text
Released: 2024-08-29
AI21 Jamba 1.5 Mini ai21-labs/ai21-jamba-1.5-mini 256K 4.1K - - 🧠 🔧 🌡️ 2024-03 In: text
Out: text
Released: 2024-08-29

Google

📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Gemini 2.5 Flash Image gemini-2.5-flash-image 32.8K 32.8K In: $0.3
Out: $30
Cache Read: $0.075
Model: 0.150
Completion: 100.000
Cache: 0.250
📎 🧠 🌡️ 2025-06 In: text, image
Out: text, image
Released: 2025-08-26
Gemini 2.5 Flash Preview 05-20 gemini-2.5-flash-preview-05-20 1M 65.5K In: $0.15
Out: $0.6
Cache Read: $0.0375
Model: 0.075
Completion: 4.000
Cache: 0.250
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-05-20
Gemini Flash-Lite Latest gemini-flash-lite-latest 1M 65.5K In: $0.1
Out: $0.4
Cache Read: $0.025
Model: 0.050
Completion: 4.000
Cache: 0.250
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-09-25
Gemini 2.5 Flash gemini-2.5-flash 1M 65.5K In: $0.3
Out: $2.5
Cache Read: $0.075
Model: 0.150
Completion: 8.333
Cache: 0.250
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-03-20
Updated: 2025-06-05
Gemini Flash Latest gemini-flash-latest 1M 65.5K In: $0.3
Out: $2.5
Cache Read: $0.075
Model: 0.150
Completion: 8.333
Cache: 0.250
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-09-25
Gemini 2.5 Pro Preview 05-06 gemini-2.5-pro-preview-05-06 1M 65.5K In: $1.25
Out: $10
Cache Read: $0.31
Model: 0.625
Completion: 8.000
Cache: 0.248
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-05-06
Gemini 2.5 Flash Preview TTS gemini-2.5-flash-preview-tts 8K 16K In: $0.5
Out: $10
Model: 0.250
Completion: 20.000
- 2025-01 In: text
Out: audio
Released: 2025-05-01
Gemini 2.0 Flash Lite gemini-2.0-flash-lite 1M 8.2K In: $0.075
Out: $0.3
Model: 0.037
Completion: 4.000
📎 🔧 🌡️ 2024-06 In: text, image, audio, video, pdf
Out: text
Released: 2024-12-11
Gemini Live 2.5 Flash Preview Native Audio gemini-live-2.5-flash-preview-native-audio 131.1K 65.5K In: $0.5
Out: $2
Model: 0.250
Completion: 4.000
🧠 🔧 2025-01 In: text, audio, video
Out: text, audio
Released: 2025-06-17
Updated: 2025-09-18
Gemini 2.0 Flash gemini-2.0-flash 1M 8.2K In: $0.1
Out: $0.4
Cache Read: $0.025
Model: 0.050
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-06 In: text, image, audio, video, pdf
Out: text
Released: 2024-12-11
Gemini 2.5 Flash-Lite gemini-2.5-flash-lite 1M 65.5K In: $0.1
Out: $0.4
Cache Read: $0.025
Model: 0.050
Completion: 4.000
Cache: 0.250
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-06-17
Gemini 2.5 Pro Preview 06-05 gemini-2.5-pro-preview-06-05 1M 65.5K In: $1.25
Out: $10
Cache Read: $0.31
Model: 0.625
Completion: 8.000
Cache: 0.248
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-06-05
Gemini Live 2.5 Flash gemini-live-2.5-flash 128K 8K In: $0.5
Out: $2
Model: 0.250
Completion: 4.000
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video
Out: text, audio
Released: 2025-09-01
Gemini 2.5 Flash Lite Preview 06-17 gemini-2.5-flash-lite-preview-06-17 1M 65.5K In: $0.1
Out: $0.4
Cache Read: $0.025
Model: 0.050
Completion: 4.000
Cache: 0.250
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-06-17
Gemini 2.5 Flash Image (Preview) gemini-2.5-flash-image-preview 32.8K 32.8K In: $0.3
Out: $30
Cache Read: $0.075
Model: 0.150
Completion: 100.000
Cache: 0.250
📎 🧠 🌡️ 2025-06 In: text, image
Out: text, image
Released: 2025-08-26
Gemini 2.5 Flash Preview 09-25 gemini-2.5-flash-preview-09-2025 1M 65.5K In: $0.3
Out: $2.5
Cache Read: $0.075
Model: 0.150
Completion: 8.333
Cache: 0.250
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-09-25
Gemini 2.5 Flash Preview 04-17 gemini-2.5-flash-preview-04-17 1M 65.5K In: $0.15
Out: $0.6
Cache Read: $0.0375
Model: 0.075
Completion: 4.000
Cache: 0.250
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-04-17
Gemini 2.5 Pro Preview TTS gemini-2.5-pro-preview-tts 8K 16K In: $1
Out: $20
Model: 0.500
Completion: 20.000
- 2025-01 In: text
Out: audio
Released: 2025-05-01
Gemini 2.5 Pro gemini-2.5-pro 1M 65.5K In: $1.25
Out: $10
Cache Read: $0.31
Model: 0.625
Completion: 8.000
Cache: 0.248
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-03-20
Updated: 2025-06-05
Gemini 1.5 Flash gemini-1.5-flash 1M 8.2K In: $0.075
Out: $0.3
Cache Read: $0.01875
Model: 0.037
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-04 In: text, image, audio, video
Out: text
Released: 2024-05-14
Gemini 1.5 Flash-8B gemini-1.5-flash-8b 1M 8.2K In: $0.0375
Out: $0.15
Cache Read: $0.01
Model: 0.019
Completion: 4.000
Cache: 0.267
📎 🔧 🌡️ 2024-04 In: text, image, audio, video
Out: text
Released: 2024-10-03
Gemini 2.5 Flash Lite Preview 09-25 gemini-2.5-flash-lite-preview-09-2025 1M 65.5K In: $0.1
Out: $0.4
Cache Read: $0.025
Model: 0.050
Completion: 4.000
Cache: 0.250
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-09-25
Gemini 1.5 Pro gemini-1.5-pro 1M 8.2K In: $1.25
Out: $5
Cache Read: $0.3125
Model: 0.625
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-04 In: text, image, audio, video
Out: text
Released: 2024-02-15
gemini-embedding-001 gemini-embedding-001 2K 3.1K In: $0.15
Out: $-
Model: 0.075 🔧 2025-06 In: text
Out: text
Released: 2025-06-01

Vertex

📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Gemini 2.5 Flash Preview 05-20 gemini-2.5-flash-preview-05-20 1M 65.5K In: $0.15
Out: $0.6
Cache Read: $0.0375
Model: 0.075
Completion: 4.000
Cache: 0.250
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-05-20
Gemini Flash-Lite Latest gemini-flash-lite-latest 1M 65.5K In: $0.1
Out: $0.4
Cache Read: $0.025
Model: 0.050
Completion: 4.000
Cache: 0.250
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-09-25
Gemini 2.5 Flash gemini-2.5-flash 1M 65.5K In: $0.3
Out: $2.5
Cache Read: $0.075
Cache Write: $0.383
Model: 0.150
Completion: 8.333
Cache: 0.250
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-06-17
Gemini Flash Latest gemini-flash-latest 1M 65.5K In: $0.3
Out: $2.5
Cache Read: $0.075
Cache Write: $0.383
Model: 0.150
Completion: 8.333
Cache: 0.250
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-09-25
Gemini 2.5 Pro Preview 05-06 gemini-2.5-pro-preview-05-06 1M 65.5K In: $1.25
Out: $10
Cache Read: $0.31
Model: 0.625
Completion: 8.000
Cache: 0.248
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-05-06
Gemini 2.0 Flash Lite gemini-2.0-flash-lite 1M 8.2K In: $0.075
Out: $0.3
Model: 0.037
Completion: 4.000
📎 🔧 🌡️ 2024-06 In: text, image, audio, video, pdf
Out: text
Released: 2024-12-11
Gemini 2.0 Flash gemini-2.0-flash 1M 8.2K In: $0.1
Out: $0.4
Cache Read: $0.025
Model: 0.050
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-06 In: text, image, audio, video, pdf
Out: text
Released: 2024-12-11
Gemini 2.5 Flash Lite gemini-2.5-flash-lite 1M 65.5K In: $0.1
Out: $0.4
Cache Read: $0.025
Model: 0.050
Completion: 4.000
Cache: 0.250
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-06-17
Gemini 2.5 Pro Preview 06-05 gemini-2.5-pro-preview-06-05 1M 65.5K In: $1.25
Out: $10
Cache Read: $0.31
Model: 0.625
Completion: 8.000
Cache: 0.248
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-06-05
Gemini 2.5 Flash Lite Preview 06-17 gemini-2.5-flash-lite-preview-06-17 65.5K 65.5K In: $0.1
Out: $0.4
Cache Read: $0.025
Model: 0.050
Completion: 4.000
Cache: 0.250
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-06-17
Gemini 2.5 Flash Preview 09-25 gemini-2.5-flash-preview-09-2025 1M 65.5K In: $0.3
Out: $2.5
Cache Read: $0.075
Cache Write: $0.383
Model: 0.150
Completion: 8.333
Cache: 0.250
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-09-25
Gemini 2.5 Flash Preview 04-17 gemini-2.5-flash-preview-04-17 1M 65.5K In: $0.15
Out: $0.6
Cache Read: $0.0375
Model: 0.075
Completion: 4.000
Cache: 0.250
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-04-17
Gemini 2.5 Pro gemini-2.5-pro 1M 65.5K In: $1.25
Out: $10
Cache Read: $0.31
Model: 0.625
Completion: 8.000
Cache: 0.248
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-03-20
Updated: 2025-06-05
Gemini 2.5 Flash Lite Preview 09-25 gemini-2.5-flash-lite-preview-09-2025 1M 65.5K In: $0.1
Out: $0.4
Cache Read: $0.025
Model: 0.050
Completion: 4.000
Cache: 0.250
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-09-25

Vertex

📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Claude Sonnet 3.5 v2 claude-3-5-sonnet@20241022 200K 8.2K In: $3
Out: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🔧 🌡️ 2024-04-30 In: text, image
Out: text
Released: 2024-10-22
Claude Haiku 3.5 claude-3-5-haiku@20241022 200K 8.2K In: $0.8
Out: $4
Cache Read: $0.08
Cache Write: $1
Model: 0.400
Completion: 5.000
Cache: 0.100
📎 🔧 🌡️ 2024-07-31 In: text, image
Out: text
Released: 2024-10-22
Claude Sonnet 4 claude-sonnet-4@20250514 200K 64K In: $3
Out: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image
Out: text
Released: 2025-05-22
Claude Opus 4.1 claude-opus-4-1@20250805 200K 32K In: $15
Out: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image
Out: text
Released: 2025-08-05
Claude Sonnet 3.7 claude-3-7-sonnet@20250219 200K 64K In: $3
Out: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2024-10-31 In: text, image
Out: text
Released: 2025-02-19
Claude Opus 4 claude-opus-4@20250514 200K 32K In: $15
Out: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image
Out: text
Released: 2025-05-22

Groq

📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Llama 3.1 8B Instant llama-3.1-8b-instant 131.1K 8.2K In: $0.05
Out: $0.08
Model: 0.025
Completion: 1.600
🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-07-23
Mistral Saba 24B mistral-saba-24b 32.8K 32.8K In: $0.79
Out: $0.79
Model: 0.395
Completion: 1.000
🔧 🌡️ 2024-08 In: text
Out: text
Released: 2025-02-06
Llama 3 8B llama3-8b-8192 8.2K 8.2K In: $0.05
Out: $0.08
Model: 0.025
Completion: 1.600
🔧 🌡️ 2023-03 In: text
Out: text
Open Weights
Released: 2024-04-18
Qwen QwQ 32B qwen-qwq-32b 131.1K 16.4K In: $0.29
Out: $0.39
Model: 0.145
Completion: 1.345
🧠 🔧 🌡️ 2024-09 In: text
Out: text
Open Weights
Released: 2024-11-27
Llama 3 70B llama3-70b-8192 8.2K 8.2K In: $0.59
Out: $0.79
Model: 0.295
Completion: 1.339
🔧 🌡️ 2023-03 In: text
Out: text
Open Weights
Released: 2024-04-18
DeepSeek R1 Distill Llama 70B deepseek-r1-distill-llama-70b 131.1K 8.2K In: $0.75
Out: $0.99
Model: 0.375
Completion: 1.320
🧠 🔧 🌡️ 2024-07 In: text
Out: text
Open Weights
Released: 2025-01-20
Llama Guard 3 8B llama-guard-3-8b 8.2K 8.2K In: $0.2
Out: $0.2
Model: 0.100
Completion: 1.000
🌡️ - In: text
Out: text
Open Weights
Released: 2024-07-23
Gemma 2 9B gemma2-9b-it 8.2K 8.2K In: $0.2
Out: $0.2
Model: 0.100
Completion: 1.000
🔧 🌡️ 2024-06 In: text
Out: text
Open Weights
Released: 2024-06-27
Llama 3.3 70B Versatile llama-3.3-70b-versatile 131.1K 32.8K In: $0.59
Out: $0.79
Model: 0.295
Completion: 1.339
🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-12-06
Kimi K2 Instruct 0905 moonshotai/kimi-k2-instruct-0905 262.1K 16.4K In: $1
Out: $3
Model: 0.500
Completion: 3.000
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-09-05
Kimi K2 Instruct moonshotai/kimi-k2-instruct 131.1K 16.4K In: $1
Out: $3
Model: 0.500
Completion: 3.000
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-07-14
GPT OSS 20B openai/gpt-oss-20b 131.1K 32.8K In: $0.1
Out: $0.5
Model: 0.050
Completion: 5.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-05
GPT OSS 120B openai/gpt-oss-120b 131.1K 32.8K In: $0.15
Out: $0.75
Model: 0.075
Completion: 5.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-05
Qwen3 32B qwen/qwen3-32b 131.1K 16.4K In: $0.29
Out: $0.59
Model: 0.145
Completion: 2.034
🧠 🔧 🌡️ 2024-11-08 In: text
Out: text
Open Weights
Released: 2024-12-23
Llama 4 Scout 17B meta-llama/llama-4-scout-17b-16e-instruct 131.1K 8.2K In: $0.11
Out: $0.34
Model: 0.055
Completion: 3.091
🔧 🌡️ 2024-08 In: text, image
Out: text
Open Weights
Released: 2025-04-05
Llama 4 Maverick 17B meta-llama/llama-4-maverick-17b-128e-instruct 131.1K 8.2K In: $0.2
Out: $0.6
Model: 0.100
Completion: 3.000
🔧 🌡️ 2024-08 In: text, image
Out: text
Open Weights
Released: 2025-04-05
Llama Guard 4 12B meta-llama/llama-guard-4-12b 131.1K 128 In: $0.2
Out: $0.2
Model: 0.100
Completion: 1.000
🌡️ - In: text, image
Out: text
Open Weights
Released: 2025-04-05

Hugging Face

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Kimi-K2-Instruct moonshotai/Kimi-K2-Instruct 131.1K 16.4K In: $1
Out: $3
Model: 0.500
Completion: 3.000
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-07-14
Kimi-K2-Instruct-0905 moonshotai/Kimi-K2-Instruct-0905 262.1K 16.4K In: $1
Out: $3
Model: 0.500
Completion: 3.000
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-09-04
Qwen3-Coder-480B-A35B-Instruct Qwen/Qwen3-Coder-480B-A35B-Instruct 262.1K 66.5K In: $2
Out: $2
Model: 1.000
Completion: 1.000
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-23
Qwen3-235B-A22B-Thinking-2507 Qwen/Qwen3-235B-A22B-Thinking-2507 262.1K 131.1K In: $0.3
Out: $3
Model: 0.150
Completion: 10.000
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-25
Qwen3-Next-80B-A3B-Instruct Qwen/Qwen3-Next-80B-A3B-Instruct 262.1K 66.5K In: $0.25
Out: $1
Model: 0.125
Completion: 4.000
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-09-11
Qwen3-Next-80B-A3B-Thinking Qwen/Qwen3-Next-80B-A3B-Thinking 262.1K 131.1K In: $0.3
Out: $2
Model: 0.150
Completion: 6.667
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-09-11
GLM-4.5 zai-org/GLM-4.5 131.1K 98.3K In: $0.6
Out: $2.2
Model: 0.300
Completion: 3.667
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-28
GLM-4.6 zai-org/GLM-4.6 200K 128K In: $0.6
Out: $2.2
Cache Read: $0.11
Model: 0.300
Completion: 3.667
Cache: 0.183
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-09-30
GLM-4.5-Air zai-org/GLM-4.5-Air 128K 96K In: $0.2
Out: $1.1
Model: 0.100
Completion: 5.500
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-28
DeepSeek-V3-0324 deepseek-ai/Deepseek-V3-0324 16.4K 8.2K In: $1.25
Out: $1.25
Model: 0.625
Completion: 1.000
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-03-24
DeepSeek-R1-0528 deepseek-ai/DeepSeek-R1-0528 163.8K 163.8K In: $3
Out: $5
Model: 1.500
Completion: 1.667
🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2025-05-28

Inception

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Mercury Coder mercury-coder 128K 16.4K In: $0.25
Out: $1
Cache Read: $0.25
Cache Write: $1
Model: 0.125
Completion: 4.000
Cache: 1.000
🔧 🌡️ 2023-10 In: text
Out: text
Released: 2025-02-26
Updated: 2025-07-31
Mercury mercury 128K 16.4K In: $0.25
Out: $1
Cache Read: $0.25
Cache Write: $1
Model: 0.125
Completion: 4.000
Cache: 1.000
🔧 🌡️ 2023-10 In: text
Out: text
Released: 2025-06-26
Updated: 2025-07-31

Inference

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Mistral Nemo 12B Instruct mistral/mistral-nemo-12b-instruct 16K 4.1K In: $0.038
Out: $0.1
Model: 0.019
Completion: 2.632
🔧 🌡️ 2024-12 In: text
Out: text
Open Weights
Released: 2025-01-01
Google Gemma 3 google/gemma-3 125K 4.1K In: $0.15
Out: $0.3
Model: 0.075
Completion: 2.000
📎 🔧 🌡️ 2024-12 In: text, image
Out: text
Open Weights
Released: 2025-01-01
Osmosis Structure 0.6B osmosis/osmosis-structure-0.6b 4K 2K In: $0.1
Out: $0.5
Model: 0.050
Completion: 5.000
🔧 🌡️ 2024-12 In: text
Out: text
Open Weights
Released: 2025-01-01
Qwen 3 Embedding 4B qwen/qwen3-embedding-4b 32K 2K In: $0.01
Out: $-
Model: 0.005 - 2024-12 In: text
Out: text
Open Weights
Released: 2025-01-01
Qwen 2.5 7B Vision Instruct qwen/qwen-2.5-7b-vision-instruct 125K 4.1K In: $0.2
Out: $0.2
Model: 0.100
Completion: 1.000
📎 🔧 🌡️ 2024-12 In: text, image
Out: text
Open Weights
Released: 2025-01-01
Llama 3.2 11B Vision Instruct meta/llama-3.2-11b-vision-instruct 16K 4.1K In: $0.055
Out: $0.055
Model: 0.028
Completion: 1.000
📎 🔧 🌡️ 2023-12 In: text, image
Out: text
Open Weights
Released: 2025-01-01
Llama 3.1 8B Instruct meta/llama-3.1-8b-instruct 16K 4.1K In: $0.025
Out: $0.025
Model: 0.013
Completion: 1.000
🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2025-01-01
Llama 3.2 3B Instruct meta/llama-3.2-3b-instruct 16K 4.1K In: $0.02
Out: $0.02
Model: 0.010
Completion: 1.000
🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2025-01-01
Llama 3.2 1B Instruct meta/llama-3.2-1b-instruct 16K 4.1K In: $0.01
Out: $0.01
Model: 0.005
Completion: 1.000
🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2025-01-01

Llama

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Llama-3.3-8B-Instruct llama-3.3-8b-instruct 128K 4.1K - - 📎 🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-12-06
Llama-4-Maverick-17B-128E-Instruct-FP8 llama-4-maverick-17b-128e-instruct-fp8 128K 4.1K - - 📎 🔧 🌡️ 2024-08 In: text, image
Out: text
Open Weights
Released: 2025-04-05
Llama-3.3-70B-Instruct llama-3.3-70b-instruct 128K 4.1K - - 📎 🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-12-06
Llama-4-Scout-17B-16E-Instruct-FP8 llama-4-scout-17b-16e-instruct-fp8 128K 4.1K - - 📎 🔧 🌡️ 2024-08 In: text, image
Out: text
Open Weights
Released: 2025-04-05
Groq-Llama-4-Maverick-17B-128E-Instruct groq-llama-4-maverick-17b-128e-instruct 128K 4.1K - - 📎 🔧 🌡️ 2025-01 In: text
Out: text
Open Weights
Released: 2025-04-05
Cerebras-Llama-4-Scout-17B-16E-Instruct cerebras-llama-4-scout-17b-16e-instruct 128K 4.1K - - 📎 🔧 🌡️ 2025-01 In: text
Out: text
Open Weights
Released: 2025-04-05
Cerebras-Llama-4-Maverick-17B-128E-Instruct cerebras-llama-4-maverick-17b-128e-instruct 128K 4.1K - - 📎 🔧 🌡️ 2025-01 In: text
Out: text
Open Weights
Released: 2025-04-05

LMStudio

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
GPT OSS 20B openai/gpt-oss-20b 131.1K 32.8K - - 🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-05
Qwen3 30B A3B 2507 qwen/qwen3-30b-a3b-2507 262.1K 16.4K - - 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-30
Qwen3 Coder 30B qwen/qwen3-coder-30b 262.1K 65.5K - - 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-23

LucidQuery AI

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
LucidQuery Nexus Coder lucidquery-nexus-coder 250K 60K In: $2
Out: $5
Model: 1.000
Completion: 2.500
📎 🧠 🔧 2025-08-01 In: text
Out: text
Released: 2025-09-01
LucidNova RF1 100B lucidnova-rf1-100b 120K 8K In: $2
Out: $5
Model: 1.000
Completion: 2.500
📎 🧠 🔧 2025-09-16 In: text
Out: text
Released: 2024-12-28
Updated: 2025-09-10

Mistral

📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Devstral Medium devstral-medium-2507 128K 128K In: $0.4
Out: $2
Model: 0.200
Completion: 5.000
🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2025-07-10
Mixtral 8x22B open-mixtral-8x22b 64K 64K In: $2
Out: $6
Model: 1.000
Completion: 3.000
🔧 🌡️ 2024-04 In: text
Out: text
Open Weights
Released: 2024-04-17
Ministral 8B ministral-8b-latest 128K 128K In: $0.1
Out: $0.1
Model: 0.050
Completion: 1.000
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2024-10-01
Updated: 2024-10-04
Pixtral Large pixtral-large-latest 128K 128K In: $2
Out: $6
Model: 1.000
Completion: 3.000
📎 🔧 🌡️ 2024-11 In: text, image
Out: text
Open Weights
Released: 2024-11-01
Updated: 2024-11-04
Ministral 3B ministral-3b-latest 128K 128K In: $0.04
Out: $0.04
Model: 0.020
Completion: 1.000
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2024-10-01
Updated: 2024-10-04
Pixtral 12B pixtral-12b 128K 128K In: $0.15
Out: $0.15
Model: 0.075
Completion: 1.000
📎 🔧 🌡️ 2024-09 In: text, image
Out: text
Open Weights
Released: 2024-09-01
Mistral Medium 3 mistral-medium-2505 131.1K 131.1K In: $0.4
Out: $2
Model: 0.200
Completion: 5.000
📎 🔧 🌡️ 2025-05 In: text, image
Out: text
Released: 2025-05-07
Devstral Small 2505 devstral-small-2505 128K 128K In: $0.1
Out: $0.3
Model: 0.050
Completion: 3.000
🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2025-05-07
Mistral Medium 3.1 mistral-medium-2508 262.1K 262.1K In: $0.4
Out: $2
Model: 0.200
Completion: 5.000
📎 🔧 🌡️ 2025-05 In: text, image
Out: text
Released: 2025-08-12
Mistral Small mistral-small-latest 128K 16.4K In: $0.1
Out: $0.3
Model: 0.050
Completion: 3.000
🔧 🌡️ 2025-03 In: text, image
Out: text
Open Weights
Released: 2024-09-01
Updated: 2024-09-04
Magistral Small magistral-small 128K 128K In: $0.5
Out: $1.5
Model: 0.250
Completion: 3.000
🧠 🔧 🌡️ 2025-06 In: text
Out: text
Open Weights
Released: 2025-03-17
Devstral Small devstral-small-2507 128K 128K In: $0.1
Out: $0.3
Model: 0.050
Completion: 3.000
🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2025-07-10
Codestral codestral-latest 256K 4.1K In: $0.3
Out: $0.9
Model: 0.150
Completion: 3.000
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2024-05-29
Updated: 2025-01-04
Mixtral 8x7B open-mixtral-8x7b 32K 32K In: $0.7
Out: $0.7
Model: 0.350
Completion: 1.000
🔧 🌡️ 2024-01 In: text
Out: text
Open Weights
Released: 2023-12-11
Mistral Nemo mistral-nemo 128K 128K In: $0.15
Out: $0.15
Model: 0.075
Completion: 1.000
🔧 🌡️ 2024-07 In: text
Out: text
Open Weights
Released: 2024-07-01
Mistral 7B open-mistral-7b 8K 8K In: $0.25
Out: $0.25
Model: 0.125
Completion: 1.000
🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2023-09-27
Mistral Large mistral-large-latest 131.1K 16.4K In: $2
Out: $6
Model: 1.000
Completion: 3.000
🔧 🌡️ 2024-11 In: text
Out: text
Open Weights
Released: 2024-11-01
Updated: 2024-11-04
Mistral Medium mistral-medium-latest 128K 16.4K In: $0.4
Out: $2
Model: 0.200
Completion: 5.000
🔧 🌡️ 2025-05 In: text, image
Out: text
Open Weights
Released: 2025-05-07
Updated: 2025-05-10
Magistral Medium magistral-medium-latest 128K 16.4K In: $2
Out: $5
Model: 1.000
Completion: 2.500
🧠 🔧 🌡️ 2025-06 In: text
Out: text
Open Weights
Released: 2025-03-17
Updated: 2025-03-20

ModelScope

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
GLM-4.5 ZhipuAI/GLM-4.5 131.1K 98.3K - - 🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-28
GLM-4.6 ZhipuAI/GLM-4.6 202.8K 98.3K - - 🧠 🔧 🌡️ 2025-07 In: text
Out: text
Open Weights
Released: 2025-09-30
Qwen3 30B A3B Thinking 2507 Qwen/Qwen3-30B-A3B-Thinking-2507 262.1K 32.8K - - 🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-30
Qwen3 235B A22B Instruct 2507 Qwen/Qwen3-235B-A22B-Instruct-2507 262.1K 131.1K - - 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-04-28
Updated: 2025-07-21
Qwen3 Coder 30B A3B Instruct Qwen/Qwen3-Coder-30B-A3B-Instruct 262.1K 65.5K - - 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-31
Qwen3 30B A3B Instruct 2507 Qwen/Qwen3-30B-A3B-Instruct-2507 262.1K 16.4K - - 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-30
Qwen3-235B-A22B-Thinking-2507 Qwen/Qwen3-235B-A22B-Thinking-2507 262.1K 131.1K - - 🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-25

Moonshot AI

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Kimi K2 Turbo kimi-k2-turbo-preview 262.1K 262.1K In: $2.4
Out: $10
Cache Read: $0.6
Model: 1.200
Completion: 4.167
Cache: 0.250
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-09-05
Kimi K2 0711 kimi-k2-0711-preview 131.1K 16.4K In: $0.6
Out: $2.5
Cache Read: $0.15
Model: 0.300
Completion: 4.167
Cache: 0.250
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-07-14
Kimi K2 0905 kimi-k2-0905-preview 262.1K 262.1K In: $0.6
Out: $2.5
Cache Read: $0.15
Model: 0.300
Completion: 4.167
Cache: 0.250
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-09-05

Moonshot AI (China)

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Kimi K2 0905 kimi-k2-0905-preview 262.1K 262.1K In: $0.6
Out: $2.5
Cache Read: $0.15
Model: 0.300
Completion: 4.167
Cache: 0.250
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-09-05
Kimi K2 0711 kimi-k2-0711-preview 131.1K 16.4K In: $0.6
Out: $2.5
Cache Read: $0.15
Model: 0.300
Completion: 4.167
Cache: 0.250
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-07-14
Kimi K2 Turbo kimi-k2-turbo-preview 262.1K 262.1K In: $2.4
Out: $10
Cache Read: $0.6
Model: 1.200
Completion: 4.167
Cache: 0.250
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-09-05

Morph

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Morph v3 Large morph-v3-large 32K 32K In: $0.9
Out: $1.9
Model: 0.450
Completion: 2.111
- - In: text
Out: text
Released: 2024-08-15
Auto auto 32K 32K In: $0.85
Out: $1.55
Model: 0.425
Completion: 1.824
- - In: text
Out: text
Released: 2024-06-01
Morph v3 Fast morph-v3-fast 16K 16K In: $0.8
Out: $1.2
Model: 0.400
Completion: 1.500
- - In: text
Out: text
Released: 2024-08-15

Nebius AI Studio

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Hermes 4 70B NousResearch/hermes-4-70b 131.1K 8.2K In: $0.13
Out: $0.4
Model: 0.065
Completion: 3.077
🧠 🔧 🌡️ 2024-07 In: text
Out: text
Released: 2024-08-01
Updated: 2025-10-04
Hermes-4 405B NousResearch/hermes-4-405b 131.1K 8.2K In: $1
Out: $3
Model: 0.500
Completion: 3.000
🧠 🔧 🌡️ 2024-07 In: text
Out: text
Released: 2024-08-01
Updated: 2025-10-04
Kimi K2 Instruct moonshotai/kimi-k2-instruct 131.1K 8.2K In: $0.5
Out: $2.4
Model: 0.250
Completion: 4.800
🧠 🔧 🌡️ 2024-01 In: text
Out: text
Released: 2025-01-01
Updated: 2025-10-04
Llama 3.1 Nemotron Ultra 253B v1 nvidia/llama-3_1-nemotron-ultra-253b-v1 131.1K 8.2K In: $0.6
Out: $1.8
Model: 0.300
Completion: 3.000
🧠 🔧 🌡️ 2024-07 In: text
Out: text
Released: 2024-07-01
Updated: 2025-10-04
GPT OSS 20B openai/gpt-oss-20b 131.1K 8.2K In: $0.05
Out: $0.2
Model: 0.025
Completion: 4.000
📎 🧠 🔧 🌡️ 2024-01 In: text
Out: text
Released: 2024-01-01
Updated: 2025-10-04
GPT OSS 120B openai/gpt-oss-120b 131.1K 8.2K In: $0.15
Out: $0.6
Model: 0.075
Completion: 4.000
📎 🧠 🔧 🌡️ 2024-01 In: text
Out: text
Released: 2024-01-01
Updated: 2025-10-04
Qwen3 235B A22B Instruct 2507 qwen/qwen3-235b-a22b-instruct-2507 262.1K 8.2K In: $0.2
Out: $0.6
Model: 0.100
Completion: 3.000
🧠 🔧 🌡️ 2025-07 In: text
Out: text
Released: 2025-07-25
Updated: 2025-10-04
Qwen3 235B A22B Thinking 2507 qwen/qwen3-235b-a22b-thinking-2507 262.1K 8.2K In: $0.2
Out: $0.8
Model: 0.100
Completion: 4.000
🧠 🔧 🌡️ 2025-07 In: text
Out: text
Released: 2025-07-25
Updated: 2025-10-04
Qwen3 Coder 480B A35B Instruct qwen/qwen3-coder-480b-a35b-instruct 262.1K 66.5K In: $0.4
Out: $1.8
Model: 0.200
Completion: 4.500
🔧 🌡️ 2025-04 In: text
Out: text
Released: 2025-07-23
Updated: 2025-10-04
Llama 3.1 405B Instruct meta-llama/llama-3_1-405b-instruct 131.1K 8.2K In: $1
Out: $3
Model: 0.500
Completion: 3.000
🧠 🔧 🌡️ 2024-03 In: text
Out: text
Released: 2024-07-23
Updated: 2025-10-04
Llama-3.3-70B-Instruct (Fast) meta-llama/llama-3.3-70b-instruct-fast 131.1K 8.2K In: $0.25
Out: $0.75
Model: 0.125
Completion: 3.000
🧠 🔧 🌡️ 2024-08 In: text
Out: text
Released: 2024-08-22
Updated: 2025-10-04
Llama-3.3-70B-Instruct (Base) meta-llama/llama-3.3-70b-instruct-base 131.1K 8.2K In: $0.13
Out: $0.4
Model: 0.065
Completion: 3.077
🧠 🔧 🌡️ 2024-08 In: text
Out: text
Released: 2024-08-22
Updated: 2025-10-04
GLM 4.5 zai-org/glm-4.5 131.1K 8.2K In: $0.6
Out: $2.2
Model: 0.300
Completion: 3.667
🧠 🔧 🌡️ 2024-05 In: text
Out: text
Released: 2024-06-01
Updated: 2025-10-04
GLM 4.5 Air zai-org/glm-4.5-air 131.1K 8.2K In: $0.2
Out: $1.2
Model: 0.100
Completion: 6.000
🧠 🔧 🌡️ 2024-05 In: text
Out: text
Released: 2024-06-01
Updated: 2025-10-04
DeepSeek V3 deepseek-ai/deepseek-v3 131.1K 8.2K In: $0.5
Out: $1.5
Model: 0.250
Completion: 3.000
🧠 🔧 🌡️ 2024-04 In: text
Out: text
Released: 2024-05-07
Updated: 2025-10-04

Nvidia

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Kimi K2 0905 moonshotai/kimi-k2-instruct-0905 262.1K 262.1K - - 🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-09-05
Kimi K2 Instruct moonshotai/kimi-k2-instruct 128K 8.2K - - 🧠 🔧 🌡️ 2024-01 In: text
Out: text
Released: 2025-01-01
Updated: 2025-09-05
Cosmos Nemotron 34B nvidia/cosmos-nemotron-34b 131.1K 8.2K - - 🧠 🌡️ 2024-01 In: text, image, video
Out: text
Released: 2024-01-01
Updated: 2025-09-05
Parakeet TDT 0.6B v2 nvidia/parakeet-tdt-0.6b-v2 - 4.1K - - - 2024-01 In: audio
Out: text
Released: 2024-01-01
Updated: 2025-09-05
NeMo Retriever OCR v1 nvidia/nemoretriever-ocr-v1 - 4.1K - - - 2024-01 In: image
Out: text
Released: 2024-01-01
Updated: 2025-09-05
Llama-3.1-Nemotron-Ultra-253B-v1 nvidia/llama-3.1-nemotron-ultra-253b-v1 131.1K 8.2K - - 🧠 🔧 🌡️ 2024-07 In: text
Out: text
Released: 2024-07-01
Updated: 2025-09-05
Gemma-3-27B-IT google/gemma-3-27b-it 131.1K 8.2K - - 📎 🧠 🔧 🌡️ 2024-12 In: text, image
Out: text
Released: 2024-12-01
Updated: 2025-09-05
Phi-4-Mini microsoft/phi-4-mini-instruct 131.1K 8.2K - - 📎 🧠 🔧 🌡️ 2024-12 In: text, image, audio
Out: text
Released: 2024-12-01
Updated: 2025-09-05
Whisper Large v3 openai/whisper-large-v3 - 4.1K - - - 2023-09 In: audio
Out: text
Open Weights
Released: 2023-09-01
Updated: 2025-09-05
GPT-OSS-120B openai/gpt-oss-120b 128K 8.2K - - 📎 🧠 🌡️ 2024-01 In: text
Out: text
Released: 2024-01-01
Updated: 2025-09-05
Qwen3-235B-A22B qwen/qwen3-235b-a22b 131.1K 8.2K - - 🧠 🔧 🌡️ 2024-12 In: text
Out: text
Released: 2024-12-01
Updated: 2025-09-05
Qwen3 Coder 480B A35B Instruct qwen/qwen3-coder-480b-a35b-instruct 262.1K 66.5K - - 🔧 🌡️ 2025-04 In: text
Out: text
Released: 2025-07-23
DeepSeek V3.1 Terminus deepseek-ai/deepseek-v3.1-terminus 128K 8.2K - - 🧠 🔧 🌡️ 2025-01 In: text
Out: text
Released: 2025-09-22
DeepSeek V3.1 deepseek-ai/deepseek-v3.1 128K 8.2K - - 🧠 🔧 🌡️ 2024-07 In: text
Out: text
Released: 2025-08-20
Updated: 2025-08-26
FLUX.1-dev black-forest-labs/flux.1-dev 4.1K - - - 🌡️ 2024-08 In: text
Out: image
Released: 2024-08-01
Updated: 2025-09-05

OpenAI

📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
GPT-4.1 nano gpt-4.1-nano 1M 32.8K In: $0.1
Out: $0.4
Cache Read: $0.03
Model: 0.050
Completion: 4.000
Cache: 0.300
📎 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2025-04-14
GPT-4 gpt-4 8.2K 8.2K In: $30
Out: $60
Model: 15.000
Completion: 2.000
📎 🔧 🌡️ 2023-11 In: text
Out: text
Released: 2023-11-06
Updated: 2024-04-09
o1-pro o1-pro 200K 100K In: $150
Out: $600
Model: 75.000
Completion: 4.000
📎 🧠 🔧 2023-09 In: text, image
Out: text
Released: 2025-03-19
GPT-4o (2024-05-13) gpt-4o-2024-05-13 128K 4.1K In: $5
Out: $15
Model: 2.500
Completion: 3.000
📎 🔧 🌡️ 2023-09 In: text, image
Out: text
Released: 2024-05-13
GPT-4o (2024-08-06) gpt-4o-2024-08-06 128K 16.4K In: $2.5
Out: $10
Cache Read: $1.25
Model: 1.250
Completion: 4.000
Cache: 0.500
📎 🔧 🌡️ 2023-09 In: text, image
Out: text
Released: 2024-08-06
GPT-4.1 mini gpt-4.1-mini 1M 32.8K In: $0.4
Out: $1.6
Cache Read: $0.1
Model: 0.200
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2025-04-14
o3-deep-research o3-deep-research 200K 100K In: $10
Out: $40
Cache Read: $2.5
Model: 5.000
Completion: 4.000
Cache: 0.250
📎 🧠 🔧 2024-05 In: text, image
Out: text
Released: 2024-06-26
GPT-3.5-turbo gpt-3.5-turbo 16.4K 4.1K In: $0.5
Out: $1.5
Cache Read: $1.25
Model: 0.250
Completion: 3.000
Cache: 2.500
🌡️ 2021-09-01 In: text
Out: text
Released: 2023-03-01
Updated: 2023-11-06
GPT-4 Turbo gpt-4-turbo 128K 4.1K In: $10
Out: $30
Model: 5.000
Completion: 3.000
📎 🔧 🌡️ 2023-12 In: text, image
Out: text
Released: 2023-11-06
Updated: 2024-04-09
o1-preview o1-preview 128K 32.8K In: $15
Out: $60
Cache Read: $7.5
Model: 7.500
Completion: 4.000
Cache: 0.500
🧠 🌡️ 2023-09 In: text
Out: text
Released: 2024-09-12
o3-mini o3-mini 200K 100K In: $1.1
Out: $4.4
Cache Read: $0.55
Model: 0.550
Completion: 4.000
Cache: 0.500
🧠 🔧 2024-05 In: text
Out: text
Released: 2024-12-20
Updated: 2025-01-29
Codex Mini codex-mini-latest 200K 100K In: $1.5
Out: $6
Cache Read: $0.375
Model: 0.750
Completion: 4.000
Cache: 0.250
📎 🧠 🔧 2024-04 In: text
Out: text
Released: 2025-05-16
GPT-5 Nano gpt-5-nano 400K 128K In: $0.05
Out: $0.4
Cache Read: $0.01
Model: 0.025
Completion: 8.000
Cache: 0.200
📎 🧠 🔧 2024-05-30 In: text, image
Out: text
Released: 2025-08-07
GPT-5-Codex gpt-5-codex 400K 128K In: $1.25
Out: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-09-15
GPT-4o gpt-4o 128K 16.4K In: $2.5
Out: $10
Cache Read: $1.25
Model: 1.250
Completion: 4.000
Cache: 0.500
📎 🔧 🌡️ 2023-09 In: text, image
Out: text
Released: 2024-05-13
Updated: 2024-08-06
GPT-4.1 gpt-4.1 1M 32.8K In: $2
Out: $8
Cache Read: $0.5
Model: 1.000
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2025-04-14
o4-mini o4-mini 200K 100K In: $1.1
Out: $4.4
Cache Read: $0.28
Model: 0.550
Completion: 4.000
Cache: 0.255
📎 🧠 🔧 2024-05 In: text, image
Out: text
Released: 2025-04-16
o1 o1 200K 100K In: $15
Out: $60
Cache Read: $7.5
Model: 7.500
Completion: 4.000
Cache: 0.500
📎 🧠 🔧 2023-09 In: text, image
Out: text
Released: 2024-12-05
GPT-5 Mini gpt-5-mini 400K 128K In: $0.25
Out: $2
Cache Read: $0.03
Model: 0.125
Completion: 8.000
Cache: 0.120
📎 🧠 🔧 2024-05-30 In: text, image
Out: text
Released: 2025-08-07
o1-mini o1-mini 128K 65.5K In: $1.1
Out: $4.4
Cache Read: $0.55
Model: 0.550
Completion: 4.000
Cache: 0.500
🧠 2023-09 In: text
Out: text
Released: 2024-09-12
o3-pro o3-pro 200K 100K In: $20
Out: $80
Model: 10.000
Completion: 4.000
📎 🧠 🔧 2024-05 In: text, image
Out: text
Released: 2025-06-10
GPT-4o (2024-11-20) gpt-4o-2024-11-20 128K 16.4K In: $2.5
Out: $10
Cache Read: $1.25
Model: 1.250
Completion: 4.000
Cache: 0.500
📎 🔧 🌡️ 2023-09 In: text, image
Out: text
Released: 2024-11-20
o3 o3 200K 100K In: $2
Out: $8
Cache Read: $0.5
Model: 1.000
Completion: 4.000
Cache: 0.250
📎 🧠 🔧 2024-05 In: text, image
Out: text
Released: 2025-04-16
o4-mini-deep-research o4-mini-deep-research 200K 100K In: $2
Out: $8
Cache Read: $0.5
Model: 1.000
Completion: 4.000
Cache: 0.250
📎 🧠 🔧 2024-05 In: text, image
Out: text
Released: 2024-06-26
GPT-5 Chat (latest) gpt-5-chat-latest 400K 128K In: $1.25
Out: $10
Model: 0.625
Completion: 8.000
📎 🧠 🌡️ 2024-09-30 In: text, image
Out: text
Released: 2025-08-07
GPT-4o mini gpt-4o-mini 128K 16.4K In: $0.15
Out: $0.6
Cache Read: $0.08
Model: 0.075
Completion: 4.000
Cache: 0.533
📎 🔧 🌡️ 2023-09 In: text, image
Out: text
Released: 2024-07-18
GPT-5 gpt-5 400K 128K In: $1.25
Out: $10
Cache Read: $0.13
Model: 0.625
Completion: 8.000
Cache: 0.104
📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-08-07
DALL-E 2 dall-e-2 1K 1 In: $0.02
Out: $0.1
Cache Read: $0.01
Cache Write: $0.05
Model: 0.010
Completion: 5.000
Cache: 0.500
📎 🔧 2021-04 In: text
Out: image
Released: 2022-04-06
Updated: 2022-06-15
DALL-E 3 dall-e-3 2K 1 In: $0.03
Out: $0.15
Cache Read: $0.01
Cache Write: $0.05
Model: 0.015
Completion: 5.000
Cache: 0.333
📎 🔧 2024-04 In: text
Out: image
Released: 2024-03-01
Updated: 2024-08-15
GPT-IMAGE-1 gpt-image-1 1K 512 In: $10
Out: $20
Cache Read: $0.1
Cache Write: $0.6
Model: 5.000
Completion: 2.000
Cache: 0.010
📎 🧠 🔧 🌡️ 2023-10 In: text
Out: image
Open Weights
Released: 2024-01-15
Updated: 2024-10-01
TEXT-EMBEDDING-3-LARGE text-embedding-3-large 64K 2K In: $7
Out: $10
Cache Read: $0.05
Cache Write: $0.4
Model: 3.500
Completion: 1.429
Cache: 0.007
📎 🧠 🔧 🌡️ 2023-10 In: text
Out: vector
Released: 2023-12-15
Updated: 2023-10-01
TEXT-EMBEDDING-3-SMALL text-embedding-3-small 32K 1K In: $4
Out: $8
Cache Read: $0.04
Cache Write: $0.3
Model: 2.000
Completion: 2.000
Cache: 0.010
📎 🧠 🔧 🌡️ 2023-10 In: text
Out: vector
Released: 2023-11-10
Updated: 2023-10-01
TEXT-EMBEDDING-ADA-002 text-embedding-ada-002 60K 1.5K In: $6
Out: $12
Cache Read: $0.06
Cache Write: $0.45
Model: 3.000
Completion: 2.000
Cache: 0.010
📎 🧠 🔧 🌡️ 2023-10 In: text
Out: vector
Released: 2023-11-20
Updated: 2023-10-01

OpenCode Zen

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Qwen3 Coder qwen3-coder 262.1K 65.5K In: $0.45
Out: $1.8
Model: 0.225
Completion: 4.000
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-23
Claude Opus 4.1 claude-opus-4-1 200K 32K In: $15
Out: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image
Out: text
Released: 2025-08-05
Kimi K2 kimi-k2 262.1K 262.1K In: $0.6
Out: $2.5
Cache Read: $0.36
Model: 0.300
Completion: 4.167
Cache: 0.600
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-09-05
Claude Haiku 4.5 claude-haiku-4-5 200K 64K In: $1
Out: $1.25
Cache Read: $0.1
Cache Write: $1.25
Model: 0.500
Completion: 1.250
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-02-31 In: text, image
Out: text
Released: 2025-10-15
Claude Sonnet 4.5 claude-sonnet-4-5 1M 64K In: $3
Out: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-07-31 In: text, image
Out: text
Released: 2025-09-29
GPT-5-Codex gpt-5-codex 400K 128K In: $1.25
Out: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-08-07
Code GBT (alpha) an-gbt 200K 128K In: $1
Out: $3
Model: 0.500
Completion: 3.000
🧠 🔧 🌡️ 2025-01 In: text
Out: text
Open Weights
Released: 2025-01-01
Big Pickle big-pickle 200K 128K - - 🧠 🔧 🌡️ 2025-01 In: text
Out: text
Released: 2025-10-17
Claude Haiku 3.5 claude-3-5-haiku 200K 8.2K In: $0.8
Out: $4
Cache Read: $0.08
Cache Write: $1
Model: 0.400
Completion: 5.000
Cache: 0.100
📎 🔧 🌡️ 2024-07-31 In: text, image
Out: text
Released: 2024-10-22
GLM-4.6 (beta) glm-4.6 204.8K 131.1K In: $0.6
Out: $1.9
Model: 0.300
Completion: 3.167
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-09-30
Grok Code Fast 1 grok-code 256K 256K - - 📎 🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-08-20
Code Supernova 1M code-supernova 1M 1M - - 📎 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2025-09-19
Claude Sonnet 4 claude-sonnet-4 1M 64K In: $3
Out: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image
Out: text
Released: 2025-05-22
GPT-5 gpt-5 400K 128K In: $1.25
Out: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-08-07

OpenRouter

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Kimi K2 moonshotai/kimi-k2 131.1K 32.8K In: $0.55
Out: $2.2
Model: 0.275
Completion: 4.000
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-07-11
Kimi K2 Instruct 0905 moonshotai/kimi-k2-0905 262.1K 16.4K In: $0.6
Out: $2.5
Model: 0.300
Completion: 4.167
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-09-05
Kimi Dev 72b (free) moonshotai/kimi-dev-72b:free 131.1K 131.1K - - 🔧 🌡️ 2025-06 In: text
Out: text
Open Weights
Released: 2025-06-16
Kimi K2 (free) moonshotai/kimi-k2:free 32.8K 32.8K - - 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-11
GLM Z1 32B (free) thudm/glm-z1-32b:free 32.8K 32.8K - - 🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-04-17
Hermes 4 70B nousresearch/hermes-4-70b 131.1K 131.1K In: $0.13
Out: $0.4
Model: 0.065
Completion: 3.077
🧠 🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2025-08-25
Hermes 4 405B nousresearch/hermes-4-405b 131.1K 131.1K In: $1
Out: $3
Model: 0.500
Completion: 3.000
🧠 🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2025-08-25
DeepHermes 3 Llama 3 8B Preview nousresearch/deephermes-3-llama-3-8b-preview 131.1K 8.2K - - 🧠 🔧 🌡️ 2024-04 In: text
Out: text
Open Weights
Released: 2025-02-28
Grok 4 x-ai/grok-4 256K 64K In: $3
Out: $15
Cache Read: $0.75
Cache Write: $15
Model: 1.500
Completion: 5.000
Cache: 0.250
🧠 🔧 🌡️ 2025-07 In: text
Out: text
Released: 2025-07-09
Grok Code Fast 1 x-ai/grok-code-fast-1 256K 10K In: $0.2
Out: $1.5
Cache Read: $0.02
Model: 0.100
Completion: 7.500
Cache: 0.100
🧠 🔧 🌡️ 2025-08 In: text
Out: text
Released: 2025-08-26
Grok 4 Fast (free) x-ai/grok-4-fast:free 2M 2M - - 🧠 🔧 🌡️ 2024-11 In: text, image
Out: text
Released: 2025-08-19
Grok 3 x-ai/grok-3 131.1K 8.2K In: $3
Out: $15
Cache Read: $0.75
Cache Write: $15
Model: 1.500
Completion: 5.000
Cache: 0.250
🔧 🌡️ 2024-11 In: text
Out: text
Released: 2025-02-17
Grok 4 Fast x-ai/grok-4-fast 2M 30K In: $0.2
Out: $0.5
Cache Read: $0.05
Cache Write: $0.05
Model: 0.100
Completion: 2.500
Cache: 0.250
🧠 🔧 🌡️ 2024-11 In: text, image
Out: text
Released: 2025-08-19
Grok 3 Beta x-ai/grok-3-beta 131.1K 8.2K In: $3
Out: $15
Cache Read: $0.75
Cache Write: $15
Model: 1.500
Completion: 5.000
Cache: 0.250
🔧 🌡️ 2024-11 In: text
Out: text
Released: 2025-02-17
Grok 3 Mini Beta x-ai/grok-3-mini-beta 131.1K 8.2K In: $0.3
Out: $0.5
Cache Read: $0.075
Cache Write: $0.5
Model: 0.150
Completion: 1.667
Cache: 0.250
🧠 🔧 🌡️ 2024-11 In: text
Out: text
Released: 2025-02-17
Grok 3 Mini x-ai/grok-3-mini 131.1K 8.2K In: $0.3
Out: $0.5
Cache Read: $0.075
Cache Write: $0.5
Model: 0.150
Completion: 1.667
Cache: 0.250
🧠 🔧 🌡️ 2024-11 In: text
Out: text
Released: 2025-02-17
Dolphin3.0 Mistral 24B cognitivecomputations/dolphin3.0-mistral-24b 32.8K 8.2K - - 🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-02-13
Dolphin3.0 R1 Mistral 24B cognitivecomputations/dolphin3.0-r1-mistral-24b 32.8K 8.2K - - 🧠 🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-02-13
DeepSeek-V3.1 deepseek/deepseek-chat-v3.1 163.8K 163.8K In: $0.2
Out: $0.8
Model: 0.100
Completion: 4.000
🧠 🔧 🌡️ 2025-07 In: text
Out: text
Open Weights
Released: 2025-08-21
R1 (free) deepseek/deepseek-r1:free 163.8K 163.8K - - 🧠 🔧 🌡️ 2025-01 In: text
Out: text
Open Weights
Released: 2025-01-20
DeepSeek V3 Base (free) deepseek/deepseek-v3-base:free 163.8K 163.8K - - 🌡️ 2025-03 In: text
Out: text
Open Weights
Released: 2025-03-29
DeepSeek V3.1 Terminus deepseek/deepseek-v3.1-terminus 131.1K 65.5K In: $0.27
Out: $1
Model: 0.135
Completion: 3.704
🧠 🔧 🌡️ 2025-07 In: text
Out: text
Open Weights
Released: 2025-09-22
Deepseek R1 0528 Qwen3 8B (free) deepseek/deepseek-r1-0528-qwen3-8b:free 131.1K 131.1K - - 🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2025-05-29
DeepSeek V3 0324 deepseek/deepseek-chat-v3-0324 16.4K 8.2K - - 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-03-24
R1 0528 (free) deepseek/deepseek-r1-0528:free 163.8K 163.8K - - 🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2025-05-28
DeepSeek R1 Distill Llama 70B deepseek/deepseek-r1-distill-llama-70b 8.2K 8.2K - - 🧠 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-01-23
DeepSeek R1 Distill Qwen 14B deepseek/deepseek-r1-distill-qwen-14b 64K 8.2K - - 🧠 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-01-29
Qwerky 72B featherless/qwerky-72b 32.8K 8.2K - - 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-03-20
DeepSeek R1T2 Chimera (free) tngtech/deepseek-r1t2-chimera:free 163.8K 163.8K - - 🧠 🌡️ 2025-07 In: text
Out: text
Open Weights
Released: 2025-07-08
Gemini 2.0 Flash google/gemini-2.0-flash-001 1M 8.2K In: $0.1
Out: $0.4
Cache Read: $0.025
Model: 0.050
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-06 In: text, image, audio, video, pdf
Out: text
Released: 2024-12-11
Gemma 2 9B (free) google/gemma-2-9b-it:free 8.2K 8.2K - - 🔧 🌡️ 2024-06 In: text
Out: text
Open Weights
Released: 2024-06-28
Gemini 2.5 Flash google/gemini-2.5-flash 1M 65.5K In: $0.3
Out: $2.5
Cache Read: $0.0375
Model: 0.150
Completion: 8.333
Cache: 0.125
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-07-17
Gemini 2.5 Pro Preview 05-06 google/gemini-2.5-pro-preview-05-06 1M 65.5K In: $1.25
Out: $10
Cache Read: $0.31
Model: 0.625
Completion: 8.000
Cache: 0.248
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-05-06
Gemma 3n E4B IT google/gemma-3n-e4b-it 8.2K 8.2K - - 📎 🌡️ 2024-10 In: text, image, audio
Out: text
Open Weights
Released: 2025-05-20
Gemini 2.5 Flash Lite google/gemini-2.5-flash-lite 1M 65.5K In: $0.1
Out: $0.4
Cache Read: $0.025
Model: 0.050
Completion: 4.000
Cache: 0.250
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-06-17
Gemini 2.5 Pro Preview 06-05 google/gemini-2.5-pro-preview-06-05 1M 65.5K In: $1.25
Out: $10
Cache Read: $0.31
Model: 0.625
Completion: 8.000
Cache: 0.248
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-06-05
Gemini 2.5 Flash Preview 09-25 google/gemini-2.5-flash-preview-09-2025 1M 65.5K In: $0.3
Out: $2.5
Cache Read: $0.031
Model: 0.150
Completion: 8.333
Cache: 0.103
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-09-25
Gemini 2.5 Pro google/gemini-2.5-pro 1M 65.5K In: $1.25
Out: $10
Cache Read: $0.31
Model: 0.625
Completion: 8.000
Cache: 0.248
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-03-20
Updated: 2025-06-05
Gemma 3 12B IT google/gemma-3-12b-it 96K 8.2K - - 📎 🔧 🌡️ 2024-10 In: text, image
Out: text
Open Weights
Released: 2025-03-13
Gemma 3n 4B (free) google/gemma-3n-e4b-it:free 8.2K 8.2K - - 📎 🔧 🌡️ 2025-05 In: text, image, audio
Out: text
Open Weights
Released: 2025-05-20
Gemini 2.5 Flash Lite Preview 09-25 google/gemini-2.5-flash-lite-preview-09-2025 1M 65.5K In: $0.1
Out: $0.4
Cache Read: $0.025
Model: 0.050
Completion: 4.000
Cache: 0.250
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-09-25
Gemini 2.0 Flash Experimental (free) google/gemini-2.0-flash-exp:free 1M 1M - - 📎 🔧 🌡️ 2024-12 In: text, image
Out: text
Released: 2024-12-11
Gemma 3 27B IT google/gemma-3-27b-it 96K 8.2K - - 📎 🔧 🌡️ 2024-10 In: text, image
Out: text
Open Weights
Released: 2025-03-12
MAI DS R1 (free) microsoft/mai-ds-r1:free 163.8K 163.8K - - 🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-04-21
GPT-4.1 Mini openai/gpt-4.1-mini 1M 32.8K In: $0.4
Out: $1.6
Cache Read: $0.1
Model: 0.200
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2025-04-14
GPT-5 Chat (latest) openai/gpt-5-chat 400K 128K In: $1.25
Out: $10
Model: 0.625
Completion: 8.000
📎 🧠 🌡️ 2024-09-30 In: text, image
Out: text
Released: 2025-08-07
GPT-5 Nano openai/gpt-5-nano 400K 128K In: $0.05
Out: $0.4
Model: 0.025
Completion: 8.000
📎 🧠 🔧 🌡️ 2024-10-01 In: text, image
Out: text
Released: 2025-08-07
GPT-5 Codex openai/gpt-5-codex 400K 128K In: $1.25
Out: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2024-10-01 In: text, image
Out: text
Released: 2025-09-15
GPT-4.1 openai/gpt-4.1 1M 32.8K In: $2
Out: $8
Cache Read: $0.5
Model: 1.000
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2025-04-14
o4 Mini openai/o4-mini 200K 100K In: $1.1
Out: $4.4
Cache Read: $0.28
Model: 0.550
Completion: 4.000
Cache: 0.255
📎 🧠 🔧 🌡️ 2024-06 In: text, image
Out: text
Released: 2025-04-16
GPT-5 Mini openai/gpt-5-mini 400K 128K In: $0.25
Out: $2
Model: 0.125
Completion: 8.000
📎 🧠 🔧 🌡️ 2024-10-01 In: text, image
Out: text
Released: 2025-08-07
GPT-5 Image openai/gpt-5-image 400K 128K In: $5
Out: $10
Cache Read: $1.25
Model: 2.500
Completion: 2.000
Cache: 0.250
📎 🧠 🔧 🌡️ 2024-10-01 In: text, image, pdf
Out: text, image
Released: 2025-10-14
GPT OSS 20B openai/gpt-oss-20b 131.1K 32.8K In: $0.05
Out: $0.2
Model: 0.025
Completion: 4.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-05
GPT OSS 120B openai/gpt-oss-120b 131.1K 32.8K In: $0.072
Out: $0.28
Model: 0.036
Completion: 3.889
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-05
GPT-4o-mini openai/gpt-4o-mini 128K 16.4K In: $0.15
Out: $0.6
Cache Read: $0.08
Model: 0.075
Completion: 4.000
Cache: 0.533
📎 🔧 🌡️ 2024-10 In: text, image
Out: text
Released: 2024-07-18
GPT-5 openai/gpt-5 400K 128K In: $1.25
Out: $10
Model: 0.625
Completion: 8.000
📎 🧠 🔧 🌡️ 2024-10-01 In: text, image
Out: text
Released: 2025-08-07
Horizon Alpha openrouter/horizon-alpha 256K 128K - - 📎 🔧 2025-07 In: text, image
Out: text
Released: 2025-07-30
Sonoma Sky Alpha openrouter/sonoma-sky-alpha 2M 2M - - 📎 🔧 - In: text, image
Out: text
Released: 2024-09-05
Cypher Alpha (free) openrouter/cypher-alpha:free 1M 1M - - 🔧 🌡️ 2025-07 In: text
Out: text
Released: 2025-07-01
Sonoma Dusk Alpha openrouter/sonoma-dusk-alpha 2M 2M - - 📎 🔧 - In: text, image
Out: text
Released: 2024-09-05
Horizon Beta openrouter/horizon-beta 256K 128K - - 📎 🔧 2025-07 In: text, image
Out: text
Released: 2025-08-01
GLM 4.5 z-ai/glm-4.5 128K 96K In: $0.6
Out: $2.2
Model: 0.300
Completion: 3.667
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-28
GLM 4.5 Air z-ai/glm-4.5-air 128K 96K In: $0.2
Out: $1.1
Model: 0.100
Completion: 5.500
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-28
GLM 4.5V z-ai/glm-4.5v 64K 16.4K In: $0.6
Out: $1.8
Model: 0.300
Completion: 3.000
📎 🧠 🔧 🌡️ 2025-04 In: text, image, video
Out: text
Open Weights
Released: 2025-08-11
GLM 4.6 z-ai/glm-4.6 200K 128K In: $0.6
Out: $2.2
Cache Read: $0.11
Model: 0.300
Completion: 3.667
Cache: 0.183
🧠 🔧 🌡️ 2025-09 In: text
Out: text
Open Weights
Released: 2025-09-30
GLM 4.5 Air (free) z-ai/glm-4.5-air:free 128K 96K - - 🧠 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-28
Qwen3 Coder qwen/qwen3-coder 262.1K 66.5K In: $0.3
Out: $1.2
Model: 0.150
Completion: 4.000
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-23
Qwen3 32B (free) qwen/qwen3-32b:free 41K 41K - - 🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-04-28
Qwen3 Next 80B A3B Instruct qwen/qwen3-next-80b-a3b-instruct 262.1K 262.1K In: $0.14
Out: $1.4
Model: 0.070
Completion: 10.000
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-09-11
Qwen2.5 Coder 32B Instruct qwen/qwen-2.5-coder-32b-instruct 32.8K 8.2K - - 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2024-11-11
Qwen3 235B A22B (free) qwen/qwen3-235b-a22b:free 131.1K 131.1K - - 🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-04-28
QwQ 32B (free) qwen/qwq-32b:free 32.8K 32.8K - - 🧠 🔧 🌡️ 2025-03 In: text
Out: text
Open Weights
Released: 2025-03-05
Qwen3 30B A3B Thinking 2507 qwen/qwen3-30b-a3b-thinking-2507 262K 262K In: $0.2
Out: $0.8
Model: 0.100
Completion: 4.000
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-29
Qwen3 30B A3B (free) qwen/qwen3-30b-a3b:free 41K 41K - - 🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-04-28
Qwen2.5 VL 72B Instruct qwen/qwen2.5-vl-72b-instruct 32.8K 8.2K - - 📎 🌡️ 2024-10 In: text, image
Out: text
Open Weights
Released: 2025-02-01
Qwen3 14B (free) qwen/qwen3-14b:free 41K 41K - - 🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-04-28
Qwen3 30B A3B Instruct 2507 qwen/qwen3-30b-a3b-instruct-2507 262K 262K In: $0.2
Out: $0.8
Model: 0.100
Completion: 4.000
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-29
Qwen3 235B A22B Thinking 2507 qwen/qwen3-235b-a22b-thinking-2507 262.1K 81.9K In: $0.078
Out: $0.312
Model: 0.039
Completion: 4.000
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-25
Qwen2.5 VL 32B Instruct (free) qwen/qwen2.5-vl-32b-instruct:free 8.2K 8.2K - - 📎 🔧 🌡️ 2025-03 In: text, image, video
Out: text
Open Weights
Released: 2025-03-24
Qwen2.5 VL 72B Instruct (free) qwen/qwen2.5-vl-72b-instruct:free 32.8K 32.8K - - 📎 🔧 🌡️ 2025-02 In: text, image
Out: text
Open Weights
Released: 2025-02-01
Qwen3 235B A22B Instruct 2507 (free) qwen/qwen3-235b-a22b-07-25:free 262.1K 131.1K - - 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-04-28
Updated: 2025-07-21
Qwen3 Coder 480B A35B Instruct (free) qwen/qwen3-coder:free 262.1K 66.5K - - 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-23
Qwen3 235B A22B Instruct 2507 qwen/qwen3-235b-a22b-07-25 262.1K 131.1K In: $0.15
Out: $0.85
Model: 0.075
Completion: 5.667
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-04-28
Updated: 2025-07-21
Qwen3 8B (free) qwen/qwen3-8b:free 41K 41K - - 🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-04-28
Qwen3 Max qwen/qwen3-max 262.1K 32.8K In: $1.2
Out: $6
Model: 0.600
Completion: 5.000
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-09-05
Qwen3 Next 80B A3B Thinking qwen/qwen3-next-80b-a3b-thinking 262.1K 262.1K In: $0.14
Out: $1.4
Model: 0.070
Completion: 10.000
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-09-11
Devstral Medium mistralai/devstral-medium-2507 131.1K 131.1K In: $0.4
Out: $2
Model: 0.200
Completion: 5.000
🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2025-07-10
Codestral 2508 mistralai/codestral-2508 256K 256K In: $0.3
Out: $0.9
Model: 0.150
Completion: 3.000
🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2025-08-01
Mistral 7B Instruct (free) mistralai/mistral-7b-instruct:free 32.8K 32.8K - - 🔧 🌡️ 2024-05 In: text
Out: text
Open Weights
Released: 2024-05-27
Devstral Small mistralai/devstral-small-2505 128K 128K In: $0.06
Out: $0.12
Model: 0.030
Completion: 2.000
🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2025-05-07
Mistral Small 3.2 24B Instruct mistralai/mistral-small-3.2-24b-instruct 96K 8.2K - - 📎 🔧 🌡️ 2024-10 In: text, image
Out: text
Open Weights
Released: 2025-06-20
Devstral Small 2505 (free) mistralai/devstral-small-2505:free 32.8K 32.8K - - 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2025-05-21
Mistral Small 3.2 24B (free) mistralai/mistral-small-3.2-24b-instruct:free 96K 96K - - 📎 🔧 🌡️ 2025-06 In: text, image
Out: text
Open Weights
Released: 2025-06-20
Mistral Medium 3 mistralai/mistral-medium-3 131.1K 131.1K In: $0.4
Out: $2
Model: 0.200
Completion: 5.000
📎 🔧 🌡️ 2025-05 In: text, image
Out: text
Released: 2025-05-07
Mistral Small 3.1 24B Instruct mistralai/mistral-small-3.1-24b-instruct 128K 8.2K - - 📎 🔧 🌡️ 2024-10 In: text, image
Out: text
Open Weights
Released: 2025-03-17
Devstral Small 1.1 mistralai/devstral-small-2507 131.1K 131.1K In: $0.1
Out: $0.3
Model: 0.050
Completion: 3.000
🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2025-07-10
Mistral Medium 3.1 mistralai/mistral-medium-3.1 262.1K 262.1K In: $0.4
Out: $2
Model: 0.200
Completion: 5.000
📎 🔧 🌡️ 2025-05 In: text, image
Out: text
Released: 2025-08-12
Mistral Nemo (free) mistralai/mistral-nemo:free 131.1K 131.1K - - 🔧 🌡️ 2024-07 In: text
Out: text
Open Weights
Released: 2024-07-19
Reka Flash 3 rekaai/reka-flash-3 32.8K 8.2K - - 🧠 🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-03-12
Llama 3.2 11B Vision Instruct meta-llama/llama-3.2-11b-vision-instruct 131.1K 8.2K - - 📎 🌡️ 2023-12 In: text, image
Out: text
Open Weights
Released: 2024-09-25
Llama 3.3 70B Instruct (free) meta-llama/llama-3.3-70b-instruct:free 65.5K 65.5K - - 🔧 🌡️ 2024-12 In: text
Out: text
Open Weights
Released: 2024-12-06
Llama 4 Scout (free) meta-llama/llama-4-scout:free 64K 64K - - 📎 🔧 🌡️ 2024-08 In: text, image
Out: text
Open Weights
Released: 2025-04-05
Claude Opus 4 anthropic/claude-opus-4 200K 32K In: $15
Out: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image
Out: text
Released: 2025-05-22
Claude Haiku 4.5 anthropic/claude-haiku-4.5 200K 64K In: $1
Out: $5
Cache Read: $0.1
Cache Write: $1.25
Model: 0.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-02-31 In: text, image
Out: text
Released: 2025-10-15
Claude Opus 4.1 anthropic/claude-opus-4.1 200K 32K In: $15
Out: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image
Out: text
Released: 2025-08-05
Claude Sonnet 3.7 anthropic/claude-3.7-sonnet 200K 128K In: $15
Out: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2024-01 In: text, image
Out: text
Released: 2025-02-19
Claude Haiku 3.5 anthropic/claude-3.5-haiku 200K 8.2K In: $0.8
Out: $4
Cache Read: $0.08
Cache Write: $1
Model: 0.400
Completion: 5.000
Cache: 0.100
📎 🔧 🌡️ 2024-07-31 In: text, image
Out: text
Released: 2024-10-22
Claude Sonnet 4 anthropic/claude-sonnet-4 200K 64K In: $3
Out: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image
Out: text
Released: 2025-05-22
Claude Sonnet 4.5 anthropic/claude-sonnet-4.5 1M 64K In: $3
Out: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-07-31 In: text, image
Out: text
Released: 2025-09-29
Sarvam-M (free) sarvamai/sarvam-m:free 32.8K 32.8K - - 🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2025-05-25

Perplexity

📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Sonar Reasoning sonar-reasoning 128K 4.1K In: $1
Out: $5
Model: 0.500
Completion: 5.000
🧠 🌡️ 2025-09-01 In: text
Out: text
Released: 2024-01-01
Updated: 2025-09-01
Sonar sonar 128K 4.1K In: $1
Out: $1
Model: 0.500
Completion: 1.000
🌡️ 2025-09-01 In: text
Out: text
Released: 2024-01-01
Updated: 2025-09-01
Sonar Pro sonar-pro 200K 8.2K In: $3
Out: $15
Model: 1.500
Completion: 5.000
📎 🌡️ 2025-09-01 In: text, image
Out: text
Released: 2024-01-01
Updated: 2025-09-01
Sonar Reasoning Pro sonar-reasoning-pro 128K 4.1K In: $2
Out: $8
Model: 1.000
Completion: 4.000
📎 🧠 🌡️ 2025-09-01 In: text, image
Out: text
Released: 2024-01-01
Updated: 2025-09-01

Requesty

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Gemini 2.5 Flash google/gemini-2.5-flash 1M 65.5K In: $0.3
Out: $2.5
Cache Read: $0.075
Cache Write: $0.55
Model: 0.150
Completion: 8.333
Cache: 0.250
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-06-17
Gemini 2.5 Pro google/gemini-2.5-pro 1M 65.5K In: $1.25
Out: $10
Cache Read: $0.31
Cache Write: $2.375
Model: 0.625
Completion: 8.000
Cache: 0.248
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-06-17
GPT-4.1 Mini openai/gpt-4.1-mini 1M 32.8K In: $0.4
Out: $1.6
Cache Read: $0.1
Model: 0.200
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2025-04-14
GPT-5 Nano openai/gpt-5-nano 16K 4K In: $0.05
Out: $0.4
Cache Read: $0.01
Model: 0.025
Completion: 8.000
Cache: 0.200
📎 🧠 🔧 2024-05-30 In: text
Out: text
Released: 2025-08-07
GPT-4.1 openai/gpt-4.1 1M 32.8K In: $2
Out: $8
Cache Read: $0.5
Model: 1.000
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2025-04-14
o4 Mini openai/o4-mini 200K 100K In: $1.1
Out: $4.4
Cache Read: $0.28
Model: 0.550
Completion: 4.000
Cache: 0.255
📎 🧠 🔧 🌡️ 2024-06 In: text, image
Out: text
Released: 2025-04-16
GPT-5 Mini openai/gpt-5-mini 128K 32K In: $0.25
Out: $2
Cache Read: $0.03
Model: 0.125
Completion: 8.000
Cache: 0.120
📎 🧠 🔧 2024-05-30 In: text, image
Out: text
Released: 2025-08-07
GPT-4o Mini openai/gpt-4o-mini 128K 16.4K In: $0.15
Out: $0.6
Cache Read: $0.08
Model: 0.075
Completion: 4.000
Cache: 0.533
📎 🔧 🌡️ 2024-10 In: text, image
Out: text
Released: 2024-07-18
GPT-5 openai/gpt-5 400K 128K In: $1.25
Out: $10
Cache Read: $0.13
Model: 0.625
Completion: 8.000
Cache: 0.104
📎 🧠 🔧 2024-09-30 In: text, audio, image, video
Out: text, audio, image
Released: 2025-08-07
Claude Opus 4 anthropic/claude-opus-4 200K 32K In: $15
Out: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image
Out: text
Released: 2025-05-22
Claude Sonnet 3.7 anthropic/claude-3-7-sonnet 200K 64K In: $3
Out: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2024-01 In: text, image
Out: text
Released: 2025-02-19
Claude Sonnet 4 anthropic/claude-4-sonnet-20250522 200K 64K In: $3
Out: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image
Out: text
Released: 2025-05-22
Claude Opus 4.1 anthropic/claude-opus-4-1-20250805 200K 32K In: $15
Out: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image
Out: text
Released: 2025-08-05

Scaleway

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Qwen3 235B A22B Instruct 2507 qwen3-235b-a22b-instruct-2507 40K 4.1K In: $0.75
Out: $2.25
Model: 0.375
Completion: 3.000
📎 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-07-01
Pixtral 12B 2409 pixtral-12b-2409 128K 4.1K In: $0.2
Out: $0.2
Model: 0.100
Completion: 1.000
📎 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2024-09-25
Llama 3.1 8B Instruct llama-3.1-8b-instruct 128K 16.4K In: $0.2
Out: $0.2
Model: 0.100
Completion: 1.000
📎 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-07-23
Mistral Nemo Instruct 2407 mistral-nemo-instruct-2407 128K 8.2K In: $0.2
Out: $0.2
Model: 0.100
Completion: 1.000
📎 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-07-25
Mistral Small 3.2 24B Instruct 2506 mistral-small-3.2-24b-instruct-2506 128K 8.2K In: $0.15
Out: $0.35
Model: 0.075
Completion: 2.333
📎 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2025-06-01
Qwen3 Coder 30B A3B Instruct qwen3-coder-30b-a3b-instruct 128K 8.2K In: $0.2
Out: $0.8
Model: 0.100
Completion: 4.000
📎 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-07-01
Llama 3.3 70B Instruct llama-3.3-70b-instruct 100K 4.1K In: $0.9
Out: $0.9
Model: 0.450
Completion: 1.000
📎 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-01-15
DeepSeek R1 Distill Llama 70B deepseek-r1-distill-llama-70b 32K 4.1K In: $0.9
Out: $0.9
Model: 0.450
Completion: 1.000
📎 🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-01-20
Voxtral Small 24B 2507 voxtral-small-24b-2507 32K 8.2K In: $0.15
Out: $0.35
Model: 0.075
Completion: 2.333
📎 🔧 🌡️ - In: text, audio
Out: text
Open Weights
Released: 2025-07-01
GPT-OSS 120B gpt-oss-120b 128K 8.2K In: $0.15
Out: $0.6
Model: 0.075
Completion: 4.000
📎 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-01-01
Gemma 3 27B IT gemma-3-27b-it 40K 8.2K In: $0.25
Out: $0.5
Model: 0.125
Completion: 2.000
📎 🔧 🌡️ - In: text, image
Out: text
Open Weights
Released: 2025-03-01

submodel

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
GPT OSS 120B openai/gpt-oss-120b 131.1K 32.8K In: $0.1
Out: $0.5
Model: 0.050
Completion: 5.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-23
Qwen3 235B A22B Instruct 2507 Qwen/Qwen3-235B-A22B-Instruct-2507 262.1K 131.1K In: $0.2
Out: $0.3
Model: 0.100
Completion: 1.500
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-23
Qwen3 Coder 480B A35B Instruct Qwen/Qwen3-Coder-480B-A35B-Instruct-FP8 262.1K 262.1K In: $0.2
Out: $0.8
Model: 0.100
Completion: 4.000
🔧 🌡️ - In: text
Out: text
Released: 2025-08-23
Qwen3 235B A22B Thinking 2507 Qwen/Qwen3-235B-A22B-Thinking-2507 262.1K 131.1K In: $0.2
Out: $0.6
Model: 0.100
Completion: 3.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-23
GLM 4.5 FP8 zai-org/GLM-4.5-FP8 131.1K 131.1K In: $0.2
Out: $0.8
Model: 0.100
Completion: 4.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-07-28
GLM 4.5 Air zai-org/GLM-4.5-Air 131.1K 131.1K In: $0.1
Out: $0.5
Model: 0.050
Completion: 5.000
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-07-28
DeepSeek R1 0528 deepseek-ai/DeepSeek-R1-0528 75K 163.8K In: $0.5
Out: $2.15
Model: 0.250
Completion: 4.300
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-08-23
DeepSeek V3.1 deepseek-ai/DeepSeek-V3.1 75K 163.8K In: $0.2
Out: $0.8
Model: 0.100
Completion: 4.000
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-08-23
DeepSeek V3 0324 deepseek-ai/DeepSeek-V3-0324 75K 163.8K In: $0.2
Out: $0.8
Model: 0.100
Completion: 4.000
🔧 🌡️ - In: text
Out: text
Released: 2025-08-23

Synthetic

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Qwen 3 235B Instruct hf:Qwen/Qwen3-235B-A22B-Instruct-2507 256K 32K In: $0.2
Out: $0.6
Model: 0.100
Completion: 3.000
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-04-28
Updated: 2025-07-21
Qwen2.5-Coder-32B-Instruct hf:Qwen/Qwen2.5-Coder-32B-Instruct 32.8K 32.8K In: $0.8
Out: $0.8
Model: 0.400
Completion: 1.000
🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2024-11-11
Qwen 3 Coder 480B hf:Qwen/Qwen3-Coder-480B-A35B-Instruct 256K 32K In: $2
Out: $2
Model: 1.000
Completion: 1.000
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-23
Qwen3 235B A22B Thinking 2507 hf:Qwen/Qwen3-235B-A22B-Thinking-2507 256K 32K In: $0.65
Out: $3
Model: 0.325
Completion: 4.615
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-25
Llama-3.1-70B-Instruct hf:meta-llama/Llama-3.1-70B-Instruct 128K 32.8K In: $0.9
Out: $0.9
Model: 0.450
Completion: 1.000
🧠 🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-07-23
Llama-3.1-8B-Instruct hf:meta-llama/Llama-3.1-8B-Instruct 128K 32.8K In: $0.2
Out: $0.2
Model: 0.100
Completion: 1.000
🧠 🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-07-23
Llama-3.3-70B-Instruct hf:meta-llama/Llama-3.3-70B-Instruct 128K 32.8K In: $0.9
Out: $0.9
Model: 0.450
Completion: 1.000
🧠 🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-12-06
Llama-4-Scout-17B-16E-Instruct hf:meta-llama/Llama-4-Scout-17B-16E-Instruct 328K 4.1K In: $0.15
Out: $0.6
Model: 0.075
Completion: 4.000
📎 🔧 🌡️ 2024-08 In: text, image
Out: text
Open Weights
Released: 2025-04-05
Llama-4-Maverick-17B-128E-Instruct-FP8 hf:meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8 524K 4.1K In: $0.22
Out: $0.88
Model: 0.110
Completion: 4.000
📎 🔧 🌡️ 2024-08 In: text, image
Out: text
Open Weights
Released: 2025-04-05
Llama-3.1-405B-Instruct hf:meta-llama/Llama-3.1-405B-Instruct 128K 32.8K In: $3
Out: $3
Model: 1.500
Completion: 1.000
🧠 🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-07-23
Kimi K2 hf:moonshotai/Kimi-K2-Instruct 128K 32.8K In: $0.6
Out: $2.5
Model: 0.300
Completion: 4.167
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-07-11
Kimi K2 0905 hf:moonshotai/Kimi-K2-Instruct-0905 262.1K 32.8K In: $1.2
Out: $1.2
Model: 0.600
Completion: 1.000
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-09-05
GLM 4.5 hf:zai-org/GLM-4.5 128K 96K In: $0.55
Out: $2.19
Model: 0.275
Completion: 3.982
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-28
GLM 4.6 hf:zai-org/GLM-4.6 200K 96K In: $0.55
Out: $2.19
Model: 0.275
Completion: 3.982
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-09-30
DeepSeek R1 hf:deepseek-ai/DeepSeek-R1 128K 128K In: $0.55
Out: $2.19
Model: 0.275
Completion: 3.982
🧠 🔧 🌡️ 2025-01 In: text
Out: text
Open Weights
Released: 2025-01-20
DeepSeek R1 (0528) hf:deepseek-ai/DeepSeek-R1-0528 128K 128K In: $3
Out: $8
Model: 1.500
Completion: 2.667
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-08-01
DeepSeek V3.1 Terminus hf:deepseek-ai/DeepSeek-V3.1-Terminus 128K 128K In: $1.2
Out: $1.2
Model: 0.600
Completion: 1.000
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-09-22
Updated: 2025-09-25
DeepSeek V3 hf:deepseek-ai/DeepSeek-V3 128K 128K In: $1.25
Out: $1.25
Model: 0.625
Completion: 1.000
🧠 🔧 🌡️ 2024-07 In: text
Out: text
Open Weights
Released: 2025-01-20
Updated: 2025-05-29
DeepSeek V3.1 hf:deepseek-ai/DeepSeek-V3.1 128K 128K In: $0.56
Out: $1.68
Model: 0.280
Completion: 3.000
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-08-21
DeepSeek V3 (0324) hf:deepseek-ai/DeepSeek-V3-0324 128K 128K In: $1.2
Out: $1.2
Model: 0.600
Completion: 1.000
🔧 🌡️ - In: text
Out: text
Released: 2025-08-01
GPT OSS 120B hf:openai/gpt-oss-120b 128K 32.8K In: $0.1
Out: $0.1
Model: 0.050
Completion: 1.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-05

Together AI

📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Kimi K2 Instruct moonshotai/Kimi-K2-Instruct 131.1K 32.8K In: $1
Out: $3
Model: 0.500
Completion: 3.000
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-07-14
GPT OSS 120B openai/gpt-oss-120b 131.1K 131.1K In: $0.15
Out: $0.6
Model: 0.075
Completion: 4.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-05
Llama 3.3 70B meta-llama/Llama-3.3-70B-Instruct-Turbo 131.1K 66.5K In: $0.88
Out: $0.88
Model: 0.440
Completion: 1.000
🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-12-06
Qwen3 Coder 480B A35B Instruct Qwen/Qwen3-Coder-480B-A35B-Instruct-FP8 262.1K 66.5K In: $2
Out: $2
Model: 1.000
Completion: 1.000
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-23
DeepSeek R1 deepseek-ai/DeepSeek-R1 163.8K 12.3K In: $3
Out: $7
Model: 1.500
Completion: 2.333
🧠 🌡️ 2024-07 In: text
Out: text
Open Weights
Released: 2024-12-26
Updated: 2025-03-24
DeepSeek V3 deepseek-ai/DeepSeek-V3 131.1K 12.3K In: $1.25
Out: $1.25
Model: 0.625
Completion: 1.000
🧠 🔧 🌡️ 2024-07 In: text
Out: text
Open Weights
Released: 2025-01-20
Updated: 2025-05-29

Upstage

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
solar-mini solar-mini 32.8K 4.1K In: $0.15
Out: $0.15
Model: 0.075
Completion: 1.000
🔧 🌡️ 2024-09 In: text
Out: text
Released: 2024-06-12
Updated: 2025-04-22
solar-pro2 solar-pro2 65.5K 8.2K In: $0.25
Out: $0.25
Model: 0.125
Completion: 1.000
🧠 🔧 🌡️ 2025-03 In: text
Out: text
Released: 2025-05-20

v0

📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
v0-1.5-lg v0-1.5-lg 512K 32K In: $15
Out: $75
Model: 7.500
Completion: 5.000
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2025-06-09
v0-1.5-md v0-1.5-md 128K 32K In: $3
Out: $15
Model: 1.500
Completion: 5.000
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2025-06-09
v0-1.0-md v0-1.0-md 128K 32K In: $3
Out: $15
Model: 1.500
Completion: 5.000
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2025-05-22

Venice AI

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Dolphin 72B dolphin-2.9.2-qwen2-72b 32.8K 8.2K In: $0.7
Out: $2.8
Model: 0.350
Completion: 4.000
🌡️ 2021-09 In: text
Out: text
Open Weights
Released: 2025-05-21
Venice Medium mistral-31-24b 131.1K 8.2K In: $0.5
Out: $2
Model: 0.250
Completion: 4.000
🔧 🌡️ 2023-10 In: text, image
Out: text
Open Weights
Released: 2025-07-15
Venice Uncensored 1.1 venice-uncensored 32.8K 8.2K In: $0.5
Out: $2
Model: 0.250
Completion: 4.000
🌡️ 2023-10 In: text
Out: text
Open Weights
Released: 2025-07-15
Qwen 2.5 VL 72B qwen-2.5-vl 32.8K 8.2K In: $0.7
Out: $2.8
Model: 0.350
Completion: 4.000
🌡️ 2023-10 In: text, image
Out: text
Open Weights
Released: 2025-06-09
Venice Large qwen3-235b 131.1K 8.2K In: $1.5
Out: $6
Model: 0.750
Completion: 4.000
🧠 🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-07-27
Venice Reasoning qwen-2.5-qwq-32b 32.8K 8.2K In: $0.5
Out: $2
Model: 0.250
Completion: 4.000
🧠 🌡️ 2023-10 In: text
Out: text
Open Weights
Released: 2025-07-08
DeepSeek Coder V2 Lite deepseek-coder-v2-lite 131.1K 8.2K In: $0.5
Out: $2
Model: 0.250
Completion: 4.000
🌡️ 2021-09 In: text
Out: text
Open Weights
Released: 2025-06-22
Venice Small qwen3-4b 32.8K 8.2K In: $0.15
Out: $0.6
Model: 0.075
Completion: 4.000
🧠 🔧 🌡️ 2024-07 In: text
Out: text
Open Weights
Released: 2025-07-27
Llama 3.3 70B llama-3.3-70b 65.5K 8.2K In: $0.7
Out: $2.8
Model: 0.350
Completion: 4.000
🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2025-06-09
Qwen 2.5 Coder 32B qwen-2.5-coder-32b 32.8K 8.2K In: $0.5
Out: $2
Model: 0.250
Completion: 4.000
🌡️ 2023-10 In: text
Out: text
Open Weights
Released: 2025-06-14
DeepSeek R1 671B deepseek-r1-671b 131.1K 8.2K In: $3.5
Out: $14
Model: 1.750
Completion: 4.000
🧠 🌡️ 2023-10 In: text
Out: text
Open Weights
Released: 2025-06-05
Llama 3.2 3B llama-3.2-3b 131.1K 8.2K In: $0.15
Out: $0.6
Model: 0.075
Completion: 4.000
🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2025-05-23
Llama 3.1 405B llama-3.1-405b 65.5K 8.2K In: $1.5
Out: $6
Model: 0.750
Completion: 4.000
🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2025-06-30

Vercel AI Gateway

📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Kimi K2 Instruct moonshotai/kimi-k2 131.1K 16.4K In: $1
Out: $3
Model: 0.500
Completion: 3.000
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-07-14
Qwen3 Next 80B A3B Instruct alibaba/qwen3-next-80b-a3b-instruct 131.1K 32.8K In: $0.5
Out: $2
Model: 0.250
Completion: 4.000
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-09-12
Qwen3 VL Instruct alibaba/qwen3-vl-instruct 131.1K 129K In: $0.7
Out: $2.8
Model: 0.350
Completion: 4.000
📎 🔧 🌡️ 2025-04 In: text, image
Out: text
Open Weights
Released: 2025-09-24
Qwen3 VL Thinking alibaba/qwen3-vl-thinking 131.1K 129K In: $0.7
Out: $8.4
Model: 0.350
Completion: 12.000
📎 🧠 🔧 🌡️ 2025-09 In: text, image
Out: text
Open Weights
Released: 2025-09-24
Qwen3 Max alibaba/qwen3-max 262.1K 32.8K In: $1.2
Out: $6
Model: 0.600
Completion: 5.000
🔧 🌡️ 2025-04 In: text
Out: text
Released: 2025-09-23
Qwen3 Coder Plus alibaba/qwen3-coder-plus 1M 1M In: $1
Out: $5
Model: 0.500
Completion: 5.000
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-23
Qwen3 Next 80B A3B Thinking alibaba/qwen3-next-80b-a3b-thinking 131.1K 32.8K In: $0.5
Out: $6
Model: 0.250
Completion: 12.000
🧠 🔧 🌡️ 2025-09 In: text
Out: text
Open Weights
Released: 2025-09-12
Grok 3 Mini Fast xai/grok-3-mini-fast 131.1K 8.2K In: $0.6
Out: $4
Cache Read: $0.15
Model: 0.300
Completion: 6.667
Cache: 0.250
🧠 🔧 🌡️ 2024-11 In: text
Out: text
Released: 2025-02-17
Grok 3 Mini xai/grok-3-mini 131.1K 8.2K In: $0.3
Out: $0.5
Cache Read: $0.075
Model: 0.150
Completion: 1.667
Cache: 0.250
🧠 🔧 🌡️ 2024-11 In: text
Out: text
Released: 2025-02-17
Grok 4 Fast xai/grok-4-fast 2M 30K In: $0.2
Out: $0.5
Cache Read: $0.05
Model: 0.100
Completion: 2.500
Cache: 0.250
📎 🧠 🔧 🌡️ 2025-07 In: text, image
Out: text
Released: 2025-09-19
Grok 3 xai/grok-3 131.1K 8.2K In: $3
Out: $15
Cache Read: $0.75
Model: 1.500
Completion: 5.000
Cache: 0.250
🔧 🌡️ 2024-11 In: text
Out: text
Released: 2025-02-17
Grok 2 xai/grok-2 131.1K 8.2K In: $2
Out: $10
Cache Read: $2
Model: 1.000
Completion: 5.000
Cache: 1.000
🔧 🌡️ 2024-08 In: text
Out: text
Released: 2024-08-20
Grok Code Fast 1 xai/grok-code-fast-1 256K 10K In: $0.2
Out: $1.5
Cache Read: $0.02
Model: 0.100
Completion: 7.500
Cache: 0.100
🧠 🔧 🌡️ 2023-10 In: text
Out: text
Released: 2025-08-28
Grok 2 Vision xai/grok-2-vision 8.2K 4.1K In: $2
Out: $10
Cache Read: $2
Model: 1.000
Completion: 5.000
Cache: 1.000
📎 🔧 🌡️ 2024-08 In: text, image
Out: text
Released: 2024-08-20
Grok 4 xai/grok-4 256K 64K In: $3
Out: $15
Cache Read: $0.75
Model: 1.500
Completion: 5.000
Cache: 0.250
🧠 🔧 🌡️ 2025-07 In: text
Out: text
Released: 2025-07-09
Grok 3 Fast xai/grok-3-fast 131.1K 8.2K In: $5
Out: $25
Cache Read: $1.25
Model: 2.500
Completion: 5.000
Cache: 0.250
🔧 🌡️ 2024-11 In: text
Out: text
Released: 2025-02-17
Grok 4 Fast (Non-Reasoning) xai/grok-4-fast-non-reasoning 2M 30K In: $0.2
Out: $0.5
Cache Read: $0.05
Model: 0.100
Completion: 2.500
Cache: 0.250
📎 🔧 🌡️ 2025-07 In: text, image
Out: text
Released: 2025-09-19
Codestral mistral/codestral 256K 4.1K In: $0.3
Out: $0.9
Model: 0.150
Completion: 3.000
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2024-05-29
Updated: 2025-01-04
Magistral Medium mistral/magistral-medium 128K 16.4K In: $2
Out: $5
Model: 1.000
Completion: 2.500
🧠 🔧 🌡️ 2025-06 In: text
Out: text
Open Weights
Released: 2025-03-17
Updated: 2025-03-20
Mistral Large mistral/mistral-large 131.1K 16.4K In: $2
Out: $6
Model: 1.000
Completion: 3.000
🔧 🌡️ 2024-11 In: text
Out: text
Open Weights
Released: 2024-11-01
Updated: 2024-11-04
Pixtral Large mistral/pixtral-large 128K 128K In: $2
Out: $6
Model: 1.000
Completion: 3.000
📎 🔧 🌡️ 2024-11 In: text, image
Out: text
Open Weights
Released: 2024-11-01
Updated: 2024-11-04
Ministral 8B mistral/ministral-8b 128K 128K In: $0.1
Out: $0.1
Model: 0.050
Completion: 1.000
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2024-10-01
Updated: 2024-10-04
Ministral 3B mistral/ministral-3b 128K 128K In: $0.04
Out: $0.04
Model: 0.020
Completion: 1.000
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2024-10-01
Updated: 2024-10-04
Magistral Small mistral/magistral-small 128K 128K In: $0.5
Out: $1.5
Model: 0.250
Completion: 3.000
🧠 🔧 🌡️ 2025-06 In: text
Out: text
Open Weights
Released: 2025-03-17
Mistral Small mistral/mistral-small 128K 16.4K In: $0.1
Out: $0.3
Model: 0.050
Completion: 3.000
🔧 🌡️ 2025-03 In: text, image
Out: text
Open Weights
Released: 2024-09-01
Updated: 2024-09-04
Pixtral 12B mistral/pixtral-12b 128K 128K In: $0.15
Out: $0.15
Model: 0.075
Completion: 1.000
📎 🔧 🌡️ 2024-09 In: text, image
Out: text
Open Weights
Released: 2024-09-01
Mixtral 8x22B mistral/mixtral-8x22b-instruct 64K 64K In: $2
Out: $6
Model: 1.000
Completion: 3.000
🔧 🌡️ 2024-04 In: text
Out: text
Open Weights
Released: 2024-04-17
v0-1.0-md vercel/v0-1.0-md 128K 32K In: $3
Out: $15
Model: 1.500
Completion: 5.000
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2025-05-22
v0-1.5-md vercel/v0-1.5-md 128K 32K In: $3
Out: $15
Model: 1.500
Completion: 5.000
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2025-06-09
DeepSeek V3.2 Exp Thinking deepseek/deepseek-v3.2-exp-thinking 163.8K 8.2K In: $0.28
Out: $0.42
Model: 0.140
Completion: 1.500
🧠 🔧 🌡️ 2025-09 In: text
Out: text
Released: 2025-09-29
DeepSeek V3.1 Terminus deepseek/deepseek-v3.1-terminus 128K 8.2K In: $0.27
Out: $1
Model: 0.135
Completion: 3.704
🧠 🔧 🌡️ 2025-07 In: text
Out: text
Open Weights
Released: 2025-09-22
DeepSeek V3.2 Exp deepseek/deepseek-v3.2-exp 163.8K 8.2K In: $0.28
Out: $0.42
Model: 0.140
Completion: 1.500
🔧 🌡️ 2025-09 In: text
Out: text
Released: 2025-09-29
DeepSeek R1 Distill Llama 70B deepseek/deepseek-r1-distill-llama-70b 131.1K 8.2K In: $0.75
Out: $0.99
Model: 0.375
Completion: 1.320
🧠 🔧 🌡️ 2024-07 In: text
Out: text
Open Weights
Released: 2025-01-20
DeepSeek-R1 deepseek/deepseek-r1 128K 32.8K In: $1.35
Out: $5.4
Model: 0.675
Completion: 4.000
🧠 🔧 🌡️ 2024-07 In: text
Out: text
Released: 2025-01-20
Updated: 2025-05-29
Gemini 2.5 Flash Lite google/gemini-2.5-flash-lite 1M 65.5K In: $0.1
Out: $0.4
Cache Read: $0.025
Model: 0.050
Completion: 4.000
Cache: 0.250
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-06-17
Gemini 2.5 Flash Preview 09-25 google/gemini-2.5-flash-preview-09-2025 1M 65.5K In: $0.3
Out: $2.5
Cache Read: $0.075
Cache Write: $0.383
Model: 0.150
Completion: 8.333
Cache: 0.250
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-09-25
Gemini 2.5 Flash Lite Preview 09-25 google/gemini-2.5-flash-lite-preview-09-2025 1M 65.5K In: $0.1
Out: $0.4
Cache Read: $0.025
Model: 0.050
Completion: 4.000
Cache: 0.250
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-09-25
Gemini 2.5 Pro google/gemini-2.5-pro 1M 65.5K In: $1.25
Out: $10
Cache Read: $0.31
Model: 0.625
Completion: 8.000
Cache: 0.248
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-03-20
Updated: 2025-06-05
Gemini 2.0 Flash google/gemini-2.0-flash 1M 8.2K In: $0.1
Out: $0.4
Cache Read: $0.025
Model: 0.050
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-06 In: text, image, audio, video, pdf
Out: text
Released: 2024-12-11
Gemini 2.0 Flash Lite google/gemini-2.0-flash-lite 1M 8.2K In: $0.075
Out: $0.3
Model: 0.037
Completion: 4.000
📎 🔧 🌡️ 2024-06 In: text, image, audio, video, pdf
Out: text
Released: 2024-12-11
Gemini 2.5 Flash google/gemini-2.5-flash 1M 65.5K In: $0.3
Out: $2.5
Cache Read: $0.075
Model: 0.150
Completion: 8.333
Cache: 0.250
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-03-20
Updated: 2025-06-05
GPT OSS 20B openai/gpt-oss-20b 131.1K 32.8K In: $0.07
Out: $0.3
Model: 0.035
Completion: 4.286
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-05
GPT OSS 120B openai/gpt-oss-120b 131.1K 32.8K In: $0.1
Out: $0.5
Model: 0.050
Completion: 5.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-05
GPT-5 openai/gpt-5 400K 128K In: $1.25
Out: $10
Cache Read: $0.13
Model: 0.625
Completion: 8.000
Cache: 0.104
📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-08-07
GPT-4o mini openai/gpt-4o-mini 128K 16.4K In: $0.15
Out: $0.6
Cache Read: $0.08
Model: 0.075
Completion: 4.000
Cache: 0.533
📎 🔧 🌡️ 2023-09 In: text, image
Out: text
Released: 2024-07-18
o3 openai/o3 200K 100K In: $2
Out: $8
Cache Read: $0.5
Model: 1.000
Completion: 4.000
Cache: 0.250
📎 🧠 🔧 2024-05 In: text, image
Out: text
Released: 2025-04-16
GPT-5 Mini openai/gpt-5-mini 400K 128K In: $0.25
Out: $2
Cache Read: $0.03
Model: 0.125
Completion: 8.000
Cache: 0.120
📎 🧠 🔧 2024-05-30 In: text, image
Out: text
Released: 2025-08-07
o1 openai/o1 200K 100K In: $15
Out: $60
Cache Read: $7.5
Model: 7.500
Completion: 4.000
Cache: 0.500
📎 🧠 🔧 2023-09 In: text, image
Out: text
Released: 2024-12-05
o4-mini openai/o4-mini 200K 100K In: $1.1
Out: $4.4
Cache Read: $0.28
Model: 0.550
Completion: 4.000
Cache: 0.255
📎 🧠 🔧 2024-05 In: text, image
Out: text
Released: 2025-04-16
GPT-4.1 openai/gpt-4.1 1M 32.8K In: $2
Out: $8
Cache Read: $0.5
Model: 1.000
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2025-04-14
GPT-4o openai/gpt-4o 128K 16.4K In: $2.5
Out: $10
Cache Read: $1.25
Model: 1.250
Completion: 4.000
Cache: 0.500
📎 🔧 🌡️ 2023-09 In: text, image
Out: text
Released: 2024-05-13
Updated: 2024-08-06
GPT-5-Codex openai/gpt-5-codex 400K 128K In: $1.25
Out: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-09-15
GPT-5 Nano openai/gpt-5-nano 400K 128K In: $0.05
Out: $0.4
Cache Read: $0.01
Model: 0.025
Completion: 8.000
Cache: 0.200
📎 🧠 🔧 2024-05-30 In: text, image
Out: text
Released: 2025-08-07
o3-mini openai/o3-mini 200K 100K In: $1.1
Out: $4.4
Cache Read: $0.55
Model: 0.550
Completion: 4.000
Cache: 0.500
🧠 🔧 2024-05 In: text
Out: text
Released: 2024-12-20
Updated: 2025-01-29
GPT-4 Turbo openai/gpt-4-turbo 128K 4.1K In: $10
Out: $30
Model: 5.000
Completion: 3.000
📎 🔧 🌡️ 2023-12 In: text, image
Out: text
Released: 2023-11-06
Updated: 2024-04-09
GPT-4.1 mini openai/gpt-4.1-mini 1M 32.8K In: $0.4
Out: $1.6
Cache Read: $0.1
Model: 0.200
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2025-04-14
GPT-4.1 nano openai/gpt-4.1-nano 1M 32.8K In: $0.1
Out: $0.4
Cache Read: $0.03
Model: 0.050
Completion: 4.000
Cache: 0.300
📎 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2025-04-14
Sonar Reasoning perplexity/sonar-reasoning 127K 8K In: $1
Out: $5
Model: 0.500
Completion: 5.000
🧠 🌡️ 2025-09 In: text
Out: text
Released: 2025-02-19
Sonar perplexity/sonar 127K 8K In: $1
Out: $1
Model: 0.500
Completion: 1.000
🌡️ 2025-02 In: text, image
Out: text
Released: 2025-02-19
Sonar Pro perplexity/sonar-pro 200K 8K In: $3
Out: $15
Model: 1.500
Completion: 5.000
🌡️ 2025-09 In: text, image
Out: text
Released: 2025-02-19
Sonar Reasoning Pro perplexity/sonar-reasoning-pro 127K 8K In: $2
Out: $8
Model: 1.000
Completion: 4.000
🧠 🌡️ 2025-09 In: text
Out: text
Released: 2025-02-19
GLM 4.5 zai/glm-4.5 128K 96K In: $0.6
Out: $2.2
Model: 0.300
Completion: 3.667
🧠 🔧 🌡️ 2025-07 In: text
Out: text
Open Weights
Released: 2025-07-28
GLM 4.5 Air zai/glm-4.5-air 128K 96K In: $0.2
Out: $1.1
Model: 0.100
Completion: 5.500
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-28
GLM 4.5V zai/glm-4.5v 66K 16K In: $0.6
Out: $1.8
Model: 0.300
Completion: 3.000
📎 🧠 🔧 🌡️ 2025-08 In: text, image
Out: text
Open Weights
Released: 2025-08-11
GLM 4.6 zai/glm-4.6 200K 96K In: $0.6
Out: $2.2
Model: 0.300
Completion: 3.667
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-09-30
Nova Micro amazon/nova-micro 128K 8.2K In: $0.035
Out: $0.14
Cache Read: $0.00875
Model: 0.018
Completion: 4.000
Cache: 0.250
🔧 🌡️ 2024-10 In: text
Out: text
Released: 2024-12-03
Nova Pro amazon/nova-pro 300K 8.2K In: $0.8
Out: $3.2
Cache Read: $0.2
Model: 0.400
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-10 In: text, image, video
Out: text
Released: 2024-12-03
Nova Lite amazon/nova-lite 300K 8.2K In: $0.06
Out: $0.24
Cache Read: $0.015
Model: 0.030
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-10 In: text, image, video
Out: text
Released: 2024-12-03
Morph v3 Fast morph/morph-v3-fast 16K 16K In: $0.8
Out: $1.2
Model: 0.400
Completion: 1.500
- - In: text
Out: text
Released: 2024-08-15
Morph v3 Large morph/morph-v3-large 32K 32K In: $0.9
Out: $1.9
Model: 0.450
Completion: 2.111
- - In: text
Out: text
Released: 2024-08-15
Llama-4-Scout-17B-16E-Instruct-FP8 meta/llama-4-scout 128K 4.1K - - 📎 🔧 🌡️ 2024-08 In: text, image
Out: text
Open Weights
Released: 2025-04-05
Llama-3.3-70B-Instruct meta/llama-3.3-70b 128K 4.1K - - 📎 🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-12-06
Llama-4-Maverick-17B-128E-Instruct-FP8 meta/llama-4-maverick 128K 4.1K - - 📎 🔧 🌡️ 2024-08 In: text, image
Out: text
Open Weights
Released: 2025-04-05
Claude Haiku 4.5 anthropic/claude-haiku-4.5 200K 64K In: $1
Out: $1.25
Cache Read: $0.1
Cache Write: $1.25
Model: 0.500
Completion: 1.250
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-02-31 In: text, image
Out: text
Released: 2025-10-15
Claude Sonnet 3.7 anthropic/claude-3.7-sonnet 200K 64K In: $3
Out: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2024-10-31 In: text, image
Out: text
Released: 2025-02-19
Claude Haiku 3.5 anthropic/claude-3-5-haiku 200K 8.2K In: $0.8
Out: $4
Cache Read: $0.08
Cache Write: $1
Model: 0.400
Completion: 5.000
Cache: 0.100
📎 🔧 🌡️ 2024-07-31 In: text, image
Out: text
Released: 2024-10-22
Claude Sonnet 4.5 anthropic/claude-4.5-sonnet 200K 64K In: $3
Out: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-07-31 In: text, image
Out: text
Released: 2025-09-29
Claude Sonnet 3.5 v2 anthropic/claude-3.5-sonnet 200K 8.2K In: $3
Out: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🔧 🌡️ 2024-04-30 In: text, image
Out: text
Released: 2024-10-22
Claude Opus 4 anthropic/claude-4-1-opus 200K 32K In: $15
Out: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image
Out: text
Released: 2025-05-22
Claude Sonnet 4 anthropic/claude-4-sonnet 200K 64K In: $3
Out: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image
Out: text
Released: 2025-05-22
Claude Opus 3 anthropic/claude-3-opus 200K 4.1K In: $15
Out: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🔧 🌡️ 2023-08-31 In: text, image
Out: text
Released: 2024-02-29
Claude Haiku 3 anthropic/claude-3-haiku 200K 4.1K In: $0.25
Out: $1.25
Cache Read: $0.03
Cache Write: $0.3
Model: 0.125
Completion: 5.000
Cache: 0.120
📎 🔧 🌡️ 2023-08-31 In: text, image
Out: text
Released: 2024-03-13
Claude Opus 4 anthropic/claude-4-opus 200K 32K In: $15
Out: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image
Out: text
Released: 2025-05-22
Qwen 3 Coder 480B cerebras/qwen3-coder 131K 32K In: $2
Out: $2
Model: 1.000
Completion: 1.000
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-23

Weights & Biases

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Kimi-K2-Instruct moonshotai/Kimi-K2-Instruct 128K 16.4K In: $1.35
Out: $4
Model: 0.675
Completion: 2.963
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-07-14
Phi-4-mini-instruct microsoft/Phi-4-mini-instruct 128K 4.1K In: $0.08
Out: $0.35
Model: 0.040
Completion: 4.375
🧠 🔧 🌡️ 2023-10 In: text
Out: text
Open Weights
Released: 2024-12-11
Meta-Llama-3.1-8B-Instruct meta-llama/Llama-3.1-8B-Instruct 128K 32.8K In: $0.22
Out: $0.22
Model: 0.110
Completion: 1.000
🧠 🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-07-23
Llama-3.3-70B-Instruct meta-llama/Llama-3.3-70B-Instruct 128K 32.8K In: $0.71
Out: $0.71
Model: 0.355
Completion: 1.000
🧠 🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-12-06
Llama 4 Scout 17B 16E Instruct meta-llama/Llama-4-Scout-17B-16E-Instruct 64K 8.2K In: $0.17
Out: $0.66
Model: 0.085
Completion: 3.882
🧠 🔧 🌡️ 2024-12 In: text, image
Out: text
Open Weights
Released: 2025-01-31
Qwen3 235B A22B Instruct 2507 Qwen/Qwen3-235B-A22B-Instruct-2507 262.1K 131.1K In: $0.1
Out: $0.1
Model: 0.050
Completion: 1.000
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-04-28
Updated: 2025-07-21
Qwen3-Coder-480B-A35B-Instruct Qwen/Qwen3-Coder-480B-A35B-Instruct 262.1K 66.5K In: $1
Out: $1.5
Model: 0.500
Completion: 1.500
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-23
Qwen3-235B-A22B-Thinking-2507 Qwen/Qwen3-235B-A22B-Thinking-2507 262.1K 131.1K In: $0.1
Out: $0.1
Model: 0.050
Completion: 1.000
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-25
DeepSeek-R1-0528 deepseek-ai/DeepSeek-R1-0528 161K 163.8K In: $1.35
Out: $5.4
Model: 0.675
Completion: 4.000
🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2025-05-28
DeepSeek-V3-0324 deepseek-ai/DeepSeek-V3-0324 161K 8.2K In: $1.14
Out: $2.75
Model: 0.570
Completion: 2.412
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-03-24

xAI

📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Grok 4 Fast (Non-Reasoning) grok-4-fast-non-reasoning 2M 30K In: $0.2
Out: $0.5
Cache Read: $0.05
Model: 0.100
Completion: 2.500
Cache: 0.250
📎 🔧 🌡️ 2025-07 In: text, image
Out: text
Released: 2025-09-19
Grok 3 Fast grok-3-fast 131.1K 8.2K In: $5
Out: $25
Cache Read: $1.25
Model: 2.500
Completion: 5.000
Cache: 0.250
🔧 🌡️ 2024-11 In: text
Out: text
Released: 2025-02-17
Grok 4 grok-4 256K 64K In: $3
Out: $15
Cache Read: $0.75
Model: 1.500
Completion: 5.000
Cache: 0.250
🧠 🔧 🌡️ 2025-07 In: text
Out: text
Released: 2025-07-09
Grok 2 Vision grok-2-vision 8.2K 4.1K In: $2
Out: $10
Cache Read: $2
Model: 1.000
Completion: 5.000
Cache: 1.000
📎 🔧 🌡️ 2024-08 In: text, image
Out: text
Released: 2024-08-20
Grok Code Fast 1 grok-code-fast-1 256K 10K In: $0.2
Out: $1.5
Cache Read: $0.02
Model: 0.100
Completion: 7.500
Cache: 0.100
🧠 🔧 🌡️ 2023-10 In: text
Out: text
Released: 2025-08-28
Grok 2 grok-2 131.1K 8.2K In: $2
Out: $10
Cache Read: $2
Model: 1.000
Completion: 5.000
Cache: 1.000
🔧 🌡️ 2024-08 In: text
Out: text
Released: 2024-08-20
Grok 3 Mini Fast Latest grok-3-mini-fast-latest 131.1K 8.2K In: $0.6
Out: $4
Cache Read: $0.15
Model: 0.300
Completion: 6.667
Cache: 0.250
🧠 🔧 🌡️ 2024-11 In: text
Out: text
Released: 2025-02-17
Grok 2 Vision (1212) grok-2-vision-1212 8.2K 4.1K In: $2
Out: $10
Cache Read: $2
Model: 1.000
Completion: 5.000
Cache: 1.000
📎 🔧 🌡️ 2024-08 In: text, image
Out: text
Released: 2024-08-20
Updated: 2024-12-12
Grok 3 grok-3 131.1K 8.2K In: $3
Out: $15
Cache Read: $0.75
Model: 1.500
Completion: 5.000
Cache: 0.250
🔧 🌡️ 2024-11 In: text
Out: text
Released: 2025-02-17
Grok 4 Fast grok-4-fast 2M 30K In: $0.2
Out: $0.5
Cache Read: $0.05
Model: 0.100
Completion: 2.500
Cache: 0.250
📎 🧠 🔧 🌡️ 2025-07 In: text, image
Out: text
Released: 2025-09-19
Grok 2 Latest grok-2-latest 131.1K 8.2K In: $2
Out: $10
Cache Read: $2
Model: 1.000
Completion: 5.000
Cache: 1.000
🔧 🌡️ 2024-08 In: text
Out: text
Released: 2024-08-20
Updated: 2024-12-12
Grok 2 (1212) grok-2-1212 131.1K 8.2K In: $2
Out: $10
Cache Read: $2
Model: 1.000
Completion: 5.000
Cache: 1.000
🔧 🌡️ 2024-08 In: text
Out: text
Released: 2024-12-12
Grok 3 Fast Latest grok-3-fast-latest 131.1K 8.2K In: $5
Out: $25
Cache Read: $1.25
Model: 2.500
Completion: 5.000
Cache: 0.250
🔧 🌡️ 2024-11 In: text
Out: text
Released: 2025-02-17
Grok 3 Latest grok-3-latest 131.1K 8.2K In: $3
Out: $15
Cache Read: $0.75
Model: 1.500
Completion: 5.000
Cache: 0.250
🔧 🌡️ 2024-11 In: text
Out: text
Released: 2025-02-17
Grok 2 Vision Latest grok-2-vision-latest 8.2K 4.1K In: $2
Out: $10
Cache Read: $2
Model: 1.000
Completion: 5.000
Cache: 1.000
📎 🔧 🌡️ 2024-08 In: text, image
Out: text
Released: 2024-08-20
Updated: 2024-12-12
Grok Vision Beta grok-vision-beta 8.2K 4.1K In: $5
Out: $15
Cache Read: $5
Model: 2.500
Completion: 3.000
Cache: 1.000
📎 🔧 🌡️ 2024-08 In: text, image
Out: text
Released: 2024-11-01
Grok 3 Mini grok-3-mini 131.1K 8.2K In: $0.3
Out: $0.5
Cache Read: $0.075
Model: 0.150
Completion: 1.667
Cache: 0.250
🧠 🔧 🌡️ 2024-11 In: text
Out: text
Released: 2025-02-17
Grok Beta grok-beta 131.1K 4.1K In: $5
Out: $15
Cache Read: $5
Model: 2.500
Completion: 3.000
Cache: 1.000
🔧 🌡️ 2024-08 In: text
Out: text
Released: 2024-11-01
Grok 3 Mini Latest grok-3-mini-latest 131.1K 8.2K In: $0.3
Out: $0.5
Cache Read: $0.075
Model: 0.150
Completion: 1.667
Cache: 0.250
🧠 🔧 🌡️ 2024-11 In: text
Out: text
Released: 2025-02-17
Grok 3 Mini Fast grok-3-mini-fast 131.1K 8.2K In: $0.6
Out: $4
Cache Read: $0.15
Model: 0.300
Completion: 6.667
Cache: 0.250
🧠 🔧 🌡️ 2024-11 In: text
Out: text
Released: 2025-02-17

Z.AI

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
GLM-4.5-Flash glm-4.5-flash 131.1K 98.3K - - 🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-28
GLM-4.5 glm-4.5 131.1K 98.3K In: $0.6
Out: $2.2
Cache Read: $0.11
Model: 0.300
Completion: 3.667
Cache: 0.183
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-28
GLM-4.5-Air glm-4.5-air 131.1K 98.3K In: $0.2
Out: $1.1
Cache Read: $0.03
Model: 0.100
Completion: 5.500
Cache: 0.150
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-28
GLM 4.5V glm-4.5v 64K 16.4K In: $0.6
Out: $1.8
Model: 0.300
Completion: 3.000
📎 🧠 🔧 🌡️ 2025-04 In: text, image, video
Out: text
Open Weights
Released: 2025-08-11
GLM-4.6 glm-4.6 204.8K 131.1K In: $0.6
Out: $2.2
Cache Read: $0.11
Model: 0.300
Completion: 3.667
Cache: 0.183
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-09-30

Z.AI Coding Plan

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
GLM-4.5-Flash glm-4.5-flash 131.1K 98.3K - - 🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-28
GLM-4.5 glm-4.5 131.1K 98.3K - - 🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-28
GLM-4.5-Air glm-4.5-air 131.1K 98.3K - - 🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-28
GLM 4.5V glm-4.5v 64K 16.4K - - 📎 🧠 🔧 🌡️ 2025-04 In: text, image, video
Out: text
Open Weights
Released: 2025-08-11
GLM-4.6 glm-4.6 204.8K 131.1K - - 🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-09-30

Zhipu AI

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
GLM-4.6 glm-4.6 204.8K 131.1K In: $0.6
Out: $2.2
Cache Read: $0.11
Model: 0.300
Completion: 3.667
Cache: 0.183
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-09-30
GLM 4.5V glm-4.5v 64K 16.4K In: $0.6
Out: $1.8
Model: 0.300
Completion: 3.000
📎 🧠 🔧 🌡️ 2025-04 In: text, image, video
Out: text
Open Weights
Released: 2025-08-11
GLM-4.5-Air glm-4.5-air 131.1K 98.3K In: $0.2
Out: $1.1
Cache Read: $0.03
Model: 0.100
Completion: 5.500
Cache: 0.150
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-28
GLM-4.5 glm-4.5 131.1K 98.3K In: $0.6
Out: $2.2
Cache Read: $0.11
Model: 0.300
Completion: 3.667
Cache: 0.183
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-28
GLM-4.5-Flash glm-4.5-flash 131.1K 98.3K - - 🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-28

Zhipu AI Coding Plan

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
GLM-4.6 glm-4.6 204.8K 131.1K - - 🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-09-30
GLM 4.5V glm-4.5v 64K 16.4K - - 📎 🧠 🔧 🌡️ 2025-04 In: text, image, video
Out: text
Open Weights
Released: 2025-08-11
GLM-4.5-Air glm-4.5-air 131.1K 98.3K - - 🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-28
GLM-4.5 glm-4.5 131.1K 98.3K - - 🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-28
GLM-4.5-Flash glm-4.5-flash 131.1K 98.3K - - 🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-28