Skip to content

Data Browser

This page displays comprehensive information about all LLM providers and models, automatically generated from API data.

Statistics

  • Provider Count: 46
  • Model Count: 652
  • Last Updated: 9/6/2025, 1:48:03 AM

Capabilities Legend: 🧠 Reasoning   🔧 Tools   📎 Attachment   🌡️ Temperature

Alibaba

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Qwen3 Coder Plus qwen3-coder-plus 1M 65.5K In: $1
Out: $5
Model: 0.500
Completion: 5.000
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-23
DeepSeek R1 deepseek-r1 128K - In: $4
Out: $16
Model: 2.000
Completion: 4.000
- - In: text
Out: text
-

Amazon Bedrock

📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Claude Sonnet 3.5 v2 anthropic.claude-3-5-sonnet-20241022-v2:0 200K 8.2K In: $3
Out: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2024-10-22
Command Light cohere.command-light-text-v14 4.1K 4.1K In: $0.3
Out: $0.6
Model: 0.150
Completion: 2.000
🌡️ 2023-08 In: text
Out: text
Open Weights
Released: 2023-11-01
Claude Opus 4.1 anthropic.claude-opus-4-1-20250805-v1:0 200K 32K In: $15
Out: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image
Out: text
Released: 2025-08-05
Llama 3.1 70B Instruct meta.llama3-1-70b-instruct-v1:0 128K 4.1K In: $0.72
Out: $0.72
Model: 0.360
Completion: 1.000
🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-07-23
Claude Haiku 3 anthropic.claude-3-haiku-20240307-v1:0 200K 4.1K In: $0.25
Out: $1.25
Model: 0.125
Completion: 5.000
📎 🔧 🌡️ 2024-02 In: text, image
Out: text
Released: 2024-03-13
Llama 3.2 3B Instruct meta.llama3-2-3b-instruct-v1:0 131K 4.1K In: $0.15
Out: $0.15
Model: 0.075
Completion: 1.000
🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-09-25
Claude Opus 3 anthropic.claude-3-opus-20240229-v1:0 200K 4.1K In: $15
Out: $75
Model: 7.500
Completion: 5.000
📎 🔧 🌡️ 2023-08 In: text, image
Out: text
Released: 2024-02-29
Command cohere.command-text-v14 4.1K 4.1K In: $1.5
Out: $2
Model: 0.750
Completion: 1.333
🌡️ 2023-08 In: text
Out: text
Open Weights
Released: 2023-11-01
Llama 4 Scout 17B Instruct meta.llama4-scout-17b-instruct-v1:0 3.5M 16.4K In: $0.17
Out: $0.66
Model: 0.085
Completion: 3.882
📎 🔧 🌡️ 2024-08 In: text, image
Out: text
Open Weights
Released: 2025-04-05
Nova Micro amazon.nova-micro-v1:0 128K 8.2K In: $0.035
Out: $0.14
Cache Read: $0.00875
Model: 0.018
Completion: 4.000
Cache: 0.250
🔧 🌡️ 2024-10 In: text
Out: text
Released: 2024-12-03
Nova Premier amazon.nova-premier-v1:0 1M 16.4K In: $2.5
Out: $12.5
Model: 1.250
Completion: 5.000
📎 🧠 🔧 🌡️ 2024-10 In: text, image, video
Out: text
Released: 2024-12-03
Claude 2 anthropic.claude-v2 100K 4.1K In: $8
Out: $24
Model: 4.000
Completion: 3.000
🌡️ 2023-08 In: text
Out: text
Released: 2023-07-11
Claude Sonnet 3.7 anthropic.claude-3-7-sonnet-20250219-v1:0 200K 8.2K In: $3
Out: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2025-02-19
Jamba 1.5 Mini ai21.jamba-1-5-mini-v1:0 256K 4.1K In: $0.2
Out: $0.4
Model: 0.100
Completion: 2.000
🔧 🌡️ 2024-08 In: text
Out: text
Open Weights
Released: 2024-08-15
Llama 3 8B Instruct meta.llama3-8b-instruct-v1:0 8.2K 2K In: $0.3
Out: $0.6
Model: 0.150
Completion: 2.000
🌡️ 2023-03 In: text
Out: text
Open Weights
Released: 2024-07-23
Jamba 1.5 Large ai21.jamba-1-5-large-v1:0 256K 4.1K In: $2
Out: $8
Model: 1.000
Completion: 4.000
🔧 🌡️ 2024-08 In: text
Out: text
Open Weights
Released: 2024-08-15
Claude Opus 4 anthropic.claude-opus-4-20250514-v1:0 200K 32K In: $15
Out: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2025-05-22
Llama 3 70B Instruct meta.llama3-70b-instruct-v1:0 8.2K 2K In: $2.65
Out: $3.5
Model: 1.325
Completion: 1.321
🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-07-23
Llama 4 Maverick 17B Instruct meta.llama4-maverick-17b-instruct-v1:0 1M 16.4K In: $0.24
Out: $0.97
Model: 0.120
Completion: 4.042
📎 🔧 🌡️ 2024-08 In: text, image
Out: text
Open Weights
Released: 2025-04-05
Nova Pro amazon.nova-pro-v1:0 300K 8.2K In: $0.8
Out: $3.2
Cache Read: $0.2
Model: 0.400
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-10 In: text, image, video
Out: text
Released: 2024-12-03
Claude Sonnet 4 anthropic.claude-sonnet-4-20250514-v1:0 200K 64K In: $3
Out: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2025-05-22
Claude Haiku 3.5 anthropic.claude-3-5-haiku-20241022-v1:0 200K 8.2K In: $0.8
Out: $4
Cache Read: $0.08
Cache Write: $1
Model: 0.400
Completion: 5.000
Cache: 0.100
📎 🔧 🌡️ 2024-07 In: text
Out: text
Released: 2024-10-22
Nova Lite amazon.nova-lite-v1:0 300K 8.2K In: $0.06
Out: $0.24
Cache Read: $0.015
Model: 0.030
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-10 In: text, image, video
Out: text
Released: 2024-12-03
Llama 3.2 1B Instruct meta.llama3-2-1b-instruct-v1:0 131K 4.1K In: $0.1
Out: $0.1
Model: 0.050
Completion: 1.000
🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-09-25
Command R+ cohere.command-r-plus-v1:0 128K 4.1K In: $3
Out: $15
Model: 1.500
Completion: 5.000
🔧 🌡️ 2024-04 In: text
Out: text
Open Weights
Released: 2024-04-04
Claude Instant anthropic.claude-instant-v1 100K 4.1K In: $0.8
Out: $2.4
Model: 0.400
Completion: 3.000
🌡️ 2023-08 In: text
Out: text
Released: 2023-03-01
Llama 3.1 8B Instruct meta.llama3-1-8b-instruct-v1:0 128K 4.1K In: $0.22
Out: $0.22
Model: 0.110
Completion: 1.000
🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-07-23
Claude Sonnet 3.5 anthropic.claude-3-5-sonnet-20240620-v1:0 200K 8.2K In: $3
Out: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2024-06-20
Command R cohere.command-r-v1:0 128K 4.1K In: $0.5
Out: $1.5
Model: 0.250
Completion: 3.000
🔧 🌡️ 2024-04 In: text
Out: text
Open Weights
Released: 2024-03-11
Llama 3.3 70B Instruct meta.llama3-3-70b-instruct-v1:0 128K 4.1K In: $0.72
Out: $0.72
Model: 0.360
Completion: 1.000
🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-12-06
Llama 3.2 11B Instruct meta.llama3-2-11b-instruct-v1:0 128K 4.1K In: $0.16
Out: $0.16
Model: 0.080
Completion: 1.000
📎 🔧 🌡️ 2023-12 In: text, image
Out: text
Open Weights
Released: 2024-09-25
Claude 2.1 anthropic.claude-v2:1 200K 4.1K In: $8
Out: $24
Model: 4.000
Completion: 3.000
🌡️ 2023-08 In: text
Out: text
Released: 2023-11-21
Llama 3.2 90B Instruct meta.llama3-2-90b-instruct-v1:0 128K 4.1K In: $0.72
Out: $0.72
Model: 0.360
Completion: 1.000
📎 🔧 🌡️ 2023-12 In: text, image
Out: text
Open Weights
Released: 2024-09-25
Claude Sonnet 3 anthropic.claude-3-sonnet-20240229-v1:0 200K 4.1K In: $3
Out: $15
Model: 1.500
Completion: 5.000
📎 🔧 🌡️ 2023-08 In: text, image
Out: text
Released: 2024-03-04
DeepSeek-R1 deepseek.r1-v1:0 128K 32.8K In: $1.35
Out: $5.4
Model: 0.675
Completion: 4.000
🧠 🔧 🌡️ 2024-07 In: text
Out: text
Released: 2025-01-20
Updated: 2025-05-29

Anthropic

📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Claude Sonnet 3.7 claude-3-7-sonnet-20250219 200K 64K In: $3
Out: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2024-10-31 In: text, image
Out: text
Released: 2025-02-19
Claude Opus 4.1 claude-opus-4-1-20250805 200K 32K In: $15
Out: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image
Out: text
Released: 2025-08-05
Claude Haiku 3 claude-3-haiku-20240307 200K 4.1K In: $0.25
Out: $1.25
Cache Read: $0.03
Cache Write: $0.3
Model: 0.125
Completion: 5.000
Cache: 0.120
📎 🔧 🌡️ 2023-08-31 In: text, image
Out: text
Released: 2024-03-13
Claude Haiku 3.5 claude-3-5-haiku-20241022 200K 8.2K In: $0.8
Out: $4
Cache Read: $0.08
Cache Write: $1
Model: 0.400
Completion: 5.000
Cache: 0.100
📎 🔧 🌡️ 2024-07-31 In: text, image
Out: text
Released: 2024-10-22
Claude Opus 4 claude-opus-4-20250514 200K 32K In: $15
Out: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image
Out: text
Released: 2025-05-22
Claude Sonnet 3.5 v2 claude-3-5-sonnet-20241022 200K 8.2K In: $3
Out: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🔧 🌡️ 2024-04-30 In: text, image
Out: text
Released: 2024-10-22
Claude Sonnet 3.5 claude-3-5-sonnet-20240620 200K 8.2K In: $3
Out: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🔧 🌡️ 2024-04-30 In: text, image
Out: text
Released: 2024-06-20
Claude Sonnet 3 claude-3-sonnet-20240229 200K 4.1K In: $3
Out: $15
Cache Read: $0.3
Cache Write: $0.3
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🔧 🌡️ 2023-08-31 In: text, image
Out: text
Released: 2024-03-04
Claude Sonnet 4 claude-sonnet-4-20250514 200K 64K In: $3
Out: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image
Out: text
Released: 2025-05-22
Claude Opus 3 claude-3-opus-20240229 200K 4.1K In: $15
Out: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🔧 🌡️ 2023-08-31 In: text, image
Out: text
Released: 2024-02-29

Azure

📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
GPT-5 Nano gpt-5-nano 272K 128K In: $0.05
Out: $0.4
Cache Read: $0.01
Model: 0.025
Completion: 8.000
Cache: 0.200
📎 🧠 🔧 2024-05-30 In: text, image
Out: text
Released: 2025-08-07
GPT-3.5 Turbo 0613 gpt-3.5-turbo-0613 16.4K 16.4K In: $3
Out: $4
Model: 1.500
Completion: 1.333
🌡️ 2021-08 In: text
Out: text
Released: 2023-06-13
GPT-3.5 Turbo 0301 gpt-3.5-turbo-0301 4.1K 4.1K In: $1.5
Out: $2
Model: 0.750
Completion: 1.333
🌡️ 2021-08 In: text
Out: text
Released: 2023-03-01
GPT-4.1 gpt-4.1 1M 32.8K In: $2
Out: $8
Cache Read: $0.5
Model: 1.000
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-05 In: text, image
Out: text
Released: 2025-04-14
GPT-4 Turbo gpt-4-turbo 128K 4.1K In: $10
Out: $30
Model: 5.000
Completion: 3.000
📎 🔧 🌡️ 2023-11 In: text, image
Out: text
Released: 2023-11-06
Updated: 2024-04-09
o1 o1 200K 100K In: $15
Out: $60
Cache Read: $7.5
Model: 7.500
Completion: 4.000
Cache: 0.500
🧠 🔧 2023-09 In: text, image
Out: text
Released: 2024-12-05
GPT-5 gpt-5 272K 128K In: $1.25
Out: $10
Cache Read: $0.13
Model: 0.625
Completion: 8.000
Cache: 0.104
📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-08-07
o3 o3 200K 100K In: $2
Out: $8
Cache Read: $0.5
Model: 1.000
Completion: 4.000
Cache: 0.250
📎 🧠 🔧 2024-05 In: text, image
Out: text
Released: 2025-04-16
GPT-5 Chat gpt-5-chat 128K 16.4K In: $1.25
Out: $10
Cache Read: $0.13
Model: 0.625
Completion: 8.000
Cache: 0.104
📎 🧠 2024-10-24 In: text, image
Out: text
Released: 2025-08-07
Codex Mini codex-mini 200K 100K In: $1.5
Out: $6
Cache Read: $0.375
Model: 0.750
Completion: 4.000
Cache: 0.250
📎 🧠 🔧 2024-04 In: text
Out: text
Released: 2025-05-16
GPT-4 Turbo Vision gpt-4-turbo-vision 128K 4.1K In: $10
Out: $30
Model: 5.000
Completion: 3.000
📎 🔧 🌡️ 2023-11 In: text, image
Out: text
Released: 2023-11-06
Updated: 2024-04-09
GPT-5 Mini gpt-5-mini 272K 128K In: $0.25
Out: $2
Cache Read: $0.03
Model: 0.125
Completion: 8.000
Cache: 0.120
📎 🧠 🔧 2024-05-30 In: text, image
Out: text
Released: 2025-08-07
o1-preview o1-preview 128K 32.8K In: $16.5
Out: $66
Cache Read: $8.25
Model: 8.250
Completion: 4.000
Cache: 0.500
🧠 🔧 2023-09 In: text
Out: text
Released: 2024-09-12
GPT-3.5 Turbo 1106 gpt-3.5-turbo-1106 16.4K 16.4K In: $1
Out: $2
Model: 0.500
Completion: 2.000
🌡️ 2021-08 In: text
Out: text
Released: 2023-11-06
GPT-4o mini gpt-4o-mini 128K 16.4K In: $0.15
Out: $0.6
Cache Read: $0.08
Model: 0.075
Completion: 4.000
Cache: 0.533
📎 🔧 🌡️ 2023-09 In: text, image
Out: text
Released: 2024-07-18
GPT-4.1 nano gpt-4.1-nano 1M 32.8K In: $0.1
Out: $0.4
Cache Read: $0.03
Model: 0.050
Completion: 4.000
Cache: 0.300
📎 🔧 🌡️ 2024-05 In: text, image
Out: text
Released: 2025-04-14
GPT-4.1 mini gpt-4.1-mini 1M 32.8K In: $0.4
Out: $1.6
Cache Read: $0.1
Model: 0.200
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-05 In: text, image
Out: text
Released: 2025-04-14
o1-mini o1-mini 128K 65.5K In: $1.1
Out: $4.4
Cache Read: $0.55
Model: 0.550
Completion: 4.000
Cache: 0.500
🧠 🔧 2023-09 In: text
Out: text
Released: 2024-09-12
GPT-3.5 Turbo 0125 gpt-3.5-turbo-0125 16.4K 16.4K In: $0.5
Out: $1.5
Model: 0.250
Completion: 3.000
🌡️ 2021-08 In: text
Out: text
Released: 2024-01-25
GPT-4 32K gpt-4-32k 32.8K 32.8K In: $60
Out: $120
Model: 30.000
Completion: 2.000
🔧 🌡️ 2023-11 In: text
Out: text
Released: 2023-03-14
GPT-3.5 Turbo Instruct gpt-3.5-turbo-instruct 4.1K 4.1K In: $1.5
Out: $2
Model: 0.750
Completion: 1.333
🌡️ 2021-08 In: text
Out: text
Released: 2023-09-21
GPT-4o gpt-4o 128K 16.4K In: $2.5
Out: $10
Cache Read: $1.25
Model: 1.250
Completion: 4.000
Cache: 0.500
📎 🔧 🌡️ 2023-09 In: text, image
Out: text
Released: 2024-05-13
GPT-4 gpt-4 8.2K 8.2K In: $60
Out: $120
Model: 30.000
Completion: 2.000
🔧 🌡️ 2023-11 In: text
Out: text
Released: 2023-03-14
o4-mini o4-mini 200K 100K In: $1.1
Out: $4.4
Cache Read: $0.28
Model: 0.550
Completion: 4.000
Cache: 0.255
📎 🧠 🔧 2024-05 In: text, image
Out: text
Released: 2025-04-16
o3-mini o3-mini 200K 100K In: $1.1
Out: $4.4
Cache Read: $0.55
Model: 0.550
Completion: 4.000
Cache: 0.500
🧠 🔧 2024-05 In: text
Out: text
Released: 2024-12-20
Updated: 2025-01-29

Baseten

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
moonshotai/Kimi-K2-Instruct-0905 Moonshotai-Kimi-K2-Instruct-0905 131K 131K In: $0.6
Out: $2.5
Model: 0.300
Completion: 4.167
🔧 🌡️ 2025-08 In: text
Out: text
Open Weights
Released: 2025-09-05
Qwen/Qwen3-Coder-480B-A35B-Instruct Qwen-Qwen3-Coder-480B-A35B-Instruct 262.1K 66.5K In: $0.38
Out: $1.53
Model: 0.190
Completion: 4.026
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-23

Cerebras

📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Qwen 3 235B Instruct qwen-3-235b-a22b-instruct-2507 131K 32K In: $0.6
Out: $1.2
Model: 0.300
Completion: 2.000
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-22
GPT OSS 120B gpt-oss-120b 131.1K 32.8K In: $0.25
Out: $0.69
Model: 0.125
Completion: 2.760
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-05
Qwen 3 Coder 480B qwen-3-coder-480b 131K 32K In: $2
Out: $2
Model: 1.000
Completion: 1.000
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-23

Chutes

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
DeepSeek R1 Distill Llama 70B deepseek-ai/DeepSeek-R1-Distill-Llama-70B 131.1K 131.1K In: $0.03
Out: $0.14
Model: 0.015
Completion: 4.667
🧠 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-01-23
DeepSeek V3.1 deepseek-ai/DeepSeek-V3.1 163.8K 163.8K In: $0.2
Out: $0.8
Model: 0.100
Completion: 4.000
🔧 🌡️ - In: text
Out: text
Released: 2025-08-21
DeepSeek R1 0528 Qwen3 8B deepseek-ai/DeepSeek-R1-0528-Qwen3-8B 131.1K 131.1K In: $0.02
Out: $0.07
Model: 0.010
Completion: 3.500
🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2025-05-29
DeepSeek V3.1 Reasoning deepseek-ai/DeepSeek-V3.1:THINKING 163.8K 163.8K In: $0.2
Out: $0.8
Model: 0.100
Completion: 4.000
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-08-21
DeepSeek V3 (0324) deepseek-ai/DeepSeek-V3-0324 75K 163.8K In: $0.18
Out: $0.72
Model: 0.090
Completion: 4.000
🔧 🌡️ - In: text
Out: text
Released: 2025-08-01
DeepSeek R1 (0528) deepseek-ai/DeepSeek-R1-0528 75K 163.8K In: $0.18
Out: $0.72
Model: 0.090
Completion: 4.000
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-08-01
GPT OSS 120B openai/gpt-oss-120b 131.1K 32.8K In: $0.1
Out: $0.41
Model: 0.050
Completion: 4.100
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-05
Devstral Small (2505) chutesai/Devstral-Small-2505 32.8K 32.8K In: $0.02
Out: $0.08
Model: 0.010
Completion: 4.000
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-05-21
Mistral Small 3.2 24B Instruct (2506) chutesai/Mistral-Small-3.2-24B-Instruct-2506 131.1K 131.1K In: $0.02
Out: $0.08
Model: 0.010
Completion: 4.000
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-06-20
Qwen3-235B-A22B-Thinking-2507 Qwen/Qwen3-235B-A22B-Thinking-2507 262.1K 262.1K In: $0.078
Out: $0.312
Model: 0.039
Completion: 4.000
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-25
Qwen3 30B A3B Instruct 2507 Qwen/Qwen3-30B-A3B-Instruct-2507 262.1K 262.1K In: $0.05
Out: $0.2
Model: 0.025
Completion: 4.000
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-25
Qwen3 30B A3B Qwen/Qwen3-30B-A3B 41K 41K In: $0.02
Out: $0.08
Model: 0.010
Completion: 4.000
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-04-28
Qwen3 235B A22B Instruct 2507 Qwen/Qwen3-235B-A22B-Instruct-2507 262.1K 131.1K In: $0.078
Out: $0.312
Model: 0.039
Completion: 4.000
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-04-28
Updated: 2025-07-21
Qwen3 Coder 30B A3B Instruct Qwen/Qwen3-Coder-30B-A3B-Instruct 262.1K 262.1K - - 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-25
Qwen3 Coder 480B A35B Instruct (FP8) Qwen/Qwen3-Coder-480B-A35B-Instruct-FP8 262.1K 262.1K In: $0.2
Out: $0.8
Model: 0.100
Completion: 4.000
🔧 🌡️ - In: text
Out: text
Released: 2025-08-01
GLM 4.5 FP8 zai-org/GLM-4.5-FP8 131.1K 131.1K - - 🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-28
GLM 4.5 Air zai-org/GLM-4.5-Air 131.1K 98.3K - - 🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-28
DeepSeek TNG R1T2 Chimera tngtech/DeepSeek-TNG-R1T2-Chimera 163.8K 163.8K In: $0.2
Out: $0.8
Model: 0.100
Completion: 4.000
🧠 🌡️ 2025-07 In: text
Out: text
Open Weights
Released: 2025-07-08
DeepSeek R1T Chimera tngtech/DeepSeek-R1T-Chimera 163.8K 163.8K In: $0.18
Out: $0.72
Model: 0.090
Completion: 4.000
🧠 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-04-26
Kimi K2 Instruct moonshotai/Kimi-K2-Instruct-75k 75K 75K In: $0.15
Out: $0.59
Model: 0.075
Completion: 3.933
🔧 🌡️ - In: text
Out: text
Released: 2025-08-01

Cloudflare Workers AI

📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
hf/nexusflow/starling-lm-7b-beta starling-lm-7b-beta 4.1K 4.1K - - 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-03-19
Updated: 2024-04-03
hf/thebloke/deepseek-coder-6.7b-base-awq deepseek-coder-6.7b-base-awq 4.1K 4.1K - - 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2023-11-05
Updated: 2023-11-09
cf/openchat/openchat-3.5-0106 openchat-3.5-0106 8.2K 8.2K - - 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-01-07
Updated: 2024-05-18
cf/mistral/mistral-7b-instruct-v0.2-lora mistral-7b-instruct-v0.2-lora 15K 15K - - 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-04-01
cf/meta/llama-3.1-8b-instruct-awq llama-3.1-8b-instruct-awq 8.2K 8.2K In: $0.12
Out: $0.27
Model: 0.060
Completion: 2.250
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-07-25
hf/mistral/mistral-7b-instruct-v0.2 mistral-7b-instruct-v0.2 3.1K 3.1K - - 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2023-12-11
Updated: 2025-07-24
cf/llava-hf/llava-1.5-7b-hf llava-1.5-7b-hf - - - - 📎 🌡️ - In: image, text
Out: text
Open Weights
Released: 2023-12-05
Updated: 2025-06-06
cf/microsoft/resnet-50 resnet-50 - - In: $0.0000025
Out: $-
Model: 0.000 - - In: image
Out: text
Open Weights
Released: 2022-03-16
Updated: 2024-02-13
hf/thebloke/deepseek-coder-6.7b-instruct-awq deepseek-coder-6.7b-instruct-awq 4.1K 4.1K - - 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2023-11-05
Updated: 2023-11-13
cf/meta/llama-3.1-70b-instruct llama-3.1-70b-instruct 24K 24K - - 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-07-16
Updated: 2024-12-15
cf/qwen/qwen1.5-0.5b-chat qwen1.5-0.5b-chat 32K 32K - - 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-01-31
Updated: 2024-04-30
cf/qwen/qwen1.5-14b-chat-awq qwen1.5-14b-chat-awq 7.5K 7.5K - - 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-02-03
Updated: 2024-04-30
cf/meta/llama-3.2-3b-instruct llama-3.2-3b-instruct 128K 128K In: $0.051
Out: $0.34
Model: 0.025
Completion: 6.667
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-09-18
Updated: 2024-10-24
cf/google/gemma-7b-it-lora gemma-7b-it-lora 3.5K 3.5K - - 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-04-02
cf/google/gemma-3-12b-it gemma-3-12b-it 80K 80K In: $0.35
Out: $0.56
Model: 0.175
Completion: 1.600
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-03-01
Updated: 2025-03-21
cf/meta/llama-3-8b-instruct llama-3-8b-instruct 8K 8K In: $0.28
Out: $0.83
Model: 0.140
Completion: 2.964
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-04-17
Updated: 2025-06-19
hf/google/gemma-7b-it gemma-7b-it 8.2K 8.2K - - 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-02-13
Updated: 2024-08-14
cf/qwen/qwq-32b qwq-32b 24K 24K In: $0.66
Out: $1
Model: 0.330
Completion: 1.515
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-03-05
Updated: 2025-03-11
cf/meta/llama-3.2-1b-instruct llama-3.2-1b-instruct 60K 60K In: $0.027
Out: $0.2
Model: 0.013
Completion: 7.407
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-09-18
Updated: 2024-10-24
cf/tiiuae/falcon-7b-instruct falcon-7b-instruct 4.1K 4.1K - - 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2023-04-25
Updated: 2024-10-12
cf/meta/llama-3.1-8b-instruct-fast llama-3.1-8b-instruct-fast 128K 128K - - 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-07-18
Updated: 2024-09-25
cf/tinyllama/tinyllama-1.1b-chat-v1.0 tinyllama-1.1b-chat-v1.0 2K 2K - - 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2023-12-30
Updated: 2024-03-17
cf/runwayml/stable-diffusion-v1-5-inpainting stable-diffusion-v1-5-inpainting - - - - - - In: text
Out: image
Open Weights
Released: 2024-02-27
cf/meta/llama-3-8b-instruct-awq llama-3-8b-instruct-awq 8.2K 8.2K In: $0.12
Out: $0.27
Model: 0.060
Completion: 2.250
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-05-09
hf/thebloke/openhermes-2.5-mistral-7b-awq openhermes-2.5-mistral-7b-awq 4.1K 4.1K - - 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2023-11-02
Updated: 2023-11-09
cf/meta-llama/llama-2-7b-chat-hf-lora llama-2-7b-chat-hf-lora 8.2K 8.2K - - 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2023-07-13
Updated: 2024-04-17
cf/openai/gpt-oss-20b gpt-oss-20b 128K 128K In: $0.2
Out: $0.3
Model: 0.100
Completion: 1.500
- - In: text
Out: text
Open Weights
Released: 2025-08-04
Updated: 2025-08-14
cf/meta/llama-3.1-8b-instruct-fp8 llama-3.1-8b-instruct-fp8 32K 32K In: $0.15
Out: $0.29
Model: 0.075
Completion: 1.933
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-07-25
cf/openai/whisper whisper - - In: $0.00045
Out: $0.00045
Model: 0.000
Completion: 1.000
- - In: audio
Out: text
Open Weights
Released: 2023-11-07
Updated: 2024-08-12
cf/unum/uform-gen2-qwen-500m uform-gen2-qwen-500m - - - - - - In: image, text
Out: text
Open Weights
Released: 2024-02-15
Updated: 2024-04-24
cf/facebook/bart-large-cnn bart-large-cnn - - - - - - In: text
Out: text
Open Weights
Released: 2022-03-02
Updated: 2024-02-13
cf/mistral/mistral-7b-instruct-v0.1 mistral-7b-instruct-v0.1 2.8K 2.8K In: $0.11
Out: $0.19
Model: 0.055
Completion: 1.727
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2023-09-27
Updated: 2025-07-24
cf/fblgit/una-cybertron-7b-v2-bf16 una-cybertron-7b-v2-bf16 15K 15K - - 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2023-12-02
Updated: 2024-03-08
hf/nousresearch/hermes-2-pro-mistral-7b hermes-2-pro-mistral-7b 24K 24K - - 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-03-11
Updated: 2024-09-08
hf/thebloke/llama-2-13b-chat-awq llama-2-13b-chat-awq 4.1K 4.1K - - 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2023-09-19
Updated: 2023-11-09
cf/qwen/qwen2.5-coder-32b-instruct qwen2.5-coder-32b-instruct 32.8K 32.8K In: $0.66
Out: $1
Model: 0.330
Completion: 1.515
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-11-06
Updated: 2025-01-12
cf/meta/m2m100-1.2b m2m100-1.2b - - In: $0.34
Out: $0.34
Model: 0.170
Completion: 1.000
- - In: text
Out: text
Open Weights
Released: 2022-03-02
Updated: 2023-11-16
cf/myshell-ai/melotts melotts - - In: $0.0002
Out: $-
Model: 0.000 📎 - In: text
Out: audio
Open Weights
Released: 2024-07-19
cf/stabilityai/stable-diffusion-xl-base-1.0 stable-diffusion-xl-base-1.0 - - - - - - In: text
Out: image
Open Weights
Released: 2023-07-25
Updated: 2023-10-30
cf/meta/llama-3.1-8b-instruct llama-3.1-8b-instruct 8K 8K In: $0.28
Out: $0.83
Model: 0.140
Completion: 2.964
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-07-18
Updated: 2024-09-25
cf/qwen/qwen1.5-7b-chat-awq qwen1.5-7b-chat-awq 20K 20K - - 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-02-03
Updated: 2024-04-30
cf/mistralai/mistral-small-3.1-24b-instruct mistral-small-3.1-24b-instruct 128K 128K In: $0.35
Out: $0.56
Model: 0.175
Completion: 1.600
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-03-11
Updated: 2025-07-28
cf/meta/llama-3.2-11b-vision-instruct llama-3.2-11b-vision-instruct 128K 128K In: $0.049
Out: $0.68
Model: 0.025
Completion: 13.878
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-09-18
Updated: 2024-12-04
cf/meta/llama-3.3-70b-instruct-fp8-fast llama-3.3-70b-instruct-fp8-fast 24K 24K In: $0.29
Out: $2.25
Model: 0.145
Completion: 7.759
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-12-06
cf/microsoft/phi-2 phi-2 2K 2K - - 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2023-12-13
Updated: 2024-04-29
cf/openai/whisper-large-v3-turbo whisper-large-v3-turbo - - In: $0.00051
Out: $0.00051
Model: 0.000
Completion: 1.000
- - In: audio
Out: text
Open Weights
Released: 2024-10-01
Updated: 2024-10-04
hf/thebloke/mistral-7b-instruct-v0.1-awq mistral-7b-instruct-v0.1-awq 4.1K 4.1K - - 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2023-09-27
Updated: 2023-11-09
hf/thebloke/neural-chat-7b-v3-1-awq neural-chat-7b-v3-1-awq 4.1K 4.1K - - 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2023-11-15
Updated: 2023-11-17
cf/openai/gpt-oss-120b gpt-oss-120b 128K 128K In: $0.35
Out: $0.75
Model: 0.175
Completion: 2.143
- - In: text
Out: text
Open Weights
Released: 2025-08-04
Updated: 2025-08-14
cf/qwen/qwen1.5-1.8b-chat qwen1.5-1.8b-chat 32K 32K - - 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-01-30
Updated: 2024-04-30
cf/meta/llama-4-scout-17b-16e-instruct llama-4-scout-17b-16e-instruct 131K 131K In: $0.27
Out: $0.85
Model: 0.135
Completion: 3.148
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-04-02
Updated: 2025-05-23
cf/google/gemma-2b-it-lora gemma-2b-it-lora 8.2K 8.2K - - 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-04-02
cf/defog/sqlcoder-7b-2 sqlcoder-7b-2 10K 10K - - 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-02-05
Updated: 2024-02-12
cf/deepseek-ai/deepseek-math-7b-instruct deepseek-math-7b-instruct 4.1K 4.1K - - 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-02-05
Updated: 2024-02-06
cf/meta/llama-2-7b-chat-fp16 llama-2-7b-chat-fp16 4.1K 4.1K In: $0.56
Out: $6.67
Model: 0.280
Completion: 11.911
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2023-07-26
cf/deepseek-ai/deepseek-r1-distill-qwen-32b deepseek-r1-distill-qwen-32b 80K 80K In: $0.5
Out: $4.88
Model: 0.250
Completion: 9.760
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-01-20
Updated: 2025-02-24
cf/thebloke/discolm-german-7b-v1-awq discolm-german-7b-v1-awq 4.1K 4.1K - - 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-01-18
Updated: 2024-01-24
cf/bytedance/stable-diffusion-xl-lightning stable-diffusion-xl-lightning - - - - - - In: text
Out: image
Open Weights
Released: 2024-02-20
Updated: 2024-04-03
cf/openai/whisper-tiny-en whisper-tiny-en - - - - - - In: audio
Out: text
Open Weights
Released: 2022-09-26
Updated: 2024-01-22
hf/thebloke/zephyr-7b-beta-awq zephyr-7b-beta-awq 4.1K 4.1K - - 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2023-10-27
Updated: 2023-11-09
cf/black-forest-labs/flux-1-schnell flux-1-schnell 2K - In: $0.000053
Out: $0.00011
Model: 0.000
Completion: 2.075
- - In: text
Out: image
Open Weights
Released: 2024-07-31
Updated: 2024-08-16
cf/runwayml/stable-diffusion-v1-5-img2img stable-diffusion-v1-5-img2img - - - - - - In: text
Out: image
Open Weights
Released: 2024-02-27
cf/meta/llama-2-7b-chat-int8 llama-2-7b-chat-int8 8.2K 8.2K - - 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2023-09-25
hf/thebloke/llamaguard-7b-awq llamaguard-7b-awq 4.1K 4.1K - - 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2023-12-11
cf/meta/llama-guard-3-8b llama-guard-3-8b - - In: $0.48
Out: $0.03
Model: 0.240
Completion: 0.063
🌡️ - In: text
Out: text
Open Weights
Released: 2024-07-22
Updated: 2024-10-11
cf/lykon/dreamshaper-8-lcm dreamshaper-8-lcm - - - - 📎 - In: text
Out: image
Open Weights
Released: 2023-12-06
Updated: 2023-12-07

Deep Infra

📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Qwen3 Coder 480B A35B Instruct Turbo Qwen/Qwen3-Coder-480B-A35B-Instruct-Turbo 262.1K 66.5K In: $0.3
Out: $1.2
Model: 0.150
Completion: 4.000
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-23
Qwen3 Coder 480B A35B Instruct Qwen/Qwen3-Coder-480B-A35B-Instruct 262.1K 66.5K In: $0.4
Out: $1.6
Model: 0.200
Completion: 4.000
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-23
GLM-4.5 zai-org/GLM-4.5 131.1K 98.3K In: $0.6
Out: $2.2
Model: 0.300
Completion: 3.667
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-28
Kimi K2 moonshotai/Kimi-K2-Instruct 131.1K 32.8K In: $0.5
Out: $2
Model: 0.250
Completion: 4.000
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-07-11

DeepSeek

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
DeepSeek Reasoner deepseek-reasoner 128K 128K In: $0.57
Out: $1.68
Cache Read: $0.07
Model: 0.285
Completion: 2.947
Cache: 0.123
📎 🧠 🔧 🌡️ 2024-07 In: text
Out: text
Released: 2025-01-20
Updated: 2025-08-21
DeepSeek Chat deepseek-chat 128K 8.2K In: $0.57
Out: $1.68
Cache Read: $0.07
Model: 0.285
Completion: 2.947
Cache: 0.123
📎 🔧 🌡️ 2024-07 In: text
Out: text
Released: 2024-12-26
Updated: 2025-08-21

ExampleCorp AI

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Novus 1 novus-1 128K 4.1K In: $5
Out: $15
Cache Read: $0.075
Cache Write: $0.5
Model: 2.500
Completion: 3.000
Cache: 0.015
📎 🧠 🔧 🌡️ 2024-07 In: text, image, audio, video, pdf
Out: text, image, audio, video, pdf
Released: 2025-01-20
Updated: 2025-08-21

FastRouter

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
DeepSeek R1 Distill Llama 70B deepseek-ai/deepseek-r1-distill-llama-70b 131.1K 131.1K In: $0.03
Out: $0.14
Model: 0.015
Completion: 4.667
🧠 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-01-23
Claude Sonnet 4 anthropic/claude-sonnet-4 200K 64K In: $3
Out: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image
Out: text
Released: 2025-05-22
Claude Opus 4.1 anthropic/claude-opus-4.1 200K 32K In: $15
Out: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image
Out: text
Released: 2025-08-05
GPT-5 Nano openai/gpt-5-nano 400K 128K In: $0.05
Out: $0.4
Cache Read: $0.005
Model: 0.025
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2024-10-01 In: text, image
Out: text
Released: 2025-08-07
GPT-4.1 openai/gpt-4.1 1M 32.8K In: $2
Out: $8
Cache Read: $0.5
Model: 1.000
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2025-04-14
GPT OSS 20B openai/gpt-oss-20b 131.1K 65.5K In: $0.05
Out: $0.2
Model: 0.025
Completion: 4.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-05
GPT-5 openai/gpt-5 400K 128K In: $1.25
Out: $10
Cache Read: $0.125
Model: 0.625
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2024-10-01 In: text, image
Out: text
Released: 2025-08-07
GPT-5 Mini openai/gpt-5-mini 400K 128K In: $0.25
Out: $2
Cache Read: $0.025
Model: 0.125
Completion: 8.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2024-10-01 In: text, image
Out: text
Released: 2025-08-07
GPT OSS 120B openai/gpt-oss-120b 131.1K 32.8K In: $0.15
Out: $0.6
Model: 0.075
Completion: 4.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-05
Grok 4 x-ai/grok-4 256K 64K In: $3
Out: $15
Cache Read: $0.75
Cache Write: $15
Model: 1.500
Completion: 5.000
Cache: 0.250
🧠 🔧 🌡️ 2025-07 In: text
Out: text
Released: 2025-07-09
Gemini 2.5 Pro google/gemini-2.5-pro 1M 65.5K In: $1.25
Out: $10
Cache Read: $0.31
Model: 0.625
Completion: 8.000
Cache: 0.248
📎 🧠 🔧 🌡️ 2025-01 In: text, image, pdf
Out: text
Released: 2025-06-17
Gemini 2.5 Flash google/gemini-2.5-flash 1M 65.5K In: $0.3
Out: $2.5
Cache Read: $0.0375
Model: 0.150
Completion: 8.333
Cache: 0.125
📎 🧠 🔧 🌡️ 2025-01 In: text, image, pdf
Out: text
Released: 2025-06-17
Qwen3 Coder qwen/qwen3-coder 262.1K 66.5K In: $0.3
Out: $1.2
Model: 0.150
Completion: 4.000
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-23
Kimi K2 moonshotai/kimi-k2 131.1K 32.8K In: $0.55
Out: $2.2
Model: 0.275
Completion: 4.000
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-07-11

Fireworks AI

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
GPT OSS 20B accounts/fireworks/gpt-oss-20b 131.1K 32.8K In: $0.05
Out: $0.2
Model: 0.025
Completion: 4.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-05
GPT OSS 120B accounts/fireworks/gpt-oss-120b 131.1K 32.8K In: $0.15
Out: $0.6
Model: 0.075
Completion: 4.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-05
Qwen3 235B-A22B accounts/fireworks/models/qwen3-235b-a22b 128K 16.4K In: $0.22
Out: $0.88
Model: 0.110
Completion: 4.000
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-04-29
Deepseek V3 03-24 accounts/fireworks/models/deepseek-v3-0324 160K 16.4K In: $0.9
Out: $0.9
Model: 0.450
Completion: 1.000
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-03-24
Qwen3 Coder 480B A35B Instruct accounts/fireworks/models/qwen3-coder-480b-a35b-instruct 256K 32.8K In: $0.45
Out: $1.8
Model: 0.225
Completion: 4.000
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-07-22
Deepseek R1 05/28 accounts/fireworks/models/deepseek-r1-0528 160K 16.4K In: $3
Out: $8
Model: 1.500
Completion: 2.667
🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2025-05-28
GLM 4.5 Air accounts/fireworks/models/glm-4p5-air 131.1K 131.1K In: $0.22
Out: $0.88
Model: 0.110
Completion: 4.000
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-08-01
Kimi K2 Instruct accounts/fireworks/models/kimi-k2-instruct 128K 16.4K In: $1
Out: $3
Model: 0.500
Completion: 3.000
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-07-11
GLM 4.5 accounts/fireworks/models/glm-4p5 131.1K 131.1K In: $0.55
Out: $2.19
Model: 0.275
Completion: 3.982
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-29
DeepSeek V3.1 accounts/fireworks/models/deepseek-v3p1 163.8K 163.8K In: $0.56
Out: $1.68
Model: 0.280
Completion: 3.000
🧠 🔧 🌡️ 2025-07 In: text
Out: text
Open Weights
Released: 2025-08-21

GitHub Copilot

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Gemini 2.5 Pro gemini-2.5-pro 1M 65.5K - - 📎 🔧 🌡️ 2025-01 In: text, image, audio, video
Out: text
Released: 2025-03-20
Updated: 2025-06-05
Claude Sonnet 4 claude-sonnet-4 200K 8.2K - - 📎 🧠 🔧 🌡️ 2025-03-31 In: text, image
Out: text
Released: 2025-05-22
GPT-4.1 gpt-4.1 128K 16.4K - - 📎 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2025-04-14
Gemini 2.0 Flash gemini-2.0-flash-001 1M 8.2K - - 📎 🔧 🌡️ 2024-06 In: text, image, audio, video
Out: text
Released: 2024-12-11
Claude Opus 4 claude-opus-4 80K 16K - - 📎 🧠 2025-03-31 In: text, image
Out: text
Released: 2025-05-22
Grok Code Fast 1 grok-code-fast-1 256K 10K - - 🧠 🔧 🌡️ 2025-08 In: text
Out: text
Released: 2025-08-27
Claude Opus 4.1 claude-opus-41 200K 32K - - 📎 🧠 🌡️ 2025-03-31 In: text, image
Out: text
Released: 2025-08-05
Claude Sonnet 3.7 Thinking claude-3.7-sonnet-thought 200K 8.2K - - 📎 🧠 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2025-02-19
GPT-5 gpt-5 128K 128K - - 📎 🧠 🔧 🌡️ 2024-10 In: text, image
Out: text
Released: 2025-08-07
o3 (Preview) o3 128K 16.4K - - 📎 🧠 🔧 2024-05 In: text, image
Out: text
Released: 2025-04-16
GPT-5-mini gpt-5-mini 128K 128K - - 📎 🧠 🔧 🌡️ 2024-06 In: text, image
Out: text
Released: 2025-08-13
Claude Sonnet 3.7 claude-3.7-sonnet 200K 8.2K - - 📎 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2025-02-19
Claude Sonnet 3.5 claude-3.5-sonnet 200K 8.2K - - 📎 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2024-10-22
GPT-4o gpt-4o 128K 16.4K - - 📎 🔧 🌡️ 2023-09 In: text, image
Out: text
Released: 2024-05-13
o4-mini (Preview) o4-mini 128K 65.5K - - 🧠 2024-10 In: text
Out: text
Released: 2025-04-16
o3-mini o3-mini 128K 65.5K - - 🧠 2024-10 In: text
Out: text
Released: 2024-12-20
Updated: 2025-01-29

GitHub Models

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
DeepSeek-V3-0324 deepseek/deepseek-v3-0324 128K 8.2K - - 🧠 🔧 🌡️ 2024-06 In: text
Out: text
Open Weights
Released: 2025-03-24
DeepSeek-R1-0528 deepseek/deepseek-r1-0528 65.5K 8.2K - - 🧠 🔧 🌡️ 2024-06 In: text
Out: text
Open Weights
Released: 2025-05-28
DeepSeek-R1 deepseek/deepseek-r1 65.5K 8.2K - - 🧠 🔧 🌡️ 2024-06 In: text
Out: text
Open Weights
Released: 2025-01-20
Meta-Llama-3-8B-Instruct meta/meta-llama-3-8b-instruct 8.2K 2K - - 🧠 🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-04-18
Llama-3.2-90B-Vision-Instruct meta/llama-3.2-90b-vision-instruct 128K 8.2K - - 🧠 🔧 🌡️ 2023-12 In: text, image, audio
Out: text
Open Weights
Released: 2024-09-25
Llama-3.3-70B-Instruct meta/llama-3.3-70b-instruct 128K 32.8K - - 🧠 🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-12-06
Meta-Llama-3.1-70B-Instruct meta/meta-llama-3.1-70b-instruct 128K 32.8K - - 🧠 🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-07-23
Llama 4 Maverick 17B 128E Instruct FP8 meta/llama-4-maverick-17b-128e-instruct-fp8 128K 8.2K - - 🧠 🔧 🌡️ 2024-12 In: text, image
Out: text
Open Weights
Released: 2025-01-31
Meta-Llama-3.1-8B-Instruct meta/meta-llama-3.1-8b-instruct 128K 32.8K - - 🧠 🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-07-23
Meta-Llama-3.1-405B-Instruct meta/meta-llama-3.1-405b-instruct 128K 32.8K - - 🧠 🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-07-23
Llama-3.2-11B-Vision-Instruct meta/llama-3.2-11b-vision-instruct 128K 8.2K - - 🧠 🔧 🌡️ 2023-12 In: text, image, audio
Out: text
Open Weights
Released: 2024-09-25
Llama 4 Scout 17B 16E Instruct meta/llama-4-scout-17b-16e-instruct 128K 8.2K - - 🧠 🔧 🌡️ 2024-12 In: text, image
Out: text
Open Weights
Released: 2025-01-31
Meta-Llama-3-70B-Instruct meta/meta-llama-3-70b-instruct 8.2K 2K - - 🧠 🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-04-18
Grok 3 xai/grok-3 128K 8.2K - - 🧠 🔧 🌡️ 2024-10 In: text
Out: text
Released: 2024-12-09
Grok 3 Mini xai/grok-3-mini 128K 8.2K - - 🧠 🔧 🌡️ 2024-10 In: text
Out: text
Released: 2024-12-09
GPT-4.1 openai/gpt-4.1 128K 16.4K - - 📎 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2025-04-14
OpenAI o1 openai/o1 200K 100K - - 🧠 2023-10 In: text, image
Out: text
Released: 2024-09-12
Updated: 2024-12-17
OpenAI o3 openai/o3 200K 100K - - 🧠 2024-04 In: text, image
Out: text
Released: 2025-01-31
OpenAI o1-preview openai/o1-preview 128K 32.8K - - 🧠 2023-10 In: text
Out: text
Released: 2024-09-12
GPT-4o mini openai/gpt-4o-mini 128K 16.4K - - 📎 🔧 🌡️ 2023-10 In: text, image, audio
Out: text
Released: 2024-07-18
GPT-4.1-nano openai/gpt-4.1-nano 128K 16.4K - - 📎 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2025-04-14
GPT-4.1-mini openai/gpt-4.1-mini 128K 16.4K - - 📎 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2025-04-14
OpenAI o1-mini openai/o1-mini 128K 65.5K - - 🧠 2023-10 In: text
Out: text
Released: 2024-09-12
Updated: 2024-12-17
GPT-4o openai/gpt-4o 128K 16.4K - - 📎 🔧 🌡️ 2023-10 In: text, image, audio
Out: text
Released: 2024-05-13
OpenAI o4-mini openai/o4-mini 200K 100K - - 🧠 2024-04 In: text, image
Out: text
Released: 2025-01-31
OpenAI o3-mini openai/o3-mini 200K 100K - - 🧠 2024-04 In: text
Out: text
Released: 2025-01-31
AI21 Jamba 1.5 Mini ai21-labs/ai21-jamba-1.5-mini 256K 4.1K - - 🧠 🔧 🌡️ 2024-03 In: text
Out: text
Released: 2024-08-29
AI21 Jamba 1.5 Large ai21-labs/ai21-jamba-1.5-large 256K 4.1K - - 🧠 🔧 🌡️ 2024-03 In: text
Out: text
Released: 2024-08-29
Phi-3-medium instruct (4k) microsoft/phi-3-medium-4k-instruct 4.1K 1K - - 🧠 🔧 🌡️ 2023-10 In: text
Out: text
Open Weights
Released: 2024-04-23
Phi-3.5-vision instruct (128k) microsoft/phi-3.5-vision-instruct 128K 4.1K - - 🧠 🔧 🌡️ 2023-10 In: text, image
Out: text
Open Weights
Released: 2024-08-20
Phi-4-Reasoning microsoft/phi-4-reasoning 128K 4.1K - - 🧠 🔧 🌡️ 2023-10 In: text
Out: text
Open Weights
Released: 2024-12-11
Phi-4 microsoft/phi-4 16K 4.1K - - 🧠 🔧 🌡️ 2023-10 In: text
Out: text
Open Weights
Released: 2024-12-11
Phi-3-small instruct (8k) microsoft/phi-3-small-8k-instruct 8.2K 2K - - 🧠 🔧 🌡️ 2023-10 In: text
Out: text
Open Weights
Released: 2024-04-23
Phi-4-mini-instruct microsoft/phi-4-mini-instruct 128K 4.1K - - 🧠 🔧 🌡️ 2023-10 In: text
Out: text
Open Weights
Released: 2024-12-11
Phi-3-mini instruct (128k) microsoft/phi-3-mini-128k-instruct 128K 4.1K - - 🧠 🔧 🌡️ 2023-10 In: text
Out: text
Open Weights
Released: 2024-04-23
Phi-3-small instruct (128k) microsoft/phi-3-small-128k-instruct 128K 4.1K - - 🧠 🔧 🌡️ 2023-10 In: text
Out: text
Open Weights
Released: 2024-04-23
MAI-DS-R1 microsoft/mai-ds-r1 65.5K 8.2K - - 🧠 🔧 🌡️ 2024-06 In: text
Out: text
Released: 2025-01-20
Phi-3.5-MoE instruct (128k) microsoft/phi-3.5-moe-instruct 128K 4.1K - - 🧠 🔧 🌡️ 2023-10 In: text
Out: text
Open Weights
Released: 2024-08-20
Phi-4-multimodal-instruct microsoft/phi-4-multimodal-instruct 128K 4.1K - - 🧠 🔧 🌡️ 2023-10 In: text, image, audio
Out: text
Open Weights
Released: 2024-12-11
Phi-4-mini-reasoning microsoft/phi-4-mini-reasoning 128K 4.1K - - 🧠 🔧 🌡️ 2023-10 In: text
Out: text
Open Weights
Released: 2024-12-11
Phi-3.5-mini instruct (128k) microsoft/phi-3.5-mini-instruct 128K 4.1K - - 🧠 🔧 🌡️ 2023-10 In: text
Out: text
Open Weights
Released: 2024-08-20
Phi-3-mini instruct (4k) microsoft/phi-3-mini-4k-instruct 4.1K 1K - - 🧠 🔧 🌡️ 2023-10 In: text
Out: text
Open Weights
Released: 2024-04-23
Phi-3-medium instruct (128k) microsoft/phi-3-medium-128k-instruct 128K 4.1K - - 🧠 🔧 🌡️ 2023-10 In: text
Out: text
Open Weights
Released: 2024-04-23
Mistral Small 3.1 mistral-ai/mistral-small-2503 128K 32.8K - - 🧠 🔧 🌡️ 2024-09 In: text, image
Out: text
Released: 2025-03-01
Mistral Large 24.11 mistral-ai/mistral-large-2411 128K 32.8K - - 🧠 🔧 🌡️ 2024-09 In: text
Out: text
Released: 2024-11-01
Mistral Medium 3 (25.05) mistral-ai/mistral-medium-2505 128K 32.8K - - 🧠 🔧 🌡️ 2024-09 In: text, image
Out: text
Released: 2025-05-01
Ministral 3B mistral-ai/ministral-3b 128K 8.2K - - 🧠 🔧 🌡️ 2024-03 In: text
Out: text
Open Weights
Released: 2024-10-22
Mistral Nemo mistral-ai/mistral-nemo 128K 8.2K - - 🧠 🔧 🌡️ 2024-03 In: text
Out: text
Open Weights
Released: 2024-07-18
Codestral 25.01 mistral-ai/codestral-2501 32K 8.2K - - 🧠 🔧 🌡️ 2024-03 In: text
Out: text
Released: 2025-01-01
Cohere Command R 08-2024 cohere/cohere-command-r-08-2024 128K 4.1K - - 🧠 🔧 🌡️ 2024-03 In: text
Out: text
Released: 2024-08-01
Cohere Command R+ 08-2024 cohere/cohere-command-r-plus-08-2024 128K 4.1K - - 🧠 🔧 🌡️ 2024-03 In: text
Out: text
Released: 2024-08-01
Cohere Command A cohere/cohere-command-a 128K 4.1K - - 🧠 🔧 🌡️ 2024-03 In: text
Out: text
Released: 2024-11-01
Cohere Command R cohere/cohere-command-r 128K 4.1K - - 🧠 🔧 🌡️ 2024-03 In: text
Out: text
Released: 2024-03-11
Updated: 2024-08-01
Cohere Command R+ cohere/cohere-command-r-plus 128K 4.1K - - 🧠 🔧 🌡️ 2024-03 In: text
Out: text
Released: 2024-04-04
Updated: 2024-08-01
JAIS 30b Chat core42/jais-30b-chat 8.2K 2K - - 🧠 🔧 🌡️ 2023-03 In: text
Out: text
Open Weights
Released: 2023-08-30

Google

📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Gemini 2.5 Flash Preview 05-20 gemini-2.5-flash-preview-05-20 1M 65.5K In: $0.15
Out: $0.6
Cache Read: $0.0375
Model: 0.075
Completion: 4.000
Cache: 0.250
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-05-20
Gemini 2.5 Pro gemini-2.5-pro 1M 65.5K In: $1.25
Out: $10
Cache Read: $0.31
Model: 0.625
Completion: 8.000
Cache: 0.248
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-03-20
Updated: 2025-06-05
Gemini 1.5 Flash gemini-1.5-flash 1M 8.2K In: $0.075
Out: $0.3
Cache Read: $0.01875
Model: 0.037
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-04 In: text, image, audio, video
Out: text
Released: 2024-05-14
Gemini 2.0 Flash Lite gemini-2.0-flash-lite 1M 8.2K In: $0.075
Out: $0.3
Model: 0.037
Completion: 4.000
📎 🔧 🌡️ 2024-06 In: text, image, audio, video, pdf
Out: text
Released: 2024-12-11
Gemini 1.5 Pro gemini-1.5-pro 1M 8.2K In: $1.25
Out: $5
Cache Read: $0.3125
Model: 0.625
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-04 In: text, image, audio, video
Out: text
Released: 2024-02-15
Gemini 1.5 Flash-8B gemini-1.5-flash-8b 1M 8.2K In: $0.0375
Out: $0.15
Cache Read: $0.01
Model: 0.019
Completion: 4.000
Cache: 0.267
📎 🔧 🌡️ 2024-04 In: text, image, audio, video
Out: text
Released: 2024-10-03
Gemini 2.5 Flash gemini-2.5-flash 1M 65.5K In: $0.3
Out: $2.5
Cache Read: $0.075
Model: 0.150
Completion: 8.333
Cache: 0.250
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-03-20
Updated: 2025-06-05
Gemini 2.5 Pro Preview 06-05 gemini-2.5-pro-preview-06-05 1M 65.5K In: $1.25
Out: $10
Cache Read: $0.31
Model: 0.625
Completion: 8.000
Cache: 0.248
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-06-05
Gemini 2.5 Pro Preview 05-06 gemini-2.5-pro-preview-05-06 1M 65.5K In: $1.25
Out: $10
Cache Read: $0.31
Model: 0.625
Completion: 8.000
Cache: 0.248
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-05-06
Gemini 2.0 Flash gemini-2.0-flash 1M 8.2K In: $0.1
Out: $0.4
Cache Read: $0.025
Model: 0.050
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-06 In: text, image, audio, video, pdf
Out: text
Released: 2024-12-11
Gemini 2.5 Flash Lite Preview 06-17 gemini-2.5-flash-lite-preview-06-17 65.5K 65.5K In: $0.1
Out: $0.4
Cache Read: $0.025
Model: 0.050
Completion: 4.000
Cache: 0.250
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-06-17
Gemini 2.5 Flash Preview 04-17 gemini-2.5-flash-preview-04-17 1M 65.5K In: $0.15
Out: $0.6
Cache Read: $0.0375
Model: 0.075
Completion: 4.000
Cache: 0.250
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-04-17
Gemini 2.5 Flash-Lite gemini-2.5-flash-lite 1M 65.5K In: $0.1
Out: $0.4
Model: 0.050
Completion: 4.000
🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
-

Vertex

📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Gemini 2.5 Flash Preview 05-20 gemini-2.5-flash-preview-05-20 1M 65.5K In: $0.15
Out: $0.6
Cache Read: $0.0375
Model: 0.075
Completion: 4.000
Cache: 0.250
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-05-20
Gemini 2.5 Pro gemini-2.5-pro 1M 65.5K In: $1.25
Out: $10
Cache Read: $0.31
Model: 0.625
Completion: 8.000
Cache: 0.248
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-03-20
Updated: 2025-06-05
Gemini 2.0 Flash Lite gemini-2.0-flash-lite 1M 8.2K In: $0.075
Out: $0.3
Model: 0.037
Completion: 4.000
📎 🔧 🌡️ 2024-06 In: text, image, audio, video, pdf
Out: text
Released: 2024-12-11
Gemini 2.5 Flash gemini-2.5-flash 1M 65.5K In: $0.3
Out: $2.5
Cache Read: $0.075
Cache Write: $0.383
Model: 0.150
Completion: 8.333
Cache: 0.250
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-06-17
Gemini 2.5 Pro Preview 06-05 gemini-2.5-pro-preview-06-05 1M 65.5K In: $1.25
Out: $10
Cache Read: $0.31
Model: 0.625
Completion: 8.000
Cache: 0.248
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-06-05
Gemini 2.5 Pro Preview 05-06 gemini-2.5-pro-preview-05-06 1M 65.5K In: $1.25
Out: $10
Cache Read: $0.31
Model: 0.625
Completion: 8.000
Cache: 0.248
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-05-06
Gemini 2.0 Flash gemini-2.0-flash 1M 8.2K In: $0.1
Out: $0.4
Cache Read: $0.025
Model: 0.050
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-06 In: text, image, audio, video, pdf
Out: text
Released: 2024-12-11
Gemini 2.5 Flash Lite Preview 06-17 gemini-2.5-flash-lite-preview-06-17 65.5K 65.5K In: $0.1
Out: $0.4
Cache Read: $0.025
Model: 0.050
Completion: 4.000
Cache: 0.250
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-06-17
Gemini 2.5 Flash Preview 04-17 gemini-2.5-flash-preview-04-17 1M 65.5K In: $0.15
Out: $0.6
Cache Read: $0.0375
Model: 0.075
Completion: 4.000
Cache: 0.250
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-04-17

Vertex

📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Claude Haiku 3.5 claude-3-5-haiku@20241022 200K 8.2K In: $0.8
Out: $4
Cache Read: $0.08
Cache Write: $1
Model: 0.400
Completion: 5.000
Cache: 0.100
📎 🔧 🌡️ 2024-07-31 In: text, image
Out: text
Released: 2024-10-22
Claude Opus 4 claude-opus-4@20250514 200K 32K In: $15
Out: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image
Out: text
Released: 2025-05-22
Claude Sonnet 3.5 v2 claude-3-5-sonnet@20241022 200K 8.2K In: $3
Out: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🔧 🌡️ 2024-04-30 In: text, image
Out: text
Released: 2024-10-22
Claude Opus 4.1 claude-opus-4-1@20250805 200K 32K In: $15
Out: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image
Out: text
Released: 2025-08-05
Claude Sonnet 3.7 claude-3-7-sonnet@20250219 200K 64K In: $3
Out: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2024-10-31 In: text, image
Out: text
Released: 2025-02-19
Claude Sonnet 4 claude-sonnet-4@20250514 200K 64K In: $3
Out: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image
Out: text
Released: 2025-05-22

Groq

📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Llama 3.1 8B Instant llama-3.1-8b-instant 131.1K 8.2K In: $0.05
Out: $0.08
Model: 0.025
Completion: 1.600
🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-07-23
Qwen QwQ 32B qwen-qwq-32b 131.1K 16.4K In: $0.29
Out: $0.39
Model: 0.145
Completion: 1.345
🧠 🔧 🌡️ 2024-09 In: text
Out: text
Open Weights
Released: 2024-11-27
Llama 3 70B llama3-70b-8192 8.2K 8.2K In: $0.59
Out: $0.79
Model: 0.295
Completion: 1.339
🔧 🌡️ 2023-03 In: text
Out: text
Open Weights
Released: 2024-04-18
DeepSeek R1 Distill Llama 70B deepseek-r1-distill-llama-70b 131.1K 8.2K In: $0.75
Out: $0.99
Model: 0.375
Completion: 1.320
🧠 🔧 🌡️ 2024-07 In: text
Out: text
Open Weights
Released: 2025-01-20
Llama 3 8B llama3-8b-8192 8.2K 8.2K In: $0.05
Out: $0.08
Model: 0.025
Completion: 1.600
🔧 🌡️ 2023-03 In: text
Out: text
Open Weights
Released: 2024-04-18
Gemma 2 9B gemma2-9b-it 8.2K 8.2K In: $0.2
Out: $0.2
Model: 0.100
Completion: 1.000
🔧 🌡️ 2024-06 In: text
Out: text
Open Weights
Released: 2024-06-27
Llama 3.3 70B Versatile llama-3.3-70b-versatile 131.1K 32.8K In: $0.59
Out: $0.79
Model: 0.295
Completion: 1.339
🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-12-06
Mistral Saba 24B mistral-saba-24b 32.8K 32.8K In: $0.79
Out: $0.79
Model: 0.395
Completion: 1.000
🔧 🌡️ 2024-08 In: text
Out: text
Released: 2025-02-06
Llama Guard 3 8B llama-guard-3-8b 8.2K 8.2K In: $0.2
Out: $0.2
Model: 0.100
Completion: 1.000
🌡️ - In: text
Out: text
Open Weights
Released: 2024-07-23
GPT OSS 20B openai/gpt-oss-20b 131.1K 32.8K In: $0.1
Out: $0.5
Model: 0.050
Completion: 5.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-05
GPT OSS 120B openai/gpt-oss-120b 131.1K 32.8K In: $0.15
Out: $0.75
Model: 0.075
Completion: 5.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-05
Llama Guard 4 12B meta-llama/llama-guard-4-12b 131.1K 128 In: $0.2
Out: $0.2
Model: 0.100
Completion: 1.000
🌡️ - In: text, image
Out: text
Open Weights
Released: 2025-04-05
Llama 4 Maverick 17B meta-llama/llama-4-maverick-17b-128e-instruct 131.1K 8.2K In: $0.2
Out: $0.6
Model: 0.100
Completion: 3.000
🔧 🌡️ 2024-08 In: text, image
Out: text
Open Weights
Released: 2025-04-05
Llama 4 Scout 17B meta-llama/llama-4-scout-17b-16e-instruct 131.1K 8.2K In: $0.11
Out: $0.34
Model: 0.055
Completion: 3.091
🔧 🌡️ 2024-08 In: text, image
Out: text
Open Weights
Released: 2025-04-05
Qwen3 32B qwen/qwen3-32b 131.1K 16.4K In: $0.29
Out: $0.59
Model: 0.145
Completion: 2.034
🧠 🔧 🌡️ 2024-11-08 In: text
Out: text
Open Weights
Released: 2024-12-23
Kimi K2 Instruct 0905 moonshotai/kimi-k2-instruct-0905 262.1K 16.4K In: $1
Out: $3
Model: 0.500
Completion: 3.000
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-09-05
Kimi K2 Instruct moonshotai/kimi-k2-instruct 131.1K 16.4K In: $1
Out: $3
Model: 0.500
Completion: 3.000
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-07-14

Hugging Face

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
DeepSeek-V3-0324 deepseek-ai/Deepseek-V3-0324 16.4K 8.2K In: $1.25
Out: $1.25
Model: 0.625
Completion: 1.000
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-03-24
DeepSeek-R1-0528 deepseek-ai/DeepSeek-R1-0528 163.8K 163.8K In: $3
Out: $5
Model: 1.500
Completion: 1.667
🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2025-05-28
Qwen3-Coder-480B-A35B-Instruct Qwen/Qwen3-Coder-480B-A35B-Instruct 262.1K 66.5K In: $2
Out: $2
Model: 1.000
Completion: 1.000
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-23
Qwen3-235B-A22B-Thinking-2507 Qwen/Qwen3-235B-A22B-Thinking-2507 262.1K 131.1K In: $0.3
Out: $3
Model: 0.150
Completion: 10.000
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-25
GLM-4.5-Air zai-org/GLM-4.5-Air 128K 96K In: $0.2
Out: $1.1
Model: 0.100
Completion: 5.500
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-28
GLM-4.5 zai-org/GLM-4.5 131.1K 98.3K In: $0.6
Out: $2.2
Model: 0.300
Completion: 3.667
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-28
Kimi-K2-Instruct moonshotai/Kimi-K2-Instruct 131.1K 16.4K In: $1
Out: $3
Model: 0.500
Completion: 3.000
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-07-14

Inception

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Mercury Coder mercury-coder 128K 16.4K In: $0.25
Out: $1
Cache Read: $0.25
Cache Write: $1
Model: 0.125
Completion: 4.000
Cache: 1.000
🔧 🌡️ 2023-10 In: text
Out: text
Released: 2025-02-26
Updated: 2025-07-31
Mercury mercury 128K 16.4K In: $0.25
Out: $1
Cache Read: $0.25
Cache Write: $1
Model: 0.125
Completion: 4.000
Cache: 1.000
🔧 🌡️ 2023-10 In: text
Out: text
Released: 2025-06-26
Updated: 2025-07-31

Inference

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Llama 3.2 3B Instruct meta/llama-3.2-3b-instruct 16K 4.1K In: $0.02
Out: $0.02
Model: 0.010
Completion: 1.000
🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2025-01-01
Llama 3.2 1B Instruct meta/llama-3.2-1b-instruct 16K 4.1K In: $0.01
Out: $0.01
Model: 0.005
Completion: 1.000
🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2025-01-01
Llama 3.1 8B Instruct meta/llama-3.1-8b-instruct 16K 4.1K In: $0.025
Out: $0.025
Model: 0.013
Completion: 1.000
🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2025-01-01
Llama 3.2 11B Vision Instruct meta/llama-3.2-11b-vision-instruct 16K 4.1K In: $0.055
Out: $0.055
Model: 0.028
Completion: 1.000
📎 🔧 🌡️ 2023-12 In: text, image
Out: text
Open Weights
Released: 2025-01-01
Mistral Nemo 12B Instruct mistral/mistral-nemo-12b-instruct 16K 4.1K In: $0.038
Out: $0.1
Model: 0.019
Completion: 2.632
🔧 🌡️ 2024-12 In: text
Out: text
Open Weights
Released: 2025-01-01
Google Gemma 3 google/gemma-3 125K 4.1K In: $0.15
Out: $0.3
Model: 0.075
Completion: 2.000
📎 🔧 🌡️ 2024-12 In: text, image
Out: text
Open Weights
Released: 2025-01-01
Osmosis Structure 0.6B osmosis/osmosis-structure-0.6b 4K 2K In: $0.1
Out: $0.5
Model: 0.050
Completion: 5.000
🔧 🌡️ 2024-12 In: text
Out: text
Open Weights
Released: 2025-01-01
Qwen 2.5 7B Vision Instruct qwen/qwen-2.5-7b-vision-instruct 125K 4.1K In: $0.2
Out: $0.2
Model: 0.100
Completion: 1.000
📎 🔧 🌡️ 2024-12 In: text, image
Out: text
Open Weights
Released: 2025-01-01
Qwen 3 Embedding 4B qwen/qwen3-embedding-4b 32K 2K In: $0.01
Out: $-
Model: 0.005 - 2024-12 In: text
Out: text
Open Weights
Released: 2025-01-01

Llama

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Groq-Llama-4-Maverick-17B-128E-Instruct groq-llama-4-maverick-17b-128e-instruct 128K 4.1K - - 📎 🔧 🌡️ 2025-01 In: text
Out: text
Open Weights
Released: 2025-04-05
Llama-3.3-70B-Instruct llama-3.3-70b-instruct 128K 4.1K - - 📎 🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-12-06
Llama-4-Maverick-17B-128E-Instruct-FP8 llama-4-maverick-17b-128e-instruct-fp8 128K 4.1K - - 📎 🔧 🌡️ 2024-08 In: text, image
Out: text
Open Weights
Released: 2025-04-05
Llama-4-Scout-17B-16E-Instruct-FP8 llama-4-scout-17b-16e-instruct-fp8 128K 4.1K - - 📎 🔧 🌡️ 2024-08 In: text, image
Out: text
Open Weights
Released: 2025-04-05
Cerebras-Llama-4-Scout-17B-16E-Instruct cerebras-llama-4-scout-17b-16e-instruct 128K 4.1K - - 📎 🔧 🌡️ 2025-01 In: text
Out: text
Open Weights
Released: 2025-04-05
Llama-3.3-8B-Instruct llama-3.3-8b-instruct 128K 4.1K - - 📎 🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-12-06
Cerebras-Llama-4-Maverick-17B-128E-Instruct cerebras-llama-4-maverick-17b-128e-instruct 128K 4.1K - - 📎 🔧 🌡️ 2025-01 In: text
Out: text
Open Weights
Released: 2025-04-05

LMStudio

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
GPT OSS 20B openai/gpt-oss-20b 131.1K 32.8K - - 🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-05
Qwen3 Coder 30B qwen/qwen3-coder-30b 262.1K 65.5K - - 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-23
Qwen3 30B A3B 2507 qwen/qwen3-30b-a3b-2507 262.1K 16.4K - - 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-30

Mistral

📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Pixtral Large pixtral-large-latest 128K 128K In: $2
Out: $6
Model: 1.000
Completion: 3.000
📎 🔧 🌡️ 2024-11 In: text, image
Out: text
Open Weights
Released: 2024-11-01
Updated: 2024-11-04
Mixtral 8x7B open-mixtral-8x7b 32K 32K In: $0.7
Out: $0.7
Model: 0.350
Completion: 1.000
🔧 🌡️ 2024-01 In: text
Out: text
Open Weights
Released: 2023-12-11
Codestral codestral-latest 256K 4.1K In: $0.3
Out: $0.9
Model: 0.150
Completion: 3.000
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2024-05-29
Updated: 2025-01-04
Devstral Small 2505 devstral-small-2505 128K 128K In: $0.1
Out: $0.3
Model: 0.050
Completion: 3.000
🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2025-05-07
Devstral Medium devstral-medium-2507 128K 128K In: $0.4
Out: $2
Model: 0.200
Completion: 5.000
🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2025-07-10
Mistral Medium 3 mistral-medium-2505 131.1K 131.1K In: $0.4
Out: $2
Model: 0.200
Completion: 5.000
📎 🔧 🌡️ 2025-05 In: text, image
Out: text
Released: 2025-05-07
Devstral Small devstral-small-2507 128K 128K In: $0.1
Out: $0.3
Model: 0.050
Completion: 3.000
🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2025-07-10
Ministral 8B ministral-8b-latest 128K 128K In: $0.1
Out: $0.1
Model: 0.050
Completion: 1.000
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2024-10-01
Updated: 2024-10-04
Magistral Medium magistral-medium-latest 128K 16.4K In: $2
Out: $5
Model: 1.000
Completion: 2.500
🧠 🔧 🌡️ 2025-06 In: text
Out: text
Open Weights
Released: 2025-03-17
Updated: 2025-03-20
Pixtral 12B pixtral-12b 128K 128K In: $0.15
Out: $0.15
Model: 0.075
Completion: 1.000
📎 🔧 🌡️ 2024-09 In: text, image
Out: text
Open Weights
Released: 2024-09-01
Mistral 7B open-mistral-7b 8K 8K In: $0.25
Out: $0.25
Model: 0.125
Completion: 1.000
🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2023-09-27
Magistral Small magistral-small 128K 128K In: $0.5
Out: $1.5
Model: 0.250
Completion: 3.000
🧠 🔧 🌡️ 2025-06 In: text
Out: text
Open Weights
Released: 2025-03-17
Mistral Medium 3.1 mistral-medium-2508 262.1K 262.1K In: $0.4
Out: $2
Model: 0.200
Completion: 5.000
📎 🔧 🌡️ 2025-05 In: text, image
Out: text
Released: 2025-08-12
Mixtral 8x22B open-mixtral-8x22b 64K 64K In: $2
Out: $6
Model: 1.000
Completion: 3.000
🔧 🌡️ 2024-04 In: text
Out: text
Open Weights
Released: 2024-04-17
Mistral Medium mistral-medium-latest 128K 16.4K In: $0.4
Out: $2
Model: 0.200
Completion: 5.000
🔧 🌡️ 2025-05 In: text, image
Out: text
Open Weights
Released: 2025-05-07
Updated: 2025-05-10
Mistral Small mistral-small-latest 128K 16.4K In: $0.1
Out: $0.3
Model: 0.050
Completion: 3.000
🔧 🌡️ 2025-03 In: text, image
Out: text
Open Weights
Released: 2024-09-01
Updated: 2024-09-04
Mistral Large mistral-large-latest 131.1K 16.4K In: $2
Out: $6
Model: 1.000
Completion: 3.000
🔧 🌡️ 2024-11 In: text
Out: text
Open Weights
Released: 2024-11-01
Updated: 2024-11-04
Mistral Nemo mistral-nemo 128K 128K In: $0.15
Out: $0.15
Model: 0.075
Completion: 1.000
🔧 🌡️ 2024-07 In: text
Out: text
Open Weights
Released: 2024-07-01
Ministral 3B ministral-3b-latest 128K 128K In: $0.04
Out: $0.04
Model: 0.020
Completion: 1.000
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2024-10-01
Updated: 2024-10-04

ModelScope

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Qwen3 30B A3B Thinking 2507 Qwen/Qwen3-30B-A3B-Thinking-2507 262.1K 32.8K - - 🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-30
Qwen3-Coder-480B-A35B-Instruct Qwen/Qwen3-Coder-480B-A35B-Instruct 262.1K 66.5K - - 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-23
Qwen3-235B-A22B-Thinking-2507 Qwen/Qwen3-235B-A22B-Thinking-2507 262.1K 131.1K - - 🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-25
Qwen3 30B A3B Instruct 2507 Qwen/Qwen3-30B-A3B-Instruct-2507 262.1K 16.4K - - 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-30
Qwen3 235B A22B Instruct 2507 Qwen/Qwen3-235B-A22B-Instruct-2507 262.1K 131.1K - - 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-04-28
Updated: 2025-07-21
Qwen3 Coder 30B A3B Instruct Qwen/Qwen3-Coder-30B-A3B-Instruct 262.1K 65.5K - - 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-31
GLM-4.5 ZhipuAI/GLM-4.5 131.1K 98.3K - - 🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-28
Kimi-K2-Instruct moonshotai/Kimi-K2-Instruct 128K 16.4K - - 🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-07-14

Moonshot AI

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Kimi K2 Turbo kimi-k2-turbo-preview 131.1K 16.4K In: $2.4
Out: $10
Cache Read: $0.6
Model: 1.200
Completion: 4.167
Cache: 0.250
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-07-14
Kimi K2 0711 kimi-k2-0711-preview 131.1K 16.4K In: $0.6
Out: $2.5
Cache Read: $0.15
Model: 0.300
Completion: 4.167
Cache: 0.250
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-07-14
Kimi K2 0905 kimi-k2-0905-preview 262.1K 262.1K In: $0.6
Out: $2.5
Cache Read: $0.15
Model: 0.300
Completion: 4.167
Cache: 0.250
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-09-05

Moonshot AI (China)

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Kimi K2 0905 kimi-k2-0905-preview 262.1K 262.1K In: $0.6
Out: $2.5
Cache Read: $0.15
Model: 0.300
Completion: 4.167
Cache: 0.250
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-09-05
Kimi K2 0711 kimi-k2-0711-preview 131.1K 16.4K In: $0.6
Out: $2.5
Cache Read: $0.15
Model: 0.300
Completion: 4.167
Cache: 0.250
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-07-14
Kimi K2 Turbo kimi-k2-turbo-preview 131.1K 16.4K In: $2.4
Out: $10
Cache Read: $0.6
Model: 1.200
Completion: 4.167
Cache: 0.250
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-07-14

Morph

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Auto auto 32K 32K In: $0.85
Out: $1.55
Model: 0.425
Completion: 1.824
- - In: text
Out: text
Released: 2024-06-01
Morph v3 Fast morph-v3-fast 16K 16K In: $0.8
Out: $1.2
Model: 0.400
Completion: 1.500
- - In: text
Out: text
Released: 2024-08-15
Morph v3 Large morph-v3-large 32K 32K In: $0.9
Out: $1.9
Model: 0.450
Completion: 2.111
- - In: text
Out: text
Released: 2024-08-15

OpenAI

📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
GPT-5 Nano gpt-5-nano 400K 128K In: $0.05
Out: $0.4
Cache Read: $0.01
Model: 0.025
Completion: 8.000
Cache: 0.200
📎 🧠 🔧 2024-05-30 In: text, image
Out: text
Released: 2025-08-07
o3-pro o3-pro 200K 100K In: $20
Out: $80
Model: 10.000
Completion: 4.000
📎 🧠 🔧 2024-05 In: text, image
Out: text
Released: 2025-06-10
Codex Mini codex-mini-latest 200K 100K In: $1.5
Out: $6
Cache Read: $0.375
Model: 0.750
Completion: 4.000
Cache: 0.250
📎 🧠 🔧 2024-04 In: text
Out: text
Released: 2025-05-16
GPT-4.1 gpt-4.1 1M 32.8K In: $2
Out: $8
Cache Read: $0.5
Model: 1.000
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2025-04-14
GPT-4 Turbo gpt-4-turbo 128K 4.1K In: $10
Out: $30
Model: 5.000
Completion: 3.000
📎 🔧 🌡️ 2023-12 In: text, image
Out: text
Released: 2023-11-06
Updated: 2024-04-09
o1 o1 200K 100K In: $15
Out: $60
Cache Read: $7.5
Model: 7.500
Completion: 4.000
Cache: 0.500
📎 🧠 🔧 2023-09 In: text, image
Out: text
Released: 2024-12-05
o3-deep-research o3-deep-research 200K 100K In: $10
Out: $40
Cache Read: $2.5
Model: 5.000
Completion: 4.000
Cache: 0.250
📎 🧠 🔧 2024-05 In: text, image
Out: text
Released: 2024-06-26
GPT-5 gpt-5 400K 128K In: $1.25
Out: $10
Cache Read: $0.13
Model: 0.625
Completion: 8.000
Cache: 0.104
📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-08-07
o1-pro o1-pro 200K 100K In: $150
Out: $600
Model: 75.000
Completion: 4.000
📎 🧠 🔧 2023-09 In: text, image
Out: text
Released: 2025-03-19
o3 o3 200K 100K In: $2
Out: $8
Cache Read: $0.5
Model: 1.000
Completion: 4.000
Cache: 0.250
📎 🧠 🔧 2024-05 In: text, image
Out: text
Released: 2025-04-16
GPT-5 Chat (latest) gpt-5-chat-latest 400K 128K In: $1.25
Out: $10
Model: 0.625
Completion: 8.000
📎 🧠 🌡️ 2024-09-30 In: text, image
Out: text
Released: 2025-08-07
GPT-5 Mini gpt-5-mini 400K 128K In: $0.25
Out: $2
Cache Read: $0.03
Model: 0.125
Completion: 8.000
Cache: 0.120
📎 🧠 🔧 2024-05-30 In: text, image
Out: text
Released: 2025-08-07
o1-preview o1-preview 128K 32.8K In: $15
Out: $60
Cache Read: $7.5
Model: 7.500
Completion: 4.000
Cache: 0.500
🧠 🌡️ 2023-09 In: text
Out: text
Released: 2024-09-12
o4-mini-deep-research o4-mini-deep-research 200K 100K In: $2
Out: $8
Cache Read: $0.5
Model: 1.000
Completion: 4.000
Cache: 0.250
📎 🧠 🔧 2024-05 In: text, image
Out: text
Released: 2024-06-26
GPT-4o mini gpt-4o-mini 128K 16.4K In: $0.15
Out: $0.6
Cache Read: $0.08
Model: 0.075
Completion: 4.000
Cache: 0.533
📎 🔧 🌡️ 2023-09 In: text, image
Out: text
Released: 2024-07-18
GPT-4.1 nano gpt-4.1-nano 1M 32.8K In: $0.1
Out: $0.4
Cache Read: $0.03
Model: 0.050
Completion: 4.000
Cache: 0.300
📎 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2025-04-14
GPT-4.1 mini gpt-4.1-mini 1M 32.8K In: $0.4
Out: $1.6
Cache Read: $0.1
Model: 0.200
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2025-04-14
o1-mini o1-mini 128K 65.5K In: $1.1
Out: $4.4
Cache Read: $0.55
Model: 0.550
Completion: 4.000
Cache: 0.500
🧠 2023-09 In: text
Out: text
Released: 2024-09-12
GPT-4o gpt-4o 128K 16.4K In: $2.5
Out: $10
Cache Read: $1.25
Model: 1.250
Completion: 4.000
Cache: 0.500
📎 🔧 🌡️ 2023-09 In: text, image
Out: text
Released: 2024-05-13
GPT-4 gpt-4 8.2K 8.2K In: $30
Out: $60
Model: 15.000
Completion: 2.000
📎 🔧 🌡️ 2023-11 In: text
Out: text
Released: 2023-11-06
Updated: 2024-04-09
GPT-3.5-turbo gpt-3.5-turbo 16.4K 4.1K In: $0.5
Out: $1.5
Cache Read: $1.25
Model: 0.250
Completion: 3.000
Cache: 2.500
🌡️ 2021-09-01 In: text
Out: text
Released: 2023-03-01
Updated: 2023-11-06
o4-mini o4-mini 200K 100K In: $1.1
Out: $4.4
Cache Read: $0.28
Model: 0.550
Completion: 4.000
Cache: 0.255
📎 🧠 🔧 2024-05 In: text, image
Out: text
Released: 2025-04-16
o3-mini o3-mini 200K 100K In: $1.1
Out: $4.4
Cache Read: $0.55
Model: 0.550
Completion: 4.000
Cache: 0.500
🧠 🔧 2024-05 In: text
Out: text
Released: 2024-12-20
Updated: 2025-01-29

opencode

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Grok Code Fast 1 grok-code 256K 32K - - 📎 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2025-08-20
Qwen3 Coder qwen/qwen3-coder 262.1K 65.5K In: $0.3
Out: $1.2
Model: 0.150
Completion: 4.000
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-23

OpenRouter

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
R1 0528 (free) deepseek/deepseek-r1-0528:free 163.8K 163.8K - - 🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2025-05-28
DeepSeek-V3.1 deepseek/deepseek-chat-v3.1 163.8K 163.8K In: $0.2
Out: $0.8
Model: 0.100
Completion: 4.000
🧠 🔧 🌡️ 2025-07 In: text
Out: text
Open Weights
Released: 2025-08-21
DeepSeek R1 Distill Llama 70B deepseek/deepseek-r1-distill-llama-70b 8.2K 8.2K - - 🧠 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-01-23
DeepSeek V3 Base (free) deepseek/deepseek-v3-base:free 163.8K 163.8K - - 🌡️ 2025-03 In: text
Out: text
Open Weights
Released: 2025-03-29
Deepseek R1 0528 Qwen3 8B (free) deepseek/deepseek-r1-0528-qwen3-8b:free 131.1K 131.1K - - 🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2025-05-29
DeepSeek V3 0324 deepseek/deepseek-chat-v3-0324 16.4K 8.2K - - 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-03-24
R1 (free) deepseek/deepseek-r1:free 163.8K 163.8K - - 🧠 🔧 🌡️ 2025-01 In: text
Out: text
Open Weights
Released: 2025-01-20
DeepSeek R1 Distill Qwen 14B deepseek/deepseek-r1-distill-qwen-14b 64K 8.2K - - 🧠 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-01-29
Reka Flash 3 rekaai/reka-flash-3 32.8K 8.2K - - 🧠 🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-03-12
Qwerky 72B featherless/qwerky-72b 32.8K 8.2K - - 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-03-20
Horizon Beta openrouter/horizon-beta 256K 128K - - 📎 🔧 2025-07 In: text, image
Out: text
Released: 2025-08-01
Cypher Alpha (free) openrouter/cypher-alpha:free 1M 1M - - 🔧 🌡️ 2025-07 In: text
Out: text
Released: 2025-07-01
Horizon Alpha openrouter/horizon-alpha 256K 128K - - 📎 🔧 2025-07 In: text, image
Out: text
Released: 2025-07-30
Claude Sonnet 4 anthropic/claude-sonnet-4 200K 64K In: $3
Out: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image
Out: text
Released: 2025-05-22
Claude Opus 4 anthropic/claude-opus-4 200K 32K In: $15
Out: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image
Out: text
Released: 2025-05-22
Claude Haiku 3.5 anthropic/claude-3.5-haiku 200K 8.2K In: $0.8
Out: $4
Cache Read: $0.08
Cache Write: $1
Model: 0.400
Completion: 5.000
Cache: 0.100
📎 🔧 🌡️ 2024-07-31 In: text, image
Out: text
Released: 2024-10-22
Claude Opus 4.1 anthropic/claude-opus-4.1 200K 32K In: $15
Out: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image
Out: text
Released: 2025-08-05
Claude Sonnet 3.7 anthropic/claude-3.7-sonnet 200K 128K In: $15
Out: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2024-01 In: text, image
Out: text
Released: 2025-02-19
GPT-5 Nano openai/gpt-5-nano 400K 128K In: $0.05
Out: $0.4
Model: 0.025
Completion: 8.000
📎 🧠 🔧 🌡️ 2024-10-01 In: text, image
Out: text
Released: 2025-08-07
GPT-4.1 openai/gpt-4.1 1M 32.8K In: $2
Out: $8
Cache Read: $0.5
Model: 1.000
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2025-04-14
GPT OSS 20B openai/gpt-oss-20b 131.1K 32.8K In: $0.05
Out: $0.2
Model: 0.025
Completion: 4.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-05
GPT-5 openai/gpt-5 400K 128K In: $1.25
Out: $10
Model: 0.625
Completion: 8.000
📎 🧠 🔧 🌡️ 2024-10-01 In: text, image
Out: text
Released: 2025-08-07
GPT-5 Chat (latest) openai/gpt-5-chat 400K 128K In: $1.25
Out: $10
Model: 0.625
Completion: 8.000
📎 🧠 🌡️ 2024-09-30 In: text, image
Out: text
Released: 2025-08-07
GPT-5 Mini openai/gpt-5-mini 400K 128K In: $0.25
Out: $2
Model: 0.125
Completion: 8.000
📎 🧠 🔧 🌡️ 2024-10-01 In: text, image
Out: text
Released: 2025-08-07
GPT-4o-mini openai/gpt-4o-mini 128K 16.4K In: $0.15
Out: $0.6
Cache Read: $0.08
Model: 0.075
Completion: 4.000
Cache: 0.533
📎 🔧 🌡️ 2024-10 In: text, image
Out: text
Released: 2024-07-18
GPT-4.1 Mini openai/gpt-4.1-mini 1M 32.8K In: $0.4
Out: $1.6
Cache Read: $0.1
Model: 0.200
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2025-04-14
GPT OSS 120B openai/gpt-oss-120b 131.1K 32.8K In: $0.15
Out: $0.6
Model: 0.075
Completion: 4.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-05
o4 Mini openai/o4-mini 200K 100K In: $1.1
Out: $4.4
Cache Read: $0.28
Model: 0.550
Completion: 4.000
Cache: 0.255
📎 🧠 🔧 🌡️ 2024-06 In: text, image
Out: text
Released: 2025-04-16
GLM Z1 32B (free) thudm/glm-z1-32b:free 32.8K 32.8K - - 🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-04-17
Sarvam-M (free) sarvamai/sarvam-m:free 32.8K 32.8K - - 🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2025-05-25
Grok Code Fast 1 x-ai/grok-code-fast-1 256K 10K In: $0.2
Out: $1.5
Cache Read: $0.02
Model: 0.100
Completion: 7.500
Cache: 0.100
🧠 🔧 🌡️ 2025-08 In: text
Out: text
Released: 2025-08-26
Grok 3 Mini Beta x-ai/grok-3-mini-beta 131.1K 8.2K In: $0.3
Out: $0.5
Cache Read: $0.075
Cache Write: $0.5
Model: 0.150
Completion: 1.667
Cache: 0.250
🧠 🔧 🌡️ 2024-11 In: text
Out: text
Released: 2025-02-17
Grok 3 x-ai/grok-3 131.1K 8.2K In: $3
Out: $15
Cache Read: $0.75
Cache Write: $15
Model: 1.500
Completion: 5.000
Cache: 0.250
🔧 🌡️ 2024-11 In: text
Out: text
Released: 2025-02-17
Grok 3 Mini x-ai/grok-3-mini 131.1K 8.2K In: $0.3
Out: $0.5
Cache Read: $0.075
Cache Write: $0.5
Model: 0.150
Completion: 1.667
Cache: 0.250
🧠 🔧 🌡️ 2024-11 In: text
Out: text
Released: 2025-02-17
Grok 4 x-ai/grok-4 256K 64K In: $3
Out: $15
Cache Read: $0.75
Cache Write: $15
Model: 1.500
Completion: 5.000
Cache: 0.250
🧠 🔧 🌡️ 2025-07 In: text
Out: text
Released: 2025-07-09
Grok 3 Beta x-ai/grok-3-beta 131.1K 8.2K In: $3
Out: $15
Cache Read: $0.75
Cache Write: $15
Model: 1.500
Completion: 5.000
Cache: 0.250
🔧 🌡️ 2024-11 In: text
Out: text
Released: 2025-02-17
Codestral 2508 mistralai/codestral-2508 256K 256K In: $0.3
Out: $0.9
Model: 0.150
Completion: 3.000
🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2025-08-01
Mistral Medium 3 mistralai/mistral-medium-3 131.1K 131.1K In: $0.4
Out: $2
Model: 0.200
Completion: 5.000
📎 🔧 🌡️ 2025-05 In: text, image
Out: text
Released: 2025-05-07
Devstral Small mistralai/devstral-small-2505 128K 128K In: $0.06
Out: $0.12
Model: 0.030
Completion: 2.000
🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2025-05-07
Mistral Small 3.2 24B (free) mistralai/mistral-small-3.2-24b-instruct:free 96K 96K - - 📎 🔧 🌡️ 2025-06 In: text, image
Out: text
Open Weights
Released: 2025-06-20
Devstral Medium mistralai/devstral-medium-2507 131.1K 131.1K In: $0.4
Out: $2
Model: 0.200
Completion: 5.000
🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2025-07-10
Mistral Small 3.2 24B Instruct mistralai/mistral-small-3.2-24b-instruct 96K 8.2K - - 📎 🔧 🌡️ 2024-10 In: text, image
Out: text
Open Weights
Released: 2025-06-20
Devstral Small 1.1 mistralai/devstral-small-2507 131.1K 131.1K In: $0.1
Out: $0.3
Model: 0.050
Completion: 3.000
🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2025-07-10
Mistral Nemo (free) mistralai/mistral-nemo:free 131.1K 131.1K - - 🔧 🌡️ 2024-07 In: text
Out: text
Open Weights
Released: 2024-07-19
Mistral Small 3.1 24B Instruct mistralai/mistral-small-3.1-24b-instruct 128K 8.2K - - 📎 🔧 🌡️ 2024-10 In: text, image
Out: text
Open Weights
Released: 2025-03-17
Mistral 7B Instruct (free) mistralai/mistral-7b-instruct:free 32.8K 32.8K - - 🔧 🌡️ 2024-05 In: text
Out: text
Open Weights
Released: 2024-05-27
Mistral Medium 3.1 mistralai/mistral-medium-3.1 262.1K 262.1K In: $0.4
Out: $2
Model: 0.200
Completion: 5.000
📎 🔧 🌡️ 2025-05 In: text, image
Out: text
Released: 2025-08-12
Devstral Small 2505 (free) mistralai/devstral-small-2505:free 32.8K 32.8K - - 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2025-05-21
Llama 4 Scout (free) meta-llama/llama-4-scout:free 64K 64K - - 📎 🔧 🌡️ 2024-08 In: text, image
Out: text
Open Weights
Released: 2025-04-05
Llama 3.2 11B Vision Instruct meta-llama/llama-3.2-11b-vision-instruct 131.1K 8.2K - - 📎 🌡️ 2023-12 In: text, image
Out: text
Open Weights
Released: 2024-09-25
Llama 3.3 70B Instruct (free) meta-llama/llama-3.3-70b-instruct:free 65.5K 65.5K - - 🔧 🌡️ 2024-12 In: text
Out: text
Open Weights
Released: 2024-12-06
Gemini 2.5 Pro google/gemini-2.5-pro 1M 65.5K In: $1.25
Out: $10
Cache Read: $0.31
Model: 0.625
Completion: 8.000
Cache: 0.248
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-03-20
Updated: 2025-06-05
Gemma 3 12B IT google/gemma-3-12b-it 96K 8.2K - - 📎 🔧 🌡️ 2024-10 In: text, image
Out: text
Open Weights
Released: 2025-03-13
Gemini 2.0 Flash google/gemini-2.0-flash-001 1M 8.2K In: $0.1
Out: $0.4
Cache Read: $0.025
Model: 0.050
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-06 In: text, image, audio, video, pdf
Out: text
Released: 2024-12-11
Gemini 2.0 Flash Experimental (free) google/gemini-2.0-flash-exp:free 1M 1M - - 📎 🔧 🌡️ 2024-12 In: text, image
Out: text
Released: 2024-12-11
Gemma 3n 4B (free) google/gemma-3n-e4b-it:free 8.2K 8.2K - - 📎 🔧 🌡️ 2025-05 In: text, image, audio
Out: text
Open Weights
Released: 2025-05-20
Gemma 2 9B (free) google/gemma-2-9b-it:free 8.2K 8.2K - - 🔧 🌡️ 2024-06 In: text
Out: text
Open Weights
Released: 2024-06-28
Gemini 2.5 Flash google/gemini-2.5-flash 1M 65.5K In: $0.3
Out: $2.5
Cache Read: $0.0375
Model: 0.150
Completion: 8.333
Cache: 0.125
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-07-17
Gemini 2.5 Pro Preview 06-05 google/gemini-2.5-pro-preview-06-05 1M 65.5K In: $1.25
Out: $10
Cache Read: $0.31
Model: 0.625
Completion: 8.000
Cache: 0.248
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-06-05
Gemini 2.5 Pro Preview 05-06 google/gemini-2.5-pro-preview-05-06 1M 65.5K In: $1.25
Out: $10
Cache Read: $0.31
Model: 0.625
Completion: 8.000
Cache: 0.248
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-05-06
Gemma 3n E4B IT google/gemma-3n-e4b-it 8.2K 8.2K - - 📎 🌡️ 2024-10 In: text, image, audio
Out: text
Open Weights
Released: 2025-05-20
Gemma 3 27B IT google/gemma-3-27b-it 96K 8.2K - - 📎 🔧 🌡️ 2024-10 In: text, image
Out: text
Open Weights
Released: 2025-03-12
MAI DS R1 (free) microsoft/mai-ds-r1:free 163.8K 163.8K - - 🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-04-21
GLM 4.5 Air (free) z-ai/glm-4.5-air:free 128K 96K - - 🧠 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-28
GLM 4.5 Air z-ai/glm-4.5-air 128K 96K In: $0.2
Out: $1.1
Model: 0.100
Completion: 5.500
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-28
GLM 4.5V z-ai/glm-4.5v 64K 16.4K In: $0.6
Out: $1.8
Model: 0.300
Completion: 3.000
📎 🧠 🔧 🌡️ 2025-04 In: text, image, video
Out: text
Open Weights
Released: 2025-08-11
GLM 4.5 z-ai/glm-4.5 128K 96K In: $0.6
Out: $2.2
Model: 0.300
Completion: 3.667
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-28
Dolphin3.0 Mistral 24B cognitivecomputations/dolphin3.0-mistral-24b 32.8K 8.2K - - 🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-02-13
Dolphin3.0 R1 Mistral 24B cognitivecomputations/dolphin3.0-r1-mistral-24b 32.8K 8.2K - - 🧠 🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-02-13
Qwen2.5 VL 72B Instruct qwen/qwen2.5-vl-72b-instruct 32.8K 8.2K - - 📎 🌡️ 2024-10 In: text, image
Out: text
Open Weights
Released: 2025-02-01
Qwen3 Coder 480B A35B Instruct (free) qwen/qwen3-coder:free 262.1K 66.5K - - 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-23
Qwen3 235B A22B Thinking 2507 qwen/qwen3-235b-a22b-thinking-2507 262.1K 81.9K In: $0.078
Out: $0.312
Model: 0.039
Completion: 4.000
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-25
Qwen3 30B A3B (free) qwen/qwen3-30b-a3b:free 41K 41K - - 🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-04-28
Qwen3 32B (free) qwen/qwen3-32b:free 41K 41K - - 🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-04-28
Qwen3 Max qwen/qwen3-max 262.1K 32.8K In: $1.2
Out: $6
Model: 0.600
Completion: 5.000
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-09-05
Qwen3 235B A22B (free) qwen/qwen3-235b-a22b:free 131.1K 131.1K - - 🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-04-28
Qwen3 30B A3B Instruct 2507 qwen/qwen3-30b-a3b-instruct-2507 131.1K 33K In: $0.2
Out: $0.8
Model: 0.100
Completion: 4.000
🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-29
Qwen2.5 Coder 32B Instruct qwen/qwen-2.5-coder-32b-instruct 32.8K 8.2K - - 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2024-11-11
Qwen2.5 VL 72B Instruct (free) qwen/qwen2.5-vl-72b-instruct:free 32.8K 32.8K - - 📎 🔧 🌡️ 2025-02 In: text, image
Out: text
Open Weights
Released: 2025-02-01
Qwen3 235B A22B Instruct 2507 (free) qwen/qwen3-235b-a22b-07-25:free 262.1K 131.1K - - 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-04-28
Updated: 2025-07-21
QwQ 32B (free) qwen/qwq-32b:free 32.8K 32.8K - - 🧠 🔧 🌡️ 2025-03 In: text
Out: text
Open Weights
Released: 2025-03-05
Qwen2.5 VL 32B Instruct (free) qwen/qwen2.5-vl-32b-instruct:free 8.2K 8.2K - - 📎 🔧 🌡️ 2025-03 In: text, image, video
Out: text
Open Weights
Released: 2025-03-24
Qwen3 Coder qwen/qwen3-coder 262.1K 66.5K In: $0.3
Out: $1.2
Model: 0.150
Completion: 4.000
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-23
Qwen3 8B (free) qwen/qwen3-8b:free 41K 41K - - 🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-04-28
Qwen3 235B A22B Instruct 2507 qwen/qwen3-235b-a22b-07-25 262.1K 131.1K In: $0.15
Out: $0.85
Model: 0.075
Completion: 5.667
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-04-28
Updated: 2025-07-21
Qwen3 14B (free) qwen/qwen3-14b:free 41K 41K - - 🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-04-28
DeepSeek R1T2 Chimera (free) tngtech/deepseek-r1t2-chimera:free 163.8K 163.8K - - 🧠 🌡️ 2025-07 In: text
Out: text
Open Weights
Released: 2025-07-08
Hermes 4 405B nousresearch/hermes-4-405b 131.1K 131.1K In: $1
Out: $3
Model: 0.500
Completion: 3.000
🧠 🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2025-08-25
Hermes 4 70B nousresearch/hermes-4-70b 131.1K 131.1K In: $0.13
Out: $0.4
Model: 0.065
Completion: 3.077
🧠 🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2025-08-25
DeepHermes 3 Llama 3 8B Preview nousresearch/deephermes-3-llama-3-8b-preview 131.1K 8.2K - - 🧠 🔧 🌡️ 2024-04 In: text
Out: text
Open Weights
Released: 2025-02-28
Kimi K2 Instruct 0905 moonshotai/kimi-k2-0905 262.1K 16.4K In: $0.6
Out: $2.5
Model: 0.300
Completion: 4.167
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-09-05
Kimi K2 (free) moonshotai/kimi-k2:free 32.8K 32.8K - - 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-11
Kimi K2 moonshotai/kimi-k2 131.1K 32.8K In: $0.55
Out: $2.2
Model: 0.275
Completion: 4.000
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-07-11
Kimi Dev 72b (free) moonshotai/kimi-dev-72b:free 131.1K 131.1K - - 🔧 🌡️ 2025-06 In: text
Out: text
Open Weights
Released: 2025-06-16

Requesty

📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Claude Opus 4 anthropic/claude-opus-4 200K 32K In: $15
Out: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image
Out: text
Released: 2025-05-22
Claude Opus 4.1 anthropic/claude-opus-4-1-20250805 200K 32K In: $15
Out: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image
Out: text
Released: 2025-08-05
Claude Sonnet 3.7 anthropic/claude-3-7-sonnet 200K 64K In: $3
Out: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2024-01 In: text, image
Out: text
Released: 2025-02-19
Claude Sonnet 4 anthropic/claude-4-sonnet-20250522 200K 64K In: $3
Out: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image
Out: text
Released: 2025-05-22
GPT-5 Nano openai/gpt-5-nano 16K 4K In: $0.05
Out: $0.4
Cache Read: $0.01
Model: 0.025
Completion: 8.000
Cache: 0.200
📎 🧠 🔧 2024-05-30 In: text
Out: text
Released: 2025-08-07
GPT-4.1 openai/gpt-4.1 1M 32.8K In: $2
Out: $8
Cache Read: $0.5
Model: 1.000
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2025-04-14
GPT-5 openai/gpt-5 400K 128K In: $1.25
Out: $10
Cache Read: $0.13
Model: 0.625
Completion: 8.000
Cache: 0.104
📎 🧠 🔧 2024-09-30 In: text, audio, image, video
Out: text, audio, image
Released: 2025-08-07
GPT-5 Mini openai/gpt-5-mini 128K 32K In: $0.25
Out: $2
Cache Read: $0.03
Model: 0.125
Completion: 8.000
Cache: 0.120
📎 🧠 🔧 2024-05-30 In: text, image
Out: text
Released: 2025-08-07
GPT-4o Mini openai/gpt-4o-mini 128K 16.4K In: $0.15
Out: $0.6
Cache Read: $0.08
Model: 0.075
Completion: 4.000
Cache: 0.533
📎 🔧 🌡️ 2024-10 In: text, image
Out: text
Released: 2024-07-18
GPT-4.1 Mini openai/gpt-4.1-mini 1M 32.8K In: $0.4
Out: $1.6
Cache Read: $0.1
Model: 0.200
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2025-04-14
o4 Mini openai/o4-mini 200K 100K In: $1.1
Out: $4.4
Cache Read: $0.28
Model: 0.550
Completion: 4.000
Cache: 0.255
📎 🧠 🔧 🌡️ 2024-06 In: text, image
Out: text
Released: 2025-04-16
Gemini 2.5 Pro google/gemini-2.5-pro 1M 65.5K In: $1.25
Out: $10
Cache Read: $0.31
Cache Write: $2.375
Model: 0.625
Completion: 8.000
Cache: 0.248
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-06-17
Gemini 2.5 Flash google/gemini-2.5-flash 1M 65.5K In: $0.3
Out: $2.5
Cache Read: $0.075
Cache Write: $0.55
Model: 0.150
Completion: 8.333
Cache: 0.250
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-06-17

submodel

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
DeepSeek V3.1 deepseek-ai/DeepSeek-V3.1 75K 163.8K In: $0.2
Out: $0.8
Model: 0.100
Completion: 4.000
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-08-23
DeepSeek V3 0324 deepseek-ai/DeepSeek-V3-0324 75K 163.8K In: $0.2
Out: $0.8
Model: 0.100
Completion: 4.000
🔧 🌡️ - In: text
Out: text
Released: 2025-08-23
DeepSeek R1 0528 deepseek-ai/DeepSeek-R1-0528 75K 163.8K In: $0.5
Out: $2.15
Model: 0.250
Completion: 4.300
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-08-23
GPT OSS 120B openai/gpt-oss-120b 131.1K 32.8K In: $0.1
Out: $0.5
Model: 0.050
Completion: 5.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-23
Qwen3 235B A22B Thinking 2507 Qwen/Qwen3-235B-A22B-Thinking-2507 262.1K 131.1K In: $0.2
Out: $0.6
Model: 0.100
Completion: 3.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-23
Qwen3 235B A22B Instruct 2507 Qwen/Qwen3-235B-A22B-Instruct-2507 262.1K 131.1K In: $0.2
Out: $0.3
Model: 0.100
Completion: 1.500
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-23
Qwen3 Coder 480B A35B Instruct Qwen/Qwen3-Coder-480B-A35B-Instruct-FP8 262.1K 262.1K In: $0.2
Out: $0.8
Model: 0.100
Completion: 4.000
🔧 🌡️ - In: text
Out: text
Released: 2025-08-23
GLM 4.5 FP8 zai-org/GLM-4.5-FP8 131.1K 131.1K In: $0.2
Out: $0.8
Model: 0.100
Completion: 4.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-07-28
GLM 4.5 Air zai-org/GLM-4.5-Air 131.1K 131.1K In: $0.1
Out: $0.5
Model: 0.050
Completion: 5.000
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-07-28

Synthetic

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
DeepSeek V3.1 deepseek-ai/DeepSeek-V3.1 128K 128K In: $0.56
Out: $1.68
Model: 0.280
Completion: 3.000
🔧 🌡️ - In: text
Out: text
Released: 2025-08-21
DeepSeek V3 deepseek-ai/DeepSeek-V3 128K 128K In: $1.25
Out: $1.25
Model: 0.625
Completion: 1.000
🧠 🔧 🌡️ 2024-07 In: text
Out: text
Open Weights
Released: 2025-01-20
Updated: 2025-05-29
DeepSeek V3 (0324) deepseek-ai/DeepSeek-V3-0324 128K 128K In: $1.2
Out: $1.2
Model: 0.600
Completion: 1.000
🔧 🌡️ - In: text
Out: text
Released: 2025-08-01
DeepSeek R1 deepseek-ai/DeepSeek-R1 128K 128K In: $0.55
Out: $2.19
Model: 0.275
Completion: 3.982
🧠 🔧 🌡️ 2025-01 In: text
Out: text
Open Weights
Released: 2025-01-20
DeepSeek R1 (0528) deepseek-ai/DeepSeek-R1-0528 128K 128K In: $3
Out: $8
Model: 1.500
Completion: 2.667
🧠 🔧 🌡️ - In: text
Out: text
Released: 2025-08-01
GPT OSS 120B openai/gpt-oss-120b 128K 32.8K In: $0.1
Out: $0.1
Model: 0.050
Completion: 1.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-05
Qwen3 235B A22B Thinking 2507 Qwen/qwen3-235b-a22b-thinking-2507 256K 32K In: $0.65
Out: $3
Model: 0.325
Completion: 4.615
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-25
Qwen2.5-Coder-32B-Instruct Qwen/qwen2.5-coder-32b-instruct 32.8K 32.8K In: $0.8
Out: $0.8
Model: 0.400
Completion: 1.000
🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2024-11-11
Qwen 3 235B Instruct Qwen/qwen-3-235b-a22b-instruct-2507 256K 32K In: $0.2
Out: $0.6
Model: 0.100
Completion: 3.000
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-04-28
Updated: 2025-07-21
Qwen 3 Coder 480B Qwen/qwen-3-coder-480b 256K 32K In: $2
Out: $2
Model: 1.000
Completion: 1.000
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-23
GLM 4.5 zai-org/glm-4.5 128K 96K In: $0.55
Out: $2.19
Model: 0.275
Completion: 3.982
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-28
Llama-3.1-405B-Instruct meta-llama/Llama-3.1-405B-Instruct 128K 32.8K In: $3
Out: $3
Model: 1.500
Completion: 1.000
🧠 🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-07-23
Llama-4-Maverick-17B-128E-Instruct-FP8 meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8 524K 4.1K In: $0.22
Out: $0.88
Model: 0.110
Completion: 4.000
📎 🔧 🌡️ 2024-08 In: text, image
Out: text
Open Weights
Released: 2025-04-05
Llama-3.2-3B-Instruct meta-llama/Llama-3.2-3B-Instruct 128K 128K In: $0.06
Out: $0.06
Model: 0.030
Completion: 1.000
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-09-18
Updated: 2024-10-24
Llama-3.1-70B-Instruct meta-llama/Llama-3.1-70B-Instruct 128K 32.8K In: $0.9
Out: $0.9
Model: 0.450
Completion: 1.000
🧠 🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-07-23
Llama-4-Scout-17B-16E-Instruct meta-llama/Llama-4-Scout-17B-16E-Instruct 328K 4.1K In: $0.15
Out: $0.6
Model: 0.075
Completion: 4.000
📎 🔧 🌡️ 2024-08 In: text, image
Out: text
Open Weights
Released: 2025-04-05
Llama-3.3-70B-Instruct meta-llama/Llama-3.3-70B-Instruct 128K 32.8K In: $0.9
Out: $0.9
Model: 0.450
Completion: 1.000
🧠 🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-12-06
Llama-3.1-8B-Instruct meta-llama/Llama-3.1-8B-Instruct 128K 32.8K In: $0.2
Out: $0.2
Model: 0.100
Completion: 1.000
🧠 🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-07-23
Llama-3.2-1B-Instruct meta-llama/Llama-3.2-1B-Instruct 128K 60K In: $0.06
Out: $0.06
Model: 0.030
Completion: 1.000
🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2024-09-18
Updated: 2024-10-24
Kimi K2 moonshotai/Kimi-K2-Instruct 128K 32.8K In: $0.6
Out: $2.5
Model: 0.300
Completion: 4.167
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-07-11

Together AI

📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
DeepSeek V3 deepseek-ai/DeepSeek-V3 131.1K 12.3K In: $1.25
Out: $1.25
Model: 0.625
Completion: 1.000
🧠 🔧 🌡️ 2024-07 In: text
Out: text
Open Weights
Released: 2025-01-20
Updated: 2025-05-29
DeepSeek R1 deepseek-ai/DeepSeek-R1 163.8K 12.3K In: $3
Out: $7
Model: 1.500
Completion: 2.333
🧠 🌡️ 2024-07 In: text
Out: text
Open Weights
Released: 2024-12-26
Updated: 2025-03-24
GPT OSS 120B openai/gpt-oss-120b 131.1K 131.1K In: $0.15
Out: $0.6
Model: 0.075
Completion: 4.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-05
Qwen3 Coder 480B A35B Instruct Qwen/Qwen3-Coder-480B-A35B-Instruct-FP8 262.1K 66.5K In: $2
Out: $2
Model: 1.000
Completion: 1.000
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-23
Llama 3.3 70B meta-llama/Llama-3.3-70B-Instruct-Turbo 131.1K 66.5K In: $0.88
Out: $0.88
Model: 0.440
Completion: 1.000
🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-12-06
Kimi K2 Instruct moonshotai/Kimi-K2-Instruct 131.1K 32.8K In: $1
Out: $3
Model: 0.500
Completion: 3.000
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-07-14

Upstage

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
solar-pro2 solar-pro2 65.5K 8.2K In: $0.25
Out: $0.25
Model: 0.125
Completion: 1.000
🧠 🔧 🌡️ 2025-03 In: text
Out: text
Released: 2025-05-20
solar-mini solar-mini 32.8K 4.1K In: $0.15
Out: $0.15
Model: 0.075
Completion: 1.000
🔧 🌡️ 2024-09 In: text
Out: text
Released: 2024-06-12
Updated: 2025-04-22

v0

📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
v0-1.5-md v0-1.5-md 128K 32K In: $3
Out: $15
Model: 1.500
Completion: 5.000
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2025-06-09
v0-1.0-md v0-1.0-md 128K 32K In: $3
Out: $15
Model: 1.500
Completion: 5.000
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2025-05-22
v0-1.5-lg v0-1.5-lg 512K 32K In: $15
Out: $75
Model: 7.500
Completion: 5.000
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2025-06-09

Venice AI

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Qwen 2.5 Coder 32B qwen-2.5-coder-32b 32.8K 8.2K In: $0.5
Out: $2
Model: 0.250
Completion: 4.000
🌡️ 2023-10 In: text
Out: text
Open Weights
Released: 2025-06-14
Venice Uncensored 1.1 venice-uncensored 32.8K 8.2K In: $0.5
Out: $2
Model: 0.250
Completion: 4.000
🌡️ 2023-10 In: text
Out: text
Open Weights
Released: 2025-07-15
DeepSeek Coder V2 Lite deepseek-coder-v2-lite 131.1K 8.2K In: $0.5
Out: $2
Model: 0.250
Completion: 4.000
🌡️ 2021-09 In: text
Out: text
Open Weights
Released: 2025-06-22
Dolphin 72B dolphin-2.9.2-qwen2-72b 32.8K 8.2K In: $0.7
Out: $2.8
Model: 0.350
Completion: 4.000
🌡️ 2021-09 In: text
Out: text
Open Weights
Released: 2025-05-21
Venice Small qwen3-4b 32.8K 8.2K In: $0.15
Out: $0.6
Model: 0.075
Completion: 4.000
🧠 🔧 🌡️ 2024-07 In: text
Out: text
Open Weights
Released: 2025-07-27
Qwen 2.5 VL 72B qwen-2.5-vl 32.8K 8.2K In: $0.7
Out: $2.8
Model: 0.350
Completion: 4.000
🌡️ 2023-10 In: text, image
Out: text
Open Weights
Released: 2025-06-09
Llama 3.3 70B llama-3.3-70b 65.5K 8.2K In: $0.7
Out: $2.8
Model: 0.350
Completion: 4.000
🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2025-06-09
Llama 3.1 405B llama-3.1-405b 65.5K 8.2K In: $1.5
Out: $6
Model: 0.750
Completion: 4.000
🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2025-06-30
Llama 3.2 3B llama-3.2-3b 131.1K 8.2K In: $0.15
Out: $0.6
Model: 0.075
Completion: 4.000
🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2025-05-23
DeepSeek R1 671B deepseek-r1-671b 131.1K 8.2K In: $3.5
Out: $14
Model: 1.750
Completion: 4.000
🧠 🌡️ 2023-10 In: text
Out: text
Open Weights
Released: 2025-06-05
Venice Reasoning qwen-2.5-qwq-32b 32.8K 8.2K In: $0.5
Out: $2
Model: 0.250
Completion: 4.000
🧠 🌡️ 2023-10 In: text
Out: text
Open Weights
Released: 2025-07-08
Venice Large qwen3-235b 131.1K 8.2K In: $1.5
Out: $6
Model: 0.750
Completion: 4.000
🧠 🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-07-27
Venice Medium mistral-31-24b 131.1K 8.2K In: $0.5
Out: $2
Model: 0.250
Completion: 4.000
🔧 🌡️ 2023-10 In: text, image
Out: text
Open Weights
Released: 2025-07-15

Vercel AI Gateway

📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
DeepSeek-R1 deepseek/deepseek-r1 128K 32.8K In: $1.35
Out: $5.4
Model: 0.675
Completion: 4.000
🧠 🔧 🌡️ 2024-07 In: text
Out: text
Released: 2025-01-20
Updated: 2025-05-29
DeepSeek R1 Distill Llama 70B deepseek/deepseek-r1-distill-llama-70b 131.1K 8.2K In: $0.75
Out: $0.99
Model: 0.375
Completion: 1.320
🧠 🔧 🌡️ 2024-07 In: text
Out: text
Open Weights
Released: 2025-01-20
Llama-4-Scout-17B-16E-Instruct-FP8 meta/llama-4-scout 128K 4.1K - - 📎 🔧 🌡️ 2024-08 In: text, image
Out: text
Open Weights
Released: 2025-04-05
Llama-3.3-70B-Instruct meta/llama-3.3-70b 128K 4.1K - - 📎 🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-12-06
Llama-4-Maverick-17B-128E-Instruct-FP8 meta/llama-4-maverick 128K 4.1K - - 📎 🔧 🌡️ 2024-08 In: text, image
Out: text
Open Weights
Released: 2025-04-05
Grok 3 Mini Fast xai/grok-3-mini-fast 131.1K 8.2K In: $0.6
Out: $4
Cache Read: $0.15
Cache Write: $4
Model: 0.300
Completion: 6.667
Cache: 0.250
🧠 🔧 🌡️ 2024-11 In: text
Out: text
Released: 2025-02-17
Grok 4 xai/grok-4 256K 64K In: $3
Out: $15
Cache Read: $0.75
Cache Write: $15
Model: 1.500
Completion: 5.000
Cache: 0.250
🧠 🔧 🌡️ 2025-07 In: text
Out: text
Released: 2025-07-09
Grok 3 Fast xai/grok-3-fast 131.1K 8.2K In: $5
Out: $25
Cache Read: $1.25
Cache Write: $25
Model: 2.500
Completion: 5.000
Cache: 0.250
🔧 🌡️ 2024-11 In: text
Out: text
Released: 2025-02-17
Grok 2 Vision xai/grok-2-vision 8.2K 4.1K In: $2
Out: $10
Cache Read: $2
Cache Write: $10
Model: 1.000
Completion: 5.000
Cache: 1.000
📎 🔧 🌡️ 2024-08 In: text, image
Out: text
Released: 2024-08-20
Grok 2 xai/grok-2 131.1K 8.2K In: $2
Out: $10
Cache Read: $2
Cache Write: $10
Model: 1.000
Completion: 5.000
Cache: 1.000
🔧 🌡️ 2024-08 In: text
Out: text
Released: 2024-08-20
Grok 3 Mini xai/grok-3-mini 131.1K 8.2K In: $0.3
Out: $0.5
Cache Read: $0.075
Cache Write: $0.5
Model: 0.150
Completion: 1.667
Cache: 0.250
🧠 🔧 🌡️ 2024-11 In: text
Out: text
Released: 2025-02-17
Grok 3 xai/grok-3 131.1K 8.2K In: $3
Out: $15
Cache Read: $0.75
Cache Write: $15
Model: 1.500
Completion: 5.000
Cache: 0.250
🔧 🌡️ 2024-11 In: text
Out: text
Released: 2025-02-17
Qwen 3 Coder 480B cerebras/qwen3-coder 131K 32K In: $2
Out: $2
Model: 1.000
Completion: 1.000
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-23
Claude Opus 4 anthropic/claude-4-opus 200K 32K In: $15
Out: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image
Out: text
Released: 2025-05-22
Claude Sonnet 3.5 v2 anthropic/claude-3.5-sonnet 200K 8.2K In: $3
Out: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🔧 🌡️ 2024-04-30 In: text, image
Out: text
Released: 2024-10-22
Claude Sonnet 3.7 anthropic/claude-3.7-sonnet 200K 64K In: $3
Out: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2024-10-31 In: text, image
Out: text
Released: 2025-02-19
Claude Sonnet 4 anthropic/claude-4-sonnet 200K 64K In: $3
Out: $15
Cache Read: $0.3
Cache Write: $3.75
Model: 1.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image
Out: text
Released: 2025-05-22
Claude Haiku 3 anthropic/claude-3-haiku 200K 4.1K In: $0.25
Out: $1.25
Cache Read: $0.03
Cache Write: $0.3
Model: 0.125
Completion: 5.000
Cache: 0.120
📎 🔧 🌡️ 2023-08-31 In: text, image
Out: text
Released: 2024-03-13
Claude Opus 3 anthropic/claude-3-opus 200K 4.1K In: $15
Out: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🔧 🌡️ 2023-08-31 In: text, image
Out: text
Released: 2024-02-29
Claude Opus 4 anthropic/claude-4-1-opus 200K 32K In: $15
Out: $75
Cache Read: $1.5
Cache Write: $18.75
Model: 7.500
Completion: 5.000
Cache: 0.100
📎 🧠 🔧 🌡️ 2025-03-31 In: text, image
Out: text
Released: 2025-05-22
Claude Haiku 3.5 anthropic/claude-3-5-haiku 200K 8.2K In: $0.8
Out: $4
Cache Read: $0.08
Cache Write: $1
Model: 0.400
Completion: 5.000
Cache: 0.100
📎 🔧 🌡️ 2024-07-31 In: text, image
Out: text
Released: 2024-10-22
GPT OSS 20B openai/gpt-oss-20b 131.1K 32.8K In: $0.07
Out: $0.3
Model: 0.035
Completion: 4.286
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-05
GPT OSS 120B openai/gpt-oss-120b 131.1K 32.8K In: $0.1
Out: $0.5
Model: 0.050
Completion: 5.000
🧠 🔧 🌡️ - In: text
Out: text
Open Weights
Released: 2025-08-05
o3-mini openai/o3-mini 200K 100K In: $1.1
Out: $4.4
Cache Read: $0.55
Model: 0.550
Completion: 4.000
Cache: 0.500
🧠 🔧 2024-05 In: text
Out: text
Released: 2024-12-20
Updated: 2025-01-29
o4-mini openai/o4-mini 200K 100K In: $1.1
Out: $4.4
Cache Read: $0.28
Model: 0.550
Completion: 4.000
Cache: 0.255
📎 🧠 🔧 2024-05 In: text, image
Out: text
Released: 2025-04-16
GPT-4o openai/gpt-4o 128K 16.4K In: $2.5
Out: $10
Cache Read: $1.25
Model: 1.250
Completion: 4.000
Cache: 0.500
📎 🔧 🌡️ 2023-09 In: text, image
Out: text
Released: 2024-05-13
GPT-4.1 mini openai/gpt-4.1-mini 1M 32.8K In: $0.4
Out: $1.6
Cache Read: $0.1
Model: 0.200
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2025-04-14
GPT-4.1 nano openai/gpt-4.1-nano 1M 32.8K In: $0.1
Out: $0.4
Cache Read: $0.03
Model: 0.050
Completion: 4.000
Cache: 0.300
📎 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2025-04-14
GPT-4o mini openai/gpt-4o-mini 128K 16.4K In: $0.15
Out: $0.6
Cache Read: $0.08
Model: 0.075
Completion: 4.000
Cache: 0.533
📎 🔧 🌡️ 2023-09 In: text, image
Out: text
Released: 2024-07-18
GPT-5 Mini openai/gpt-5-mini 400K 128K In: $0.25
Out: $2
Cache Read: $0.03
Model: 0.125
Completion: 8.000
Cache: 0.120
📎 🧠 🔧 2024-05-30 In: text, image
Out: text
Released: 2025-08-07
o3 openai/o3 200K 100K In: $2
Out: $8
Cache Read: $0.5
Model: 1.000
Completion: 4.000
Cache: 0.250
📎 🧠 🔧 2024-05 In: text, image
Out: text
Released: 2025-04-16
GPT-5 openai/gpt-5 400K 128K In: $1.25
Out: $10
Cache Read: $0.13
Model: 0.625
Completion: 8.000
Cache: 0.104
📎 🧠 🔧 2024-09-30 In: text, image
Out: text
Released: 2025-08-07
o1 openai/o1 200K 100K In: $15
Out: $60
Cache Read: $7.5
Model: 7.500
Completion: 4.000
Cache: 0.500
📎 🧠 🔧 2023-09 In: text, image
Out: text
Released: 2024-12-05
GPT-4 Turbo openai/gpt-4-turbo 128K 4.1K In: $10
Out: $30
Model: 5.000
Completion: 3.000
📎 🔧 🌡️ 2023-12 In: text, image
Out: text
Released: 2023-11-06
Updated: 2024-04-09
GPT-4.1 openai/gpt-4.1 1M 32.8K In: $2
Out: $8
Cache Read: $0.5
Model: 1.000
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-04 In: text, image
Out: text
Released: 2025-04-14
GPT-5 Nano openai/gpt-5-nano 400K 128K In: $0.05
Out: $0.4
Cache Read: $0.01
Model: 0.025
Completion: 8.000
Cache: 0.200
📎 🧠 🔧 2024-05-30 In: text, image
Out: text
Released: 2025-08-07
Mixtral 8x22B mistral/mixtral-8x22b-instruct 64K 64K In: $2
Out: $6
Model: 1.000
Completion: 3.000
🔧 🌡️ 2024-04 In: text
Out: text
Open Weights
Released: 2024-04-17
Mistral Small mistral/mistral-small 128K 16.4K In: $0.1
Out: $0.3
Model: 0.050
Completion: 3.000
🔧 🌡️ 2025-03 In: text, image
Out: text
Open Weights
Released: 2024-09-01
Updated: 2024-09-04
Pixtral Large mistral/pixtral-large 128K 128K In: $2
Out: $6
Model: 1.000
Completion: 3.000
📎 🔧 🌡️ 2024-11 In: text, image
Out: text
Open Weights
Released: 2024-11-01
Updated: 2024-11-04
Ministral 3B mistral/ministral-3b 128K 128K In: $0.04
Out: $0.04
Model: 0.020
Completion: 1.000
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2024-10-01
Updated: 2024-10-04
Magistral Medium mistral/magistral-medium 128K 16.4K In: $2
Out: $5
Model: 1.000
Completion: 2.500
🧠 🔧 🌡️ 2025-06 In: text
Out: text
Open Weights
Released: 2025-03-17
Updated: 2025-03-20
Magistral Small mistral/magistral-small 128K 128K In: $0.5
Out: $1.5
Model: 0.250
Completion: 3.000
🧠 🔧 🌡️ 2025-06 In: text
Out: text
Open Weights
Released: 2025-03-17
Pixtral 12B mistral/pixtral-12b 128K 128K In: $0.15
Out: $0.15
Model: 0.075
Completion: 1.000
📎 🔧 🌡️ 2024-09 In: text, image
Out: text
Open Weights
Released: 2024-09-01
Codestral mistral/codestral 256K 4.1K In: $0.3
Out: $0.9
Model: 0.150
Completion: 3.000
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2024-05-29
Updated: 2025-01-04
Ministral 8B mistral/ministral-8b 128K 128K In: $0.1
Out: $0.1
Model: 0.050
Completion: 1.000
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2024-10-01
Updated: 2024-10-04
Mistral Large mistral/mistral-large 131.1K 16.4K In: $2
Out: $6
Model: 1.000
Completion: 3.000
🔧 🌡️ 2024-11 In: text
Out: text
Open Weights
Released: 2024-11-01
Updated: 2024-11-04
v0-1.0-md vercel/v0-1.0-md 128K 32K In: $3
Out: $15
Model: 1.500
Completion: 5.000
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2025-05-22
v0-1.5-md vercel/v0-1.5-md 128K 32K In: $3
Out: $15
Model: 1.500
Completion: 5.000
📎 🧠 🔧 🌡️ - In: text, image
Out: text
Released: 2025-06-09
Gemini 2.0 Flash google/gemini-2.0-flash 1M 8.2K In: $0.1
Out: $0.4
Cache Read: $0.025
Model: 0.050
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-06 In: text, image, audio, video, pdf
Out: text
Released: 2024-12-11
Gemini 2.5 Flash google/gemini-2.5-flash 1M 65.5K In: $0.3
Out: $2.5
Cache Read: $0.075
Model: 0.150
Completion: 8.333
Cache: 0.250
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-03-20
Updated: 2025-06-05
Gemini 2.0 Flash Lite google/gemini-2.0-flash-lite 1M 8.2K In: $0.075
Out: $0.3
Model: 0.037
Completion: 4.000
📎 🔧 🌡️ 2024-06 In: text, image, audio, video, pdf
Out: text
Released: 2024-12-11
Gemini 2.5 Pro google/gemini-2.5-pro 1M 65.5K In: $1.25
Out: $10
Cache Read: $0.31
Model: 0.625
Completion: 8.000
Cache: 0.248
📎 🧠 🔧 🌡️ 2025-01 In: text, image, audio, video, pdf
Out: text
Released: 2025-03-20
Updated: 2025-06-05
Nova Micro amazon/nova-micro 128K 8.2K In: $0.035
Out: $0.14
Cache Read: $0.00875
Model: 0.018
Completion: 4.000
Cache: 0.250
🔧 🌡️ 2024-10 In: text
Out: text
Released: 2024-12-03
Nova Lite amazon/nova-lite 300K 8.2K In: $0.06
Out: $0.24
Cache Read: $0.015
Model: 0.030
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-10 In: text, image, video
Out: text
Released: 2024-12-03
Nova Pro amazon/nova-pro 300K 8.2K In: $0.8
Out: $3.2
Cache Read: $0.2
Model: 0.400
Completion: 4.000
Cache: 0.250
📎 🔧 🌡️ 2024-10 In: text, image, video
Out: text
Released: 2024-12-03
Morph v3 Large morph/morph-v3-large 32K 32K In: $0.9
Out: $1.9
Model: 0.450
Completion: 2.111
- - In: text
Out: text
Released: 2024-08-15
Morph v3 Fast morph/morph-v3-fast 16K 16K In: $0.8
Out: $1.2
Model: 0.400
Completion: 1.500
- - In: text
Out: text
Released: 2024-08-15
Kimi K2 Instruct moonshotai/kimi-k2 131.1K 16.4K In: $1
Out: $3
Model: 0.500
Completion: 3.000
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-07-14

Weights & Biases

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
DeepSeek-V3-0324 deepseek-ai/DeepSeek-V3-0324 161K 8.2K In: $1.14
Out: $2.75
Model: 0.570
Completion: 2.412
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-03-24
DeepSeek-R1-0528 deepseek-ai/DeepSeek-R1-0528 161K 163.8K In: $1.35
Out: $5.4
Model: 0.675
Completion: 4.000
🧠 🔧 🌡️ 2025-05 In: text
Out: text
Open Weights
Released: 2025-05-28
Qwen3-Coder-480B-A35B-Instruct Qwen/Qwen3-Coder-480B-A35B-Instruct 262.1K 66.5K In: $1
Out: $1.5
Model: 0.500
Completion: 1.500
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-23
Qwen3-235B-A22B-Thinking-2507 Qwen/Qwen3-235B-A22B-Thinking-2507 262.1K 131.1K In: $0.1
Out: $0.1
Model: 0.050
Completion: 1.000
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-25
Qwen3 235B A22B Instruct 2507 Qwen/Qwen3-235B-A22B-Instruct-2507 262.1K 131.1K In: $0.1
Out: $0.1
Model: 0.050
Completion: 1.000
🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-04-28
Updated: 2025-07-21
Llama 4 Scout 17B 16E Instruct meta-llama/Llama-4-Scout-17B-16E-Instruct 64K 8.2K In: $0.17
Out: $0.66
Model: 0.085
Completion: 3.882
🧠 🔧 🌡️ 2024-12 In: text, image
Out: text
Open Weights
Released: 2025-01-31
Llama-3.3-70B-Instruct meta-llama/Llama-3.3-70B-Instruct 128K 32.8K In: $0.71
Out: $0.71
Model: 0.355
Completion: 1.000
🧠 🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-12-06
Meta-Llama-3.1-8B-Instruct meta-llama/Llama-3.1-8B-Instruct 128K 32.8K In: $0.22
Out: $0.22
Model: 0.110
Completion: 1.000
🧠 🔧 🌡️ 2023-12 In: text
Out: text
Open Weights
Released: 2024-07-23
Phi-4-mini-instruct microsoft/Phi-4-mini-instruct 128K 4.1K In: $0.08
Out: $0.35
Model: 0.040
Completion: 4.375
🧠 🔧 🌡️ 2023-10 In: text
Out: text
Open Weights
Released: 2024-12-11
Kimi-K2-Instruct moonshotai/Kimi-K2-Instruct 128K 16.4K In: $1.35
Out: $4
Model: 0.675
Completion: 2.963
🔧 🌡️ 2024-10 In: text
Out: text
Open Weights
Released: 2025-07-14

xAI

📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
Grok 3 Mini Fast Latest grok-3-mini-fast-latest 131.1K 8.2K In: $0.6
Out: $4
Cache Read: $0.15
Cache Write: $4
Model: 0.300
Completion: 6.667
Cache: 0.250
🧠 🔧 🌡️ 2024-11 In: text
Out: text
Released: 2025-02-17
Grok 3 Mini Latest grok-3-mini-latest 131.1K 8.2K In: $0.3
Out: $0.5
Cache Read: $0.075
Cache Write: $0.5
Model: 0.150
Completion: 1.667
Cache: 0.250
🧠 🔧 🌡️ 2024-11 In: text
Out: text
Released: 2025-02-17
Grok Beta grok-beta 131.1K 4.1K In: $5
Out: $15
Cache Read: $5
Cache Write: $15
Model: 2.500
Completion: 3.000
Cache: 1.000
🔧 🌡️ 2024-08 In: text
Out: text
Released: 2024-11-01
Grok 3 Fast Latest grok-3-fast-latest 131.1K 8.2K In: $5
Out: $25
Cache Read: $1.25
Cache Write: $25
Model: 2.500
Completion: 5.000
Cache: 0.250
🔧 🌡️ 2024-11 In: text
Out: text
Released: 2025-02-17
Grok 3 grok-3 131.1K 8.2K In: $3
Out: $15
Cache Read: $0.75
Cache Write: $15
Model: 1.500
Completion: 5.000
Cache: 0.250
🔧 🌡️ 2024-11 In: text
Out: text
Released: 2025-02-17
Grok 3 Mini grok-3-mini 131.1K 8.2K In: $0.3
Out: $0.5
Cache Read: $0.075
Cache Write: $0.5
Model: 0.150
Completion: 1.667
Cache: 0.250
🧠 🔧 🌡️ 2024-11 In: text
Out: text
Released: 2025-02-17
Grok 2 Vision (1212) grok-2-vision-1212 8.2K 4.1K In: $2
Out: $10
Cache Read: $2
Cache Write: $10
Model: 1.000
Completion: 5.000
Cache: 1.000
📎 🔧 🌡️ 2024-08 In: text, image
Out: text
Released: 2024-08-20
Updated: 2024-12-12
Grok 2 grok-2 131.1K 8.2K In: $2
Out: $10
Cache Read: $2
Cache Write: $10
Model: 1.000
Completion: 5.000
Cache: 1.000
🔧 🌡️ 2024-08 In: text
Out: text
Released: 2024-08-20
Grok 2 Vision Latest grok-2-vision-latest 8.2K 4.1K In: $2
Out: $10
Cache Read: $2
Cache Write: $10
Model: 1.000
Completion: 5.000
Cache: 1.000
📎 🔧 🌡️ 2024-08 In: text, image
Out: text
Released: 2024-08-20
Updated: 2024-12-12
Grok 3 Latest grok-3-latest 131.1K 8.2K In: $3
Out: $15
Cache Read: $0.75
Cache Write: $15
Model: 1.500
Completion: 5.000
Cache: 0.250
🔧 🌡️ 2024-11 In: text
Out: text
Released: 2025-02-17
Grok 2 Vision grok-2-vision 8.2K 4.1K In: $2
Out: $10
Cache Read: $2
Cache Write: $10
Model: 1.000
Completion: 5.000
Cache: 1.000
📎 🔧 🌡️ 2024-08 In: text, image
Out: text
Released: 2024-08-20
Grok 2 Latest grok-2-latest 131.1K 8.2K In: $2
Out: $10
Cache Read: $2
Cache Write: $10
Model: 1.000
Completion: 5.000
Cache: 1.000
🔧 🌡️ 2024-08 In: text
Out: text
Released: 2024-08-20
Updated: 2024-12-12
Grok 3 Fast grok-3-fast 131.1K 8.2K In: $5
Out: $25
Cache Read: $1.25
Cache Write: $25
Model: 2.500
Completion: 5.000
Cache: 0.250
🔧 🌡️ 2024-11 In: text
Out: text
Released: 2025-02-17
Grok 2 (1212) grok-2-1212 131.1K 8.2K In: $2
Out: $10
Cache Read: $2
Cache Write: $10
Model: 1.000
Completion: 5.000
Cache: 1.000
🔧 🌡️ 2024-08 In: text
Out: text
Released: 2024-12-12
Grok 4 grok-4 256K 64K In: $3
Out: $15
Cache Read: $0.75
Cache Write: $15
Model: 1.500
Completion: 5.000
Cache: 0.250
🧠 🔧 🌡️ 2025-07 In: text
Out: text
Released: 2025-07-09
Grok Vision Beta grok-vision-beta 8.2K 4.1K In: $5
Out: $15
Cache Read: $5
Cache Write: $15
Model: 2.500
Completion: 3.000
Cache: 1.000
📎 🔧 🌡️ 2024-08 In: text, image
Out: text
Released: 2024-11-01
Grok 3 Mini Fast grok-3-mini-fast 131.1K 8.2K In: $0.6
Out: $4
Cache Read: $0.15
Cache Write: $4
Model: 0.300
Completion: 6.667
Cache: 0.250
🧠 🔧 🌡️ 2024-11 In: text
Out: text
Released: 2025-02-17

Z.AI

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
GLM-4.5-Air glm-4.5-air 131.1K 98.3K In: $0.2
Out: $1.1
Cache Read: $0.03
Model: 0.100
Completion: 5.500
Cache: 0.150
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-28
GLM 4.5V glm-4.5v 64K 16.4K In: $0.6
Out: $1.8
Model: 0.300
Completion: 3.000
📎 🧠 🔧 🌡️ 2025-04 In: text, image, video
Out: text
Open Weights
Released: 2025-08-11
GLM-4.5-Flash glm-4.5-flash 131.1K 98.3K - - 🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-28
GLM-4.5 glm-4.5 131.1K 98.3K In: $0.6
Out: $2.2
Cache Read: $0.11
Model: 0.300
Completion: 3.667
Cache: 0.183
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-28

Zhipu AI

📖 API Address | 📚 Official Documentation

Model Model ID Context Output Pricing ($/1M) NewAPI Ratios Capabilities Knowledge Modalities Details
GLM-4.5 glm-4.5 131.1K 98.3K In: $0.6
Out: $2.2
Cache Read: $0.11
Model: 0.300
Completion: 3.667
Cache: 0.183
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-28
GLM-4.5-Flash glm-4.5-flash 131.1K 98.3K - - 🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-28
GLM 4.5V glm-4.5v 64K 16.4K In: $0.6
Out: $1.8
Model: 0.300
Completion: 3.000
📎 🧠 🔧 🌡️ 2025-04 In: text, image, video
Out: text
Open Weights
Released: 2025-08-11
GLM-4.5-Air glm-4.5-air 131.1K 98.3K In: $0.2
Out: $1.1
Cache Read: $0.03
Model: 0.100
Completion: 5.500
Cache: 0.150
🧠 🔧 🌡️ 2025-04 In: text
Out: text
Open Weights
Released: 2025-07-28