Data Browser¶
This page displays comprehensive information about all LLM providers and models, automatically generated from API data.
Statistics
Provider Count: 57 Model Count: 967 Last Updated: 10/21/2025, 11:36:21 AM
Capabilities Legend: 🧠 Reasoning 🔧 Tools 📎 Attachment 🌡️ Temperature
AIHubMix¶
Model | Model ID | Context | Output | Pricing ($/1M) | NewAPI Ratios | Capabilities | Knowledge | Modalities | Details |
---|---|---|---|---|---|---|---|---|---|
GPT-4.1 nano | gpt-4.1-nano | 1M | 32.8K | In: $0.1 Out: $0.4 Cache Read: $0.03 | Model: 0.050 Completion: 4.000 Cache: 0.300 | 📎 🔧 🌡️ | 2024-04 | In: text, image Out: text | Released: 2025-04-14 |
Qwen3 235B A22B Instruct 2507 | qwen3-235b-a22b-instruct-2507 | 262.1K | 262.1K | In: $0.28 Out: $1.12 | Model: 0.140 Completion: 4.000 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-30 |
Claude Opus 4.1 | claude-opus-4-1 | 200K | 32K | In: $16.5 Out: $82.5 Cache Read: $1.5 Cache Write: $18.75 | Model: 8.250 Completion: 5.000 Cache: 0.091 | 📎 🧠 🔧 🌡️ | 2025-03-31 | In: text, image Out: text | Released: 2025-08-05 |
Claude Haiku 4.5 | claude-haiku-4-5 | 200K | 64K | In: $1.1 Out: $5.5 Cache Read: $0.11 Cache Write: $1.25 | Model: 0.550 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-07-31 | In: text, image Out: text | Released: 2025-09-29 |
Gemini 2.5 Flash | gemini-2.5-flash | 1M | 65K | In: $0.075 Out: $0.3 Cache Read: $0.02 | Model: 0.037 Completion: 4.000 Cache: 0.267 | 📎 🔧 🌡️ | 2025-04 | In: text, image, audio, video Out: text | Released: 2025-09-15 |
GPT-4.1 mini | gpt-4.1-mini | 1M | 32.8K | In: $0.4 Out: $1.6 Cache Read: $0.1 | Model: 0.200 Completion: 4.000 Cache: 0.250 | 📎 🔧 🌡️ | 2024-04 | In: text, image Out: text | Released: 2025-04-14 |
Claude Sonnet 4.5 | claude-sonnet-4-5 | 200K | 64K | In: $3.3 Out: $16.5 Cache Read: $0.3 Cache Write: $3.75 | Model: 1.650 Completion: 5.000 Cache: 0.091 | 📎 🧠 🔧 🌡️ | 2025-07-31 | In: text, image Out: text | Released: 2025-09-29 |
DeepSeek-V3.2-Exp | DeepSeek-V3.2-Exp | 163K | 163K | In: $0.27 Out: $0.41 | Model: 0.135 Completion: 1.519 | 🧠 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-09-29 |
Qwen3 235B A22B Thinking 2507 | qwen3-235b-a22b-thinking-2507 | 262.1K | 262.1K | In: $0.28 Out: $2.8 | Model: 0.140 Completion: 10.000 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-30 |
GPT-5-Nano | gpt-5-nano | 128K | 16.4K | In: $0.5 Out: $2 Cache Read: $0.25 | Model: 0.250 Completion: 4.000 Cache: 0.500 | 📎 🔧 🌡️ | 2024-09-30 | In: text, image Out: text | Released: 2025-09-15 |
GPT-5-Codex | gpt-5-codex | 400K | 128K | - | - | 🧠 🔧 | 2024-09-30 | In: text, image Out: text | Released: 2025-09-15 |
GPT-4o | gpt-4o | 128K | 16.4K | In: $2.5 Out: $10 Cache Read: $1.25 | Model: 1.250 Completion: 4.000 Cache: 0.500 | 📎 🔧 🌡️ | 2023-09 | In: text, image Out: text | Released: 2024-05-13 Updated: 2024-08-06 |
GPT-4.1 | gpt-4.1 | 1M | 32.8K | In: $2 Out: $8 Cache Read: $0.5 | Model: 1.000 Completion: 4.000 Cache: 0.250 | 📎 🔧 🌡️ | 2024-04 | In: text, image Out: text | Released: 2025-04-14 |
GLM-4.6 | glm-4.6 | 204.8K | 204.8K | In: $0.27 Out: $1.1 Cache Read: $0.11 | Model: 0.135 Completion: 4.074 Cache: 0.407 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-09-30 |
o4-mini | o4-mini | 200K | 65.5K | In: $1.5 Out: $6 Cache Read: $0.75 | Model: 0.750 Completion: 4.000 Cache: 0.500 | 🧠 | 2024-09 | In: text Out: text | Released: 2025-09-15 |
GPT-5-Mini | gpt-5-mini | 200K | 64K | In: $1.5 Out: $6 Cache Read: $0.75 | Model: 0.750 Completion: 4.000 Cache: 0.500 | 📎 🧠 🔧 🌡️ | 2024-09-30 | In: text, image Out: text | Released: 2025-09-15 |
Gemini 2.5 Pro | gemini-2.5-pro | 2M | 65K | In: $1.25 Out: $5 Cache Read: $0.31 | Model: 0.625 Completion: 4.000 Cache: 0.248 | 📎 🧠 🔧 🌡️ | 2025-04 | In: text, image, audio, video Out: text | Released: 2025-09-15 |
DeepSeek-V3.2-Exp-Think | DeepSeek-V3.2-Exp-Think | 131K | 64K | In: $0.27 Out: $0.41 | Model: 0.135 Completion: 1.519 | 🧠 🔧 🌡️ | 2025-09 | In: text Out: text | Open Weights Released: 2025-09-29 |
GPT-4o (2024-11-20) | gpt-4o-2024-11-20 | 128K | 16.4K | In: $2.5 Out: $10 Cache Read: $1.25 | Model: 1.250 Completion: 4.000 Cache: 0.500 | 📎 🔧 🌡️ | 2023-09 | In: text, image Out: text | Released: 2024-11-20 |
Qwen3 Coder 480B A35B Instruct | qwen3-coder-480b-a35b-instruct | 262.1K | 131K | In: $0.82 Out: $3.29 | Model: 0.410 Completion: 4.012 | 🔧 🌡️ | - | In: text Out: text | Released: 2025-08-01 |
GPT-5 | gpt-5 | 400K | 128K | In: $5 Out: $20 Cache Read: $2.5 | Model: 2.500 Completion: 4.000 Cache: 0.500 | 📎 🧠 🔧 🌡️ | 2024-09-30 | In: text, image Out: text | Released: 2025-09-15 |
Kimi K2 0905 | Kimi-K2-0905 | 262.1K | 262.1K | In: $0.55 Out: $2.19 | Model: 0.275 Completion: 3.982 | 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-09-05 |
GPT-5-Pro | gpt-5-pro | 400K | 128K | In: $7 Out: $28 Cache Read: $3.5 | Model: 3.500 Completion: 4.000 Cache: 0.500 | 📎 🧠 🔧 🌡️ | 2024-09-30 | In: text, image Out: text | Released: 2025-09-15 |
Alibaba¶
📖 API Address | 📚 Official Documentation
Model | Model ID | Context | Output | Pricing ($/1M) | NewAPI Ratios | Capabilities | Knowledge | Modalities | Details |
---|---|---|---|---|---|---|---|---|---|
Qwen3-LiveTranslate Flash Realtime | qwen3-livetranslate-flash-realtime | 53.2K | 4.1K | In: $10 Out: $10 | Model: 5.000 Completion: 1.000 | 🌡️ | 2024-04 | In: text, image, audio, video Out: text, audio | Released: 2025-09-22 |
Qwen3-ASR Flash | qwen3-asr-flash | 53.2K | 4.1K | In: $0.035 Out: $0.035 | Model: 0.018 Completion: 1.000 | - | 2024-04 | In: audio Out: text | Released: 2025-09-08 |
Qwen-Omni Turbo | qwen-omni-turbo | 32.8K | 2K | In: $0.07 Out: $0.27 | Model: 0.035 Completion: 3.857 | 🔧 🌡️ | 2024-04 | In: text, image, audio, video Out: text, audio | Released: 2025-01-19 Updated: 2025-03-26 |
Qwen-VL Max | qwen-vl-max | 131.1K | 8.2K | In: $0.8 Out: $3.2 | Model: 0.400 Completion: 4.000 | 🔧 🌡️ | 2024-04 | In: text, image Out: text | Released: 2024-04-08 Updated: 2025-08-13 |
Qwen3-Next 80B-A3B Instruct | qwen3-next-80b-a3b-instruct | 131.1K | 32.8K | In: $0.5 Out: $2 | Model: 0.250 Completion: 4.000 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-09 |
Qwen Turbo | qwen-turbo | 1M | 16.4K | In: $0.05 Out: $0.2 | Model: 0.025 Completion: 4.000 | 🧠 🔧 🌡️ | 2024-04 | In: text Out: text | Released: 2024-11-01 Updated: 2025-04-28 |
Qwen3-VL 235B-A22B | qwen3-vl-235b-a22b | 131.1K | 32.8K | In: $0.7 Out: $2.8 | Model: 0.350 Completion: 4.000 | 🧠 🔧 🌡️ | 2025-04 | In: text, image Out: text | Open Weights Released: 2025-04 |
Qwen3 Coder Flash | qwen3-coder-flash | 1M | 65.5K | In: $0.3 Out: $1.5 | Model: 0.150 Completion: 5.000 | 🔧 🌡️ | 2025-04 | In: text Out: text | Released: 2025-07-28 |
Qwen3-VL 30B-A3B | qwen3-vl-30b-a3b | 131.1K | 32.8K | In: $0.2 Out: $0.8 | Model: 0.100 Completion: 4.000 | 🧠 🔧 🌡️ | 2025-04 | In: text, image Out: text | Open Weights Released: 2025-04 |
Qwen3 14B | qwen3-14b | 131.1K | 8.2K | In: $0.35 Out: $1.4 | Model: 0.175 Completion: 4.000 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-04 |
QVQ Max | qvq-max | 131.1K | 8.2K | In: $1.2 Out: $4.8 | Model: 0.600 Completion: 4.000 | 🧠 🔧 🌡️ | 2024-04 | In: text, image Out: text | Released: 2025-03-25 |
Qwen Plus Character (Japanese) | qwen-plus-character-ja | 8.2K | 512 | In: $0.5 Out: $1.4 | Model: 0.250 Completion: 2.800 | 🔧 🌡️ | 2024-04 | In: text Out: text | Released: 2024-01 |
Qwen2.5 14B Instruct | qwen2-5-14b-instruct | 131.1K | 8.2K | In: $0.35 Out: $1.4 | Model: 0.175 Completion: 4.000 | 🔧 🌡️ | 2024-04 | In: text Out: text | Open Weights Released: 2024-09 |
QwQ Plus | qwq-plus | 131.1K | 8.2K | In: $0.8 Out: $2.4 | Model: 0.400 Completion: 3.000 | 🧠 🔧 🌡️ | 2024-04 | In: text Out: text | Released: 2025-03-05 |
Qwen3-Coder 30B-A3B Instruct | qwen3-coder-30b-a3b-instruct | 262.1K | 65.5K | In: $0.45 Out: $2.25 | Model: 0.225 Completion: 5.000 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-04 |
Qwen-VL OCR | qwen-vl-ocr | 34.1K | 4.1K | In: $0.72 Out: $0.72 | Model: 0.360 Completion: 1.000 | 🌡️ | 2024-04 | In: text, image Out: text | Released: 2024-10-28 Updated: 2025-04-13 |
Qwen2.5 72B Instruct | qwen2-5-72b-instruct | 131.1K | 8.2K | In: $1.4 Out: $5.6 | Model: 0.700 Completion: 4.000 | 🔧 🌡️ | 2024-04 | In: text Out: text | Open Weights Released: 2024-09 |
Qwen3-Omni Flash | qwen3-omni-flash | 65.5K | 16.4K | In: $0.43 Out: $1.66 | Model: 0.215 Completion: 3.860 | 🧠 🔧 🌡️ | 2024-04 | In: text, image, audio, video Out: text, audio | Released: 2025-09-15 |
Qwen Flash | qwen-flash | 1M | 32.8K | In: $0.05 Out: $0.4 | Model: 0.025 Completion: 8.000 | 🧠 🔧 🌡️ | 2024-04 | In: text Out: text | Released: 2025-07-28 |
Qwen3 8B | qwen3-8b | 131.1K | 8.2K | In: $0.18 Out: $0.7 | Model: 0.090 Completion: 3.889 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-04 |
Qwen3-Omni Flash Realtime | qwen3-omni-flash-realtime | 65.5K | 16.4K | In: $0.52 Out: $1.99 | Model: 0.260 Completion: 3.827 | 🔧 🌡️ | 2024-04 | In: text, image, audio, video Out: text, audio | Released: 2025-09-15 |
Qwen2.5-VL 72B Instruct | qwen2-5-vl-72b-instruct | 131.1K | 8.2K | In: $2.8 Out: $8.4 | Model: 1.400 Completion: 3.000 | 🔧 🌡️ | 2024-04 | In: text, image Out: text | Open Weights Released: 2024-09 |
Qwen3-VL Plus | qwen3-vl-plus | 262.1K | 32.8K | In: $0.2 Out: $1.6 | Model: 0.100 Completion: 8.000 | 🧠 🔧 🌡️ | 2025-04 | In: text, image Out: text | Released: 2025-09-23 |
Qwen Plus | qwen-plus | 1M | 32.8K | In: $0.4 Out: $1.2 | Model: 0.200 Completion: 3.000 | 🧠 🔧 🌡️ | 2024-04 | In: text Out: text | Released: 2024-01-25 Updated: 2025-09-11 |
Qwen2.5 32B Instruct | qwen2-5-32b-instruct | 131.1K | 8.2K | In: $0.7 Out: $2.8 | Model: 0.350 Completion: 4.000 | 🔧 🌡️ | 2024-04 | In: text Out: text | Open Weights Released: 2024-09 |
Qwen2.5-Omni 7B | qwen2-5-omni-7b | 32.8K | 2K | In: $0.1 Out: $0.4 | Model: 0.050 Completion: 4.000 | 🔧 🌡️ | 2024-04 | In: text, image, audio, video Out: text, audio | Open Weights Released: 2024-12 |
Qwen Max | qwen-max | 32.8K | 8.2K | In: $1.6 Out: $6.4 | Model: 0.800 Completion: 4.000 | 🔧 🌡️ | 2024-04 | In: text Out: text | Released: 2024-04-03 Updated: 2025-01-25 |
Qwen2.5 7B Instruct | qwen2-5-7b-instruct | 131.1K | 8.2K | In: $0.175 Out: $0.7 | Model: 0.087 Completion: 4.000 | 🔧 🌡️ | 2024-04 | In: text Out: text | Open Weights Released: 2024-09 |
Qwen2.5-VL 7B Instruct | qwen2-5-vl-7b-instruct | 131.1K | 8.2K | In: $0.35 Out: $1.05 | Model: 0.175 Completion: 3.000 | 🔧 🌡️ | 2024-04 | In: text, image Out: text | Open Weights Released: 2024-09 |
Qwen3 235B-A22B | qwen3-235b-a22b | 131.1K | 16.4K | In: $0.7 Out: $2.8 | Model: 0.350 Completion: 4.000 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-04 |
Qwen-Omni Turbo Realtime | qwen-omni-turbo-realtime | 32.8K | 2K | In: $0.27 Out: $1.07 | Model: 0.135 Completion: 3.963 | 🔧 🌡️ | 2024-04 | In: text, image, audio Out: text, audio | Released: 2025-05-08 |
Qwen-MT Turbo | qwen-mt-turbo | 16.4K | 8.2K | In: $0.16 Out: $0.49 | Model: 0.080 Completion: 3.063 | 🌡️ | 2024-04 | In: text Out: text | Released: 2025-01 |
Qwen3-Coder 480B-A35B Instruct | qwen3-coder-480b-a35b-instruct | 262.1K | 65.5K | In: $1.5 Out: $7.5 | Model: 0.750 Completion: 5.000 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-04 |
Qwen-MT Plus | qwen-mt-plus | 16.4K | 8.2K | In: $2.46 Out: $7.37 | Model: 1.230 Completion: 2.996 | 🌡️ | 2024-04 | In: text Out: text | Released: 2025-01 |
Qwen3 Max | qwen3-max | 262.1K | 65.5K | In: $1.2 Out: $6 | Model: 0.600 Completion: 5.000 | 🔧 🌡️ | 2025-04 | In: text Out: text | Released: 2025-09-23 |
Qwen3 Coder Plus | qwen3-coder-plus | 1M | 65.5K | In: $1 Out: $5 | Model: 0.500 Completion: 5.000 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-23 |
Qwen3-Next 80B-A3B (Thinking) | qwen3-next-80b-a3b-thinking | 131.1K | 32.8K | In: $0.5 Out: $6 | Model: 0.250 Completion: 12.000 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-09 |
Qwen3 32B | qwen3-32b | 131.1K | 16.4K | In: $0.7 Out: $2.8 | Model: 0.350 Completion: 4.000 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-04 |
Qwen-VL Plus | qwen-vl-plus | 131.1K | 8.2K | In: $0.21 Out: $0.63 | Model: 0.105 Completion: 3.000 | 🔧 🌡️ | 2024-04 | In: text, image Out: text | Released: 2024-01-25 Updated: 2025-08-15 |
DeepSeek R1 | deepseek-r1 | 128K | - | In: $4 Out: $16 | Model: 2.000 Completion: 4.000 | - | - | In: text Out: text | - |
Alibaba (China)¶
📖 API Address | 📚 Official Documentation
Model | Model ID | Context | Output | Pricing ($/1M) | NewAPI Ratios | Capabilities | Knowledge | Modalities | Details |
---|---|---|---|---|---|---|---|---|---|
DeepSeek R1 Distill Qwen 7B | deepseek-r1-distill-qwen-7b | 32.8K | 16.4K | In: $0.072 Out: $0.144 | Model: 0.036 Completion: 2.000 | 🧠 🔧 🌡️ | - | In: text Out: text | Released: 2025-01-01 |
Qwen3-ASR Flash | qwen3-asr-flash | 53.2K | 4.1K | In: $0.032 Out: $0.032 | Model: 0.016 Completion: 1.000 | - | 2024-04 | In: audio Out: text | Released: 2025-09-08 |
DeepSeek R1 0528 | deepseek-r1-0528 | 131.1K | 16.4K | In: $0.574 Out: $2.294 | Model: 0.287 Completion: 3.997 | 🧠 🔧 🌡️ | - | In: text Out: text | Released: 2025-05-28 |
DeepSeek V3 | deepseek-v3 | 65.5K | 8.2K | In: $0.287 Out: $1.147 | Model: 0.143 Completion: 3.997 | 🔧 🌡️ | - | In: text Out: text | Released: 2024-12-01 |
Qwen-Omni Turbo | qwen-omni-turbo | 32.8K | 2K | In: $0.058 Out: $0.23 | Model: 0.029 Completion: 3.966 | 🔧 🌡️ | 2024-04 | In: text, image, audio, video Out: text, audio | Released: 2025-01-19 Updated: 2025-03-26 |
Qwen-VL Max | qwen-vl-max | 131.1K | 8.2K | In: $0.23 Out: $0.574 | Model: 0.115 Completion: 2.496 | 🔧 🌡️ | 2024-04 | In: text, image Out: text | Released: 2024-04-08 Updated: 2025-08-13 |
DeepSeek V3.2 Exp | deepseek-v3-2-exp | 131.1K | 65.5K | In: $0.287 Out: $0.431 | Model: 0.143 Completion: 1.502 | 🔧 🌡️ | - | In: text Out: text | Released: 2025-01-01 |
Qwen3-Next 80B-A3B Instruct | qwen3-next-80b-a3b-instruct | 131.1K | 32.8K | In: $0.144 Out: $0.574 | Model: 0.072 Completion: 3.986 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-09 |
DeepSeek R1 | deepseek-r1 | 131.1K | 16.4K | In: $0.574 Out: $2.294 | Model: 0.287 Completion: 3.997 | 🧠 🔧 🌡️ | - | In: text Out: text | Released: 2025-01-01 |
Qwen Turbo | qwen-turbo | 1M | 16.4K | In: $0.044 Out: $0.087 | Model: 0.022 Completion: 1.977 | 🧠 🔧 🌡️ | 2024-04 | In: text Out: text | Released: 2024-11-01 Updated: 2025-07-15 |
Qwen3-VL 235B-A22B | qwen3-vl-235b-a22b | 131.1K | 32.8K | In: $0.286705 Out: $1.14682 | Model: 0.143 Completion: 4.000 | 🧠 🔧 🌡️ | 2025-04 | In: text, image Out: text | Open Weights Released: 2025-04 |
Qwen3 Coder Flash | qwen3-coder-flash | 1M | 65.5K | In: $0.144 Out: $0.574 | Model: 0.072 Completion: 3.986 | 🔧 🌡️ | 2025-04 | In: text Out: text | Released: 2025-07-28 |
Qwen3-VL 30B-A3B | qwen3-vl-30b-a3b | 131.1K | 32.8K | In: $0.108 Out: $0.431 | Model: 0.054 Completion: 3.991 | 🧠 🔧 🌡️ | 2025-04 | In: text, image Out: text | Open Weights Released: 2025-04 |
Qwen3 14B | qwen3-14b | 131.1K | 8.2K | In: $0.144 Out: $0.574 | Model: 0.072 Completion: 3.986 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-04 |
QVQ Max | qvq-max | 131.1K | 8.2K | In: $1.147 Out: $4.588 | Model: 0.574 Completion: 4.000 | 🧠 🔧 🌡️ | 2024-04 | In: text, image Out: text | Released: 2025-03-25 |
DeepSeek R1 Distill Qwen 32B | deepseek-r1-distill-qwen-32b | 32.8K | 16.4K | In: $0.287 Out: $0.861 | Model: 0.143 Completion: 3.000 | 🧠 🔧 🌡️ | - | In: text Out: text | Released: 2025-01-01 |
Qwen Plus Character | qwen-plus-character | 32.8K | 4.1K | In: $0.115 Out: $0.287 | Model: 0.058 Completion: 2.496 | 🔧 🌡️ | 2024-04 | In: text Out: text | Released: 2024-01 |
Qwen2.5 14B Instruct | qwen2-5-14b-instruct | 131.1K | 8.2K | In: $0.144 Out: $0.431 | Model: 0.072 Completion: 2.993 | 🔧 🌡️ | 2024-04 | In: text Out: text | Open Weights Released: 2024-09 |
QwQ Plus | qwq-plus | 131.1K | 8.2K | In: $0.23 Out: $0.574 | Model: 0.115 Completion: 2.496 | 🧠 🔧 🌡️ | 2024-04 | In: text Out: text | Released: 2025-03-05 |
Qwen2.5-Coder 32B Instruct | qwen2-5-coder-32b-instruct | 131.1K | 8.2K | In: $0.287 Out: $0.861 | Model: 0.143 Completion: 3.000 | 🔧 🌡️ | 2024-04 | In: text Out: text | Open Weights Released: 2024-11 |
Qwen3-Coder 30B-A3B Instruct | qwen3-coder-30b-a3b-instruct | 262.1K | 65.5K | In: $0.216 Out: $0.861 | Model: 0.108 Completion: 3.986 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-04 |
Qwen Math Plus | qwen-math-plus | 4.1K | 3.1K | In: $0.574 Out: $1.721 | Model: 0.287 Completion: 2.998 | 🔧 🌡️ | 2024-04 | In: text Out: text | Released: 2024-08-16 Updated: 2024-09-19 |
Qwen-VL OCR | qwen-vl-ocr | 34.1K | 4.1K | In: $0.717 Out: $0.717 | Model: 0.358 Completion: 1.000 | 🌡️ | 2024-04 | In: text, image Out: text | Released: 2024-10-28 Updated: 2025-04-13 |
Qwen Doc Turbo | qwen-doc-turbo | 131.1K | 8.2K | In: $0.087 Out: $0.144 | Model: 0.043 Completion: 1.655 | 🔧 🌡️ | 2024-04 | In: text Out: text | Released: 2024-01 |
Qwen Deep Research | qwen-deep-research | 1M | 32.8K | In: $7.742 Out: $23.367 | Model: 3.871 Completion: 3.018 | 🔧 🌡️ | 2024-04 | In: text Out: text | Released: 2024-01 |
Qwen2.5 72B Instruct | qwen2-5-72b-instruct | 131.1K | 8.2K | In: $0.574 Out: $1.721 | Model: 0.287 Completion: 2.998 | 🔧 🌡️ | 2024-04 | In: text Out: text | Open Weights Released: 2024-09 |
Qwen3-Omni Flash | qwen3-omni-flash | 65.5K | 16.4K | In: $0.058 Out: $0.23 | Model: 0.029 Completion: 3.966 | 🧠 🔧 🌡️ | 2024-04 | In: text, image, audio, video Out: text, audio | Released: 2025-09-15 |
Qwen Flash | qwen-flash | 1M | 32.8K | In: $0.022 Out: $0.216 | Model: 0.011 Completion: 9.818 | 🧠 🔧 🌡️ | 2024-04 | In: text Out: text | Released: 2025-07-28 |
Qwen3 8B | qwen3-8b | 131.1K | 8.2K | In: $0.072 Out: $0.287 | Model: 0.036 Completion: 3.986 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-04 |
Qwen3-Omni Flash Realtime | qwen3-omni-flash-realtime | 65.5K | 16.4K | In: $0.23 Out: $0.918 | Model: 0.115 Completion: 3.991 | 🔧 🌡️ | 2024-04 | In: text, image, audio Out: text, audio | Released: 2025-09-15 |
Qwen2.5-VL 72B Instruct | qwen2-5-vl-72b-instruct | 131.1K | 8.2K | In: $2.294 Out: $6.881 | Model: 1.147 Completion: 3.000 | 🔧 🌡️ | 2024-04 | In: text, image Out: text | Open Weights Released: 2024-09 |
Qwen3-VL Plus | qwen3-vl-plus | 262.1K | 32.8K | In: $0.143353 Out: $1.433525 | Model: 0.072 Completion: 10.000 | 🧠 🔧 🌡️ | 2025-04 | In: text, image Out: text | Released: 2025-09-23 |
Qwen Plus | qwen-plus | 1M | 32.8K | In: $0.115 Out: $0.287 | Model: 0.058 Completion: 2.496 | 🧠 🔧 🌡️ | 2024-04 | In: text Out: text | Released: 2024-01-25 Updated: 2025-09-11 |
Qwen2.5 32B Instruct | qwen2-5-32b-instruct | 131.1K | 8.2K | In: $0.287 Out: $0.861 | Model: 0.143 Completion: 3.000 | 🔧 🌡️ | 2024-04 | In: text Out: text | Open Weights Released: 2024-09 |
Qwen2.5-Omni 7B | qwen2-5-omni-7b | 32.8K | 2K | In: $0.087 Out: $0.345 | Model: 0.043 Completion: 3.966 | 🔧 🌡️ | 2024-04 | In: text, image, audio, video Out: text, audio | Open Weights Released: 2024-12 |
Qwen Max | qwen-max | 131.1K | 8.2K | In: $0.345 Out: $1.377 | Model: 0.172 Completion: 3.991 | 🔧 🌡️ | 2024-04 | In: text Out: text | Released: 2024-04-03 Updated: 2025-01-25 |
Qwen Long | qwen-long | 10M | 8.2K | In: $0.072 Out: $0.287 | Model: 0.036 Completion: 3.986 | 🔧 🌡️ | 2024-04 | In: text Out: text | Released: 2025-01-25 |
Qwen2.5-Math 72B Instruct | qwen2-5-math-72b-instruct | 4.1K | 3.1K | In: $0.574 Out: $1.721 | Model: 0.287 Completion: 2.998 | 🔧 🌡️ | 2024-04 | In: text Out: text | Open Weights Released: 2024-09 |
Moonshot Kimi K2 Instruct | moonshot-kimi-k2-instruct | 131.1K | 131.1K | In: $0.574 Out: $2.294 | Model: 0.287 Completion: 3.997 | 🔧 🌡️ | - | In: text Out: text | Released: 2025-01-01 |
Tongyi Intent Detect V3 | tongyi-intent-detect-v3 | 8.2K | 1K | In: $0.058 Out: $0.144 | Model: 0.029 Completion: 2.483 | 🌡️ | 2024-04 | In: text Out: text | Released: 2024-01 |
Qwen2.5 7B Instruct | qwen2-5-7b-instruct | 131.1K | 8.2K | In: $0.072 Out: $0.144 | Model: 0.036 Completion: 2.000 | 🔧 🌡️ | 2024-04 | In: text Out: text | Open Weights Released: 2024-09 |
Qwen2.5-VL 7B Instruct | qwen2-5-vl-7b-instruct | 131.1K | 8.2K | In: $0.287 Out: $0.717 | Model: 0.143 Completion: 2.498 | 🔧 🌡️ | 2024-04 | In: text, image Out: text | Open Weights Released: 2024-09 |
DeepSeek V3.1 | deepseek-v3-1 | 131.1K | 65.5K | In: $0.574 Out: $1.721 | Model: 0.287 Completion: 2.998 | 🔧 🌡️ | - | In: text Out: text | Released: 2025-01-01 |
DeepSeek R1 Distill Llama 70B | deepseek-r1-distill-llama-70b | 32.8K | 16.4K | In: $0.287 Out: $0.861 | Model: 0.143 Completion: 3.000 | 🧠 🔧 🌡️ | - | In: text Out: text | Released: 2025-01-01 |
Qwen3 235B-A22B | qwen3-235b-a22b | 131.1K | 16.4K | In: $0.287 Out: $1.147 | Model: 0.143 Completion: 3.997 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-04 |
Qwen2.5-Coder 7B Instruct | qwen2-5-coder-7b-instruct | 131.1K | 8.2K | In: $0.144 Out: $0.287 | Model: 0.072 Completion: 1.993 | 🔧 🌡️ | 2024-04 | In: text Out: text | Open Weights Released: 2024-11 |
DeepSeek R1 Distill Qwen 14B | deepseek-r1-distill-qwen-14b | 32.8K | 16.4K | In: $0.144 Out: $0.431 | Model: 0.072 Completion: 2.993 | 🧠 🔧 🌡️ | - | In: text Out: text | Released: 2025-01-01 |
Qwen-Omni Turbo Realtime | qwen-omni-turbo-realtime | 32.8K | 2K | In: $0.23 Out: $0.918 | Model: 0.115 Completion: 3.991 | 🔧 🌡️ | 2024-04 | In: text, image, audio Out: text, audio | Released: 2025-05-08 |
Qwen Math Turbo | qwen-math-turbo | 4.1K | 3.1K | In: $0.287 Out: $0.861 | Model: 0.143 Completion: 3.000 | 🔧 🌡️ | 2024-04 | In: text Out: text | Released: 2024-09-19 |
Qwen-MT Turbo | qwen-mt-turbo | 16.4K | 8.2K | In: $0.101 Out: $0.28 | Model: 0.051 Completion: 2.772 | 🌡️ | 2024-04 | In: text Out: text | Released: 2025-01 |
DeepSeek R1 Distill Llama 8B | deepseek-r1-distill-llama-8b | 32.8K | 16.4K | - | - | 🧠 🔧 🌡️ | - | In: text Out: text | Released: 2025-01-01 |
Qwen3-Coder 480B-A35B Instruct | qwen3-coder-480b-a35b-instruct | 262.1K | 65.5K | In: $0.861 Out: $3.441 | Model: 0.430 Completion: 3.997 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-04 |
Qwen-MT Plus | qwen-mt-plus | 16.4K | 8.2K | In: $0.259 Out: $0.775 | Model: 0.130 Completion: 2.992 | 🌡️ | 2024-04 | In: text Out: text | Released: 2025-01 |
Qwen3 Max | qwen3-max | 262.1K | 65.5K | In: $0.861 Out: $3.441 | Model: 0.430 Completion: 3.997 | 🔧 🌡️ | 2025-04 | In: text Out: text | Released: 2025-09-23 |
QwQ 32B | qwq-32b | 131.1K | 8.2K | In: $0.287 Out: $0.861 | Model: 0.143 Completion: 3.000 | 🧠 🔧 🌡️ | 2024-04 | In: text Out: text | Open Weights Released: 2024-12 |
Qwen2.5-Math 7B Instruct | qwen2-5-math-7b-instruct | 4.1K | 3.1K | In: $0.144 Out: $0.287 | Model: 0.072 Completion: 1.993 | 🔧 🌡️ | 2024-04 | In: text Out: text | Open Weights Released: 2024-09 |
Qwen3-Next 80B-A3B (Thinking) | qwen3-next-80b-a3b-thinking | 131.1K | 32.8K | In: $0.144 Out: $1.434 | Model: 0.072 Completion: 9.958 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-09 |
DeepSeek R1 Distill Qwen 1.5B | deepseek-r1-distill-qwen-1-5b | 32.8K | 16.4K | - | - | 🧠 🔧 🌡️ | - | In: text Out: text | Released: 2025-01-01 |
Qwen3 32B | qwen3-32b | 131.1K | 16.4K | In: $0.287 Out: $1.147 | Model: 0.143 Completion: 3.997 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-04 |
Qwen-VL Plus | qwen-vl-plus | 131.1K | 8.2K | In: $0.115 Out: $0.287 | Model: 0.058 Completion: 2.496 | 🔧 🌡️ | 2024-04 | In: text, image Out: text | Released: 2024-01-25 Updated: 2025-08-15 |
Qwen3 Coder Plus | qwen3-coder-plus | 1M | 65.5K | In: $1 Out: $5 | Model: 0.500 Completion: 5.000 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-23 |
Amazon Bedrock¶
Model | Model ID | Context | Output | Pricing ($/1M) | NewAPI Ratios | Capabilities | Knowledge | Modalities | Details |
---|---|---|---|---|---|---|---|---|---|
Command R+ | cohere.command-r-plus-v1:0 | 128K | 4.1K | In: $3 Out: $15 | Model: 1.500 Completion: 5.000 | 🔧 🌡️ | 2024-04 | In: text Out: text | Open Weights Released: 2024-04-04 |
Claude 2 | anthropic.claude-v2 | 100K | 4.1K | In: $8 Out: $24 | Model: 4.000 Completion: 3.000 | 🌡️ | 2023-08 | In: text Out: text | Released: 2023-07-11 |
Claude Sonnet 3.7 | anthropic.claude-3-7-sonnet-20250219-v1:0 | 200K | 8.2K | In: $3 Out: $15 Cache Read: $0.3 Cache Write: $3.75 | Model: 1.500 Completion: 5.000 Cache: 0.100 | 📎 🔧 🌡️ | 2024-04 | In: text, image Out: text | Released: 2025-02-19 |
Claude Sonnet 4 | anthropic.claude-sonnet-4-20250514-v1:0 | 200K | 64K | In: $3 Out: $15 Cache Read: $0.3 Cache Write: $3.75 | Model: 1.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2024-04 | In: text, image Out: text | Released: 2025-05-22 |
Llama 3.2 11B Instruct | meta.llama3-2-11b-instruct-v1:0 | 128K | 4.1K | In: $0.16 Out: $0.16 | Model: 0.080 Completion: 1.000 | 📎 🔧 🌡️ | 2023-12 | In: text, image Out: text | Open Weights Released: 2024-09-25 |
Claude Haiku 3 | anthropic.claude-3-haiku-20240307-v1:0 | 200K | 4.1K | In: $0.25 Out: $1.25 | Model: 0.125 Completion: 5.000 | 📎 🔧 🌡️ | 2024-02 | In: text, image Out: text | Released: 2024-03-13 |
Llama 3.2 90B Instruct | meta.llama3-2-90b-instruct-v1:0 | 128K | 4.1K | In: $0.72 Out: $0.72 | Model: 0.360 Completion: 1.000 | 📎 🔧 🌡️ | 2023-12 | In: text, image Out: text | Open Weights Released: 2024-09-25 |
Llama 3.2 1B Instruct | meta.llama3-2-1b-instruct-v1:0 | 131K | 4.1K | In: $0.1 Out: $0.1 | Model: 0.050 Completion: 1.000 | 🔧 🌡️ | 2023-12 | In: text Out: text | Open Weights Released: 2024-09-25 |
Claude 2.1 | anthropic.claude-v2:1 | 200K | 4.1K | In: $8 Out: $24 | Model: 4.000 Completion: 3.000 | 🌡️ | 2023-08 | In: text Out: text | Released: 2023-11-21 |
Command Light | cohere.command-light-text-v14 | 4.1K | 4.1K | In: $0.3 Out: $0.6 | Model: 0.150 Completion: 2.000 | 🌡️ | 2023-08 | In: text Out: text | Open Weights Released: 2023-11-01 |
Jamba 1.5 Large | ai21.jamba-1-5-large-v1:0 | 256K | 4.1K | In: $2 Out: $8 | Model: 1.000 Completion: 4.000 | 🔧 🌡️ | 2024-08 | In: text Out: text | Open Weights Released: 2024-08-15 |
Llama 3.3 70B Instruct | meta.llama3-3-70b-instruct-v1:0 | 128K | 4.1K | In: $0.72 Out: $0.72 | Model: 0.360 Completion: 1.000 | 🔧 🌡️ | 2023-12 | In: text Out: text | Open Weights Released: 2024-12-06 |
Claude Opus 3 | anthropic.claude-3-opus-20240229-v1:0 | 200K | 4.1K | In: $15 Out: $75 | Model: 7.500 Completion: 5.000 | 📎 🔧 🌡️ | 2023-08 | In: text, image Out: text | Released: 2024-02-29 |
Nova Pro | amazon.nova-pro-v1:0 | 300K | 8.2K | In: $0.8 Out: $3.2 Cache Read: $0.2 | Model: 0.400 Completion: 4.000 Cache: 0.250 | 📎 🔧 🌡️ | 2024-10 | In: text, image, video Out: text | Released: 2024-12-03 |
Llama 3.1 8B Instruct | meta.llama3-1-8b-instruct-v1:0 | 128K | 4.1K | In: $0.22 Out: $0.22 | Model: 0.110 Completion: 1.000 | 🔧 🌡️ | 2023-12 | In: text Out: text | Open Weights Released: 2024-07-23 |
Claude Sonnet 3.5 | anthropic.claude-3-5-sonnet-20240620-v1:0 | 200K | 8.2K | In: $3 Out: $15 Cache Read: $0.3 Cache Write: $3.75 | Model: 1.500 Completion: 5.000 Cache: 0.100 | 📎 🔧 🌡️ | 2024-04 | In: text, image Out: text | Released: 2024-06-20 |
Claude Haiku 4.5 | anthropic.claude-haiku-4-5-20251001-v1:0 | 200K | 64K | In: $1 Out: $5 Cache Read: $0.1 Cache Write: $1.25 | Model: 0.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-02-28 | In: text, image Out: text | Released: 2025-10-15 |
Command R | cohere.command-r-v1:0 | 128K | 4.1K | In: $0.5 Out: $1.5 | Model: 0.250 Completion: 3.000 | 🔧 🌡️ | 2024-04 | In: text Out: text | Open Weights Released: 2024-03-11 |
Nova Micro | amazon.nova-micro-v1:0 | 128K | 8.2K | In: $0.035 Out: $0.14 Cache Read: $0.00875 | Model: 0.018 Completion: 4.000 Cache: 0.250 | 🔧 🌡️ | 2024-10 | In: text Out: text | Released: 2024-12-03 |
Llama 3.1 70B Instruct | meta.llama3-1-70b-instruct-v1:0 | 128K | 4.1K | In: $0.72 Out: $0.72 | Model: 0.360 Completion: 1.000 | 🔧 🌡️ | 2023-12 | In: text Out: text | Open Weights Released: 2024-07-23 |
Llama 3 70B Instruct | meta.llama3-70b-instruct-v1:0 | 8.2K | 2K | In: $2.65 Out: $3.5 | Model: 1.325 Completion: 1.321 | 🌡️ | 2023-12 | In: text Out: text | Open Weights Released: 2024-07-23 |
DeepSeek-R1 | deepseek.r1-v1:0 | 128K | 32.8K | In: $1.35 Out: $5.4 | Model: 0.675 Completion: 4.000 | 🧠 🔧 🌡️ | 2024-07 | In: text Out: text | Released: 2025-01-20 Updated: 2025-05-29 |
Claude Sonnet 3.5 v2 | anthropic.claude-3-5-sonnet-20241022-v2:0 | 200K | 8.2K | In: $3 Out: $15 Cache Read: $0.3 Cache Write: $3.75 | Model: 1.500 Completion: 5.000 Cache: 0.100 | 📎 🔧 🌡️ | 2024-04 | In: text, image Out: text | Released: 2024-10-22 |
Command | cohere.command-text-v14 | 4.1K | 4.1K | In: $1.5 Out: $2 | Model: 0.750 Completion: 1.333 | 🌡️ | 2023-08 | In: text Out: text | Open Weights Released: 2023-11-01 |
Claude Opus 4 | anthropic.claude-opus-4-20250514-v1:0 | 200K | 32K | In: $15 Out: $75 Cache Read: $1.5 Cache Write: $18.75 | Model: 7.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2024-04 | In: text, image Out: text | Released: 2025-05-22 |
Claude Sonnet 4.5 | anthropic.claude-sonnet-4-5-20250929-v1:0 | 200K | 64K | In: $3 Out: $15 Cache Read: $0.3 Cache Write: $3.75 | Model: 1.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-07-31 | In: text, image Out: text | Released: 2025-09-29 |
Llama 3.2 3B Instruct | meta.llama3-2-3b-instruct-v1:0 | 131K | 4.1K | In: $0.15 Out: $0.15 | Model: 0.075 Completion: 1.000 | 🔧 🌡️ | 2023-12 | In: text Out: text | Open Weights Released: 2024-09-25 |
Claude Instant | anthropic.claude-instant-v1 | 100K | 4.1K | In: $0.8 Out: $2.4 | Model: 0.400 Completion: 3.000 | 🌡️ | 2023-08 | In: text Out: text | Released: 2023-03-01 |
Nova Premier | amazon.nova-premier-v1:0 | 1M | 16.4K | In: $2.5 Out: $12.5 | Model: 1.250 Completion: 5.000 | 📎 🧠 🔧 🌡️ | 2024-10 | In: text, image, video Out: text | Released: 2024-12-03 |
Claude Opus 4.1 | anthropic.claude-opus-4-1-20250805-v1:0 | 200K | 32K | In: $15 Out: $75 Cache Read: $1.5 Cache Write: $18.75 | Model: 7.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-03-31 | In: text, image Out: text | Released: 2025-08-05 |
Llama 4 Scout 17B Instruct | meta.llama4-scout-17b-instruct-v1:0 | 3.5M | 16.4K | In: $0.17 Out: $0.66 | Model: 0.085 Completion: 3.882 | 📎 🔧 🌡️ | 2024-08 | In: text, image Out: text | Open Weights Released: 2025-04-05 |
Jamba 1.5 Mini | ai21.jamba-1-5-mini-v1:0 | 256K | 4.1K | In: $0.2 Out: $0.4 | Model: 0.100 Completion: 2.000 | 🔧 🌡️ | 2024-08 | In: text Out: text | Open Weights Released: 2024-08-15 |
Llama 3 8B Instruct | meta.llama3-8b-instruct-v1:0 | 8.2K | 2K | In: $0.3 Out: $0.6 | Model: 0.150 Completion: 2.000 | 🌡️ | 2023-03 | In: text Out: text | Open Weights Released: 2024-07-23 |
Claude Sonnet 3 | anthropic.claude-3-sonnet-20240229-v1:0 | 200K | 4.1K | In: $3 Out: $15 | Model: 1.500 Completion: 5.000 | 📎 🔧 🌡️ | 2023-08 | In: text, image Out: text | Released: 2024-03-04 |
Llama 4 Maverick 17B Instruct | meta.llama4-maverick-17b-instruct-v1:0 | 1M | 16.4K | In: $0.24 Out: $0.97 | Model: 0.120 Completion: 4.042 | 📎 🔧 🌡️ | 2024-08 | In: text, image Out: text | Open Weights Released: 2025-04-05 |
Nova Lite | amazon.nova-lite-v1:0 | 300K | 8.2K | In: $0.06 Out: $0.24 Cache Read: $0.015 | Model: 0.030 Completion: 4.000 Cache: 0.250 | 📎 🔧 🌡️ | 2024-10 | In: text, image, video Out: text | Released: 2024-12-03 |
Claude Haiku 3.5 | anthropic.claude-3-5-haiku-20241022-v1:0 | 200K | 8.2K | In: $0.8 Out: $4 Cache Read: $0.08 Cache Write: $1 | Model: 0.400 Completion: 5.000 Cache: 0.100 | 📎 🔧 🌡️ | 2024-07 | In: text Out: text | Released: 2024-10-22 |
Anthropic¶
Model | Model ID | Context | Output | Pricing ($/1M) | NewAPI Ratios | Capabilities | Knowledge | Modalities | Details |
---|---|---|---|---|---|---|---|---|---|
Claude Sonnet 3.5 v2 | claude-3-5-sonnet-20241022 | 200K | 8.2K | In: $3 Out: $15 Cache Read: $0.3 Cache Write: $3.75 | Model: 1.500 Completion: 5.000 Cache: 0.100 | 📎 🔧 🌡️ | 2024-04-30 | In: text, image Out: text | Released: 2024-10-22 |
Claude Sonnet 3.5 | claude-3-5-sonnet-20240620 | 200K | 8.2K | In: $3 Out: $15 Cache Read: $0.3 Cache Write: $3.75 | Model: 1.500 Completion: 5.000 Cache: 0.100 | 📎 🔧 🌡️ | 2024-04-30 | In: text, image Out: text | Released: 2024-06-20 |
Claude Opus 3 | claude-3-opus-20240229 | 200K | 4.1K | In: $15 Out: $75 Cache Read: $1.5 Cache Write: $18.75 | Model: 7.500 Completion: 5.000 Cache: 0.100 | 📎 🔧 🌡️ | 2023-08-31 | In: text, image Out: text | Released: 2024-02-29 |
Claude Sonnet 4.5 | claude-sonnet-4-5-20250929 | 200K | 64K | In: $3 Out: $15 Cache Read: $0.3 Cache Write: $3.75 | Model: 1.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-07-31 | In: text, image Out: text | Released: 2025-09-29 |
Claude Sonnet 4 | claude-sonnet-4-20250514 | 200K | 64K | In: $3 Out: $15 Cache Read: $0.3 Cache Write: $3.75 | Model: 1.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-03-31 | In: text, image Out: text | Released: 2025-05-22 |
Claude Opus 4 | claude-opus-4-20250514 | 200K | 32K | In: $15 Out: $75 Cache Read: $1.5 Cache Write: $18.75 | Model: 7.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-03-31 | In: text, image Out: text | Released: 2025-05-22 |
Claude Haiku 3.5 | claude-3-5-haiku-20241022 | 200K | 8.2K | In: $0.8 Out: $4 Cache Read: $0.08 Cache Write: $1 | Model: 0.400 Completion: 5.000 Cache: 0.100 | 📎 🔧 🌡️ | 2024-07-31 | In: text, image Out: text | Released: 2024-10-22 |
Claude Haiku 3 | claude-3-haiku-20240307 | 200K | 4.1K | In: $0.25 Out: $1.25 Cache Read: $0.03 Cache Write: $0.3 | Model: 0.125 Completion: 5.000 Cache: 0.120 | 📎 🔧 🌡️ | 2023-08-31 | In: text, image Out: text | Released: 2024-03-13 |
Claude Sonnet 3.7 | claude-3-7-sonnet-20250219 | 200K | 64K | In: $3 Out: $15 Cache Read: $0.3 Cache Write: $3.75 | Model: 1.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2024-10-31 | In: text, image Out: text | Released: 2025-02-19 |
Claude Opus 4.1 | claude-opus-4-1-20250805 | 200K | 32K | In: $15 Out: $75 Cache Read: $1.5 Cache Write: $18.75 | Model: 7.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-03-31 | In: text, image Out: text | Released: 2025-08-05 |
Claude Sonnet 3 | claude-3-sonnet-20240229 | 200K | 4.1K | In: $3 Out: $15 Cache Read: $0.3 Cache Write: $0.3 | Model: 1.500 Completion: 5.000 Cache: 0.100 | 📎 🔧 🌡️ | 2023-08-31 | In: text, image Out: text | Released: 2024-03-04 |
Claude Haiku 4.5 | claude-haiku-4-5-20251001 | 200K | 64K | In: $1 Out: $5 Cache Read: $0.1 Cache Write: $1.25 | Model: 0.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-02-31 | In: text, image Out: text | Released: 2025-10-15 |
Claude Sonnet 4 | claude-sonnet-4-0 | 200K | 64K | In: $3 Out: $15 Cache Read: $0.3 Cache Write: $3.75 | Model: 1.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-03-31 | In: text, image Out: text | Released: 2025-05-22 |
Claude Sonnet 3.7 | claude-3-7-sonnet-latest | 200K | 64K | In: $3 Out: $15 Cache Read: $0.3 Cache Write: $3.75 | Model: 1.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2024-10-31 | In: text, image Out: text | Released: 2025-02-19 |
Claude Sonnet 4.5 | claude-sonnet-4-5 | 200K | 64K | In: $3 Out: $15 Cache Read: $0.3 Cache Write: $3.75 | Model: 1.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-07-31 | In: text, image Out: text | Released: 2025-09-29 |
Claude Haiku 3.5 | claude-3-5-haiku-latest | 200K | 8.2K | In: $0.8 Out: $4 Cache Read: $0.08 Cache Write: $1 | Model: 0.400 Completion: 5.000 Cache: 0.100 | 📎 🔧 🌡️ | 2024-07-31 | In: text, image Out: text | Released: 2024-10-22 |
Claude Haiku 4.5 | claude-haiku-4-5 | 200K | 64K | In: $1 Out: $5 Cache Read: $0.1 Cache Write: $1.25 | Model: 0.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-02-31 | In: text, image Out: text | Released: 2025-10-15 |
Claude Opus 4.1 | claude-opus-4-1 | 200K | 32K | In: $15 Out: $75 Cache Read: $1.5 Cache Write: $18.75 | Model: 7.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-03-31 | In: text, image Out: text | Released: 2025-08-05 |
Claude Opus 4 | claude-opus-4-0 | 200K | 32K | In: $15 Out: $75 Cache Read: $1.5 Cache Write: $18.75 | Model: 7.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-03-31 | In: text, image Out: text | Released: 2025-05-22 |
Azure¶
Model | Model ID | Context | Output | Pricing ($/1M) | NewAPI Ratios | Capabilities | Knowledge | Modalities | Details |
---|---|---|---|---|---|---|---|---|---|
GPT-4.1 nano | gpt-4.1-nano | 1M | 32.8K | In: $0.1 Out: $0.4 Cache Read: $0.03 | Model: 0.050 Completion: 4.000 Cache: 0.300 | 📎 🔧 🌡️ | 2024-05 | In: text, image Out: text | Released: 2025-04-14 |
GPT-4 | gpt-4 | 8.2K | 8.2K | In: $60 Out: $120 | Model: 30.000 Completion: 2.000 | 🔧 🌡️ | 2023-11 | In: text Out: text | Released: 2023-03-14 |
GPT-4 32K | gpt-4-32k | 32.8K | 32.8K | In: $60 Out: $120 | Model: 30.000 Completion: 2.000 | 🔧 🌡️ | 2023-11 | In: text Out: text | Released: 2023-03-14 |
GPT-4.1 mini | gpt-4.1-mini | 1M | 32.8K | In: $0.4 Out: $1.6 Cache Read: $0.1 | Model: 0.200 Completion: 4.000 Cache: 0.250 | 📎 🔧 🌡️ | 2024-05 | In: text, image Out: text | Released: 2025-04-14 |
GPT-5 Chat | gpt-5-chat | 128K | 16.4K | In: $1.25 Out: $10 Cache Read: $0.13 | Model: 0.625 Completion: 8.000 Cache: 0.104 | 📎 🧠 | 2024-10-24 | In: text, image Out: text | Released: 2025-08-07 |
GPT-3.5 Turbo 0125 | gpt-3.5-turbo-0125 | 16.4K | 16.4K | In: $0.5 Out: $1.5 | Model: 0.250 Completion: 3.000 | 🌡️ | 2021-08 | In: text Out: text | Released: 2024-01-25 |
GPT-4 Turbo | gpt-4-turbo | 128K | 4.1K | In: $10 Out: $30 | Model: 5.000 Completion: 3.000 | 📎 🔧 🌡️ | 2023-11 | In: text, image Out: text | Released: 2023-11-06 Updated: 2024-04-09 |
GPT-3.5 Turbo 0613 | gpt-3.5-turbo-0613 | 16.4K | 16.4K | In: $3 Out: $4 | Model: 1.500 Completion: 1.333 | 🌡️ | 2021-08 | In: text Out: text | Released: 2023-06-13 |
o1-preview | o1-preview | 128K | 32.8K | In: $16.5 Out: $66 Cache Read: $8.25 | Model: 8.250 Completion: 4.000 Cache: 0.500 | 🧠 🔧 | 2023-09 | In: text Out: text | Released: 2024-09-12 |
o3-mini | o3-mini | 200K | 100K | In: $1.1 Out: $4.4 Cache Read: $0.55 | Model: 0.550 Completion: 4.000 Cache: 0.500 | 🧠 🔧 | 2024-05 | In: text Out: text | Released: 2024-12-20 Updated: 2025-01-29 |
GPT-5 Nano | gpt-5-nano | 272K | 128K | In: $0.05 Out: $0.4 Cache Read: $0.01 | Model: 0.025 Completion: 8.000 Cache: 0.200 | 📎 🧠 🔧 | 2024-05-30 | In: text, image Out: text | Released: 2025-08-07 |
GPT-5-Codex | gpt-5-codex | 400K | 128K | - | - | 🧠 🔧 | 2024-09-30 | In: text, image Out: text | Released: 2025-09-15 |
GPT-4o | gpt-4o | 128K | 16.4K | In: $2.5 Out: $10 Cache Read: $1.25 | Model: 1.250 Completion: 4.000 Cache: 0.500 | 📎 🔧 🌡️ | 2023-09 | In: text, image Out: text | Released: 2024-05-13 |
GPT-3.5 Turbo 0301 | gpt-3.5-turbo-0301 | 4.1K | 4.1K | In: $1.5 Out: $2 | Model: 0.750 Completion: 1.333 | 🌡️ | 2021-08 | In: text Out: text | Released: 2023-03-01 |
GPT-4.1 | gpt-4.1 | 1M | 32.8K | In: $2 Out: $8 Cache Read: $0.5 | Model: 1.000 Completion: 4.000 Cache: 0.250 | 📎 🔧 🌡️ | 2024-05 | In: text, image Out: text | Released: 2025-04-14 |
o4-mini | o4-mini | 200K | 100K | In: $1.1 Out: $4.4 Cache Read: $0.28 | Model: 0.550 Completion: 4.000 Cache: 0.255 | 📎 🧠 🔧 | 2024-05 | In: text, image Out: text | Released: 2025-04-16 |
o1 | o1 | 200K | 100K | In: $15 Out: $60 Cache Read: $7.5 | Model: 7.500 Completion: 4.000 Cache: 0.500 | 🧠 🔧 | 2023-09 | In: text, image Out: text | Released: 2024-12-05 |
GPT-5 Mini | gpt-5-mini | 272K | 128K | In: $0.25 Out: $2 Cache Read: $0.03 | Model: 0.125 Completion: 8.000 Cache: 0.120 | 📎 🧠 🔧 | 2024-05-30 | In: text, image Out: text | Released: 2025-08-07 |
o1-mini | o1-mini | 128K | 65.5K | In: $1.1 Out: $4.4 Cache Read: $0.55 | Model: 0.550 Completion: 4.000 Cache: 0.500 | 🧠 🔧 | 2023-09 | In: text Out: text | Released: 2024-09-12 |
GPT-3.5 Turbo Instruct | gpt-3.5-turbo-instruct | 4.1K | 4.1K | In: $1.5 Out: $2 | Model: 0.750 Completion: 1.333 | 🌡️ | 2021-08 | In: text Out: text | Released: 2023-09-21 |
o3 | o3 | 200K | 100K | In: $2 Out: $8 Cache Read: $0.5 | Model: 1.000 Completion: 4.000 Cache: 0.250 | 📎 🧠 🔧 | 2024-05 | In: text, image Out: text | Released: 2025-04-16 |
Codex Mini | codex-mini | 200K | 100K | In: $1.5 Out: $6 Cache Read: $0.375 | Model: 0.750 Completion: 4.000 Cache: 0.250 | 📎 🧠 🔧 | 2024-04 | In: text Out: text | Released: 2025-05-16 |
GPT-4 Turbo Vision | gpt-4-turbo-vision | 128K | 4.1K | In: $10 Out: $30 | Model: 5.000 Completion: 3.000 | 📎 🔧 🌡️ | 2023-11 | In: text, image Out: text | Released: 2023-11-06 Updated: 2024-04-09 |
GPT-4o mini | gpt-4o-mini | 128K | 16.4K | In: $0.15 Out: $0.6 Cache Read: $0.08 | Model: 0.075 Completion: 4.000 Cache: 0.533 | 📎 🔧 🌡️ | 2023-09 | In: text, image Out: text | Released: 2024-07-18 |
GPT-5 | gpt-5 | 272K | 128K | In: $1.25 Out: $10 Cache Read: $0.13 | Model: 0.625 Completion: 8.000 Cache: 0.104 | 📎 🧠 🔧 | 2024-09-30 | In: text, image Out: text | Released: 2025-08-07 |
GPT-3.5 Turbo 1106 | gpt-3.5-turbo-1106 | 16.4K | 16.4K | In: $1 Out: $2 | Model: 0.500 Completion: 2.000 | 🌡️ | 2021-08 | In: text Out: text | Released: 2023-11-06 |
Baseten¶
📖 API Address | 📚 Official Documentation
Model | Model ID | Context | Output | Pricing ($/1M) | NewAPI Ratios | Capabilities | Knowledge | Modalities | Details |
---|---|---|---|---|---|---|---|---|---|
Kimi K2 Instruct 0905 | moonshotai/Kimi-K2-Instruct-0905 | 262.1K | 262.1K | In: $0.6 Out: $2.5 | Model: 0.300 Completion: 4.167 | 🔧 🌡️ | 2025-08 | In: text Out: text | Open Weights Released: 2025-09-05 |
Qwen3 Coder 480B A35B Instruct | Qwen3/Qwen3-Coder-480B-A35B-Instruct | 262.1K | 66.5K | In: $0.38 Out: $1.53 | Model: 0.190 Completion: 4.026 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-23 |
GLM 4.6 | zai-org/GLM-4.6 | 200K | 200K | In: $0.6 Out: $2.2 | Model: 0.300 Completion: 3.667 | 🔧 🌡️ | 2025-08 | In: text Out: text | Open Weights Released: 2025-16-09 |
Cerebras¶
Model | Model ID | Context | Output | Pricing ($/1M) | NewAPI Ratios | Capabilities | Knowledge | Modalities | Details |
---|---|---|---|---|---|---|---|---|---|
Qwen 3 235B Instruct | qwen-3-235b-a22b-instruct-2507 | 131K | 32K | In: $0.6 Out: $1.2 | Model: 0.300 Completion: 2.000 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-22 |
Qwen 3 Coder 480B | qwen-3-coder-480b | 131K | 32K | In: $2 Out: $2 | Model: 1.000 Completion: 1.000 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-23 |
GPT OSS 120B | gpt-oss-120b | 131.1K | 32.8K | In: $0.25 Out: $0.69 | Model: 0.125 Completion: 2.760 | 🧠 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-08-05 |
Chutes¶
📖 API Address | 📚 Official Documentation
Model | Model ID | Context | Output | Pricing ($/1M) | NewAPI Ratios | Capabilities | Knowledge | Modalities | Details |
---|---|---|---|---|---|---|---|---|---|
Kimi Dev 72B | moonshotai/Kimi-Dev-72B | 131.1K | 131.1K | In: $0.06664 Out: $0.266688 | Model: 0.033 Completion: 4.002 | 🔧 🌡️ | - | In: text Out: text | Released: 2024-12-01 |
Kimi K2 Instruct | moonshotai/Kimi-K2-Instruct-75k | 75K | 75K | In: $0.15 Out: $0.59 | Model: 0.075 Completion: 3.933 | 🔧 🌡️ | - | In: text Out: text | Released: 2025-08-01 |
Kimi K2 Instruct 0905 | moonshotai/Kimi-K2-Instruct-0905 | 262.1K | 262.1K | In: $0.296176 Out: $1.18528 | Model: 0.148 Completion: 4.002 | 🔧 🌡️ | - | In: text Out: text | Released: 2024-09-05 |
Kimi VL A3B Thinking | moonshotai/Kimi-VL-A3B-Thinking | 131.1K | 131.1K | In: $0.02499 Out: $0.100008 | Model: 0.012 Completion: 4.002 | 🧠 🔧 🌡️ | - | In: text, image Out: text | Released: 2024-12-01 |
LongCat Flash Chat FP8 | meituan-longcat/LongCat-Flash-Chat-FP8 | 131.1K | 131.1K | In: $0.25 Out: $1 | Model: 0.125 Completion: 4.000 | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-09-10 |
DeepSeek R1T Chimera | tngtech/DeepSeek-R1T-Chimera | 163.8K | 163.8K | In: $0.18 Out: $0.72 | Model: 0.090 Completion: 4.000 | 🧠 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-04-26 |
DeepSeek TNG R1T2 Chimera | tngtech/DeepSeek-TNG-R1T2-Chimera | 163.8K | 163.8K | In: $0.2 Out: $0.8 | Model: 0.100 Completion: 4.000 | 🧠 🌡️ | 2025-07 | In: text Out: text | Open Weights Released: 2025-07-08 |
GPT OSS 120B | openai/gpt-oss-120b | 131.1K | 32.8K | In: $0.1 Out: $0.41 | Model: 0.050 Completion: 4.100 | 🧠 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-08-05 |
Devstral Small (2505) | chutesai/Devstral-Small-2505 | 32.8K | 32.8K | In: $0.02 Out: $0.08 | Model: 0.010 Completion: 4.000 | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-05-21 |
Mistral Small 3.2 24B Instruct (2506) | chutesai/Mistral-Small-3.2-24B-Instruct-2506 | 131.1K | 131.1K | In: $0.02 Out: $0.08 | Model: 0.010 Completion: 4.000 | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-06-20 |
Qwen3 30B A3B | Qwen/Qwen3-30B-A3B | 41K | 41K | In: $0.02 Out: $0.08 | Model: 0.010 Completion: 4.000 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-04-28 |
Qwen3 30B A3B Thinking 2507 | Qwen/Qwen3-30B-A3B-Thinking-2507 | 262.1K | 262.1K | In: $0.08 Out: $0.29 | Model: 0.040 Completion: 3.625 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-30 |
Qwen3 235B A22B Instruct 2507 | Qwen/Qwen3-235B-A22B-Instruct-2507 | 262.1K | 131.1K | In: $0.078 Out: $0.312 | Model: 0.039 Completion: 4.000 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-04-28 Updated: 2025-07-21 |
Qwen3 Coder 30B A3B Instruct | Qwen/Qwen3-Coder-30B-A3B-Instruct | 262.1K | 262.1K | - | - | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-25 |
Qwen3 Coder 480B A35B Instruct (FP8) | Qwen/Qwen3-Coder-480B-A35B-Instruct-FP8 | 262.1K | 262.1K | In: $0.2 Out: $0.8 | Model: 0.100 Completion: 4.000 | 🔧 🌡️ | - | In: text Out: text | Released: 2025-08-01 |
Qwen3 30B A3B Instruct 2507 | Qwen/Qwen3-30B-A3B-Instruct-2507 | 262.1K | 262.1K | In: $0.05 Out: $0.2 | Model: 0.025 Completion: 4.000 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-25 |
Qwen3-235B-A22B-Thinking-2507 | Qwen/Qwen3-235B-A22B-Thinking-2507 | 262.1K | 262.1K | In: $0.078 Out: $0.312 | Model: 0.039 Completion: 4.000 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-25 |
Qwen3 Next 80B A3B Instruct | Qwen/Qwen3-Next-80B-A3B-Instruct | 262.1K | 262.1K | In: $0.1 Out: $0.8 | Model: 0.050 Completion: 8.000 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-09-11 |
Qwen3 Next 80B A3B Thinking | Qwen/Qwen3-Next-80B-A3B-Thinking | 262.1K | 262.1K | In: $0.1 Out: $0.8 | Model: 0.050 Completion: 8.000 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-09-11 |
GLM 4.5 Turbo | zai-org/GLM-4.5-turbo | 131.1K | 131.1K | In: $1 Out: $3 | Model: 0.500 Completion: 3.000 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-28 |
GLM 4.6 FP8 | zai-org/GLM-4.6-FP8 | 204.8K | 131.1K | In: $0.39 Out: $1.55 | Model: 0.195 Completion: 3.974 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-09-30 |
GLM 4.6 Turbo | zai-org/GLM-4.6-turbo | 204.8K | 131.1K | In: $1.15 Out: $3.25 | Model: 0.575 Completion: 2.826 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-10-03 |
GLM 4.5 FP8 | zai-org/GLM-4.5-FP8 | 131.1K | 131.1K | - | - | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-28 |
GLM 4.5 Air | zai-org/GLM-4.5-Air | 131.1K | 98.3K | - | - | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-28 |
DeepSeek R1 0528 Qwen3 8B | deepseek-ai/DeepSeek-R1-0528-Qwen3-8B | 131.1K | 131.1K | In: $0.02 Out: $0.07 | Model: 0.010 Completion: 3.500 | 🧠 🔧 🌡️ | 2025-05 | In: text Out: text | Open Weights Released: 2025-05-29 |
DeepSeek R1 (0528) | deepseek-ai/DeepSeek-R1-0528 | 75K | 163.8K | In: $0.18 Out: $0.72 | Model: 0.090 Completion: 4.000 | 🧠 🔧 🌡️ | - | In: text Out: text | Released: 2025-08-01 |
DeepSeek V3.2 Exp | deepseek-ai/DeepSeek-V3.2-Exp | 128K | 64K | In: $0.25 Out: $0.35 | Model: 0.125 Completion: 1.400 | 🧠 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-09-29 |
DeepSeek V3.1 Terminus | deepseek-ai/DeepSeek-V3.1-Terminus | 131.1K | 65.5K | In: $0.25 Out: $1 | Model: 0.125 Completion: 4.000 | 🧠 🔧 🌡️ | 2025-07 | In: text Out: text | Open Weights Released: 2025-09-22 |
DeepSeek V3.1 Turbo | deepseek-ai/DeepSeek-V3.1-turbo | 128K | 128K | In: $1 Out: $3 | Model: 0.500 Completion: 3.000 | 🔧 🌡️ | - | In: text Out: text | Released: 2025-08-21 |
DeepSeek V3.1 Reasoning | deepseek-ai/DeepSeek-V3.1:THINKING | 163.8K | 163.8K | In: $0.2 Out: $0.8 | Model: 0.100 Completion: 4.000 | 🧠 🔧 🌡️ | - | In: text Out: text | Released: 2025-08-21 |
DeepSeek R1 Distill Llama 70B | deepseek-ai/DeepSeek-R1-Distill-Llama-70B | 131.1K | 131.1K | In: $0.03 Out: $0.14 | Model: 0.015 Completion: 4.667 | 🧠 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-01-23 |
DeepSeek V3.1 | deepseek-ai/DeepSeek-V3.1 | 163.8K | 163.8K | In: $0.2 Out: $0.8 | Model: 0.100 Completion: 4.000 | 🔧 🌡️ | - | In: text Out: text | Released: 2025-08-21 |
DeepSeek V3 (0324) | deepseek-ai/DeepSeek-V3-0324 | 75K | 163.8K | In: $0.18 Out: $0.72 | Model: 0.090 Completion: 4.000 | 🔧 🌡️ | - | In: text Out: text | Released: 2025-08-01 |
Cloudflare Workers AI¶
Model | Model ID | Context | Output | Pricing ($/1M) | NewAPI Ratios | Capabilities | Knowledge | Modalities | Details |
---|---|---|---|---|---|---|---|---|---|
hf/thebloke/mistral-7b-instruct-v0.1-awq | mistral-7b-instruct-v0.1-awq | 4.1K | 4.1K | - | - | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2023-09-27 Updated: 2023-11-09 |
cf/deepgram/aura-1 | aura-1 | - | - | In: $0.015 Out: $0.015 | Model: 0.007 Completion: 1.000 | - | - | In: text Out: audio | Open Weights Released: 2025-08-27 Updated: 2025-07-07 |
hf/mistral/mistral-7b-instruct-v0.2 | mistral-7b-instruct-v0.2 | 3.1K | 3.1K | - | - | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2023-12-11 Updated: 2025-07-24 |
cf/tinyllama/tinyllama-1.1b-chat-v1.0 | tinyllama-1.1b-chat-v1.0 | 2K | 2K | - | - | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2023-12-30 Updated: 2024-03-17 |
cf/qwen/qwen1.5-0.5b-chat | qwen1.5-0.5b-chat | 32K | 32K | - | - | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2024-01-31 Updated: 2024-04-30 |
cf/meta/llama-3.2-11b-vision-instruct | llama-3.2-11b-vision-instruct | 128K | 128K | In: $0.049 Out: $0.68 | Model: 0.025 Completion: 13.878 | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2024-09-18 Updated: 2024-12-04 |
hf/thebloke/llama-2-13b-chat-awq | llama-2-13b-chat-awq | 4.1K | 4.1K | - | - | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2023-09-19 Updated: 2023-11-09 |
cf/meta/llama-3.1-8b-instruct-fp8 | llama-3.1-8b-instruct-fp8 | 32K | 32K | In: $0.15 Out: $0.29 | Model: 0.075 Completion: 1.933 | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2024-07-25 |
cf/openai/whisper | whisper | - | - | In: $0.00045 Out: $0.00045 | Model: 0.000 Completion: 1.000 | - | - | In: audio Out: text | Open Weights Released: 2023-11-07 Updated: 2024-08-12 |
cf/stabilityai/stable-diffusion-xl-base-1.0 | stable-diffusion-xl-base-1.0 | - | - | - | - | - | - | In: text Out: image | Open Weights Released: 2023-07-25 Updated: 2023-10-30 |
cf/meta/llama-2-7b-chat-fp16 | llama-2-7b-chat-fp16 | 4.1K | 4.1K | In: $0.56 Out: $6.67 | Model: 0.280 Completion: 11.911 | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2023-07-26 |
cf/microsoft/resnet-50 | resnet-50 | - | - | In: $0.0000025 Out: $- | Model: 0.000 | - | - | In: image Out: text | Open Weights Released: 2022-03-16 Updated: 2024-02-13 |
cf/runwayml/stable-diffusion-v1-5-inpainting | stable-diffusion-v1-5-inpainting | - | - | - | - | - | - | In: text Out: image | Open Weights Released: 2024-02-27 |
cf/defog/sqlcoder-7b-2 | sqlcoder-7b-2 | 10K | 10K | - | - | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2024-02-05 Updated: 2024-02-12 |
cf/meta/llama-3-8b-instruct | llama-3-8b-instruct | 8K | 8K | In: $0.28 Out: $0.83 | Model: 0.140 Completion: 2.964 | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2024-04-17 Updated: 2025-06-19 |
cf/meta-llama/llama-2-7b-chat-hf-lora | llama-2-7b-chat-hf-lora | 8.2K | 8.2K | - | - | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2023-07-13 Updated: 2024-04-17 |
cf/meta/llama-3.1-8b-instruct | llama-3.1-8b-instruct | 8K | 8K | In: $0.28 Out: $0.83 | Model: 0.140 Completion: 2.964 | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2024-07-18 Updated: 2024-09-25 |
cf/openchat/openchat-3.5-0106 | openchat-3.5-0106 | 8.2K | 8.2K | - | - | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2024-01-07 Updated: 2024-05-18 |
hf/thebloke/openhermes-2.5-mistral-7b-awq | openhermes-2.5-mistral-7b-awq | 4.1K | 4.1K | - | - | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2023-11-02 Updated: 2023-11-09 |
cf/leonardo/lucid-origin | lucid-origin | - | - | In: $0.007 Out: $0.007 | Model: 0.004 Completion: 1.000 | - | - | In: text Out: image | Released: 2025-08-25 Updated: 2025-08-05 |
cf/facebook/bart-large-cnn | bart-large-cnn | - | - | - | - | - | - | In: text Out: text | Open Weights Released: 2022-03-02 Updated: 2024-02-13 |
cf/black-forest-labs/flux-1-schnell | flux-1-schnell | 2K | - | In: $0.000053 Out: $0.00011 | Model: 0.000 Completion: 2.075 | - | - | In: text Out: image | Open Weights Released: 2024-07-31 Updated: 2024-08-16 |
cf/deepseek-ai/deepseek-r1-distill-qwen-32b | deepseek-r1-distill-qwen-32b | 80K | 80K | In: $0.5 Out: $4.88 | Model: 0.250 Completion: 9.760 | 🧠 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-01-20 Updated: 2025-02-24 |
cf/google/gemma-2b-it-lora | gemma-2b-it-lora | 8.2K | 8.2K | - | - | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2024-04-02 |
cf/fblgit/una-cybertron-7b-v2-bf16 | una-cybertron-7b-v2-bf16 | 15K | 15K | - | - | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2023-12-02 Updated: 2024-03-08 |
cf/meta/m2m100-1.2b | m2m100-1.2b | - | - | In: $0.34 Out: $0.34 | Model: 0.170 Completion: 1.000 | - | - | In: text Out: text | Open Weights Released: 2022-03-02 Updated: 2023-11-16 |
cf/meta/llama-3.2-3b-instruct | llama-3.2-3b-instruct | 128K | 128K | In: $0.051 Out: $0.34 | Model: 0.025 Completion: 6.667 | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2024-09-18 Updated: 2024-10-24 |
cf/qwen/qwen2.5-coder-32b-instruct | qwen2.5-coder-32b-instruct | 32.8K | 32.8K | In: $0.66 Out: $1 | Model: 0.330 Completion: 1.515 | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2024-11-06 Updated: 2025-01-12 |
cf/runwayml/stable-diffusion-v1-5-img2img | stable-diffusion-v1-5-img2img | - | - | - | - | - | - | In: text Out: image | Open Weights Released: 2024-02-27 |
cf/google/gemma-7b-it-lora | gemma-7b-it-lora | 3.5K | 3.5K | - | - | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2024-04-02 |
cf/qwen/qwen1.5-14b-chat-awq | qwen1.5-14b-chat-awq | 7.5K | 7.5K | - | - | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2024-02-03 Updated: 2024-04-30 |
cf/qwen/qwen1.5-1.8b-chat | qwen1.5-1.8b-chat | 32K | 32K | - | - | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2024-01-30 Updated: 2024-04-30 |
cf/mistralai/mistral-small-3.1-24b-instruct | mistral-small-3.1-24b-instruct | 128K | 128K | In: $0.35 Out: $0.56 | Model: 0.175 Completion: 1.600 | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-03-11 Updated: 2025-07-28 |
hf/google/gemma-7b-it | gemma-7b-it | 8.2K | 8.2K | - | - | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2024-02-13 Updated: 2024-08-14 |
hf/thebloke/llamaguard-7b-awq | llamaguard-7b-awq | 4.1K | 4.1K | - | - | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2023-12-11 |
hf/nousresearch/hermes-2-pro-mistral-7b | hermes-2-pro-mistral-7b | 24K | 24K | - | - | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2024-03-11 Updated: 2024-09-08 |
cf/tiiuae/falcon-7b-instruct | falcon-7b-instruct | 4.1K | 4.1K | - | - | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2023-04-25 Updated: 2024-10-12 |
cf/meta/llama-3.3-70b-instruct-fp8-fast | llama-3.3-70b-instruct-fp8-fast | 24K | 24K | In: $0.29 Out: $2.25 | Model: 0.145 Completion: 7.759 | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2024-12-06 |
cf/meta/llama-3-8b-instruct-awq | llama-3-8b-instruct-awq | 8.2K | 8.2K | In: $0.12 Out: $0.27 | Model: 0.060 Completion: 2.250 | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2024-05-09 |
cf/leonardo/phoenix-1.0 | phoenix-1.0 | - | - | In: $0.0058 Out: $0.0058 | Model: 0.003 Completion: 1.000 | - | - | In: text Out: image | Released: 2025-08-25 |
cf/microsoft/phi-2 | phi-2 | 2K | 2K | - | - | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2023-12-13 Updated: 2024-04-29 |
cf/lykon/dreamshaper-8-lcm | dreamshaper-8-lcm | - | - | - | - | 📎 | - | In: text Out: image | Open Weights Released: 2023-12-06 Updated: 2023-12-07 |
cf/thebloke/discolm-german-7b-v1-awq | discolm-german-7b-v1-awq | 4.1K | 4.1K | - | - | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2024-01-18 Updated: 2024-01-24 |
cf/meta/llama-2-7b-chat-int8 | llama-2-7b-chat-int8 | 8.2K | 8.2K | - | - | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2023-09-25 |
cf/meta/llama-3.2-1b-instruct | llama-3.2-1b-instruct | 60K | 60K | In: $0.027 Out: $0.2 | Model: 0.013 Completion: 7.407 | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2024-09-18 Updated: 2024-10-24 |
cf/openai/whisper-large-v3-turbo | whisper-large-v3-turbo | - | - | In: $0.00051 Out: $0.00051 | Model: 0.000 Completion: 1.000 | - | - | In: audio Out: text | Open Weights Released: 2024-10-01 Updated: 2024-10-04 |
cf/meta/llama-4-scout-17b-16e-instruct | llama-4-scout-17b-16e-instruct | 131K | 131K | In: $0.27 Out: $0.85 | Model: 0.135 Completion: 3.148 | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-04-02 Updated: 2025-05-23 |
hf/nexusflow/starling-lm-7b-beta | starling-lm-7b-beta | 4.1K | 4.1K | - | - | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2024-03-19 Updated: 2024-04-03 |
hf/thebloke/deepseek-coder-6.7b-base-awq | deepseek-coder-6.7b-base-awq | 4.1K | 4.1K | - | - | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2023-11-05 Updated: 2023-11-09 |
cf/google/gemma-3-12b-it | gemma-3-12b-it | 80K | 80K | In: $0.35 Out: $0.56 | Model: 0.175 Completion: 1.600 | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-03-01 Updated: 2025-03-21 |
cf/meta/llama-guard-3-8b | llama-guard-3-8b | - | - | In: $0.48 Out: $0.03 | Model: 0.240 Completion: 0.063 | 🌡️ | - | In: text Out: text | Open Weights Released: 2024-07-22 Updated: 2024-10-11 |
hf/thebloke/neural-chat-7b-v3-1-awq | neural-chat-7b-v3-1-awq | 4.1K | 4.1K | - | - | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2023-11-15 Updated: 2023-11-17 |
cf/openai/whisper-tiny-en | whisper-tiny-en | - | - | - | - | - | - | In: audio Out: text | Open Weights Released: 2022-09-26 Updated: 2024-01-22 |
cf/bytedance/stable-diffusion-xl-lightning | stable-diffusion-xl-lightning | - | - | - | - | - | - | In: text Out: image | Open Weights Released: 2024-02-20 Updated: 2024-04-03 |
cf/mistral/mistral-7b-instruct-v0.1 | mistral-7b-instruct-v0.1 | 2.8K | 2.8K | In: $0.11 Out: $0.19 | Model: 0.055 Completion: 1.727 | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2023-09-27 Updated: 2025-07-24 |
cf/llava-hf/llava-1.5-7b-hf | llava-1.5-7b-hf | - | - | - | - | 📎 🌡️ | - | In: image, text Out: text | Open Weights Released: 2023-12-05 Updated: 2025-06-06 |
cf/openai/gpt-oss-20b | gpt-oss-20b | 128K | 128K | In: $0.2 Out: $0.3 | Model: 0.100 Completion: 1.500 | - | - | In: text Out: text | Open Weights Released: 2025-08-04 Updated: 2025-08-14 |
cf/deepseek-ai/deepseek-math-7b-instruct | deepseek-math-7b-instruct | 4.1K | 4.1K | - | - | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2024-02-05 Updated: 2024-02-06 |
cf/openai/gpt-oss-120b | gpt-oss-120b | 128K | 128K | In: $0.35 Out: $0.75 | Model: 0.175 Completion: 2.143 | - | - | In: text Out: text | Open Weights Released: 2025-08-04 Updated: 2025-08-14 |
cf/myshell-ai/melotts | melotts | - | - | In: $0.0002 Out: $- | Model: 0.000 | 📎 | - | In: text Out: audio | Open Weights Released: 2024-07-19 |
cf/qwen/qwen1.5-7b-chat-awq | qwen1.5-7b-chat-awq | 20K | 20K | - | - | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2024-02-03 Updated: 2024-04-30 |
cf/meta/llama-3.1-8b-instruct-fast | llama-3.1-8b-instruct-fast | 128K | 128K | - | - | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2024-07-18 Updated: 2024-09-25 |
cf/deepgram/nova-3 | nova-3 | - | - | In: $0.0052 Out: $0.0052 | Model: 0.003 Completion: 1.000 | - | - | In: audio Out: text | Open Weights Released: 2025-06-05 Updated: 2025-07-08 |
cf/meta/llama-3.1-70b-instruct | llama-3.1-70b-instruct | 24K | 24K | - | - | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2024-07-16 Updated: 2024-12-15 |
cf/qwen/qwq-32b | qwq-32b | 24K | 24K | In: $0.66 Out: $1 | Model: 0.330 Completion: 1.515 | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-03-05 Updated: 2025-03-11 |
hf/thebloke/zephyr-7b-beta-awq | zephyr-7b-beta-awq | 4.1K | 4.1K | - | - | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2023-10-27 Updated: 2023-11-09 |
hf/thebloke/deepseek-coder-6.7b-instruct-awq | deepseek-coder-6.7b-instruct-awq | 4.1K | 4.1K | - | - | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2023-11-05 Updated: 2023-11-13 |
cf/meta/llama-3.1-8b-instruct-awq | llama-3.1-8b-instruct-awq | 8.2K | 8.2K | In: $0.12 Out: $0.27 | Model: 0.060 Completion: 2.250 | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2024-07-25 |
cf/mistral/mistral-7b-instruct-v0.2-lora | mistral-7b-instruct-v0.2-lora | 15K | 15K | - | - | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2024-04-01 |
cf/unum/uform-gen2-qwen-500m | uform-gen2-qwen-500m | - | - | - | - | - | - | In: image, text Out: text | Open Weights Released: 2024-02-15 Updated: 2024-04-24 |
Cortecs¶
📖 API Address | 📚 Official Documentation
Model | Model ID | Context | Output | Pricing ($/1M) | NewAPI Ratios | Capabilities | Knowledge | Modalities | Details |
---|---|---|---|---|---|---|---|---|---|
Nova Pro 1.0 | nova-pro-v1 | 300K | 5K | In: $1.016 Out: $4.061 | Model: 0.508 Completion: 3.997 | 🔧 🌡️ | 2024-04 | In: text, image Out: text | Released: 2024-12-03 |
Claude 4.5 Sonnet | claude-4-5-sonnet | 200K | 200K | In: $3.259 Out: $16.296 | Model: 1.629 Completion: 5.000 | 📎 🧠 🔧 🌡️ | 2025-07-31 | In: text, image Out: text | Released: 2025-09-29 |
DeepSeek V3 0324 | deepseek-v3-0324 | 128K | 128K | In: $0.551 Out: $1.654 | Model: 0.276 Completion: 3.002 | 🔧 🌡️ | 2024-07 | In: text Out: text | Open Weights Released: 2025-03-24 |
Kimi K2 Instruct | kimi-k2-instruct | 131K | 131K | In: $0.551 Out: $2.646 | Model: 0.276 Completion: 4.802 | 🔧 🌡️ | 2024-07 | In: text Out: text | Open Weights Released: 2025-07-11 Updated: 2025-09-05 |
GPT 4.1 | gpt-4.1 | 1M | 32.8K | In: $2.354 Out: $9.417 | Model: 1.177 Completion: 4.000 | 🔧 🌡️ | 2024-06 | In: text, image Out: text | Released: 2025-04-14 |
Gemini 2.5 Pro | gemini-2.5-pro | 1M | 65.5K | In: $1.654 Out: $11.024 | Model: 0.827 Completion: 6.665 | 🔧 🌡️ | 2025-01 | In: text, image Out: text | Released: 2025-03-20 Updated: 2025-06-17 |
GPT Oss 120b | gpt-oss-120b | 128K | 128K | - | - | 🔧 🌡️ | 2024-01 | In: text Out: text | Open Weights Released: 2025-08-05 |
Qwen3 Coder 480B A35B Instruct | qwen3-coder-480b-a35b-instruct | 262K | 262K | In: $0.441 Out: $1.984 | Model: 0.221 Completion: 4.499 | 🔧 🌡️ | 2025-01 | In: text Out: text | Open Weights Released: 2025-07-25 |
Claude Sonnet 4 | claude-sonnet-4 | 200K | 64K | In: $3.307 Out: $16.536 | Model: 1.653 Completion: 5.000 | 🔧 🌡️ | 2025-03 | In: text, image Out: text | Released: 2025-05-22 |
Llama 3.1 405B Instruct | llama-3.1-405b-instruct | 128K | 128K | - | - | 🔧 🌡️ | 2023-12 | In: text Out: text | Open Weights Released: 2024-07-23 |
Qwen3 32B | qwen3-32b | 16.4K | 16.4K | In: $0.099 Out: $0.33 | Model: 0.050 Completion: 3.333 | 🔧 🌡️ | 2024-12 | In: text Out: text | Open Weights Released: 2025-04-29 |
Deep Infra¶
Model | Model ID | Context | Output | Pricing ($/1M) | NewAPI Ratios | Capabilities | Knowledge | Modalities | Details |
---|---|---|---|---|---|---|---|---|---|
Kimi K2 | moonshotai/Kimi-K2-Instruct | 131.1K | 32.8K | In: $0.5 Out: $2 | Model: 0.250 Completion: 4.000 | 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-07-11 |
Qwen3 Coder 480B A35B Instruct | Qwen/Qwen3-Coder-480B-A35B-Instruct | 262.1K | 66.5K | In: $0.4 Out: $1.6 | Model: 0.200 Completion: 4.000 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-23 |
Qwen3 Coder 480B A35B Instruct Turbo | Qwen/Qwen3-Coder-480B-A35B-Instruct-Turbo | 262.1K | 66.5K | In: $0.3 Out: $1.2 | Model: 0.150 Completion: 4.000 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-23 |
GLM-4.5 | zai-org/GLM-4.5 | 131.1K | 98.3K | In: $0.6 Out: $2.2 | Model: 0.300 Completion: 3.667 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-28 |
DeepSeek¶
📖 API Address | 📚 Official Documentation
Model | Model ID | Context | Output | Pricing ($/1M) | NewAPI Ratios | Capabilities | Knowledge | Modalities | Details |
---|---|---|---|---|---|---|---|---|---|
DeepSeek Chat | deepseek-chat | 128K | 8.2K | In: $0.57 Out: $1.68 Cache Read: $0.07 | Model: 0.285 Completion: 2.947 Cache: 0.123 | 📎 🔧 🌡️ | 2024-07 | In: text Out: text | Released: 2024-12-26 Updated: 2025-08-21 |
DeepSeek Reasoner | deepseek-reasoner | 128K | 128K | In: $0.57 Out: $1.68 Cache Read: $0.07 | Model: 0.285 Completion: 2.947 Cache: 0.123 | 📎 🧠 🔧 🌡️ | 2024-07 | In: text Out: text | Released: 2025-01-20 Updated: 2025-08-21 |
doubao¶
Model | Model ID | Context | Output | Pricing ($/1M) | NewAPI Ratios | Capabilities | Knowledge | Modalities | Details |
---|---|---|---|---|---|---|---|---|---|
doubao-seed-1-6-flash | doubao-seed-1-6-flash | 256K | 32K | - | - | 🔧 🌡️ | 2023-10 | In: text, image Out: text | Released: 2025-06-11 Updated: 2025-07-15 |
doubao-seed-1-6-thinking | doubao-seed-1-6-thinking | 256K | 32K | - | - | 🧠 🔧 🌡️ | 2023-10 | In: text, image Out: text | Released: 2025-06-11 Updated: 2025-07-15 |
doubao-seed-1-6 | doubao-seed-1-6 | 256K | 32K | - | - | 🧠 🔧 🌡️ | 2023-10 | In: text, image Out: text | Released: 2025-06-11 Updated: 2025-06-15 |
ExampleCorp AI¶
📖 API Address | 📚 Official Documentation
Model | Model ID | Context | Output | Pricing ($/1M) | NewAPI Ratios | Capabilities | Knowledge | Modalities | Details |
---|---|---|---|---|---|---|---|---|---|
Novus 1 | novus-1 | 128K | 4.1K | In: $5 Out: $15 Cache Read: $0.075 Cache Write: $0.5 | Model: 2.500 Completion: 3.000 Cache: 0.015 | 📎 🧠 🔧 🌡️ | 2024-07 | In: text, image, audio, video, pdf Out: text, image, audio, video, pdf | Released: 2025-01-20 Updated: 2025-08-21 |
FastRouter¶
📖 API Address | 📚 Official Documentation
Model | Model ID | Context | Output | Pricing ($/1M) | NewAPI Ratios | Capabilities | Knowledge | Modalities | Details |
---|---|---|---|---|---|---|---|---|---|
Kimi K2 | moonshotai/kimi-k2 | 131.1K | 32.8K | In: $0.55 Out: $2.2 | Model: 0.275 Completion: 4.000 | 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-07-11 |
Grok 4 | x-ai/grok-4 | 256K | 64K | In: $3 Out: $15 Cache Read: $0.75 Cache Write: $15 | Model: 1.500 Completion: 5.000 Cache: 0.250 | 🧠 🔧 🌡️ | 2025-07 | In: text Out: text | Released: 2025-07-09 |
Gemini 2.5 Flash | google/gemini-2.5-flash | 1M | 65.5K | In: $0.3 Out: $2.5 Cache Read: $0.0375 | Model: 0.150 Completion: 8.333 Cache: 0.125 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, pdf Out: text | Released: 2025-06-17 |
Gemini 2.5 Pro | google/gemini-2.5-pro | 1M | 65.5K | In: $1.25 Out: $10 Cache Read: $0.31 | Model: 0.625 Completion: 8.000 Cache: 0.248 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, pdf Out: text | Released: 2025-06-17 |
GPT-5 Nano | openai/gpt-5-nano | 400K | 128K | In: $0.05 Out: $0.4 Cache Read: $0.005 | Model: 0.025 Completion: 8.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2024-10-01 | In: text, image Out: text | Released: 2025-08-07 |
GPT-4.1 | openai/gpt-4.1 | 1M | 32.8K | In: $2 Out: $8 Cache Read: $0.5 | Model: 1.000 Completion: 4.000 Cache: 0.250 | 📎 🔧 🌡️ | 2024-04 | In: text, image Out: text | Released: 2025-04-14 |
GPT-5 Mini | openai/gpt-5-mini | 400K | 128K | In: $0.25 Out: $2 Cache Read: $0.025 | Model: 0.125 Completion: 8.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2024-10-01 | In: text, image Out: text | Released: 2025-08-07 |
GPT OSS 20B | openai/gpt-oss-20b | 131.1K | 65.5K | In: $0.05 Out: $0.2 | Model: 0.025 Completion: 4.000 | 🧠 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-08-05 |
GPT OSS 120B | openai/gpt-oss-120b | 131.1K | 32.8K | In: $0.15 Out: $0.6 | Model: 0.075 Completion: 4.000 | 🧠 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-08-05 |
GPT-5 | openai/gpt-5 | 400K | 128K | In: $1.25 Out: $10 Cache Read: $0.125 | Model: 0.625 Completion: 8.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2024-10-01 | In: text, image Out: text | Released: 2025-08-07 |
Qwen3 Coder | qwen/qwen3-coder | 262.1K | 66.5K | In: $0.3 Out: $1.2 | Model: 0.150 Completion: 4.000 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-23 |
Claude Opus 4.1 | anthropic/claude-opus-4.1 | 200K | 32K | In: $15 Out: $75 Cache Read: $1.5 Cache Write: $18.75 | Model: 7.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-03-31 | In: text, image Out: text | Released: 2025-08-05 |
Claude Sonnet 4 | anthropic/claude-sonnet-4 | 200K | 64K | In: $3 Out: $15 Cache Read: $0.3 Cache Write: $3.75 | Model: 1.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-03-31 | In: text, image Out: text | Released: 2025-05-22 |
DeepSeek R1 Distill Llama 70B | deepseek-ai/deepseek-r1-distill-llama-70b | 131.1K | 131.1K | In: $0.03 Out: $0.14 | Model: 0.015 Completion: 4.667 | 🧠 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-01-23 |
Fireworks AI¶
📖 API Address | 📚 Official Documentation
Model | Model ID | Context | Output | Pricing ($/1M) | NewAPI Ratios | Capabilities | Knowledge | Modalities | Details |
---|---|---|---|---|---|---|---|---|---|
Deepseek R1 05/28 | accounts/fireworks/models/deepseek-r1-0528 | 160K | 16.4K | In: $3 Out: $8 | Model: 1.500 Completion: 2.667 | 🧠 🔧 🌡️ | 2025-05 | In: text Out: text | Open Weights Released: 2025-05-28 |
DeepSeek V3.1 | accounts/fireworks/models/deepseek-v3p1 | 163.8K | 163.8K | In: $0.56 Out: $1.68 | Model: 0.280 Completion: 3.000 | 🧠 🔧 🌡️ | 2025-07 | In: text Out: text | Open Weights Released: 2025-08-21 |
Deepseek V3 03-24 | accounts/fireworks/models/deepseek-v3-0324 | 160K | 16.4K | In: $0.9 Out: $0.9 | Model: 0.450 Completion: 1.000 | 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-03-24 |
Kimi K2 Instruct | accounts/fireworks/models/kimi-k2-instruct | 128K | 16.4K | In: $1 Out: $3 | Model: 0.500 Completion: 3.000 | 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-07-11 |
Qwen3 235B-A22B | accounts/fireworks/models/qwen3-235b-a22b | 128K | 16.4K | In: $0.22 Out: $0.88 | Model: 0.110 Completion: 4.000 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-04-29 |
GPT OSS 20B | accounts/fireworks/models/gpt-oss-20b | 131.1K | 32.8K | In: $0.05 Out: $0.2 | Model: 0.025 Completion: 4.000 | 🧠 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-08-05 |
GPT OSS 120B | accounts/fireworks/models/gpt-oss-120b | 131.1K | 32.8K | In: $0.15 Out: $0.6 | Model: 0.075 Completion: 4.000 | 🧠 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-08-05 |
GLM 4.5 Air | accounts/fireworks/models/glm-4p5-air | 131.1K | 131.1K | In: $0.22 Out: $0.88 | Model: 0.110 Completion: 4.000 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-08-01 |
Qwen3 Coder 480B A35B Instruct | accounts/fireworks/models/qwen3-coder-480b-a35b-instruct | 256K | 32.8K | In: $0.45 Out: $1.8 | Model: 0.225 Completion: 4.000 | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-07-22 |
GLM 4.5 | accounts/fireworks/models/glm-4p5 | 131.1K | 131.1K | In: $0.55 Out: $2.19 | Model: 0.275 Completion: 3.982 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-29 |
GitHub Copilot¶
📖 API Address | 📚 Official Documentation
Model | Model ID | Context | Output | Pricing ($/1M) | NewAPI Ratios | Capabilities | Knowledge | Modalities | Details |
---|---|---|---|---|---|---|---|---|---|
Gemini 2.0 Flash | gemini-2.0-flash-001 | 1M | 8.2K | - | - | 📎 🔧 🌡️ | 2024-06 | In: text, image, audio, video Out: text | Released: 2024-12-11 |
Claude Opus 4 | claude-opus-4 | 80K | 16K | - | - | 📎 🧠 | 2025-03-31 | In: text, image Out: text | Released: 2025-05-22 |
Grok Code Fast 1 | grok-code-fast-1 | 256K | 10K | - | - | 🧠 🔧 🌡️ | 2025-08 | In: text Out: text | Released: 2025-08-27 |
Claude Haiku 4.5 | claude-haiku-4.5 | 144K | 16K | - | - | 📎 🧠 🔧 🌡️ | 2025-02-31 | In: text, image Out: text | Released: 2025-10-15 |
Claude Sonnet 3.5 | claude-3.5-sonnet | 90K | 8.2K | - | - | 📎 🔧 🌡️ | 2024-04 | In: text, image Out: text | Released: 2024-10-22 |
o3-mini | o3-mini | 128K | 65.5K | - | - | 🧠 | 2024-10 | In: text Out: text | Released: 2024-12-20 Updated: 2025-01-29 |
GPT-5-Codex | gpt-5-codex | 128K | 64K | - | - | 🧠 🔧 | 2024-09-30 | In: text, image Out: text | Released: 2025-09-15 |
GPT-4o | gpt-4o | 128K | 16.4K | - | - | 📎 🔧 🌡️ | 2023-09 | In: text, image Out: text | Released: 2024-05-13 |
GPT-4.1 | gpt-4.1 | 128K | 16.4K | - | - | 📎 🔧 🌡️ | 2024-04 | In: text, image Out: text | Released: 2025-04-14 |
o4-mini (Preview) | o4-mini | 128K | 65.5K | - | - | 🧠 | 2024-10 | In: text Out: text | Released: 2025-04-16 |
Claude Opus 4.1 | claude-opus-41 | 80K | 16K | - | - | 📎 🧠 🌡️ | 2025-03-31 | In: text, image Out: text | Released: 2025-08-05 |
GPT-5-mini | gpt-5-mini | 128K | 64K | - | - | 📎 🧠 🔧 🌡️ | 2024-06 | In: text, image Out: text | Released: 2025-08-13 |
Claude Sonnet 3.7 | claude-3.7-sonnet | 200K | 16.4K | - | - | 📎 🔧 🌡️ | 2024-04 | In: text, image Out: text | Released: 2025-02-19 |
Gemini 2.5 Pro | gemini-2.5-pro | 128K | 64K | - | - | 📎 🔧 🌡️ | 2025-01 | In: text, image, audio, video Out: text | Released: 2025-03-20 Updated: 2025-06-05 |
o3 (Preview) | o3 | 128K | 16.4K | - | - | 📎 🧠 🔧 | 2024-05 | In: text, image Out: text | Released: 2025-04-16 |
Claude Sonnet 4 | claude-sonnet-4 | 128K | 16K | - | - | 📎 🧠 🔧 🌡️ | 2025-03-31 | In: text, image Out: text | Released: 2025-05-22 |
GPT-5 | gpt-5 | 128K | 64K | - | - | 📎 🧠 🔧 🌡️ | 2024-10 | In: text, image Out: text | Released: 2025-08-07 |
Claude Sonnet 3.7 Thinking | claude-3.7-sonnet-thought | 200K | 16.4K | - | - | 📎 🧠 🔧 🌡️ | 2024-04 | In: text, image Out: text | Released: 2025-02-19 |
Claude Sonnet 4.5 | claude-sonnet-4.5 | 128K | 16K | - | - | 📎 🧠 🔧 🌡️ | 2025-03-31 | In: text, image Out: text | Released: 2025-09-29 |
GitHub Models¶
📖 API Address | 📚 Official Documentation
Model | Model ID | Context | Output | Pricing ($/1M) | NewAPI Ratios | Capabilities | Knowledge | Modalities | Details |
---|---|---|---|---|---|---|---|---|---|
JAIS 30b Chat | core42/jais-30b-chat | 8.2K | 2K | - | - | 🧠 🔧 🌡️ | 2023-03 | In: text Out: text | Open Weights Released: 2023-08-30 |
Grok 3 | xai/grok-3 | 128K | 8.2K | - | - | 🧠 🔧 🌡️ | 2024-10 | In: text Out: text | Released: 2024-12-09 |
Grok 3 Mini | xai/grok-3-mini | 128K | 8.2K | - | - | 🧠 🔧 🌡️ | 2024-10 | In: text Out: text | Released: 2024-12-09 |
Cohere Command R 08-2024 | cohere/cohere-command-r-08-2024 | 128K | 4.1K | - | - | 🧠 🔧 🌡️ | 2024-03 | In: text Out: text | Released: 2024-08-01 |
Cohere Command A | cohere/cohere-command-a | 128K | 4.1K | - | - | 🧠 🔧 🌡️ | 2024-03 | In: text Out: text | Released: 2024-11-01 |
Cohere Command R+ 08-2024 | cohere/cohere-command-r-plus-08-2024 | 128K | 4.1K | - | - | 🧠 🔧 🌡️ | 2024-03 | In: text Out: text | Released: 2024-08-01 |
Cohere Command R | cohere/cohere-command-r | 128K | 4.1K | - | - | 🧠 🔧 🌡️ | 2024-03 | In: text Out: text | Released: 2024-03-11 Updated: 2024-08-01 |
Cohere Command R+ | cohere/cohere-command-r-plus | 128K | 4.1K | - | - | 🧠 🔧 🌡️ | 2024-03 | In: text Out: text | Released: 2024-04-04 Updated: 2024-08-01 |
DeepSeek-R1-0528 | deepseek/deepseek-r1-0528 | 65.5K | 8.2K | - | - | 🧠 🔧 🌡️ | 2024-06 | In: text Out: text | Open Weights Released: 2025-05-28 |
DeepSeek-R1 | deepseek/deepseek-r1 | 65.5K | 8.2K | - | - | 🧠 🔧 🌡️ | 2024-06 | In: text Out: text | Open Weights Released: 2025-01-20 |
DeepSeek-V3-0324 | deepseek/deepseek-v3-0324 | 128K | 8.2K | - | - | 🧠 🔧 🌡️ | 2024-06 | In: text Out: text | Open Weights Released: 2025-03-24 |
Mistral Medium 3 (25.05) | mistral-ai/mistral-medium-2505 | 128K | 32.8K | - | - | 🧠 🔧 🌡️ | 2024-09 | In: text, image Out: text | Released: 2025-05-01 |
Ministral 3B | mistral-ai/ministral-3b | 128K | 8.2K | - | - | 🧠 🔧 🌡️ | 2024-03 | In: text Out: text | Open Weights Released: 2024-10-22 |
Mistral Nemo | mistral-ai/mistral-nemo | 128K | 8.2K | - | - | 🧠 🔧 🌡️ | 2024-03 | In: text Out: text | Open Weights Released: 2024-07-18 |
Mistral Large 24.11 | mistral-ai/mistral-large-2411 | 128K | 32.8K | - | - | 🧠 🔧 🌡️ | 2024-09 | In: text Out: text | Released: 2024-11-01 |
Codestral 25.01 | mistral-ai/codestral-2501 | 32K | 8.2K | - | - | 🧠 🔧 🌡️ | 2024-03 | In: text Out: text | Released: 2025-01-01 |
Mistral Small 3.1 | mistral-ai/mistral-small-2503 | 128K | 32.8K | - | - | 🧠 🔧 🌡️ | 2024-09 | In: text, image Out: text | Released: 2025-03-01 |
Phi-3-medium instruct (128k) | microsoft/phi-3-medium-128k-instruct | 128K | 4.1K | - | - | 🧠 🔧 🌡️ | 2023-10 | In: text Out: text | Open Weights Released: 2024-04-23 |
Phi-3-mini instruct (4k) | microsoft/phi-3-mini-4k-instruct | 4.1K | 1K | - | - | 🧠 🔧 🌡️ | 2023-10 | In: text Out: text | Open Weights Released: 2024-04-23 |
Phi-3-small instruct (128k) | microsoft/phi-3-small-128k-instruct | 128K | 4.1K | - | - | 🧠 🔧 🌡️ | 2023-10 | In: text Out: text | Open Weights Released: 2024-04-23 |
Phi-3.5-vision instruct (128k) | microsoft/phi-3.5-vision-instruct | 128K | 4.1K | - | - | 🧠 🔧 🌡️ | 2023-10 | In: text, image Out: text | Open Weights Released: 2024-08-20 |
Phi-4 | microsoft/phi-4 | 16K | 4.1K | - | - | 🧠 🔧 🌡️ | 2023-10 | In: text Out: text | Open Weights Released: 2024-12-11 |
Phi-4-mini-reasoning | microsoft/phi-4-mini-reasoning | 128K | 4.1K | - | - | 🧠 🔧 🌡️ | 2023-10 | In: text Out: text | Open Weights Released: 2024-12-11 |
Phi-3-small instruct (8k) | microsoft/phi-3-small-8k-instruct | 8.2K | 2K | - | - | 🧠 🔧 🌡️ | 2023-10 | In: text Out: text | Open Weights Released: 2024-04-23 |
Phi-3.5-mini instruct (128k) | microsoft/phi-3.5-mini-instruct | 128K | 4.1K | - | - | 🧠 🔧 🌡️ | 2023-10 | In: text Out: text | Open Weights Released: 2024-08-20 |
Phi-4-multimodal-instruct | microsoft/phi-4-multimodal-instruct | 128K | 4.1K | - | - | 🧠 🔧 🌡️ | 2023-10 | In: text, image, audio Out: text | Open Weights Released: 2024-12-11 |
Phi-3-mini instruct (128k) | microsoft/phi-3-mini-128k-instruct | 128K | 4.1K | - | - | 🧠 🔧 🌡️ | 2023-10 | In: text Out: text | Open Weights Released: 2024-04-23 |
Phi-3.5-MoE instruct (128k) | microsoft/phi-3.5-moe-instruct | 128K | 4.1K | - | - | 🧠 🔧 🌡️ | 2023-10 | In: text Out: text | Open Weights Released: 2024-08-20 |
Phi-4-mini-instruct | microsoft/phi-4-mini-instruct | 128K | 4.1K | - | - | 🧠 🔧 🌡️ | 2023-10 | In: text Out: text | Open Weights Released: 2024-12-11 |
Phi-3-medium instruct (4k) | microsoft/phi-3-medium-4k-instruct | 4.1K | 1K | - | - | 🧠 🔧 🌡️ | 2023-10 | In: text Out: text | Open Weights Released: 2024-04-23 |
Phi-4-Reasoning | microsoft/phi-4-reasoning | 128K | 4.1K | - | - | 🧠 🔧 🌡️ | 2023-10 | In: text Out: text | Open Weights Released: 2024-12-11 |
MAI-DS-R1 | microsoft/mai-ds-r1 | 65.5K | 8.2K | - | - | 🧠 🔧 🌡️ | 2024-06 | In: text Out: text | Released: 2025-01-20 |
GPT-4.1-nano | openai/gpt-4.1-nano | 128K | 16.4K | - | - | 📎 🔧 🌡️ | 2024-04 | In: text, image Out: text | Released: 2025-04-14 |
GPT-4.1-mini | openai/gpt-4.1-mini | 128K | 16.4K | - | - | 📎 🔧 🌡️ | 2024-04 | In: text, image Out: text | Released: 2025-04-14 |
OpenAI o1-preview | openai/o1-preview | 128K | 32.8K | - | - | 🧠 | 2023-10 | In: text Out: text | Released: 2024-09-12 |
OpenAI o3-mini | openai/o3-mini | 200K | 100K | - | - | 🧠 | 2024-04 | In: text Out: text | Released: 2025-01-31 |
GPT-4o | openai/gpt-4o | 128K | 16.4K | - | - | 📎 🔧 🌡️ | 2023-10 | In: text, image, audio Out: text | Released: 2024-05-13 |
GPT-4.1 | openai/gpt-4.1 | 128K | 16.4K | - | - | 📎 🔧 🌡️ | 2024-04 | In: text, image Out: text | Released: 2025-04-14 |
OpenAI o4-mini | openai/o4-mini | 200K | 100K | - | - | 🧠 | 2024-04 | In: text, image Out: text | Released: 2025-01-31 |
OpenAI o1 | openai/o1 | 200K | 100K | - | - | 🧠 | 2023-10 | In: text, image Out: text | Released: 2024-09-12 Updated: 2024-12-17 |
OpenAI o1-mini | openai/o1-mini | 128K | 65.5K | - | - | 🧠 | 2023-10 | In: text Out: text | Released: 2024-09-12 Updated: 2024-12-17 |
OpenAI o3 | openai/o3 | 200K | 100K | - | - | 🧠 | 2024-04 | In: text, image Out: text | Released: 2025-01-31 |
GPT-4o mini | openai/gpt-4o-mini | 128K | 16.4K | - | - | 📎 🔧 🌡️ | 2023-10 | In: text, image, audio Out: text | Released: 2024-07-18 |
Llama-3.2-11B-Vision-Instruct | meta/llama-3.2-11b-vision-instruct | 128K | 8.2K | - | - | 🧠 🔧 🌡️ | 2023-12 | In: text, image, audio Out: text | Open Weights Released: 2024-09-25 |
Meta-Llama-3.1-405B-Instruct | meta/meta-llama-3.1-405b-instruct | 128K | 32.8K | - | - | 🧠 🔧 🌡️ | 2023-12 | In: text Out: text | Open Weights Released: 2024-07-23 |
Llama 4 Maverick 17B 128E Instruct FP8 | meta/llama-4-maverick-17b-128e-instruct-fp8 | 128K | 8.2K | - | - | 🧠 🔧 🌡️ | 2024-12 | In: text, image Out: text | Open Weights Released: 2025-01-31 |
Meta-Llama-3-70B-Instruct | meta/meta-llama-3-70b-instruct | 8.2K | 2K | - | - | 🧠 🔧 🌡️ | 2023-12 | In: text Out: text | Open Weights Released: 2024-04-18 |
Meta-Llama-3.1-70B-Instruct | meta/meta-llama-3.1-70b-instruct | 128K | 32.8K | - | - | 🧠 🔧 🌡️ | 2023-12 | In: text Out: text | Open Weights Released: 2024-07-23 |
Llama-3.3-70B-Instruct | meta/llama-3.3-70b-instruct | 128K | 32.8K | - | - | 🧠 🔧 🌡️ | 2023-12 | In: text Out: text | Open Weights Released: 2024-12-06 |
Llama-3.2-90B-Vision-Instruct | meta/llama-3.2-90b-vision-instruct | 128K | 8.2K | - | - | 🧠 🔧 🌡️ | 2023-12 | In: text, image, audio Out: text | Open Weights Released: 2024-09-25 |
Meta-Llama-3-8B-Instruct | meta/meta-llama-3-8b-instruct | 8.2K | 2K | - | - | 🧠 🔧 🌡️ | 2023-12 | In: text Out: text | Open Weights Released: 2024-04-18 |
Llama 4 Scout 17B 16E Instruct | meta/llama-4-scout-17b-16e-instruct | 128K | 8.2K | - | - | 🧠 🔧 🌡️ | 2024-12 | In: text, image Out: text | Open Weights Released: 2025-01-31 |
Meta-Llama-3.1-8B-Instruct | meta/meta-llama-3.1-8b-instruct | 128K | 32.8K | - | - | 🧠 🔧 🌡️ | 2023-12 | In: text Out: text | Open Weights Released: 2024-07-23 |
AI21 Jamba 1.5 Large | ai21-labs/ai21-jamba-1.5-large | 256K | 4.1K | - | - | 🧠 🔧 🌡️ | 2024-03 | In: text Out: text | Released: 2024-08-29 |
AI21 Jamba 1.5 Mini | ai21-labs/ai21-jamba-1.5-mini | 256K | 4.1K | - | - | 🧠 🔧 🌡️ | 2024-03 | In: text Out: text | Released: 2024-08-29 |
Google¶
Model | Model ID | Context | Output | Pricing ($/1M) | NewAPI Ratios | Capabilities | Knowledge | Modalities | Details |
---|---|---|---|---|---|---|---|---|---|
Gemini 2.5 Flash Image | gemini-2.5-flash-image | 32.8K | 32.8K | In: $0.3 Out: $30 Cache Read: $0.075 | Model: 0.150 Completion: 100.000 Cache: 0.250 | 📎 🧠 🌡️ | 2025-06 | In: text, image Out: text, image | Released: 2025-08-26 |
Gemini 2.5 Flash Preview 05-20 | gemini-2.5-flash-preview-05-20 | 1M | 65.5K | In: $0.15 Out: $0.6 Cache Read: $0.0375 | Model: 0.075 Completion: 4.000 Cache: 0.250 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-05-20 |
Gemini Flash-Lite Latest | gemini-flash-lite-latest | 1M | 65.5K | In: $0.1 Out: $0.4 Cache Read: $0.025 | Model: 0.050 Completion: 4.000 Cache: 0.250 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-09-25 |
Gemini 2.5 Flash | gemini-2.5-flash | 1M | 65.5K | In: $0.3 Out: $2.5 Cache Read: $0.075 | Model: 0.150 Completion: 8.333 Cache: 0.250 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-03-20 Updated: 2025-06-05 |
Gemini Flash Latest | gemini-flash-latest | 1M | 65.5K | In: $0.3 Out: $2.5 Cache Read: $0.075 | Model: 0.150 Completion: 8.333 Cache: 0.250 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-09-25 |
Gemini 2.5 Pro Preview 05-06 | gemini-2.5-pro-preview-05-06 | 1M | 65.5K | In: $1.25 Out: $10 Cache Read: $0.31 | Model: 0.625 Completion: 8.000 Cache: 0.248 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-05-06 |
Gemini 2.5 Flash Preview TTS | gemini-2.5-flash-preview-tts | 8K | 16K | In: $0.5 Out: $10 | Model: 0.250 Completion: 20.000 | - | 2025-01 | In: text Out: audio | Released: 2025-05-01 |
Gemini 2.0 Flash Lite | gemini-2.0-flash-lite | 1M | 8.2K | In: $0.075 Out: $0.3 | Model: 0.037 Completion: 4.000 | 📎 🔧 🌡️ | 2024-06 | In: text, image, audio, video, pdf Out: text | Released: 2024-12-11 |
Gemini Live 2.5 Flash Preview Native Audio | gemini-live-2.5-flash-preview-native-audio | 131.1K | 65.5K | In: $0.5 Out: $2 | Model: 0.250 Completion: 4.000 | 🧠 🔧 | 2025-01 | In: text, audio, video Out: text, audio | Released: 2025-06-17 Updated: 2025-09-18 |
Gemini 2.0 Flash | gemini-2.0-flash | 1M | 8.2K | In: $0.1 Out: $0.4 Cache Read: $0.025 | Model: 0.050 Completion: 4.000 Cache: 0.250 | 📎 🔧 🌡️ | 2024-06 | In: text, image, audio, video, pdf Out: text | Released: 2024-12-11 |
Gemini 2.5 Flash-Lite | gemini-2.5-flash-lite | 1M | 65.5K | In: $0.1 Out: $0.4 Cache Read: $0.025 | Model: 0.050 Completion: 4.000 Cache: 0.250 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-06-17 |
Gemini 2.5 Pro Preview 06-05 | gemini-2.5-pro-preview-06-05 | 1M | 65.5K | In: $1.25 Out: $10 Cache Read: $0.31 | Model: 0.625 Completion: 8.000 Cache: 0.248 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-06-05 |
Gemini Live 2.5 Flash | gemini-live-2.5-flash | 128K | 8K | In: $0.5 Out: $2 | Model: 0.250 Completion: 4.000 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video Out: text, audio | Released: 2025-09-01 |
Gemini 2.5 Flash Lite Preview 06-17 | gemini-2.5-flash-lite-preview-06-17 | 1M | 65.5K | In: $0.1 Out: $0.4 Cache Read: $0.025 | Model: 0.050 Completion: 4.000 Cache: 0.250 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-06-17 |
Gemini 2.5 Flash Image (Preview) | gemini-2.5-flash-image-preview | 32.8K | 32.8K | In: $0.3 Out: $30 Cache Read: $0.075 | Model: 0.150 Completion: 100.000 Cache: 0.250 | 📎 🧠 🌡️ | 2025-06 | In: text, image Out: text, image | Released: 2025-08-26 |
Gemini 2.5 Flash Preview 09-25 | gemini-2.5-flash-preview-09-2025 | 1M | 65.5K | In: $0.3 Out: $2.5 Cache Read: $0.075 | Model: 0.150 Completion: 8.333 Cache: 0.250 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-09-25 |
Gemini 2.5 Flash Preview 04-17 | gemini-2.5-flash-preview-04-17 | 1M | 65.5K | In: $0.15 Out: $0.6 Cache Read: $0.0375 | Model: 0.075 Completion: 4.000 Cache: 0.250 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-04-17 |
Gemini 2.5 Pro Preview TTS | gemini-2.5-pro-preview-tts | 8K | 16K | In: $1 Out: $20 | Model: 0.500 Completion: 20.000 | - | 2025-01 | In: text Out: audio | Released: 2025-05-01 |
Gemini 2.5 Pro | gemini-2.5-pro | 1M | 65.5K | In: $1.25 Out: $10 Cache Read: $0.31 | Model: 0.625 Completion: 8.000 Cache: 0.248 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-03-20 Updated: 2025-06-05 |
Gemini 1.5 Flash | gemini-1.5-flash | 1M | 8.2K | In: $0.075 Out: $0.3 Cache Read: $0.01875 | Model: 0.037 Completion: 4.000 Cache: 0.250 | 📎 🔧 🌡️ | 2024-04 | In: text, image, audio, video Out: text | Released: 2024-05-14 |
Gemini 1.5 Flash-8B | gemini-1.5-flash-8b | 1M | 8.2K | In: $0.0375 Out: $0.15 Cache Read: $0.01 | Model: 0.019 Completion: 4.000 Cache: 0.267 | 📎 🔧 🌡️ | 2024-04 | In: text, image, audio, video Out: text | Released: 2024-10-03 |
Gemini 2.5 Flash Lite Preview 09-25 | gemini-2.5-flash-lite-preview-09-2025 | 1M | 65.5K | In: $0.1 Out: $0.4 Cache Read: $0.025 | Model: 0.050 Completion: 4.000 Cache: 0.250 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-09-25 |
Gemini 1.5 Pro | gemini-1.5-pro | 1M | 8.2K | In: $1.25 Out: $5 Cache Read: $0.3125 | Model: 0.625 Completion: 4.000 Cache: 0.250 | 📎 🔧 🌡️ | 2024-04 | In: text, image, audio, video Out: text | Released: 2024-02-15 |
gemini-embedding-001 | gemini-embedding-001 | 2K | 3.1K | In: $0.15 Out: $- | Model: 0.075 | 🔧 | 2025-06 | In: text Out: text | Released: 2025-06-01 |
Vertex¶
Model | Model ID | Context | Output | Pricing ($/1M) | NewAPI Ratios | Capabilities | Knowledge | Modalities | Details |
---|---|---|---|---|---|---|---|---|---|
Gemini 2.5 Flash Preview 05-20 | gemini-2.5-flash-preview-05-20 | 1M | 65.5K | In: $0.15 Out: $0.6 Cache Read: $0.0375 | Model: 0.075 Completion: 4.000 Cache: 0.250 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-05-20 |
Gemini Flash-Lite Latest | gemini-flash-lite-latest | 1M | 65.5K | In: $0.1 Out: $0.4 Cache Read: $0.025 | Model: 0.050 Completion: 4.000 Cache: 0.250 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-09-25 |
Gemini 2.5 Flash | gemini-2.5-flash | 1M | 65.5K | In: $0.3 Out: $2.5 Cache Read: $0.075 Cache Write: $0.383 | Model: 0.150 Completion: 8.333 Cache: 0.250 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-06-17 |
Gemini Flash Latest | gemini-flash-latest | 1M | 65.5K | In: $0.3 Out: $2.5 Cache Read: $0.075 Cache Write: $0.383 | Model: 0.150 Completion: 8.333 Cache: 0.250 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-09-25 |
Gemini 2.5 Pro Preview 05-06 | gemini-2.5-pro-preview-05-06 | 1M | 65.5K | In: $1.25 Out: $10 Cache Read: $0.31 | Model: 0.625 Completion: 8.000 Cache: 0.248 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-05-06 |
Gemini 2.0 Flash Lite | gemini-2.0-flash-lite | 1M | 8.2K | In: $0.075 Out: $0.3 | Model: 0.037 Completion: 4.000 | 📎 🔧 🌡️ | 2024-06 | In: text, image, audio, video, pdf Out: text | Released: 2024-12-11 |
Gemini 2.0 Flash | gemini-2.0-flash | 1M | 8.2K | In: $0.1 Out: $0.4 Cache Read: $0.025 | Model: 0.050 Completion: 4.000 Cache: 0.250 | 📎 🔧 🌡️ | 2024-06 | In: text, image, audio, video, pdf Out: text | Released: 2024-12-11 |
Gemini 2.5 Flash Lite | gemini-2.5-flash-lite | 1M | 65.5K | In: $0.1 Out: $0.4 Cache Read: $0.025 | Model: 0.050 Completion: 4.000 Cache: 0.250 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-06-17 |
Gemini 2.5 Pro Preview 06-05 | gemini-2.5-pro-preview-06-05 | 1M | 65.5K | In: $1.25 Out: $10 Cache Read: $0.31 | Model: 0.625 Completion: 8.000 Cache: 0.248 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-06-05 |
Gemini 2.5 Flash Lite Preview 06-17 | gemini-2.5-flash-lite-preview-06-17 | 65.5K | 65.5K | In: $0.1 Out: $0.4 Cache Read: $0.025 | Model: 0.050 Completion: 4.000 Cache: 0.250 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-06-17 |
Gemini 2.5 Flash Preview 09-25 | gemini-2.5-flash-preview-09-2025 | 1M | 65.5K | In: $0.3 Out: $2.5 Cache Read: $0.075 Cache Write: $0.383 | Model: 0.150 Completion: 8.333 Cache: 0.250 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-09-25 |
Gemini 2.5 Flash Preview 04-17 | gemini-2.5-flash-preview-04-17 | 1M | 65.5K | In: $0.15 Out: $0.6 Cache Read: $0.0375 | Model: 0.075 Completion: 4.000 Cache: 0.250 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-04-17 |
Gemini 2.5 Pro | gemini-2.5-pro | 1M | 65.5K | In: $1.25 Out: $10 Cache Read: $0.31 | Model: 0.625 Completion: 8.000 Cache: 0.248 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-03-20 Updated: 2025-06-05 |
Gemini 2.5 Flash Lite Preview 09-25 | gemini-2.5-flash-lite-preview-09-2025 | 1M | 65.5K | In: $0.1 Out: $0.4 Cache Read: $0.025 | Model: 0.050 Completion: 4.000 Cache: 0.250 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-09-25 |
Vertex¶
Model | Model ID | Context | Output | Pricing ($/1M) | NewAPI Ratios | Capabilities | Knowledge | Modalities | Details |
---|---|---|---|---|---|---|---|---|---|
Claude Sonnet 3.5 v2 | claude-3-5-sonnet@20241022 | 200K | 8.2K | In: $3 Out: $15 Cache Read: $0.3 Cache Write: $3.75 | Model: 1.500 Completion: 5.000 Cache: 0.100 | 📎 🔧 🌡️ | 2024-04-30 | In: text, image Out: text | Released: 2024-10-22 |
Claude Haiku 3.5 | claude-3-5-haiku@20241022 | 200K | 8.2K | In: $0.8 Out: $4 Cache Read: $0.08 Cache Write: $1 | Model: 0.400 Completion: 5.000 Cache: 0.100 | 📎 🔧 🌡️ | 2024-07-31 | In: text, image Out: text | Released: 2024-10-22 |
Claude Sonnet 4 | claude-sonnet-4@20250514 | 200K | 64K | In: $3 Out: $15 Cache Read: $0.3 Cache Write: $3.75 | Model: 1.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-03-31 | In: text, image Out: text | Released: 2025-05-22 |
Claude Opus 4.1 | claude-opus-4-1@20250805 | 200K | 32K | In: $15 Out: $75 Cache Read: $1.5 Cache Write: $18.75 | Model: 7.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-03-31 | In: text, image Out: text | Released: 2025-08-05 |
Claude Sonnet 3.7 | claude-3-7-sonnet@20250219 | 200K | 64K | In: $3 Out: $15 Cache Read: $0.3 Cache Write: $3.75 | Model: 1.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2024-10-31 | In: text, image Out: text | Released: 2025-02-19 |
Claude Opus 4 | claude-opus-4@20250514 | 200K | 32K | In: $15 Out: $75 Cache Read: $1.5 Cache Write: $18.75 | Model: 7.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-03-31 | In: text, image Out: text | Released: 2025-05-22 |
Groq¶
Model | Model ID | Context | Output | Pricing ($/1M) | NewAPI Ratios | Capabilities | Knowledge | Modalities | Details |
---|---|---|---|---|---|---|---|---|---|
Llama 3.1 8B Instant | llama-3.1-8b-instant | 131.1K | 8.2K | In: $0.05 Out: $0.08 | Model: 0.025 Completion: 1.600 | 🔧 🌡️ | 2023-12 | In: text Out: text | Open Weights Released: 2024-07-23 |
Mistral Saba 24B | mistral-saba-24b | 32.8K | 32.8K | In: $0.79 Out: $0.79 | Model: 0.395 Completion: 1.000 | 🔧 🌡️ | 2024-08 | In: text Out: text | Released: 2025-02-06 |
Llama 3 8B | llama3-8b-8192 | 8.2K | 8.2K | In: $0.05 Out: $0.08 | Model: 0.025 Completion: 1.600 | 🔧 🌡️ | 2023-03 | In: text Out: text | Open Weights Released: 2024-04-18 |
Qwen QwQ 32B | qwen-qwq-32b | 131.1K | 16.4K | In: $0.29 Out: $0.39 | Model: 0.145 Completion: 1.345 | 🧠 🔧 🌡️ | 2024-09 | In: text Out: text | Open Weights Released: 2024-11-27 |
Llama 3 70B | llama3-70b-8192 | 8.2K | 8.2K | In: $0.59 Out: $0.79 | Model: 0.295 Completion: 1.339 | 🔧 🌡️ | 2023-03 | In: text Out: text | Open Weights Released: 2024-04-18 |
DeepSeek R1 Distill Llama 70B | deepseek-r1-distill-llama-70b | 131.1K | 8.2K | In: $0.75 Out: $0.99 | Model: 0.375 Completion: 1.320 | 🧠 🔧 🌡️ | 2024-07 | In: text Out: text | Open Weights Released: 2025-01-20 |
Llama Guard 3 8B | llama-guard-3-8b | 8.2K | 8.2K | In: $0.2 Out: $0.2 | Model: 0.100 Completion: 1.000 | 🌡️ | - | In: text Out: text | Open Weights Released: 2024-07-23 |
Gemma 2 9B | gemma2-9b-it | 8.2K | 8.2K | In: $0.2 Out: $0.2 | Model: 0.100 Completion: 1.000 | 🔧 🌡️ | 2024-06 | In: text Out: text | Open Weights Released: 2024-06-27 |
Llama 3.3 70B Versatile | llama-3.3-70b-versatile | 131.1K | 32.8K | In: $0.59 Out: $0.79 | Model: 0.295 Completion: 1.339 | 🔧 🌡️ | 2023-12 | In: text Out: text | Open Weights Released: 2024-12-06 |
Kimi K2 Instruct 0905 | moonshotai/kimi-k2-instruct-0905 | 262.1K | 16.4K | In: $1 Out: $3 | Model: 0.500 Completion: 3.000 | 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-09-05 |
Kimi K2 Instruct | moonshotai/kimi-k2-instruct | 131.1K | 16.4K | In: $1 Out: $3 | Model: 0.500 Completion: 3.000 | 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-07-14 |
GPT OSS 20B | openai/gpt-oss-20b | 131.1K | 32.8K | In: $0.1 Out: $0.5 | Model: 0.050 Completion: 5.000 | 🧠 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-08-05 |
GPT OSS 120B | openai/gpt-oss-120b | 131.1K | 32.8K | In: $0.15 Out: $0.75 | Model: 0.075 Completion: 5.000 | 🧠 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-08-05 |
Qwen3 32B | qwen/qwen3-32b | 131.1K | 16.4K | In: $0.29 Out: $0.59 | Model: 0.145 Completion: 2.034 | 🧠 🔧 🌡️ | 2024-11-08 | In: text Out: text | Open Weights Released: 2024-12-23 |
Llama 4 Scout 17B | meta-llama/llama-4-scout-17b-16e-instruct | 131.1K | 8.2K | In: $0.11 Out: $0.34 | Model: 0.055 Completion: 3.091 | 🔧 🌡️ | 2024-08 | In: text, image Out: text | Open Weights Released: 2025-04-05 |
Llama 4 Maverick 17B | meta-llama/llama-4-maverick-17b-128e-instruct | 131.1K | 8.2K | In: $0.2 Out: $0.6 | Model: 0.100 Completion: 3.000 | 🔧 🌡️ | 2024-08 | In: text, image Out: text | Open Weights Released: 2025-04-05 |
Llama Guard 4 12B | meta-llama/llama-guard-4-12b | 131.1K | 128 | In: $0.2 Out: $0.2 | Model: 0.100 Completion: 1.000 | 🌡️ | - | In: text, image Out: text | Open Weights Released: 2025-04-05 |
Hugging Face¶
📖 API Address | 📚 Official Documentation
Model | Model ID | Context | Output | Pricing ($/1M) | NewAPI Ratios | Capabilities | Knowledge | Modalities | Details |
---|---|---|---|---|---|---|---|---|---|
Kimi-K2-Instruct | moonshotai/Kimi-K2-Instruct | 131.1K | 16.4K | In: $1 Out: $3 | Model: 0.500 Completion: 3.000 | 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-07-14 |
Kimi-K2-Instruct-0905 | moonshotai/Kimi-K2-Instruct-0905 | 262.1K | 16.4K | In: $1 Out: $3 | Model: 0.500 Completion: 3.000 | 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-09-04 |
Qwen3-Coder-480B-A35B-Instruct | Qwen/Qwen3-Coder-480B-A35B-Instruct | 262.1K | 66.5K | In: $2 Out: $2 | Model: 1.000 Completion: 1.000 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-23 |
Qwen3-235B-A22B-Thinking-2507 | Qwen/Qwen3-235B-A22B-Thinking-2507 | 262.1K | 131.1K | In: $0.3 Out: $3 | Model: 0.150 Completion: 10.000 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-25 |
Qwen3-Next-80B-A3B-Instruct | Qwen/Qwen3-Next-80B-A3B-Instruct | 262.1K | 66.5K | In: $0.25 Out: $1 | Model: 0.125 Completion: 4.000 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-09-11 |
Qwen3-Next-80B-A3B-Thinking | Qwen/Qwen3-Next-80B-A3B-Thinking | 262.1K | 131.1K | In: $0.3 Out: $2 | Model: 0.150 Completion: 6.667 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-09-11 |
GLM-4.5 | zai-org/GLM-4.5 | 131.1K | 98.3K | In: $0.6 Out: $2.2 | Model: 0.300 Completion: 3.667 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-28 |
GLM-4.6 | zai-org/GLM-4.6 | 200K | 128K | In: $0.6 Out: $2.2 Cache Read: $0.11 | Model: 0.300 Completion: 3.667 Cache: 0.183 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-09-30 |
GLM-4.5-Air | zai-org/GLM-4.5-Air | 128K | 96K | In: $0.2 Out: $1.1 | Model: 0.100 Completion: 5.500 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-28 |
DeepSeek-V3-0324 | deepseek-ai/Deepseek-V3-0324 | 16.4K | 8.2K | In: $1.25 Out: $1.25 | Model: 0.625 Completion: 1.000 | 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-03-24 |
DeepSeek-R1-0528 | deepseek-ai/DeepSeek-R1-0528 | 163.8K | 163.8K | In: $3 Out: $5 | Model: 1.500 Completion: 1.667 | 🧠 🔧 🌡️ | 2025-05 | In: text Out: text | Open Weights Released: 2025-05-28 |
Inception¶
📖 API Address | 📚 Official Documentation
Model | Model ID | Context | Output | Pricing ($/1M) | NewAPI Ratios | Capabilities | Knowledge | Modalities | Details |
---|---|---|---|---|---|---|---|---|---|
Mercury Coder | mercury-coder | 128K | 16.4K | In: $0.25 Out: $1 Cache Read: $0.25 Cache Write: $1 | Model: 0.125 Completion: 4.000 Cache: 1.000 | 🔧 🌡️ | 2023-10 | In: text Out: text | Released: 2025-02-26 Updated: 2025-07-31 |
Mercury | mercury | 128K | 16.4K | In: $0.25 Out: $1 Cache Read: $0.25 Cache Write: $1 | Model: 0.125 Completion: 4.000 Cache: 1.000 | 🔧 🌡️ | 2023-10 | In: text Out: text | Released: 2025-06-26 Updated: 2025-07-31 |
Inference¶
📖 API Address | 📚 Official Documentation
Model | Model ID | Context | Output | Pricing ($/1M) | NewAPI Ratios | Capabilities | Knowledge | Modalities | Details |
---|---|---|---|---|---|---|---|---|---|
Mistral Nemo 12B Instruct | mistral/mistral-nemo-12b-instruct | 16K | 4.1K | In: $0.038 Out: $0.1 | Model: 0.019 Completion: 2.632 | 🔧 🌡️ | 2024-12 | In: text Out: text | Open Weights Released: 2025-01-01 |
Google Gemma 3 | google/gemma-3 | 125K | 4.1K | In: $0.15 Out: $0.3 | Model: 0.075 Completion: 2.000 | 📎 🔧 🌡️ | 2024-12 | In: text, image Out: text | Open Weights Released: 2025-01-01 |
Osmosis Structure 0.6B | osmosis/osmosis-structure-0.6b | 4K | 2K | In: $0.1 Out: $0.5 | Model: 0.050 Completion: 5.000 | 🔧 🌡️ | 2024-12 | In: text Out: text | Open Weights Released: 2025-01-01 |
Qwen 3 Embedding 4B | qwen/qwen3-embedding-4b | 32K | 2K | In: $0.01 Out: $- | Model: 0.005 | - | 2024-12 | In: text Out: text | Open Weights Released: 2025-01-01 |
Qwen 2.5 7B Vision Instruct | qwen/qwen-2.5-7b-vision-instruct | 125K | 4.1K | In: $0.2 Out: $0.2 | Model: 0.100 Completion: 1.000 | 📎 🔧 🌡️ | 2024-12 | In: text, image Out: text | Open Weights Released: 2025-01-01 |
Llama 3.2 11B Vision Instruct | meta/llama-3.2-11b-vision-instruct | 16K | 4.1K | In: $0.055 Out: $0.055 | Model: 0.028 Completion: 1.000 | 📎 🔧 🌡️ | 2023-12 | In: text, image Out: text | Open Weights Released: 2025-01-01 |
Llama 3.1 8B Instruct | meta/llama-3.1-8b-instruct | 16K | 4.1K | In: $0.025 Out: $0.025 | Model: 0.013 Completion: 1.000 | 🔧 🌡️ | 2023-12 | In: text Out: text | Open Weights Released: 2025-01-01 |
Llama 3.2 3B Instruct | meta/llama-3.2-3b-instruct | 16K | 4.1K | In: $0.02 Out: $0.02 | Model: 0.010 Completion: 1.000 | 🔧 🌡️ | 2023-12 | In: text Out: text | Open Weights Released: 2025-01-01 |
Llama 3.2 1B Instruct | meta/llama-3.2-1b-instruct | 16K | 4.1K | In: $0.01 Out: $0.01 | Model: 0.005 Completion: 1.000 | 🔧 🌡️ | 2023-12 | In: text Out: text | Open Weights Released: 2025-01-01 |
Llama¶
📖 API Address | 📚 Official Documentation
Model | Model ID | Context | Output | Pricing ($/1M) | NewAPI Ratios | Capabilities | Knowledge | Modalities | Details |
---|---|---|---|---|---|---|---|---|---|
Llama-3.3-8B-Instruct | llama-3.3-8b-instruct | 128K | 4.1K | - | - | 📎 🔧 🌡️ | 2023-12 | In: text Out: text | Open Weights Released: 2024-12-06 |
Llama-4-Maverick-17B-128E-Instruct-FP8 | llama-4-maverick-17b-128e-instruct-fp8 | 128K | 4.1K | - | - | 📎 🔧 🌡️ | 2024-08 | In: text, image Out: text | Open Weights Released: 2025-04-05 |
Llama-3.3-70B-Instruct | llama-3.3-70b-instruct | 128K | 4.1K | - | - | 📎 🔧 🌡️ | 2023-12 | In: text Out: text | Open Weights Released: 2024-12-06 |
Llama-4-Scout-17B-16E-Instruct-FP8 | llama-4-scout-17b-16e-instruct-fp8 | 128K | 4.1K | - | - | 📎 🔧 🌡️ | 2024-08 | In: text, image Out: text | Open Weights Released: 2025-04-05 |
Groq-Llama-4-Maverick-17B-128E-Instruct | groq-llama-4-maverick-17b-128e-instruct | 128K | 4.1K | - | - | 📎 🔧 🌡️ | 2025-01 | In: text Out: text | Open Weights Released: 2025-04-05 |
Cerebras-Llama-4-Scout-17B-16E-Instruct | cerebras-llama-4-scout-17b-16e-instruct | 128K | 4.1K | - | - | 📎 🔧 🌡️ | 2025-01 | In: text Out: text | Open Weights Released: 2025-04-05 |
Cerebras-Llama-4-Maverick-17B-128E-Instruct | cerebras-llama-4-maverick-17b-128e-instruct | 128K | 4.1K | - | - | 📎 🔧 🌡️ | 2025-01 | In: text Out: text | Open Weights Released: 2025-04-05 |
LMStudio¶
📖 API Address | 📚 Official Documentation
Model | Model ID | Context | Output | Pricing ($/1M) | NewAPI Ratios | Capabilities | Knowledge | Modalities | Details |
---|---|---|---|---|---|---|---|---|---|
GPT OSS 20B | openai/gpt-oss-20b | 131.1K | 32.8K | - | - | 🧠 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-08-05 |
Qwen3 30B A3B 2507 | qwen/qwen3-30b-a3b-2507 | 262.1K | 16.4K | - | - | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-30 |
Qwen3 Coder 30B | qwen/qwen3-coder-30b | 262.1K | 65.5K | - | - | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-23 |
LucidQuery AI¶
📖 API Address | 📚 Official Documentation
Model | Model ID | Context | Output | Pricing ($/1M) | NewAPI Ratios | Capabilities | Knowledge | Modalities | Details |
---|---|---|---|---|---|---|---|---|---|
LucidQuery Nexus Coder | lucidquery-nexus-coder | 250K | 60K | In: $2 Out: $5 | Model: 1.000 Completion: 2.500 | 📎 🧠 🔧 | 2025-08-01 | In: text Out: text | Released: 2025-09-01 |
LucidNova RF1 100B | lucidnova-rf1-100b | 120K | 8K | In: $2 Out: $5 | Model: 1.000 Completion: 2.500 | 📎 🧠 🔧 | 2025-09-16 | In: text Out: text | Released: 2024-12-28 Updated: 2025-09-10 |
Mistral¶
Model | Model ID | Context | Output | Pricing ($/1M) | NewAPI Ratios | Capabilities | Knowledge | Modalities | Details |
---|---|---|---|---|---|---|---|---|---|
Devstral Medium | devstral-medium-2507 | 128K | 128K | In: $0.4 Out: $2 | Model: 0.200 Completion: 5.000 | 🔧 🌡️ | 2025-05 | In: text Out: text | Open Weights Released: 2025-07-10 |
Mixtral 8x22B | open-mixtral-8x22b | 64K | 64K | In: $2 Out: $6 | Model: 1.000 Completion: 3.000 | 🔧 🌡️ | 2024-04 | In: text Out: text | Open Weights Released: 2024-04-17 |
Ministral 8B | ministral-8b-latest | 128K | 128K | In: $0.1 Out: $0.1 | Model: 0.050 Completion: 1.000 | 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2024-10-01 Updated: 2024-10-04 |
Pixtral Large | pixtral-large-latest | 128K | 128K | In: $2 Out: $6 | Model: 1.000 Completion: 3.000 | 📎 🔧 🌡️ | 2024-11 | In: text, image Out: text | Open Weights Released: 2024-11-01 Updated: 2024-11-04 |
Ministral 3B | ministral-3b-latest | 128K | 128K | In: $0.04 Out: $0.04 | Model: 0.020 Completion: 1.000 | 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2024-10-01 Updated: 2024-10-04 |
Pixtral 12B | pixtral-12b | 128K | 128K | In: $0.15 Out: $0.15 | Model: 0.075 Completion: 1.000 | 📎 🔧 🌡️ | 2024-09 | In: text, image Out: text | Open Weights Released: 2024-09-01 |
Mistral Medium 3 | mistral-medium-2505 | 131.1K | 131.1K | In: $0.4 Out: $2 | Model: 0.200 Completion: 5.000 | 📎 🔧 🌡️ | 2025-05 | In: text, image Out: text | Released: 2025-05-07 |
Devstral Small 2505 | devstral-small-2505 | 128K | 128K | In: $0.1 Out: $0.3 | Model: 0.050 Completion: 3.000 | 🔧 🌡️ | 2025-05 | In: text Out: text | Open Weights Released: 2025-05-07 |
Mistral Medium 3.1 | mistral-medium-2508 | 262.1K | 262.1K | In: $0.4 Out: $2 | Model: 0.200 Completion: 5.000 | 📎 🔧 🌡️ | 2025-05 | In: text, image Out: text | Released: 2025-08-12 |
Mistral Small | mistral-small-latest | 128K | 16.4K | In: $0.1 Out: $0.3 | Model: 0.050 Completion: 3.000 | 🔧 🌡️ | 2025-03 | In: text, image Out: text | Open Weights Released: 2024-09-01 Updated: 2024-09-04 |
Magistral Small | magistral-small | 128K | 128K | In: $0.5 Out: $1.5 | Model: 0.250 Completion: 3.000 | 🧠 🔧 🌡️ | 2025-06 | In: text Out: text | Open Weights Released: 2025-03-17 |
Devstral Small | devstral-small-2507 | 128K | 128K | In: $0.1 Out: $0.3 | Model: 0.050 Completion: 3.000 | 🔧 🌡️ | 2025-05 | In: text Out: text | Open Weights Released: 2025-07-10 |
Codestral | codestral-latest | 256K | 4.1K | In: $0.3 Out: $0.9 | Model: 0.150 Completion: 3.000 | 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2024-05-29 Updated: 2025-01-04 |
Mixtral 8x7B | open-mixtral-8x7b | 32K | 32K | In: $0.7 Out: $0.7 | Model: 0.350 Completion: 1.000 | 🔧 🌡️ | 2024-01 | In: text Out: text | Open Weights Released: 2023-12-11 |
Mistral Nemo | mistral-nemo | 128K | 128K | In: $0.15 Out: $0.15 | Model: 0.075 Completion: 1.000 | 🔧 🌡️ | 2024-07 | In: text Out: text | Open Weights Released: 2024-07-01 |
Mistral 7B | open-mistral-7b | 8K | 8K | In: $0.25 Out: $0.25 | Model: 0.125 Completion: 1.000 | 🔧 🌡️ | 2023-12 | In: text Out: text | Open Weights Released: 2023-09-27 |
Mistral Large | mistral-large-latest | 131.1K | 16.4K | In: $2 Out: $6 | Model: 1.000 Completion: 3.000 | 🔧 🌡️ | 2024-11 | In: text Out: text | Open Weights Released: 2024-11-01 Updated: 2024-11-04 |
Mistral Medium | mistral-medium-latest | 128K | 16.4K | In: $0.4 Out: $2 | Model: 0.200 Completion: 5.000 | 🔧 🌡️ | 2025-05 | In: text, image Out: text | Open Weights Released: 2025-05-07 Updated: 2025-05-10 |
Magistral Medium | magistral-medium-latest | 128K | 16.4K | In: $2 Out: $5 | Model: 1.000 Completion: 2.500 | 🧠 🔧 🌡️ | 2025-06 | In: text Out: text | Open Weights Released: 2025-03-17 Updated: 2025-03-20 |
ModelScope¶
📖 API Address | 📚 Official Documentation
Model | Model ID | Context | Output | Pricing ($/1M) | NewAPI Ratios | Capabilities | Knowledge | Modalities | Details |
---|---|---|---|---|---|---|---|---|---|
GLM-4.5 | ZhipuAI/GLM-4.5 | 131.1K | 98.3K | - | - | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-28 |
GLM-4.6 | ZhipuAI/GLM-4.6 | 202.8K | 98.3K | - | - | 🧠 🔧 🌡️ | 2025-07 | In: text Out: text | Open Weights Released: 2025-09-30 |
Qwen3 30B A3B Thinking 2507 | Qwen/Qwen3-30B-A3B-Thinking-2507 | 262.1K | 32.8K | - | - | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-30 |
Qwen3 235B A22B Instruct 2507 | Qwen/Qwen3-235B-A22B-Instruct-2507 | 262.1K | 131.1K | - | - | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-04-28 Updated: 2025-07-21 |
Qwen3 Coder 30B A3B Instruct | Qwen/Qwen3-Coder-30B-A3B-Instruct | 262.1K | 65.5K | - | - | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-31 |
Qwen3 30B A3B Instruct 2507 | Qwen/Qwen3-30B-A3B-Instruct-2507 | 262.1K | 16.4K | - | - | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-30 |
Qwen3-235B-A22B-Thinking-2507 | Qwen/Qwen3-235B-A22B-Thinking-2507 | 262.1K | 131.1K | - | - | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-25 |
Moonshot AI¶
📖 API Address | 📚 Official Documentation
Model | Model ID | Context | Output | Pricing ($/1M) | NewAPI Ratios | Capabilities | Knowledge | Modalities | Details |
---|---|---|---|---|---|---|---|---|---|
Kimi K2 Turbo | kimi-k2-turbo-preview | 262.1K | 262.1K | In: $2.4 Out: $10 Cache Read: $0.6 | Model: 1.200 Completion: 4.167 Cache: 0.250 | 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-09-05 |
Kimi K2 0711 | kimi-k2-0711-preview | 131.1K | 16.4K | In: $0.6 Out: $2.5 Cache Read: $0.15 | Model: 0.300 Completion: 4.167 Cache: 0.250 | 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-07-14 |
Kimi K2 0905 | kimi-k2-0905-preview | 262.1K | 262.1K | In: $0.6 Out: $2.5 Cache Read: $0.15 | Model: 0.300 Completion: 4.167 Cache: 0.250 | 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-09-05 |
Moonshot AI (China)¶
📖 API Address | 📚 Official Documentation
Model | Model ID | Context | Output | Pricing ($/1M) | NewAPI Ratios | Capabilities | Knowledge | Modalities | Details |
---|---|---|---|---|---|---|---|---|---|
Kimi K2 0905 | kimi-k2-0905-preview | 262.1K | 262.1K | In: $0.6 Out: $2.5 Cache Read: $0.15 | Model: 0.300 Completion: 4.167 Cache: 0.250 | 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-09-05 |
Kimi K2 0711 | kimi-k2-0711-preview | 131.1K | 16.4K | In: $0.6 Out: $2.5 Cache Read: $0.15 | Model: 0.300 Completion: 4.167 Cache: 0.250 | 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-07-14 |
Kimi K2 Turbo | kimi-k2-turbo-preview | 262.1K | 262.1K | In: $2.4 Out: $10 Cache Read: $0.6 | Model: 1.200 Completion: 4.167 Cache: 0.250 | 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-09-05 |
Morph¶
📖 API Address | 📚 Official Documentation
Model | Model ID | Context | Output | Pricing ($/1M) | NewAPI Ratios | Capabilities | Knowledge | Modalities | Details |
---|---|---|---|---|---|---|---|---|---|
Morph v3 Large | morph-v3-large | 32K | 32K | In: $0.9 Out: $1.9 | Model: 0.450 Completion: 2.111 | - | - | In: text Out: text | Released: 2024-08-15 |
Auto | auto | 32K | 32K | In: $0.85 Out: $1.55 | Model: 0.425 Completion: 1.824 | - | - | In: text Out: text | Released: 2024-06-01 |
Morph v3 Fast | morph-v3-fast | 16K | 16K | In: $0.8 Out: $1.2 | Model: 0.400 Completion: 1.500 | - | - | In: text Out: text | Released: 2024-08-15 |
Nebius AI Studio¶
📖 API Address | 📚 Official Documentation
Model | Model ID | Context | Output | Pricing ($/1M) | NewAPI Ratios | Capabilities | Knowledge | Modalities | Details |
---|---|---|---|---|---|---|---|---|---|
Hermes 4 70B | NousResearch/hermes-4-70b | 131.1K | 8.2K | In: $0.13 Out: $0.4 | Model: 0.065 Completion: 3.077 | 🧠 🔧 🌡️ | 2024-07 | In: text Out: text | Released: 2024-08-01 Updated: 2025-10-04 |
Hermes-4 405B | NousResearch/hermes-4-405b | 131.1K | 8.2K | In: $1 Out: $3 | Model: 0.500 Completion: 3.000 | 🧠 🔧 🌡️ | 2024-07 | In: text Out: text | Released: 2024-08-01 Updated: 2025-10-04 |
Kimi K2 Instruct | moonshotai/kimi-k2-instruct | 131.1K | 8.2K | In: $0.5 Out: $2.4 | Model: 0.250 Completion: 4.800 | 🧠 🔧 🌡️ | 2024-01 | In: text Out: text | Released: 2025-01-01 Updated: 2025-10-04 |
Llama 3.1 Nemotron Ultra 253B v1 | nvidia/llama-3_1-nemotron-ultra-253b-v1 | 131.1K | 8.2K | In: $0.6 Out: $1.8 | Model: 0.300 Completion: 3.000 | 🧠 🔧 🌡️ | 2024-07 | In: text Out: text | Released: 2024-07-01 Updated: 2025-10-04 |
GPT OSS 20B | openai/gpt-oss-20b | 131.1K | 8.2K | In: $0.05 Out: $0.2 | Model: 0.025 Completion: 4.000 | 📎 🧠 🔧 🌡️ | 2024-01 | In: text Out: text | Released: 2024-01-01 Updated: 2025-10-04 |
GPT OSS 120B | openai/gpt-oss-120b | 131.1K | 8.2K | In: $0.15 Out: $0.6 | Model: 0.075 Completion: 4.000 | 📎 🧠 🔧 🌡️ | 2024-01 | In: text Out: text | Released: 2024-01-01 Updated: 2025-10-04 |
Qwen3 235B A22B Instruct 2507 | qwen/qwen3-235b-a22b-instruct-2507 | 262.1K | 8.2K | In: $0.2 Out: $0.6 | Model: 0.100 Completion: 3.000 | 🧠 🔧 🌡️ | 2025-07 | In: text Out: text | Released: 2025-07-25 Updated: 2025-10-04 |
Qwen3 235B A22B Thinking 2507 | qwen/qwen3-235b-a22b-thinking-2507 | 262.1K | 8.2K | In: $0.2 Out: $0.8 | Model: 0.100 Completion: 4.000 | 🧠 🔧 🌡️ | 2025-07 | In: text Out: text | Released: 2025-07-25 Updated: 2025-10-04 |
Qwen3 Coder 480B A35B Instruct | qwen/qwen3-coder-480b-a35b-instruct | 262.1K | 66.5K | In: $0.4 Out: $1.8 | Model: 0.200 Completion: 4.500 | 🔧 🌡️ | 2025-04 | In: text Out: text | Released: 2025-07-23 Updated: 2025-10-04 |
Llama 3.1 405B Instruct | meta-llama/llama-3_1-405b-instruct | 131.1K | 8.2K | In: $1 Out: $3 | Model: 0.500 Completion: 3.000 | 🧠 🔧 🌡️ | 2024-03 | In: text Out: text | Released: 2024-07-23 Updated: 2025-10-04 |
Llama-3.3-70B-Instruct (Fast) | meta-llama/llama-3.3-70b-instruct-fast | 131.1K | 8.2K | In: $0.25 Out: $0.75 | Model: 0.125 Completion: 3.000 | 🧠 🔧 🌡️ | 2024-08 | In: text Out: text | Released: 2024-08-22 Updated: 2025-10-04 |
Llama-3.3-70B-Instruct (Base) | meta-llama/llama-3.3-70b-instruct-base | 131.1K | 8.2K | In: $0.13 Out: $0.4 | Model: 0.065 Completion: 3.077 | 🧠 🔧 🌡️ | 2024-08 | In: text Out: text | Released: 2024-08-22 Updated: 2025-10-04 |
GLM 4.5 | zai-org/glm-4.5 | 131.1K | 8.2K | In: $0.6 Out: $2.2 | Model: 0.300 Completion: 3.667 | 🧠 🔧 🌡️ | 2024-05 | In: text Out: text | Released: 2024-06-01 Updated: 2025-10-04 |
GLM 4.5 Air | zai-org/glm-4.5-air | 131.1K | 8.2K | In: $0.2 Out: $1.2 | Model: 0.100 Completion: 6.000 | 🧠 🔧 🌡️ | 2024-05 | In: text Out: text | Released: 2024-06-01 Updated: 2025-10-04 |
DeepSeek V3 | deepseek-ai/deepseek-v3 | 131.1K | 8.2K | In: $0.5 Out: $1.5 | Model: 0.250 Completion: 3.000 | 🧠 🔧 🌡️ | 2024-04 | In: text Out: text | Released: 2024-05-07 Updated: 2025-10-04 |
Nvidia¶
📖 API Address | 📚 Official Documentation
Model | Model ID | Context | Output | Pricing ($/1M) | NewAPI Ratios | Capabilities | Knowledge | Modalities | Details |
---|---|---|---|---|---|---|---|---|---|
Kimi K2 0905 | moonshotai/kimi-k2-instruct-0905 | 262.1K | 262.1K | - | - | 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-09-05 |
Kimi K2 Instruct | moonshotai/kimi-k2-instruct | 128K | 8.2K | - | - | 🧠 🔧 🌡️ | 2024-01 | In: text Out: text | Released: 2025-01-01 Updated: 2025-09-05 |
Cosmos Nemotron 34B | nvidia/cosmos-nemotron-34b | 131.1K | 8.2K | - | - | 🧠 🌡️ | 2024-01 | In: text, image, video Out: text | Released: 2024-01-01 Updated: 2025-09-05 |
Parakeet TDT 0.6B v2 | nvidia/parakeet-tdt-0.6b-v2 | - | 4.1K | - | - | - | 2024-01 | In: audio Out: text | Released: 2024-01-01 Updated: 2025-09-05 |
NeMo Retriever OCR v1 | nvidia/nemoretriever-ocr-v1 | - | 4.1K | - | - | - | 2024-01 | In: image Out: text | Released: 2024-01-01 Updated: 2025-09-05 |
Llama-3.1-Nemotron-Ultra-253B-v1 | nvidia/llama-3.1-nemotron-ultra-253b-v1 | 131.1K | 8.2K | - | - | 🧠 🔧 🌡️ | 2024-07 | In: text Out: text | Released: 2024-07-01 Updated: 2025-09-05 |
Gemma-3-27B-IT | google/gemma-3-27b-it | 131.1K | 8.2K | - | - | 📎 🧠 🔧 🌡️ | 2024-12 | In: text, image Out: text | Released: 2024-12-01 Updated: 2025-09-05 |
Phi-4-Mini | microsoft/phi-4-mini-instruct | 131.1K | 8.2K | - | - | 📎 🧠 🔧 🌡️ | 2024-12 | In: text, image, audio Out: text | Released: 2024-12-01 Updated: 2025-09-05 |
Whisper Large v3 | openai/whisper-large-v3 | - | 4.1K | - | - | - | 2023-09 | In: audio Out: text | Open Weights Released: 2023-09-01 Updated: 2025-09-05 |
GPT-OSS-120B | openai/gpt-oss-120b | 128K | 8.2K | - | - | 📎 🧠 🌡️ | 2024-01 | In: text Out: text | Released: 2024-01-01 Updated: 2025-09-05 |
Qwen3-235B-A22B | qwen/qwen3-235b-a22b | 131.1K | 8.2K | - | - | 🧠 🔧 🌡️ | 2024-12 | In: text Out: text | Released: 2024-12-01 Updated: 2025-09-05 |
Qwen3 Coder 480B A35B Instruct | qwen/qwen3-coder-480b-a35b-instruct | 262.1K | 66.5K | - | - | 🔧 🌡️ | 2025-04 | In: text Out: text | Released: 2025-07-23 |
DeepSeek V3.1 Terminus | deepseek-ai/deepseek-v3.1-terminus | 128K | 8.2K | - | - | 🧠 🔧 🌡️ | 2025-01 | In: text Out: text | Released: 2025-09-22 |
DeepSeek V3.1 | deepseek-ai/deepseek-v3.1 | 128K | 8.2K | - | - | 🧠 🔧 🌡️ | 2024-07 | In: text Out: text | Released: 2025-08-20 Updated: 2025-08-26 |
FLUX.1-dev | black-forest-labs/flux.1-dev | 4.1K | - | - | - | 🌡️ | 2024-08 | In: text Out: image | Released: 2024-08-01 Updated: 2025-09-05 |
OpenAI¶
Model | Model ID | Context | Output | Pricing ($/1M) | NewAPI Ratios | Capabilities | Knowledge | Modalities | Details |
---|---|---|---|---|---|---|---|---|---|
GPT-4.1 nano | gpt-4.1-nano | 1M | 32.8K | In: $0.1 Out: $0.4 Cache Read: $0.03 | Model: 0.050 Completion: 4.000 Cache: 0.300 | 📎 🔧 🌡️ | 2024-04 | In: text, image Out: text | Released: 2025-04-14 |
GPT-4 | gpt-4 | 8.2K | 8.2K | In: $30 Out: $60 | Model: 15.000 Completion: 2.000 | 📎 🔧 🌡️ | 2023-11 | In: text Out: text | Released: 2023-11-06 Updated: 2024-04-09 |
o1-pro | o1-pro | 200K | 100K | In: $150 Out: $600 | Model: 75.000 Completion: 4.000 | 📎 🧠 🔧 | 2023-09 | In: text, image Out: text | Released: 2025-03-19 |
GPT-4o (2024-05-13) | gpt-4o-2024-05-13 | 128K | 4.1K | In: $5 Out: $15 | Model: 2.500 Completion: 3.000 | 📎 🔧 🌡️ | 2023-09 | In: text, image Out: text | Released: 2024-05-13 |
GPT-4o (2024-08-06) | gpt-4o-2024-08-06 | 128K | 16.4K | In: $2.5 Out: $10 Cache Read: $1.25 | Model: 1.250 Completion: 4.000 Cache: 0.500 | 📎 🔧 🌡️ | 2023-09 | In: text, image Out: text | Released: 2024-08-06 |
GPT-4.1 mini | gpt-4.1-mini | 1M | 32.8K | In: $0.4 Out: $1.6 Cache Read: $0.1 | Model: 0.200 Completion: 4.000 Cache: 0.250 | 📎 🔧 🌡️ | 2024-04 | In: text, image Out: text | Released: 2025-04-14 |
o3-deep-research | o3-deep-research | 200K | 100K | In: $10 Out: $40 Cache Read: $2.5 | Model: 5.000 Completion: 4.000 Cache: 0.250 | 📎 🧠 🔧 | 2024-05 | In: text, image Out: text | Released: 2024-06-26 |
GPT-3.5-turbo | gpt-3.5-turbo | 16.4K | 4.1K | In: $0.5 Out: $1.5 Cache Read: $1.25 | Model: 0.250 Completion: 3.000 Cache: 2.500 | 🌡️ | 2021-09-01 | In: text Out: text | Released: 2023-03-01 Updated: 2023-11-06 |
GPT-4 Turbo | gpt-4-turbo | 128K | 4.1K | In: $10 Out: $30 | Model: 5.000 Completion: 3.000 | 📎 🔧 🌡️ | 2023-12 | In: text, image Out: text | Released: 2023-11-06 Updated: 2024-04-09 |
o1-preview | o1-preview | 128K | 32.8K | In: $15 Out: $60 Cache Read: $7.5 | Model: 7.500 Completion: 4.000 Cache: 0.500 | 🧠 🌡️ | 2023-09 | In: text Out: text | Released: 2024-09-12 |
o3-mini | o3-mini | 200K | 100K | In: $1.1 Out: $4.4 Cache Read: $0.55 | Model: 0.550 Completion: 4.000 Cache: 0.500 | 🧠 🔧 | 2024-05 | In: text Out: text | Released: 2024-12-20 Updated: 2025-01-29 |
Codex Mini | codex-mini-latest | 200K | 100K | In: $1.5 Out: $6 Cache Read: $0.375 | Model: 0.750 Completion: 4.000 Cache: 0.250 | 📎 🧠 🔧 | 2024-04 | In: text Out: text | Released: 2025-05-16 |
GPT-5 Nano | gpt-5-nano | 400K | 128K | In: $0.05 Out: $0.4 Cache Read: $0.01 | Model: 0.025 Completion: 8.000 Cache: 0.200 | 📎 🧠 🔧 | 2024-05-30 | In: text, image Out: text | Released: 2025-08-07 |
GPT-5-Codex | gpt-5-codex | 400K | 128K | In: $1.25 Out: $10 Cache Read: $0.125 | Model: 0.625 Completion: 8.000 Cache: 0.100 | 🧠 🔧 | 2024-09-30 | In: text, image Out: text | Released: 2025-09-15 |
GPT-4o | gpt-4o | 128K | 16.4K | In: $2.5 Out: $10 Cache Read: $1.25 | Model: 1.250 Completion: 4.000 Cache: 0.500 | 📎 🔧 🌡️ | 2023-09 | In: text, image Out: text | Released: 2024-05-13 Updated: 2024-08-06 |
GPT-4.1 | gpt-4.1 | 1M | 32.8K | In: $2 Out: $8 Cache Read: $0.5 | Model: 1.000 Completion: 4.000 Cache: 0.250 | 📎 🔧 🌡️ | 2024-04 | In: text, image Out: text | Released: 2025-04-14 |
o4-mini | o4-mini | 200K | 100K | In: $1.1 Out: $4.4 Cache Read: $0.28 | Model: 0.550 Completion: 4.000 Cache: 0.255 | 📎 🧠 🔧 | 2024-05 | In: text, image Out: text | Released: 2025-04-16 |
o1 | o1 | 200K | 100K | In: $15 Out: $60 Cache Read: $7.5 | Model: 7.500 Completion: 4.000 Cache: 0.500 | 📎 🧠 🔧 | 2023-09 | In: text, image Out: text | Released: 2024-12-05 |
GPT-5 Mini | gpt-5-mini | 400K | 128K | In: $0.25 Out: $2 Cache Read: $0.03 | Model: 0.125 Completion: 8.000 Cache: 0.120 | 📎 🧠 🔧 | 2024-05-30 | In: text, image Out: text | Released: 2025-08-07 |
o1-mini | o1-mini | 128K | 65.5K | In: $1.1 Out: $4.4 Cache Read: $0.55 | Model: 0.550 Completion: 4.000 Cache: 0.500 | 🧠 | 2023-09 | In: text Out: text | Released: 2024-09-12 |
o3-pro | o3-pro | 200K | 100K | In: $20 Out: $80 | Model: 10.000 Completion: 4.000 | 📎 🧠 🔧 | 2024-05 | In: text, image Out: text | Released: 2025-06-10 |
GPT-4o (2024-11-20) | gpt-4o-2024-11-20 | 128K | 16.4K | In: $2.5 Out: $10 Cache Read: $1.25 | Model: 1.250 Completion: 4.000 Cache: 0.500 | 📎 🔧 🌡️ | 2023-09 | In: text, image Out: text | Released: 2024-11-20 |
o3 | o3 | 200K | 100K | In: $2 Out: $8 Cache Read: $0.5 | Model: 1.000 Completion: 4.000 Cache: 0.250 | 📎 🧠 🔧 | 2024-05 | In: text, image Out: text | Released: 2025-04-16 |
o4-mini-deep-research | o4-mini-deep-research | 200K | 100K | In: $2 Out: $8 Cache Read: $0.5 | Model: 1.000 Completion: 4.000 Cache: 0.250 | 📎 🧠 🔧 | 2024-05 | In: text, image Out: text | Released: 2024-06-26 |
GPT-5 Chat (latest) | gpt-5-chat-latest | 400K | 128K | In: $1.25 Out: $10 | Model: 0.625 Completion: 8.000 | 📎 🧠 🌡️ | 2024-09-30 | In: text, image Out: text | Released: 2025-08-07 |
GPT-4o mini | gpt-4o-mini | 128K | 16.4K | In: $0.15 Out: $0.6 Cache Read: $0.08 | Model: 0.075 Completion: 4.000 Cache: 0.533 | 📎 🔧 🌡️ | 2023-09 | In: text, image Out: text | Released: 2024-07-18 |
GPT-5 | gpt-5 | 400K | 128K | In: $1.25 Out: $10 Cache Read: $0.13 | Model: 0.625 Completion: 8.000 Cache: 0.104 | 📎 🧠 🔧 | 2024-09-30 | In: text, image Out: text | Released: 2025-08-07 |
DALL-E 2 | dall-e-2 | 1K | 1 | In: $0.02 Out: $0.1 Cache Read: $0.01 Cache Write: $0.05 | Model: 0.010 Completion: 5.000 Cache: 0.500 | 📎 🔧 | 2021-04 | In: text Out: image | Released: 2022-04-06 Updated: 2022-06-15 |
DALL-E 3 | dall-e-3 | 2K | 1 | In: $0.03 Out: $0.15 Cache Read: $0.01 Cache Write: $0.05 | Model: 0.015 Completion: 5.000 Cache: 0.333 | 📎 🔧 | 2024-04 | In: text Out: image | Released: 2024-03-01 Updated: 2024-08-15 |
GPT-IMAGE-1 | gpt-image-1 | 1K | 512 | In: $10 Out: $20 Cache Read: $0.1 Cache Write: $0.6 | Model: 5.000 Completion: 2.000 Cache: 0.010 | 📎 🧠 🔧 🌡️ | 2023-10 | In: text Out: image | Open Weights Released: 2024-01-15 Updated: 2024-10-01 |
TEXT-EMBEDDING-3-LARGE | text-embedding-3-large | 64K | 2K | In: $7 Out: $10 Cache Read: $0.05 Cache Write: $0.4 | Model: 3.500 Completion: 1.429 Cache: 0.007 | 📎 🧠 🔧 🌡️ | 2023-10 | In: text Out: vector | Released: 2023-12-15 Updated: 2023-10-01 |
TEXT-EMBEDDING-3-SMALL | text-embedding-3-small | 32K | 1K | In: $4 Out: $8 Cache Read: $0.04 Cache Write: $0.3 | Model: 2.000 Completion: 2.000 Cache: 0.010 | 📎 🧠 🔧 🌡️ | 2023-10 | In: text Out: vector | Released: 2023-11-10 Updated: 2023-10-01 |
TEXT-EMBEDDING-ADA-002 | text-embedding-ada-002 | 60K | 1.5K | In: $6 Out: $12 Cache Read: $0.06 Cache Write: $0.45 | Model: 3.000 Completion: 2.000 Cache: 0.010 | 📎 🧠 🔧 🌡️ | 2023-10 | In: text Out: vector | Released: 2023-11-20 Updated: 2023-10-01 |
OpenCode Zen¶
📖 API Address | 📚 Official Documentation
Model | Model ID | Context | Output | Pricing ($/1M) | NewAPI Ratios | Capabilities | Knowledge | Modalities | Details |
---|---|---|---|---|---|---|---|---|---|
Qwen3 Coder | qwen3-coder | 262.1K | 65.5K | In: $0.45 Out: $1.8 | Model: 0.225 Completion: 4.000 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-23 |
Claude Opus 4.1 | claude-opus-4-1 | 200K | 32K | In: $15 Out: $75 Cache Read: $1.5 Cache Write: $18.75 | Model: 7.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-03-31 | In: text, image Out: text | Released: 2025-08-05 |
Kimi K2 | kimi-k2 | 262.1K | 262.1K | In: $0.6 Out: $2.5 Cache Read: $0.36 | Model: 0.300 Completion: 4.167 Cache: 0.600 | 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-09-05 |
Claude Haiku 4.5 | claude-haiku-4-5 | 200K | 64K | In: $1 Out: $1.25 Cache Read: $0.1 Cache Write: $1.25 | Model: 0.500 Completion: 1.250 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-02-31 | In: text, image Out: text | Released: 2025-10-15 |
Claude Sonnet 4.5 | claude-sonnet-4-5 | 1M | 64K | In: $3 Out: $15 Cache Read: $0.3 Cache Write: $3.75 | Model: 1.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-07-31 | In: text, image Out: text | Released: 2025-09-29 |
GPT-5-Codex | gpt-5-codex | 400K | 128K | In: $1.25 Out: $10 Cache Read: $0.125 | Model: 0.625 Completion: 8.000 Cache: 0.100 | 📎 🧠 🔧 | 2024-09-30 | In: text, image Out: text | Released: 2025-08-07 |
Code GBT (alpha) | an-gbt | 200K | 128K | In: $1 Out: $3 | Model: 0.500 Completion: 3.000 | 🧠 🔧 🌡️ | 2025-01 | In: text Out: text | Open Weights Released: 2025-01-01 |
Big Pickle | big-pickle | 200K | 128K | - | - | 🧠 🔧 🌡️ | 2025-01 | In: text Out: text | Released: 2025-10-17 |
Claude Haiku 3.5 | claude-3-5-haiku | 200K | 8.2K | In: $0.8 Out: $4 Cache Read: $0.08 Cache Write: $1 | Model: 0.400 Completion: 5.000 Cache: 0.100 | 📎 🔧 🌡️ | 2024-07-31 | In: text, image Out: text | Released: 2024-10-22 |
GLM-4.6 (beta) | glm-4.6 | 204.8K | 131.1K | In: $0.6 Out: $1.9 | Model: 0.300 Completion: 3.167 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-09-30 |
Grok Code Fast 1 | grok-code | 256K | 256K | - | - | 📎 🧠 🔧 🌡️ | - | In: text Out: text | Released: 2025-08-20 |
Code Supernova 1M | code-supernova | 1M | 1M | - | - | 📎 🧠 🔧 🌡️ | - | In: text, image Out: text | Released: 2025-09-19 |
Claude Sonnet 4 | claude-sonnet-4 | 1M | 64K | In: $3 Out: $15 Cache Read: $0.3 Cache Write: $3.75 | Model: 1.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-03-31 | In: text, image Out: text | Released: 2025-05-22 |
GPT-5 | gpt-5 | 400K | 128K | In: $1.25 Out: $10 Cache Read: $0.125 | Model: 0.625 Completion: 8.000 Cache: 0.100 | 📎 🧠 🔧 | 2024-09-30 | In: text, image Out: text | Released: 2025-08-07 |
OpenRouter¶
📖 API Address | 📚 Official Documentation
Model | Model ID | Context | Output | Pricing ($/1M) | NewAPI Ratios | Capabilities | Knowledge | Modalities | Details |
---|---|---|---|---|---|---|---|---|---|
Kimi K2 | moonshotai/kimi-k2 | 131.1K | 32.8K | In: $0.55 Out: $2.2 | Model: 0.275 Completion: 4.000 | 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-07-11 |
Kimi K2 Instruct 0905 | moonshotai/kimi-k2-0905 | 262.1K | 16.4K | In: $0.6 Out: $2.5 | Model: 0.300 Completion: 4.167 | 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-09-05 |
Kimi Dev 72b (free) | moonshotai/kimi-dev-72b:free | 131.1K | 131.1K | - | - | 🔧 🌡️ | 2025-06 | In: text Out: text | Open Weights Released: 2025-06-16 |
Kimi K2 (free) | moonshotai/kimi-k2:free | 32.8K | 32.8K | - | - | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-11 |
GLM Z1 32B (free) | thudm/glm-z1-32b:free | 32.8K | 32.8K | - | - | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-04-17 |
Hermes 4 70B | nousresearch/hermes-4-70b | 131.1K | 131.1K | In: $0.13 Out: $0.4 | Model: 0.065 Completion: 3.077 | 🧠 🔧 🌡️ | 2023-12 | In: text Out: text | Open Weights Released: 2025-08-25 |
Hermes 4 405B | nousresearch/hermes-4-405b | 131.1K | 131.1K | In: $1 Out: $3 | Model: 0.500 Completion: 3.000 | 🧠 🔧 🌡️ | 2023-12 | In: text Out: text | Open Weights Released: 2025-08-25 |
DeepHermes 3 Llama 3 8B Preview | nousresearch/deephermes-3-llama-3-8b-preview | 131.1K | 8.2K | - | - | 🧠 🔧 🌡️ | 2024-04 | In: text Out: text | Open Weights Released: 2025-02-28 |
Grok 4 | x-ai/grok-4 | 256K | 64K | In: $3 Out: $15 Cache Read: $0.75 Cache Write: $15 | Model: 1.500 Completion: 5.000 Cache: 0.250 | 🧠 🔧 🌡️ | 2025-07 | In: text Out: text | Released: 2025-07-09 |
Grok Code Fast 1 | x-ai/grok-code-fast-1 | 256K | 10K | In: $0.2 Out: $1.5 Cache Read: $0.02 | Model: 0.100 Completion: 7.500 Cache: 0.100 | 🧠 🔧 🌡️ | 2025-08 | In: text Out: text | Released: 2025-08-26 |
Grok 4 Fast (free) | x-ai/grok-4-fast:free | 2M | 2M | - | - | 🧠 🔧 🌡️ | 2024-11 | In: text, image Out: text | Released: 2025-08-19 |
Grok 3 | x-ai/grok-3 | 131.1K | 8.2K | In: $3 Out: $15 Cache Read: $0.75 Cache Write: $15 | Model: 1.500 Completion: 5.000 Cache: 0.250 | 🔧 🌡️ | 2024-11 | In: text Out: text | Released: 2025-02-17 |
Grok 4 Fast | x-ai/grok-4-fast | 2M | 30K | In: $0.2 Out: $0.5 Cache Read: $0.05 Cache Write: $0.05 | Model: 0.100 Completion: 2.500 Cache: 0.250 | 🧠 🔧 🌡️ | 2024-11 | In: text, image Out: text | Released: 2025-08-19 |
Grok 3 Beta | x-ai/grok-3-beta | 131.1K | 8.2K | In: $3 Out: $15 Cache Read: $0.75 Cache Write: $15 | Model: 1.500 Completion: 5.000 Cache: 0.250 | 🔧 🌡️ | 2024-11 | In: text Out: text | Released: 2025-02-17 |
Grok 3 Mini Beta | x-ai/grok-3-mini-beta | 131.1K | 8.2K | In: $0.3 Out: $0.5 Cache Read: $0.075 Cache Write: $0.5 | Model: 0.150 Completion: 1.667 Cache: 0.250 | 🧠 🔧 🌡️ | 2024-11 | In: text Out: text | Released: 2025-02-17 |
Grok 3 Mini | x-ai/grok-3-mini | 131.1K | 8.2K | In: $0.3 Out: $0.5 Cache Read: $0.075 Cache Write: $0.5 | Model: 0.150 Completion: 1.667 Cache: 0.250 | 🧠 🔧 🌡️ | 2024-11 | In: text Out: text | Released: 2025-02-17 |
Dolphin3.0 Mistral 24B | cognitivecomputations/dolphin3.0-mistral-24b | 32.8K | 8.2K | - | - | 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-02-13 |
Dolphin3.0 R1 Mistral 24B | cognitivecomputations/dolphin3.0-r1-mistral-24b | 32.8K | 8.2K | - | - | 🧠 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-02-13 |
DeepSeek-V3.1 | deepseek/deepseek-chat-v3.1 | 163.8K | 163.8K | In: $0.2 Out: $0.8 | Model: 0.100 Completion: 4.000 | 🧠 🔧 🌡️ | 2025-07 | In: text Out: text | Open Weights Released: 2025-08-21 |
R1 (free) | deepseek/deepseek-r1:free | 163.8K | 163.8K | - | - | 🧠 🔧 🌡️ | 2025-01 | In: text Out: text | Open Weights Released: 2025-01-20 |
DeepSeek V3 Base (free) | deepseek/deepseek-v3-base:free | 163.8K | 163.8K | - | - | 🌡️ | 2025-03 | In: text Out: text | Open Weights Released: 2025-03-29 |
DeepSeek V3.1 Terminus | deepseek/deepseek-v3.1-terminus | 131.1K | 65.5K | In: $0.27 Out: $1 | Model: 0.135 Completion: 3.704 | 🧠 🔧 🌡️ | 2025-07 | In: text Out: text | Open Weights Released: 2025-09-22 |
Deepseek R1 0528 Qwen3 8B (free) | deepseek/deepseek-r1-0528-qwen3-8b:free | 131.1K | 131.1K | - | - | 🧠 🔧 🌡️ | 2025-05 | In: text Out: text | Open Weights Released: 2025-05-29 |
DeepSeek V3 0324 | deepseek/deepseek-chat-v3-0324 | 16.4K | 8.2K | - | - | 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-03-24 |
R1 0528 (free) | deepseek/deepseek-r1-0528:free | 163.8K | 163.8K | - | - | 🧠 🔧 🌡️ | 2025-05 | In: text Out: text | Open Weights Released: 2025-05-28 |
DeepSeek R1 Distill Llama 70B | deepseek/deepseek-r1-distill-llama-70b | 8.2K | 8.2K | - | - | 🧠 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-01-23 |
DeepSeek R1 Distill Qwen 14B | deepseek/deepseek-r1-distill-qwen-14b | 64K | 8.2K | - | - | 🧠 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-01-29 |
Qwerky 72B | featherless/qwerky-72b | 32.8K | 8.2K | - | - | 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-03-20 |
DeepSeek R1T2 Chimera (free) | tngtech/deepseek-r1t2-chimera:free | 163.8K | 163.8K | - | - | 🧠 🌡️ | 2025-07 | In: text Out: text | Open Weights Released: 2025-07-08 |
Gemini 2.0 Flash | google/gemini-2.0-flash-001 | 1M | 8.2K | In: $0.1 Out: $0.4 Cache Read: $0.025 | Model: 0.050 Completion: 4.000 Cache: 0.250 | 📎 🔧 🌡️ | 2024-06 | In: text, image, audio, video, pdf Out: text | Released: 2024-12-11 |
Gemma 2 9B (free) | google/gemma-2-9b-it:free | 8.2K | 8.2K | - | - | 🔧 🌡️ | 2024-06 | In: text Out: text | Open Weights Released: 2024-06-28 |
Gemini 2.5 Flash | google/gemini-2.5-flash | 1M | 65.5K | In: $0.3 Out: $2.5 Cache Read: $0.0375 | Model: 0.150 Completion: 8.333 Cache: 0.125 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-07-17 |
Gemini 2.5 Pro Preview 05-06 | google/gemini-2.5-pro-preview-05-06 | 1M | 65.5K | In: $1.25 Out: $10 Cache Read: $0.31 | Model: 0.625 Completion: 8.000 Cache: 0.248 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-05-06 |
Gemma 3n E4B IT | google/gemma-3n-e4b-it | 8.2K | 8.2K | - | - | 📎 🌡️ | 2024-10 | In: text, image, audio Out: text | Open Weights Released: 2025-05-20 |
Gemini 2.5 Flash Lite | google/gemini-2.5-flash-lite | 1M | 65.5K | In: $0.1 Out: $0.4 Cache Read: $0.025 | Model: 0.050 Completion: 4.000 Cache: 0.250 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-06-17 |
Gemini 2.5 Pro Preview 06-05 | google/gemini-2.5-pro-preview-06-05 | 1M | 65.5K | In: $1.25 Out: $10 Cache Read: $0.31 | Model: 0.625 Completion: 8.000 Cache: 0.248 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-06-05 |
Gemini 2.5 Flash Preview 09-25 | google/gemini-2.5-flash-preview-09-2025 | 1M | 65.5K | In: $0.3 Out: $2.5 Cache Read: $0.031 | Model: 0.150 Completion: 8.333 Cache: 0.103 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-09-25 |
Gemini 2.5 Pro | google/gemini-2.5-pro | 1M | 65.5K | In: $1.25 Out: $10 Cache Read: $0.31 | Model: 0.625 Completion: 8.000 Cache: 0.248 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-03-20 Updated: 2025-06-05 |
Gemma 3 12B IT | google/gemma-3-12b-it | 96K | 8.2K | - | - | 📎 🔧 🌡️ | 2024-10 | In: text, image Out: text | Open Weights Released: 2025-03-13 |
Gemma 3n 4B (free) | google/gemma-3n-e4b-it:free | 8.2K | 8.2K | - | - | 📎 🔧 🌡️ | 2025-05 | In: text, image, audio Out: text | Open Weights Released: 2025-05-20 |
Gemini 2.5 Flash Lite Preview 09-25 | google/gemini-2.5-flash-lite-preview-09-2025 | 1M | 65.5K | In: $0.1 Out: $0.4 Cache Read: $0.025 | Model: 0.050 Completion: 4.000 Cache: 0.250 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-09-25 |
Gemini 2.0 Flash Experimental (free) | google/gemini-2.0-flash-exp:free | 1M | 1M | - | - | 📎 🔧 🌡️ | 2024-12 | In: text, image Out: text | Released: 2024-12-11 |
Gemma 3 27B IT | google/gemma-3-27b-it | 96K | 8.2K | - | - | 📎 🔧 🌡️ | 2024-10 | In: text, image Out: text | Open Weights Released: 2025-03-12 |
MAI DS R1 (free) | microsoft/mai-ds-r1:free | 163.8K | 163.8K | - | - | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-04-21 |
GPT-4.1 Mini | openai/gpt-4.1-mini | 1M | 32.8K | In: $0.4 Out: $1.6 Cache Read: $0.1 | Model: 0.200 Completion: 4.000 Cache: 0.250 | 📎 🔧 🌡️ | 2024-04 | In: text, image Out: text | Released: 2025-04-14 |
GPT-5 Chat (latest) | openai/gpt-5-chat | 400K | 128K | In: $1.25 Out: $10 | Model: 0.625 Completion: 8.000 | 📎 🧠 🌡️ | 2024-09-30 | In: text, image Out: text | Released: 2025-08-07 |
GPT-5 Nano | openai/gpt-5-nano | 400K | 128K | In: $0.05 Out: $0.4 | Model: 0.025 Completion: 8.000 | 📎 🧠 🔧 🌡️ | 2024-10-01 | In: text, image Out: text | Released: 2025-08-07 |
GPT-5 Codex | openai/gpt-5-codex | 400K | 128K | In: $1.25 Out: $10 Cache Read: $0.125 | Model: 0.625 Completion: 8.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2024-10-01 | In: text, image Out: text | Released: 2025-09-15 |
GPT-4.1 | openai/gpt-4.1 | 1M | 32.8K | In: $2 Out: $8 Cache Read: $0.5 | Model: 1.000 Completion: 4.000 Cache: 0.250 | 📎 🔧 🌡️ | 2024-04 | In: text, image Out: text | Released: 2025-04-14 |
o4 Mini | openai/o4-mini | 200K | 100K | In: $1.1 Out: $4.4 Cache Read: $0.28 | Model: 0.550 Completion: 4.000 Cache: 0.255 | 📎 🧠 🔧 🌡️ | 2024-06 | In: text, image Out: text | Released: 2025-04-16 |
GPT-5 Mini | openai/gpt-5-mini | 400K | 128K | In: $0.25 Out: $2 | Model: 0.125 Completion: 8.000 | 📎 🧠 🔧 🌡️ | 2024-10-01 | In: text, image Out: text | Released: 2025-08-07 |
GPT-5 Image | openai/gpt-5-image | 400K | 128K | In: $5 Out: $10 Cache Read: $1.25 | Model: 2.500 Completion: 2.000 Cache: 0.250 | 📎 🧠 🔧 🌡️ | 2024-10-01 | In: text, image, pdf Out: text, image | Released: 2025-10-14 |
GPT OSS 20B | openai/gpt-oss-20b | 131.1K | 32.8K | In: $0.05 Out: $0.2 | Model: 0.025 Completion: 4.000 | 🧠 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-08-05 |
GPT OSS 120B | openai/gpt-oss-120b | 131.1K | 32.8K | In: $0.072 Out: $0.28 | Model: 0.036 Completion: 3.889 | 🧠 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-08-05 |
GPT-4o-mini | openai/gpt-4o-mini | 128K | 16.4K | In: $0.15 Out: $0.6 Cache Read: $0.08 | Model: 0.075 Completion: 4.000 Cache: 0.533 | 📎 🔧 🌡️ | 2024-10 | In: text, image Out: text | Released: 2024-07-18 |
GPT-5 | openai/gpt-5 | 400K | 128K | In: $1.25 Out: $10 | Model: 0.625 Completion: 8.000 | 📎 🧠 🔧 🌡️ | 2024-10-01 | In: text, image Out: text | Released: 2025-08-07 |
Horizon Alpha | openrouter/horizon-alpha | 256K | 128K | - | - | 📎 🔧 | 2025-07 | In: text, image Out: text | Released: 2025-07-30 |
Sonoma Sky Alpha | openrouter/sonoma-sky-alpha | 2M | 2M | - | - | 📎 🔧 | - | In: text, image Out: text | Released: 2024-09-05 |
Cypher Alpha (free) | openrouter/cypher-alpha:free | 1M | 1M | - | - | 🔧 🌡️ | 2025-07 | In: text Out: text | Released: 2025-07-01 |
Sonoma Dusk Alpha | openrouter/sonoma-dusk-alpha | 2M | 2M | - | - | 📎 🔧 | - | In: text, image Out: text | Released: 2024-09-05 |
Horizon Beta | openrouter/horizon-beta | 256K | 128K | - | - | 📎 🔧 | 2025-07 | In: text, image Out: text | Released: 2025-08-01 |
GLM 4.5 | z-ai/glm-4.5 | 128K | 96K | In: $0.6 Out: $2.2 | Model: 0.300 Completion: 3.667 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-28 |
GLM 4.5 Air | z-ai/glm-4.5-air | 128K | 96K | In: $0.2 Out: $1.1 | Model: 0.100 Completion: 5.500 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-28 |
GLM 4.5V | z-ai/glm-4.5v | 64K | 16.4K | In: $0.6 Out: $1.8 | Model: 0.300 Completion: 3.000 | 📎 🧠 🔧 🌡️ | 2025-04 | In: text, image, video Out: text | Open Weights Released: 2025-08-11 |
GLM 4.6 | z-ai/glm-4.6 | 200K | 128K | In: $0.6 Out: $2.2 Cache Read: $0.11 | Model: 0.300 Completion: 3.667 Cache: 0.183 | 🧠 🔧 🌡️ | 2025-09 | In: text Out: text | Open Weights Released: 2025-09-30 |
GLM 4.5 Air (free) | z-ai/glm-4.5-air:free | 128K | 96K | - | - | 🧠 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-28 |
Qwen3 Coder | qwen/qwen3-coder | 262.1K | 66.5K | In: $0.3 Out: $1.2 | Model: 0.150 Completion: 4.000 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-23 |
Qwen3 32B (free) | qwen/qwen3-32b:free | 41K | 41K | - | - | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-04-28 |
Qwen3 Next 80B A3B Instruct | qwen/qwen3-next-80b-a3b-instruct | 262.1K | 262.1K | In: $0.14 Out: $1.4 | Model: 0.070 Completion: 10.000 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-09-11 |
Qwen2.5 Coder 32B Instruct | qwen/qwen-2.5-coder-32b-instruct | 32.8K | 8.2K | - | - | 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2024-11-11 |
Qwen3 235B A22B (free) | qwen/qwen3-235b-a22b:free | 131.1K | 131.1K | - | - | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-04-28 |
QwQ 32B (free) | qwen/qwq-32b:free | 32.8K | 32.8K | - | - | 🧠 🔧 🌡️ | 2025-03 | In: text Out: text | Open Weights Released: 2025-03-05 |
Qwen3 30B A3B Thinking 2507 | qwen/qwen3-30b-a3b-thinking-2507 | 262K | 262K | In: $0.2 Out: $0.8 | Model: 0.100 Completion: 4.000 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-29 |
Qwen3 30B A3B (free) | qwen/qwen3-30b-a3b:free | 41K | 41K | - | - | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-04-28 |
Qwen2.5 VL 72B Instruct | qwen/qwen2.5-vl-72b-instruct | 32.8K | 8.2K | - | - | 📎 🌡️ | 2024-10 | In: text, image Out: text | Open Weights Released: 2025-02-01 |
Qwen3 14B (free) | qwen/qwen3-14b:free | 41K | 41K | - | - | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-04-28 |
Qwen3 30B A3B Instruct 2507 | qwen/qwen3-30b-a3b-instruct-2507 | 262K | 262K | In: $0.2 Out: $0.8 | Model: 0.100 Completion: 4.000 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-29 |
Qwen3 235B A22B Thinking 2507 | qwen/qwen3-235b-a22b-thinking-2507 | 262.1K | 81.9K | In: $0.078 Out: $0.312 | Model: 0.039 Completion: 4.000 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-25 |
Qwen2.5 VL 32B Instruct (free) | qwen/qwen2.5-vl-32b-instruct:free | 8.2K | 8.2K | - | - | 📎 🔧 🌡️ | 2025-03 | In: text, image, video Out: text | Open Weights Released: 2025-03-24 |
Qwen2.5 VL 72B Instruct (free) | qwen/qwen2.5-vl-72b-instruct:free | 32.8K | 32.8K | - | - | 📎 🔧 🌡️ | 2025-02 | In: text, image Out: text | Open Weights Released: 2025-02-01 |
Qwen3 235B A22B Instruct 2507 (free) | qwen/qwen3-235b-a22b-07-25:free | 262.1K | 131.1K | - | - | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-04-28 Updated: 2025-07-21 |
Qwen3 Coder 480B A35B Instruct (free) | qwen/qwen3-coder:free | 262.1K | 66.5K | - | - | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-23 |
Qwen3 235B A22B Instruct 2507 | qwen/qwen3-235b-a22b-07-25 | 262.1K | 131.1K | In: $0.15 Out: $0.85 | Model: 0.075 Completion: 5.667 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-04-28 Updated: 2025-07-21 |
Qwen3 8B (free) | qwen/qwen3-8b:free | 41K | 41K | - | - | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-04-28 |
Qwen3 Max | qwen/qwen3-max | 262.1K | 32.8K | In: $1.2 Out: $6 | Model: 0.600 Completion: 5.000 | 🧠 🔧 🌡️ | - | In: text Out: text | Released: 2025-09-05 |
Qwen3 Next 80B A3B Thinking | qwen/qwen3-next-80b-a3b-thinking | 262.1K | 262.1K | In: $0.14 Out: $1.4 | Model: 0.070 Completion: 10.000 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-09-11 |
Devstral Medium | mistralai/devstral-medium-2507 | 131.1K | 131.1K | In: $0.4 Out: $2 | Model: 0.200 Completion: 5.000 | 🔧 🌡️ | 2025-05 | In: text Out: text | Open Weights Released: 2025-07-10 |
Codestral 2508 | mistralai/codestral-2508 | 256K | 256K | In: $0.3 Out: $0.9 | Model: 0.150 Completion: 3.000 | 🔧 🌡️ | 2025-05 | In: text Out: text | Open Weights Released: 2025-08-01 |
Mistral 7B Instruct (free) | mistralai/mistral-7b-instruct:free | 32.8K | 32.8K | - | - | 🔧 🌡️ | 2024-05 | In: text Out: text | Open Weights Released: 2024-05-27 |
Devstral Small | mistralai/devstral-small-2505 | 128K | 128K | In: $0.06 Out: $0.12 | Model: 0.030 Completion: 2.000 | 🔧 🌡️ | 2025-05 | In: text Out: text | Open Weights Released: 2025-05-07 |
Mistral Small 3.2 24B Instruct | mistralai/mistral-small-3.2-24b-instruct | 96K | 8.2K | - | - | 📎 🔧 🌡️ | 2024-10 | In: text, image Out: text | Open Weights Released: 2025-06-20 |
Devstral Small 2505 (free) | mistralai/devstral-small-2505:free | 32.8K | 32.8K | - | - | 🔧 🌡️ | 2025-05 | In: text Out: text | Open Weights Released: 2025-05-21 |
Mistral Small 3.2 24B (free) | mistralai/mistral-small-3.2-24b-instruct:free | 96K | 96K | - | - | 📎 🔧 🌡️ | 2025-06 | In: text, image Out: text | Open Weights Released: 2025-06-20 |
Mistral Medium 3 | mistralai/mistral-medium-3 | 131.1K | 131.1K | In: $0.4 Out: $2 | Model: 0.200 Completion: 5.000 | 📎 🔧 🌡️ | 2025-05 | In: text, image Out: text | Released: 2025-05-07 |
Mistral Small 3.1 24B Instruct | mistralai/mistral-small-3.1-24b-instruct | 128K | 8.2K | - | - | 📎 🔧 🌡️ | 2024-10 | In: text, image Out: text | Open Weights Released: 2025-03-17 |
Devstral Small 1.1 | mistralai/devstral-small-2507 | 131.1K | 131.1K | In: $0.1 Out: $0.3 | Model: 0.050 Completion: 3.000 | 🔧 🌡️ | 2025-05 | In: text Out: text | Open Weights Released: 2025-07-10 |
Mistral Medium 3.1 | mistralai/mistral-medium-3.1 | 262.1K | 262.1K | In: $0.4 Out: $2 | Model: 0.200 Completion: 5.000 | 📎 🔧 🌡️ | 2025-05 | In: text, image Out: text | Released: 2025-08-12 |
Mistral Nemo (free) | mistralai/mistral-nemo:free | 131.1K | 131.1K | - | - | 🔧 🌡️ | 2024-07 | In: text Out: text | Open Weights Released: 2024-07-19 |
Reka Flash 3 | rekaai/reka-flash-3 | 32.8K | 8.2K | - | - | 🧠 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-03-12 |
Llama 3.2 11B Vision Instruct | meta-llama/llama-3.2-11b-vision-instruct | 131.1K | 8.2K | - | - | 📎 🌡️ | 2023-12 | In: text, image Out: text | Open Weights Released: 2024-09-25 |
Llama 3.3 70B Instruct (free) | meta-llama/llama-3.3-70b-instruct:free | 65.5K | 65.5K | - | - | 🔧 🌡️ | 2024-12 | In: text Out: text | Open Weights Released: 2024-12-06 |
Llama 4 Scout (free) | meta-llama/llama-4-scout:free | 64K | 64K | - | - | 📎 🔧 🌡️ | 2024-08 | In: text, image Out: text | Open Weights Released: 2025-04-05 |
Claude Opus 4 | anthropic/claude-opus-4 | 200K | 32K | In: $15 Out: $75 Cache Read: $1.5 Cache Write: $18.75 | Model: 7.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-03-31 | In: text, image Out: text | Released: 2025-05-22 |
Claude Haiku 4.5 | anthropic/claude-haiku-4.5 | 200K | 64K | In: $1 Out: $5 Cache Read: $0.1 Cache Write: $1.25 | Model: 0.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-02-31 | In: text, image Out: text | Released: 2025-10-15 |
Claude Opus 4.1 | anthropic/claude-opus-4.1 | 200K | 32K | In: $15 Out: $75 Cache Read: $1.5 Cache Write: $18.75 | Model: 7.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-03-31 | In: text, image Out: text | Released: 2025-08-05 |
Claude Sonnet 3.7 | anthropic/claude-3.7-sonnet | 200K | 128K | In: $15 Out: $75 Cache Read: $1.5 Cache Write: $18.75 | Model: 7.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2024-01 | In: text, image Out: text | Released: 2025-02-19 |
Claude Haiku 3.5 | anthropic/claude-3.5-haiku | 200K | 8.2K | In: $0.8 Out: $4 Cache Read: $0.08 Cache Write: $1 | Model: 0.400 Completion: 5.000 Cache: 0.100 | 📎 🔧 🌡️ | 2024-07-31 | In: text, image Out: text | Released: 2024-10-22 |
Claude Sonnet 4 | anthropic/claude-sonnet-4 | 200K | 64K | In: $3 Out: $15 Cache Read: $0.3 Cache Write: $3.75 | Model: 1.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-03-31 | In: text, image Out: text | Released: 2025-05-22 |
Claude Sonnet 4.5 | anthropic/claude-sonnet-4.5 | 1M | 64K | In: $3 Out: $15 Cache Read: $0.3 Cache Write: $3.75 | Model: 1.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-07-31 | In: text, image Out: text | Released: 2025-09-29 |
Sarvam-M (free) | sarvamai/sarvam-m:free | 32.8K | 32.8K | - | - | 🧠 🔧 🌡️ | 2025-05 | In: text Out: text | Open Weights Released: 2025-05-25 |
Perplexity¶
Model | Model ID | Context | Output | Pricing ($/1M) | NewAPI Ratios | Capabilities | Knowledge | Modalities | Details |
---|---|---|---|---|---|---|---|---|---|
Sonar Reasoning | sonar-reasoning | 128K | 4.1K | In: $1 Out: $5 | Model: 0.500 Completion: 5.000 | 🧠 🌡️ | 2025-09-01 | In: text Out: text | Released: 2024-01-01 Updated: 2025-09-01 |
Sonar | sonar | 128K | 4.1K | In: $1 Out: $1 | Model: 0.500 Completion: 1.000 | 🌡️ | 2025-09-01 | In: text Out: text | Released: 2024-01-01 Updated: 2025-09-01 |
Sonar Pro | sonar-pro | 200K | 8.2K | In: $3 Out: $15 | Model: 1.500 Completion: 5.000 | 📎 🌡️ | 2025-09-01 | In: text, image Out: text | Released: 2024-01-01 Updated: 2025-09-01 |
Sonar Reasoning Pro | sonar-reasoning-pro | 128K | 4.1K | In: $2 Out: $8 | Model: 1.000 Completion: 4.000 | 📎 🧠 🌡️ | 2025-09-01 | In: text, image Out: text | Released: 2024-01-01 Updated: 2025-09-01 |
Requesty¶
📖 API Address | 📚 Official Documentation
Model | Model ID | Context | Output | Pricing ($/1M) | NewAPI Ratios | Capabilities | Knowledge | Modalities | Details |
---|---|---|---|---|---|---|---|---|---|
Gemini 2.5 Flash | google/gemini-2.5-flash | 1M | 65.5K | In: $0.3 Out: $2.5 Cache Read: $0.075 Cache Write: $0.55 | Model: 0.150 Completion: 8.333 Cache: 0.250 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-06-17 |
Gemini 2.5 Pro | google/gemini-2.5-pro | 1M | 65.5K | In: $1.25 Out: $10 Cache Read: $0.31 Cache Write: $2.375 | Model: 0.625 Completion: 8.000 Cache: 0.248 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-06-17 |
GPT-4.1 Mini | openai/gpt-4.1-mini | 1M | 32.8K | In: $0.4 Out: $1.6 Cache Read: $0.1 | Model: 0.200 Completion: 4.000 Cache: 0.250 | 📎 🔧 🌡️ | 2024-04 | In: text, image Out: text | Released: 2025-04-14 |
GPT-5 Nano | openai/gpt-5-nano | 16K | 4K | In: $0.05 Out: $0.4 Cache Read: $0.01 | Model: 0.025 Completion: 8.000 Cache: 0.200 | 📎 🧠 🔧 | 2024-05-30 | In: text Out: text | Released: 2025-08-07 |
GPT-4.1 | openai/gpt-4.1 | 1M | 32.8K | In: $2 Out: $8 Cache Read: $0.5 | Model: 1.000 Completion: 4.000 Cache: 0.250 | 📎 🔧 🌡️ | 2024-04 | In: text, image Out: text | Released: 2025-04-14 |
o4 Mini | openai/o4-mini | 200K | 100K | In: $1.1 Out: $4.4 Cache Read: $0.28 | Model: 0.550 Completion: 4.000 Cache: 0.255 | 📎 🧠 🔧 🌡️ | 2024-06 | In: text, image Out: text | Released: 2025-04-16 |
GPT-5 Mini | openai/gpt-5-mini | 128K | 32K | In: $0.25 Out: $2 Cache Read: $0.03 | Model: 0.125 Completion: 8.000 Cache: 0.120 | 📎 🧠 🔧 | 2024-05-30 | In: text, image Out: text | Released: 2025-08-07 |
GPT-4o Mini | openai/gpt-4o-mini | 128K | 16.4K | In: $0.15 Out: $0.6 Cache Read: $0.08 | Model: 0.075 Completion: 4.000 Cache: 0.533 | 📎 🔧 🌡️ | 2024-10 | In: text, image Out: text | Released: 2024-07-18 |
GPT-5 | openai/gpt-5 | 400K | 128K | In: $1.25 Out: $10 Cache Read: $0.13 | Model: 0.625 Completion: 8.000 Cache: 0.104 | 📎 🧠 🔧 | 2024-09-30 | In: text, audio, image, video Out: text, audio, image | Released: 2025-08-07 |
Claude Opus 4 | anthropic/claude-opus-4 | 200K | 32K | In: $15 Out: $75 Cache Read: $1.5 Cache Write: $18.75 | Model: 7.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-03-31 | In: text, image Out: text | Released: 2025-05-22 |
Claude Sonnet 3.7 | anthropic/claude-3-7-sonnet | 200K | 64K | In: $3 Out: $15 Cache Read: $0.3 Cache Write: $3.75 | Model: 1.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2024-01 | In: text, image Out: text | Released: 2025-02-19 |
Claude Sonnet 4 | anthropic/claude-4-sonnet-20250522 | 200K | 64K | In: $3 Out: $15 Cache Read: $0.3 Cache Write: $3.75 | Model: 1.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-03-31 | In: text, image Out: text | Released: 2025-05-22 |
Claude Opus 4.1 | anthropic/claude-opus-4-1-20250805 | 200K | 32K | In: $15 Out: $75 Cache Read: $1.5 Cache Write: $18.75 | Model: 7.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-03-31 | In: text, image Out: text | Released: 2025-08-05 |
Scaleway¶
📖 API Address | 📚 Official Documentation
Model | Model ID | Context | Output | Pricing ($/1M) | NewAPI Ratios | Capabilities | Knowledge | Modalities | Details |
---|---|---|---|---|---|---|---|---|---|
Qwen3 235B A22B Instruct 2507 | qwen3-235b-a22b-instruct-2507 | 40K | 4.1K | In: $0.75 Out: $2.25 | Model: 0.375 Completion: 3.000 | 📎 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-07-01 |
Pixtral 12B 2409 | pixtral-12b-2409 | 128K | 4.1K | In: $0.2 Out: $0.2 | Model: 0.100 Completion: 1.000 | 📎 🔧 🌡️ | - | In: text, image Out: text | Open Weights Released: 2024-09-25 |
Llama 3.1 8B Instruct | llama-3.1-8b-instruct | 128K | 16.4K | In: $0.2 Out: $0.2 | Model: 0.100 Completion: 1.000 | 📎 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2024-07-23 |
Mistral Nemo Instruct 2407 | mistral-nemo-instruct-2407 | 128K | 8.2K | In: $0.2 Out: $0.2 | Model: 0.100 Completion: 1.000 | 📎 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2024-07-25 |
Mistral Small 3.2 24B Instruct 2506 | mistral-small-3.2-24b-instruct-2506 | 128K | 8.2K | In: $0.15 Out: $0.35 | Model: 0.075 Completion: 2.333 | 📎 🔧 🌡️ | - | In: text, image Out: text | Open Weights Released: 2025-06-01 |
Qwen3 Coder 30B A3B Instruct | qwen3-coder-30b-a3b-instruct | 128K | 8.2K | In: $0.2 Out: $0.8 | Model: 0.100 Completion: 4.000 | 📎 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-07-01 |
Llama 3.3 70B Instruct | llama-3.3-70b-instruct | 100K | 4.1K | In: $0.9 Out: $0.9 | Model: 0.450 Completion: 1.000 | 📎 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-01-15 |
DeepSeek R1 Distill Llama 70B | deepseek-r1-distill-llama-70b | 32K | 4.1K | In: $0.9 Out: $0.9 | Model: 0.450 Completion: 1.000 | 📎 🧠 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-01-20 |
Voxtral Small 24B 2507 | voxtral-small-24b-2507 | 32K | 8.2K | In: $0.15 Out: $0.35 | Model: 0.075 Completion: 2.333 | 📎 🔧 🌡️ | - | In: text, audio Out: text | Open Weights Released: 2025-07-01 |
GPT-OSS 120B | gpt-oss-120b | 128K | 8.2K | In: $0.15 Out: $0.6 | Model: 0.075 Completion: 4.000 | 📎 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2024-01-01 |
Gemma 3 27B IT | gemma-3-27b-it | 40K | 8.2K | In: $0.25 Out: $0.5 | Model: 0.125 Completion: 2.000 | 📎 🔧 🌡️ | - | In: text, image Out: text | Open Weights Released: 2025-03-01 |
submodel¶
📖 API Address | 📚 Official Documentation
Model | Model ID | Context | Output | Pricing ($/1M) | NewAPI Ratios | Capabilities | Knowledge | Modalities | Details |
---|---|---|---|---|---|---|---|---|---|
GPT OSS 120B | openai/gpt-oss-120b | 131.1K | 32.8K | In: $0.1 Out: $0.5 | Model: 0.050 Completion: 5.000 | 🧠 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-08-23 |
Qwen3 235B A22B Instruct 2507 | Qwen/Qwen3-235B-A22B-Instruct-2507 | 262.1K | 131.1K | In: $0.2 Out: $0.3 | Model: 0.100 Completion: 1.500 | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-08-23 |
Qwen3 Coder 480B A35B Instruct | Qwen/Qwen3-Coder-480B-A35B-Instruct-FP8 | 262.1K | 262.1K | In: $0.2 Out: $0.8 | Model: 0.100 Completion: 4.000 | 🔧 🌡️ | - | In: text Out: text | Released: 2025-08-23 |
Qwen3 235B A22B Thinking 2507 | Qwen/Qwen3-235B-A22B-Thinking-2507 | 262.1K | 131.1K | In: $0.2 Out: $0.6 | Model: 0.100 Completion: 3.000 | 🧠 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-08-23 |
GLM 4.5 FP8 | zai-org/GLM-4.5-FP8 | 131.1K | 131.1K | In: $0.2 Out: $0.8 | Model: 0.100 Completion: 4.000 | 🧠 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-07-28 |
GLM 4.5 Air | zai-org/GLM-4.5-Air | 131.1K | 131.1K | In: $0.1 Out: $0.5 | Model: 0.050 Completion: 5.000 | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-07-28 |
DeepSeek R1 0528 | deepseek-ai/DeepSeek-R1-0528 | 75K | 163.8K | In: $0.5 Out: $2.15 | Model: 0.250 Completion: 4.300 | 🧠 🔧 🌡️ | - | In: text Out: text | Released: 2025-08-23 |
DeepSeek V3.1 | deepseek-ai/DeepSeek-V3.1 | 75K | 163.8K | In: $0.2 Out: $0.8 | Model: 0.100 Completion: 4.000 | 🧠 🔧 🌡️ | - | In: text Out: text | Released: 2025-08-23 |
DeepSeek V3 0324 | deepseek-ai/DeepSeek-V3-0324 | 75K | 163.8K | In: $0.2 Out: $0.8 | Model: 0.100 Completion: 4.000 | 🔧 🌡️ | - | In: text Out: text | Released: 2025-08-23 |
Synthetic¶
📖 API Address | 📚 Official Documentation
Model | Model ID | Context | Output | Pricing ($/1M) | NewAPI Ratios | Capabilities | Knowledge | Modalities | Details |
---|---|---|---|---|---|---|---|---|---|
Qwen 3 235B Instruct | hf:Qwen/Qwen3-235B-A22B-Instruct-2507 | 256K | 32K | In: $0.2 Out: $0.6 | Model: 0.100 Completion: 3.000 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-04-28 Updated: 2025-07-21 |
Qwen2.5-Coder-32B-Instruct | hf:Qwen/Qwen2.5-Coder-32B-Instruct | 32.8K | 32.8K | In: $0.8 Out: $0.8 | Model: 0.400 Completion: 1.000 | 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2024-11-11 |
Qwen 3 Coder 480B | hf:Qwen/Qwen3-Coder-480B-A35B-Instruct | 256K | 32K | In: $2 Out: $2 | Model: 1.000 Completion: 1.000 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-23 |
Qwen3 235B A22B Thinking 2507 | hf:Qwen/Qwen3-235B-A22B-Thinking-2507 | 256K | 32K | In: $0.65 Out: $3 | Model: 0.325 Completion: 4.615 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-25 |
Llama-3.1-70B-Instruct | hf:meta-llama/Llama-3.1-70B-Instruct | 128K | 32.8K | In: $0.9 Out: $0.9 | Model: 0.450 Completion: 1.000 | 🧠 🔧 🌡️ | 2023-12 | In: text Out: text | Open Weights Released: 2024-07-23 |
Llama-3.1-8B-Instruct | hf:meta-llama/Llama-3.1-8B-Instruct | 128K | 32.8K | In: $0.2 Out: $0.2 | Model: 0.100 Completion: 1.000 | 🧠 🔧 🌡️ | 2023-12 | In: text Out: text | Open Weights Released: 2024-07-23 |
Llama-3.3-70B-Instruct | hf:meta-llama/Llama-3.3-70B-Instruct | 128K | 32.8K | In: $0.9 Out: $0.9 | Model: 0.450 Completion: 1.000 | 🧠 🔧 🌡️ | 2023-12 | In: text Out: text | Open Weights Released: 2024-12-06 |
Llama-4-Scout-17B-16E-Instruct | hf:meta-llama/Llama-4-Scout-17B-16E-Instruct | 328K | 4.1K | In: $0.15 Out: $0.6 | Model: 0.075 Completion: 4.000 | 📎 🔧 🌡️ | 2024-08 | In: text, image Out: text | Open Weights Released: 2025-04-05 |
Llama-4-Maverick-17B-128E-Instruct-FP8 | hf:meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8 | 524K | 4.1K | In: $0.22 Out: $0.88 | Model: 0.110 Completion: 4.000 | 📎 🔧 🌡️ | 2024-08 | In: text, image Out: text | Open Weights Released: 2025-04-05 |
Llama-3.1-405B-Instruct | hf:meta-llama/Llama-3.1-405B-Instruct | 128K | 32.8K | In: $3 Out: $3 | Model: 1.500 Completion: 1.000 | 🧠 🔧 🌡️ | 2023-12 | In: text Out: text | Open Weights Released: 2024-07-23 |
Kimi K2 | hf:moonshotai/Kimi-K2-Instruct | 128K | 32.8K | In: $0.6 Out: $2.5 | Model: 0.300 Completion: 4.167 | 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-07-11 |
Kimi K2 0905 | hf:moonshotai/Kimi-K2-Instruct-0905 | 262.1K | 32.8K | In: $1.2 Out: $1.2 | Model: 0.600 Completion: 1.000 | 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-09-05 |
GLM 4.5 | hf:zai-org/GLM-4.5 | 128K | 96K | In: $0.55 Out: $2.19 | Model: 0.275 Completion: 3.982 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-28 |
GLM 4.6 | hf:zai-org/GLM-4.6 | 200K | 96K | In: $0.55 Out: $2.19 | Model: 0.275 Completion: 3.982 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-09-30 |
DeepSeek R1 | hf:deepseek-ai/DeepSeek-R1 | 128K | 128K | In: $0.55 Out: $2.19 | Model: 0.275 Completion: 3.982 | 🧠 🔧 🌡️ | 2025-01 | In: text Out: text | Open Weights Released: 2025-01-20 |
DeepSeek R1 (0528) | hf:deepseek-ai/DeepSeek-R1-0528 | 128K | 128K | In: $3 Out: $8 | Model: 1.500 Completion: 2.667 | 🧠 🔧 🌡️ | - | In: text Out: text | Released: 2025-08-01 |
DeepSeek V3.1 Terminus | hf:deepseek-ai/DeepSeek-V3.1-Terminus | 128K | 128K | In: $1.2 Out: $1.2 | Model: 0.600 Completion: 1.000 | 🧠 🔧 🌡️ | - | In: text Out: text | Released: 2025-09-22 Updated: 2025-09-25 |
DeepSeek V3 | hf:deepseek-ai/DeepSeek-V3 | 128K | 128K | In: $1.25 Out: $1.25 | Model: 0.625 Completion: 1.000 | 🧠 🔧 🌡️ | 2024-07 | In: text Out: text | Open Weights Released: 2025-01-20 Updated: 2025-05-29 |
DeepSeek V3.1 | hf:deepseek-ai/DeepSeek-V3.1 | 128K | 128K | In: $0.56 Out: $1.68 | Model: 0.280 Completion: 3.000 | 🧠 🔧 🌡️ | - | In: text Out: text | Released: 2025-08-21 |
DeepSeek V3 (0324) | hf:deepseek-ai/DeepSeek-V3-0324 | 128K | 128K | In: $1.2 Out: $1.2 | Model: 0.600 Completion: 1.000 | 🔧 🌡️ | - | In: text Out: text | Released: 2025-08-01 |
GPT OSS 120B | hf:openai/gpt-oss-120b | 128K | 32.8K | In: $0.1 Out: $0.1 | Model: 0.050 Completion: 1.000 | 🧠 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-08-05 |
Together AI¶
Model | Model ID | Context | Output | Pricing ($/1M) | NewAPI Ratios | Capabilities | Knowledge | Modalities | Details |
---|---|---|---|---|---|---|---|---|---|
Kimi K2 Instruct | moonshotai/Kimi-K2-Instruct | 131.1K | 32.8K | In: $1 Out: $3 | Model: 0.500 Completion: 3.000 | 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-07-14 |
GPT OSS 120B | openai/gpt-oss-120b | 131.1K | 131.1K | In: $0.15 Out: $0.6 | Model: 0.075 Completion: 4.000 | 🧠 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-08-05 |
Llama 3.3 70B | meta-llama/Llama-3.3-70B-Instruct-Turbo | 131.1K | 66.5K | In: $0.88 Out: $0.88 | Model: 0.440 Completion: 1.000 | 🔧 🌡️ | 2023-12 | In: text Out: text | Open Weights Released: 2024-12-06 |
Qwen3 Coder 480B A35B Instruct | Qwen/Qwen3-Coder-480B-A35B-Instruct-FP8 | 262.1K | 66.5K | In: $2 Out: $2 | Model: 1.000 Completion: 1.000 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-23 |
DeepSeek R1 | deepseek-ai/DeepSeek-R1 | 163.8K | 12.3K | In: $3 Out: $7 | Model: 1.500 Completion: 2.333 | 🧠 🌡️ | 2024-07 | In: text Out: text | Open Weights Released: 2024-12-26 Updated: 2025-03-24 |
DeepSeek V3 | deepseek-ai/DeepSeek-V3 | 131.1K | 12.3K | In: $1.25 Out: $1.25 | Model: 0.625 Completion: 1.000 | 🧠 🔧 🌡️ | 2024-07 | In: text Out: text | Open Weights Released: 2025-01-20 Updated: 2025-05-29 |
Upstage¶
📖 API Address | 📚 Official Documentation
Model | Model ID | Context | Output | Pricing ($/1M) | NewAPI Ratios | Capabilities | Knowledge | Modalities | Details |
---|---|---|---|---|---|---|---|---|---|
solar-mini | solar-mini | 32.8K | 4.1K | In: $0.15 Out: $0.15 | Model: 0.075 Completion: 1.000 | 🔧 🌡️ | 2024-09 | In: text Out: text | Released: 2024-06-12 Updated: 2025-04-22 |
solar-pro2 | solar-pro2 | 65.5K | 8.2K | In: $0.25 Out: $0.25 | Model: 0.125 Completion: 1.000 | 🧠 🔧 🌡️ | 2025-03 | In: text Out: text | Released: 2025-05-20 |
v0¶
Model | Model ID | Context | Output | Pricing ($/1M) | NewAPI Ratios | Capabilities | Knowledge | Modalities | Details |
---|---|---|---|---|---|---|---|---|---|
v0-1.5-lg | v0-1.5-lg | 512K | 32K | In: $15 Out: $75 | Model: 7.500 Completion: 5.000 | 📎 🧠 🔧 🌡️ | - | In: text, image Out: text | Released: 2025-06-09 |
v0-1.5-md | v0-1.5-md | 128K | 32K | In: $3 Out: $15 | Model: 1.500 Completion: 5.000 | 📎 🧠 🔧 🌡️ | - | In: text, image Out: text | Released: 2025-06-09 |
v0-1.0-md | v0-1.0-md | 128K | 32K | In: $3 Out: $15 | Model: 1.500 Completion: 5.000 | 📎 🧠 🔧 🌡️ | - | In: text, image Out: text | Released: 2025-05-22 |
Venice AI¶
📖 API Address | 📚 Official Documentation
Model | Model ID | Context | Output | Pricing ($/1M) | NewAPI Ratios | Capabilities | Knowledge | Modalities | Details |
---|---|---|---|---|---|---|---|---|---|
Dolphin 72B | dolphin-2.9.2-qwen2-72b | 32.8K | 8.2K | In: $0.7 Out: $2.8 | Model: 0.350 Completion: 4.000 | 🌡️ | 2021-09 | In: text Out: text | Open Weights Released: 2025-05-21 |
Venice Medium | mistral-31-24b | 131.1K | 8.2K | In: $0.5 Out: $2 | Model: 0.250 Completion: 4.000 | 🔧 🌡️ | 2023-10 | In: text, image Out: text | Open Weights Released: 2025-07-15 |
Venice Uncensored 1.1 | venice-uncensored | 32.8K | 8.2K | In: $0.5 Out: $2 | Model: 0.250 Completion: 4.000 | 🌡️ | 2023-10 | In: text Out: text | Open Weights Released: 2025-07-15 |
Qwen 2.5 VL 72B | qwen-2.5-vl | 32.8K | 8.2K | In: $0.7 Out: $2.8 | Model: 0.350 Completion: 4.000 | 🌡️ | 2023-10 | In: text, image Out: text | Open Weights Released: 2025-06-09 |
Venice Large | qwen3-235b | 131.1K | 8.2K | In: $1.5 Out: $6 | Model: 0.750 Completion: 4.000 | 🧠 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-07-27 |
Venice Reasoning | qwen-2.5-qwq-32b | 32.8K | 8.2K | In: $0.5 Out: $2 | Model: 0.250 Completion: 4.000 | 🧠 🌡️ | 2023-10 | In: text Out: text | Open Weights Released: 2025-07-08 |
DeepSeek Coder V2 Lite | deepseek-coder-v2-lite | 131.1K | 8.2K | In: $0.5 Out: $2 | Model: 0.250 Completion: 4.000 | 🌡️ | 2021-09 | In: text Out: text | Open Weights Released: 2025-06-22 |
Venice Small | qwen3-4b | 32.8K | 8.2K | In: $0.15 Out: $0.6 | Model: 0.075 Completion: 4.000 | 🧠 🔧 🌡️ | 2024-07 | In: text Out: text | Open Weights Released: 2025-07-27 |
Llama 3.3 70B | llama-3.3-70b | 65.5K | 8.2K | In: $0.7 Out: $2.8 | Model: 0.350 Completion: 4.000 | 🔧 🌡️ | 2023-12 | In: text Out: text | Open Weights Released: 2025-06-09 |
Qwen 2.5 Coder 32B | qwen-2.5-coder-32b | 32.8K | 8.2K | In: $0.5 Out: $2 | Model: 0.250 Completion: 4.000 | 🌡️ | 2023-10 | In: text Out: text | Open Weights Released: 2025-06-14 |
DeepSeek R1 671B | deepseek-r1-671b | 131.1K | 8.2K | In: $3.5 Out: $14 | Model: 1.750 Completion: 4.000 | 🧠 🌡️ | 2023-10 | In: text Out: text | Open Weights Released: 2025-06-05 |
Llama 3.2 3B | llama-3.2-3b | 131.1K | 8.2K | In: $0.15 Out: $0.6 | Model: 0.075 Completion: 4.000 | 🔧 🌡️ | 2023-12 | In: text Out: text | Open Weights Released: 2025-05-23 |
Llama 3.1 405B | llama-3.1-405b | 65.5K | 8.2K | In: $1.5 Out: $6 | Model: 0.750 Completion: 4.000 | 🌡️ | 2023-12 | In: text Out: text | Open Weights Released: 2025-06-30 |
Vercel AI Gateway¶
Model | Model ID | Context | Output | Pricing ($/1M) | NewAPI Ratios | Capabilities | Knowledge | Modalities | Details |
---|---|---|---|---|---|---|---|---|---|
Kimi K2 Instruct | moonshotai/kimi-k2 | 131.1K | 16.4K | In: $1 Out: $3 | Model: 0.500 Completion: 3.000 | 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-07-14 |
Qwen3 Next 80B A3B Instruct | alibaba/qwen3-next-80b-a3b-instruct | 131.1K | 32.8K | In: $0.5 Out: $2 | Model: 0.250 Completion: 4.000 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-09-12 |
Qwen3 VL Instruct | alibaba/qwen3-vl-instruct | 131.1K | 129K | In: $0.7 Out: $2.8 | Model: 0.350 Completion: 4.000 | 📎 🔧 🌡️ | 2025-04 | In: text, image Out: text | Open Weights Released: 2025-09-24 |
Qwen3 VL Thinking | alibaba/qwen3-vl-thinking | 131.1K | 129K | In: $0.7 Out: $8.4 | Model: 0.350 Completion: 12.000 | 📎 🧠 🔧 🌡️ | 2025-09 | In: text, image Out: text | Open Weights Released: 2025-09-24 |
Qwen3 Max | alibaba/qwen3-max | 262.1K | 32.8K | In: $1.2 Out: $6 | Model: 0.600 Completion: 5.000 | 🔧 🌡️ | 2025-04 | In: text Out: text | Released: 2025-09-23 |
Qwen3 Coder Plus | alibaba/qwen3-coder-plus | 1M | 1M | In: $1 Out: $5 | Model: 0.500 Completion: 5.000 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-23 |
Qwen3 Next 80B A3B Thinking | alibaba/qwen3-next-80b-a3b-thinking | 131.1K | 32.8K | In: $0.5 Out: $6 | Model: 0.250 Completion: 12.000 | 🧠 🔧 🌡️ | 2025-09 | In: text Out: text | Open Weights Released: 2025-09-12 |
Grok 3 Mini Fast | xai/grok-3-mini-fast | 131.1K | 8.2K | In: $0.6 Out: $4 Cache Read: $0.15 | Model: 0.300 Completion: 6.667 Cache: 0.250 | 🧠 🔧 🌡️ | 2024-11 | In: text Out: text | Released: 2025-02-17 |
Grok 3 Mini | xai/grok-3-mini | 131.1K | 8.2K | In: $0.3 Out: $0.5 Cache Read: $0.075 | Model: 0.150 Completion: 1.667 Cache: 0.250 | 🧠 🔧 🌡️ | 2024-11 | In: text Out: text | Released: 2025-02-17 |
Grok 4 Fast | xai/grok-4-fast | 2M | 30K | In: $0.2 Out: $0.5 Cache Read: $0.05 | Model: 0.100 Completion: 2.500 Cache: 0.250 | 📎 🧠 🔧 🌡️ | 2025-07 | In: text, image Out: text | Released: 2025-09-19 |
Grok 3 | xai/grok-3 | 131.1K | 8.2K | In: $3 Out: $15 Cache Read: $0.75 | Model: 1.500 Completion: 5.000 Cache: 0.250 | 🔧 🌡️ | 2024-11 | In: text Out: text | Released: 2025-02-17 |
Grok 2 | xai/grok-2 | 131.1K | 8.2K | In: $2 Out: $10 Cache Read: $2 | Model: 1.000 Completion: 5.000 Cache: 1.000 | 🔧 🌡️ | 2024-08 | In: text Out: text | Released: 2024-08-20 |
Grok Code Fast 1 | xai/grok-code-fast-1 | 256K | 10K | In: $0.2 Out: $1.5 Cache Read: $0.02 | Model: 0.100 Completion: 7.500 Cache: 0.100 | 🧠 🔧 🌡️ | 2023-10 | In: text Out: text | Released: 2025-08-28 |
Grok 2 Vision | xai/grok-2-vision | 8.2K | 4.1K | In: $2 Out: $10 Cache Read: $2 | Model: 1.000 Completion: 5.000 Cache: 1.000 | 📎 🔧 🌡️ | 2024-08 | In: text, image Out: text | Released: 2024-08-20 |
Grok 4 | xai/grok-4 | 256K | 64K | In: $3 Out: $15 Cache Read: $0.75 | Model: 1.500 Completion: 5.000 Cache: 0.250 | 🧠 🔧 🌡️ | 2025-07 | In: text Out: text | Released: 2025-07-09 |
Grok 3 Fast | xai/grok-3-fast | 131.1K | 8.2K | In: $5 Out: $25 Cache Read: $1.25 | Model: 2.500 Completion: 5.000 Cache: 0.250 | 🔧 🌡️ | 2024-11 | In: text Out: text | Released: 2025-02-17 |
Grok 4 Fast (Non-Reasoning) | xai/grok-4-fast-non-reasoning | 2M | 30K | In: $0.2 Out: $0.5 Cache Read: $0.05 | Model: 0.100 Completion: 2.500 Cache: 0.250 | 📎 🔧 🌡️ | 2025-07 | In: text, image Out: text | Released: 2025-09-19 |
Codestral | mistral/codestral | 256K | 4.1K | In: $0.3 Out: $0.9 | Model: 0.150 Completion: 3.000 | 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2024-05-29 Updated: 2025-01-04 |
Magistral Medium | mistral/magistral-medium | 128K | 16.4K | In: $2 Out: $5 | Model: 1.000 Completion: 2.500 | 🧠 🔧 🌡️ | 2025-06 | In: text Out: text | Open Weights Released: 2025-03-17 Updated: 2025-03-20 |
Mistral Large | mistral/mistral-large | 131.1K | 16.4K | In: $2 Out: $6 | Model: 1.000 Completion: 3.000 | 🔧 🌡️ | 2024-11 | In: text Out: text | Open Weights Released: 2024-11-01 Updated: 2024-11-04 |
Pixtral Large | mistral/pixtral-large | 128K | 128K | In: $2 Out: $6 | Model: 1.000 Completion: 3.000 | 📎 🔧 🌡️ | 2024-11 | In: text, image Out: text | Open Weights Released: 2024-11-01 Updated: 2024-11-04 |
Ministral 8B | mistral/ministral-8b | 128K | 128K | In: $0.1 Out: $0.1 | Model: 0.050 Completion: 1.000 | 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2024-10-01 Updated: 2024-10-04 |
Ministral 3B | mistral/ministral-3b | 128K | 128K | In: $0.04 Out: $0.04 | Model: 0.020 Completion: 1.000 | 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2024-10-01 Updated: 2024-10-04 |
Magistral Small | mistral/magistral-small | 128K | 128K | In: $0.5 Out: $1.5 | Model: 0.250 Completion: 3.000 | 🧠 🔧 🌡️ | 2025-06 | In: text Out: text | Open Weights Released: 2025-03-17 |
Mistral Small | mistral/mistral-small | 128K | 16.4K | In: $0.1 Out: $0.3 | Model: 0.050 Completion: 3.000 | 🔧 🌡️ | 2025-03 | In: text, image Out: text | Open Weights Released: 2024-09-01 Updated: 2024-09-04 |
Pixtral 12B | mistral/pixtral-12b | 128K | 128K | In: $0.15 Out: $0.15 | Model: 0.075 Completion: 1.000 | 📎 🔧 🌡️ | 2024-09 | In: text, image Out: text | Open Weights Released: 2024-09-01 |
Mixtral 8x22B | mistral/mixtral-8x22b-instruct | 64K | 64K | In: $2 Out: $6 | Model: 1.000 Completion: 3.000 | 🔧 🌡️ | 2024-04 | In: text Out: text | Open Weights Released: 2024-04-17 |
v0-1.0-md | vercel/v0-1.0-md | 128K | 32K | In: $3 Out: $15 | Model: 1.500 Completion: 5.000 | 📎 🧠 🔧 🌡️ | - | In: text, image Out: text | Released: 2025-05-22 |
v0-1.5-md | vercel/v0-1.5-md | 128K | 32K | In: $3 Out: $15 | Model: 1.500 Completion: 5.000 | 📎 🧠 🔧 🌡️ | - | In: text, image Out: text | Released: 2025-06-09 |
DeepSeek V3.2 Exp Thinking | deepseek/deepseek-v3.2-exp-thinking | 163.8K | 8.2K | In: $0.28 Out: $0.42 | Model: 0.140 Completion: 1.500 | 🧠 🔧 🌡️ | 2025-09 | In: text Out: text | Released: 2025-09-29 |
DeepSeek V3.1 Terminus | deepseek/deepseek-v3.1-terminus | 128K | 8.2K | In: $0.27 Out: $1 | Model: 0.135 Completion: 3.704 | 🧠 🔧 🌡️ | 2025-07 | In: text Out: text | Open Weights Released: 2025-09-22 |
DeepSeek V3.2 Exp | deepseek/deepseek-v3.2-exp | 163.8K | 8.2K | In: $0.28 Out: $0.42 | Model: 0.140 Completion: 1.500 | 🔧 🌡️ | 2025-09 | In: text Out: text | Released: 2025-09-29 |
DeepSeek R1 Distill Llama 70B | deepseek/deepseek-r1-distill-llama-70b | 131.1K | 8.2K | In: $0.75 Out: $0.99 | Model: 0.375 Completion: 1.320 | 🧠 🔧 🌡️ | 2024-07 | In: text Out: text | Open Weights Released: 2025-01-20 |
DeepSeek-R1 | deepseek/deepseek-r1 | 128K | 32.8K | In: $1.35 Out: $5.4 | Model: 0.675 Completion: 4.000 | 🧠 🔧 🌡️ | 2024-07 | In: text Out: text | Released: 2025-01-20 Updated: 2025-05-29 |
Gemini 2.5 Flash Lite | google/gemini-2.5-flash-lite | 1M | 65.5K | In: $0.1 Out: $0.4 Cache Read: $0.025 | Model: 0.050 Completion: 4.000 Cache: 0.250 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-06-17 |
Gemini 2.5 Flash Preview 09-25 | google/gemini-2.5-flash-preview-09-2025 | 1M | 65.5K | In: $0.3 Out: $2.5 Cache Read: $0.075 Cache Write: $0.383 | Model: 0.150 Completion: 8.333 Cache: 0.250 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-09-25 |
Gemini 2.5 Flash Lite Preview 09-25 | google/gemini-2.5-flash-lite-preview-09-2025 | 1M | 65.5K | In: $0.1 Out: $0.4 Cache Read: $0.025 | Model: 0.050 Completion: 4.000 Cache: 0.250 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-09-25 |
Gemini 2.5 Pro | google/gemini-2.5-pro | 1M | 65.5K | In: $1.25 Out: $10 Cache Read: $0.31 | Model: 0.625 Completion: 8.000 Cache: 0.248 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-03-20 Updated: 2025-06-05 |
Gemini 2.0 Flash | google/gemini-2.0-flash | 1M | 8.2K | In: $0.1 Out: $0.4 Cache Read: $0.025 | Model: 0.050 Completion: 4.000 Cache: 0.250 | 📎 🔧 🌡️ | 2024-06 | In: text, image, audio, video, pdf Out: text | Released: 2024-12-11 |
Gemini 2.0 Flash Lite | google/gemini-2.0-flash-lite | 1M | 8.2K | In: $0.075 Out: $0.3 | Model: 0.037 Completion: 4.000 | 📎 🔧 🌡️ | 2024-06 | In: text, image, audio, video, pdf Out: text | Released: 2024-12-11 |
Gemini 2.5 Flash | google/gemini-2.5-flash | 1M | 65.5K | In: $0.3 Out: $2.5 Cache Read: $0.075 | Model: 0.150 Completion: 8.333 Cache: 0.250 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-03-20 Updated: 2025-06-05 |
GPT OSS 20B | openai/gpt-oss-20b | 131.1K | 32.8K | In: $0.07 Out: $0.3 | Model: 0.035 Completion: 4.286 | 🧠 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-08-05 |
GPT OSS 120B | openai/gpt-oss-120b | 131.1K | 32.8K | In: $0.1 Out: $0.5 | Model: 0.050 Completion: 5.000 | 🧠 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-08-05 |
GPT-5 | openai/gpt-5 | 400K | 128K | In: $1.25 Out: $10 Cache Read: $0.13 | Model: 0.625 Completion: 8.000 Cache: 0.104 | 📎 🧠 🔧 | 2024-09-30 | In: text, image Out: text | Released: 2025-08-07 |
GPT-4o mini | openai/gpt-4o-mini | 128K | 16.4K | In: $0.15 Out: $0.6 Cache Read: $0.08 | Model: 0.075 Completion: 4.000 Cache: 0.533 | 📎 🔧 🌡️ | 2023-09 | In: text, image Out: text | Released: 2024-07-18 |
o3 | openai/o3 | 200K | 100K | In: $2 Out: $8 Cache Read: $0.5 | Model: 1.000 Completion: 4.000 Cache: 0.250 | 📎 🧠 🔧 | 2024-05 | In: text, image Out: text | Released: 2025-04-16 |
GPT-5 Mini | openai/gpt-5-mini | 400K | 128K | In: $0.25 Out: $2 Cache Read: $0.03 | Model: 0.125 Completion: 8.000 Cache: 0.120 | 📎 🧠 🔧 | 2024-05-30 | In: text, image Out: text | Released: 2025-08-07 |
o1 | openai/o1 | 200K | 100K | In: $15 Out: $60 Cache Read: $7.5 | Model: 7.500 Completion: 4.000 Cache: 0.500 | 📎 🧠 🔧 | 2023-09 | In: text, image Out: text | Released: 2024-12-05 |
o4-mini | openai/o4-mini | 200K | 100K | In: $1.1 Out: $4.4 Cache Read: $0.28 | Model: 0.550 Completion: 4.000 Cache: 0.255 | 📎 🧠 🔧 | 2024-05 | In: text, image Out: text | Released: 2025-04-16 |
GPT-4.1 | openai/gpt-4.1 | 1M | 32.8K | In: $2 Out: $8 Cache Read: $0.5 | Model: 1.000 Completion: 4.000 Cache: 0.250 | 📎 🔧 🌡️ | 2024-04 | In: text, image Out: text | Released: 2025-04-14 |
GPT-4o | openai/gpt-4o | 128K | 16.4K | In: $2.5 Out: $10 Cache Read: $1.25 | Model: 1.250 Completion: 4.000 Cache: 0.500 | 📎 🔧 🌡️ | 2023-09 | In: text, image Out: text | Released: 2024-05-13 Updated: 2024-08-06 |
GPT-5-Codex | openai/gpt-5-codex | 400K | 128K | In: $1.25 Out: $10 Cache Read: $0.125 | Model: 0.625 Completion: 8.000 Cache: 0.100 | 🧠 🔧 | 2024-09-30 | In: text, image Out: text | Released: 2025-09-15 |
GPT-5 Nano | openai/gpt-5-nano | 400K | 128K | In: $0.05 Out: $0.4 Cache Read: $0.01 | Model: 0.025 Completion: 8.000 Cache: 0.200 | 📎 🧠 🔧 | 2024-05-30 | In: text, image Out: text | Released: 2025-08-07 |
o3-mini | openai/o3-mini | 200K | 100K | In: $1.1 Out: $4.4 Cache Read: $0.55 | Model: 0.550 Completion: 4.000 Cache: 0.500 | 🧠 🔧 | 2024-05 | In: text Out: text | Released: 2024-12-20 Updated: 2025-01-29 |
GPT-4 Turbo | openai/gpt-4-turbo | 128K | 4.1K | In: $10 Out: $30 | Model: 5.000 Completion: 3.000 | 📎 🔧 🌡️ | 2023-12 | In: text, image Out: text | Released: 2023-11-06 Updated: 2024-04-09 |
GPT-4.1 mini | openai/gpt-4.1-mini | 1M | 32.8K | In: $0.4 Out: $1.6 Cache Read: $0.1 | Model: 0.200 Completion: 4.000 Cache: 0.250 | 📎 🔧 🌡️ | 2024-04 | In: text, image Out: text | Released: 2025-04-14 |
GPT-4.1 nano | openai/gpt-4.1-nano | 1M | 32.8K | In: $0.1 Out: $0.4 Cache Read: $0.03 | Model: 0.050 Completion: 4.000 Cache: 0.300 | 📎 🔧 🌡️ | 2024-04 | In: text, image Out: text | Released: 2025-04-14 |
Sonar Reasoning | perplexity/sonar-reasoning | 127K | 8K | In: $1 Out: $5 | Model: 0.500 Completion: 5.000 | 🧠 🌡️ | 2025-09 | In: text Out: text | Released: 2025-02-19 |
Sonar | perplexity/sonar | 127K | 8K | In: $1 Out: $1 | Model: 0.500 Completion: 1.000 | 🌡️ | 2025-02 | In: text, image Out: text | Released: 2025-02-19 |
Sonar Pro | perplexity/sonar-pro | 200K | 8K | In: $3 Out: $15 | Model: 1.500 Completion: 5.000 | 🌡️ | 2025-09 | In: text, image Out: text | Released: 2025-02-19 |
Sonar Reasoning Pro | perplexity/sonar-reasoning-pro | 127K | 8K | In: $2 Out: $8 | Model: 1.000 Completion: 4.000 | 🧠 🌡️ | 2025-09 | In: text Out: text | Released: 2025-02-19 |
GLM 4.5 | zai/glm-4.5 | 128K | 96K | In: $0.6 Out: $2.2 | Model: 0.300 Completion: 3.667 | 🧠 🔧 🌡️ | 2025-07 | In: text Out: text | Open Weights Released: 2025-07-28 |
GLM 4.5 Air | zai/glm-4.5-air | 128K | 96K | In: $0.2 Out: $1.1 | Model: 0.100 Completion: 5.500 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-28 |
GLM 4.5V | zai/glm-4.5v | 66K | 16K | In: $0.6 Out: $1.8 | Model: 0.300 Completion: 3.000 | 📎 🧠 🔧 🌡️ | 2025-08 | In: text, image Out: text | Open Weights Released: 2025-08-11 |
GLM 4.6 | zai/glm-4.6 | 200K | 96K | In: $0.6 Out: $2.2 | Model: 0.300 Completion: 3.667 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-09-30 |
Nova Micro | amazon/nova-micro | 128K | 8.2K | In: $0.035 Out: $0.14 Cache Read: $0.00875 | Model: 0.018 Completion: 4.000 Cache: 0.250 | 🔧 🌡️ | 2024-10 | In: text Out: text | Released: 2024-12-03 |
Nova Pro | amazon/nova-pro | 300K | 8.2K | In: $0.8 Out: $3.2 Cache Read: $0.2 | Model: 0.400 Completion: 4.000 Cache: 0.250 | 📎 🔧 🌡️ | 2024-10 | In: text, image, video Out: text | Released: 2024-12-03 |
Nova Lite | amazon/nova-lite | 300K | 8.2K | In: $0.06 Out: $0.24 Cache Read: $0.015 | Model: 0.030 Completion: 4.000 Cache: 0.250 | 📎 🔧 🌡️ | 2024-10 | In: text, image, video Out: text | Released: 2024-12-03 |
Morph v3 Fast | morph/morph-v3-fast | 16K | 16K | In: $0.8 Out: $1.2 | Model: 0.400 Completion: 1.500 | - | - | In: text Out: text | Released: 2024-08-15 |
Morph v3 Large | morph/morph-v3-large | 32K | 32K | In: $0.9 Out: $1.9 | Model: 0.450 Completion: 2.111 | - | - | In: text Out: text | Released: 2024-08-15 |
Llama-4-Scout-17B-16E-Instruct-FP8 | meta/llama-4-scout | 128K | 4.1K | - | - | 📎 🔧 🌡️ | 2024-08 | In: text, image Out: text | Open Weights Released: 2025-04-05 |
Llama-3.3-70B-Instruct | meta/llama-3.3-70b | 128K | 4.1K | - | - | 📎 🔧 🌡️ | 2023-12 | In: text Out: text | Open Weights Released: 2024-12-06 |
Llama-4-Maverick-17B-128E-Instruct-FP8 | meta/llama-4-maverick | 128K | 4.1K | - | - | 📎 🔧 🌡️ | 2024-08 | In: text, image Out: text | Open Weights Released: 2025-04-05 |
Claude Haiku 4.5 | anthropic/claude-haiku-4.5 | 200K | 64K | In: $1 Out: $1.25 Cache Read: $0.1 Cache Write: $1.25 | Model: 0.500 Completion: 1.250 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-02-31 | In: text, image Out: text | Released: 2025-10-15 |
Claude Sonnet 3.7 | anthropic/claude-3.7-sonnet | 200K | 64K | In: $3 Out: $15 Cache Read: $0.3 Cache Write: $3.75 | Model: 1.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2024-10-31 | In: text, image Out: text | Released: 2025-02-19 |
Claude Haiku 3.5 | anthropic/claude-3-5-haiku | 200K | 8.2K | In: $0.8 Out: $4 Cache Read: $0.08 Cache Write: $1 | Model: 0.400 Completion: 5.000 Cache: 0.100 | 📎 🔧 🌡️ | 2024-07-31 | In: text, image Out: text | Released: 2024-10-22 |
Claude Sonnet 4.5 | anthropic/claude-4.5-sonnet | 200K | 64K | In: $3 Out: $15 Cache Read: $0.3 Cache Write: $3.75 | Model: 1.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-07-31 | In: text, image Out: text | Released: 2025-09-29 |
Claude Sonnet 3.5 v2 | anthropic/claude-3.5-sonnet | 200K | 8.2K | In: $3 Out: $15 Cache Read: $0.3 Cache Write: $3.75 | Model: 1.500 Completion: 5.000 Cache: 0.100 | 📎 🔧 🌡️ | 2024-04-30 | In: text, image Out: text | Released: 2024-10-22 |
Claude Opus 4 | anthropic/claude-4-1-opus | 200K | 32K | In: $15 Out: $75 Cache Read: $1.5 Cache Write: $18.75 | Model: 7.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-03-31 | In: text, image Out: text | Released: 2025-05-22 |
Claude Sonnet 4 | anthropic/claude-4-sonnet | 200K | 64K | In: $3 Out: $15 Cache Read: $0.3 Cache Write: $3.75 | Model: 1.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-03-31 | In: text, image Out: text | Released: 2025-05-22 |
Claude Opus 3 | anthropic/claude-3-opus | 200K | 4.1K | In: $15 Out: $75 Cache Read: $1.5 Cache Write: $18.75 | Model: 7.500 Completion: 5.000 Cache: 0.100 | 📎 🔧 🌡️ | 2023-08-31 | In: text, image Out: text | Released: 2024-02-29 |
Claude Haiku 3 | anthropic/claude-3-haiku | 200K | 4.1K | In: $0.25 Out: $1.25 Cache Read: $0.03 Cache Write: $0.3 | Model: 0.125 Completion: 5.000 Cache: 0.120 | 📎 🔧 🌡️ | 2023-08-31 | In: text, image Out: text | Released: 2024-03-13 |
Claude Opus 4 | anthropic/claude-4-opus | 200K | 32K | In: $15 Out: $75 Cache Read: $1.5 Cache Write: $18.75 | Model: 7.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-03-31 | In: text, image Out: text | Released: 2025-05-22 |
Qwen 3 Coder 480B | cerebras/qwen3-coder | 131K | 32K | In: $2 Out: $2 | Model: 1.000 Completion: 1.000 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-23 |
Weights & Biases¶
📖 API Address | 📚 Official Documentation
Model | Model ID | Context | Output | Pricing ($/1M) | NewAPI Ratios | Capabilities | Knowledge | Modalities | Details |
---|---|---|---|---|---|---|---|---|---|
Kimi-K2-Instruct | moonshotai/Kimi-K2-Instruct | 128K | 16.4K | In: $1.35 Out: $4 | Model: 0.675 Completion: 2.963 | 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-07-14 |
Phi-4-mini-instruct | microsoft/Phi-4-mini-instruct | 128K | 4.1K | In: $0.08 Out: $0.35 | Model: 0.040 Completion: 4.375 | 🧠 🔧 🌡️ | 2023-10 | In: text Out: text | Open Weights Released: 2024-12-11 |
Meta-Llama-3.1-8B-Instruct | meta-llama/Llama-3.1-8B-Instruct | 128K | 32.8K | In: $0.22 Out: $0.22 | Model: 0.110 Completion: 1.000 | 🧠 🔧 🌡️ | 2023-12 | In: text Out: text | Open Weights Released: 2024-07-23 |
Llama-3.3-70B-Instruct | meta-llama/Llama-3.3-70B-Instruct | 128K | 32.8K | In: $0.71 Out: $0.71 | Model: 0.355 Completion: 1.000 | 🧠 🔧 🌡️ | 2023-12 | In: text Out: text | Open Weights Released: 2024-12-06 |
Llama 4 Scout 17B 16E Instruct | meta-llama/Llama-4-Scout-17B-16E-Instruct | 64K | 8.2K | In: $0.17 Out: $0.66 | Model: 0.085 Completion: 3.882 | 🧠 🔧 🌡️ | 2024-12 | In: text, image Out: text | Open Weights Released: 2025-01-31 |
Qwen3 235B A22B Instruct 2507 | Qwen/Qwen3-235B-A22B-Instruct-2507 | 262.1K | 131.1K | In: $0.1 Out: $0.1 | Model: 0.050 Completion: 1.000 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-04-28 Updated: 2025-07-21 |
Qwen3-Coder-480B-A35B-Instruct | Qwen/Qwen3-Coder-480B-A35B-Instruct | 262.1K | 66.5K | In: $1 Out: $1.5 | Model: 0.500 Completion: 1.500 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-23 |
Qwen3-235B-A22B-Thinking-2507 | Qwen/Qwen3-235B-A22B-Thinking-2507 | 262.1K | 131.1K | In: $0.1 Out: $0.1 | Model: 0.050 Completion: 1.000 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-25 |
DeepSeek-R1-0528 | deepseek-ai/DeepSeek-R1-0528 | 161K | 163.8K | In: $1.35 Out: $5.4 | Model: 0.675 Completion: 4.000 | 🧠 🔧 🌡️ | 2025-05 | In: text Out: text | Open Weights Released: 2025-05-28 |
DeepSeek-V3-0324 | deepseek-ai/DeepSeek-V3-0324 | 161K | 8.2K | In: $1.14 Out: $2.75 | Model: 0.570 Completion: 2.412 | 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-03-24 |
xAI¶
Model | Model ID | Context | Output | Pricing ($/1M) | NewAPI Ratios | Capabilities | Knowledge | Modalities | Details |
---|---|---|---|---|---|---|---|---|---|
Grok 4 Fast (Non-Reasoning) | grok-4-fast-non-reasoning | 2M | 30K | In: $0.2 Out: $0.5 Cache Read: $0.05 | Model: 0.100 Completion: 2.500 Cache: 0.250 | 📎 🔧 🌡️ | 2025-07 | In: text, image Out: text | Released: 2025-09-19 |
Grok 3 Fast | grok-3-fast | 131.1K | 8.2K | In: $5 Out: $25 Cache Read: $1.25 | Model: 2.500 Completion: 5.000 Cache: 0.250 | 🔧 🌡️ | 2024-11 | In: text Out: text | Released: 2025-02-17 |
Grok 4 | grok-4 | 256K | 64K | In: $3 Out: $15 Cache Read: $0.75 | Model: 1.500 Completion: 5.000 Cache: 0.250 | 🧠 🔧 🌡️ | 2025-07 | In: text Out: text | Released: 2025-07-09 |
Grok 2 Vision | grok-2-vision | 8.2K | 4.1K | In: $2 Out: $10 Cache Read: $2 | Model: 1.000 Completion: 5.000 Cache: 1.000 | 📎 🔧 🌡️ | 2024-08 | In: text, image Out: text | Released: 2024-08-20 |
Grok Code Fast 1 | grok-code-fast-1 | 256K | 10K | In: $0.2 Out: $1.5 Cache Read: $0.02 | Model: 0.100 Completion: 7.500 Cache: 0.100 | 🧠 🔧 🌡️ | 2023-10 | In: text Out: text | Released: 2025-08-28 |
Grok 2 | grok-2 | 131.1K | 8.2K | In: $2 Out: $10 Cache Read: $2 | Model: 1.000 Completion: 5.000 Cache: 1.000 | 🔧 🌡️ | 2024-08 | In: text Out: text | Released: 2024-08-20 |
Grok 3 Mini Fast Latest | grok-3-mini-fast-latest | 131.1K | 8.2K | In: $0.6 Out: $4 Cache Read: $0.15 | Model: 0.300 Completion: 6.667 Cache: 0.250 | 🧠 🔧 🌡️ | 2024-11 | In: text Out: text | Released: 2025-02-17 |
Grok 2 Vision (1212) | grok-2-vision-1212 | 8.2K | 4.1K | In: $2 Out: $10 Cache Read: $2 | Model: 1.000 Completion: 5.000 Cache: 1.000 | 📎 🔧 🌡️ | 2024-08 | In: text, image Out: text | Released: 2024-08-20 Updated: 2024-12-12 |
Grok 3 | grok-3 | 131.1K | 8.2K | In: $3 Out: $15 Cache Read: $0.75 | Model: 1.500 Completion: 5.000 Cache: 0.250 | 🔧 🌡️ | 2024-11 | In: text Out: text | Released: 2025-02-17 |
Grok 4 Fast | grok-4-fast | 2M | 30K | In: $0.2 Out: $0.5 Cache Read: $0.05 | Model: 0.100 Completion: 2.500 Cache: 0.250 | 📎 🧠 🔧 🌡️ | 2025-07 | In: text, image Out: text | Released: 2025-09-19 |
Grok 2 Latest | grok-2-latest | 131.1K | 8.2K | In: $2 Out: $10 Cache Read: $2 | Model: 1.000 Completion: 5.000 Cache: 1.000 | 🔧 🌡️ | 2024-08 | In: text Out: text | Released: 2024-08-20 Updated: 2024-12-12 |
Grok 2 (1212) | grok-2-1212 | 131.1K | 8.2K | In: $2 Out: $10 Cache Read: $2 | Model: 1.000 Completion: 5.000 Cache: 1.000 | 🔧 🌡️ | 2024-08 | In: text Out: text | Released: 2024-12-12 |
Grok 3 Fast Latest | grok-3-fast-latest | 131.1K | 8.2K | In: $5 Out: $25 Cache Read: $1.25 | Model: 2.500 Completion: 5.000 Cache: 0.250 | 🔧 🌡️ | 2024-11 | In: text Out: text | Released: 2025-02-17 |
Grok 3 Latest | grok-3-latest | 131.1K | 8.2K | In: $3 Out: $15 Cache Read: $0.75 | Model: 1.500 Completion: 5.000 Cache: 0.250 | 🔧 🌡️ | 2024-11 | In: text Out: text | Released: 2025-02-17 |
Grok 2 Vision Latest | grok-2-vision-latest | 8.2K | 4.1K | In: $2 Out: $10 Cache Read: $2 | Model: 1.000 Completion: 5.000 Cache: 1.000 | 📎 🔧 🌡️ | 2024-08 | In: text, image Out: text | Released: 2024-08-20 Updated: 2024-12-12 |
Grok Vision Beta | grok-vision-beta | 8.2K | 4.1K | In: $5 Out: $15 Cache Read: $5 | Model: 2.500 Completion: 3.000 Cache: 1.000 | 📎 🔧 🌡️ | 2024-08 | In: text, image Out: text | Released: 2024-11-01 |
Grok 3 Mini | grok-3-mini | 131.1K | 8.2K | In: $0.3 Out: $0.5 Cache Read: $0.075 | Model: 0.150 Completion: 1.667 Cache: 0.250 | 🧠 🔧 🌡️ | 2024-11 | In: text Out: text | Released: 2025-02-17 |
Grok Beta | grok-beta | 131.1K | 4.1K | In: $5 Out: $15 Cache Read: $5 | Model: 2.500 Completion: 3.000 Cache: 1.000 | 🔧 🌡️ | 2024-08 | In: text Out: text | Released: 2024-11-01 |
Grok 3 Mini Latest | grok-3-mini-latest | 131.1K | 8.2K | In: $0.3 Out: $0.5 Cache Read: $0.075 | Model: 0.150 Completion: 1.667 Cache: 0.250 | 🧠 🔧 🌡️ | 2024-11 | In: text Out: text | Released: 2025-02-17 |
Grok 3 Mini Fast | grok-3-mini-fast | 131.1K | 8.2K | In: $0.6 Out: $4 Cache Read: $0.15 | Model: 0.300 Completion: 6.667 Cache: 0.250 | 🧠 🔧 🌡️ | 2024-11 | In: text Out: text | Released: 2025-02-17 |
Z.AI¶
📖 API Address | 📚 Official Documentation
Model | Model ID | Context | Output | Pricing ($/1M) | NewAPI Ratios | Capabilities | Knowledge | Modalities | Details |
---|---|---|---|---|---|---|---|---|---|
GLM-4.5-Flash | glm-4.5-flash | 131.1K | 98.3K | - | - | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-28 |
GLM-4.5 | glm-4.5 | 131.1K | 98.3K | In: $0.6 Out: $2.2 Cache Read: $0.11 | Model: 0.300 Completion: 3.667 Cache: 0.183 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-28 |
GLM-4.5-Air | glm-4.5-air | 131.1K | 98.3K | In: $0.2 Out: $1.1 Cache Read: $0.03 | Model: 0.100 Completion: 5.500 Cache: 0.150 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-28 |
GLM 4.5V | glm-4.5v | 64K | 16.4K | In: $0.6 Out: $1.8 | Model: 0.300 Completion: 3.000 | 📎 🧠 🔧 🌡️ | 2025-04 | In: text, image, video Out: text | Open Weights Released: 2025-08-11 |
GLM-4.6 | glm-4.6 | 204.8K | 131.1K | In: $0.6 Out: $2.2 Cache Read: $0.11 | Model: 0.300 Completion: 3.667 Cache: 0.183 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-09-30 |
Z.AI Coding Plan¶
📖 API Address | 📚 Official Documentation
Model | Model ID | Context | Output | Pricing ($/1M) | NewAPI Ratios | Capabilities | Knowledge | Modalities | Details |
---|---|---|---|---|---|---|---|---|---|
GLM-4.5-Flash | glm-4.5-flash | 131.1K | 98.3K | - | - | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-28 |
GLM-4.5 | glm-4.5 | 131.1K | 98.3K | - | - | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-28 |
GLM-4.5-Air | glm-4.5-air | 131.1K | 98.3K | - | - | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-28 |
GLM 4.5V | glm-4.5v | 64K | 16.4K | - | - | 📎 🧠 🔧 🌡️ | 2025-04 | In: text, image, video Out: text | Open Weights Released: 2025-08-11 |
GLM-4.6 | glm-4.6 | 204.8K | 131.1K | - | - | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-09-30 |
Zhipu AI¶
📖 API Address | 📚 Official Documentation
Model | Model ID | Context | Output | Pricing ($/1M) | NewAPI Ratios | Capabilities | Knowledge | Modalities | Details |
---|---|---|---|---|---|---|---|---|---|
GLM-4.6 | glm-4.6 | 204.8K | 131.1K | In: $0.6 Out: $2.2 Cache Read: $0.11 | Model: 0.300 Completion: 3.667 Cache: 0.183 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-09-30 |
GLM 4.5V | glm-4.5v | 64K | 16.4K | In: $0.6 Out: $1.8 | Model: 0.300 Completion: 3.000 | 📎 🧠 🔧 🌡️ | 2025-04 | In: text, image, video Out: text | Open Weights Released: 2025-08-11 |
GLM-4.5-Air | glm-4.5-air | 131.1K | 98.3K | In: $0.2 Out: $1.1 Cache Read: $0.03 | Model: 0.100 Completion: 5.500 Cache: 0.150 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-28 |
GLM-4.5 | glm-4.5 | 131.1K | 98.3K | In: $0.6 Out: $2.2 Cache Read: $0.11 | Model: 0.300 Completion: 3.667 Cache: 0.183 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-28 |
GLM-4.5-Flash | glm-4.5-flash | 131.1K | 98.3K | - | - | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-28 |
Zhipu AI Coding Plan¶
📖 API Address | 📚 Official Documentation
Model | Model ID | Context | Output | Pricing ($/1M) | NewAPI Ratios | Capabilities | Knowledge | Modalities | Details |
---|---|---|---|---|---|---|---|---|---|
GLM-4.6 | glm-4.6 | 204.8K | 131.1K | - | - | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-09-30 |
GLM 4.5V | glm-4.5v | 64K | 16.4K | - | - | 📎 🧠 🔧 🌡️ | 2025-04 | In: text, image, video Out: text | Open Weights Released: 2025-08-11 |
GLM-4.5-Air | glm-4.5-air | 131.1K | 98.3K | - | - | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-28 |
GLM-4.5 | glm-4.5 | 131.1K | 98.3K | - | - | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-28 |
GLM-4.5-Flash | glm-4.5-flash | 131.1K | 98.3K | - | - | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-28 |