LLM Pricing
Models
Calculator
Compare
API vs Subscription
Menu
Models
Calculator
Compare
API vs Subscription
Cost calculator
Input
Model
ai21.j2-mid-v1 (Bedrock)
ai21.j2-ultra-v1 (Bedrock)
ai21.jamba-1-5-large-v1:0 (Bedrock)
ai21.jamba-1-5-mini-v1:0 (Bedrock)
ai21.jamba-instruct-v1:0 (Bedrock)
us.writer.palmyra-x4-v1:0 (bedrock_converse)
us.writer.palmyra-x5-v1:0 (bedrock_converse)
writer.palmyra-x4-v1:0 (bedrock_converse)
writer.palmyra-x5-v1:0 (bedrock_converse)
amazon.nova-lite-v1:0 (bedrock_converse)
amazon.nova-2-lite-v1:0 (bedrock_converse)
amazon.nova-2-pro-preview-20251202-v1:0 (bedrock_converse)
apac.amazon.nova-2-lite-v1:0 (bedrock_converse)
apac.amazon.nova-2-pro-preview-20251202-v1:0 (bedrock_converse)
eu.amazon.nova-2-lite-v1:0 (bedrock_converse)
eu.amazon.nova-2-pro-preview-20251202-v1:0 (bedrock_converse)
us.amazon.nova-2-lite-v1:0 (bedrock_converse)
us.amazon.nova-2-pro-preview-20251202-v1:0 (bedrock_converse)
amazon.nova-2-multimodal-embeddings-v1:0 (Bedrock)
amazon.nova-micro-v1:0 (bedrock_converse)
amazon.nova-pro-v1:0 (bedrock_converse)
amazon.titan-embed-image-v1 (Bedrock)
amazon.titan-embed-text-v1 (Bedrock)
amazon.titan-embed-text-v2:0 (Bedrock)
twelvelabs.marengo-embed-2-7-v1:0 (Bedrock)
us.twelvelabs.marengo-embed-2-7-v1:0 (Bedrock)
eu.twelvelabs.marengo-embed-2-7-v1:0 (Bedrock)
twelvelabs.pegasus-1-2-v1:0 (Bedrock)
us.twelvelabs.pegasus-1-2-v1:0 (Bedrock)
eu.twelvelabs.pegasus-1-2-v1:0 (Bedrock)
amazon.titan-text-express-v1 (Bedrock)
amazon.titan-text-lite-v1 (Bedrock)
amazon.titan-text-premier-v1:0 (Bedrock)
anthropic.claude-3-5-haiku-20241022-v1:0 (Bedrock)
anthropic.claude-haiku-4-5-20251001-v1:0 (bedrock_converse)
anthropic.claude-haiku-4-5@20251001 (bedrock_converse)
anthropic.claude-3-5-sonnet-20240620-v1:0 (Bedrock)
anthropic.claude-3-5-sonnet-20241022-v2:0 (Bedrock)
anthropic.claude-3-7-sonnet-20240620-v1:0 (Bedrock)
anthropic.claude-3-7-sonnet-20250219-v1:0 (bedrock_converse)
anthropic.claude-3-haiku-20240307-v1:0 (Bedrock)
anthropic.claude-3-opus-20240229-v1:0 (Bedrock)
anthropic.claude-3-sonnet-20240229-v1:0 (Bedrock)
anthropic.claude-instant-v1 (Bedrock)
anthropic.claude-opus-4-1-20250805-v1:0 (bedrock_converse)
anthropic.claude-opus-4-20250514-v1:0 (bedrock_converse)
anthropic.claude-opus-4-5-20251101-v1:0 (bedrock_converse)
anthropic.claude-opus-4-6-v1 (bedrock_converse)
global.anthropic.claude-opus-4-6-v1 (bedrock_converse)
us.anthropic.claude-opus-4-6-v1 (bedrock_converse)
eu.anthropic.claude-opus-4-6-v1 (bedrock_converse)
au.anthropic.claude-opus-4-6-v1 (bedrock_converse)
anthropic.claude-opus-4-7 (bedrock_converse)
global.anthropic.claude-opus-4-7 (bedrock_converse)
us.anthropic.claude-opus-4-7 (bedrock_converse)
eu.anthropic.claude-opus-4-7 (bedrock_converse)
au.anthropic.claude-opus-4-7 (bedrock_converse)
anthropic.claude-sonnet-4-6 (bedrock_converse)
global.anthropic.claude-sonnet-4-6 (bedrock_converse)
us.anthropic.claude-sonnet-4-6 (bedrock_converse)
eu.anthropic.claude-sonnet-4-6 (bedrock_converse)
au.anthropic.claude-sonnet-4-6 (bedrock_converse)
jp.anthropic.claude-sonnet-4-6 (bedrock_converse)
anthropic.claude-sonnet-4-20250514-v1:0 (bedrock_converse)
anthropic.claude-sonnet-4-5-20250929-v1:0 (bedrock_converse)
anthropic.claude-v1 (Bedrock)
anthropic.claude-v2:1 (Bedrock)
zephyr-7b-beta (anyscale)
CodeLlama-34b-Instruct-hf (anyscale)
CodeLlama-70b-Instruct-hf (anyscale)
gemma-7b-it (anyscale)
Llama-2-13b-chat-hf (anyscale)
Llama-2-70b-chat-hf (anyscale)
Llama-2-7b-chat-hf (anyscale)
Meta-Llama-3-70B-Instruct (anyscale)
Meta-Llama-3-8B-Instruct (anyscale)
Mistral-7B-Instruct-v0.1 (anyscale)
Mixtral-8x22B-Instruct-v0.1 (anyscale)
Mixtral-8x7B-Instruct-v0.1 (anyscale)
apac.amazon.nova-lite-v1:0 (bedrock_converse)
apac.amazon.nova-micro-v1:0 (bedrock_converse)
apac.amazon.nova-pro-v1:0 (bedrock_converse)
apac.anthropic.claude-3-5-sonnet-20240620-v1:0 (Bedrock)
apac.anthropic.claude-3-5-sonnet-20241022-v2:0 (Bedrock)
apac.anthropic.claude-3-haiku-20240307-v1:0 (Bedrock)
apac.anthropic.claude-haiku-4-5-20251001-v1:0 (bedrock_converse)
apac.anthropic.claude-3-sonnet-20240229-v1:0 (Bedrock)
apac.anthropic.claude-sonnet-4-20250514-v1:0 (bedrock_converse)
au.anthropic.claude-sonnet-4-5-20250929-v1:0 (bedrock_converse)
ada (Azure)
codex-mini (Azure)
command-r-plus (Azure)
claude-haiku-4-5 (azure_ai)
claude-opus-4-5 (azure_ai)
claude-opus-4-6 (azure_ai)
claude-opus-4-7 (azure_ai)
claude-opus-4-1 (azure_ai)
claude-sonnet-4-5 (azure_ai)
claude-sonnet-4-6 (azure_ai)
computer-use-preview (Azure)
gpt-oss-120b (azure_ai)
gpt-5.4 (azure_ai)
gpt-5.4-2026-03-05 (azure_ai)
gpt-5.4-pro (azure_ai)
gpt-5.4-pro-2026-03-05 (azure_ai)
gpt-5.4-mini (azure_ai)
gpt-5.4-mini-2026-03-17 (azure_ai)
gpt-5.4-nano (azure_ai)
gpt-5.4-nano-2026-03-17 (azure_ai)
model_router (azure_ai)
gpt-4o-2024-08-06 (Azure)
gpt-4o-2024-11-20 (Azure)
gpt-4o-mini-2024-07-18 (Azure)
gpt-4o-mini-realtime-preview-2024-12-17 (Azure)
gpt-4o-realtime-preview-2024-10-01 (Azure)
gpt-4o-realtime-preview-2024-12-17 (Azure)
gpt-5-2025-08-07 (Azure)
gpt-5-mini-2025-08-07 (Azure)
gpt-5.1 (Azure)
gpt-5.1-chat (Azure)
gpt-5.1-codex (Azure)
gpt-5.1-codex-mini (Azure)
gpt-5-nano-2025-08-07 (Azure)
o1-2024-12-17 (Azure)
o1-mini-2024-09-12 (Azure)
o1-preview-2024-09-12 (Azure)
o3-mini-2025-01-31 (Azure)
gpt-4o-2024-08-06 (Azure)
gpt-4o-2024-11-20 (Azure)
gpt-4o-mini (Azure)
gpt-4o-2024-08-06 (Azure)
gpt-4o-2024-11-20 (Azure)
gpt-5.1 (Azure)
gpt-5.1-chat (Azure)
gpt-5.1-codex (Azure)
gpt-5.1-codex-mini (Azure)
gpt-3.5-turbo (Azure)
gpt-3.5-turbo-0125 (Azure)
gpt-3.5-turbo-instruct-0914 (azure_text)
gpt-35-turbo (Azure)
gpt-35-turbo-0125 (Azure)
gpt-35-turbo-1106 (Azure)
gpt-35-turbo-16k (Azure)
gpt-35-turbo-16k-0613 (Azure)
gpt-35-turbo-instruct (azure_text)
gpt-35-turbo-instruct-0914 (azure_text)
gpt-4 (Azure)
gpt-4-0125-preview (Azure)
gpt-4-0613 (Azure)
gpt-4-1106-preview (Azure)
gpt-4-32k (Azure)
gpt-4-32k-0613 (Azure)
gpt-4-turbo (Azure)
gpt-4-turbo-2024-04-09 (Azure)
gpt-4-turbo-vision-preview (Azure)
gpt-4.1 (Azure)
gpt-4.1-2025-04-14 (Azure)
gpt-4.1-mini (Azure)
gpt-4.1-mini-2025-04-14 (Azure)
gpt-4.1-nano (Azure)
gpt-4.1-nano-2025-04-14 (Azure)
gpt-4.5-preview (Azure)
gpt-4o (Azure)
gpt-4o-2024-05-13 (Azure)
gpt-4o-2024-08-06 (Azure)
gpt-4o-2024-11-20 (Azure)
gpt-audio-2025-08-28 (Azure)
gpt-audio-1.5-2026-02-23 (Azure)
gpt-audio-mini-2025-10-06 (Azure)
gpt-4o-audio-preview-2024-12-17 (Azure)
gpt-4o-mini (Azure)
gpt-4o-mini-2024-07-18 (Azure)
gpt-4o-mini-audio-preview-2024-12-17 (Azure)
gpt-4o-mini-realtime-preview-2024-12-17 (Azure)
gpt-realtime-2025-08-28 (Azure)
gpt-realtime-1.5-2026-02-23 (Azure)
gpt-realtime-mini-2025-10-06 (Azure)
gpt-4o-mini-transcribe (Azure)
gpt-4o-mini-tts (Azure)
gpt-4o-realtime-preview-2024-10-01 (Azure)
gpt-4o-realtime-preview-2024-12-17 (Azure)
gpt-4o-transcribe (Azure)
gpt-4o-transcribe-diarize (Azure)
gpt-5.1-2025-11-13 (Azure)
gpt-5.1-chat-2025-11-13 (Azure)
gpt-5.1-codex-2025-11-13 (Azure)
gpt-5.1-codex-mini-2025-11-13 (Azure)
gpt-5 (Azure)
gpt-5-2025-08-07 (Azure)
gpt-5-chat (Azure)
gpt-5-chat-latest (Azure)
gpt-5-codex (Azure)
gpt-5-mini (Azure)
gpt-5-mini-2025-08-07 (Azure)
gpt-5-nano (Azure)
gpt-5-nano-2025-08-07 (Azure)
gpt-5-pro (Azure)
gpt-5.1 (Azure)
gpt-5.1-chat (Azure)
gpt-5.1-codex (Azure)
gpt-5.1-codex-max (Azure)
gpt-5.1-codex-mini (Azure)
gpt-5.2 (Azure)
gpt-5.2-2025-12-11 (Azure)
gpt-5.2-chat (Azure)
gpt-5.2-chat-2025-12-11 (Azure)
gpt-5.2-codex (Azure)
gpt-5.3-chat (Azure)
gpt-5.3-codex (Azure)
gpt-5.2-pro (Azure)
gpt-5.2-pro-2025-12-11 (Azure)
gpt-5.4 (Azure)
gpt-5.4-2026-03-05 (Azure)
gpt-5.4-pro (Azure)
gpt-5.4-pro-2026-03-05 (Azure)
gpt-5.5 (Azure)
gpt-5.5-2026-04-23 (Azure)
gpt-5.5-pro (Azure)
gpt-5.5-pro-2026-04-23 (Azure)
gpt-5.4-mini (Azure)
gpt-5.4-mini-2026-03-17 (Azure)
gpt-5.4-nano (Azure)
gpt-5.4-nano-2026-03-17 (Azure)
gpt-image-1 (Azure)
gpt-image-1-mini (Azure)
gpt-image-1.5 (Azure)
gpt-image-1.5-2025-12-16 (Azure)
gpt-image-2 (Azure)
gpt-image-2-2026-04-21 (Azure)
mistral-large-2402 (Azure)
mistral-large-latest (Azure)
azure (Azure)
o1-2024-12-17 (Azure)
o1-mini (Azure)
o1-mini-2024-09-12 (Azure)
o1-preview (Azure)
o1-preview-2024-09-12 (Azure)
azure (Azure)
o3-2025-04-16 (Azure)
o3-deep-research (Azure)
o3-mini (Azure)
o3-mini-2025-01-31 (Azure)
o3-pro (Azure)
o3-pro-2025-06-10 (Azure)
o4-mini (Azure)
o4-mini-2025-04-16 (Azure)
text-embedding-3-large (Azure)
text-embedding-3-small (Azure)
text-embedding-ada-002 (Azure)
gpt-4.1-2025-04-14 (Azure)
gpt-4.1-mini-2025-04-14 (Azure)
gpt-4.1-nano-2025-04-14 (Azure)
gpt-4o-2024-08-06 (Azure)
gpt-4o-2024-11-20 (Azure)
gpt-4o-mini-2024-07-18 (Azure)
gpt-4o-mini-realtime-preview-2024-12-17 (Azure)
gpt-4o-realtime-preview-2024-10-01 (Azure)
gpt-4o-realtime-preview-2024-12-17 (Azure)
gpt-5-2025-08-07 (Azure)
gpt-5-mini-2025-08-07 (Azure)
gpt-5-nano-2025-08-07 (Azure)
gpt-5.1 (Azure)
gpt-5.1-chat (Azure)
gpt-5.1-codex (Azure)
gpt-5.1-codex-mini (Azure)
o1-2024-12-17 (Azure)
o1-mini-2024-09-12 (Azure)
o1-preview-2024-09-12 (Azure)
o3-2025-04-16 (Azure)
o3-mini-2025-01-31 (Azure)
o4-mini-2025-04-16 (Azure)
Cohere-embed-v3-english (azure_ai)
Cohere-embed-v3-multilingual (azure_ai)
Llama-3.2-11B-Vision-Instruct (azure_ai)
Llama-3.2-90B-Vision-Instruct (azure_ai)
Llama-3.3-70B-Instruct (azure_ai)
Llama-4-Maverick-17B-128E-Instruct-FP8 (azure_ai)
Llama-4-Scout-17B-16E-Instruct (azure_ai)
Meta-Llama-3-70B-Instruct (azure_ai)
Meta-Llama-3.1-405B-Instruct (azure_ai)
Meta-Llama-3.1-70B-Instruct (azure_ai)
Meta-Llama-3.1-8B-Instruct (azure_ai)
Phi-3-medium-128k-instruct (azure_ai)
Phi-3-medium-4k-instruct (azure_ai)
Phi-3-mini-128k-instruct (azure_ai)
Phi-3-mini-4k-instruct (azure_ai)
Phi-3-small-128k-instruct (azure_ai)
Phi-3-small-8k-instruct (azure_ai)
Phi-3.5-MoE-instruct (azure_ai)
Phi-3.5-mini-instruct (azure_ai)
Phi-3.5-vision-instruct (azure_ai)
Phi-4 (azure_ai)
Phi-4-mini-instruct (azure_ai)
Phi-4-multimodal-instruct (azure_ai)
Phi-4-mini-reasoning (azure_ai)
Phi-4-reasoning (azure_ai)
MAI-DS-R1 (azure_ai)
deepseek-v3.2 (azure_ai)
deepseek-v3.2-speciale (azure_ai)
deepseek-r1 (azure_ai)
deepseek-v3 (azure_ai)
deepseek-v3-0324 (azure_ai)
embed-v-4-0 (azure_ai)
grok-3 (azure_ai)
grok-3-mini (azure_ai)
grok-3 (azure_ai)
grok-3-mini (azure_ai)
grok-4 (azure_ai)
grok-4-fast-non-reasoning (azure_ai)
grok-4-fast-reasoning (azure_ai)
grok-4-1-fast-non-reasoning (azure_ai)
grok-4-1-fast-reasoning (azure_ai)
grok-code-fast-1 (azure_ai)
jais-30b-chat (azure_ai)
jamba-instruct (azure_ai)
kimi-k2.5 (azure_ai)
ministral-3b (azure_ai)
mistral-large (azure_ai)
mistral-large-2407 (azure_ai)
mistral-large-latest (azure_ai)
mistral-large-3 (azure_ai)
mistral-medium-2505 (azure_ai)
mistral-nemo (azure_ai)
mistral-small (azure_ai)
mistral-small-2503 (azure_ai)
babbage-002 (text-completion-openai)
anthropic.claude-instant-v1 (Bedrock)
anthropic.claude-v1 (Bedrock)
anthropic.claude-v2:1 (Bedrock)
deepseek.v3.2 (Bedrock)
minimax.minimax-m2.1 (Bedrock)
minimax.minimax-m2.5 (Bedrock)
moonshotai.kimi-k2-thinking (Bedrock)
moonshotai.kimi-k2.5 (Bedrock)
qwen.qwen3-coder-next (Bedrock)
moonshotai.kimi-k2-thinking (Bedrock)
moonshotai.kimi-k2.5 (Bedrock)
meta.llama3-70b-instruct-v1:0 (Bedrock)
meta.llama3-8b-instruct-v1:0 (Bedrock)
deepseek.v3.2 (Bedrock)
minimax.minimax-m2.1 (Bedrock)
minimax.minimax-m2.5 (Bedrock)
moonshotai.kimi-k2-thinking (Bedrock)
moonshotai.kimi-k2.5 (Bedrock)
qwen.qwen3-coder-next (Bedrock)
minimax.minimax-m2.5 (Bedrock)
deepseek.v3.2 (Bedrock)
minimax.minimax-m2.1 (Bedrock)
minimax.minimax-m2.5 (Bedrock)
moonshotai.kimi-k2.5 (Bedrock)
qwen.qwen3-coder-next (Bedrock)
meta.llama3-70b-instruct-v1:0 (Bedrock)
meta.llama3-8b-instruct-v1:0 (Bedrock)
deepseek.v3.2 (Bedrock)
minimax.minimax-m2.1 (Bedrock)
minimax.minimax-m2.5 (Bedrock)
moonshotai.kimi-k2.5 (Bedrock)
anthropic.claude-instant-v1 (Bedrock)
anthropic.claude-v1 (Bedrock)
anthropic.claude-v2:1 (Bedrock)
minimax.minimax-m2.1 (Bedrock)
minimax.minimax-m2.5 (Bedrock)
qwen.qwen3-coder-next (Bedrock)
meta.llama3-70b-instruct-v1:0 (Bedrock)
meta.llama3-8b-instruct-v1:0 (Bedrock)
minimax.minimax-m2.1 (Bedrock)
minimax.minimax-m2.5 (Bedrock)
qwen.qwen3-coder-next (Bedrock)
meta.llama3-70b-instruct-v1:0 (Bedrock)
meta.llama3-8b-instruct-v1:0 (Bedrock)
minimax.minimax-m2.1 (Bedrock)
minimax.minimax-m2.5 (Bedrock)
qwen.qwen3-coder-next (Bedrock)
mistral.mistral-7b-instruct-v0:2 (Bedrock)
mistral.mistral-large-2402-v1:0 (Bedrock)
mistral.mixtral-8x7b-instruct-v0:1 (Bedrock)
minimax.minimax-m2.1 (Bedrock)
minimax.minimax-m2.5 (Bedrock)
qwen.qwen3-coder-next (Bedrock)
anthropic.claude-3-5-sonnet-20240620-v1:0 (Bedrock)
meta.llama3-70b-instruct-v1:0 (Bedrock)
meta.llama3-8b-instruct-v1:0 (Bedrock)
deepseek.v3.2 (Bedrock)
minimax.minimax-m2.1 (Bedrock)
minimax.minimax-m2.5 (Bedrock)
moonshotai.kimi-k2-thinking (Bedrock)
moonshotai.kimi-k2.5 (Bedrock)
qwen.qwen3-coder-next (Bedrock)
anthropic.claude-instant-v1 (Bedrock)
anthropic.claude-v1 (Bedrock)
anthropic.claude-v2:1 (Bedrock)
meta.llama3-70b-instruct-v1:0 (Bedrock)
meta.llama3-8b-instruct-v1:0 (Bedrock)
mistral.mistral-7b-instruct-v0:2 (Bedrock)
mistral.mistral-large-2402-v1:0 (Bedrock)
mistral.mixtral-8x7b-instruct-v0:1 (Bedrock)
deepseek.v3.2 (Bedrock)
minimax.minimax-m2.1 (Bedrock)
minimax.minimax-m2.5 (Bedrock)
moonshotai.kimi-k2-thinking (Bedrock)
moonshotai.kimi-k2.5 (Bedrock)
qwen.qwen3-coder-next (Bedrock)
deepseek.v3.2 (Bedrock)
minimax.minimax-m2.1 (Bedrock)
minimax.minimax-m2.5 (Bedrock)
moonshotai.kimi-k2-thinking (Bedrock)
moonshotai.kimi-k2.5 (Bedrock)
qwen.qwen3-coder-next (Bedrock)
amazon.nova-pro-v1:0 (Bedrock)
amazon.titan-embed-text-v1 (Bedrock)
amazon.titan-embed-text-v2:0 (Bedrock)
amazon.titan-text-express-v1 (Bedrock)
amazon.titan-text-lite-v1 (Bedrock)
amazon.titan-text-premier-v1:0 (Bedrock)
anthropic.claude-3-5-sonnet-20240620-v1:0 (Bedrock)
anthropic.claude-3-haiku-20240307-v1:0 (Bedrock)
anthropic.claude-sonnet-4-5-20250929-v1:0 (Bedrock)
claude-sonnet-4-5-20250929-v1:0 (Bedrock)
meta.llama3-70b-instruct-v1:0 (Bedrock)
meta.llama3-8b-instruct-v1:0 (Bedrock)
amazon.nova-pro-v1:0 (Bedrock)
amazon.titan-embed-text-v1 (Bedrock)
amazon.titan-embed-text-v2:0 (Bedrock)
amazon.titan-text-express-v1 (Bedrock)
amazon.titan-text-lite-v1 (Bedrock)
amazon.titan-text-premier-v1:0 (Bedrock)
anthropic.claude-3-7-sonnet-20250219-v1:0 (Bedrock)
anthropic.claude-3-5-sonnet-20240620-v1:0 (Bedrock)
anthropic.claude-3-haiku-20240307-v1:0 (Bedrock)
anthropic.claude-sonnet-4-5-20250929-v1:0 (Bedrock)
claude-sonnet-4-5-20250929-v1:0 (Bedrock)
meta.llama3-70b-instruct-v1:0 (Bedrock)
meta.llama3-8b-instruct-v1:0 (Bedrock)
meta.llama3-70b-instruct-v1:0 (Bedrock)
meta.llama3-8b-instruct-v1:0 (Bedrock)
anthropic.claude-instant-v1 (Bedrock)
anthropic.claude-v1 (Bedrock)
anthropic.claude-v2:1 (Bedrock)
mistral.mistral-7b-instruct-v0:2 (Bedrock)
mistral.mistral-large-2402-v1:0 (Bedrock)
mistral.mixtral-8x7b-instruct-v0:1 (Bedrock)
deepseek.v3.2 (Bedrock)
minimax.minimax-m2.1 (Bedrock)
minimax.minimax-m2.5 (Bedrock)
moonshotai.kimi-k2-thinking (Bedrock)
moonshotai.kimi-k2.5 (Bedrock)
qwen.qwen3-coder-next (Bedrock)
us.anthropic.claude-3-5-haiku-20241022-v1:0 (Bedrock)
llama-3.3-70b (cerebras)
llama3.1-70b (cerebras)
llama3.1-8b (cerebras)
gpt-oss-120b (cerebras)
qwen-3-32b (cerebras)
zai-glm-4.6 (cerebras)
zai-glm-4.7 (cerebras)
chatdolphin (nlp_cloud)
chatgpt-4o-latest (OpenAI)
gpt-4o-transcribe-diarize (OpenAI)
claude-haiku-4-5-20251001 (Anthropic)
claude-haiku-4-5 (Anthropic)
claude-3-7-sonnet-20250219 (Anthropic)
claude-3-haiku-20240307 (Anthropic)
claude-3-opus-20240229 (Anthropic)
claude-4-opus-20250514 (Anthropic)
claude-4-sonnet-20250514 (Anthropic)
claude-sonnet-4-5 (Anthropic)
claude-sonnet-4-5-20250929 (Anthropic)
claude-sonnet-4-6 (Anthropic)
claude-sonnet-4-5-20250929-v1:0 (Bedrock)
claude-opus-4-1 (Anthropic)
claude-opus-4-1-20250805 (Anthropic)
claude-opus-4-20250514 (Anthropic)
claude-opus-4-5-20251101 (Anthropic)
claude-opus-4-5 (Anthropic)
claude-opus-4-6 (Anthropic)
claude-opus-4-6-20260205 (Anthropic)
claude-opus-4-7 (Anthropic)
claude-opus-4-7-20260416 (Anthropic)
Claude Sonnet 4 (Anthropic)
llama-2-7b-chat-fp16 (cloudflare)
llama-2-7b-chat-int8 (cloudflare)
mistral-7b-instruct-v0.1 (cloudflare)
codellama-7b-instruct-awq (cloudflare)
codex-mini-latest (OpenAI)
cohere.command-light-text-v14 (Bedrock)
cohere.command-r-plus-v1:0 (Bedrock)
cohere.command-r-v1:0 (Bedrock)
cohere.command-text-v14 (Bedrock)
cohere.embed-english-v3 (Bedrock)
cohere.embed-multilingual-v3 (Bedrock)
cohere.embed-v4:0 (Bedrock)
embed-v4.0 (Cohere)
command (Cohere)
command-a-03-2025 (cohere_chat)
command-light (cohere_chat)
command-nightly (Cohere)
command-r (cohere_chat)
command-r-08-2024 (cohere_chat)
command-r-plus (cohere_chat)
command-r-plus-08-2024 (cohere_chat)
command-r7b-12-2024 (cohere_chat)
computer-use-preview (Azure)
DeepSeek Chat (DeepSeek)
deepseek-reasoner (DeepSeek)
qwen-coder (dashscope)
qwen-max (dashscope)
qwen-plus (dashscope)
qwen-plus-2025-01-25 (dashscope)
qwen-plus-2025-04-28 (dashscope)
qwen-plus-2025-07-14 (dashscope)
qwen-turbo (dashscope)
qwen-turbo-2024-11-01 (dashscope)
qwen-turbo-2025-04-28 (dashscope)
qwen-turbo-latest (dashscope)
qwen3-next-80b-a3b-instruct (dashscope)
qwen3-next-80b-a3b-thinking (dashscope)
qwen3-vl-235b-a22b-instruct (dashscope)
qwen3-vl-235b-a22b-thinking (dashscope)
qwen3-vl-32b-instruct (dashscope)
qwen3-vl-32b-thinking (dashscope)
qwq-plus (dashscope)
databricks-bge-large-en (databricks)
databricks-claude-3-7-sonnet (databricks)
databricks-claude-haiku-4-5 (databricks)
databricks-claude-opus-4 (databricks)
databricks-claude-opus-4-1 (databricks)
databricks-claude-opus-4-5 (databricks)
databricks-claude-sonnet-4 (databricks)
databricks-claude-sonnet-4-1 (databricks)
databricks-claude-sonnet-4-5 (databricks)
databricks-gemini-2-5-flash (databricks)
databricks-gemini-2-5-pro (databricks)
databricks-gemma-3-12b (databricks)
databricks-gpt-5 (databricks)
databricks-gpt-5-1 (databricks)
databricks-gpt-5-mini (databricks)
databricks-gpt-5-nano (databricks)
databricks-gpt-oss-120b (databricks)
databricks-gpt-oss-20b (databricks)
databricks-gte-large-en (databricks)
databricks-llama-2-70b-chat (databricks)
databricks-llama-4-maverick (databricks)
databricks-meta-llama-3-1-405b-instruct (databricks)
databricks-meta-llama-3-1-8b-instruct (databricks)
databricks-meta-llama-3-3-70b-instruct (databricks)
databricks-meta-llama-3-70b-instruct (databricks)
databricks-mixtral-8x7b-instruct (databricks)
databricks-mpt-30b-instruct (databricks)
databricks-mpt-7b-instruct (databricks)
davinci-002 (text-completion-openai)
MythoMax-L2-13b (deepinfra)
Hermes-3-Llama-3.1-405B (deepinfra)
Hermes-3-Llama-3.1-70B (deepinfra)
QwQ-32B (deepinfra)
Qwen2.5-72B-Instruct (deepinfra)
Qwen2.5-7B-Instruct (deepinfra)
Qwen2.5-VL-32B-Instruct (deepinfra)
Qwen3-14B (deepinfra)
Qwen3-235B-A22B (deepinfra)
Qwen3-235B-A22B-Instruct-2507 (deepinfra)
Qwen3-235B-A22B-Thinking-2507 (deepinfra)
Qwen3-30B-A3B (deepinfra)
Qwen3-32B (deepinfra)
Qwen3-Coder-480B-A35B-Instruct (deepinfra)
Qwen3-Coder-480B-A35B-Instruct-Turbo (deepinfra)
Qwen3-Next-80B-A3B-Instruct (deepinfra)
Qwen3-Next-80B-A3B-Thinking (deepinfra)
L3-8B-Lunaris-v1-Turbo (deepinfra)
L3.1-70B-Euryale-v2.2 (deepinfra)
L3.3-70B-Euryale-v2.3 (deepinfra)
olmOCR-7B-0725-FP8 (deepinfra)
claude-3-7-sonnet-latest (deepinfra)
claude-4-opus (deepinfra)
claude-4-sonnet (deepinfra)
DeepSeek-R1 (deepinfra)
DeepSeek-R1-0528 (deepinfra)
DeepSeek-R1-0528-Turbo (deepinfra)
DeepSeek-R1-Distill-Llama-70B (deepinfra)
DeepSeek-R1-Distill-Qwen-32B (deepinfra)
DeepSeek-R1-Turbo (deepinfra)
DeepSeek-V3 (deepinfra)
DeepSeek-V3-0324 (deepinfra)
DeepSeek-V3.1 (deepinfra)
DeepSeek-V3.1-Terminus (deepinfra)
gemini-2.0-flash-001 (deepinfra)
gemini-2.5-flash (deepinfra)
gemini-2.5-pro (deepinfra)
gemma-3-12b-it (deepinfra)
gemma-3-27b-it (deepinfra)
gemma-3-4b-it (deepinfra)
Llama-3.2-11B-Vision-Instruct (deepinfra)
Llama-3.2-3B-Instruct (deepinfra)
Llama-3.3-70B-Instruct (deepinfra)
Llama-3.3-70B-Instruct-Turbo (deepinfra)
Llama-4-Maverick-17B-128E-Instruct-FP8 (deepinfra)
Llama-4-Scout-17B-16E-Instruct (deepinfra)
Llama-Guard-3-8B (deepinfra)
Llama-Guard-4-12B (deepinfra)
Meta-Llama-3-8B-Instruct (deepinfra)
Meta-Llama-3.1-70B-Instruct (deepinfra)
Meta-Llama-3.1-70B-Instruct-Turbo (deepinfra)
Meta-Llama-3.1-8B-Instruct (deepinfra)
Meta-Llama-3.1-8B-Instruct-Turbo (deepinfra)
WizardLM-2-8x22B (deepinfra)
phi-4 (deepinfra)
Mistral-Nemo-Instruct-2407 (deepinfra)
Mistral-Small-24B-Instruct-2501 (deepinfra)
Mistral-Small-3.2-24B-Instruct-2506 (deepinfra)
Mixtral-8x7B-Instruct-v0.1 (deepinfra)
Kimi-K2-Instruct (deepinfra)
Kimi-K2-Instruct-0905 (deepinfra)
Llama-3.1-Nemotron-70B-Instruct (deepinfra)
Llama-3.3-Nemotron-Super-49B-v1.5 (deepinfra)
NVIDIA-Nemotron-Nano-9B-v2 (deepinfra)
gpt-oss-120b (deepinfra)
gpt-oss-20b (deepinfra)
GLM-4.5 (deepinfra)
deepseek-chat (DeepSeek)
deepseek-coder (DeepSeek)
deepseek-r1 (DeepSeek)
deepseek-reasoner (DeepSeek)
deepseek-v3 (DeepSeek)
deepseek-v3.2 (DeepSeek)
deepseek.v3-v1:0 (bedrock_converse)
deepseek.v3.2 (bedrock_converse)
dolphin (nlp_cloud)
embed-english-light-v2.0 (Cohere)
embed-english-light-v3.0 (Cohere)
embed-english-v2.0 (Cohere)
embed-english-v3.0 (Cohere)
embed-multilingual-v2.0 (Cohere)
embed-multilingual-v3.0 (Cohere)
embed-multilingual-light-v3.0 (Cohere)
eu.amazon.nova-lite-v1:0 (bedrock_converse)
eu.amazon.nova-micro-v1:0 (bedrock_converse)
eu.amazon.nova-pro-v1:0 (bedrock_converse)
eu.anthropic.claude-3-5-haiku-20241022-v1:0 (Bedrock)
eu.anthropic.claude-haiku-4-5-20251001-v1:0 (bedrock_converse)
eu.anthropic.claude-3-5-sonnet-20240620-v1:0 (Bedrock)
eu.anthropic.claude-3-5-sonnet-20241022-v2:0 (Bedrock)
eu.anthropic.claude-3-7-sonnet-20250219-v1:0 (Bedrock)
eu.anthropic.claude-3-haiku-20240307-v1:0 (Bedrock)
eu.anthropic.claude-3-opus-20240229-v1:0 (Bedrock)
eu.anthropic.claude-3-sonnet-20240229-v1:0 (Bedrock)
eu.anthropic.claude-opus-4-1-20250805-v1:0 (bedrock_converse)
eu.anthropic.claude-opus-4-20250514-v1:0 (bedrock_converse)
eu.anthropic.claude-sonnet-4-20250514-v1:0 (bedrock_converse)
eu.anthropic.claude-sonnet-4-5-20250929-v1:0 (bedrock_converse)
eu.meta.llama3-2-1b-instruct-v1:0 (Bedrock)
eu.meta.llama3-2-3b-instruct-v1:0 (Bedrock)
eu.mistral.pixtral-large-2502-v1:0 (bedrock_converse)
fireworks-ai-4.1b-to-16b (fireworks_ai)
fireworks-ai-56b-to-176b (fireworks_ai)
fireworks-ai-above-16b (fireworks_ai)
fireworks-ai-embedding-150m-to-350m (fireworks_ai-embedding-models)
fireworks-ai-embedding-up-to-150m (fireworks_ai-embedding-models)
fireworks-ai-moe-up-to-56b (fireworks_ai)
fireworks-ai-up-to-4b (fireworks_ai)
UAE-Large-V1 (fireworks_ai-embedding-models)
deepseek-coder-v2-instruct (fireworks_ai)
deepseek-r1 (fireworks_ai)
deepseek-r1-0528 (fireworks_ai)
deepseek-r1-basic (fireworks_ai)
deepseek-v3 (fireworks_ai)
deepseek-v3-0324 (fireworks_ai)
deepseek-v3p1 (fireworks_ai)
deepseek-v3p1-terminus (fireworks_ai)
deepseek-v3p2 (fireworks_ai)
firefunction-v2 (fireworks_ai)
glm-4p5 (fireworks_ai)
glm-4p5-air (fireworks_ai)
glm-4p6 (fireworks_ai)
glm-4p7 (fireworks_ai)
glm-5p1 (fireworks_ai)
gpt-oss-120b (fireworks_ai)
gpt-oss-20b (fireworks_ai)
kimi-k2-instruct (fireworks_ai)
kimi-k2-instruct-0905 (fireworks_ai)
kimi-k2-thinking (fireworks_ai)
kimi-k2p5 (fireworks_ai)
llama-v3p1-405b-instruct (fireworks_ai)
llama-v3p1-8b-instruct (fireworks_ai)
llama-v3p2-11b-vision-instruct (fireworks_ai)
llama-v3p2-1b-instruct (fireworks_ai)
llama-v3p2-3b-instruct (fireworks_ai)
llama-v3p2-90b-vision-instruct (fireworks_ai)
llama4-maverick-instruct-basic (fireworks_ai)
llama4-scout-instruct-basic (fireworks_ai)
minimax-m2p1 (fireworks_ai)
mixtral-8x22b-instruct-hf (fireworks_ai)
qwen2-72b-instruct (fireworks_ai)
qwen2p5-coder-32b-instruct (fireworks_ai)
yi-large (fireworks_ai)
glm-4p7 (fireworks_ai)
glm-5p1 (fireworks_ai)
kimi-k2p5 (fireworks_ai)
minimax-m2p1 (fireworks_ai)
nomic-embed-text-v1 (fireworks_ai-embedding-models)
nomic-embed-text-v1.5 (fireworks_ai-embedding-models)
gte-base (fireworks_ai-embedding-models)
gte-large (fireworks_ai-embedding-models)
meta-llama-3.1-70b-instruct (friendliai)
meta-llama-3.1-8b-instruct (friendliai)
ft:babbage-002 (text-completion-openai)
ft:davinci-002 (text-completion-openai)
ft:gpt-3.5-turbo (OpenAI)
ft:gpt-3.5-turbo-0125 (OpenAI)
ft:gpt-3.5-turbo-0613 (OpenAI)
ft:gpt-3.5-turbo-1106 (OpenAI)
ft:gpt-4-0613 (OpenAI)
ft:gpt-4o-2024-08-06 (OpenAI)
ft:gpt-4o-2024-11-20 (OpenAI)
ft:gpt-4o-mini-2024-07-18 (OpenAI)
ft:gpt-4.1-2025-04-14 (OpenAI)
ft:gpt-4.1-mini-2025-04-14 (OpenAI)
ft:gpt-4.1-nano-2025-04-14 (OpenAI)
ft:o4-mini-2025-04-16 (OpenAI)
gemini-2.0-flash (vertex_ai-language-models)
gemini-2.0-flash-001 (vertex_ai-language-models)
gemini-2.0-flash-lite (vertex_ai-language-models)
gemini-2.0-flash-lite-001 (vertex_ai-language-models)
gemini-2.5-flash (vertex_ai-language-models)
gemini-2.5-flash-image (vertex_ai-language-models)
gemini-3-pro-image-preview (vertex_ai-language-models)
gemini-3.1-flash-image-preview (vertex_ai-language-models)
gemini-3.1-flash-lite-preview (vertex_ai-language-models)
gemini-3.1-flash-lite (vertex_ai-language-models)
deep-research-pro-preview-12-2025 (vertex_ai-language-models)
gemini-2.5-flash-lite (vertex_ai-language-models)
gemini-2.5-flash-lite-preview-09-2025 (vertex_ai-language-models)
gemini-2.5-flash-preview-09-2025 (vertex_ai-language-models)
gemini-live-2.5-flash-preview-native-audio-09-2025 (vertex_ai-language-models)
gemini-live-2.5-flash-preview-native-audio-09-2025 (Google)
gemini-2.5-flash-lite-preview-06-17 (vertex_ai-language-models)
gemini-2.5-pro (vertex_ai-language-models)
gemini-3-pro-preview (vertex_ai-language-models)
gemini-3.1-pro-preview (vertex_ai-language-models)
gemini-3.1-pro-preview-customtools (vertex_ai-language-models)
gemini-3-pro-preview (Vertex AI)
gemini-3-flash-preview (Vertex AI)
gemini-3.5-flash (Vertex AI)
gemini-3.1-pro-preview (Vertex AI)
gemini-3.1-pro-preview-customtools (Vertex AI)
gemini-2.5-pro-preview-tts (vertex_ai-language-models)
gemini-robotics-er-1.5-preview (vertex_ai-language-models)
gemini-robotics-er-1.5-preview (Google)
gemini-2.5-computer-use-preview-10-2025 (vertex_ai-language-models)
gemini-embedding-001 (vertex_ai-embedding-models)
gemini-embedding-2-preview (vertex_ai-embedding-models)
gemini-embedding-2 (vertex_ai-embedding-models)
gemini-embedding-2-preview (Vertex AI)
gemini-embedding-2 (Vertex AI)
gemini-embedding-001 (Google)
gemini-embedding-2-preview (Google)
gemini-embedding-2 (Google)
gemini-1.5-flash (Google)
gemini-2.0-flash (Google)
gemini-2.0-flash-001 (Google)
gemini-2.0-flash-lite (Google)
Gemini 2.5 Flash (Google)
gemini-2.5-flash-image (Google)
gemini-3-pro-image-preview (Google)
gemini-3.1-flash-image-preview (Google)
deep-research-pro-preview-12-2025 (Google)
gemini-2.5-flash-lite (Google)
gemini-2.5-flash-lite-preview-09-2025 (Google)
gemini-2.5-flash-preview-09-2025 (Google)
gemini-flash-latest (Google)
gemini-flash-lite-latest (Google)
gemini-2.5-flash-lite-preview-06-17 (Google)
gemini-2.5-flash-preview-tts (Google)
gemini-2.5-pro (Google)
gemini-2.5-computer-use-preview-10-2025 (Google)
gemini-3-pro-preview (Google)
gemini-3.1-flash-lite-preview (Google)
gemini-3.1-flash-lite (Google)
gemini-3-flash-preview (Google)
gemini-3.5-flash (Google)
gemini-3.1-pro-preview (Google)
gemini-3.1-pro-preview-customtools (Google)
gemini-3-flash-preview (vertex_ai-language-models)
gemini-3.5-flash (vertex_ai-language-models)
gemini-2.5-pro-preview-tts (Google)
gemini-gemma-2-27b-it (Google)
gemini-gemma-2-9b-it (Google)
claude-opus-4.5 (gmi)
claude-sonnet-4.5 (gmi)
claude-sonnet-4 (gmi)
claude-opus-4 (gmi)
gpt-5.2 (gmi)
gpt-5.1 (gmi)
gpt-5 (gmi)
gpt-4o (gmi)
gpt-4o-mini (gmi)
DeepSeek-V3.2 (gmi)
DeepSeek-V3-0324 (gmi)
gemini-3-pro-preview (gmi)
gemini-3-flash-preview (gmi)
Kimi-K2-Thinking (gmi)
MiniMax-M2.1 (gmi)
MiniMax-M2.5 (baseten)
Nemotron-120B-A12B (baseten)
GLM-5 (baseten)
GLM-4.7 (baseten)
GLM-4.6 (baseten)
Kimi-K2.5 (baseten)
Kimi-K2-Thinking (baseten)
Kimi-K2-Instruct-0905 (baseten)
gpt-oss-120b (baseten)
DeepSeek-V3.1 (baseten)
DeepSeek-V3-0324 (baseten)
Qwen3-VL-235B-A22B-Instruct-FP8 (gmi)
GLM-4.7-FP8 (gmi)
google.gemma-3-12b-it (bedrock_converse)
google.gemma-3-27b-it (bedrock_converse)
google.gemma-3-4b-it (bedrock_converse)
global.anthropic.claude-sonnet-4-5-20250929-v1:0 (bedrock_converse)
global.anthropic.claude-sonnet-4-20250514-v1:0 (bedrock_converse)
global.anthropic.claude-haiku-4-5-20251001-v1:0 (bedrock_converse)
global.amazon.nova-2-lite-v1:0 (bedrock_converse)
gpt-3.5-turbo (OpenAI)
gpt-3.5-turbo-0125 (OpenAI)
gpt-3.5-turbo-1106 (OpenAI)
gpt-3.5-turbo-16k (OpenAI)
gpt-3.5-turbo-instruct (text-completion-openai)
gpt-3.5-turbo-instruct-0914 (text-completion-openai)
gpt-4 (OpenAI)
gpt-4-0125-preview (OpenAI)
gpt-4-0314 (OpenAI)
gpt-4-0613 (OpenAI)
gpt-4-1106-preview (OpenAI)
gpt-4-turbo (OpenAI)
gpt-4-turbo-2024-04-09 (OpenAI)
gpt-4-turbo-preview (OpenAI)
GPT-4.1 (OpenAI)
gpt-4.1-2025-04-14 (OpenAI)
GPT-4.1 mini (OpenAI)
gpt-4.1-mini-2025-04-14 (OpenAI)
gpt-4.1-nano (OpenAI)
gpt-4.1-nano-2025-04-14 (OpenAI)
gpt-4o (OpenAI)
gpt-4o-2024-05-13 (OpenAI)
gpt-4o-2024-08-06 (OpenAI)
gpt-4o-2024-11-20 (OpenAI)
gpt-4o-audio-preview (OpenAI)
gpt-4o-audio-preview-2024-12-17 (OpenAI)
gpt-4o-audio-preview-2025-06-03 (OpenAI)
gpt-audio (OpenAI)
gpt-audio-1.5 (OpenAI)
gpt-audio-2025-08-28 (OpenAI)
gpt-audio-mini (OpenAI)
gpt-audio-mini-2025-10-06 (OpenAI)
gpt-audio-mini-2025-12-15 (OpenAI)
gpt-4o-mini (OpenAI)
gpt-4o-mini-2024-07-18 (OpenAI)
gpt-4o-mini-audio-preview (OpenAI)
gpt-4o-mini-audio-preview-2024-12-17 (OpenAI)
gpt-4o-mini-realtime-preview (OpenAI)
gpt-4o-mini-realtime-preview-2024-12-17 (OpenAI)
gpt-4o-mini-search-preview (OpenAI)
gpt-4o-mini-search-preview-2025-03-11 (OpenAI)
gpt-4o-mini-transcribe (OpenAI)
gpt-4o-mini-tts (OpenAI)
gpt-4o-realtime-preview (OpenAI)
gpt-4o-realtime-preview-2024-12-17 (OpenAI)
gpt-4o-realtime-preview-2025-06-03 (OpenAI)
gpt-4o-search-preview (OpenAI)
gpt-4o-search-preview-2025-03-11 (OpenAI)
gpt-4o-transcribe (OpenAI)
gpt-image-1.5 (OpenAI)
gpt-image-1.5-2025-12-16 (OpenAI)
gpt-image-2 (OpenAI)
gpt-image-2-2026-04-21 (OpenAI)
gpt-5 (OpenAI)
gpt-5.1 (OpenAI)
gpt-5.1-2025-11-13 (OpenAI)
gpt-5.1-chat-latest (OpenAI)
gpt-5.2 (OpenAI)
gpt-5.2-2025-12-11 (OpenAI)
gpt-5.2-chat-latest (OpenAI)
gpt-5.3-chat-latest (OpenAI)
gpt-5.2-pro (OpenAI)
gpt-5.2-pro-2025-12-11 (OpenAI)
gpt-5.5 (OpenAI)
gpt-5.5-2026-04-23 (OpenAI)
gpt-5.5-pro (OpenAI)
gpt-5.5-pro-2026-04-23 (OpenAI)
gpt-5.4 (OpenAI)
gpt-5.4-2026-03-05 (OpenAI)
gpt-5.4-pro (OpenAI)
gpt-5.4-pro-2026-03-05 (OpenAI)
gpt-5.4-mini (OpenAI)
gpt-5.4-mini-2026-03-17 (OpenAI)
gpt-5.4-nano (OpenAI)
gpt-5.4-nano-2026-03-17 (OpenAI)
gpt-5-pro (OpenAI)
gpt-5-pro-2025-10-06 (OpenAI)
gpt-5-2025-08-07 (OpenAI)
gpt-5-chat (OpenAI)
gpt-5-chat-latest (OpenAI)
gpt-5-codex (OpenAI)
gpt-5.1-codex (OpenAI)
gpt-5.1-codex-max (OpenAI)
gpt-5.1-codex-mini (OpenAI)
gpt-5.2-codex (OpenAI)
gpt-5.3-codex (OpenAI)
gpt-5-mini (OpenAI)
gpt-5-mini-2025-08-07 (OpenAI)
gpt-5-nano (OpenAI)
gpt-5-nano-2025-08-07 (OpenAI)
gpt-image-1 (OpenAI)
gpt-image-1-mini (OpenAI)
gpt-realtime (OpenAI)
gpt-realtime-1.5 (OpenAI)
gpt-realtime-2 (OpenAI)
gpt-realtime-mini (OpenAI)
gpt-realtime-2025-08-28 (OpenAI)
anthropic-claude-3-opus (gradient_ai)
anthropic-claude-3.5-haiku (gradient_ai)
anthropic-claude-3.5-sonnet (gradient_ai)
anthropic-claude-3.7-sonnet (gradient_ai)
deepseek-r1-distill-llama-70b (gradient_ai)
llama3-8b-instruct (gradient_ai)
llama3.3-70b-instruct (gradient_ai)
mistral-nemo-instruct-2407 (gradient_ai)
openai-o3 (gradient_ai)
openai-o3-mini (gradient_ai)
nova-micro-v1 (amazon_nova)
nova-lite-v1 (amazon_nova)
nova-premier-v1 (amazon_nova)
nova-pro-v1 (amazon_nova)
llama-3.1-8b-instant (Groq)
llama-3.3-70b-versatile (Groq)
gemma-7b-it (Groq)
llama-guard-4-12b (Groq)
llama-4-maverick-17b-128e-instruct (Groq)
llama-4-scout-17b-16e-instruct (Groq)
kimi-k2-instruct-0905 (Groq)
gpt-oss-120b (Groq)
gpt-oss-20b (Groq)
gpt-oss-safeguard-20b (Groq)
qwen3-32b (Groq)
Hermes-3-Llama-3.1-70B (hyperbolic)
QwQ-32B (hyperbolic)
Qwen2.5-72B-Instruct (hyperbolic)
Qwen2.5-Coder-32B-Instruct (hyperbolic)
Qwen3-235B-A22B (hyperbolic)
DeepSeek-R1 (hyperbolic)
DeepSeek-R1-0528 (hyperbolic)
DeepSeek-V3 (hyperbolic)
DeepSeek-V3-0324 (hyperbolic)
Llama-3.2-3B-Instruct (hyperbolic)
Llama-3.3-70B-Instruct (hyperbolic)
Meta-Llama-3-70B-Instruct (hyperbolic)
Meta-Llama-3.1-405B-Instruct (hyperbolic)
Meta-Llama-3.1-70B-Instruct (hyperbolic)
Meta-Llama-3.1-8B-Instruct (hyperbolic)
Kimi-K2-Instruct (hyperbolic)
j2-light (ai21)
j2-mid (ai21)
j2-ultra (ai21)
jamba-1.5 (ai21)
jamba-1.5-large (ai21)
jamba-1.5-large@001 (ai21)
jamba-1.5-mini (ai21)
jamba-1.5-mini@001 (ai21)
jamba-large-1.6 (ai21)
jamba-large-1.7 (ai21)
jamba-mini-1.6 (ai21)
jamba-mini-1.7 (ai21)
jina-reranker-v2-base-multilingual (jina_ai)
jp.anthropic.claude-sonnet-4-5-20250929-v1:0 (bedrock_converse)
jp.anthropic.claude-haiku-4-5-20251001-v1:0 (bedrock_converse)
DeepSeek-R1-0528 (crusoe)
DeepSeek-V3-0324 (crusoe)
gemma-3-12b-it (crusoe)
Llama-3.3-70B-Instruct (crusoe)
Kimi-K2-Thinking (crusoe)
gpt-oss-120b (crusoe)
Qwen3-235B-A22B-Instruct-2507 (crusoe)
deepseek-llama3.3-70b (lambda_ai)
deepseek-r1-0528 (lambda_ai)
deepseek-r1-671b (lambda_ai)
deepseek-v3-0324 (lambda_ai)
hermes3-405b (lambda_ai)
hermes3-70b (lambda_ai)
hermes3-8b (lambda_ai)
lfm-40b (lambda_ai)
lfm-7b (lambda_ai)
llama-4-maverick-17b-128e-instruct-fp8 (lambda_ai)
llama-4-scout-17b-16e-instruct (lambda_ai)
llama3.1-405b-instruct-fp8 (lambda_ai)
llama3.1-70b-instruct-fp8 (lambda_ai)
llama3.1-8b-instruct (lambda_ai)
llama3.1-nemotron-70b-instruct-fp8 (lambda_ai)
llama3.2-11b-vision-instruct (lambda_ai)
llama3.2-3b-instruct (lambda_ai)
llama3.3-70b-instruct-fp8 (lambda_ai)
qwen25-coder-32b-instruct (lambda_ai)
qwen3-32b-fp8 (lambda_ai)
meta.llama2-13b-chat-v1 (Bedrock)
meta.llama2-70b-chat-v1 (Bedrock)
meta.llama3-1-405b-instruct-v1:0 (Bedrock)
meta.llama3-1-70b-instruct-v1:0 (Bedrock)
meta.llama3-1-8b-instruct-v1:0 (Bedrock)
meta.llama3-2-11b-instruct-v1:0 (Bedrock)
meta.llama3-2-1b-instruct-v1:0 (Bedrock)
meta.llama3-2-3b-instruct-v1:0 (Bedrock)
meta.llama3-2-90b-instruct-v1:0 (Bedrock)
meta.llama3-3-70b-instruct-v1:0 (bedrock_converse)
meta.llama3-70b-instruct-v1:0 (Bedrock)
meta.llama3-8b-instruct-v1:0 (Bedrock)
meta.llama4-maverick-17b-instruct-v1:0 (bedrock_converse)
meta.llama4-scout-17b-instruct-v1:0 (bedrock_converse)
minimax.minimax-m2 (bedrock_converse)
minimax.minimax-m2.1 (bedrock_converse)
minimax.minimax-m2.5 (bedrock_converse)
MiniMax-M2.1 (minimax)
MiniMax-M2.1-lightning (minimax)
MiniMax-M2.5 (minimax)
MiniMax-M2.5-lightning (minimax)
MiniMax-M2 (minimax)
mistral.devstral-2-123b (bedrock_converse)
mistral.magistral-small-2509 (bedrock_converse)
mistral.ministral-3-14b-instruct (bedrock_converse)
mistral.ministral-3-3b-instruct (bedrock_converse)
mistral.ministral-3-8b-instruct (bedrock_converse)
mistral.mistral-7b-instruct-v0:2 (Bedrock)
mistral.mistral-large-2402-v1:0 (Bedrock)
mistral.mistral-large-2407-v1:0 (Bedrock)
mistral.mistral-large-3-675b-instruct (bedrock_converse)
mistral.mistral-small-2402-v1:0 (Bedrock)
mistral.mixtral-8x7b-instruct-v0:1 (Bedrock)
mistral.voxtral-mini-3b-2507 (bedrock_converse)
mistral.voxtral-small-24b-2507 (bedrock_converse)
codestral-2405 (Mistral)
codestral-2508 (Mistral)
codestral-latest (Mistral)
codestral-mamba-latest (Mistral)
devstral-medium-2507 (Mistral)
devstral-small-2505 (Mistral)
devstral-small-2507 (Mistral)
devstral-small-latest (Mistral)
labs-devstral-small-2512 (Mistral)
devstral-latest (Mistral)
devstral-medium-latest (Mistral)
devstral-2512 (Mistral)
magistral-medium-2506 (Mistral)
magistral-medium-2509 (Mistral)
magistral-medium-1-2-2509 (Mistral)
magistral-medium-latest (Mistral)
magistral-small-2506 (Mistral)
magistral-small-latest (Mistral)
magistral-small-1-2-2509 (Mistral)
mistral-embed (Mistral)
codestral-embed (Mistral)
codestral-embed-2505 (Mistral)
mistral-large-2402 (Mistral)
mistral-large-2407 (Mistral)
mistral-large-2411 (Mistral)
Mistral Large (Mistral)
mistral-large-3 (Mistral)
mistral-large-2512 (Mistral)
mistral-medium (Mistral)
mistral-medium-2312 (Mistral)
mistral-medium-2505 (Mistral)
mistral-medium-latest (Mistral)
mistral-medium-3-1-2508 (Mistral)
mistral-small (Mistral)
mistral-small-latest (Mistral)
mistral-small-3-2-2506 (Mistral)
ministral-3-3b-2512 (Mistral)
ministral-3-8b-2512 (Mistral)
ministral-3-14b-2512 (Mistral)
ministral-8b-2512 (Mistral)
mistral-tiny (Mistral)
open-codestral-mamba (Mistral)
open-mistral-7b (Mistral)
open-mistral-nemo (Mistral)
open-mistral-nemo-2407 (Mistral)
open-mixtral-8x22b (Mistral)
open-mixtral-8x7b (Mistral)
pixtral-12b-2409 (Mistral)
pixtral-large-2411 (Mistral)
pixtral-large-latest (Mistral)
moonshot.kimi-k2-thinking (bedrock_converse)
moonshotai.kimi-k2.5 (bedrock_converse)
kimi-k2-0711-preview (moonshot)
kimi-k2-0905-preview (moonshot)
kimi-k2-turbo-preview (moonshot)
kimi-k2.5 (moonshot)
kimi-k2.6 (moonshot)
kimi-latest (moonshot)
kimi-latest-128k (moonshot)
kimi-latest-32k (moonshot)
kimi-latest-8k (moonshot)
kimi-thinking-preview (moonshot)
kimi-k2-thinking (moonshot)
kimi-k2-thinking-turbo (moonshot)
moonshot-v1-128k (moonshot)
moonshot-v1-128k-0430 (moonshot)
moonshot-v1-128k-vision-preview (moonshot)
moonshot-v1-32k (moonshot)
moonshot-v1-32k-0430 (moonshot)
moonshot-v1-32k-vision-preview (moonshot)
moonshot-v1-8k (moonshot)
moonshot-v1-8k-0430 (moonshot)
moonshot-v1-8k-vision-preview (moonshot)
moonshot-v1-auto (moonshot)
morph-v3-fast (morph)
morph-v3-large (morph)
multimodalembedding (vertex_ai-embedding-models)
multimodalembedding@001 (vertex_ai-embedding-models)
QwQ-32B (nscale)
Qwen2.5-Coder-32B-Instruct (nscale)
Qwen2.5-Coder-3B-Instruct (nscale)
Qwen2.5-Coder-7B-Instruct (nscale)
DeepSeek-R1-Distill-Llama-70B (nscale)
DeepSeek-R1-Distill-Llama-8B (nscale)
DeepSeek-R1-Distill-Qwen-1.5B (nscale)
DeepSeek-R1-Distill-Qwen-14B (nscale)
DeepSeek-R1-Distill-Qwen-32B (nscale)
DeepSeek-R1-Distill-Qwen-7B (nscale)
Llama-3.1-8B-Instruct (nscale)
Llama-3.3-70B-Instruct (nscale)
Llama-4-Scout-17B-16E-Instruct (nscale)
mixtral-8x22b-instruct-v0.1 (nscale)
DeepSeek-R1 (nebius)
DeepSeek-R1-0528 (nebius)
DeepSeek-R1-Distill-Llama-70B (nebius)
DeepSeek-V3 (nebius)
DeepSeek-V3-0324 (nebius)
gemma-3-27b-it (nebius)
Llama-3.3-70B-Instruct (nebius)
Llama-Guard-3-8B (nebius)
Meta-Llama-3.1-8B-Instruct (nebius)
Meta-Llama-3.1-70B-Instruct (nebius)
Meta-Llama-3.1-405B-Instruct (nebius)
Mistral-Nemo-Instruct-2407 (nebius)
Hermes-3-Llama-3.1-405B (nebius)
Llama-3.1-Nemotron-Ultra-253B-v1 (nebius)
Llama-3.3-Nemotron-Super-49B-v1 (nebius)
Qwen3-235B-A22B (nebius)
Qwen3-32B (nebius)
Qwen3-30B-A3B (nebius)
Qwen3-14B (nebius)
Qwen3-4B (nebius)
QwQ-32B (nebius)
Qwen2.5-72B-Instruct (nebius)
Qwen2.5-32B-Instruct (nebius)
Qwen2.5-Coder-7B (nebius)
Qwen2.5-VL-72B-Instruct (nebius)
Qwen2-VL-72B-Instruct (nebius)
Qwen2-VL-7B-Instruct (nebius)
bge-en-icl (nebius)
bge-multilingual-gemma2 (nebius)
e5-mistral-7b-instruct (nebius)
nvidia.nemotron-nano-12b-v2 (bedrock_converse)
nvidia.nemotron-nano-9b-v2 (bedrock_converse)
nvidia.nemotron-nano-3-30b (bedrock_converse)
nvidia.nemotron-super-3-120b (bedrock_converse)
o1 (OpenAI)
o1-2024-12-17 (OpenAI)
o1-pro (OpenAI)
o1-pro-2025-03-19 (OpenAI)
o3 (OpenAI)
o3-2025-04-16 (OpenAI)
o3-deep-research (OpenAI)
o3-deep-research-2025-06-26 (OpenAI)
o3-mini (OpenAI)
o3-mini-2025-01-31 (OpenAI)
o3-pro (OpenAI)
o3-pro-2025-06-10 (OpenAI)
o4-mini (OpenAI)
o4-mini-2025-04-16 (OpenAI)
o4-mini-deep-research (OpenAI)
o4-mini-deep-research-2025-06-26 (OpenAI)
meta.llama-3.1-405b-instruct (oci)
meta.llama-3.2-90b-vision-instruct (oci)
meta.llama-3.3-70b-instruct (oci)
meta.llama-4-maverick-17b-128e-instruct-fp8 (oci)
meta.llama-4-scout-17b-16e-instruct (oci)
xai.grok-3 (oci)
xai.grok-3-fast (oci)
xai.grok-3-mini (oci)
xai.grok-3-mini-fast (oci)
xai.grok-4 (oci)
cohere.command-latest (oci)
cohere.command-a-03-2025 (oci)
cohere.command-plus-latest (oci)
cohere.command-a-reasoning-08-2025 (oci)
cohere.command-a-vision-07-2025 (oci)
cohere.command-a-translate-08-2025 (oci)
cohere.command-r-08-2024 (oci)
cohere.command-r-plus-08-2024 (oci)
meta.llama-3.2-11b-vision-instruct (oci)
meta.llama-3.1-70b-instruct (oci)
meta.llama-3.3-70b-instruct-fp8-dynamic (oci)
xai.grok-4-fast (oci)
xai.grok-4.1-fast (oci)
xai.grok-4.20 (oci)
xai.grok-4.20-multi-agent (oci)
xai.grok-code-fast-1 (oci)
google.gemini-2.5-pro (oci)
google.gemini-2.5-flash (oci)
google.gemini-2.5-flash-lite (oci)
cohere.embed-english-v3.0 (oci)
cohere.embed-english-light-v3.0 (oci)
cohere.embed-multilingual-v3.0 (oci)
cohere.embed-multilingual-light-v3.0 (oci)
cohere.embed-english-image-v3.0 (oci)
cohere.embed-english-light-image-v3.0 (oci)
cohere.embed-multilingual-light-image-v3.0 (oci)
cohere.embed-v4.0 (oci)
openai.gpt-oss-120b-1:0 (bedrock_converse)
openai.gpt-oss-20b-1:0 (bedrock_converse)
openai.gpt-oss-safeguard-120b (bedrock_converse)
openai.gpt-oss-safeguard-20b (bedrock_converse)
claude-3-haiku (OpenRouter)
claude-3.5-sonnet (OpenRouter)
claude-3.7-sonnet (OpenRouter)
claude-opus-4 (OpenRouter)
claude-opus-4.1 (OpenRouter)
claude-sonnet-4 (OpenRouter)
claude-sonnet-4.6 (OpenRouter)
claude-opus-4.5 (OpenRouter)
claude-opus-4.6 (OpenRouter)
claude-sonnet-4.5 (OpenRouter)
claude-haiku-4.5 (OpenRouter)
claude-opus-4.7 (OpenRouter)
ui-tars-1.5-7b (OpenRouter)
deepseek-chat (OpenRouter)
deepseek-chat-v3-0324 (OpenRouter)
deepseek-chat-v3.1 (OpenRouter)
deepseek-v3.2 (OpenRouter)
deepseek-v3.2-exp (OpenRouter)
deepseek-r1 (OpenRouter)
deepseek-r1-0528 (OpenRouter)
gemini-2.0-flash-001 (OpenRouter)
gemini-2.5-flash (OpenRouter)
gemini-2.5-pro (OpenRouter)
gemini-3-pro-preview (OpenRouter)
gemini-3-flash-preview (OpenRouter)
gemini-3.1-flash-lite-preview (OpenRouter)
gemini-3.1-flash-lite (OpenRouter)
gemini-3.1-pro-preview (OpenRouter)
mythomax-l2-13b (OpenRouter)
weaver (OpenRouter)
llama-3-70b-instruct (OpenRouter)
minimax-m2 (OpenRouter)
devstral-2512 (OpenRouter)
ministral-3b-2512 (OpenRouter)
ministral-8b-2512 (OpenRouter)
ministral-14b-2512 (OpenRouter)
mistral-large-2512 (OpenRouter)
mistral-7b-instruct (OpenRouter)
mistral-large (OpenRouter)
mistral-small-3.1-24b-instruct (OpenRouter)
mistral-small-3.2-24b-instruct (OpenRouter)
mixtral-8x22b-instruct (OpenRouter)
kimi-k2.5 (OpenRouter)
gpt-3.5-turbo (OpenRouter)
gpt-3.5-turbo-16k (OpenRouter)
gpt-4 (OpenRouter)
gpt-4.1 (OpenRouter)
gpt-4.1-mini (OpenRouter)
gpt-4.1-nano (OpenRouter)
gpt-4o (OpenRouter)
gpt-4o-2024-05-13 (OpenRouter)
gpt-5-chat (OpenRouter)
gpt-5-codex (OpenRouter)
gpt-5.2-codex (OpenRouter)
gpt-5 (OpenRouter)
gpt-5-mini (OpenRouter)
gpt-5-nano (OpenRouter)
gpt-5.1-codex-max (OpenRouter)
gpt-5.2 (OpenRouter)
gpt-5.2-chat (OpenRouter)
gpt-5.2-pro (OpenRouter)
gpt-oss-120b (OpenRouter)
gpt-oss-20b (OpenRouter)
openai (OpenRouter)
o3-mini (OpenRouter)
o3-mini-high (OpenRouter)
qwen-2.5-coder-32b-instruct (OpenRouter)
qwen-vl-plus (OpenRouter)
qwen3-coder (OpenRouter)
qwen3-coder-plus (OpenRouter)
qwen3-235b-a22b-2507 (OpenRouter)
qwen3-235b-a22b-thinking-2507 (OpenRouter)
qwen3.6-plus (OpenRouter)
qwen3.5-35b-a3b (OpenRouter)
qwen3.5-27b (OpenRouter)
qwen3.5-122b-a10b (OpenRouter)
qwen3.5-flash-02-23 (OpenRouter)
qwen3.5-plus-02-15 (OpenRouter)
qwen3.5-397b-a17b (OpenRouter)
router (OpenRouter)
remm-slerp-l2-13b (OpenRouter)
grok-4 (OpenRouter)
glm-4.6 (OpenRouter)
glm-4.6:exacto (OpenRouter)
mimo-v2-flash (OpenRouter)
mimo-v2.5-pro (OpenRouter)
mimo-v2.5 (OpenRouter)
glm-4.7 (OpenRouter)
glm-4.7-flash (OpenRouter)
glm-5 (OpenRouter)
minimax-m2.1 (OpenRouter)
minimax-m2.5 (OpenRouter)
DeepSeek-R1-Distill-Llama-70B (ovhcloud)
Llama-3.1-8B-Instruct (ovhcloud)
Meta-Llama-3_1-70B-Instruct (ovhcloud)
Meta-Llama-3_3-70B-Instruct (ovhcloud)
Mistral-7B-Instruct-v0.3 (ovhcloud)
Mistral-Nemo-Instruct-2407 (ovhcloud)
Mistral-Small-3.2-24B-Instruct-2506 (ovhcloud)
Mixtral-8x7B-Instruct-v0.1 (ovhcloud)
Qwen2.5-Coder-32B-Instruct (ovhcloud)
Qwen2.5-VL-72B-Instruct (ovhcloud)
Qwen3-32B (ovhcloud)
gpt-oss-120b (ovhcloud)
gpt-oss-20b (ovhcloud)
llava-v1.6-mistral-7b-hf (ovhcloud)
mamba-codestral-7B-v0.1 (ovhcloud)
chat-bison (palm)
chat-bison-001 (palm)
text-bison (palm)
text-bison-001 (palm)
text-bison-safety-off (palm)
text-bison-safety-recitation-off (palm)
codellama-34b-instruct (perplexity)
codellama-70b-instruct (perplexity)
llama-2-70b-chat (perplexity)
llama-3.1-70b-instruct (perplexity)
llama-3.1-8b-instruct (perplexity)
mistral-7b-instruct (perplexity)
mixtral-8x7b-instruct (perplexity)
pplx-70b-chat (perplexity)
pplx-70b-online (perplexity)
pplx-7b-chat (perplexity)
pplx-7b-online (perplexity)
sonar (perplexity)
sonar-deep-research (perplexity)
sonar-medium-chat (perplexity)
sonar-medium-online (perplexity)
sonar-pro (perplexity)
sonar-reasoning (perplexity)
sonar-reasoning-pro (perplexity)
sonar-small-chat (perplexity)
sonar-small-online (perplexity)
pplx-embed-v1-0.6b (perplexity)
pplx-embed-v1-4b (perplexity)
qwen.qwen3-coder-480b-a35b-v1:0 (bedrock_converse)
qwen.qwen3-235b-a22b-2507-v1:0 (bedrock_converse)
qwen.qwen3-coder-30b-a3b-v1:0 (bedrock_converse)
qwen.qwen3-32b-v1:0 (bedrock_converse)
qwen.qwen3-next-80b-a3b (bedrock_converse)
qwen.qwen3-vl-235b-a22b (bedrock_converse)
qwen.qwen3-coder-next (bedrock_converse)
llama-2-13b (replicate)
llama-2-13b-chat (replicate)
llama-2-70b (replicate)
llama-2-70b-chat (replicate)
llama-2-7b (replicate)
llama-2-7b-chat (replicate)
llama-3-70b (replicate)
llama-3-70b-instruct (replicate)
llama-3-8b (replicate)
llama-3-8b-instruct (replicate)
mistral-7b-instruct-v0.2 (replicate)
mistral-7b-v0.1 (replicate)
mixtral-8x7b-instruct-v0.1 (replicate)
gpt-5 (replicate)
gpt-oss-20b (replicate)
claude-4.5-haiku (replicate)
granite-3.3-8b-instruct (replicate)
gpt-4o (replicate)
o4-mini (replicate)
o1-mini (replicate)
openai (replicate)
gpt-4o-mini (replicate)
qwen3-235b-a22b-instruct-2507 (replicate)
claude-4-sonnet (replicate)
deepseek-v3 (replicate)
claude-3.7-sonnet (replicate)
claude-3.5-haiku (replicate)
claude-3.5-sonnet (replicate)
gemini-3-pro (replicate)
claude-4.5-sonnet (replicate)
gpt-4.1 (replicate)
gpt-4.1-nano (replicate)
gpt-4.1-mini (replicate)
gpt-5-nano (replicate)
gpt-5-mini (replicate)
gemini-2.5-flash (replicate)
gpt-oss-120b (replicate)
deepseek-v3.1 (replicate)
grok-4 (replicate)
deepseek-r1 (replicate)
MiniMax-M2.7 (sambanova)
DeepSeek-R1 (sambanova)
DeepSeek-R1-Distill-Llama-70B (sambanova)
DeepSeek-V3-0324 (sambanova)
Llama-4-Maverick-17B-128E-Instruct (sambanova)
Llama-4-Scout-17B-16E-Instruct (sambanova)
Meta-Llama-3.1-405B-Instruct (sambanova)
Meta-Llama-3.1-8B-Instruct (sambanova)
Meta-Llama-3.2-1B-Instruct (sambanova)
Meta-Llama-3.2-3B-Instruct (sambanova)
Meta-Llama-3.3-70B-Instruct (sambanova)
Meta-Llama-Guard-3-8B (sambanova)
QwQ-32B (sambanova)
Qwen2-Audio-7B-Instruct (sambanova)
Qwen3-32B (sambanova)
DeepSeek-V3.1 (sambanova)
gpt-oss-120b (sambanova)
text-embedding-004 (vertex_ai-embedding-models)
text-embedding-005 (vertex_ai-embedding-models)
text-embedding-3-large (OpenAI)
text-embedding-3-small (OpenAI)
text-embedding-ada-002 (OpenAI)
text-embedding-ada-002-v2 (OpenAI)
text-embedding-large-exp-03-07 (vertex_ai-embedding-models)
text-embedding-preview-0409 (vertex_ai-embedding-models)
text-multilingual-embedding-002 (vertex_ai-embedding-models)
text-unicorn (vertex_ai-text-models)
text-unicorn@001 (vertex_ai-text-models)
together-ai-21.1b-41b (Together AI)
together-ai-4.1b-8b (Together AI)
together-ai-41.1b-80b (Together AI)
together-ai-8.1b-21b (Together AI)
together-ai-81.1b-110b (Together AI)
together-ai-embedding-151m-to-350m (Together AI)
together-ai-embedding-up-to-150m (Together AI)
bge-base-en-v1.5 (Together AI)
bge-base-en-v1.5 (Together AI)
together-ai-up-to-4b (Together AI)
Qwen3-235B-A22B-Instruct-2507-tput (Together AI)
Qwen3-235B-A22B-Thinking-2507 (Together AI)
Qwen3-235B-A22B-fp8-tput (Together AI)
Qwen3-Coder-480B-A35B-Instruct-FP8 (Together AI)
DeepSeek-R1 (Together AI)
DeepSeek-R1-0528-tput (Together AI)
DeepSeek-V3 (Together AI)
DeepSeek-V3.1 (Together AI)
Llama-3.3-70B-Instruct-Turbo (Together AI)
Llama-4-Maverick-17B-128E-Instruct-FP8 (Together AI)
Llama-4-Scout-17B-16E-Instruct (Together AI)
Meta-Llama-3.1-405B-Instruct-Turbo (Together AI)
Meta-Llama-3.1-70B-Instruct-Turbo (Together AI)
Meta-Llama-3.1-8B-Instruct-Turbo (Together AI)
Mixtral-8x7B-Instruct-v0.1 (Together AI)
Kimi-K2-Instruct (Together AI)
gpt-oss-120b (Together AI)
gpt-oss-20b (Together AI)
GLM-4.5-Air-FP8 (Together AI)
GLM-4.6 (Together AI)
GLM-4.7 (Together AI)
Kimi-K2.5 (Together AI)
Kimi-K2-Instruct-0905 (Together AI)
Qwen3-Next-80B-A3B-Instruct (Together AI)
Qwen3-Next-80B-A3B-Thinking (Together AI)
Qwen3.5-397B-A17B (Together AI)
us.amazon.nova-lite-v1:0 (bedrock_converse)
us.amazon.nova-micro-v1:0 (bedrock_converse)
us.amazon.nova-premier-v1:0 (bedrock_converse)
us.amazon.nova-pro-v1:0 (bedrock_converse)
us.anthropic.claude-3-5-haiku-20241022-v1:0 (Bedrock)
us.anthropic.claude-haiku-4-5-20251001-v1:0 (bedrock_converse)
us.anthropic.claude-3-5-sonnet-20240620-v1:0 (Bedrock)
us.anthropic.claude-3-5-sonnet-20241022-v2:0 (Bedrock)
us.anthropic.claude-3-7-sonnet-20250219-v1:0 (bedrock_converse)
us.anthropic.claude-3-haiku-20240307-v1:0 (Bedrock)
us.anthropic.claude-3-opus-20240229-v1:0 (Bedrock)
us.anthropic.claude-3-sonnet-20240229-v1:0 (Bedrock)
us.anthropic.claude-opus-4-1-20250805-v1:0 (bedrock_converse)
us.anthropic.claude-sonnet-4-5-20250929-v1:0 (bedrock_converse)
us-gov.anthropic.claude-sonnet-4-5-20250929-v1:0 (bedrock_converse)
au.anthropic.claude-haiku-4-5-20251001-v1:0 (bedrock_converse)
us.anthropic.claude-opus-4-20250514-v1:0 (bedrock_converse)
us.anthropic.claude-opus-4-5-20251101-v1:0 (bedrock_converse)
global.anthropic.claude-opus-4-5-20251101-v1:0 (bedrock_converse)
eu.anthropic.claude-opus-4-5-20251101-v1:0 (bedrock_converse)
us.anthropic.claude-sonnet-4-20250514-v1:0 (bedrock_converse)
us.deepseek.r1-v1:0 (bedrock_converse)
us.deepseek.v3.2 (bedrock_converse)
eu.deepseek.v3.2 (bedrock_converse)
us.meta.llama3-1-405b-instruct-v1:0 (Bedrock)
us.meta.llama3-1-70b-instruct-v1:0 (Bedrock)
us.meta.llama3-1-8b-instruct-v1:0 (Bedrock)
us.meta.llama3-2-11b-instruct-v1:0 (Bedrock)
us.meta.llama3-2-1b-instruct-v1:0 (Bedrock)
us.meta.llama3-2-3b-instruct-v1:0 (Bedrock)
us.meta.llama3-2-90b-instruct-v1:0 (Bedrock)
us.meta.llama3-3-70b-instruct-v1:0 (bedrock_converse)
us.meta.llama4-maverick-17b-instruct-v1:0 (bedrock_converse)
us.meta.llama4-scout-17b-instruct-v1:0 (bedrock_converse)
us.mistral.pixtral-large-2502-v1:0 (bedrock_converse)
v0-1.0-md (v0)
v0-1.5-lg (v0)
v0-1.5-md (v0)
qwen-3-14b (vercel_ai_gateway)
qwen-3-235b (vercel_ai_gateway)
qwen-3-30b (vercel_ai_gateway)
qwen-3-32b (vercel_ai_gateway)
qwen3-coder (vercel_ai_gateway)
nova-lite (vercel_ai_gateway)
nova-micro (vercel_ai_gateway)
nova-pro (vercel_ai_gateway)
titan-embed-text-v2 (vercel_ai_gateway)
claude-3-haiku (vercel_ai_gateway)
claude-3-opus (vercel_ai_gateway)
claude-3.5-haiku (vercel_ai_gateway)
claude-3.5-sonnet (vercel_ai_gateway)
claude-3.7-sonnet (vercel_ai_gateway)
claude-4-opus (vercel_ai_gateway)
claude-4-sonnet (vercel_ai_gateway)
claude-3-5-sonnet (vercel_ai_gateway)
claude-3-5-sonnet-20241022 (vercel_ai_gateway)
claude-3-7-sonnet (vercel_ai_gateway)
claude-haiku-4.5 (vercel_ai_gateway)
claude-opus-4 (vercel_ai_gateway)
claude-opus-4.1 (vercel_ai_gateway)
claude-opus-4.5 (vercel_ai_gateway)
claude-opus-4.6 (vercel_ai_gateway)
claude-sonnet-4 (vercel_ai_gateway)
claude-sonnet-4.5 (vercel_ai_gateway)
command-a (vercel_ai_gateway)
command-r (vercel_ai_gateway)
command-r-plus (vercel_ai_gateway)
embed-v4.0 (vercel_ai_gateway)
deepseek-r1 (vercel_ai_gateway)
deepseek-r1-distill-llama-70b (vercel_ai_gateway)
deepseek-v3 (vercel_ai_gateway)
gemini-2.0-flash (vercel_ai_gateway)
gemini-2.0-flash-lite (vercel_ai_gateway)
gemini-2.5-flash (vercel_ai_gateway)
gemini-2.5-pro (vercel_ai_gateway)
gemini-embedding-001 (vercel_ai_gateway)
gemma-2-9b (vercel_ai_gateway)
text-embedding-005 (vercel_ai_gateway)
text-multilingual-embedding-002 (vercel_ai_gateway)
mercury-coder-small (vercel_ai_gateway)
llama-3-70b (vercel_ai_gateway)
llama-3-8b (vercel_ai_gateway)
llama-3.1-70b (vercel_ai_gateway)
llama-3.1-8b (vercel_ai_gateway)
llama-3.2-11b (vercel_ai_gateway)
llama-3.2-1b (vercel_ai_gateway)
llama-3.2-3b (vercel_ai_gateway)
llama-3.2-90b (vercel_ai_gateway)
llama-3.3-70b (vercel_ai_gateway)
llama-4-maverick (vercel_ai_gateway)
llama-4-scout (vercel_ai_gateway)
codestral (vercel_ai_gateway)
codestral-embed (vercel_ai_gateway)
devstral-small (vercel_ai_gateway)
magistral-medium (vercel_ai_gateway)
magistral-small (vercel_ai_gateway)
ministral-3b (vercel_ai_gateway)
ministral-8b (vercel_ai_gateway)
mistral-embed (vercel_ai_gateway)
mistral-large (vercel_ai_gateway)
mistral-saba-24b (vercel_ai_gateway)
mistral-small (vercel_ai_gateway)
mixtral-8x22b-instruct (vercel_ai_gateway)
pixtral-12b (vercel_ai_gateway)
pixtral-large (vercel_ai_gateway)
kimi-k2 (vercel_ai_gateway)
morph-v3-fast (vercel_ai_gateway)
morph-v3-large (vercel_ai_gateway)
gpt-3.5-turbo (vercel_ai_gateway)
gpt-3.5-turbo-instruct (vercel_ai_gateway)
gpt-4-turbo (vercel_ai_gateway)
gpt-4.1 (vercel_ai_gateway)
gpt-4.1-mini (vercel_ai_gateway)
gpt-4.1-nano (vercel_ai_gateway)
gpt-4o (vercel_ai_gateway)
gpt-4o-mini (vercel_ai_gateway)
openai (vercel_ai_gateway)
openai (vercel_ai_gateway)
o3-mini (vercel_ai_gateway)
o4-mini (vercel_ai_gateway)
text-embedding-3-large (vercel_ai_gateway)
text-embedding-3-small (vercel_ai_gateway)
text-embedding-ada-002 (vercel_ai_gateway)
sonar (vercel_ai_gateway)
sonar-pro (vercel_ai_gateway)
sonar-reasoning (vercel_ai_gateway)
sonar-reasoning-pro (vercel_ai_gateway)
v0-1.0-md (vercel_ai_gateway)
v0-1.5-md (vercel_ai_gateway)
grok-2 (vercel_ai_gateway)
grok-2-vision (vercel_ai_gateway)
grok-3 (vercel_ai_gateway)
grok-3-fast (vercel_ai_gateway)
grok-3-mini (vercel_ai_gateway)
grok-3-mini-fast (vercel_ai_gateway)
grok-4 (vercel_ai_gateway)
glm-4.5 (vercel_ai_gateway)
glm-4.5-air (vercel_ai_gateway)
glm-4.6 (vercel_ai_gateway)
claude-3-5-haiku (vertex_ai-anthropic_models)
claude-3-5-haiku@20241022 (vertex_ai-anthropic_models)
claude-haiku-4-5 (vertex_ai-anthropic_models)
claude-haiku-4-5@20251001 (vertex_ai-anthropic_models)
claude-3-5-sonnet (vertex_ai-anthropic_models)
claude-3-5-sonnet@20240620 (vertex_ai-anthropic_models)
claude-3-7-sonnet@20250219 (vertex_ai-anthropic_models)
claude-3-haiku (vertex_ai-anthropic_models)
claude-3-haiku@20240307 (vertex_ai-anthropic_models)
claude-3-opus (vertex_ai-anthropic_models)
claude-3-opus@20240229 (vertex_ai-anthropic_models)
claude-3-sonnet (vertex_ai-anthropic_models)
claude-3-sonnet@20240229 (vertex_ai-anthropic_models)
claude-opus-4 (vertex_ai-anthropic_models)
claude-opus-4-1 (vertex_ai-anthropic_models)
claude-opus-4-1@20250805 (vertex_ai-anthropic_models)
claude-opus-4-5 (vertex_ai-anthropic_models)
claude-opus-4-5@20251101 (vertex_ai-anthropic_models)
claude-opus-4-6 (vertex_ai-anthropic_models)
claude-opus-4-6@default (vertex_ai-anthropic_models)
claude-opus-4-7 (vertex_ai-anthropic_models)
claude-opus-4-7@default (vertex_ai-anthropic_models)
claude-sonnet-4-5 (vertex_ai-anthropic_models)
claude-sonnet-4-6 (vertex_ai-anthropic_models)
claude-sonnet-4-5@20250929 (vertex_ai-anthropic_models)
claude-opus-4@20250514 (vertex_ai-anthropic_models)
claude-sonnet-4 (vertex_ai-anthropic_models)
claude-sonnet-4@20250514 (vertex_ai-anthropic_models)
codestral-2@001 (vertex_ai-mistral_models)
codestral-2 (vertex_ai-mistral_models)
codestral-2@001 (vertex_ai-mistral_models)
codestral-2 (vertex_ai-mistral_models)
codestral-2501 (vertex_ai-mistral_models)
codestral@2405 (vertex_ai-mistral_models)
codestral@latest (vertex_ai-mistral_models)
deepseek-v3.1-maas (vertex_ai-deepseek_models)
deepseek-v3.2-maas (vertex_ai-deepseek_models)
deepseek-r1-0528-maas (vertex_ai-deepseek_models)
gemini-2.5-flash-image (vertex_ai-language-models)
gemini-3-pro-image-preview (vertex_ai-language-models)
gemini-3.1-flash-image-preview (vertex_ai-language-models)
gemini-3.1-flash-lite-preview (vertex_ai-language-models)
gemini-3.1-flash-lite (vertex_ai-language-models)
deep-research-pro-preview-12-2025 (vertex_ai-language-models)
jamba-1.5 (vertex_ai-ai21_models)
jamba-1.5-large (vertex_ai-ai21_models)
jamba-1.5-large@001 (vertex_ai-ai21_models)
jamba-1.5-mini (vertex_ai-ai21_models)
jamba-1.5-mini@001 (vertex_ai-ai21_models)
llama-3.1-405b-instruct-maas (vertex_ai-llama_models)
llama-4-maverick-17b-128e-instruct-maas (vertex_ai-llama_models)
llama-4-maverick-17b-16e-instruct-maas (vertex_ai-llama_models)
llama-4-scout-17b-128e-instruct-maas (vertex_ai-llama_models)
llama-4-scout-17b-16e-instruct-maas (vertex_ai-llama_models)
minimax-m2-maas (vertex_ai-minimax_models)
kimi-k2-thinking-maas (vertex_ai-moonshot_models)
glm-4.7-maas (vertex_ai-zai_models)
glm-5-maas (vertex_ai-zai_models)
mistral-medium-3 (vertex_ai-mistral_models)
mistral-medium-3@001 (vertex_ai-mistral_models)
mistral-medium-3 (vertex_ai-mistral_models)
mistral-medium-3@001 (vertex_ai-mistral_models)
mistral-large-2411 (vertex_ai-mistral_models)
mistral-large@2407 (vertex_ai-mistral_models)
mistral-large@2411-001 (vertex_ai-mistral_models)
mistral-large@latest (vertex_ai-mistral_models)
mistral-nemo@2407 (vertex_ai-mistral_models)
mistral-nemo@latest (vertex_ai-mistral_models)
mistral-small-2503 (vertex_ai-mistral_models)
mistral-small-2503@001 (vertex_ai-mistral_models)
deepseek-ocr-maas (Vertex AI)
gpt-oss-120b-maas (vertex_ai-openai_models)
gpt-oss-20b-maas (vertex_ai-openai_models)
grok-4.1-fast-non-reasoning (Vertex AI)
grok-4.1-fast-reasoning (Vertex AI)
grok-4.20-non-reasoning (Vertex AI)
grok-4.20-reasoning (Vertex AI)
qwen3-235b-a22b-instruct-2507-maas (vertex_ai-qwen_models)
qwen3-coder-480b-a35b-instruct-maas (vertex_ai-qwen_models)
qwen3-next-80b-a3b-instruct-maas (vertex_ai-qwen_models)
qwen3-next-80b-a3b-thinking-maas (vertex_ai-qwen_models)
rerank-2 (voyage)
rerank-2-lite (voyage)
rerank-2.5 (voyage)
rerank-2.5-lite (voyage)
voyage-2 (voyage)
voyage-3 (voyage)
voyage-3-large (voyage)
voyage-3-lite (voyage)
voyage-3.5 (voyage)
voyage-3.5-lite (voyage)
voyage-code-2 (voyage)
voyage-code-3 (voyage)
voyage-context-3 (voyage)
voyage-finance-2 (voyage)
voyage-large-2 (voyage)
voyage-law-2 (voyage)
voyage-lite-01 (voyage)
voyage-lite-02-instruct (voyage)
voyage-multimodal-3 (voyage)
gpt-oss-120b (wandb)
gpt-oss-20b (wandb)
GLM-4.5 (wandb)
Qwen3-235B-A22B-Instruct-2507 (wandb)
Qwen3-Coder-480B-A35B-Instruct (wandb)
Qwen3-235B-A22B-Thinking-2507 (wandb)
Kimi-K2-Instruct (wandb)
Kimi-K2.5 (wandb)
MiniMax-M2.5 (wandb)
Llama-3.1-8B-Instruct (wandb)
DeepSeek-V3.1 (wandb)
DeepSeek-R1-0528 (wandb)
DeepSeek-V3-0324 (wandb)
Llama-3.3-70B-Instruct (wandb)
Llama-4-Scout-17B-16E-Instruct (wandb)
Phi-4-mini-instruct (wandb)
granite-3-8b-instruct (watsonx)
mistral-large (watsonx)
mt0-xxl-13b (watsonx)
jais-13b-chat (watsonx)
flan-t5-xl-3b (watsonx)
granite-13b-chat-v2 (watsonx)
granite-13b-instruct-v2 (watsonx)
granite-3-3-8b-instruct (watsonx)
granite-4-h-small (watsonx)
granite-guardian-3-2-2b (watsonx)
granite-guardian-3-3-8b (watsonx)
granite-ttm-1024-96-r2 (watsonx)
granite-ttm-1536-96-r2 (watsonx)
granite-ttm-512-96-r2 (watsonx)
granite-vision-3-2-2b (watsonx)
llama-3-2-11b-vision-instruct (watsonx)
llama-3-2-1b-instruct (watsonx)
llama-3-2-3b-instruct (watsonx)
llama-3-2-90b-vision-instruct (watsonx)
llama-3-3-70b-instruct (watsonx)
llama-4-maverick-17b (watsonx)
llama-guard-3-11b-vision (watsonx)
mistral-medium-2505 (watsonx)
mistral-small-2503 (watsonx)
mistral-small-3-1-24b-instruct-2503 (watsonx)
pixtral-12b-2409 (watsonx)
gpt-oss-120b (watsonx)
allam-1-13b-instruct (watsonx)
grok-2 (xai)
grok-2-1212 (xai)
grok-2-latest (xai)
grok-2-vision (xai)
grok-2-vision-1212 (xai)
grok-2-vision-latest (xai)
grok-3 (xai)
grok-3-beta (xai)
grok-3-fast-beta (xai)
grok-3-fast-latest (xai)
grok-3-latest (xai)
grok-3-mini (xai)
grok-3-mini-beta (xai)
grok-3-mini-fast (xai)
grok-3-mini-fast-beta (xai)
grok-3-mini-fast-latest (xai)
grok-3-mini-latest (xai)
grok-4 (xai)
grok-4-fast-reasoning (xai)
grok-4-fast-non-reasoning (xai)
grok-4-0709 (xai)
grok-4-latest (xai)
grok-4-1-fast (xai)
grok-4-1-fast-reasoning (xai)
grok-4-1-fast-reasoning-latest (xai)
grok-4-1-fast-non-reasoning (xai)
grok-4-1-fast-non-reasoning-latest (xai)
grok-4.20-multi-agent-beta-0309 (xai)
grok-4.20-beta-0309-reasoning (xai)
grok-4.20-0309-reasoning (xai)
grok-4.20-beta-0309-non-reasoning (xai)
grok-4.3 (xai)
grok-4.3-latest (xai)
grok-beta (xai)
grok-code-fast (xai)
grok-code-fast-1 (xai)
grok-code-fast-1-0825 (xai)
grok-vision-beta (xai)
zai.glm-4.7 (bedrock_converse)
zai.glm-5 (bedrock_converse)
zai.glm-4.7-flash (bedrock_converse)
glm-5 (zai)
glm-5-code (zai)
glm-4.7 (zai)
glm-4.6 (zai)
glm-4.5 (zai)
glm-4.5v (zai)
glm-4.5-x (zai)
glm-4.5-air (zai)
glm-4.5-airx (zai)
glm-4-32b-0414-128k (zai)
qwen3-coder-480b-a35b-instruct (fireworks_ai)
flux-kontext-pro (fireworks_ai)
SSD-1B (fireworks_ai)
chronos-hermes-13b-v2 (fireworks_ai)
code-llama-13b (fireworks_ai)
code-llama-13b-instruct (fireworks_ai)
code-llama-13b-python (fireworks_ai)
code-llama-34b (fireworks_ai)
code-llama-34b-instruct (fireworks_ai)
code-llama-34b-python (fireworks_ai)
code-llama-70b (fireworks_ai)
code-llama-70b-instruct (fireworks_ai)
code-llama-70b-python (fireworks_ai)
code-llama-7b (fireworks_ai)
code-llama-7b-instruct (fireworks_ai)
code-llama-7b-python (fireworks_ai)
code-qwen-1p5-7b (fireworks_ai)
codegemma-2b (fireworks_ai)
codegemma-7b (fireworks_ai)
cogito-671b-v2-p1 (fireworks_ai)
cogito-v1-preview-llama-3b (fireworks_ai)
cogito-v1-preview-llama-70b (fireworks_ai)
cogito-v1-preview-llama-8b (fireworks_ai)
cogito-v1-preview-qwen-14b (fireworks_ai)
cogito-v1-preview-qwen-32b (fireworks_ai)
flux-kontext-max (fireworks_ai)
dbrx-instruct (fireworks_ai)
deepseek-coder-1b-base (fireworks_ai)
deepseek-coder-33b-instruct (fireworks_ai)
deepseek-coder-7b-base (fireworks_ai)
deepseek-coder-7b-base-v1p5 (fireworks_ai)
deepseek-coder-7b-instruct-v1p5 (fireworks_ai)
deepseek-coder-v2-lite-base (fireworks_ai)
deepseek-coder-v2-lite-instruct (fireworks_ai)
deepseek-prover-v2 (fireworks_ai)
deepseek-r1-0528-distill-qwen3-8b (fireworks_ai)
deepseek-r1-distill-llama-70b (fireworks_ai)
deepseek-r1-distill-llama-8b (fireworks_ai)
deepseek-r1-distill-qwen-14b (fireworks_ai)
deepseek-r1-distill-qwen-1p5b (fireworks_ai)
deepseek-r1-distill-qwen-32b (fireworks_ai)
deepseek-r1-distill-qwen-7b (fireworks_ai)
deepseek-v2-lite-chat (fireworks_ai)
deepseek-v2p5 (fireworks_ai)
devstral-small-2505 (fireworks_ai)
dobby-mini-unhinged-plus-llama-3-1-8b (fireworks_ai)
dobby-unhinged-llama-3-3-70b-new (fireworks_ai)
dolphin-2-9-2-qwen2-72b (fireworks_ai)
dolphin-2p6-mixtral-8x7b (fireworks_ai)
ernie-4p5-21b-a3b-pt (fireworks_ai)
ernie-4p5-300b-a47b-pt (fireworks_ai)
fare-20b (fireworks_ai)
firefunction-v1 (fireworks_ai)
firellava-13b (fireworks_ai)
firesearch-ocr-v6 (fireworks_ai)
flux-1-dev (fireworks_ai)
flux-1-dev-controlnet-union (fireworks_ai)
flux-1-dev-fp8 (fireworks_ai)
flux-1-schnell (fireworks_ai)
flux-1-schnell-fp8 (fireworks_ai)
gemma-2b-it (fireworks_ai)
gemma-3-27b-it (fireworks_ai)
gemma-7b (fireworks_ai)
gemma-7b-it (fireworks_ai)
gemma2-9b-it (fireworks_ai)
glm-4p5v (fireworks_ai)
gpt-oss-safeguard-120b (fireworks_ai)
gpt-oss-safeguard-20b (fireworks_ai)
hermes-2-pro-mistral-7b (fireworks_ai)
internvl3-38b (fireworks_ai)
internvl3-78b (fireworks_ai)
internvl3-8b (fireworks_ai)
japanese-stable-diffusion-xl (fireworks_ai)
kat-coder (fireworks_ai)
kat-dev-32b (fireworks_ai)
kat-dev-72b-exp (fireworks_ai)
llama-guard-2-8b (fireworks_ai)
llama-guard-3-1b (fireworks_ai)
llama-guard-3-8b (fireworks_ai)
llama-v2-13b (fireworks_ai)
llama-v2-13b-chat (fireworks_ai)
llama-v2-70b (fireworks_ai)
llama-v2-70b-chat (fireworks_ai)
llama-v2-7b (fireworks_ai)
llama-v2-7b-chat (fireworks_ai)
llama-v3-70b-instruct (fireworks_ai)
llama-v3-70b-instruct-hf (fireworks_ai)
llama-v3-8b (fireworks_ai)
llama-v3-8b-instruct-hf (fireworks_ai)
llama-v3p1-405b-instruct-long (fireworks_ai)
llama-v3p1-70b-instruct (fireworks_ai)
llama-v3p1-70b-instruct-1b (fireworks_ai)
llama-v3p1-nemotron-70b-instruct (fireworks_ai)
llama-v3p2-1b (fireworks_ai)
llama-v3p2-3b (fireworks_ai)
llama-v3p3-70b-instruct (fireworks_ai)
llamaguard-7b (fireworks_ai)
llava-yi-34b (fireworks_ai)
minimax-m1-80k (fireworks_ai)
minimax-m2 (fireworks_ai)
ministral-3-14b-instruct-2512 (fireworks_ai)
ministral-3-3b-instruct-2512 (fireworks_ai)
ministral-3-8b-instruct-2512 (fireworks_ai)
mistral-7b (fireworks_ai)
mistral-7b-instruct-4k (fireworks_ai)
mistral-7b-instruct-v0p2 (fireworks_ai)
mistral-7b-instruct-v3 (fireworks_ai)
mistral-7b-v0p2 (fireworks_ai)
mistral-large-3-fp8 (fireworks_ai)
mistral-nemo-base-2407 (fireworks_ai)
mistral-nemo-instruct-2407 (fireworks_ai)
mistral-small-24b-instruct-2501 (fireworks_ai)
mixtral-8x22b (fireworks_ai)
mixtral-8x22b-instruct (fireworks_ai)
mixtral-8x7b (fireworks_ai)
mixtral-8x7b-instruct (fireworks_ai)
mixtral-8x7b-instruct-hf (fireworks_ai)
mythomax-l2-13b (fireworks_ai)
nemotron-nano-v2-12b-vl (fireworks_ai)
nous-capybara-7b-v1p9 (fireworks_ai)
nous-hermes-2-mixtral-8x7b-dpo (fireworks_ai)
nous-hermes-2-yi-34b (fireworks_ai)
nous-hermes-llama2-13b (fireworks_ai)
nous-hermes-llama2-70b (fireworks_ai)
nous-hermes-llama2-7b (fireworks_ai)
nvidia-nemotron-nano-12b-v2 (fireworks_ai)
nvidia-nemotron-nano-9b-v2 (fireworks_ai)
openchat-3p5-0106-7b (fireworks_ai)
openhermes-2-mistral-7b (fireworks_ai)
openhermes-2p5-mistral-7b (fireworks_ai)
openorca-7b (fireworks_ai)
phi-2-3b (fireworks_ai)
phi-3-mini-128k-instruct (fireworks_ai)
phi-3-vision-128k-instruct (fireworks_ai)
phind-code-llama-34b-python-v1 (fireworks_ai)
phind-code-llama-34b-v1 (fireworks_ai)
phind-code-llama-34b-v2 (fireworks_ai)
playground-v2-1024px-aesthetic (fireworks_ai)
playground-v2-5-1024px-aesthetic (fireworks_ai)
pythia-12b (fireworks_ai)
qwen-qwq-32b-preview (fireworks_ai)
qwen-v2p5-14b-instruct (fireworks_ai)
qwen-v2p5-7b (fireworks_ai)
qwen1p5-72b-chat (fireworks_ai)
qwen2-7b-instruct (fireworks_ai)
qwen2-vl-2b-instruct (fireworks_ai)
qwen2-vl-72b-instruct (fireworks_ai)
qwen2-vl-7b-instruct (fireworks_ai)
qwen2p5-0p5b-instruct (fireworks_ai)
qwen2p5-14b (fireworks_ai)
qwen2p5-1p5b-instruct (fireworks_ai)
qwen2p5-32b (fireworks_ai)
qwen2p5-32b-instruct (fireworks_ai)
qwen2p5-72b (fireworks_ai)
qwen2p5-72b-instruct (fireworks_ai)
qwen2p5-7b-instruct (fireworks_ai)
qwen2p5-coder-0p5b (fireworks_ai)
qwen2p5-coder-0p5b-instruct (fireworks_ai)
qwen2p5-coder-14b (fireworks_ai)
qwen2p5-coder-14b-instruct (fireworks_ai)
qwen2p5-coder-1p5b (fireworks_ai)
qwen2p5-coder-1p5b-instruct (fireworks_ai)
qwen2p5-coder-32b (fireworks_ai)
qwen2p5-coder-32b-instruct-128k (fireworks_ai)
qwen2p5-coder-32b-instruct-32k-rope (fireworks_ai)
qwen2p5-coder-32b-instruct-64k (fireworks_ai)
qwen2p5-coder-3b (fireworks_ai)
qwen2p5-coder-3b-instruct (fireworks_ai)
qwen2p5-coder-7b (fireworks_ai)
qwen2p5-coder-7b-instruct (fireworks_ai)
qwen2p5-math-72b-instruct (fireworks_ai)
qwen2p5-vl-32b-instruct (fireworks_ai)
qwen2p5-vl-3b-instruct (fireworks_ai)
qwen2p5-vl-72b-instruct (fireworks_ai)
qwen2p5-vl-7b-instruct (fireworks_ai)
qwen3-0p6b (fireworks_ai)
qwen3-14b (fireworks_ai)
qwen3-1p7b (fireworks_ai)
qwen3-1p7b-fp8-draft (fireworks_ai)
qwen3-1p7b-fp8-draft-131072 (fireworks_ai)
qwen3-1p7b-fp8-draft-40960 (fireworks_ai)
qwen3-235b-a22b (fireworks_ai)
qwen3-235b-a22b-instruct-2507 (fireworks_ai)
qwen3-235b-a22b-thinking-2507 (fireworks_ai)
qwen3-30b-a3b (fireworks_ai)
qwen3-30b-a3b-instruct-2507 (fireworks_ai)
qwen3-30b-a3b-thinking-2507 (fireworks_ai)
qwen3-32b (fireworks_ai)
qwen3-4b (fireworks_ai)
qwen3-4b-instruct-2507 (fireworks_ai)
qwen3-8b (fireworks_ai)
qwen3-coder-30b-a3b-instruct (fireworks_ai)
qwen3-coder-480b-instruct-bf16 (fireworks_ai)
models (fireworks_ai)
qwen3-next-80b-a3b-instruct (fireworks_ai)
qwen3-next-80b-a3b-thinking (fireworks_ai)
qwen3-vl-235b-a22b-instruct (fireworks_ai)
qwen3-vl-235b-a22b-thinking (fireworks_ai)
qwen3-vl-30b-a3b-instruct (fireworks_ai)
qwen3-vl-30b-a3b-thinking (fireworks_ai)
qwen3-vl-32b-instruct (fireworks_ai)
qwen3-vl-8b-instruct (fireworks_ai)
qwq-32b (fireworks_ai)
rolm-ocr (fireworks_ai)
snorkel-mistral-7b-pairrm-dpo (fireworks_ai)
stable-diffusion-xl-1024-v1-0 (fireworks_ai)
stablecode-3b (fireworks_ai)
starcoder-16b (fireworks_ai)
starcoder-7b (fireworks_ai)
starcoder2-15b (fireworks_ai)
starcoder2-3b (fireworks_ai)
starcoder2-7b (fireworks_ai)
toppy-m-7b (fireworks_ai)
yi-34b (fireworks_ai)
yi-34b-200k-capybara (fireworks_ai)
yi-34b-chat (fireworks_ai)
yi-6b (fireworks_ai)
zephyr-7b-beta (fireworks_ai)
deepseek-v3.2 (novita)
minimax-m2.1 (novita)
glm-4.7 (novita)
mimo-v2-flash (novita)
autoglm-phone-9b-multilingual (novita)
kimi-k2-thinking (novita)
minimax-m2 (novita)
paddleocr-vl (novita)
deepseek-v3.2-exp (novita)
qwen3-vl-235b-a22b-thinking (novita)
glm-4.6v (novita)
glm-4.6 (novita)
kat-coder-pro (novita)
qwen3-next-80b-a3b-instruct (novita)
qwen3-next-80b-a3b-thinking (novita)
deepseek-ocr (novita)
deepseek-v3.1-terminus (novita)
qwen3-vl-235b-a22b-instruct (novita)
qwen3-max (novita)
r1v4-lite (novita)
deepseek-v3.1 (novita)
kimi-k2-0905 (novita)
qwen3-coder-480b-a35b-instruct (novita)
qwen3-coder-30b-a3b-instruct (novita)
gpt-oss-120b (novita)
kimi-k2-instruct (novita)
deepseek-v3-0324 (novita)
glm-4.5 (novita)
qwen3-235b-a22b-thinking-2507 (novita)
llama-3.1-8b-instruct (novita)
gemma-3-12b-it (novita)
glm-4.5v (novita)
gpt-oss-20b (novita)
qwen3-235b-a22b-instruct-2507 (novita)
deepseek-r1-distill-qwen-14b (novita)
llama-3.3-70b-instruct (novita)
qwen-2.5-72b-instruct (novita)
mistral-nemo (novita)
minimax-m1-80k (novita)
deepseek-r1-0528 (novita)
deepseek-r1-distill-qwen-32b (novita)
llama-3-8b-instruct (novita)
wizardlm-2-8x22b (novita)
deepseek-r1-0528-qwen3-8b (novita)
deepseek-r1-distill-llama-70b (novita)
llama-3-70b-instruct (novita)
qwen3-235b-a22b-fp8 (novita)
llama-4-maverick-17b-128e-instruct-fp8 (novita)
llama-4-scout-17b-16e-instruct (novita)
hermes-2-pro-llama-3-8b (novita)
qwen2.5-vl-72b-instruct (novita)
l3-70b-euryale-v2.1 (novita)
ernie-4.5-21B-a3b-thinking (novita)
l3-8b-lunaris (novita)
baichuan-m2-32b (novita)
ernie-4.5-vl-424b-a47b (novita)
ernie-4.5-300b-a47b-paddle (novita)
deepseek-prover-v2-671b (novita)
qwen3-32b-fp8 (novita)
qwen3-30b-a3b-fp8 (novita)
gemma-3-27b-it (novita)
deepseek-v3-turbo (novita)
deepseek-r1-turbo (novita)
L3-8B-Stheno-v3.2 (novita)
mythomax-l2-13b (novita)
ernie-4.5-vl-28b-a3b-thinking (novita)
qwen3-vl-8b-instruct (novita)
glm-4.5-air (novita)
qwen3-vl-30b-a3b-instruct (novita)
qwen3-vl-30b-a3b-thinking (novita)
qwen3-omni-30b-a3b-thinking (novita)
qwen3-omni-30b-a3b-instruct (novita)
qwen-mt-plus (novita)
ernie-4.5-vl-28b-a3b (novita)
ernie-4.5-21B-a3b (novita)
qwen3-8b-fp8 (novita)
qwen3-4b-fp8 (novita)
qwen2.5-7b-instruct (novita)
llama-3.2-3b-instruct (novita)
l31-70b-euryale-v2.2 (novita)
qwen3-embedding-0.6b (novita)
qwen3-embedding-8b (novita)
bge-m3 (novita)
qwen3-reranker-8b (novita)
bge-reranker-v2-m3 (novita)
llama-3.1-8b (llamagate)
llama-3.2-3b (llamagate)
mistral-7b-v0.3 (llamagate)
qwen3-8b (llamagate)
dolphin3-8b (llamagate)
deepseek-r1-8b (llamagate)
deepseek-r1-7b-qwen (llamagate)
openthinker-7b (llamagate)
qwen2.5-coder-7b (llamagate)
deepseek-coder-6.7b (llamagate)
codellama-7b (llamagate)
qwen3-vl-8b (llamagate)
llava-7b (llamagate)
gemma3-4b (llamagate)
nomic-embed-text (llamagate)
qwen3-embedding-8b (llamagate)
gpt-4o-mini-tts-2025-03-20 (OpenAI)
gpt-4o-mini-tts-2025-12-15 (OpenAI)
gpt-4o-mini-transcribe-2025-03-20 (OpenAI)
gpt-4o-mini-transcribe-2025-12-15 (OpenAI)
gpt-5-search-api (OpenAI)
gpt-5-search-api-2025-10-14 (OpenAI)
gpt-realtime-mini-2025-10-06 (OpenAI)
gpt-realtime-mini-2025-12-15 (OpenAI)
chatgpt-image-latest (OpenAI)
gemini-2.0-flash-lite-001 (Google)
gemini-2.5-flash-native-audio-latest (Google)
gemini-2.5-flash-native-audio-preview-09-2025 (Google)
gemini-2.5-flash-native-audio-preview-12-2025 (Google)
gemini-3.1-flash-live-preview (Google)
gemini-2.5-flash-native-audio-latest (Google)
gemini-2.5-flash-native-audio-preview-09-2025 (Google)
gemini-2.5-flash-native-audio-preview-12-2025 (Google)
gemini-3.1-flash-live-preview (Google)
gemini-2.5-flash-preview-tts (Google)
gemini-flash-latest (Google)
gemini-flash-lite-latest (Google)
gemini-pro-latest (Google)
gemini-pro-latest (Google)
gemini-exp-1206 (Google)
claude-sonnet-4-6@default (vertex_ai-anthropic_models)
openai.gpt-oss-120b (bedrock_mantle)
openai.gpt-oss-20b (bedrock_mantle)
openai.gpt-oss-safeguard-120b (bedrock_mantle)
openai.gpt-oss-safeguard-20b (bedrock_mantle)
zai.glm-5 (Bedrock)
zai.glm-5 (Bedrock)
anthropic.claude-haiku-4-5-20251001-v1:0 (Bedrock)
anthropic.claude-haiku-4-5-20251001-v1:0 (Bedrock)
Input tokens / request
Output tokens / request
Requests / month
Advanced options
Cache hit rate (%)
Calculate
Results
Cost per request
-
Monthly cost
-
Model
-
Input cost
-
Output cost
-
Cache savings
-
Same usage with other models
Model
Monthly cost
Difference