Model marketplace

100+ models. One catalog.

Compare pricing, latency, context windows, and capabilities across every major LLM provider.

OpenAI
GPT-4.1
1M

Reasoning + coding — best for general + code.

Input
$5/M
Output
$15/M
Latency
420ms
Tools Vision Enterprise
Anthropic
Claude Sonnet 4
200K

Writing + agents — best for agents + writing.

Input
$3/M
Output
$15/M
Latency
480ms
Tools Enterprise
Google
Gemini 2.5 Pro
1M

Multimodal + long context — best for long context, vision.

Input
$1.25/M
Output
$5/M
Latency
510ms
Tools Vision Long ctx Enterprise
Meta
Llama 4 405B
256K

Open weights — best for customizable, self-host.

Input
$0.6/M
Output
$0.9/M
Latency
720ms
Tools Open
DeepSeek
DeepSeek R1
128K

Reasoning — best for math, logic, planning.

Input
$0.27/M
Output
$1.1/M
Latency
980ms
Open
Mistral
Mistral Large 2
128K

Enterprise EU — best for eu deployment.

Input
$2/M
Output
$6/M
Latency
460ms
Tools Enterprise
xAI
Grok 3
128K

Realtime + tools — best for search-grounded.

Input
$5/M
Output
$15/M
Latency
540ms
Tools Vision
Groq
Llama 3.3 70B
128K

Lowest latency — best for realtime ux.

Input
$0.59/M
Output
$0.79/M
Latency
110ms
Open
Cohere
Command R+
128K

RAG + tools — best for enterprise rag.

Input
$2.5/M
Output
$10/M
Latency
520ms
Tools Enterprise
Perplexity
Sonar Pro
200K

Web search grounded — best for live information.

Input
$3/M
Output
$15/M
Latency
880ms
Tools