Model marketplace

100+ models. One catalog.

Compare pricing, latency, context windows, and capabilities across every major LLM provider.

OpenAI

GPT-4.1

Reasoning + coding — best for general + code.

Input

$5/M

Output

$15/M

Latency

420ms

Tools Vision Enterprise

Anthropic

Claude Sonnet 4

200K

Writing + agents — best for agents + writing.

Input

$3/M

Output

$15/M

Latency

480ms

Tools Enterprise

Google

Gemini 2.5 Pro

Multimodal + long context — best for long context, vision.

Input

$1.25/M

Output

$5/M

Latency

510ms

Tools Vision Long ctx Enterprise

Meta

Llama 4 405B

256K

Open weights — best for customizable, self-host.

Input

$0.6/M

Output

$0.9/M

Latency

720ms

Tools Open

DeepSeek

DeepSeek R1

128K

Reasoning — best for math, logic, planning.

Input

$0.27/M

Output

$1.1/M

Latency

980ms

Open

Mistral

Mistral Large 2

128K

Enterprise EU — best for eu deployment.

Input

$2/M

Output

$6/M

Latency

460ms

Tools Enterprise

xAI

Grok 3

128K

Realtime + tools — best for search-grounded.

Input

$5/M

Output

$15/M

Latency

540ms

Tools Vision

Groq

Llama 3.3 70B

128K

Lowest latency — best for realtime ux.

Input

$0.59/M

Output

$0.79/M

Latency

110ms

Open

Cohere

Command R+

128K

RAG + tools — best for enterprise rag.

Input

$2.5/M

Output

$10/M

Latency

520ms

Tools Enterprise

Perplexity

Sonar Pro

200K

Web search grounded — best for live information.

Input

$3/M

Output

$15/M

Latency

880ms

Tools