HomeModel Market

Model Market

Access 50+ LLMs via API, trade token blocks on the spot market, or trade futures on model pricing.

Account Balance

$1,247.50USD

This Month

$342.18 used

API Calls

1.2M today

Platform fee: 2% per API call
|Unified endpoint: api.mercatus.ai/v1/chat/completions

GPT-4o

OpenAI

Live

Flagship multimodal model with vision, audio, and text capabilities.

CodingAnalysisMultimodalReasoning

Input / 1M tokens

$2.50

Output / 1M tokens

$10.00

128K ~500ms 98%

GPT-4o Mini

OpenAI

Live

Cost-efficient model for lightweight tasks.

FastAffordableChat

Input / 1M tokens

$0.15

Output / 1M tokens

$0.60

128K ~200ms 95%

Claude 3.5 Sonnet

Anthropic

Live

Best-in-class for complex reasoning and coding tasks.

CodingAnalysisLong ContextReasoning

Input / 1M tokens

$3.00

Output / 1M tokens

$15.00

200K ~600ms 94%

DeepSeek V3

DeepSeek

Live

High-performance open-source model with MoE architecture.

CodingAffordableOpen SourceMath

Input / 1M tokens

$0.14

Output / 1M tokens

$0.28

128K ~300ms 91%

DeepSeek R1

DeepSeek

Live

Reasoning-focused model with chain-of-thought capabilities.

ReasoningMathOpen Source

Input / 1M tokens

$0.55

Output / 1M tokens

$2.19

128K ~800ms 88%

Gemini 2.0 Flash

Google

Live

Ultra-fast model optimized for speed and efficiency.

FastAffordableLong ContextMultimodal

Input / 1M tokens

$0.10

Output / 1M tokens

$0.40

1M ~150ms 87%

Llama 3.1 405B

Yotta (Self-hosted)

Live

Meta's largest open model, self-hosted for cost efficiency.

Open SourceMultilingualResearch

Input / 1M tokens

$0.80

Output / 1M tokens

$0.80

128K ~400ms 85%

Claude 3 Opus

Anthropic

Live

Most capable Claude model for complex tasks.

Complex ReasoningResearchCreative Writing

Input / 1M tokens

$15.00

Output / 1M tokens

$75.00

200K ~1200ms 78%

Mixtral 8x22B

Yotta (Self-hosted)

Live

Mistral's MoE model with excellent cost-performance ratio.

Open SourceEfficientMultilingual

Input / 1M tokens

$0.65

Output / 1M tokens

$0.65

64K ~350ms 76%

Qwen 2.5 72B

Yotta (Self-hosted)

Live

Alibaba's flagship model with strong multilingual support.

Open SourceMultilingualCoding

Input / 1M tokens

$0.40

Output / 1M tokens

$0.40

128K ~350ms 74%

Grok 2

xAI

Live

xAI's latest model with real-time knowledge.

ChatAnalysisGrounding

Input / 1M tokens

$2.00

Output / 1M tokens

$10.00

128K ~500ms 72%

Command R+

Cohere

Live

Enterprise-grade model optimized for RAG and tool use.

ResearchGroundingMultilingual

Input / 1M tokens

$2.50

Output / 1M tokens

$10.00

128K ~600ms 68%

Quick Start — OpenAI-Compatible API

Use any OpenAI SDK. Just change the base URL and API key.

curl https://api.mercatus.ai/v1/chat/completions \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{"model": "gpt-4o", "messages": [{"role": "user", "content": "Hello"}]}'