All models

DeepSeek V4 Pro

deepseek

deepseek/deepseek-v4-pro

Open in Console

DeepSeek V4 Pro reasoning model. Admin-gated by default.

Context

200K

Max output

66K

Tools

STANDARD

Reasoning

Supported

Pricing

LanePer 1M tokens
Input$6.67
Output$13.33
Cache read$0.06
Cache write (5m)$8.33
Cache write (1h)$13.33

Billed per token. No minimums, no per-request fees. Caching applies on supported providers; misses fall back to standard input/output rates.

Reasoning effort

offlowmedium · defaulthigh

Routed providers

  • anthropic

Requests are automatically routed to the highest-priority healthy upstream. Failures fall back to the next route transparently.

Quickstart

from openai import OpenAI

client = OpenAI(
    base_url="https://api.vecbase.com/v1",
    api_key="sk-vbc-...",
)

response = client.chat.completions.create(
    model="deepseek/deepseek-v4-pro",
    messages=[{"role": "user", "content": "Hello"}],
)
print(response.choices[0].message.content)

Ready to ship?

Create a project, mint an API key, and call this model from any OpenAI-compatible client.

Get started free