DeepSeek: DeepSeek V4 Pro

deepseek-v4-pro
chatDeepSeek

Quick Reference

Input
Text
Output
Text
Context
1M
Max Output
393K
Input Price
$0.435/M
Output Price
$0.87/M
Author
DeepSeek
Version
main
Open Source
Yes

Overview

Flagship MoE large model with 1.6T total parameters and 49B activated parameters, natively supporting million-token ultra-long context. Backed by massive high-quality training data, it delivers top-tier mathematical logic, complex reasoning, professional coding, and deep long-text comprehension—well suited for advanced research, complex office workflows, and deep intelligent agent scenarios.

Input modalities

Text

Output modalities

Text

Capabilities

chat

Features

Function Calling
Structured Output
Caching
Batch Processing
Web Search

Pricing

Per-token prices for DeepSeek: DeepSeek V4 Pro.

Token TypePriceUnit
Input$0.435/Mper million tokens
Output$0.87/Mper million tokens
Cache Read$0.003625/Mper million tokens

Specifications

Context Window

1Mtokens

Max Input

607Ktokens

Max Output

393Ktokens

API Reference

OpenAI-compatible endpoint at https://api.inferoute.ai/v1.

import os
from openai import OpenAI

client = OpenAI(
    base_url="https://api.inferoute.ai/v1",
    api_key=os.environ.get("INFEROUTE_API_KEY"),
)

try:
    response = client.chat.completions.create(
        model="deepseek-v4-pro",
        messages=[
            {"role": "system", "content": "You are a helpful assistant."},
            {"role": "user", "content": "Write a haiku about recursion."},
        ],
        max_tokens=512,
        temperature=0.7,
    )

    print(response.choices[0].message.content)
except Exception as e:
    print(f"Error: {e}")