DeepSeek: DeepSeek V4 Pro

deepseek-v4-pro

chatDeepSeek

Quick Reference

Input: Text
Output: Text

Context: 1M
Max Output: 393K

Input Price: $0.435/M
Output Price: $0.87/M

Author: DeepSeek
Version: main
Open Source: Yes

Overview

Flagship MoE large model with 1.6T total parameters and 49B activated parameters, natively supporting million-token ultra-long context. Backed by massive high-quality training data, it delivers top-tier mathematical logic, complex reasoning, professional coding, and deep long-text comprehension—well suited for advanced research, complex office workflows, and deep intelligent agent scenarios.

Input modalities

Text

Output modalities

Text

Capabilities

chat

Features

Function Calling

Structured Output

Caching

Batch Processing

Web Search

Pricing

Per-token prices for DeepSeek: DeepSeek V4 Pro.

Token Type	Price	Unit
Input	$0.435/M	per million tokens
Output	$0.87/M	per million tokens
Cache Read	$0.003625/M	per million tokens

Specifications

Context Window

1Mtokens

Max Input

607Ktokens

Max Output

393Ktokens

API Reference

OpenAI-compatible endpoint at https://api.inferoute.ai/v1.

import os
from openai import OpenAI

client = OpenAI(
    base_url="https://api.inferoute.ai/v1",
    api_key=os.environ.get("INFEROUTE_API_KEY"),
)

try:
    response = client.chat.completions.create(
        model="deepseek-v4-pro",
        messages=[
            {"role": "system", "content": "You are a helpful assistant."},
            {"role": "user", "content": "Write a haiku about recursion."},
        ],
        max_tokens=512,
        temperature=0.7,
    )

    print(response.choices[0].message.content)
except Exception as e:
    print(f"Error: {e}")