DeepSeek: DeepSeek V4 Pro
deepseek-v4-pro
chatDeepSeek
Quick Reference
- Input
- Text
- Output
- Text
- Context
- 1M
- Max Output
- 393K
- Input Price
- $0.435/M
- Output Price
- $0.87/M
- Author
- DeepSeek
- Version
- main
- Open Source
- Yes
Overview
Flagship MoE large model with 1.6T total parameters and 49B activated parameters, natively supporting million-token ultra-long context. Backed by massive high-quality training data, it delivers top-tier mathematical logic, complex reasoning, professional coding, and deep long-text comprehension—well suited for advanced research, complex office workflows, and deep intelligent agent scenarios.
Input modalities
Text
Output modalities
Text
Capabilities
chat
Features
Function Calling
Structured Output
Caching
Batch Processing
Web Search
Pricing
Per-token prices for DeepSeek: DeepSeek V4 Pro.
| Token Type | Price | Unit |
|---|---|---|
| Input | $0.435/M | per million tokens |
| Output | $0.87/M | per million tokens |
| Cache Read | $0.003625/M | per million tokens |
Specifications
Context Window
1Mtokens
Max Input
607Ktokens
Max Output
393Ktokens
API Reference
OpenAI-compatible endpoint at https://api.inferoute.ai/v1.
import os
from openai import OpenAI
client = OpenAI(
base_url="https://api.inferoute.ai/v1",
api_key=os.environ.get("INFEROUTE_API_KEY"),
)
try:
response = client.chat.completions.create(
model="deepseek-v4-pro",
messages=[
{"role": "system", "content": "You are a helpful assistant."},
{"role": "user", "content": "Write a haiku about recursion."},
],
max_tokens=512,
temperature=0.7,
)
print(response.choices[0].message.content)
except Exception as e:
print(f"Error: {e}")