Qwen: Qwen3.7 Max
qwen3.7-max
chatQwen
Quick Reference
- Input
- Text
- Output
- Text
- Context
- 1M
- Max Output
- 65.5K
- Input Price
- $1.25/M
- Output Price
- $3.75/M
- Author
- Alibaba
- Version
- main
- Open Source
- Yes
Overview
The largest and most capable Max model in the Qwen3.7 series, currently opened with pure-text capabilities for early experience. Qwen3.7 is a new-generation flagship model built for the agent era, with core advantages in the breadth and depth of agent capabilities—excelling in programming, office and productivity tasks, and long-horizon autonomous execution. This version is functionally equivalent to the snapshot model qwen3.7-max-2026-05-20.
Input modalities
Text
Output modalities
Text
Capabilities
chatreasoning
Features
Function Calling
Structured Output
Caching
Batch Processing
Web Search
Prefix Completion
Pricing
Per-token prices for Qwen: Qwen3.7 Max.
Input <= 128K
| Token Type | Price | Unit |
|---|---|---|
| Input | $1.25/M | per million tokens |
| Output | $3.75/M | per million tokens |
| Cache Read | $0.25/M | per million tokens |
Specifications
Context Window
1Mtokens
Max Input
934Ktokens
Max Output
65.5Ktokens
API Reference
OpenAI-compatible endpoint at https://api.inferoute.ai/v1.
import os
from openai import OpenAI
client = OpenAI(
base_url="https://api.inferoute.ai/v1",
api_key=os.environ.get("INFEROUTE_API_KEY"),
)
try:
response = client.chat.completions.create(
model="qwen3.7-max",
messages=[
{"role": "system", "content": "You are a helpful assistant."},
{"role": "user", "content": "Write a haiku about recursion."},
],
max_tokens=512,
temperature=0.7,
)
print(response.choices[0].message.content)
except Exception as e:
print(f"Error: {e}")