Qwen: Qwen3.7 Max

qwen3.7-max
chatQwen

Quick Reference

Input
Text
Output
Text
Context
1M
Max Output
65.5K
Input Price
$1.25/M
Output Price
$3.75/M
Author
Alibaba
Version
main
Open Source
Yes

Overview

The largest and most capable Max model in the Qwen3.7 series, currently opened with pure-text capabilities for early experience. Qwen3.7 is a new-generation flagship model built for the agent era, with core advantages in the breadth and depth of agent capabilities—excelling in programming, office and productivity tasks, and long-horizon autonomous execution. This version is functionally equivalent to the snapshot model qwen3.7-max-2026-05-20.

Input modalities

Text

Output modalities

Text

Capabilities

chatreasoning

Features

Function Calling
Structured Output
Caching
Batch Processing
Web Search
Prefix Completion

Pricing

Per-token prices for Qwen: Qwen3.7 Max.

Input <= 128K

Token TypePriceUnit
Input$1.25/Mper million tokens
Output$3.75/Mper million tokens
Cache Read$0.25/Mper million tokens

Specifications

Context Window

1Mtokens

Max Input

934Ktokens

Max Output

65.5Ktokens

API Reference

OpenAI-compatible endpoint at https://api.inferoute.ai/v1.

import os
from openai import OpenAI

client = OpenAI(
    base_url="https://api.inferoute.ai/v1",
    api_key=os.environ.get("INFEROUTE_API_KEY"),
)

try:
    response = client.chat.completions.create(
        model="qwen3.7-max",
        messages=[
            {"role": "system", "content": "You are a helpful assistant."},
            {"role": "user", "content": "Write a haiku about recursion."},
        ],
        max_tokens=512,
        temperature=0.7,
    )

    print(response.choices[0].message.content)
except Exception as e:
    print(f"Error: {e}")