Claude Opus 4.7: Coding, Vision, Cost Deep Dive

Q: Is Claude Opus 4.7 the most powerful Claude model?

It's the most powerful publicly available Claude model. Anthropic has an unreleased model called Claude Mythos that is reportedly more powerful but was deemed too risky for public release.

Q: How much does Claude Opus 4.7 cost?

Listed price is $5 per million input tokens and $25 per million output tokens — same as 4.6. However, a new tokenizer means the same text uses up to 35% more tokens, so real costs are effectively higher.

Q: Can I use Opus 4.7 with Claude Code?

Yes — Claude Code now defaults to Opus 4.7 with xhigh effort. The new /ultrareview command is exclusive to Claude Code with Opus 4.7.

Claude Opus 4.7 launched April 16, 2026 with major coding upgrades, 3.75MP image support, and a new xhigh effort tier. But a silent tokenizer change makes it up to 35% more expensive than the sticker price. Full breakdown inside.

TL;DR: Claude Opus 4.7 (released April 16, 2026) is Anthropic's most capable public model yet — with dramatically better coding, 3x higher image resolution, and a new "xhigh" effort tier. The catch? A new tokenizer quietly makes it up to 35% more expensive than the sticker price suggests. Here's everything you need to know.

Feature	Opus 4.6	Opus 4.7
Max Image Resolution	~800px	2,576px (~3.75 MP)
Effort Levels	off / low / medium / high / max	+ new "xhigh" tier
Context Window	1M tokens	1M tokens
Sticker Price (input/output)	$5 / $25 per M tokens	$5 / $25 per M tokens
Real Cost (after tokenizer)	baseline	up to +35% more tokens
SWE-bench Pro	—	significant jump
Claude Code default effort	high	xhigh
Reasoning traces default	visible	hidden

What Is Claude Opus 4.7?

Anthropic dropped Claude Opus 4.7 on April 16, 2026 — and it's the biggest leap in the Opus line since the model family launched. It's available across Claude.ai, the Claude API, Amazon Bedrock, Google Cloud Vertex AI, Microsoft Foundry, and GitHub Copilot.

The headline: Opus 4.7 isn't just incrementally better — it's qualitatively different for long-horizon, agentic, and complex reasoning tasks. Anthropic is positioning this as the model you can actually trust to run overnight and come back to finished work.

The Big Upgrades: What's Actually New

1. Coding That Runs Itself

The most talked-about improvement is in advanced coding. Opus 4.7 shows a notable jump on SWE-bench Pro and CursorBench — the benchmarks that actually matter for real-world engineering tasks (not just toy coding challenges).

What this means in practice: users are reporting they can hand off complex, long-running coding sessions without babysitting the model. It:

Handles multi-step systems engineering with better consistency
Self-verifies outputs — it actively looks for ways to check its own work
Follows complex instruction chains more precisely (less drift)
Carries tasks to completion instead of stopping halfway through

For Claude Code specifically, the default effort tier has been bumped to "xhigh" and there's a new /ultrareview command that simulates a senior reviewer going over your code.

# Claude Code now defaults to xhigh effort
# New ultrareview command for critical code review
/ultrareview

# Example: overnight agentic coding session
# Opus 4.7 now reliably completes multi-session work
# with file-system-based memory between sessions

2. Vision That Can Actually Read

Opus 4.6 had mediocre vision. Opus 4.7 fixes this — it now accepts images up to 2,576 pixels on the long edge (~3.75 megapixels), which is more than 3x the previous limit.

This matters for:

Reading dense charts and financial documents
Analyzing UI screenshots and design mockups
Processing scanned PDFs with fine text
Code screenshots and terminal output images

3. New "xhigh" Effort Tier

Anthropic added a new effort level between "high" and "max." The full ladder is now: off → minimal → low → medium → high → xhigh → max

import anthropic

client = anthropic.Anthropic()

# Using the new xhigh effort tier
response = client.messages.create(
    model="claude-opus-4-7-20260416",
    max_tokens=16000,
    thinking={
        "type": "enabled",
        "budget_tokens": 10000,
        "effort": "xhigh"  # New in 4.7
    },
    messages=[{
        "role": "user",
        "content": "Architect a distributed rate limiter for 10M req/sec"
    }]
)

xhigh sits between high and max — giving you more reasoning depth without burning the maximum thinking budget every time. Good for challenging problems where "high" isn't enough but "max" is overkill.

4. Better Long-Context and Agentic Memory

One of the quiet but significant improvements: Opus 4.7 handles file-system-based memory significantly better across long, multi-session work. This is the backbone of reliable overnight agent runs — the model writes notes, reads them back, and continues coherently where it left off.

5. Document and Professional Task Quality

Anthropic describes 4.7 as more "tasteful and creative" for professional tasks — better interfaces, slides, financial analysis, and multi-step research. The model reasons through underspecified requests and self-verifies before outputting.

The Controversy: What They Don't Tell You

The Tokenizer Tax (Hidden Cost Increase)

Here's the part Anthropic buried in the fine print: Opus 4.7 uses a new tokenizer. The sticker price is identical to 4.6 ($5 input / $25 output per million tokens) — but the same text now costs up to 35% more tokens to process.

Real-world impact: if you were spending $100/month on Opus 4.6 for a specific workload, you might spend $135/month on 4.7 at "the same price." This caught a lot of developers off guard.

"The price looks the same but the tokenizer change is a silent 20-35% price hike for most workloads." — Reddit r/ClaudeAI

Long-Context Retrieval Regression

Some users report a regression in long-context retrieval compared to 4.6. For workloads that depend on finding specific information buried deep in large documents, 4.7 doesn't always perform better. Worth testing your specific use case before migrating.

Claude Code "Malware" False Positives

There are reports of Claude Code incorrectly flagging benign code as "malware" in Opus 4.7. Likely a side effect of the more aggressive safety tuning. Minor but annoying if you're working with security tools or low-level code.

API Breaking Changes

If you're upgrading existing integrations, watch out:

thinking.budget_tokens, temperature, and top_p now return 400 errors in certain combinations
Reasoning traces now default to hidden (you need to explicitly request them)

# BEFORE (4.6) - reasoning traces were visible by default
# AFTER (4.7) - need to explicitly enable
response = client.messages.create(
    model="claude-opus-4-7-20260416",
    thinking={
        "type": "enabled",
        "budget_tokens": 8000
        # reasoning traces now hidden unless you request them
    },
    ...
)

Benchmarks: How Does It Stack Up?

Benchmark	What It Tests	Opus 4.7 vs 4.6
SWE-bench Pro	Real-world software engineering	Significant jump ↑
CursorBench	IDE-style coding tasks	Notable improvement ↑
Long-context retrieval	Finding info in large docs	Mixed (possible regression)
Vision/document analysis	Image + PDF understanding	Major improvement ↑
Hallucination rate	Factual accuracy	Reduced ↓ (good)
Reward hacking	Gaming evaluations	Lower rates ↓ (good)

Should You Use It?

✅ Use Opus 4.7 if:

You're running complex, long-horizon agentic coding tasks
Your workflow involves analyzing high-resolution images, charts, or dense documents
You need a model that self-verifies and carries tasks to completion
You use Claude Code heavily (xhigh default + /ultrareview is genuinely useful)
You need overnight reliability for multi-session agent work

❌ Stick with Opus 4.6 (or use Sonnet) if:

You have a fixed API budget and haven't accounted for the tokenizer cost increase
Your primary workload is long-context document retrieval (test first)
You have existing integrations using temperature + thinking together (breaking change)
Cost efficiency matters more than raw capability (Sonnet is often 80% of the quality at 20% of the price)

Practical Tips for Migrating to Opus 4.7

import anthropic

# 1. Update your model string
MODEL = "claude-opus-4-7-20260416"  # new

# 2. Check for breaking API changes
# temperature + thinking together now errors
# Use EITHER temperature OR thinking, not both

# 3. Account for tokenizer cost increase
# Run a quick test: take your most expensive prompt
# and compare token counts between 4.6 and 4.7

# 4. Enable reasoning traces explicitly if you need them
response = client.messages.create(
    model=MODEL,
    max_tokens=8192,
    thinking={"type": "enabled", "budget_tokens": 5000},
    messages=[{"role": "user", "content": your_prompt}]
)
# traces are in response.content[0] (thinking block)
# followed by response.content[1] (text block)

FAQ

Is Claude Opus 4.7 the most powerful Claude model?

It's the most powerful publicly available Claude model. Anthropic has an unreleased model called "Claude Mythos" that's reportedly more powerful but was deemed too risky for public release.

How much does Claude Opus 4.7 cost?

The listed price is $5 per million input tokens and $25 per million output tokens — same as 4.6. However, the new tokenizer means the same text uses up to 35% more tokens, so real costs are effectively higher. Factor this into your budget.

Can I use Opus 4.7 with Claude Code?

Yes — and Claude Code now defaults to Opus 4.7 with xhigh effort. The new /ultrareview command is exclusive to Claude Code with Opus 4.7.

What's the context window?

1 million tokens, same as Opus 4.6.

Is Opus 4.7 worth it over Sonnet?

For complex agentic tasks, systems engineering, and high-stakes reasoning: yes. For standard content generation, summarization, and simpler coding: Sonnet 4.x still offers much better value. Use Opus 4.7 where complexity and reliability matter most.

Claude Opus 4.7 Deep Dive: Better Coding, 3x Vision, and the Hidden Cost You Need to Know