GrabStack.

Signal, not noise — the living guide to AI tools, agents and stacks.

From The Wire

All dispatches →

Anthropic ships Claude Opus 4 — tops SWE-bench, adds extended thinking

Posted 2026-05-23 · Review by 2026-07-23

Claude Opus 4 takes #1 on SWE-bench Verified. Extended thinking and parallel tool use make it the strongest coding agent model available.

OpenAI deprecates Sora — exits the consumer video race

Posted 2026-04-26 · Review by 2026-09-26

Sora 2 is officially deprecated. The API shuts down 24 September 2026. OpenAI is refocusing on enterprise video tooling.

The Capitals

See all →

The key players — the dominant labs and flagship models.

Claude Code

Anthropic

Top Ranked

Terminal-native agent that consistently ranks #1 across independent benchmarks; works in terminal, IDE and browser.

Claude Cowork

Anthropic

Flagship

Desktop agent for macOS and Windows that edits files, organises folders, synthesises research and runs scheduled multi-step tasks in a privacy-first sandbox.

Claude Opus 4.8

Anthropic

Flagship

The current flagship (with Sonnet 4.6 and Haiku 4.5); at or near the top for complex multi-file coding and long-context technical work, and the most natural for prose.

Gemini 3.1 Pro

Google

Top Ranked

Leads most published reasoning benchmarks and has the cheapest output among the majors; paired with Gemini Spark for long cloud tasks.

HappyHorse

Alibaba

Top Ranked

Currently tops the Artificial Analysis leaderboard.

Nano Banana Pro

Google

Top Ranked

Gemini 3 Pro Image; leads the image-arena leaderboard, with the best multilingual text rendering, free in the Gemini app.

Seedance 2.0

ByteDance

Top Ranked

Currently tops the Artificial Analysis leaderboard.

The Cemetery

See all →

The dead and the zombies — tools that died or quietly stopped mattering.

Limitless / Bee

acquired

Meta acquired Limitless (Dec 2025); Amazon acquired Bee (mid-2025).

Sora 2

OpenAI

OpenAI stepped back from the consumer video race.

Explore