Gawk · 2026-05-09

Gawk — 2026-05-09 · 8 tool incidents

Five verifiable things that moved in the AI ecosystem in the last 24h. Every number traces to a public source.

What moved

· Open-weight models hold 2 of the OpenRouter top 5 (vs 1 yesterday).
· claude-opus-4-6-thinking holds #1 on LMArena for the 18th consecutive snapshot.
· llamaindex downloads declined for the 14th consecutive snapshot.

Benchmark movers

No rank changes in the LMArena top 3

Why this matters · Public benchmarks are gameable, but rank shuffles still hint at where the frontier is genuinely moving.

#1 claude-opus-4-6-thinking
anthropic · 1500 Elo
LMArena · 2026-05-01
#2 claude-opus-4-6
anthropic · 1496 Elo
LMArena · 2026-05-01
#3 gemini-3.1-pro-preview
google · 1487 Elo
LMArena · 2026-05-01

Source: lmarena.ai

Anchor link

Share · LinkedIn Share · X

Tool Health

8 incidents in the last 24h (vs 2 yesterday)

Why this matters · Provider outages and degradations cause retry storms upstream. Tracking the 7-day shape catches flapping providers before they page you.

Tool health, 7 days ending 2026-05-09. Each row is one tool; each column is one UTC day. Green = operational, amber = degraded, red = outage, grey = no data.

Elevated errors on Claude Opus 4.1
started 07:57 UTC · resolved 08:22 UTC · minor impact
Anthropic status page
Claude Code IDE extension unable to load on Windows
started 22:30 UTC · resolved 00:24 UTC · major impact
Anthropic status page
Elevated errors on Claude Opus 4.7
started 17:01 UTC · resolved 17:25 UTC · minor impact
Anthropic status page
Elevated errors on Claude Sonnet 4.6
started 15:03 UTC · resolved 15:11 UTC · minor impact
Anthropic status page
Elevated errors across Claude Models
started 09:49 UTC · resolved 11:40 UTC · major impact
Anthropic status page
Elevated errors for Responses API
started 00:07 UTC · resolved 00:08 UTC · major impact
OpenAI status page
Degraded Performance with Codex Cloud Tasks
started 15:07 UTC · resolved 16:17 UTC · minor impact
OpenAI status page
Increased Error Rate for gpt-5.5 model in the API
started 12:31 UTC · resolved 14:14 UTC · minor impact
OpenAI status page
openai-api: Degraded → Operational
Statuspage
codex: Degraded → Operational
Statuspage

Source: status.anthropic.com, status.openai.com, status.openai-api.com, status.codex.com

Anchor link

Share · LinkedIn Share · X

SDK adoption

6 notable shifts across the six registries

Why this matters · Package install volume is where developers place real bets — distinct from where labs are shipping marketing.

PyPI: torch
−212.3k 24h downloads day-over-day
PyPI · View on Gawk →
pypistats.org is a third-party aggregator — counts include mirror hits, CI builds, and pip install retries.
npm: @anthropic-ai/sdk
−79.2k 24h downloads day-over-day
npm · View on Gawk →
crates.io: ort
+27.6k 90d downloads day-over-day
crates.io · View on Gawk →
Docker Hub: ollama/ollama
+333.9k all-time pulls day-over-day
Docker Hub · View on Gawk →
Homebrew: ollama
+189 30d downloads day-over-day
Homebrew · View on Gawk →
VS Code Marketplace: GitHub.copilot
+14.4k cumulative installs day-over-day
VS Code Marketplace · View on Gawk →

Source: pypistats.org, npmjs.com, crates.io, hub.docker.com, formulae.brew.sh, marketplace.visualstudio.com

Anchor link

Share · LinkedIn Share · X

Notable AI Lab activity

4 labs moved in the last 24h

Why this matters · GitHub event volume on a lab's own repos is the cleanest publicly-verifiable proxy for engineering activity we have.

Microsoft Research
−36 events (now 106) · Redmond, US
Gawk · Labs
Google Research
+77 events (now 85) · Mountain View, US
Gawk · Labs
DeepSeek
−5 events (now 69) · Hangzhou, CN
Gawk · Labs
Google DeepMind
+10 events (now 20) · London, GB
Gawk · Labs

Source: gawk.dev

Anchor link

Share · LinkedIn Share · X

Model Usage

moonshotai/kimi-k2.6 holds #1 · biggest mover: deepseek/deepseek-v4-pro +16 to #8 · biggest drop: tencent/hy3-preview -37

Why this matters · OpenRouter rankings reflect API-first developer spend; direct customers like consumer ChatGPT are invisible by construction.

deepseek/deepseek-v4-pro climbed +16 to #8
was #24 on 2026-05-02
OpenRouter · View on Gawk →
OpenRouter request volume reflects developer API spending, not end-user adoption. Biased toward API-first workflows that route through OpenRouter — direct OpenAI / Anthropic / Google customers who never use OpenRouter are invisible.
tencent/hy3-preview slipped -37 to #38
was #1 on 2026-05-02
OpenRouter · View on Gawk →
#1 moonshotai/kimi-k2.6
OpenRouter · View on Gawk →
#2 anthropic/claude-sonnet-4.6
OpenRouter · View on Gawk →
#3 anthropic/claude-opus-4.7
OpenRouter · View on Gawk →

Source: openrouter.ai

Anchor link

Share · LinkedIn Share · X

Benchmark movers

Tool Health

Top HN stories

SDK adoption

Notable AI Lab activity

Model Usage