Gawk · 2026-05-08
Gawk — 2026-05-08 · 2 tool incidents
Five verifiable things that moved in the AI ecosystem in the last 24h. Every number traces to a public source.
What moved
- · claude-opus-4-6-thinking holds #1 on LMArena for the 17th consecutive snapshot.
- · llamaindex downloads declined for the 13th consecutive snapshot.
- · openai downloads declined for the 5th consecutive snapshot.
Benchmark movers
#No rank changes in the LMArena top 3
Why this matters · Public benchmarks are gameable, but rank shuffles still hint at where the frontier is genuinely moving.
#1 claude-opus-4-6-thinking
anthropic · 1500 Elo
#2 claude-opus-4-6
anthropic · 1496 Elo
#3 gemini-3.1-pro-preview
google · 1487 Elo
Source: lmarena.ai
Tool Health
#2 incidents in the last 24h (vs 3 yesterday)
Why this matters · Provider outages and degradations cause retry storms upstream. Tracking the 7-day shape catches flapping providers before they page you.
Elevated errors on Claude Opus 4.7
started 12:10 UTC · resolved 12:45 UTC · minor impact
Elevated transcription failures affecting ChatGPT & Codex
started 17:15 UTC · resolved 08:31 UTC · minor impact
claude-code: Degraded → Operational
openai-api: Operational → Degraded
codex: Operational → Degraded
Source: status.anthropic.com, status.openai.com, status.claude-code.com, status.openai-api.com, status.codex.com
Top HN stories
#Top 5 on HN in the last 24h
Why this matters · What developers debate on Hacker News often previews which models, frameworks, and patterns they'll actually adopt next.
Dirtyfrag: Universal Linux LPE
85 points · 37 comments
AlphaEvolve: Gemini-powered coding agent scaling impact across fields
44 points · 2 comments
Natural Language Autoencoders: Turning Claude's Thoughts into Text
34 points · 7 comments
Agents need control flow, not more prompts
24 points · 5 comments
Agentic Engineering
22 points · 5 comments
Source: news.ycombinator.com
SDK adoption
#6 notable shifts across the six registries
Why this matters · Package install volume is where developers place real bets — distinct from where labs are shipping marketing.
PyPI: transformers
−520.4k 24h downloads day-over-day
pypistats.org is a third-party aggregator — counts include mirror hits, CI builds, and pip install retries.
npm: @anthropic-ai/sdk
+146.0k 24h downloads day-over-day
crates.io: ort
+31.4k 90d downloads day-over-day
Docker Hub: ollama/ollama
+298.7k all-time pulls day-over-day
Homebrew: ollama
−2.0k 30d downloads day-over-day
VS Code Marketplace: GitHub.copilot
+25.6k cumulative installs day-over-day
Source: pypistats.org, npmjs.com, crates.io, hub.docker.com, formulae.brew.sh, marketplace.visualstudio.com
Notable AI Lab activity
#7 labs moved in the last 24h
Why this matters · GitHub event volume on a lab's own repos is the cleanest publicly-verifiable proxy for engineering activity we have.
OpenAI
−18 events (now 142) · San Francisco, US
Microsoft Research
+77 events (now 142) · Redmond, US
DeepSeek
+8 events (now 74) · Hangzhou, CN
Hugging Face
−33 events (now 63) · New York, US
Stanford CRFM
−17 events (now 11) · Stanford, US
Google DeepMind
−7 events (now 10) · London, GB
Google Research
−5 events (now 8) · Mountain View, US
Source: gawk.dev
Model Usage
#tencent/hy3-preview holds #1 · biggest mover: deepseek/deepseek-v4-pro +19 to #10 · biggest drop: anthropic/claude-sonnet-4.5 -13
Why this matters · OpenRouter rankings reflect API-first developer spend; direct customers like consumer ChatGPT are invisible by construction.
deepseek/deepseek-v4-pro climbed +19 to #10
was #29 on 2026-05-01
OpenRouter request volume reflects developer API spending, not end-user adoption. Biased toward API-first workflows that route through OpenRouter — direct OpenAI / Anthropic / Google customers who never use OpenRouter are invisible.
anthropic/claude-sonnet-4.5 slipped -13 to #41
was #28 on 2026-05-01
#1 tencent/hy3-preview
#2 moonshotai/kimi-k2.6
#3 anthropic/claude-sonnet-4.6
Source: openrouter.ai