Gawk · 2026-05-05
Gawk — 2026-05-05 · 4 tool incidents
Five verifiable things that moved in the AI ecosystem in the last 24h. Every number traces to a public source.
What moved
- · claude-opus-4-6-thinking holds #1 on LMArena for the 14th consecutive snapshot.
- · @anthropic-ai/sdk downloads grew for the 10th consecutive snapshot.
- · llamaindex downloads declined for the 10th consecutive snapshot.
Benchmark movers
#No rank changes in the LMArena top 3
Why this matters · Public benchmarks are gameable, but rank shuffles still hint at where the frontier is genuinely moving.
#1 claude-opus-4-6-thinking
anthropic · 1500 Elo
#2 claude-opus-4-6
anthropic · 1496 Elo
#3 gemini-3.1-pro-preview
google · 1487 Elo
Source: lmarena.ai
Tool Health
#4 incidents in the last 24h (vs 1 yesterday)
Why this matters · Provider outages and degradations cause retry storms upstream. Tracking the 7-day shape catches flapping providers before they page you.
Elevated errors on Claude Opus 4.5 and Sonnet 4.5
started 13:59 UTC · resolved 14:45 UTC · minor impact
Elevated errors on Claude Opus 4.7
started 14:07 UTC · resolved 14:33 UTC · minor impact
Issue affecting some pages on the ChatGPT website
started 15:29 UTC · resolved 17:10 UTC
Windsurf Connectivity Issues
started 15:08 UTC · resolved 16:00 UTC · minor impact
Source: status.anthropic.com, status.openai.com, status.windsurf.com
Top HN stories
#Top 5 on HN in the last 24h
Why this matters · What developers debate on Hacker News often previews which models, frameworks, and patterns they'll actually adopt next.
How OpenAI delivers low-latency voice AI at scale
81 points · 30 comments
Train Your Own LLM from Scratch
75 points · 9 comments
Lessons for Agentic Coding: What should we do when code is cheap?
29 points · 25 comments
Agent Skills
29 points · 3 comments
Ask HN: Best Embedding Models?
11 points · 3 comments
Source: news.ycombinator.com
SDK adoption
#6 notable shifts across the six registries
Why this matters · Package install volume is where developers place real bets — distinct from where labs are shipping marketing.
PyPI: openai
+3.8M 24h downloads day-over-day
pypistats.org is a third-party aggregator — counts include mirror hits, CI builds, and pip install retries.
npm: openai
−259.2k 24h downloads day-over-day
crates.io: ort
+8.7k 90d downloads day-over-day
Docker Hub: ollama/ollama
+296.4k all-time pulls day-over-day
Homebrew: ollama
−3.3k 30d downloads day-over-day
VS Code Marketplace: GitHub.copilot
+18.6k cumulative installs day-over-day
Source: pypistats.org, npmjs.com, crates.io, hub.docker.com, formulae.brew.sh, marketplace.visualstudio.com
Notable AI Lab activity
#9 labs moved in the last 24h
Why this matters · GitHub event volume on a lab's own repos is the cleanest publicly-verifiable proxy for engineering activity we have.
OpenAI
+31 events (now 116) · San Francisco, US
Microsoft Research
+38 events (now 98) · Redmond, US
DeepSeek
+25 events (now 92) · Hangzhou, CN
Google Research
+34 events (now 43) · Mountain View, US
New mover: Stanford CRFM
28 events · Stanford, US
New mover: Meta FAIR
17 events · Menlo Park, US
Google DeepMind
−15 events (now 10) · London, GB
Dropped off: Moonshot AI
was 9 events · Beijing, CN
Dropped off: xAI
was 8 events · Palo Alto, US
Source: gawk.dev
Model Usage
#tencent/hy3-preview holds #1 · biggest mover: deepseek/deepseek-v4-flash +23 to #7 · biggest drop: openai/gpt-oss-120b -18
Why this matters · OpenRouter rankings reflect API-first developer spend; direct customers like consumer ChatGPT are invisible by construction.
deepseek/deepseek-v4-flash climbed +23 to #7
was #30 on 2026-04-28
OpenRouter request volume reflects developer API spending, not end-user adoption. Biased toward API-first workflows that route through OpenRouter — direct OpenAI / Anthropic / Google customers who never use OpenRouter are invisible.
openai/gpt-oss-120b slipped -18 to #39
was #21 on 2026-04-28
#1 tencent/hy3-preview
#2 moonshotai/kimi-k2.6
#3 anthropic/claude-sonnet-4.6
Source: openrouter.ai