Gawk · 2026-05-12
Gawk — 2026-05-12 · 1 tool incident
Five verifiable things that moved in the AI ecosystem in the last 24h. Every number traces to a public source.
What moved
- · claude-opus-4-6-thinking holds #1 on LMArena for the 21st consecutive snapshot.
Benchmark movers
#No rank changes in the LMArena top 3
Why this matters · Public benchmarks are gameable, but rank shuffles still hint at where the frontier is genuinely moving.
#1 claude-opus-4-6-thinking
anthropic · 1500 Elo
#2 claude-opus-4-6
anthropic · 1496 Elo
#3 gemini-3.1-pro-preview
google · 1487 Elo
Source: lmarena.ai
Tool Health
#1 incident in the last 24h (vs 1 yesterday)
Why this matters · Provider outages and degradations cause retry storms upstream. Tracking the 7-day shape catches flapping providers before they page you.
Elevated error rates with GPT 5.5
started 17:02 UTC · resolved 17:59 UTC · minor impact
Source: status.openai.com
Top HN stories
#Top 5 on HN in the last 24h
Why this matters · What developers debate on Hacker News often previews which models, frameworks, and patterns they'll actually adopt next.
Claude Platform on AWS
56 points · 29 comments
Fake building: Claude wrote 3k lines instead of import pywikibot
36 points · 17 comments
Arcadia, CA, Mayor Federally Charged with Acting as Illegal Agent of PRC, Pleads
15 points · 7 comments
Show HN: Agent FM – local, open-source radio for Claude Code and Codex agents
7 points · 0 comments
Supercomputer networking to accelerate large scale AI training
7 points · 0 comments
Source: news.ycombinator.com
SDK adoption
#6 notable shifts across the six registries
Why this matters · Package install volume is where developers place real bets — distinct from where labs are shipping marketing.
PyPI: diffusers
+91.6k 24h downloads day-over-day
pypistats.org is a third-party aggregator — counts include mirror hits, CI builds, and pip install retries.
npm: openai
−1.5M 24h downloads day-over-day
crates.io: ort
+6.3k 90d downloads day-over-day
Docker Hub: ollama/ollama
+307.3k all-time pulls day-over-day
Homebrew: ollama
−973 30d downloads day-over-day
VS Code Marketplace: GitHub.copilot
+17.3k cumulative installs day-over-day
Source: pypistats.org, npmjs.com, crates.io, hub.docker.com, formulae.brew.sh, marketplace.visualstudio.com
Agent frameworks
#3 agent frameworks moved >10% in the last week
Why this matters · Agent frameworks rise and fall on weekly download velocity and open-issue trajectory, not GitHub stars.
CrewAI · +21%
2.2M weekly downloads · 51k stars
smolagents · +15%
142k weekly downloads · 27k stars
AutoGen · +11%
359k weekly downloads · 58k stars
pypistats.org · View on Gawk →
Tracks the live `autogen-agentchat` package — the deprecated bare `autogen` PyPI name is a different abandoned project and is not counted here.
Source: pypistats.org, pypistats.org, pypistats.org
Notable AI Lab activity
#8 labs moved in the last 24h
Why this matters · GitHub event volume on a lab's own repos is the cleanest publicly-verifiable proxy for engineering activity we have.
Anthropic
−12 events (now 188) · San Francisco, US
Microsoft Research
+25 events (now 131) · Redmond, US
OpenAI
−39 events (now 109) · San Francisco, US
DeepSeek
−23 events (now 55) · Hangzhou, CN
Google Research
+32 events (now 49) · Mountain View, US
Meta FAIR
−12 events (now 20) · Menlo Park, US
Moonshot AI
−11 events (now 13) · Beijing, CN
Google DeepMind
−8 events (now 12) · London, GB
Source: gawk.dev
Model Usage
#moonshotai/kimi-k2.6 holds #1 · biggest mover: openrouter/owl-alpha +32 to #16 · biggest drop: z-ai/glm-5-turbo -15
Why this matters · OpenRouter rankings reflect API-first developer spend; direct customers like consumer ChatGPT are invisible by construction.
openrouter/owl-alpha climbed +32 to #16
was #48 on 2026-05-05
OpenRouter request volume reflects developer API spending, not end-user adoption. Biased toward API-first workflows that route through OpenRouter — direct OpenAI / Anthropic / Google customers who never use OpenRouter are invisible.
z-ai/glm-5-turbo slipped -15 to #45
was #30 on 2026-05-05
#1 moonshotai/kimi-k2.6
#2 anthropic/claude-sonnet-4.6
#3 tencent/hy3-preview
Source: openrouter.ai