Gawk · 2026-05-16
Gawk — 2026-05-16 · 1 tool incident
Five verifiable things that moved in the AI ecosystem in the last 24h. Every number traces to a public source.
What moved
- · claude-opus-4-6-thinking holds #1 on LMArena for the 25th consecutive snapshot.
Benchmark movers
#No rank changes in the LMArena top 3
Why this matters · Public benchmarks are gameable, but rank shuffles still hint at where the frontier is genuinely moving.
#1 claude-opus-4-6-thinking
anthropic · 1500 Elo
#2 claude-opus-4-6
anthropic · 1498 Elo
#3 claude-opus-4-7-thinking
anthropic · 1485 Elo
Source: lmarena.ai
Tool Health
#1 incident in the last 24h (vs 3 yesterday)
Why this matters · Provider outages and degradations cause retry storms upstream. Tracking the 7-day shape catches flapping providers before they page you.
GPT5.5 Performance Degradation
started 16:11 UTC · ongoing
Source: status.openai.com
Top HN stories
#Top 5 on HN in the last 24h
Why this matters · What developers debate on Hacker News often previews which models, frameworks, and patterns they'll actually adopt next.
Show HN: Find the best local LLM for your hardware, ranked by benchmarks
132 points · 15 comments
UK sovereign LLM inference
85 points · 83 comments
OpenAI is connecting ChatGPT to bank accounts via Plaid
57 points · 80 comments
Show HN: Claude Code vs. Codex Global Usage Leaderboard
10 points · 10 comments
Bug Archeology: Solving a decade-old Swift/C++ mystery with LLMs
9 points · 3 comments
Source: news.ycombinator.com
SDK adoption
#5 notable shifts across the six registries
Why this matters · Package install volume is where developers place real bets — distinct from where labs are shipping marketing.
npm: openai
−239.2k 24h downloads day-over-day
crates.io: ort
+30.4k 90d downloads day-over-day
Docker Hub: ollama/ollama
+336.0k all-time pulls day-over-day
Homebrew: ollama
+586 30d downloads day-over-day
VS Code Marketplace: GitHub.copilot
+14.9k cumulative installs day-over-day
Source: npmjs.com, crates.io, hub.docker.com, formulae.brew.sh, marketplace.visualstudio.com
Agent frameworks
#4 agent frameworks moved >10% in the last week
Why this matters · Agent frameworks rise and fall on weekly download velocity and open-issue trajectory, not GitHub stars.
CrewAI · +62%
3.2M weekly downloads · 52k stars
smolagents · +39%
170k weekly downloads · 27k stars
AutoGen · +11%
369k weekly downloads · 58k stars
pypistats.org · View on Gawk →
Tracks the live `autogen-agentchat` package — the deprecated bare `autogen` PyPI name is a different abandoned project and is not counted here.
LangGraph · +11%
15M weekly downloads · 32k stars
Source: pypistats.org, pypistats.org, pypistats.org, pypistats.org
Notable AI Lab activity
#11 labs moved in the last 24h
Why this matters · GitHub event volume on a lab's own repos is the cleanest publicly-verifiable proxy for engineering activity we have.
Anthropic
−26 events (now 174) · San Francisco, US
OpenAI
−8 events (now 93) · San Francisco, US
Ollama
−5 events (now 91) · San Francisco, US
Microsoft Research
−20 events (now 87) · Redmond, US
Hugging Face
−26 events (now 72) · New York, US
New mover: xAI
65 events · Palo Alto, US
New mover: AMD
65 events · Santa Clara, US
DeepSeek
−36 events (now 60) · Hangzhou, CN
Weights & Biases
−55 events (now 34) · San Francisco, US
Dropped off: Replicate
was 31 events · San Francisco, US
Dropped off: Meta FAIR
was 18 events · Menlo Park, US
Source: gawk.dev
Model Usage
#tencent/hy3-preview holds #1 · biggest mover: tencent/hy3-preview +37 to #1 · biggest drop: z-ai/glm-5 -11
Why this matters · OpenRouter rankings reflect API-first developer spend; direct customers like consumer ChatGPT are invisible by construction.
tencent/hy3-preview climbed +37 to #1
was #38 on 2026-05-09
OpenRouter request volume reflects developer API spending, not end-user adoption. Biased toward API-first workflows that route through OpenRouter — direct OpenAI / Anthropic / Google customers who never use OpenRouter are invisible.
z-ai/glm-5 slipped -11 to #36
was #25 on 2026-05-09
#1 tencent/hy3-preview
#2 deepseek/deepseek-v4-flash
#3 anthropic/claude-opus-4.7
Source: openrouter.ai