Gawk · 2026-06-04
Gawk — 2026-06-04 · 1 tool incident
Five verifiable things that moved in the AI ecosystem in the last 24h. Every number traces to a public source.
What moved
- · claude-opus-4-6-thinking holds #1 on LMArena for the 30th consecutive snapshot.
- · anthropic downloads grew for the 4th consecutive snapshot.
- · transformers downloads grew for the 4th consecutive snapshot.
Benchmark movers
#No rank changes in the LMArena top 3
Why this matters · Public benchmarks are gameable, but rank shuffles still hint at where the frontier is genuinely moving.
#1 claude-opus-4-6-thinking
anthropic · 1499 Elo
#2 claude-opus-4-6
anthropic · 1498 Elo
#3 claude-opus-4-7-thinking
anthropic · 1486 Elo
Source: lmarena.ai
Tool Health
#1 incident in the last 24h (vs 6 yesterday)
Why this matters · Provider outages and degradations cause retry storms upstream. Tracking the 7-day shape catches flapping providers before they page you.
Increased latency for Codex compaction for a subset of users
started 02:57 UTC · resolved 04:37 UTC · minor impact
Source: status.openai.com
Top HN stories
#Top 5 on HN in the last 24h
Why this matters · What developers debate on Hacker News often previews which models, frameworks, and patterns they'll actually adopt next.
The ways we contain Claude across products
31 points · 4 comments
I built a vulnerable app and spent $1,500 seeing if LLMs could hack it
25 points · 7 comments
Gemini Spark is the most impressive and terrifying AI experience I've had yet
7 points · 4 comments
Train your own LLM? Here's what happens
6 points · 0 comments
Reddit user creates DB and MCP to mine Polygon, finds patterns on Polymarket
5 points · 0 comments
Source: news.ycombinator.com
SDK adoption
#6 notable shifts across the six registries
Why this matters · Package install volume is where developers place real bets — distinct from where labs are shipping marketing.
PyPI: transformers
−294.9k 24h downloads day-over-day
pypistats.org is a third-party aggregator — counts include mirror hits, CI builds, and pip install retries.
npm: openai
+1.8M 24h downloads day-over-day
crates.io: tch
+8.9k 90d downloads day-over-day
Docker Hub: ollama/ollama
+316.5k all-time pulls day-over-day
Homebrew: ollama
+872 30d downloads day-over-day
VS Code Marketplace: saoudrizwan.claude-dev
+18.3k cumulative installs day-over-day
Source: pypistats.org, npmjs.com, crates.io, hub.docker.com, formulae.brew.sh, marketplace.visualstudio.com
Notable AI Lab activity
#6 labs moved in the last 24h
Why this matters · GitHub event volume on a lab's own repos is the cleanest publicly-verifiable proxy for engineering activity we have.
OpenAI
+25 events (now 156) · San Francisco, US
Microsoft Research
+14 events (now 100) · Redmond, US
DeepSeek
−5 events (now 80) · Hangzhou, CN
Weights & Biases
+14 events (now 80) · San Francisco, US
New mover: Stanford CRFM
36 events · Stanford, US
Dropped off: Replicate
was 29 events · San Francisco, US
Source: gawk.dev
Model Usage
#tencent/hy3-preview holds #1 · biggest mover: deepseek/deepseek-v4-flash +45 to #2 · biggest drop: moonshotai/kimi-k2.5 -24
Why this matters · OpenRouter rankings reflect API-first developer spend; direct customers like consumer ChatGPT are invisible by construction.
deepseek/deepseek-v4-flash climbed +45 to #2
was #47 on 2026-04-26
OpenRouter request volume reflects developer API spending, not end-user adoption. Biased toward API-first workflows that route through OpenRouter — direct OpenAI / Anthropic / Google customers who never use OpenRouter are invisible.
moonshotai/kimi-k2.5 slipped -24 to #43
was #19 on 2026-04-26
#1 tencent/hy3-preview
#2 deepseek/deepseek-v4-flash
#3 xiaomi/mimo-v2.5
Source: openrouter.ai