Gawk · 2026-06-01
Gawk — 2026-06-01 · where things stand
First-day snapshot — where things stand now. Diff mode resumes tomorrow once we have two days to compare.
Benchmark movers
#Current LMArena top 3
Why this matters · Public benchmarks are gameable, but rank shuffles still hint at where the frontier is genuinely moving.
#1 claude-opus-4-6-thinking
anthropic · 1499 Elo
#2 claude-opus-4-6
anthropic · 1498 Elo
#3 claude-opus-4-7-thinking
anthropic · 1486 Elo
Source: lmarena.ai
Tool Health
#Current status of 5 tracked tools
Why this matters · Provider outages and degradations cause retry storms upstream. Tracking the 7-day shape catches flapping providers before they page you.
Elevated errors for Claude Opus 4.7
started 12:15 UTC · ongoing · minor impact
Sonnet 4.5 elevated errors
started 09:08 UTC · resolved 09:28 UTC · minor impact
Opus 4.7 elevated errors
started 06:48 UTC · resolved 07:22 UTC · minor impact
claude-code
Degraded · 1 active incident
openai-api
Operational
codex
Operational
copilot
Operational
windsurf
Operational
Source: status.anthropic.com, status.claude-code.com, status.openai-api.com, status.codex.com, status.copilot.com, status.windsurf.com
Top HN stories
#Top 2 on HN in the last 24h
Why this matters · What developers debate on Hacker News often previews which models, frameworks, and patterns they'll actually adopt next.
My client is replacing me with Claude for all DevOps/infra and most feature dev
6 points · 1 comments
Two LLM UI Patterns That Aren't Chat
5 points · 0 comments
Source: news.ycombinator.com
SDK adoption
#Current adoption leaders across six registries
Why this matters · Package install volume is where developers place real bets — distinct from where labs are shipping marketing.
PyPI: openai
7.0M 24h downloads
pypistats.org is a third-party aggregator — counts include mirror hits, CI builds, and pip install retries.
npm: @anthropic-ai/sdk
2.7M 24h downloads
crates.io: ort
3.9M 90d downloads
Docker Hub: ollama/ollama
139.9M all-time pulls
Homebrew: ollama
57.4k 30d downloads
VS Code Marketplace: GitHub.copilot
73.7M cumulative installs
Source: pypistats.org, npmjs.com, crates.io, hub.docker.com, formulae.brew.sh, marketplace.visualstudio.com
Notable AI Lab activity
#Top 5 labs by 24h activity
Why this matters · GitHub event volume on a lab's own repos is the cleanest publicly-verifiable proxy for engineering activity we have.
LangChain
199 events · San Francisco, US
Anthropic
198 events · San Francisco, US
OpenAI
161 events · San Francisco, US
Ollama
97 events · San Francisco, US
Hugging Face
86 events · New York, US
Source: gawk.dev
Model Usage
#deepseek/deepseek-v4-flash holds #1 · biggest mover: xiaomi/mimo-v2.5 +37 to #6 · biggest drop: google/gemini-3.1-pro-preview -10
Why this matters · OpenRouter rankings reflect API-first developer spend; direct customers like consumer ChatGPT are invisible by construction.
xiaomi/mimo-v2.5 climbed +37 to #6
was #43 on 2026-05-24
OpenRouter request volume reflects developer API spending, not end-user adoption. Biased toward API-first workflows that route through OpenRouter — direct OpenAI / Anthropic / Google customers who never use OpenRouter are invisible.
google/gemini-3.1-pro-preview slipped -10 to #26
was #16 on 2026-05-24
#1 deepseek/deepseek-v4-flash
#2 tencent/hy3-preview
#3 anthropic/claude-opus-4.7
Source: openrouter.ai