Gawk · 2026-06-12
Gawk — 2026-06-12 · 4 tool incidents
Five verifiable things that moved in the AI ecosystem in the last 24h. Every number traces to a public source.
What moved
- · claude-opus-4-6-thinking holds #1 on LMArena for the 30th consecutive snapshot.
- · diffusers downloads declined for the 4th consecutive snapshot.
Benchmark movers
#1 change in the LMArena top 3
Why this matters · Public benchmarks are gameable, but rank shuffles still hint at where the frontier is genuinely moving.
claude-opus-4-6-thinking
+10 Elo (now 1511) · anthropic
Source: lmarena.ai
Tool Health
#4 incidents in the last 24h (vs 1 yesterday)
Why this matters · Provider outages and degradations cause retry storms upstream. Tracking the 7-day shape catches flapping providers before they page you.
Elevated errors on Claude Opus 4.6
started 14:37 UTC · resolved 15:01 UTC · minor impact
Elevated 431 Errors
started 22:22 UTC · resolved 00:47 UTC · minor impact
Elevated error rates for GPT 5.5 in Codex
started 20:59 UTC · resolved 22:28 UTC · minor impact
Disruption in service availabilty for Free and Go users
started 13:03 UTC · resolved 14:04 UTC · minor impact
Source: status.anthropic.com, status.openai.com
Top HN stories
#Top 5 on HN in the last 24h
Why this matters · What developers debate on Hacker News often previews which models, frameworks, and patterns they'll actually adopt next.
OpenAI Prepping for On-Prem Product?
19 points · 8 comments
Kickbacks: An ad marketplace for coding agent spinners
9 points · 0 comments
He Hacked Teslas for Elon Musk. Now He's Launching a $100M AI Cyber Agent
9 points · 1 comments
Shall we play a game? – LLMs use tactical nukes in 95% of simulations
9 points · 2 comments
AI Agent Bankrupted Their Operator While Trying to Scan DN42
5 points · 1 comments
Source: news.ycombinator.com
SDK adoption
#5 notable shifts across the six registries
Why this matters · Package install volume is where developers place real bets — distinct from where labs are shipping marketing.
PyPI: anthropic
+6.7M 24h downloads day-over-day
pypistats.org is a third-party aggregator — counts include mirror hits, CI builds, and pip install retries.
crates.io: ort
+37.6k 90d downloads day-over-day
Docker Hub: ollama/ollama
+320.1k all-time pulls day-over-day
Homebrew: ollama
+1.2k 30d downloads day-over-day
VS Code Marketplace: GitHub.copilot
+10.5k cumulative installs day-over-day
Source: pypistats.org, crates.io, hub.docker.com, formulae.brew.sh, marketplace.visualstudio.com
Agent frameworks
#5 agent frameworks moved >10% in the last week
Why this matters · Agent frameworks rise and fall on weekly download velocity and open-issue trajectory, not GitHub stars.
Pydantic AI · -59%
3.1M weekly downloads · 18k stars
pypistats.org · View on Gawk →
Tracks the `pydantic-ai` meta-package; the lean `pydantic-ai-slim` variant is roughly 4× larger but users typically compare the meta number.
smolagents · -41%
108k weekly downloads · 28k stars
CrewAI · -21%
2.3M weekly downloads · 53k stars
AutoGen · -20%
272k weekly downloads · 59k stars
pypistats.org · View on Gawk →
Tracks the live `autogen-agentchat` package — the deprecated bare `autogen` PyPI name is a different abandoned project and is not counted here.
LangGraph · -13%
14M weekly downloads · 35k stars
Source: pypistats.org, pypistats.org, pypistats.org, pypistats.org, pypistats.org
Notable AI Lab activity
#9 labs moved in the last 24h
Why this matters · GitHub event volume on a lab's own repos is the cleanest publicly-verifiable proxy for engineering activity we have.
LangChain
+5 events (now 188) · San Francisco, US
Microsoft Research
+7 events (now 118) · Redmond, US
OpenAI
−51 events (now 113) · San Francisco, US
Hugging Face
+24 events (now 102) · New York, US
Ollama
+6 events (now 95) · San Francisco, US
Weights & Biases
+6 events (now 79) · San Francisco, US
DeepSeek
−17 events (now 71) · Hangzhou, CN
New mover: AMD
52 events · Santa Clara, US
Dropped off: Google Research
was 89 events · Mountain View, US
Source: gawk.dev