Gawk · 2026-06-18
Gawk — 2026-06-18 · 4 tool incidents
Five verifiable things that moved in the AI ecosystem in the last 24h. Every number traces to a public source.
What moved
- · Open-weight models hold 0 of the OpenRouter top 5 (vs 1 yesterday).
- · claude-opus-4-6-thinking holds #1 on LMArena for the 30th consecutive snapshot.
- · ai downloads grew for the 6th consecutive snapshot.
Benchmark movers
#1 change in the LMArena top 3
Why this matters · Public benchmarks are gameable, but rank shuffles still hint at where the frontier is genuinely moving.
claude-opus-4-6-thinking
+10 Elo (now 1511) · anthropic
Source: lmarena.ai
Tool Health
#4 incidents in the last 24h (vs 7 yesterday)
Why this matters · Provider outages and degradations cause retry storms upstream. Tracking the 7-day shape catches flapping providers before they page you.
Elevated errors for Claude Opus 4.8
started 15:34 UTC · resolved 16:28 UTC · minor impact
SSO login errors for some ChatGPT Enterprise workspaces
started 08:55 UTC · ongoing · minor impact
ChatGPT failing to load or save
started 03:26 UTC · resolved 03:55 UTC · minor impact
Errors with conversations on Android and iOS devices
started 17:20 UTC · resolved 18:23 UTC · major impact
Source: status.anthropic.com, status.openai.com
Top HN stories
#Top 5 on HN in the last 24h
Why this matters · What developers debate on Hacker News often previews which models, frameworks, and patterns they'll actually adopt next.
Leaked financial docs show OpenAI is losing billions of dollars a year
121 points · 70 comments
Local Qwen isn't a worse Opus, it's a different tool
106 points · 34 comments
ChatGPT Spontaneously Generates Sexual Violence and Hardcore Snuff Imagery
47 points · 42 comments
A Robot Is Sprinting Towards You: Do You Want It Running on Claude or Grok?
35 points · 20 comments
Noam Shazeer is joining OpenAI
19 points · 1 comments
Source: news.ycombinator.com
SDK adoption
#5 notable shifts across the six registries
Why this matters · Package install volume is where developers place real bets — distinct from where labs are shipping marketing.
npm: ai
+356.0k 24h downloads day-over-day
crates.io: ort
+37.0k 90d downloads day-over-day
Docker Hub: ollama/ollama
+297.9k all-time pulls day-over-day
Homebrew: ollama
+677 30d downloads day-over-day
VS Code Marketplace: GitHub.copilot
+15.1k cumulative installs day-over-day
Source: npmjs.com, crates.io, hub.docker.com, formulae.brew.sh, marketplace.visualstudio.com
Agent frameworks
#5 agent frameworks moved >10% in the last week
Why this matters · Agent frameworks rise and fall on weekly download velocity and open-issue trajectory, not GitHub stars.
Pydantic AI · +102%
6.2M weekly downloads · 18k stars
pypistats.org · View on Gawk →
Tracks the `pydantic-ai` meta-package; the lean `pydantic-ai-slim` variant is roughly 4× larger but users typically compare the meta number.
smolagents · +47%
162k weekly downloads · 28k stars
LangGraph · +26%
18M weekly downloads · 35k stars
AutoGen · +25%
340k weekly downloads · 59k stars
pypistats.org · View on Gawk →
Tracks the live `autogen-agentchat` package — the deprecated bare `autogen` PyPI name is a different abandoned project and is not counted here.
CrewAI · +24%
2.9M weekly downloads · 54k stars
Source: pypistats.org, pypistats.org, pypistats.org, pypistats.org, pypistats.org
Notable AI Lab activity
#10 labs moved in the last 24h
Why this matters · GitHub event volume on a lab's own repos is the cleanest publicly-verifiable proxy for engineering activity we have.
LangChain
−6 events (now 182) · San Francisco, US
Anthropic
−31 events (now 163) · San Francisco, US
Microsoft Research
−62 events (now 104) · Redmond, US
Ollama
+6 events (now 98) · San Francisco, US
Hugging Face
−5 events (now 87) · New York, US
Weights & Biases
+6 events (now 76) · San Francisco, US
New mover: Intel AI
41 events · Santa Clara, US
New mover: INRIA
33 events · Paris, FR
Dropped off: Mistral AI
was 185 events · Paris, FR
Dropped off: Replicate
was 81 events · San Francisco, US
Source: gawk.dev
Model Usage
#google/gemini-3.1-flash-image holds #1 · biggest drop: tencent/hy3-preview -24
Why this matters · OpenRouter rankings reflect API-first developer spend; direct customers like consumer ChatGPT are invisible by construction.
tencent/hy3-preview slipped -24 to #50
was #26 on 2026-04-26
#1 google/gemini-3.1-flash-image
#2 google/gemini-3-pro-image
#3 cohere/north-mini-code:free
Source: openrouter.ai