Gawk · 2026-05-18
Gawk — 2026-05-18 · 1 tool incident
Five verifiable things that moved in the AI ecosystem in the last 24h. Every number traces to a public source.
What moved
- · claude-opus-4-6-thinking holds #1 on LMArena for the 27th consecutive snapshot.
- · ai downloads grew for the 4th consecutive snapshot.
- · openai downloads grew for the 4th consecutive snapshot.
Benchmark movers
#No rank changes in the LMArena top 3
Why this matters · Public benchmarks are gameable, but rank shuffles still hint at where the frontier is genuinely moving.
#1 claude-opus-4-6-thinking
anthropic · 1500 Elo
#2 claude-opus-4-6
anthropic · 1498 Elo
#3 claude-opus-4-7-thinking
anthropic · 1485 Elo
Source: lmarena.ai
Tool Health
#1 incident in the last 24h (vs 1 yesterday)
Why this matters · Provider outages and degradations cause retry storms upstream. Tracking the 7-day shape catches flapping providers before they page you.
Elevated errors on Claude Haiku 4.5
started 06:12 UTC · resolved 08:07 UTC · minor impact
Source: status.anthropic.com
Top HN stories
#Top 5 on HN in the last 24h
Why this matters · What developers debate on Hacker News often previews which models, frameworks, and patterns they'll actually adopt next.
Mistral's CEO: Europe has 2 years to stop becoming America's AI 'vassal state'
76 points · 99 comments
Why Crouching Tiger, Hidden Dragon Is a Masterpiece
12 points · 1 comments
Agentic Trading with Safe Guardrails
11 points · 2 comments
I use LLMs as a staff engineer in 2026
6 points · 1 comments
How to use codex to get the most out of it
5 points · 0 comments
Source: news.ycombinator.com
SDK adoption
#6 notable shifts across the six registries
Why this matters · Package install volume is where developers place real bets — distinct from where labs are shipping marketing.
PyPI: langchain
−275.7k 24h downloads day-over-day
pypistats.org is a third-party aggregator — counts include mirror hits, CI builds, and pip install retries.
npm: openai
−1.5M 24h downloads day-over-day
crates.io: ort
−11.4k 90d downloads day-over-day
Docker Hub: ollama/ollama
+314.8k all-time pulls day-over-day
Homebrew: ollama
−1.5k 30d downloads day-over-day
VS Code Marketplace: Continue.continue
+9.4k cumulative installs day-over-day
Source: pypistats.org, npmjs.com, crates.io, hub.docker.com, formulae.brew.sh, marketplace.visualstudio.com
Agent frameworks
#3 agent frameworks moved >10% in the last week
Why this matters · Agent frameworks rise and fall on weekly download velocity and open-issue trajectory, not GitHub stars.
CrewAI · +94%
3.9M weekly downloads · 52k stars
smolagents · +29%
167k weekly downloads · 27k stars
OpenAI Agents · +14%
9.0M weekly downloads · 26k stars
Source: pypistats.org, pypistats.org, pypistats.org
Notable AI Lab activity
#5 labs moved in the last 24h
Why this matters · GitHub event volume on a lab's own repos is the cleanest publicly-verifiable proxy for engineering activity we have.
LangChain
−5 events (now 190) · San Francisco, US
OpenAI
+28 events (now 123) · San Francisco, US
DeepSeek
−21 events (now 82) · Hangzhou, CN
xAI
−9 events (now 27) · Palo Alto, US
INRIA
−21 events (now 23) · Paris, FR
Source: gawk.dev
Model Usage
#tencent/hy3-preview holds #1 · biggest mover: z-ai/glm-4.7 +21 to #29 · biggest drop: z-ai/glm-5 -11
Why this matters · OpenRouter rankings reflect API-first developer spend; direct customers like consumer ChatGPT are invisible by construction.
z-ai/glm-4.7 climbed +21 to #29
was #50 on 2026-05-11
OpenRouter request volume reflects developer API spending, not end-user adoption. Biased toward API-first workflows that route through OpenRouter — direct OpenAI / Anthropic / Google customers who never use OpenRouter are invisible.
z-ai/glm-5 slipped -11 to #38
was #27 on 2026-05-11
#1 tencent/hy3-preview
#2 deepseek/deepseek-v4-flash
#3 anthropic/claude-sonnet-4.6
Source: openrouter.ai