Gawk · 2026-05-23

Gawk — 2026-05-23 · 2 tool incidents

Five verifiable things that moved in the AI ecosystem in the last 24h. Every number traces to a public source.

What moved

· claude-opus-4-6-thinking holds #1 on LMArena for the 30th consecutive snapshot.
· ai downloads grew for the 9th consecutive snapshot.
· openai downloads grew for the 9th consecutive snapshot.

Benchmark movers

No rank changes in the LMArena top 3

Why this matters · Public benchmarks are gameable, but rank shuffles still hint at where the frontier is genuinely moving.

#1 claude-opus-4-6-thinking
anthropic · 1500 Elo
LMArena · 2026-05-19
#2 claude-opus-4-6
anthropic · 1498 Elo
LMArena · 2026-05-19
#3 gemini-3.5-flash
google · 1486 Elo
LMArena · 2026-05-19

Source: lmarena.ai

Anchor link

Share · LinkedIn Share · X

Tool Health

2 incidents in the last 24h (vs 3 yesterday)

Why this matters · Provider outages and degradations cause retry storms upstream. Tracking the 7-day shape catches flapping providers before they page you.

Tool health, 7 days ending 2026-05-23. Each row is one tool; each column is one UTC day. Green = operational, amber = degraded, red = outage, grey = no data.

Elevated errors on Claude Opus 4.7
started 10:17 UTC · resolved 10:31 UTC · minor impact
Anthropic status page
Increase in users hitting Codex rate limits
started 16:37 UTC · ongoing · minor impact
OpenAI status page
claude-code: Degraded → Operational
Statuspage
openai-api: Operational → Degraded
Statuspage
codex: Operational → Degraded
Statuspage

Source: status.anthropic.com, status.openai.com, status.claude-code.com, status.openai-api.com, status.codex.com

Anchor link

Share · LinkedIn Share · X

SDK adoption

6 notable shifts across the six registries

Why this matters · Package install volume is where developers place real bets — distinct from where labs are shipping marketing.

PyPI: openai
−1.4M 24h downloads day-over-day
PyPI · View on Gawk →
pypistats.org is a third-party aggregator — counts include mirror hits, CI builds, and pip install retries.
npm: openai
−162.0k 24h downloads day-over-day
npm · View on Gawk →
crates.io: ort
+30.1k 90d downloads day-over-day
crates.io · View on Gawk →
Docker Hub: ollama/ollama
+363.2k all-time pulls day-over-day
Docker Hub · View on Gawk →
Homebrew: ollama
−842 30d downloads day-over-day
Homebrew · View on Gawk →
VS Code Marketplace: GitHub.copilot
+16.7k cumulative installs day-over-day
VS Code Marketplace · View on Gawk →

Source: pypistats.org, npmjs.com, crates.io, hub.docker.com, formulae.brew.sh, marketplace.visualstudio.com

Anchor link

Share · LinkedIn Share · X

Agent frameworks

3 agent frameworks moved >10% in the last week

Why this matters · Agent frameworks rise and fall on weekly download velocity and open-issue trajectory, not GitHub stars.

CrewAI · +32%
4.2M weekly downloads · 52k stars
pypistats.org · View on Gawk →
Pydantic AI · -20%
8.7M weekly downloads · 17k stars
pypistats.org · View on Gawk →
Tracks the `pydantic-ai` meta-package; the lean `pydantic-ai-slim` variant is roughly 4× larger but users typically compare the meta number.
smolagents · -14%
146k weekly downloads · 27k stars
pypistats.org · View on Gawk →

Source: pypistats.org, pypistats.org, pypistats.org

Anchor link

Share · LinkedIn Share · X

Notable AI Lab activity

10 labs moved in the last 24h

Why this matters · GitHub event volume on a lab's own repos is the cleanest publicly-verifiable proxy for engineering activity we have.

LangChain
+17 events (now 189) · San Francisco, US
Gawk · Labs
OpenAI
+7 events (now 141) · San Francisco, US
Gawk · Labs
Hugging Face
−9 events (now 113) · New York, US
Gawk · Labs
DeepSeek
+6 events (now 73) · Hangzhou, CN
Gawk · Labs
Microsoft Research
−29 events (now 67) · Redmond, US
Gawk · Labs
Weights & Biases
−29 events (now 43) · San Francisco, US
Gawk · Labs
New mover: Google DeepMind
30 events · London, GB
Gawk · Labs
New mover: Alibaba Qwen
21 events · Hangzhou, CN
Gawk · Labs
Dropped off: AMD
was 55 events · Santa Clara, US
Gawk · Labs
Dropped off: Intel AI
was 27 events · Santa Clara, US
Gawk · Labs

Source: gawk.dev

Anchor link

Share · LinkedIn Share · X

Model Usage

deepseek/deepseek-v4-flash holds #1 · biggest mover: openai/gpt-5-mini +10 to #30 · biggest drop: qwen/qwen3.5-397b-a17b -16

Why this matters · OpenRouter rankings reflect API-first developer spend; direct customers like consumer ChatGPT are invisible by construction.

openai/gpt-5-mini climbed +10 to #30
was #40 on 2026-05-16
OpenRouter · View on Gawk →
OpenRouter request volume reflects developer API spending, not end-user adoption. Biased toward API-first workflows that route through OpenRouter — direct OpenAI / Anthropic / Google customers who never use OpenRouter are invisible.
qwen/qwen3.5-397b-a17b slipped -16 to #45
was #29 on 2026-05-16
OpenRouter · View on Gawk →
#1 deepseek/deepseek-v4-flash
OpenRouter · View on Gawk →
#2 tencent/hy3-preview
OpenRouter · View on Gawk →
#3 anthropic/claude-sonnet-4.6
OpenRouter · View on Gawk →

Source: openrouter.ai

Anchor link

Share · LinkedIn Share · X

Benchmark movers

Tool Health

Top HN stories

SDK adoption

Agent frameworks

Notable AI Lab activity

Model Usage