Gawk · 2026-05-03

Gawk — 2026-05-03 · 10 labs moved in the last 24h

Five verifiable things that moved in the AI ecosystem in the last 24h. Every number traces to a public source.

Benchmark movers

No rank changes in the LMArena top 3

Why this matters · Public benchmarks are gameable, but rank shuffles still hint at where the frontier is genuinely moving.

#1 claude-opus-4-6-thinking
anthropic · 1500 Elo
LMArena · 2026-05-01
#2 claude-opus-4-6
anthropic · 1496 Elo
LMArena · 2026-05-01
#3 gemini-3.1-pro-preview
google · 1487 Elo
LMArena · 2026-05-01

Source: lmarena.ai

Anchor link

Share · LinkedIn Share · X

Tool Health

All tools operational, no incidents in the last 24h (vs 2 yesterday)

Why this matters · Provider outages and degradations cause retry storms upstream. Tracking the 7-day shape catches flapping providers before they page you.

Tool health, 7 days ending 2026-05-03. Each row is one tool; each column is one UTC day. Green = operational, amber = degraded, red = outage, grey = no data.

claude-code
Operational
Statuspage
openai-api
Degraded · 1 active incident
Statuspage
codex
Operational · 1 active incident
Statuspage
copilot
Operational
Statuspage
windsurf
Operational
Statuspage

Source: status.claude-code.com, status.openai-api.com, status.codex.com, status.copilot.com, status.windsurf.com

Anchor link

Share · LinkedIn Share · X

SDK adoption

5 notable shifts across the six registries

Why this matters · Package install volume is where developers place real bets — distinct from where labs are shipping marketing.

PyPI: huggingface-hub
−2.4M 24h downloads day-over-day
PyPI · View on Gawk →
pypistats.org is a third-party aggregator — counts include mirror hits, CI builds, and pip install retries.
npm: openai
−453.2k 24h downloads day-over-day
npm · View on Gawk →
crates.io: tch
−8.1k 90d downloads day-over-day
crates.io · View on Gawk →
Docker Hub: ollama/ollama
+305.0k all-time pulls day-over-day
Docker Hub · View on Gawk →
VS Code Marketplace: Continue.continue
+9.5k cumulative installs day-over-day
VS Code Marketplace · View on Gawk →

Source: pypistats.org, npmjs.com, crates.io, hub.docker.com, marketplace.visualstudio.com

Anchor link

Share · LinkedIn Share · X

Notable AI Lab activity

10 labs moved in the last 24h

Why this matters · GitHub event volume on a lab's own repos is the cleanest publicly-verifiable proxy for engineering activity we have.

Anthropic
+9 events (now 177) · San Francisco, US
Gawk · Labs
OpenAI
−18 events (now 81) · San Francisco, US
Gawk · Labs
DeepSeek
−18 events (now 66) · Hangzhou, CN
Gawk · Labs
Microsoft Research
−54 events (now 38) · Redmond, US
Gawk · Labs
Google DeepMind
−9 events (now 13) · London, GB
Gawk · Labs
New mover: Moonshot AI
9 events · Beijing, CN
Gawk · Labs
Meta FAIR
−14 events (now 8) · Menlo Park, US
Gawk · Labs
New mover: Mistral AI
8 events · Paris, FR
Gawk · Labs
Dropped off: Stanford CRFM
was 27 events · Stanford, US
Gawk · Labs
Dropped off: INRIA
was 21 events · Paris, FR
Gawk · Labs

Source: gawk.dev

Anchor link

Share · LinkedIn Share · X

Model Usage

tencent/hy3-preview holds #1 · biggest mover: deepseek/deepseek-v4-flash +38 to #9 · biggest drop: openai/gpt-oss-120b -24

Why this matters · OpenRouter rankings reflect API-first developer spend; direct customers like consumer ChatGPT are invisible by construction.

deepseek/deepseek-v4-flash climbed +38 to #9
was #47 on 2026-04-26
OpenRouter · View on Gawk →
OpenRouter request volume reflects developer API spending, not end-user adoption. Biased toward API-first workflows that route through OpenRouter — direct OpenAI / Anthropic / Google customers who never use OpenRouter are invisible.
openai/gpt-oss-120b slipped -24 to #40
was #16 on 2026-04-26
OpenRouter · View on Gawk →
#1 tencent/hy3-preview
OpenRouter · View on Gawk →
#2 moonshotai/kimi-k2.6
OpenRouter · View on Gawk →
#3 anthropic/claude-sonnet-4.6
OpenRouter · View on Gawk →

Source: openrouter.ai

Anchor link

Share · LinkedIn Share · X

Benchmark movers

Tool Health

Top HN stories

SDK adoption

Notable AI Lab activity

Model Usage