Gawk · 2026-05-03

Gawk — 2026-05-03 · 10 labs moved in the last 24h

Five verifiable things that moved in the AI ecosystem in the last 24h. Every number traces to a public source.

Benchmark movers

#

No rank changes in the LMArena top 3

Why this matters · Public benchmarks are gameable, but rank shuffles still hint at where the frontier is genuinely moving.

Source: lmarena.ai

Tool Health

#

All tools operational, no incidents in the last 24h (vs 2 yesterday)

Why this matters · Provider outages and degradations cause retry storms upstream. Tracking the 7-day shape catches flapping providers before they page you.

Tool health, 7 days ending 2026-05-03. Each row is one tool; each column is one UTC day. Green = operational, amber = degraded, red = outage, grey = no data.

Source: status.claude-code.com, status.openai-api.com, status.codex.com, status.copilot.com, status.windsurf.com

Top HN stories

#

Top 5 on HN in the last 24h

Why this matters · What developers debate on Hacker News often previews which models, frameworks, and patterns they'll actually adopt next.

  • VS Code inserting 'Co-Authored-by Copilot' into commits regardless of usage

    461 points · 209 comments

    news.ycombinator.com

  • LLMs consistently pick resumes they generate over ones by humans or other models

    256 points · 115 comments

    news.ycombinator.com

  • Kimi K2.6 just beat Claude, GPT-5.5, and Gemini in a coding challenge

    252 points · 110 comments

    news.ycombinator.com

  • Open Design: Use Your Coding Agent as a Design Engine

    62 points · 27 comments

    news.ycombinator.com

  • The agent harness belongs outside the sandbox

    48 points · 33 comments

    news.ycombinator.com

Source: news.ycombinator.com

SDK adoption

#

5 notable shifts across the six registries

Why this matters · Package install volume is where developers place real bets — distinct from where labs are shipping marketing.

Source: pypistats.org, npmjs.com, crates.io, hub.docker.com, marketplace.visualstudio.com

Notable AI Lab activity

#

10 labs moved in the last 24h

Why this matters · GitHub event volume on a lab's own repos is the cleanest publicly-verifiable proxy for engineering activity we have.

Source: gawk.dev

Model Usage

#

tencent/hy3-preview holds #1 · biggest mover: deepseek/deepseek-v4-flash +38 to #9 · biggest drop: openai/gpt-oss-120b -24

Why this matters · OpenRouter rankings reflect API-first developer spend; direct customers like consumer ChatGPT are invisible by construction.

Source: openrouter.ai

Archived from . Every number traces to a public source.