Gawk · 2026-05-20

Gawk — 2026-05-20 · 3 tool incidents

Five verifiable things that moved in the AI ecosystem in the last 24h. Every number traces to a public source.

What moved

  • · claude-opus-4-6-thinking holds #1 on LMArena for the 29th consecutive snapshot.
  • · ai downloads grew for the 6th consecutive snapshot.
  • · openai downloads grew for the 6th consecutive snapshot.

Benchmark movers

#

No rank changes in the LMArena top 3

Why this matters · Public benchmarks are gameable, but rank shuffles still hint at where the frontier is genuinely moving.

Source: lmarena.ai

Tool Health

#

3 incidents in the last 24h (vs 5 yesterday)

Why this matters · Provider outages and degradations cause retry storms upstream. Tracking the 7-day shape catches flapping providers before they page you.

Tool health, 7 days ending 2026-05-20. Each row is one tool; each column is one UTC day. Green = operational, amber = degraded, red = outage, grey = no data.
  • Elevated errors on Claude Haiku 4.5

    started 08:14 UTC · resolved 08:49 UTC · minor impact

    Anthropic status page

  • Elevated errors on Claude Opus 4.7

    started 14:03 UTC · resolved 15:40 UTC · major impact

    Anthropic status page

  • API users may see increased error rates for GPT-5.4 and GPT-5.5

    started 23:32 UTC · resolved 00:37 UTC · minor impact

    OpenAI status page

Source: status.anthropic.com, status.openai.com

Top HN stories

#

Top 5 on HN in the last 24h

Why this matters · What developers debate on Hacker News often previews which models, frameworks, and patterns they'll actually adopt next.

  • OpenAI Adopts Google's SynthID Watermark for AI Images with Verification Tool

    55 points · 24 comments

    news.ycombinator.com

  • Anthropic Is Preparing for IPO and We Should Be Worried

    30 points · 30 comments

    news.ycombinator.com

  • Show HN: Chrome ext to let zot, your terminal coding agent, operate the browser

    11 points · 1 comments

    news.ycombinator.com

  • We cut Claude's token usage 79% by redesigning our CLI for agents

    11 points · 3 comments

    news.ycombinator.com

  • Andrej Karpathy Joins Anthropic

    9 points · 0 comments

    news.ycombinator.com

Source: news.ycombinator.com

SDK adoption

#

6 notable shifts across the six registries

Why this matters · Package install volume is where developers place real bets — distinct from where labs are shipping marketing.

Source: pypistats.org, npmjs.com, crates.io, hub.docker.com, formulae.brew.sh, marketplace.visualstudio.com

Agent frameworks

#

2 agent frameworks moved >10% in the last week

Why this matters · Agent frameworks rise and fall on weekly download velocity and open-issue trajectory, not GitHub stars.

  • CrewAI · +81%

    4.0M weekly downloads · 52k stars

    pypistats.org · View on Gawk →

  • Pydantic AI · -33%

    8.0M weekly downloads · 17k stars

    pypistats.org · View on Gawk →

    Tracks the `pydantic-ai` meta-package; the lean `pydantic-ai-slim` variant is roughly 4× larger but users typically compare the meta number.

Source: pypistats.org, pypistats.org

Notable AI Lab activity

#

10 labs moved in the last 24h

Why this matters · GitHub event volume on a lab's own repos is the cleanest publicly-verifiable proxy for engineering activity we have.

  • LangChain

    −24 events (now 171) · San Francisco, US

    Gawk · Labs

  • OpenAI

    −33 events (now 135) · San Francisco, US

    Gawk · Labs

  • Microsoft Research

    +31 events (now 118) · Redmond, US

    Gawk · Labs

  • Hugging Face

    −8 events (now 93) · New York, US

    Gawk · Labs

  • New mover: Google Research

    92 events · Mountain View, US

    Gawk · Labs

  • Ollama

    −8 events (now 89) · San Francisco, US

    Gawk · Labs

  • DeepSeek

    +17 events (now 82) · Hangzhou, CN

    Gawk · Labs

  • New mover: AMD

    66 events · Santa Clara, US

    Gawk · Labs

  • Dropped off: Replicate

    was 30 events · San Francisco, US

    Gawk · Labs

  • Dropped off: Meta FAIR

    was 20 events · Menlo Park, US

    Gawk · Labs

Source: gawk.dev

Model Usage

#

tencent/hy3-preview holds #1 · biggest mover: z-ai/glm-4.7 +12 to #30 · biggest drop: z-ai/glm-5 -8

Why this matters · OpenRouter rankings reflect API-first developer spend; direct customers like consumer ChatGPT are invisible by construction.

Source: openrouter.ai

Archived from . Every number traces to a public source.