Gawk · 2026-05-07

Gawk — 2026-05-07 · 3 tool incidents

Five verifiable things that moved in the AI ecosystem in the last 24h. Every number traces to a public source.

What moved

  • · claude-opus-4-6-thinking holds #1 on LMArena for the 16th consecutive snapshot.
  • · llamaindex downloads declined for the 12th consecutive snapshot.
  • · ai downloads declined for the 4th consecutive snapshot.

Benchmark movers

#

No rank changes in the LMArena top 3

Why this matters · Public benchmarks are gameable, but rank shuffles still hint at where the frontier is genuinely moving.

Source: lmarena.ai

Tool Health

#

3 incidents in the last 24h (vs 0 yesterday)

Why this matters · Provider outages and degradations cause retry storms upstream. Tracking the 7-day shape catches flapping providers before they page you.

Tool health, 7 days ending 2026-05-07. Each row is one tool; each column is one UTC day. Green = operational, amber = degraded, red = outage, grey = no data.
  • Connection failures for organizations restricting GitHub access by IP address

    started 22:32 UTC · ongoing · major impact

    Anthropic status page

  • Elevated errors across multiple models

    started 15:29 UTC · resolved 16:51 UTC · major impact

    Anthropic status page

  • Increased error rate with image generation in the API

    started 07:55 UTC · resolved 09:38 UTC · minor impact

    OpenAI status page

  • claude-code: Operational → Degraded

    Statuspage

Source: status.anthropic.com, status.openai.com, status.claude-code.com

Top HN stories

#

Top 5 on HN in the last 24h

Why this matters · What developers debate on Hacker News often previews which models, frameworks, and patterns they'll actually adopt next.

  • Higher usage limits for Claude and a compute deal with SpaceX

    102 points · 45 comments

    news.ycombinator.com

  • How Unsloth and Nvidia made LLM training 25% faster on consumer GPUs

    21 points · 2 comments

    news.ycombinator.com

  • ProgramBench: Can Language Models Rebuild Programs from Scratch?

    16 points · 12 comments

    news.ycombinator.com

  • SpaceXAI will provide Anthropic with access to Colossus 1

    11 points · 2 comments

    news.ycombinator.com

  • Firm solar and storage costs fall to $54/MWh, says IRENA

    9 points · 0 comments

    news.ycombinator.com

Source: news.ycombinator.com

SDK adoption

#

6 notable shifts across the six registries

Why this matters · Package install volume is where developers place real bets — distinct from where labs are shipping marketing.

Source: pypistats.org, npmjs.com, crates.io, hub.docker.com, formulae.brew.sh, marketplace.visualstudio.com

Notable AI Lab activity

#

7 labs moved in the last 24h

Why this matters · GitHub event volume on a lab's own repos is the cleanest publicly-verifiable proxy for engineering activity we have.

  • Anthropic

    +16 events (now 199) · San Francisco, US

    Gawk · Labs

  • OpenAI

    +46 events (now 160) · San Francisco, US

    Gawk · Labs

  • DeepSeek

    −9 events (now 66) · Hangzhou, CN

    Gawk · Labs

  • Microsoft Research

    −6 events (now 65) · Redmond, US

    Gawk · Labs

  • Stanford CRFM

    +6 events (now 28) · Stanford, US

    Gawk · Labs

  • Google DeepMind

    +5 events (now 17) · London, GB

    Gawk · Labs

  • Google Research

    −87 events (now 13) · Mountain View, US

    Gawk · Labs

Source: gawk.dev

Model Usage

#

tencent/hy3-preview holds #1 · biggest mover: deepseek/deepseek-v4-pro +32 to #10 · biggest drop: openai/gpt-oss-120b -18

Why this matters · OpenRouter rankings reflect API-first developer spend; direct customers like consumer ChatGPT are invisible by construction.

Source: openrouter.ai

Archived from . Every number traces to a public source.