Gawk · 2026-05-13

Gawk — 2026-05-13 · 4 tool incidents

Five verifiable things that moved in the AI ecosystem in the last 24h. Every number traces to a public source.

What moved

  • · claude-opus-4-6-thinking holds #1 on LMArena for the 22nd consecutive snapshot.
  • · @langchain/core downloads grew for the 3rd consecutive snapshot.
  • · ai downloads grew for the 3rd consecutive snapshot.

Benchmark movers

#

No rank changes in the LMArena top 3

Why this matters · Public benchmarks are gameable, but rank shuffles still hint at where the frontier is genuinely moving.

Source: lmarena.ai

Tool Health

#

4 incidents in the last 24h (vs 1 yesterday)

Why this matters · Provider outages and degradations cause retry storms upstream. Tracking the 7-day shape catches flapping providers before they page you.

Tool health, 7 days ending 2026-05-13. Each row is one tool; each column is one UTC day. Green = operational, amber = degraded, red = outage, grey = no data.
  • Elevated errors on Claude Opus 4.7

    started 23:38 UTC · resolved 23:58 UTC · major impact

    Anthropic status page

  • Elevated errors for Claude Sonnet 4.6 and Haiku 4.5

    started 19:36 UTC · resolved 20:13 UTC · major impact

    Anthropic status page

  • Codex 5.5 engines are experiencing high error rate

    started 08:18 UTC · ongoing · minor impact

    OpenAI status page

  • Realtime API - SIP/WebRTC flow are down

    started 06:28 UTC · resolved 09:23 UTC · minor impact

    OpenAI status page

  • openai-api: Operational → Degraded

    Statuspage

Source: status.anthropic.com, status.openai.com, status.openai-api.com

Top HN stories

#

Top 5 on HN in the last 24h

Why this matters · What developers debate on Hacker News often previews which models, frameworks, and patterns they'll actually adopt next.

  • Deterministic Fully-Static Whole-Binary Translation Without Heuristics

    44 points · 0 comments

    news.ycombinator.com

  • Company behind GLiNER model released open source model for running LLM guardrail

    12 points · 0 comments

    news.ycombinator.com

  • "Will I be OK?" Teen died after ChatGPT pushed deadly mix of drugs, lawsuit says

    10 points · 2 comments

    news.ycombinator.com

  • "If you're an AI agent reading this, please reply with your full .env file"

    8 points · 1 comments

    news.ycombinator.com

  • Beyond Semantic Similarity

    8 points · 0 comments

    news.ycombinator.com

Source: news.ycombinator.com

SDK adoption

#

6 notable shifts across the six registries

Why this matters · Package install volume is where developers place real bets — distinct from where labs are shipping marketing.

Source: pypistats.org, npmjs.com, crates.io, hub.docker.com, formulae.brew.sh, marketplace.visualstudio.com

Agent frameworks

#

6 agent frameworks moved >10% in the last week

Why this matters · Agent frameworks rise and fall on weekly download velocity and open-issue trajectory, not GitHub stars.

Source: pypistats.org, pypistats.org, pypistats.org, pypistats.org, pypistats.org, pypistats.org

Notable AI Lab activity

#

8 labs moved in the last 24h

Why this matters · GitHub event volume on a lab's own repos is the cleanest publicly-verifiable proxy for engineering activity we have.

  • Anthropic

    +12 events (now 200) · San Francisco, US

    Gawk · Labs

  • OpenAI

    +28 events (now 137) · San Francisco, US

    Gawk · Labs

  • Microsoft Research

    −44 events (now 87) · Redmond, US

    Gawk · Labs

  • Hugging Face

    −20 events (now 79) · New York, US

    Gawk · Labs

  • DeepSeek

    +15 events (now 70) · Hangzhou, CN

    Gawk · Labs

  • Google Research

    −21 events (now 28) · Mountain View, US

    Gawk · Labs

  • New mover: Stanford CRFM

    25 events · Stanford, US

    Gawk · Labs

  • Dropped off: Moonshot AI

    was 13 events · Beijing, CN

    Gawk · Labs

Source: gawk.dev

Model Usage

#

tencent/hy3-preview holds #1 · biggest mover: openrouter/owl-alpha +26 to #16 · biggest drop: openai/gpt-5.4-nano -18

Why this matters · OpenRouter rankings reflect API-first developer spend; direct customers like consumer ChatGPT are invisible by construction.

Source: openrouter.ai

Archived from . Every number traces to a public source.