Gawk · 2026-07-01
Gawk — 2026-07-01 · 3 tool incidents
Five verifiable things that moved in the AI ecosystem in the last 24h. Every number traces to a public source.
What moved
- · claude-opus-4-6-thinking holds #1 on LMArena for the 30th consecutive snapshot.
- · langchain downloads declined for the 4th consecutive snapshot.
- · torch downloads declined for the 4th consecutive snapshot.
Benchmark movers
#No rank changes in the LMArena top 3
Why this matters · Public benchmarks are gameable, but rank shuffles still hint at where the frontier is genuinely moving.
#1 claude-opus-4-6-thinking
anthropic · 1500 Elo
#2 claude-opus-4-6
anthropic · 1498 Elo
#3 claude-fable-5
anthropic · 1494 Elo
Source: lmarena.ai
Tool Health
#3 incidents in the last 24h (vs 0 yesterday)
Why this matters · Provider outages and degradations cause retry storms upstream. Tracking the 7-day shape catches flapping providers before they page you.
Elevated error rates on Opus 4.8
started 14:31 UTC · resolved 15:28 UTC · minor impact
Codex, workspace analytics, conversation search, searching for custom GPTs, ChatGPT user invites, and Compliance Log Platform download endpoint not working in FedRAMP workspaces
started 03:38 UTC · ongoing · minor impact
Delays in copilot budget limits resets for some users
started 10:51 UTC · ongoing · minor impact
openai-api: Operational → Degraded
Source: status.anthropic.com, status.openai.com, githubstatus.com, status.openai-api.com
Top HN stories
#Top 5 on HN in the last 24h
Why this matters · What developers debate on Hacker News often previews which models, frameworks, and patterns they'll actually adopt next.
Claude Sonnet 5
381 points · 183 comments
From brain waves to words: a new path to communication without surgery
84 points · 44 comments
Words Are a Byproduct of Consciousness. For LLMs, It's Backwards
59 points · 87 comments
Redeploying Fable 5
41 points · 7 comments
Tell HN: Installing Cursor on iOS irreversibly changes your privacy settings
36 points · 7 comments
Source: news.ycombinator.com
SDK adoption
#5 notable shifts across the six registries
Why this matters · Package install volume is where developers place real bets — distinct from where labs are shipping marketing.
PyPI: huggingface-hub
+3.4M 24h downloads day-over-day
pypistats.org is a third-party aggregator — counts include mirror hits, CI builds, and pip install retries.
crates.io: ort
+36.1k 90d downloads day-over-day
Docker Hub: ollama/ollama
+340.2k all-time pulls day-over-day
Homebrew: ollama
+418 30d downloads day-over-day
VS Code Marketplace: saoudrizwan.claude-dev
+15.9k cumulative installs day-over-day
Source: pypistats.org, crates.io, hub.docker.com, formulae.brew.sh, marketplace.visualstudio.com
Agent frameworks
#2 agent frameworks moved >10% in the last week
Why this matters · Agent frameworks rise and fall on weekly download velocity and open-issue trajectory, not GitHub stars.
Pydantic AI · -63%
2.8M weekly downloads · 18k stars
pypistats.org · View on Gawk →
Tracks the `pydantic-ai` meta-package; the lean `pydantic-ai-slim` variant is roughly 4× larger but users typically compare the meta number.
AutoGen · -12%
250k weekly downloads · 59k stars
pypistats.org · View on Gawk →
Tracks the live `autogen-agentchat` package — the deprecated bare `autogen` PyPI name is a different abandoned project and is not counted here.
Source: pypistats.org, pypistats.org
Notable AI Lab activity
#7 labs moved in the last 24h
Why this matters · GitHub event volume on a lab's own repos is the cleanest publicly-verifiable proxy for engineering activity we have.
Anthropic
+6 events (now 189) · San Francisco, US
Microsoft Research
+28 events (now 137) · Redmond, US
OpenAI
−22 events (now 96) · San Francisco, US
Ollama
+17 events (now 92) · San Francisco, US
Hugging Face
−19 events (now 87) · New York, US
New mover: Replicate
29 events · San Francisco, US
Dropped off: Alibaba Qwen
was 24 events · Hangzhou, CN
Source: gawk.dev
Model Usage
#deepseek/deepseek-v4-flash holds #1 · biggest mover: deepseek/deepseek-v4-flash +49 to #1 · biggest drop: moonshotai/kimi-k2.7-code -29
Why this matters · OpenRouter rankings reflect API-first developer spend; direct customers like consumer ChatGPT are invisible by construction.
deepseek/deepseek-v4-flash climbed +49 to #1
was #50 on 2026-06-24
OpenRouter request volume reflects developer API spending, not end-user adoption. Biased toward API-first workflows that route through OpenRouter — direct OpenAI / Anthropic / Google customers who never use OpenRouter are invisible.
moonshotai/kimi-k2.7-code slipped -29 to #35
was #6 on 2026-06-24
#1 deepseek/deepseek-v4-flash
#2 xiaomi/mimo-v2.5
#3 tencent/hy3-preview
Source: openrouter.ai