Gawk · 2026-07-03
Gawk — 2026-07-03 · 2 changes in the LMArena top 3
Five verifiable things that moved in the AI ecosystem in the last 24h. Every number traces to a public source.
What moved
- · claude-opus-4-6-thinking holds #1 on LMArena for the 30th consecutive snapshot.
Benchmark movers
#2 changes in the LMArena top 3
Why this matters · Public benchmarks are gameable, but rank shuffles still hint at where the frontier is genuinely moving.
claude-opus-4-6-thinking
+1 Elo (now 1501) · anthropic
claude-fable-5
+1 Elo (now 1495) · anthropic
Source: lmarena.ai
Tool Health
#All tools operational, no incidents in the last 24h (vs 2 yesterday)
Why this matters · Provider outages and degradations cause retry storms upstream. Tracking the 7-day shape catches flapping providers before they page you.
claude-code
Operational
openai-api
Degraded · 1 active incident
codex
Operational · 1 active incident
copilot
Operational
windsurf
Operational
Source: status.claude-code.com, status.openai-api.com, status.codex.com, status.copilot.com, status.windsurf.com
Top HN stories
#Top 5 on HN in the last 24h
Why this matters · What developers debate on Hacker News often previews which models, frameworks, and patterns they'll actually adopt next.
Alibaba to ban Claude Code in workplace over alleged backdoor risks, source says
67 points · 28 comments
OpenAI: In early talks to give 5% stake to US Government
40 points · 46 comments
Claude's AskUserQuestion: "No response after 60s – continued without an answer"
37 points · 42 comments
14× faster embeddings: how we rebuilt the ONNX path in Manticore
33 points · 4 comments
AI coding is a nightmare. Am I the only one experiencing this?
21 points · 9 comments
Source: news.ycombinator.com
SDK adoption
#5 notable shifts across the six registries
Why this matters · Package install volume is where developers place real bets — distinct from where labs are shipping marketing.
PyPI: diffusers
−5.1k 24h downloads day-over-day
pypistats.org is a third-party aggregator — counts include mirror hits, CI builds, and pip install retries.
crates.io: ort
+57.8k 90d downloads day-over-day
Docker Hub: ollama/ollama
+316.7k all-time pulls day-over-day
Homebrew: ollama
−1.1k 30d downloads day-over-day
VS Code Marketplace: saoudrizwan.claude-dev
+13.9k cumulative installs day-over-day
Source: pypistats.org, crates.io, hub.docker.com, formulae.brew.sh, marketplace.visualstudio.com
Agent frameworks
#1 agent framework moved >10% in the last week
Why this matters · Agent frameworks rise and fall on weekly download velocity and open-issue trajectory, not GitHub stars.
Pydantic AI · -52%
3.0M weekly downloads · 18k stars
pypistats.org · View on Gawk →
Tracks the `pydantic-ai` meta-package; the lean `pydantic-ai-slim` variant is roughly 4× larger but users typically compare the meta number.
Source: pypistats.org
Notable AI Lab activity
#8 labs moved in the last 24h
Why this matters · GitHub event volume on a lab's own repos is the cleanest publicly-verifiable proxy for engineering activity we have.
Anthropic
+32 events (now 186) · San Francisco, US
Microsoft Research
+12 events (now 128) · Redmond, US
Ollama
+5 events (now 97) · San Francisco, US
Hugging Face
−6 events (now 95) · New York, US
DeepSeek
+6 events (now 83) · Hangzhou, CN
OpenAI
−18 events (now 74) · San Francisco, US
New mover: Google Research
25 events · Mountain View, US
Dropped off: Intel AI
was 26 events · Santa Clara, US
Source: gawk.dev
Model Usage
#deepseek/deepseek-v4-flash holds #1 · biggest mover: deepseek/deepseek-v4-pro +44 to #6 · biggest drop: moonshotai/kimi-k2.7-code -28
Why this matters · OpenRouter rankings reflect API-first developer spend; direct customers like consumer ChatGPT are invisible by construction.
deepseek/deepseek-v4-pro climbed +44 to #6
was #50 on 2026-06-26
OpenRouter request volume reflects developer API spending, not end-user adoption. Biased toward API-first workflows that route through OpenRouter — direct OpenAI / Anthropic / Google customers who never use OpenRouter are invisible.
moonshotai/kimi-k2.7-code slipped -28 to #35
was #7 on 2026-06-26
#1 deepseek/deepseek-v4-flash
#2 xiaomi/mimo-v2.5
#3 minimax/minimax-m3
Source: openrouter.ai