Transparency
Sources
Every number on Gawk traces to a public source. This page lists all 40 data sources currently feeding the dashboard, plus 38curated AI labs that compose the AI Labs layer. Each row shows what the source tracks, how often it’s polled, and when it was last seen live.
The rules behind the feed’s severity ranking live on /methodology. The full typed registry — sanity ranges, caveats, verifiedAt dates — is the public summary at /data-sources.md.
Tool Status (6)
Public status pages for the AI coding tools tracked on the dashboard. Polled every 5 minutes through `/api/status` with a last-known cache fallback.
Operational state + active incidents for Claude API and Claude Code (CLI).
- Cadence
- Every 1–5 minutes
- Last seen
- 9m ago
- Surfaces in
- Tools panel
Per-component status for ChatGPT, OpenAI API, Codex Web/API, CLI, VS Code extension.
- Cadence
- Every 1–5 minutes
- Last seen
- 9m ago
- Surfaces in
- Tools panel
Active OpenAI status-page incidents (investigating / identified / monitoring).
- Cadence
- Every 1–5 minutes
- Last seen
- 9m ago
- Surfaces in
- Tools panel
Open issue count on anthropics/claude-code, surfaced as community-pressure on the Claude Code card.
- Cadence
- Hourly
- Last seen
- 9m ago
- Surfaces in
- Tools panel · Claude Code card
GitHub platform components, filtered for the `Copilot` component used by the Copilot health card.
- Cadence
- Every 1–5 minutes
- Last seen
- 9m ago
- Surfaces in
- Tools panel · Copilot card
- Windsurf StatusLive
Overall page status + incidents for Windsurf (Cascade + Tab).
- Cadence
- Every 1–5 minutes
- Last seen
- 9m ago
- Surfaces in
- Tools panel · Windsurf card
Platform Infrastructure (4)
Public status pages for the four services Gawk itself runs on. Surfaced operator-side at /admin only — the public Tool Health card grid stays AI-focused.
- Vercel StatusOn demand
Vercel platform health (Dashboard, Builds, Edge Network, Functions). Surfaces on /admin only.
- Cadence
- Every 1–5 minutes
- Last seen
- Fetched per request — no scheduled poll
- Surfaces in
- /admin · Platform health
- Supabase StatusOn demand
Supabase platform health (Database, Auth, Storage, Realtime, Edge Functions). Surfaces on /admin only.
- Cadence
- Every 1–5 minutes
- Last seen
- Fetched per request — no scheduled poll
- Surfaces in
- /admin · Platform health
- Cloudflare StatusOn demand
Cloudflare top-line indicator. Per-datacenter components are intentionally not consumed component-by-component. Surfaces on /admin only.
- Cadence
- Every 1–5 minutes
- Last seen
- Fetched per request — no scheduled poll
- Surfaces in
- /admin · Platform health
- Upstash StatusOn demand
Upstash platform health by region (EU-CENTRAL-1, US-EAST-1, etc.) + product (Redis, QStash, Vector). Surfaces on /admin only.
- Cadence
- Every 1–5 minutes
- Last seen
- Fetched per request — no scheduled poll
- Surfaces in
- /admin · Platform health
Code Activity (6)
GitHub events, archive backfills, and discovery surfaces that drive the live globe and the curated repo registry.
Live GitHub PushEvent / PR / Issues / Releases / Forks / Stars across every public repo. Drives the globe pulse.
- Cadence
- Real-time (under 30s)
- Last seen
- 2h ago
- Surfaces in
- Live globe + Wire panel
Hourly complete archive of every public GitHub event, used to backfill the globe on cold start.
- Cadence
- Hourly
- Last seen
- 2h ago
- Surfaces in
- Globe cold-start backfill
Per-repo file-existence + first-500-bytes shape probe for AI-tool config files (CLAUDE.md, .cursorrules, ...).
- Cadence
- Event-driven
- Last seen
- 2h ago
- Surfaces in
- Globe colouring + Registry verifier
Filename discovery for the six AI-tool config formats; feeds the registry verifier.
- Cadence
- Every 6 hours
- Last seen
- 5h ago
- Surfaces in
- Repo registry
Topic-tag discovery (claude / cursor / aider / windsurf / llm / ai-agent / ...) for registry candidates.
- Cadence
- Hourly
- Last seen
- 3h ago
- Surfaces in
- Repo registry
Reverse-dependents for six AI npm packages (anthropic, openai, langchain, ...) feeding registry candidates.
- Cadence
- Every 6 hours
- Last seen
- 4h ago
- Surfaces in
- Repo registry
Discussion (3)
Community chatter — currently Hacker News with the deterministic AI-keyword filter applied before ingest.
Recent HN stories that match a deterministic AI keyword + domain allowlist.
- Cadence
- Every 1–5 minutes
- Last seen
- 2h ago
- Surfaces in
- Wire panel · Feed
Top-of-day posts from r/LocalLLaMA via Reddit's Atom feed. AI-relevance presumed from the sub's charter (no keyword filter applied).
- Cadence
- Every 1–5 minutes
- Last seen
- 1h ago
- Surfaces in
- Feed · NEWS cards
Top-of-day posts from r/ClaudeAI via Reddit's Atom feed. Anthropic-adjacent discussion; AI-relevance presumed from the sub's charter.
- Cadence
- Every 1–5 minutes
- Last seen
- 1h ago
- Surfaces in
- Feed · NEWS cards
Models (3)
Where models live (HuggingFace), what they cost (OpenRouter usage), and how they rank head-to-head (Chatbot Arena).
Top text-generation models on HuggingFace ranked by 30-day downloads.
- Cadence
- Hourly
- Last seen
- Fetched per request — no scheduled poll
- Surfaces in
- Models panel
Top 20 models by Chatbot Arena Elo for the `text` overall split.
- Cadence
- Daily
- Last seen
- 7h ago
- Surfaces in
- Benchmarks panel · Feed
Weekly request-volume rankings for every OpenRouter-routed model. Reflects API spend, not end-user adoption.
- Cadence
- Every 6 hours
- Last seen
- 4h ago
- Surfaces in
- Model Usage panel · Feed
SDK Adoption (6)
Package-registry download counters for the AI SDK slate. Daily snapshots feed within-package week-over-week deltas.
Rolling download counters for seven AI Python SDKs (anthropic, openai, langchain, transformers, torch, huggingface-hub, diffusers).
- Cadence
- Every 6 hours
- Last seen
- 4h ago
- Surfaces in
- SDK Adoption panel · Feed
Rolling download counters for five AI JavaScript SDKs (@anthropic-ai/sdk, openai, @langchain/core, ai, llamaindex).
- Cadence
- Every 6 hours
- Last seen
- 4h ago
- Surfaces in
- SDK Adoption panel · Feed
Recent (90d) + all-time download counters for four Rust ML crates (candle-core, burn, tch, ort).
- Cadence
- Every 6 hours
- Last seen
- 4h ago
- Surfaces in
- SDK Adoption panel · Feed
All-time pull counts for two AI inference images (ollama/ollama, vllm/vllm-openai).
- Cadence
- Every 6 hours
- Last seen
- 4h ago
- Surfaces in
- SDK Adoption panel · Feed
30/90/365-day install counters for the ollama Homebrew formula.
- Cadence
- Every 6 hours
- Last seen
- 4h ago
- Surfaces in
- SDK Adoption panel · Feed
Cumulative install counts for six AI coding-assistant VS Code extensions (Copilot, Continue, Cody, Codeium, Cline, TabNine).
- Cadence
- Every 6 hours
- Last seen
- 5h ago
- Surfaces in
- SDK Adoption panel · Feed
Agents (1)
Eight agent frameworks (LangGraph, CrewAI, smolagents, AutoGen, OpenAI Agents, Pydantic AI) plus two tombstones (AutoGPT legacy, Sweep dormant). Per-repo metadata feeds the Agents panel; PyPI + npm download counters listed under SDK Adoption are the same numbers, attributed twice because both panels consume them.
Per-repository scalar metadata (stars, open issues, last-pushed timestamp, archived flag) for the eight tracked agent frameworks. Powers the dormant/archived badges on the Agents panel.
- Cadence
- Daily
- Last seen
- 4h ago
- Surfaces in
- Agents panel · Feed
Research (2)
arXiv submissions in the cs.AI + cs.LG categories. Recency-only — Gawk does not re-rank papers.
Twenty most-recent cs.AI + cs.LG submissions on arXiv, newest first. No re-ranking.
- Cadence
- Daily
- Last seen
- 9m ago
- Surfaces in
- Research panel · Feed
Annual reference report on global AI investment, research output, adoption, and safety, published by the Stanford Institute for Human-Centered AI (HAI). Static reference data — not polled. Per-edition data + scripts mirrored to the public github.com/ai-index-hai-stanford repository.
- Cadence
- Weekly
- Last seen
- 21d ago
- Surfaces in
- External reference — cited where AI Index figures appear
AI Labs (2)
36 curated AI labs with verifiable HQ coordinates, sized on the globe by 7-day GitHub event activity across their flagship repos.
36 curated AI labs across 10+ countries with verifiable HQ coordinates.
- Cadence
- Weekly
- Last seen
- 9m ago
- Surfaces in
- AI Labs panel + globe layer
7-day activity counts on flagship repos for every lab in the registry.
- Cadence
- Every 6 hours
- Last seen
- 9m ago
- Surfaces in
- AI Labs panel · sizes lab dots
AI Publishers (7)
Editor-curated AI publishers with verifiable HQ coordinates. Mix of regional press (Heise DE, Synced CN, MarkTechPost IN, Analytics Vidhya IN, The Register UK, MIT TR US) and practitioner newsletters (latent.space SF). Each feed is parsed by the deterministic ingest pipeline; no LLM relevance scoring.
AI/ML headlines from The Register (UK tech press, London).
- Cadence
- Every 1–5 minutes
- Last seen
- 1h ago
- Surfaces in
- Wire panel + map
German-language AI headlines from Heise Online, filtered through the same AI-keyword allowlist.
- Cadence
- Every 1–5 minutes
- Last seen
- 1h ago
- Surfaces in
- Wire panel + map
English-language AI research coverage with strong China/global lab depth (editorial team, Beijing).
- Cadence
- Every 1–5 minutes
- Last seen
- 1h ago
- Surfaces in
- Wire panel + map
AI research news from MarkTechPost (India-based editorial team).
- Cadence
- Every 1–5 minutes
- Last seen
- 1h ago
- Surfaces in
- Wire panel + map
MIT Technology Review's AI topic feed (Cambridge MA, US editorial counterweight).
- Cadence
- Every 1–5 minutes
- Last seen
- 1h ago
- Surfaces in
- Wire panel + map
AI engineering newsletter + podcast by swyx and Alessio Fanelli (San Francisco). Practitioner-focused: model releases, agent architecture, eval methodology.
- Cadence
- Every 1–5 minutes
- Last seen
- 1h ago
- Surfaces in
- Wire panel + map
Indian data-science / AI publisher (Gurgaon). Tutorial and news coverage of the AI / ML stack, complementary to MarkTechPost.
- Cadence
- Every 1–5 minutes
- Last seen
- 1h ago
- Surfaces in
- Wire panel + map