Transparency

Sources

Every number on Gawk traces to a public source. This page lists all 40 data sources currently feeding the dashboard, plus 38curated AI labs that compose the AI Labs layer. Each row shows what the source tracks, how often it’s polled, and when it was last seen live.

The rules behind the feed’s severity ranking live on /methodology. The full typed registry — sanity ranges, caveats, verifiedAt dates — is the public summary at /data-sources.md.

Sources
40
Categories
10
Curated labs
38
Live now
32

Tool Status (6)

Public status pages for the AI coding tools tracked on the dashboard. Polled every 5 minutes through `/api/status` with a last-known cache fallback.

  • Operational state + active incidents for Claude API and Claude Code (CLI).

    Cadence
    Every 1–5 minutes
    Last seen
    9m ago
    Surfaces in
    Tools panel
  • Per-component status for ChatGPT, OpenAI API, Codex Web/API, CLI, VS Code extension.

    Cadence
    Every 1–5 minutes
    Last seen
    9m ago
    Surfaces in
    Tools panel
  • Active OpenAI status-page incidents (investigating / identified / monitoring).

    Cadence
    Every 1–5 minutes
    Last seen
    9m ago
    Surfaces in
    Tools panel
  • Open issue count on anthropics/claude-code, surfaced as community-pressure on the Claude Code card.

    Cadence
    Hourly
    Last seen
    9m ago
    Surfaces in
    Tools panel · Claude Code card
  • GitHub platform components, filtered for the `Copilot` component used by the Copilot health card.

    Cadence
    Every 1–5 minutes
    Last seen
    9m ago
    Surfaces in
    Tools panel · Copilot card
  • Overall page status + incidents for Windsurf (Cascade + Tab).

    Cadence
    Every 1–5 minutes
    Last seen
    9m ago
    Surfaces in
    Tools panel · Windsurf card

Platform Infrastructure (4)

Public status pages for the four services Gawk itself runs on. Surfaced operator-side at /admin only — the public Tool Health card grid stays AI-focused.

  • Vercel StatusOn demand

    Vercel platform health (Dashboard, Builds, Edge Network, Functions). Surfaces on /admin only.

    Cadence
    Every 1–5 minutes
    Last seen
    Fetched per request — no scheduled poll
    Surfaces in
    /admin · Platform health
  • Supabase platform health (Database, Auth, Storage, Realtime, Edge Functions). Surfaces on /admin only.

    Cadence
    Every 1–5 minutes
    Last seen
    Fetched per request — no scheduled poll
    Surfaces in
    /admin · Platform health
  • Cloudflare top-line indicator. Per-datacenter components are intentionally not consumed component-by-component. Surfaces on /admin only.

    Cadence
    Every 1–5 minutes
    Last seen
    Fetched per request — no scheduled poll
    Surfaces in
    /admin · Platform health
  • Upstash platform health by region (EU-CENTRAL-1, US-EAST-1, etc.) + product (Redis, QStash, Vector). Surfaces on /admin only.

    Cadence
    Every 1–5 minutes
    Last seen
    Fetched per request — no scheduled poll
    Surfaces in
    /admin · Platform health

Code Activity (6)

GitHub events, archive backfills, and discovery surfaces that drive the live globe and the curated repo registry.

  • Live GitHub PushEvent / PR / Issues / Releases / Forks / Stars across every public repo. Drives the globe pulse.

    Cadence
    Real-time (under 30s)
    Last seen
    2h ago
    Surfaces in
    Live globe + Wire panel
  • Hourly complete archive of every public GitHub event, used to backfill the globe on cold start.

    Cadence
    Hourly
    Last seen
    2h ago
    Surfaces in
    Globe cold-start backfill
  • Per-repo file-existence + first-500-bytes shape probe for AI-tool config files (CLAUDE.md, .cursorrules, ...).

    Cadence
    Event-driven
    Last seen
    2h ago
    Surfaces in
    Globe colouring + Registry verifier
  • Filename discovery for the six AI-tool config formats; feeds the registry verifier.

    Cadence
    Every 6 hours
    Last seen
    5h ago
    Surfaces in
    Repo registry
  • Topic-tag discovery (claude / cursor / aider / windsurf / llm / ai-agent / ...) for registry candidates.

    Cadence
    Hourly
    Last seen
    3h ago
    Surfaces in
    Repo registry
  • Reverse-dependents for six AI npm packages (anthropic, openai, langchain, ...) feeding registry candidates.

    Cadence
    Every 6 hours
    Last seen
    4h ago
    Surfaces in
    Repo registry

Discussion (3)

Community chatter — currently Hacker News with the deterministic AI-keyword filter applied before ingest.

  • Recent HN stories that match a deterministic AI keyword + domain allowlist.

    Cadence
    Every 1–5 minutes
    Last seen
    2h ago
    Surfaces in
    Wire panel · Feed
  • Top-of-day posts from r/LocalLLaMA via Reddit's Atom feed. AI-relevance presumed from the sub's charter (no keyword filter applied).

    Cadence
    Every 1–5 minutes
    Last seen
    1h ago
    Surfaces in
    Feed · NEWS cards
  • Top-of-day posts from r/ClaudeAI via Reddit's Atom feed. Anthropic-adjacent discussion; AI-relevance presumed from the sub's charter.

    Cadence
    Every 1–5 minutes
    Last seen
    1h ago
    Surfaces in
    Feed · NEWS cards

Models (3)

Where models live (HuggingFace), what they cost (OpenRouter usage), and how they rank head-to-head (Chatbot Arena).

SDK Adoption (6)

Package-registry download counters for the AI SDK slate. Daily snapshots feed within-package week-over-week deltas.

  • Rolling download counters for seven AI Python SDKs (anthropic, openai, langchain, transformers, torch, huggingface-hub, diffusers).

    Cadence
    Every 6 hours
    Last seen
    4h ago
    Surfaces in
    SDK Adoption panel · Feed
  • Rolling download counters for five AI JavaScript SDKs (@anthropic-ai/sdk, openai, @langchain/core, ai, llamaindex).

    Cadence
    Every 6 hours
    Last seen
    4h ago
    Surfaces in
    SDK Adoption panel · Feed
  • Recent (90d) + all-time download counters for four Rust ML crates (candle-core, burn, tch, ort).

    Cadence
    Every 6 hours
    Last seen
    4h ago
    Surfaces in
    SDK Adoption panel · Feed
  • All-time pull counts for two AI inference images (ollama/ollama, vllm/vllm-openai).

    Cadence
    Every 6 hours
    Last seen
    4h ago
    Surfaces in
    SDK Adoption panel · Feed
  • 30/90/365-day install counters for the ollama Homebrew formula.

    Cadence
    Every 6 hours
    Last seen
    4h ago
    Surfaces in
    SDK Adoption panel · Feed
  • Cumulative install counts for six AI coding-assistant VS Code extensions (Copilot, Continue, Cody, Codeium, Cline, TabNine).

    Cadence
    Every 6 hours
    Last seen
    5h ago
    Surfaces in
    SDK Adoption panel · Feed

Agents (1)

Eight agent frameworks (LangGraph, CrewAI, smolagents, AutoGen, OpenAI Agents, Pydantic AI) plus two tombstones (AutoGPT legacy, Sweep dormant). Per-repo metadata feeds the Agents panel; PyPI + npm download counters listed under SDK Adoption are the same numbers, attributed twice because both panels consume them.

Research (2)

arXiv submissions in the cs.AI + cs.LG categories. Recency-only — Gawk does not re-rank papers.

  • Twenty most-recent cs.AI + cs.LG submissions on arXiv, newest first. No re-ranking.

    Cadence
    Daily
    Last seen
    9m ago
    Surfaces in
    Research panel · Feed
  • Annual reference report on global AI investment, research output, adoption, and safety, published by the Stanford Institute for Human-Centered AI (HAI). Static reference data — not polled. Per-edition data + scripts mirrored to the public github.com/ai-index-hai-stanford repository.

    Cadence
    Weekly
    Last seen
    21d ago
    Surfaces in
    External reference — cited where AI Index figures appear

AI Labs (2)

36 curated AI labs with verifiable HQ coordinates, sized on the globe by 7-day GitHub event activity across their flagship repos.

AI Publishers (7)

Editor-curated AI publishers with verifiable HQ coordinates. Mix of regional press (Heise DE, Synced CN, MarkTechPost IN, Analytics Vidhya IN, The Register UK, MIT TR US) and practitioner newsletters (latent.space SF). Each feed is parsed by the deterministic ingest pipeline; no LLM relevance scoring.

← back to Gawk