This refresh pulls the actual 7-day chat_decision_audit numbers (council_v2 + single routing modes) at build time, lays them out as a 4-lane funnel (AI tokens / NetSuite API / Vectorize / D1+KV+R2 → CostCapDO daily cap → total $/day), and exposes per-pillar spend, free-tier ceilings, and the kill-switch braking surface. Replaces the static rounded estimates in cost-surface.html.
| routing_mode | queries | spend | avg / query | avg latency |
|---|---|---|---|---|
council_v2 | 564 | $9.32 | $0.0165 | 33,838 ms |
single | 551 | $1.87 | $0.0034 | 5,118 ms |
| total | 1,115 | $11.20 | $0.0100 | — |
As of 2026-05-25 · SELECT routing_mode, COUNT(*) AS n, SUM(cost_usd) AS spend FROM chat_decision_audit WHERE created_at >= date('now','-7 days') GROUP BY routing_mode
| Service | Ceiling | Current usage | Headroom |
|---|---|---|---|
| Vectorize queries | 5M / month free | ~30K queries/month | 99% headroom |
| D1 row reads | 25M / day on Workers Paid | ~50K / day | 99% headroom |
| KV reads | 10M / day free | ~10K / day | 99% headroom |
| R2 Class A ops | 10M / month on Workers Paid | ~5K / month | 99% headroom |
| Workers AI Llama 3.3 70B | 10K neurons/day free | ~3K / day | 70% headroom |
| Anthropic API | 60 req/min workspace limit | ~0.3 RPS peak | 97% headroom |
| CostCapDO daily cap | $5 / day (configurable) | ~$1.60 / day average | 68% headroom |
claude-sonnet-4.6 (Anthropic) · kimi-k2.5 (free via Cursor) · @cf/meta/llama-3.3-70b-instructcustomscript_gfs_platform_queryNS_PUSH_QUEUE · drain via /api/ns-push/draingfs-knowledgeenv.COST_CAP_DOsrc/durable_objects.ts/api/ai/capsPOST /api/ai/caps/resetPOST /api/ai/caps/emergency-bumpkill:ns_writes · kill:proposed_apply · kill:email_intake · kill:external_portals · kill:high_risk_opsGET /api/kill-switches · POST /api/kill-switches/flip