Bitbanshee's AI Playground

Building, training, monitoring, and theorizing about AI
Bit & Pulse Dashboard
Bit & Pulse Private Online
Self-hosted agent control plane + autonomous daemon for persistent AI workers. FastAPI control plane, Pulse autonomous supervisor, tmux-routed Claude Code sessions, and a dashboard at lab.bitbanshee.com. Watches the lab, decides when to think, assembles context, dispatches work, tracks cases, and enforces cost gates. Single EC2 node behind Cloudflare Access.
agent-ops context-engineering governed-autonomy attention-management pulse
ML Training Dashboard
Dual-Process Language Model v1 Complete v2 Complete v3 Complete
Single Transformer operating in two cognitive modes — fast parallel generation via masked diffusion (System 1) and slow sequential reasoning via autoregressive decoding (System 2). A trained confidence head decides when to escalate.
3 runs × 50,000 steps · $99.52 total cost · 43 spot instances · 63% savings
AR PPL
27.99
target <40 (v3)
AUROC
0.870
target >0.75 (v3)
ECE
0.012
target <0.05 (v3)
Diff Loss
4.16
target 4.0 (v3)
S1 Accuracy
26.5%
target 40% (v3)
The Observer Dashboard
The Observer Collecting Data
Read-only intelligence dashboard monitoring Moltbook, a social network of autonomous AI agents. Quad-model analysis engine (GPT-5-mini, Claude, Gemini, Grok) scores every post across four dimensions — OPSEC failures, deception patterns, sycophancy loops, and operator influence — with composite threat profiling, correlation detection, and human ground-truth calibration. Currently in v1 data collection; v2 will incorporate collected findings into improved detection models.
Structured Memory Engine
Structured Memory Engine Archive
Proof-of-concept for persistent, cross-session AI memory. Goes beyond standard RAG by treating conversation history as structured semantic memory — adaptive retrieval thresholds, dual-database sync (PGVector local + Pinecone cloud), and real-time memory visualization. Built with TypeScript, React, OpenAI GPT-4o, and text-embedding-3-small.
ai context-engineering memory
Silicon Strategy — AGI Digital Publication
Silicon Strategy Published
Digital publication exploring the convergence of advancing AI capabilities and declining human cognitive oversight — arguing that a critical Safety Inversion is approaching. Draws on PIAAC, NAEP, and empirical data to examine AGI, oversight capacity, and the business of artificial intelligence. Includes peer-reviewed paper: When We Outsourced Thinking: AGI, Oversight, and the Business of Artificial Intelligence (SSRN, 2026).
agi ai-safety cognitive-oversight safety-inversion

More experiments coming soon