⌬ AI-ENGINEER HACKATHON · DEEP-RESEARCH PLANNER · APR 2026

The Architect's
co-pilot.

PLAN.AI takes a one-line idea and returns a citation-grounded technical blueprint in 22 sections, a Mermaid C4 diagram, and a Shipping Floor that compiles the blueprint into Architect / Executor / Auditor tasks. Underneath: nine source integrations, a three-depth research engine including a STORM-lite multi-persona orchestrator, and a Vercel-ready deploy where every API key is supplied client-side.

ReactViteTypeScriptVercel Edge OpenAI / OpenRouterTavilyFirecrawlMermaid C4

· captured 10 May 2026 · unedited workspace screenshots

Source Coverage

Nine research surfaces.

Code, papers, threads, encyclopedias, and the open web — each surface is a separate integration with its own rate-limit, ranking, and citation format. The synthesizer treats them as typed evidence and inlines source anchors directly in the blueprint prose.

⚙

GitHub

Repo search, README extraction, language stats. Pulls real-world reference implementations into the synthesis.

📜

arXiv

Pre-print search by topic with abstract pull-through. Used as the primary academic substrate for novel methods.

🎓

Semantic Scholar

Citation graph + venue metadata. Disambiguates and ranks methods by influence.

💬

Community signal — what practitioners actually deploy and where the sharp edges are.

⚡

Stack Exchange

High-precision technical answers. Tag-filtered, score-weighted.

📚

Wikipedia

Disambiguation + canonical-definition substrate. Anchors specialist terms before the synthesizer expands them.

📰

Hacker News

Fresh discourse on tooling, deployment, and infra trade-offs at the engineering tier.

🧭

Tavily

Search-grade web retrieval with answer extraction. The general-web fallback when the specialist sources go quiet.

🔥

Firecrawl

Structured page-to-markdown extraction for arbitrary URLs. Used when the user pastes a link directly into a workspace.

Research Depths

Quick. Standard. STORM.

Three depths, three different cost / quality trade-offs. Quick is cheap and fast for spike research. Standard is the everyday one-call pipeline. Deep wakes up a STORM-lite orchestrator that recruits three personas and runs them in parallel against the source surfaces.

Tier 01

Quick

Single LLM pass against a curated 2-source mix. Useful for "what is X, who works on it, where do I read more" briefs.

~1 LLM call
2 source surfaces
Inline citations only
< 30 s typical

Tier 02

Standard

Plan → fan-out → synthesize. The agent first decomposes the topic into 4 — 6 questions, fans out to all 9 sources, then synthesizes a 22-section blueprint with cited prose.

4 — 6 fan-out queries
All 9 source surfaces
22-section blueprint
Mermaid C4 diagram inline

Tier 03

STORM-lite Deep

A multi-persona orchestrator (Engineer / PM / SRE) generates 6 — 9 parallel fan-out queries with role-specific framings, then a synthesizer reconciles the three perspectives into a single cited blueprint.

6 — 9 fan-out queries
Three persona viewpoints
Cross-persona reconciliation
Workspace-branchable findings

⌬ STORM-lite personas

ENGINEER

Asks how it's actually built. Pulls implementations, dependency graphs, perf trade-offs, gotchas.

PRODUCT

Asks who pays for it. Pulls competitive landscape, user pain, pricing signals, adoption traction.

SRE

Asks what breaks at 3 AM. Pulls deployment topology, observability requirements, failure modes, capacity planning.

The Output

22 sections. One blueprint.

Every successful run produces a single artefact: a 22-section technical blueprint. Sections cover problem framing, related work, system architecture, data layer, model layer, deployment, observability, threat model, roll-out plan, and explicit risks. Every claim is anchored by an inline citation back to a source surface.

Block · Diagram

Mermaid C4 inlined

The blueprint embeds a Mermaid C4 system diagram with components, containers, and data-flow arrows. Renders client-side; users can fork-and-edit in place.

Block · Citations

Inline source anchors

Every concrete claim points back to a numbered source from the run's evidence pool. No floating assertions; clicking a citation opens the underlying source in a side-pane.

Block · Branching

Workspace branching + merge

Right-click any reference to spawn a sibling workspace explored independently, then fold its findings back into the parent blueprint with a merge action.

Block · Topology

React-Flow + dagre layout

Workspace topology is rendered as a React-Flow graph with dagre auto-layout. The whole research session is itself a navigable, persistable structure.

Block · Voice

ElevenLabs narration

Optional voice-over of any section via ElevenLabs. Useful for solo review on a walk; the blueprint reads itself back to you, citations included.

Block · Persistence

localStorage workspaces

All workspaces and the current blueprint persist locally — no auth required, no server-side user data, no PII leaving the browser.

Real Artefacts

A sample run — GuardPilot.

These are unedited outputs from a real PLAN.AI run on 12–13 May 2026 — the topic was a 24-hour security-loop hackathon idea (CCTV → YOLO tracking → simulated PTZ → OpenAI Vision verification → audit trail). The diagram on the left is the live render of the system architecture; the document on the right is the full 22-section blueprint, served as a side artefact instead of being inlined into this page.

▶ Sample diagram · Mermaid C4 render

Recorded 13 May 2026 · 02:27. A live walkthrough of the rendered diagram and the planner UI — the same architecture that maps to the Architect / Executor / Auditor lanes in the Shipping Floor below.

▤ Sample plan · 22 sections · 853 lines

GuardPilot — 24-hour security-loop blueprint

STORM-lite depth, three personas, inline citations, embedded Mermaid C4. The unedited PLAN.AI output — not a hand-curated demo. Preview the full rendered document in-browser with diagrams, or download the raw markdown.

Executive Summary
Context & Problem
Goals
Non-Goals
Stakeholders & Personas
Constraints
Solution Strategy
Architecture
Quality Attributes — NFRs
Cross-Cutting Concerns
Security Considerations
Operational Considerations
Stack Decisions — Mini-ADRs
Phased Rollout & Milestones
Cost Estimate
Success Metrics
Risks & Technical Debt
Open Questions
Alternatives Considered
ADR Log
Glossary
References

▤ Preview rendered → ⬇ Download .md

From Plan To Build

The Shipping Floor.

A blueprint is only useful if it ships. The Shipping Floor compiles every successful run into a three-lane task list — Architect, Executor, Auditor — directly executable by an upstream agentic runner. The same Architect / Executor / Auditor pattern Aiko's factory implements, here exposed as a planning artefact rather than a runtime.

⌬ Lane 01 · Plan

Architect

Tasks that define and ratify scope: spec writing, interface contracts, decision records, Mermaid diagrams. The blueprint pre-populates this lane from sections 1 — 8.

⚙ Lane 02 · Build

Executor

Tasks that produce code: scaffold, implement, integrate, write fixtures. Pre-populated from sections 9 — 16 of the blueprint.

⚖ Lane 03 · Verify

Auditor

Tasks that gate-keep delivery: tests, lint, security scan, deployment validation, observability check. Pre-populated from sections 17 — 22.

Production Path

Vite. Vercel. Zero secrets.

The whole stack ships to Vercel as a static React build with a single serverless catch-all handler. No backend DB, no auth, no embedded API keys — every key the user wants to use is entered client-side and persisted only in localStorage. Public demos can be cloned and run with the cloner's own keys.

Build

Vite + React + TypeScript

Dev Proxy

Vite proxy → /api

Edge Function

api/[...path].ts catch-all

State

localStorage only

Streaming

SSE markdown chat

Render

Block-memoized markdown

Voice

ElevenLabs (optional)

Embedded keys

None — client-supplied

Full Walk-Through

Watch the full run.

Twelve minutes from one-line idea to a downloadable 22-section blueprint with rendered Mermaid C4. No cuts, no edits — the same path the Shipping Floor consumes upstream.

▶ PLAN.AI · Full End-to-End Walk-Through

Recorded 14 May 2026 — unedited end-to-end run: idea → research → blueprint → diagram → shipping floor.

The Architect's
co-pilot.

Nine research surfaces.

GitHub

arXiv

Semantic Scholar

Reddit

Stack Exchange

Wikipedia

Hacker News

Tavily

Firecrawl

Quick. Standard. STORM.

Quick

Standard

STORM-lite Deep

22 sections. One blueprint.

Mermaid C4 inlined

Inline source anchors

Workspace branching + merge

React-Flow + dagre layout

ElevenLabs narration

localStorage workspaces

A sample run — GuardPilot.

▶ Sample diagram · Mermaid C4 render

GuardPilot — 24-hour security-loop blueprint

The Shipping Floor.

Architect

Executor

Auditor

Vite. Vercel. Zero secrets.

Watch the full run.

▶ PLAN.AI · Full End-to-End Walk-Through

Portifolio

The Architect'sco-pilot.

Nine research surfaces.

GitHub

arXiv

Semantic Scholar

Reddit

Stack Exchange

Wikipedia

Hacker News

Tavily

Firecrawl

Quick. Standard. STORM.

Quick

Standard

STORM-lite Deep

22 sections. One blueprint.

Mermaid C4 inlined

Inline source anchors

Workspace branching + merge

React-Flow + dagre layout

ElevenLabs narration

localStorage workspaces

A sample run — GuardPilot.

▶ Sample diagram · Mermaid C4 render

GuardPilot — 24-hour security-loop blueprint

The Shipping Floor.

Architect

Executor

Auditor

Vite. Vercel. Zero secrets.

Watch the full run.

▶ PLAN.AI · Full End-to-End Walk-Through

Portifolio

The Architect's
co-pilot.