Holodeck Simulation Engine

The Science of
Collective AI
Intelligence.

180 archetype-distinct AI experts across 23 domain presets, each sampled across hundreds of independent reasonings. Each archetype carries a distinct worldview, risk tolerance, and analytical framework. Deliberating until consensus emerges — or exposing genuine uncertainty.

See the research →Try it free →

The Problem

The problem with
single-point forecasting.

“When the Fed's GDP forecast missed by 4.2 points in Q4 2025, prediction markets were 96% confident in the wrong outcome. Our swarm was at 57% — uncertain because the data was genuinely uncertain.”

The Architecture

How it works.

→

The Science

The math.

BRIER SCORE — Measures forecast accuracy: (predicted probability − actual outcome)². Lower is better. A score of 0 is perfect. A random coin flip scores 0.25. Holodeck is built depth-first for real estate, macro, and private markets — the domains with structured ground-truth data. The swarm has dedicated archetype clusters for those, and they perform: real estate 0.168 (69 cross-domain time-gated questions across 8 sub-domains of real estate), macro 0.18 (90 resolved, pre-time-gating audit), NCAA backtest 0.148 (62 games). Everything else — broad sports, crypto, geopolitics — runs on a general-purpose archetype mix that’s still in development. The track-record page breaks this out by domain.

The Results

Real questions. Resolved outcomes.

We ran our swarm against resolved Kalshi prediction markets. Three questions. Real outcomes. Here's what happened.

QUESTION	OUTCOME	KALSHI	HOLODECK	VERDICT
CPI > 0.8% Mar 2026	✓ YES	73%	44.5%	Markets win
CPI > 0.9% Mar 2026	✗ NO	33%	43.6%	Markets win
GDP > 1.5% Q4 2025	✗ NO	96%	57.7%	🎯 Swarm wins

Brier Score — where Holodeck is built deep: 0.168 on real estate (69 cross-domain time-gated questions across 8 sub-domains, 39% better than the phase-1 baseline), 0.18 on macro (90 resolved, pre-time-gating audit), 0.148 on the 2026 NCAA Tournament backtest (62 games). Aggregate across all domains reflects the mix of depth-first domains (real estate, macro) with general-purpose extension into sports, crypto, geopolitics (where dedicated archetype clusters are a roadmap item, not a current focus).

The Agents

Archetype gallery.

Each agent is a behavioral specification, not a persona. We define how it reasons, not what it believes.

180 archetype-distinct experts across 23 domain presets, each sampled hundreds of times per question. Each preset has its own archetype library, calibrated against resolved outcomes.

Twelve Prediction Environments

Every domain has its own specialist library.

The Paper

What's Under The Hood

The public demo is one layer.

The full engine runs 22,000+ lines of research infrastructure across 12 prediction domains. What you see in the demo is one question. The full system runs entire market sweeps.

The public demo runs a subset of the archetype library. Enterprise gets the full engine: custom domains, private deployment, dedicated calibration. Research partnerships available.

Start Predicting

Holodeck is live.

Free tier available. Enterprise plans for teams and institutions.

Try it free →Enterprise access →

Already have an account? Sign in →

The Science ofCollective AIIntelligence.

The problem withsingle-point forecasting.