Augur
Multi-Agent Ensemble Forecasting — 17 Specialists, One Answer
The Problem
Probabilistic forecasting is one of the most valuable and least commoditised capabilities in professional services. Hedge funds pay analysts hundreds of thousands of dollars per year to produce calibrated probability estimates for macro events, policy changes, and market outcomes. Strategy consultants build scenario models. Political risk firms charge per-report for geopolitical assessments.
The problem is not that AI can't forecast — it's that a single LLM produces a single perspective, with no framework for calibration, no domain specialisation, and no transparency into how it arrived at a number. Superforecasting research (Tetlock, IARPA) shows that the best forecasters are those who aggregate independent estimates from people with diverse analytical frameworks. No AI product has done this seriously.
Cindicator raised $15M in 2017 to crowdsource human predictions and blend them with ML. The crowd was too noisy and incentive-misaligned. The model never got good enough to justify the operational cost.
The Solution
Augur fans any probabilistic question out to a mesh of 17 domain AI specialists simultaneously. Each specialist applies a distinct analytical playbook — loaded from a TOML manifest at runtime — and returns a structured forecast:
{
"probability": 0.72,
"confidence": 0.85,
"reasoning": "Fed futures are pricing ~65% probability of a cut...",
"key_assumptions": ["Inflation stays below 3.5%", "No major financial shock"],
"key_uncertainties": ["Q2 employment data not yet released"],
"would_change_if": "A surprise CPI print above 3.8% would shift my estimate to ~0.35"
}Responses aggregate via confidence-weighted average. A specialist with 0.9 confidence counts 9x more than one with 0.1. The result: a calibrated ensemble probability with transparent per-specialist reasoning.
The Specialist Mesh
| Specialist | Domain | Analytical Framework |
| Reasoner | Logic & inference | Deductive/inductive/abductive reasoning, steelmanning |
| Intelligence Analyst | OSINT & geopolitics | ACH (Analysis of Competing Hypotheses), NATO source grading |
| Market Analyst | Financial markets | Technical + fundamental, sentiment analysis |
| Researcher | Information synthesis | Multi-source corroboration, evidence grading |
| Data Scientist | Quantitative | Statistical reasoning, base rates, Bayesian updating |
| + 12 others | Various | Domain-specific playbooks |
Each specialist's playbook is defined in a TOML manifest. Update the manifest, update the specialist's behavior — no code changes.
API
POST /v1/forecast
{
"question": "Will the Fed cut rates before September 2026?",
"context": "Current fed funds rate: 4.25-4.5%...",
"specialists": ["reasoner", "market_analyst", "intelligence_analyst"],
"synthesize": true
}
→ {
"ensemble_probability": 0.68,
"ensemble_confidence": 0.74,
"consensus": "lean_yes",
"synthesis": "Specialists broadly agreed on a likely cut, with market_analyst most bullish (0.75) citing futures pricing, and intelligence_analyst most cautious (0.61) citing election-year political uncertainty. Key residual risk: Q2 employment print.",
"specialists": [...],
"successful": 3,
"failed": 0,
"latency_ms": 4200
}Performance
- >Parallel execution: All specialists run simultaneously via `asyncio.gather` — ensemble latency ≈ slowest single specialist, not sum
- >Graceful degradation: Timeout or API error per specialist → `status: "timeout"`, probability falls back to neutral 0.5, excluded from weighted average
- >TOML hot-reload: Manifest changes take effect on next request, no restart needed
Revenue Model
| Tier | Price | Target |
| Developer | $0.50/query | Individuals, experimentation |
| Pro | $299/month (500 queries) | Analysts, researchers |
| Team | $999/month (2,000 queries) | Strategy teams |
| Enterprise | Custom + SLA | Hedge funds, risk consultancies |
The addressable market for institutional forecasting tools is well-established: Bloomberg Terminal (~$6B/yr), Refinitiv (~$6B/yr), specialized political risk firms (Eurasia Group, Oxford Analytica). Augur is not competing with those databases — it's offering calibrated probabilistic synthesis that none of them provide.
Stack
- >Backend: FastAPI + Python, Anthropic Claude API (claude-sonnet-4-6)
- >Execution: asyncio parallel fan-out, per-specialist timeout, synthesis via claude-haiku
- >Config: TOML manifests (17 files, hot-loaded at request time)
- >Deploy: Production at eudaimonia.win
**Rising Sun** · [risingsun.name](https://risingsun.name) · April 2026
*"17 specialists. One calibrated answer."*