About This Site

How This Documentation Was Created

This documentation was created rapidly—over the course of a few days in early 2026—with substantial assistance from large language models (primarily Claude). This has implications for how you should read it.

Revision History

The initial rapid draft has since been through substantial hardening (June 2026):

A full-corpus adversarial review (nine independent review passes covering every page), followed by a systematic fix pass against its findings.
Citation purge: every flagged fabricated or misattributed citation was removed or corrected; real-history claims (dates, court cases, nuclear/aviation safety figures) were verified against primary sources.
One set of numbers: the propagation math was re-derived independently, the canonical worked example is stated exactly once and referenced everywhere, and all signature figures were made consistent across pages.
Honest labeling: every hypothetical scenario and worked example carries an explicit banner; a ~25,000-word fictional case-study section was removed entirely.
Terminology canon: vocabulary was consolidated against a written canon that records collision checks against adjacent fields and credits prior art (reliability engineering’s common-cause-failure literature, AI Control, security’s least-privilege lineage) rather than renaming it.

The “limitations” below still apply—the framework remains untested in production and most numbers remain illustrative—but the specific failure modes of rapid LLM-assisted drafting (hallucinated citations, internal inconsistencies, unlabeled fiction) have now been hunted deliberately rather than left to chance. Reconciling estimates against reality is what this framework preaches; this page is where we practice it on ourselves.

What This Means

Strengths of LLM-assisted creation:

Rapid exploration of ideas and their implications
Consistent terminology across ~100,000 words
Systematic coverage of related concepts
Quick iteration on structure and framing

Limitations to be aware of:

Less vetting than traditional documentation: Most content has not been extensively reviewed by domain experts
Possible hallucinated details: Some specific claims (numbers, citations, examples) may be inaccurate
Inherited biases: The content reflects patterns in LLM training data, which may include errors or biases
Untested recommendations: The practical recommendations have not been validated in production systems

Epistemic Status

This documentation should be read as exploratory writing—an attempt to systematize ideas about delegation risk, not a vetted technical specification.

The core concepts (Delegation Risk quantification, structural safety through decomposition, cross-domain methods) are adapted from established fields and seem sound in principle. But:

The specific formulas and numbers are illustrative, not validated
The case studies (except Sydney and the documented human-systems histories) are hypothetical scenarios, and are labeled as such
The recommendations reflect our current thinking, which may change
The framework as a whole is untested at scale for AI systems

Why Publish It Anyway?

We believe the ideas are valuable enough to share even in this form because:

The core insight seems important: Safety through architecture, not just behavior
Early discussion is valuable: Getting feedback on the approach while it’s still malleable
Transparency about process: Being clear about how it was made helps readers calibrate
Iteration is possible: This is living documentation that can be improved

How to Help

If you find errors, unclear reasoning, or have suggestions:

Open an issue on GitHub
The documentation is open source under CC BY 4.0

We’re especially interested in:

Factual corrections
Logical inconsistencies
Missing considerations
Real-world examples (positive or negative)
Alternative framings that work better

Acknowledgments

This documentation was written primarily by Ozzie Gooen with substantial assistance from Claude (Anthropic). The framework draws on ideas from many sources, including Eric Drexler’s CAIS, Redwood Research’s AI Control work, and decades of nuclear/financial risk management practice.

The rapid creation was an experiment in using LLMs for technical writing at scale. We’ve tried to be transparent about the process and its limitations.