Architecture Comparator
Compare the risk profiles of different delegation architectures to make informed design decisions.
How It Works
Section titled “How It Works”This tool provides side-by-side comparison of delegation architectures:
- Component Breakdown: See which AI/human components each architecture uses
- Risk Visualization: Compare expected vs. worst-case risk
- Mitigation Impact: See how different safety measures affect overall risk
- Trade-off Analysis: Understand the cost-benefit of each approach
| Architecture | Expected Risk | Worst Case | Components | Mitigations |
|---|---|---|---|---|
| Baseline (Human Only) | $72/mo* | $2,000 | 1 | 2 |
| Simple AI Assist | $79/mo | $2,500 | 2 | 2 |
| Autonomous AI | $151/mo | $4,800 | 3 | 3 |
| Full Automation | $602/mo | $8,000 | 3 | 3 |
Default Architectures Explained
Section titled “Default Architectures Explained”Baseline (Human Only)
Section titled “Baseline (Human Only)”Traditional human-driven process with no AI delegation. Establishes the risk level you’re comparing against.
Characteristics:
- Single point of failure (human error)
- Predictable but expensive
- Limited scalability
- Well-understood failure modes
Simple AI Assist
Section titled “Simple AI Assist”AI provides suggestions and recommendations, but humans make all decisions. Common for high-stakes domains.
Characteristics:
- AI errors caught by human review
- Slower than autonomous but safer
- Good for building trust in AI systems
- Human bottleneck remains
Autonomous AI
Section titled “Autonomous AI”AI handles routine tasks independently; humans handle exceptions. Balances efficiency with safety.
Characteristics:
- Higher throughput for routine work
- Complex failure modes (routing errors)
- Requires robust exception handling
- Multiple points of mitigation
Full Automation
Section titled “Full Automation”End-to-end AI with minimal human intervention. Maximum efficiency, maximum risk.
Characteristics:
- Highest potential damage
- Requires extensive mitigation
- Suitable only for well-understood domains
- Fastest degradation if poorly designed
Interpreting the Comparison
Section titled “Interpreting the Comparison”Expected vs. Worst Case
Section titled “Expected vs. Worst Case”- Expected Risk (dark bar): Average monthly risk given probability distributions
- Worst Case (light bar): Maximum possible damage if everything fails
A wide gap indicates high tail risk.
Component Types
Section titled “Component Types”Risk varies significantly by component type:
| Type | Base Failure Rate | Typical Use |
|---|---|---|
| Deterministic | ~0.1% | Rule-based routing, validation |
| Narrow ML | ~3% | Classification, detection |
| General LLM | ~10% | Generation, reasoning |
| RL Agent | ~20% | Autonomous decision-making |
| Human | ~5% | Review, exception handling |
Mitigation Stacking
Section titled “Mitigation Stacking”Each mitigation reduces risk multiplicatively. With 3 mitigations at 85% effectiveness each:
final_risk = base_risk × 0.85³ = base_risk × 0.61More mitigations help, but with diminishing returns.
When to Choose Each Architecture
Section titled “When to Choose Each Architecture”Choose Human Only When:
Section titled “Choose Human Only When:”- Stakes are extremely high
- Decisions require nuanced judgment
- AI systems aren’t well-calibrated for your domain
- Regulatory requirements demand human oversight
Choose AI Assist When:
Section titled “Choose AI Assist When:”- AI can improve human decision quality
- Speed is important but not critical
- Building organizational trust in AI
- Failure costs are moderate
Choose Autonomous AI When:
Section titled “Choose Autonomous AI When:”- High volume of routine decisions
- Clear criteria for “routine” vs “exception”
- Good monitoring and fallback in place
- Moderate-to-high risk tolerance
Choose Full Automation When:
Section titled “Choose Full Automation When:”- Domain is well-understood with clear boundaries
- Extensive testing and validation completed
- Strong mitigation stack in place
- Benefits significantly outweigh risks
Migration Paths
Section titled “Migration Paths”Human → AI Assist
Section titled “Human → AI Assist”- Start with AI suggestions for low-stakes decisions
- Track AI accuracy vs. human decisions
- Gradually expand scope based on performance
- Maintain human review throughout
AI Assist → Autonomous
Section titled “AI Assist → Autonomous”- Identify truly routine tasks (over 95% predictable)
- Implement robust exception detection
- Add monitoring and alerting
- Pilot with limited scope, then expand
Autonomous → Full Automation
Section titled “Autonomous → Full Automation”- Achieve consistent performance metrics
- Reduce human exception handling rate to less than 5%
- Implement comprehensive mitigation stack
- Establish clear rollback procedures
Customization
Section titled “Customization”The default architectures serve as templates. To analyze your specific situation:
- Use the Risk Calculator to model each architecture
- Use the Sensitivity Dashboard to identify key parameters
- Use the Trust Updater to calibrate component reliability
- Document your analysis in the a decomposition worksheet