Architecture Comparator

Compare the risk profiles of different delegation architectures to make informed design decisions.

How It Works

This tool provides side-by-side comparison of delegation architectures:

Component Breakdown: See which AI/human components each architecture uses
Risk Visualization: Compare expected vs. worst-case risk
Mitigation Impact: See how different safety measures affect overall risk
Trade-off Analysis: Understand the cost-benefit of each approach

Architecture Comparison

Compare risk profiles of different delegation architectures. Darker bars show expected monthly risk; lighter bars show worst-case potential.

Baseline (Human Only)

Traditional human-driven process with no AI delegation

$72/mo

expected

Simple AI Assist

AI provides suggestions, human makes all decisions

$79/mo

expected

Autonomous AI

AI handles routine tasks, human handles exceptions

$151/mo

expected

Full Automation

End-to-end AI with minimal human intervention

$602/mo

expected

Trade-off Analysis

Architecture	Expected Risk	Worst Case	Components	Mitigations
Baseline (Human Only)	$72/mo*	$2,000	1	2
Simple AI Assist	$79/mo	$2,500	2	2
Autonomous AI	$151/mo	$4,800	3	3
Full Automation	$602/mo	$8,000	3	3

Component Type Legend

Deterministic(~0.1% base)

Narrow ML(~3.0% base)

General LLM(~10.0% base)

RL Agent(~20.0% base)

Human(~5.0% base)

Default Architectures Explained

Baseline (Human Only)

Traditional human-driven process with no AI delegation. Establishes the risk level you’re comparing against.

Characteristics:

Single point of failure (human error)
Predictable but expensive
Limited scalability
Well-understood failure modes

Simple AI Assist

AI provides suggestions and recommendations, but humans make all decisions. Common for high-stakes domains.

Characteristics:

AI errors caught by human review
Slower than autonomous but safer
Good for building trust in AI systems
Human bottleneck remains

Autonomous AI

AI handles routine tasks independently; humans handle exceptions. Balances efficiency with safety.

Characteristics:

Higher throughput for routine work
Complex failure modes (routing errors)
Requires robust exception handling
Multiple points of mitigation

Full Automation

End-to-end AI with minimal human intervention. Maximum efficiency, maximum risk.

Characteristics:

Highest potential damage
Requires extensive mitigation
Suitable only for well-understood domains
Fastest degradation if poorly designed

Interpreting the Comparison

Expected vs. Worst Case

Expected Risk (dark bar): Average monthly risk given probability distributions
Worst Case (light bar): Maximum possible damage if everything fails

A wide gap indicates high tail risk.

Component Types

Risk varies significantly by component type:

Type	Base Failure Rate	Typical Use
Deterministic	~0.1%	Rule-based routing, validation
Narrow ML	~3%	Classification, detection
General LLM	~10%	Generation, reasoning
RL Agent	~20%	Autonomous decision-making
Human	~5%	Review, exception handling

Mitigation Stacking

Each mitigation reduces risk multiplicatively. With 3 mitigations at 85% effectiveness each:

final_risk = base_risk × 0.85³ = base_risk × 0.61

More mitigations help, but with diminishing returns.

When to Choose Each Architecture

Choose Human Only When:

Stakes are extremely high
Decisions require nuanced judgment
AI systems aren’t well-calibrated for your domain
Regulatory requirements demand human oversight

Choose AI Assist When:

AI can improve human decision quality
Speed is important but not critical
Building organizational trust in AI
Failure costs are moderate

Choose Autonomous AI When:

High volume of routine decisions
Clear criteria for “routine” vs “exception”
Good monitoring and fallback in place
Moderate-to-high risk tolerance

Choose Full Automation When:

Domain is well-understood with clear boundaries
Extensive testing and validation completed
Strong mitigation stack in place
Benefits significantly outweigh risks

Migration Paths

Human → AI Assist

Start with AI suggestions for low-stakes decisions
Track AI accuracy vs. human decisions
Gradually expand scope based on performance
Maintain human review throughout

AI Assist → Autonomous

Identify truly routine tasks (over 95% predictable)
Implement robust exception detection
Add monitoring and alerting
Pilot with limited scope, then expand

Autonomous → Full Automation

Achieve consistent performance metrics
Reduce human exception handling rate to less than 5%
Implement comprehensive mitigation stack
Establish clear rollback procedures

Customization

The default architectures serve as templates. To analyze your specific situation:

Use the Risk Calculator to model each architecture
Use the Sensitivity Dashboard to identify key parameters
Use the Trust Updater to calibrate component reliability
Document your analysis in the a decomposition worksheet