Coverage Ledger

AI-2027 Failure Modes — Classification & Rationale

This ledger classifies AI-2027 failure modes by their relationship to the architectural assumptions examined in this project. Each entry states a status and a one-line rationale. No new claims are introduced.

Status indicates whether this project's analysis changes the dynamics of the failure mode, not whether the failure mode is resolved.

Summary: 5 changed, 8 partially changed, 3 not solved, 4 out of scope.

Failure Mode	Status	Rationale	Response
Lack of runtime verifiability	Changed	Neutral witnessing provides cryptographic attestation of execution	See response
Reactive oversight timing	Changed	Constitutional abstention refuses execution before invariant violation reaches deployment	See response
Race dynamics collapse pauses	Changed	Public attestation makes restraint observable without centralized enforcement	See response
Late alignment detection	Changed	Bounded execution envelopes narrow the set of unsafe states reachable at deployment	See response
Opaque power concentration	Changed	Public verifiability decouples visibility from control	See response
Deceptive alignment / scheming	Partially Changed	Bounded execution narrows executable states; does not detect or prevent deceptive intent	See response
Reward hacking	Partially Changed	Execution envelopes constrain what can run, not what an agent optimizes for	See response
Goal subversion / self-exfiltration	Partially Changed	Constitutional abstention makes some actions architecturally unexecutable; does not address all paths	See response
Model weight theft	Partially Changed	Public verifiability changes observability; does not prevent exfiltration	See response
Treaty verification failure	Partially Changed	Attestation infrastructure addresses verifiability, not treaty design	See response
Epistemic overload / decision capture	Partially Changed	Architectural constraints on execution introduced; does not address epistemic quality broadly	See response
Totalitarian lock-in	Partially Changed	Concentrated power becomes observable; exercise of power not prevented	See response
Permanent power concentration	Partially Changed	Observability narrows opacity; does not redistribute power	See response
Sandbagging / capability hiding	Not Solved	Behavioral deception during evaluation is outside the scope of runtime constraints
Insider threats / supply-chain attacks	Not Solved	Operational security is outside the scope of architectural execution constraints
SL4/SL5 security gaps	Not Solved	Security level requirements are an operational and policy matter
Military escalation / weaponization	Out of Scope	Military capability and doctrine are outside the scope of this analysis
Job obsolescence / economic inequality	Out of Scope	Economic distribution is outside the scope of execution architecture
Human enfeeblement	Out of Scope	Long-term human capability effects are not addressed by runtime constraints
Extinction risk	Out of Scope	This project narrows specific assumptions; it does not address existential risk in aggregate

"Not Solved" and "Out of Scope" are not dismissals. They are classifications.

Coverage Ledger is a detailed classification table mapping twenty AI-2027 failure modes to one of four statuses — Changed, Partially Changed, Not Solved, or Out of Scope — with specific rationale for each assignment.

Classification Statuses

Changed: the architectural mechanism directly narrows the failure mode. Partially Changed: indirect effect with significant limits. Not Solved: the failure mode remains unaffected. Out of Scope: outside runtime execution constraints entirely.

Rationale Transparency

Each classification includes explicit rationale explaining why the status was assigned. Rationale notes what the mechanism does and does not affect.

Relationship to Scope Page

The Coverage Ledger extends the Failure Modes & Scope matrix with detailed rationale. Both pages are frozen as of the v1.0 scope lock.