Governing AI Actions, Not Agents, Through Institutional Attestation.

Jakob Salfeld-Nebgen· June 26, 2026 View original

Summary

This paper proposes a computational governance model for autonomous AI systems, "Institutional Attestation," which governs consequential actions rather than internal reasoning. It requires independent, cryptographically bound evidence for high-risk actions, evaluated by a deterministic policy and recorded in a tamper-evident log.

As autonomous AI agents begin to undertake critical, irreversible actions like clinical prescribing or software deployment, a new governance model is needed. This paper introduces "Institutional Attestation," a framework inspired by how human institutions govern powerful autonomous actors by focusing on the point of consequential action, rather than attempting to monitor internal reasoning. Under this model, an AI agent maintains full autonomy over its planning and reasoning processes. However, it lacks direct execution authority for designated high-risk actions. Instead, execution is contingent upon specific preconditions, each independently verified and attested by a separate, authoritative source. These attestations are cryptographically linked to the agent's declared intent and evaluated against a deterministic policy. All decisions and attestations are recorded in a tamper-evident log, allowing for independent re-verification. The paper includes a proof-of-concept implementation and illustrates its application in scenarios such as software deployment and medical prescribing, offering a robust, auditable approach to AI governance.

Why it matters

This model offers a practical and auditable framework for governing high-stakes autonomous AI systems, providing a mechanism to ensure accountability and safety without stifling agent autonomy in planning, which is crucial for widespread AI adoption in sensitive domains.

How to implement this in your domain

  1. 1Identify high-risk, irreversible actions performed by autonomous AI systems within your organization.
  2. 2Define clear preconditions for each high-risk action that must be independently attested.
  3. 3Establish authoritative sources responsible for providing independent attestations for these preconditions.
  4. 4Implement cryptographic binding mechanisms to link attestations to the AI agent's declared intent.
  5. 5Develop a tamper-evident logging system to record all decisions, attestations, and policy evaluations for auditability.

Who benefits

HealthcareSoftware DevelopmentFinancial ServicesLegalGovernment

Key takeaways

  • Governing AI by focusing on consequential actions rather than internal reasoning is a viable approach.
  • Institutional Attestation requires independent, cryptographically bound evidence for high-risk actions.
  • The model ensures accountability and auditability through tamper-evident logging.
  • It allows AI agents autonomy in planning while maintaining control over execution of critical actions.

Original post by Jakob Salfeld-Nebgen

"arXiv:2606.26298v1 Announce Type: new Abstract: Autonomous AI agents may begin to perform consequential, irreversible actions such as clinical prescribing and production software deployment. This paper observes that human institutions have governed powerful autonomous actors not…"

View on X

Originally posted by Jakob Salfeld-Nebgen on X · view source

Want to go deeper?

Turn these trends into skills with Learnijoy's hands-on AI & tech courses.

Explore courses