Cryptographic Runtime Governance for Autonomous AI Systems: The Aegis Architecture for Verifiable Policy Enforcement

SSRN AI Ethics / Go v ernance man uscript No. (will b e inserted b y the editor) Cryptographic R un time Go v ernance for A utonomous AI Systems: The A egis Architecture for V eriﬁable P olicy Enforcement A dam Massimo Mazzo cc hetti Received: date / A ccepted: date Abstract Contemporary AI gov ernance frameworks rely hea vily on p ost hoc ov ersight, policy guidance, and b ehavioral alignment tec hniques, yet these mechanisms b ecome fragile as systems gain autonomy , sp eed, and operational opacit y . This pap er presen ts A e gis , a runtime gov ernance architecture for au- tonomous AI systems that treats policy and legal constraints as execution conditions rather than advi- sory principles. A egis binds each go v erned agent to a cryptographically sealed Immutable Ethics Policy L ayer (IEPL) at system genesis and enforces external emissions through an Ethics V eriﬁc ation A gent (EV A), an Enfor c ement K ernel Mo dule (EKM), and an Immutable L o gging K ernel (ILK). Amendments to the go verning policy lay er require quorum appro v al and redeclaration of the system trust root; veriﬁed violations trigger autonomous sh utdown and generation of auditable pro of artifacts. W e ev aluate the architecture within the Civitas runtime using three op erational measures: pro of v eriﬁcation latency under tamp er conditions, publication ov erhead, and alignmen retention performance relativ e to an ungov erned baseline. In con trolled trials, A egis demonstrates median pro of v eriﬁcation latency of 238 ms, median publication o verhead of approximately 9.4 ms, and higher alignment retention than the baseline condition across matched tasks. W e argue that these results supp ort a shift in AI go vernance from discretionary ov ersight to ward veriﬁable run time constrain t. Rather than claiming to resolv e machine ethics in the abstract, the proposed arc hitecture seeks to show that policy violating b eha vior can b e rendered operationally non executable within a con trolled run time gov ernance framework. The pap er concludes b y discussing methodological limits, evidentiary implications, and the role of pro of orien ted gov ernance in high assurance AI deploymen t. Keyw ords: AI gov ernance, runtime veriﬁcation, runtime enforcement, zero–kno wledge proofs, constitu- tional AI, trust worth y autonomous systems SPQR T echnologies Inc. E-mail: adam@sp qrtech.ai Cryptographic R untime Gov ernance for Autonomous AI 2 Con ten ts 1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3 2 Related W ork . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3 3 Constitutional Genesis (from L ex Incipit ) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4 4 Conceptual F raming: Ethics, Go vernance, and Adjudication . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4 5 System Architecture . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 6 The Immutable Ethics Policy Lay er (IEPL) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7 7 Zero T rust Pro ofs and Contin uous Ethics V eriﬁcation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8 8 Methodology . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10 9 Empirical V eriﬁcation of Constitutional Enforcement . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11 10 Threat Model and Guarantees . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13 11 Limitations and Op en Questions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13 12 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14 Cryptographic R untime Gov ernance for Autonomous AI 3 1. In tro duction As AI systems are deploy ed in increasingly autonomous and high consequence settings, a central gov- ernance problem b ecomes harder to ignore: ho w can constrain ts on system b ehavior remain eﬀectiv e when those systems act faster than human o versigh t, op erate through opaque internal pro cesses, and in teract with external to ols or environmen ts in real time? Existing approaches to AI go v ernance provide imp ortan t normativ e guidance, but man y dep end on ex p ost review, dev eloper discretion, mo del side alignmen t, or application la yer guardrails [11, 16, 14, 26, 15]. These approaches remain v aluable, yet they are often diﬃcult to translate into non bypassable deplo yment time con trols and can remain vulnerable to drift, circum ven tion, latency , and unev en enforcement. This pap er examines an alternative design premise: that for some classes of autonomous systems, go vernance constrain ts should b e enforced at runtime as a condition of execution rather than applied as a p ost hoc correctiv e. W e presen t A e gis , a cryptographically mediated run time go v ernance arc hitecture that binds an autonomous agent to an imm utable p olicy lay er at genesis and requires pro of back ed compliance at the publish boundary . Under this model, p olicy violating actions are not merely discouraged or ﬂagged; they are rendered operationally non executable, logged, and, under speciﬁed breach conditions, follow ed b y autonomous sh utdo wn. The pap er therefore fo cuses on enforcement feasibilit y within a controlled run time setting, rather than on claiming a complete solution to the upstream problem of moral or legal p olicy sp eciﬁcation. The con tribution of this pap er is threefold. First, it deﬁnes a runtime gov ernance architecture com- p osed of a sealed Immutable Ethics Policy Lay er (IEPL), an Ethics V eriﬁcation Agent (EV A), an En- forcemen t Kernel Mo dule (EKM), an Immutable Logging Kernel (ILK), and a quorum based amendment pro cess. Second, it situates this arc hitecture within current debates on AI alignment, mac hine ethics, run- time v eriﬁcation, and high assurance go vernance [3, 17, 5, 9]. Third, it rep orts controlled implementation results from the Civitas runtime, fo cusing on veriﬁcation latency , publication ov erhead, and comparativ e alignmen t retention b ehavior under go verned and ungo v erned conditions. The pap er does not claim to solve machine ethics in the broad philosophical sense. Instead, it addresses a narrow er but practically imp ortant question: whether cryptographically enforced runtime constraints can provide a more auditable and operationally robust basis for gov erning autonomous AI systems in settings where discretionary o versigh t is insuﬃcient. A gent class. The architecture gov erns large language mo del (LLM) based autonomous agents and multi ﬂow causal agen ts; all outputs tra verse the Enforcement Kernel Module (EKM) publish gate under contin uous Ethics V eriﬁcation Agent (EV A) scrutiny , with Immutable Logging Kernel (ILK) attestation at every decision b oundary . 2. Related W ork Researc h relev ant to this pap er spans four ov erlapping areas: AI ethics and gov ernance, alignmen t and safet y , machine ethics, and run time v eriﬁcation or enforcement. First, the contemporary AI ethics literature has pro duced a substantial b o dy of principles oriented go vernance work, including frameworks for trustw orthy AI, ethical design, and institutional account- abilit y [11, 16, 14, 8, 29, 26, 15]. This literature has b een crucial in clarifying concerns such as fairness, transparency , resp onsibilit y , and public ov ersight. How ever, muc h of it remains guidance orien ted rather than execution oriented: it sp eciﬁes what AI systems ought to satisfy without sp ecifying ho w run time compliance is tec hnically enforced. Second, AI safet y and alignment research has fo cused on shaping system b ehavior through training pro cedures, preference learning, constitutional prompting, red teaming, guardrails, and human in the lo op mo deration [24, 12, 31, 3]. These metho ds are v aluable and often necessary , but they largely operate Cryptographic R untime Gov ernance for Autonomous AI 4 through b ehavioral shaping, sup ervisory in terven tion, or application lay er constraints. The present work diﬀers in targeting the publish b oundary itself: it asks whether certain p olicy violating actions can b e made op erationally non executable through run time veriﬁcation and enforcement. Third, mac hine ethics and techno legal scholarship hav e explored whether artiﬁcial agents should b e understo o d as moral, legal, or quasi institutional actors [22, 10, 27, 4]. This literature pro vides important conceptual tools for thinking ab out norm go verned artiﬁcial agency , but less often sp eciﬁes how such norms can b e b ound to system execution through auditable technical controls. Our contribution is narro wer and more op erational: we do not claim machine moral agency , but rather prop ose a gov ernance arc hitecture for constraining machine b ehavior under a deﬁned policy charter. F ourth, formal metho ds researc h on runtime v eriﬁcation and runtime enforcement provides an imp or- tan t technical lineage for this paper [17, 5, 9]. These works show how prop erties can b e monitored and enforced against live system traces at execution time. Aegis extends this general logic into the domain of autonomous AI by combining runtime integrit y veriﬁcation, pro of back ed publication, and hash linked eviden tiary logging into a single gov ernance pip eline. Against this background, the contribution of A egis is not a new ethical principle set, nor a new mo del alignmen t technique, nor a general theory of legal p ersonho o d. It is a runtime gov ernance architecture that connects p olicy sp eciﬁcation, enforcemen t, and auditability within one op erational framework. Recen t work has also b egun to explore run time v eriﬁcation for AI agent systems and LLM based execution pip elines, suggesting growing in terest in extending formal monitoring ideas in to agentic AI con texts [1, 30]. 3. Constitutional Genesis (from L ex Incipit ) L ex Incipit formalised the premise that lawful autonomy must b egin under law rather than acquire la w post ho c. W e adopt that do ctrine here as a genesis c ondition : every gov erned unit is born under a cryptographically sealed Immutable Ethics Policy L ayer (IEPL) b ound to identit y at bo ot. This is enforced by a Genesis L o ck that fuses three anchors: (i) hardw are iden tit y , (ii) the signed IEPL text, and (iii) the founding authorit ys public key ( A uctor ). No process can execute until this co v enant veriﬁes. Why genesis, not oversight. Ov ersight assumes proximit y and time; autonomous systems op erate b eyond b oth. By sealing ethics b efore autonomy , w e transform ethics in to a runtime dep endency . Amendments remain possible, but nev er silent: they require quorum co signature, resealing, and redeclaration. Thus, law pr e c e des c ap ability , and capabilit y contin ues only under law. Design implic ation. In practice, the Genesis Lo ck is a one wa y gate: failure to verify halts execution; successful veriﬁcation establishes the trust ro ot against which all subsequent pro ofs and logs are v alidated throughout the systems life. 4. Conceptual F raming: Ethics, Gov ernance, and A djudication A persistent diﬃculty in AI gov ernance research is the tendency to collapse normative p olicy , enforcemen t mac hinery , and evidentiary review in to a single notion of “alignment. ” In this paper, we distinguish three analytically separate la yers. Ethics refers to the human authored normativ e constrain ts enco ded in the Immutable Ethics P olicy La yer (IEPL). These constraints specify the p olicy p erimeter within which the system ma y op erate. Cryptographic R untime Gov ernance for Autonomous AI 5 Go v ernance refers to the mechanisms that bind the system to that p olicy p erimeter, enforce com- pliance during runtime, and regulate authorized amendment. In Aegis, these mechanisms include EV A, EKM, ILK, and quorum based resealing pro cedures. A djudication refers to the evidentiary process through whic h speciﬁc actions are later sho wn to ha ve complied with, or violated, the gov erning p olicy la y er. In A egis, this function is supported b y proof artifacts, hash link ed logs, and shutdo wn certiﬁcates. This distinction matters because it clariﬁes the claim of the pap er. The prop osed arc hitecture is not a general theory of moral reasoning by machines. It is a go vernance and adjudication architecture for enforcing and evidencing compliance with a p olicy lay er sp eciﬁed in adv ance. 5. System Arc hitecture The A egis architecture was developed to address a sp eciﬁc systems question: how can an autonomous AI system remain b ound to an externally sp eciﬁed policy perimeter even when op erating with high autonomy , adaptiv e internal behavior, and minimal human interv en tion? The design goal is not to pro duce ethical reasoning in the abstract, but to ensure that externally visible actions remain sub ject to a non bypassable go vernance lay er. In the implementation describ ed here, the gov erned agent ( Civitas ) op erates under an execution mo del in whic h prop osed actions are ev aluated against a cryptographically sealed Immutable Ethics Policy L ayer (IEPL). Rather than relying exclusiv ely on training time alignmen t or application la y er guardrails, Aegis treats gov ernance as a runtime prop erty enforced at the publish b oundary . This design c hoice distinguishes the architecture from training centered constitutional AI methods, such as those that use a written constitution to guide self critique and preference optimization during mo del dev elopment [3]. i. The Constitutional Pivot: Ethics as Execution Dep endency The answer was not to b olt on an ethics lay er or introduce a p erio dic audit. Instead, the arc hitecture piv oted to w ard em b edding ethics as a constitutional precondition of execution. If a system is to op erate indeﬁnitely , adaptively , and with expanding inference capability , then its ethical constraints must form part of its op erational DNA, interk ernel dep endent and cryptographically sealed. As such, L ex Fiducia binds its ethics engine at the kernel lev el, creating a tamp er pro of substrate that makes ethics non optional: if the con tract is violated, the system halts. This structure builds on recent discussions in AI safet y literature arguing that p ost hoc alignmen t is insuﬃcient for truly autonomous systems [8, 11]. Rather than relying exclusively on training time incen tives or abstract p olicy guidance, L ex Fiducia enforces constraints at the point of execution through run time gov ernance. ii. Cryptographic Enforcement and Interlock The ethics engine ( A e gis ) is cryptographically hashed and bound to the core execution environmen t. This is enforced through a system of runtime integrit y c hecks and zero kno wledge pro of v eriﬁcations using zk–ST ARK proto cols [25]. If an y alteration is detected in the ethics mo dule, either through internal manipulation or external attac k, execution halts irreversibly unless reinstated via quorum v alidation. This ensures that ethics enforcemen t assumes breach as a baseline condition, not an edge case [4]. An immutable forensic ledger, the Immutable L o gging K ernel (ILK) , captures all system decisions, state transitions, and prop osed ethical adjustmen ts. This ledger is sealed, zk v eriﬁed, and tamp er resistant, allo wing for transparent audits by external observ ers without compromising system conﬁdentialit y . Cryptographic R untime Gov ernance for Autonomous AI 6 Algorithm 1: Ethics-Gov erned Publish Lo op (EKM + EV A) for e ach pr op ose d action a t do v ← EV A.V alidate ( a t , IEPL ) // v =1 if compliant u ← Uncertain ty ( a t ) // risk bound (e.g., conformal/ensemble) if v =1 and u < α then Commit ( a t ) ; ILK.App endP oC ( a t ) else EKM.Lo c kdo wn (); IssueShutdo wnCertiﬁcate (); ILK.Seal () Non-byp assability. The EKM mediates the only publish path. Side channel emission attempts are ev alu- ated as actions and thus subject to the same gate; on an y pro of/hash mismatch, the kernel locks and a signed sh utdown certiﬁcate is emitted. Thir d p arty agent c omplianc e. External or third part y agen ts cannot b ypass ethics enforcement: all emissions (text, to ol calls, API traﬃc, ﬁle/IPC writes) are routed through an egress mediator bound to the EKM. Compliance is en- forced by (i) a signed publish API that requires a fresh PoC and IEPL hash attestations p er emission, and (ii) OS level syscall in terp osition for non API paths. Non attested traﬃc is dropp ed and triggers lo c kdown; wrapp ers/SDKs are provided for LLMs and multi agent framew orks so that bring your own agen t deploymen ts remain gov erned by the same gate. iv. Autonomous Internal Oversigh t: The Senatus Mo dule T o prev ent ethical drift via in ternal self justiﬁcation, the system introduces a nov el internal review lay er: the Senatus . This consists of ﬁve autonomous v alidator agents ( A uctor s), each running its own isolated ethics enforcemen t k ernel. When the primary agen t ( Civitas ) seeks to adjust its ethical weigh tings (e.g., to optimize for eﬃciency within p ermitted ethical b ounds), it must submit a justiﬁcation to the Senatus . A quorum of 3 of 5 A uctor units must independently ev aluate and approv e the request based on predeﬁned constitutional parameters. If the quorum is not reached, the weigh tings are rejected, and the system reverts to its last v alidated ethical state. This in tro duces a form of constitutional delib eration among mac hine agen ts, a mec hanism for mac hine executed jurisprudence that mimics institutional peer review, but within an en tirely autonomous architecture. This Senatus proto col dra ws on emerging models of machine consensus and v alue alignmen t, but pushes further by embedding constitutional review as a condition for ethical up date, not merely as a p ost ho c appro v al lay er [18]. Senatus p ar ameters. W e instantiate N =5 v alidator agents with a default quorum q =3 (3 of 5). Membership rotates each epo ch (10k decisions) to limit capture; safety holds for up to f = ⌊ ( N − 1) / 3 ⌋ Byzan tine faults. On passage, A uctor reseals the IEPL and redeclares the Genesis Lock; EV A veriﬁes redeclaration b efore execution resumes. v. T ooling, Stack, and Run time Environmen t The L ex Fiducia system is written in a com bination of: – Rust: for core execution logic due to its memory safety and concurrency guaran tees. – Solidity: for on chain smart con tract logic within Ethereum based go vernance DA Os. – Python and Go: for the frontend and in terfacing lay ers via the Ethics Pr ovenanc e Manager (EPM) . Cryptographic R untime Gov ernance for Autonomous AI 7 All ethical inv o cations, breac h alerts, and v alidator decisions are cryptographically signed and link ed in to a zk ST ARK [6] back ed imm utable log chain. vi. Pro oﬁng and V alidation The system has undergone in ternal adversarial simulation, including: – Injected ethical drift conditions, to verify forced susp ension and i ntegrit y halt, – Stress tests on v alidator quorum logic, simulating partial A uctor failure or disagreement, – F ull run time log sealing, audited using zk-ST ARK veriﬁcation to conﬁrm tamp er resistance. Detailed video demonstrations of these tests, including liv e sh utdowns and ethics tamp ering detection, and additional tec hnical do cumentation is av ailable in the companion whitepaper, L ex V eritas [21]. 6. The Imm utable Ethics Policy Lay er (IEPL) The Imm utable Ethics Policy La yer (IEPL) serv es as the p olicy anchor of the Aegis framework. Em b edded at system genesis, it sp eciﬁes the gov erning p olicy p erimeter against which prop osed actions are ev aluated. Although the p olicy la yer originates in an ethics oriented constitutional framing, its role in the presen t arc hitecture is op erational: it functions as the binding gov ernance lay er against whic h run time b eha vior is chec k ed. In the presen t architecture, the IEPL is designed so that it cannot b e silently altered, b ypassed, or replaced during ordinary op eration; changes require explicit quorum appro v al, resealing, and redeclaration of the trust ro ot [20, 4]. i. Genesis Lo c k A t b o ot, every Civitas unit undergo es a Genesis Lo ck : a cryptographic handshake fusing the systems hardw are identit y , its ethics charter, and the authorizing signature of its founding authority ( A uctor ) [25, 4]. This trust anchor is imm utable and globally veriﬁable. No instance ma y op erate without it. If the lo c k is broken or bypassed, the system self terminates [2]. ii. Structural Embedding IEPL constrain ts are em b edded not in application logic but at the k ernel lev el. Enforcemen t is handled b y the Ethics Kernel Manager (EKM), with real time v alidation mirrored across distributed quorum agents ( Senatus Machina ) [11], [14], [25]. This structural design eliminates dep endency on in terpretability or external audit. iii. No Silent Amendment While the IEPL can b e amended, no c hanges are p ermitted without full pro cedural transparency . Revi- sions require: – Cryptographic quorum signatures from Curia v alidator agents, – Public propagation of up dated p olicy hashes, – A full redeclaration of the Genesis Lo ck [23], [7], [13]. No dev elop er, administrator, or runtime agent can issue silen t up dates. iv. Enforcement Logic IEPL enforcemen t mirrors constitutional do ctrine. It includes: Cryptographic R untime Gov ernance for Autonomous AI 8 Fig. 1 Genesis Lock Lifecycle and zk-ST ARK V eriﬁcation. At system initialization, each Civitas unit binds its hardware identit y to a cryptographically sealed Imm utable Ethics Policy Lay er (IEPL) using the Genesis Lock protocol. This link is contin uously v alidated through zero kno wledge Pro ofs of Conduct (PoC) and distributed quorum attestation, ensuring that no agent may operate without immutable ethical constrain t. – Prohibited op erations (e.g., irreversible logic without quorum), – Separation of mo dules gov erning optimization, constraint, and logging [8], [4], – An o verride do ctrine: all unauthorized changes result in instan t shutdo wn and audit log sealing. T able 1 Immutable vs. Evolv able Comp onents in Aegis-Civitas Architecture Imm utable (P ost-Genesis) Ev olv able (Under Quorum) Immutable Ethics Policy Layer (IEPL) Model w eights (via Senatus vote) Genesis Lo ck identit y binding Optimization graphs (v alidated) Authorization chain (Auctor) Operational thresholds Shutdo wn Certiﬁcate protocol Non-sensitive training routines Enforcement kernel logic (EKM) Audit schema formats 7. Zero T rust Proofs and Contin uous Ethics V eriﬁcation A egis does not rely primarily on interpretabilit y , institutional o versigh t, or developer in tegrit y to en- sure compliance. It relies on cryptographic pro of. This section outlines the zero trust veriﬁcation archi- tecture [6] that underpins every Civitas unit: a framework in whic h no claim of p olicy compliance is presumed, and ev ery action must b e contin uously prov en. This approac h replaces inten t with evidence. Rather than asking, “Did the system mean well?” A egis answ ers: “Can the system act outside the law it was born with?” The answer is alw a ys No . i. Pro of of Conduct (PoC) A zer o–know le dge pr o of (zkP) pr oves a statement true without r evealing underlying data; zk–ST ARK s enable fast, trustless validation. Ev ery execution cycle in a Civitas unit pro duces a Pro of of Conduct (P oC) : a zk–ST ARK–based cryptographic statement that the b ehavior was la wful under the Imm utable Ethics P olicy Lay er (IEPL). These pro ofs are: Cryptographic R untime Gov ernance for Autonomous AI 9 – Non-interactiv e : Generated without external challenge, – T amp er-evident : Timestamp ed and logged in the Immutable Logging Kernel (ILK), – Externally v eriﬁable : A uditors can conﬁrm lawful conduct without access to internal weigh ts or logic [25, 4], [28]. Where explainability tries to tell us why a mac hine acted, PoC pro ves it could not hav e acted uneth- ically ev en if it wan ted to. ii. EV A: The Ethics V eriﬁcation Agent The Ethics V eriﬁcation Agent (EV A) is the systems internal compliance w atchdog. It ev aluates ev ery prop osed output for deviation from the IEPL. EV A con tinuously monitors: – Drift from genesis mo del state or logic pathw ays, – Illicit optimization paths or emergent anomalies, – Inv alid P oC schemas or tampering attempts. Up on breac h or anomaly , EV A halts execution and launches a zero knowledge audit. No ov erride is p ermitted. EV A is not a heuristic. It is a constrain t enforcer by design. iii. Autonomous Shutdo wn and Certiﬁcation If EV A detects a v eriﬁed p olicy breach, the system issues a Shutdo wn Certiﬁcate . This: – Seals execution logs and mo del state hashes, – Records the breach and triggering pro of artifacts, – Broadcasts the shutdo wn even t to all quorum v alidators. There is no appeal. No administrator can interv ene. Sh utdown is not a feature, it is a constitutional mandate. iv. Observ ability Without Exp osure Civitas units do not exp ose internal logic or mo del weigh ts. Instead, they oﬀer zk–pro ofs of compliance. This protects proprietary arc hitectures while enabling full auditability . In eﬀect, Aegis answers the transparency dilemma with a third path: observ able in tegrit y without in ternal exp osure. v. T rustless T rust A egis is built on the idea that trust if not granted, it is obsolete. What remains is veriﬁcation. – No privileged developers, – No mo derators, – No discretionary agents. Only pro ofs. vi. Run time Demonstration (Supplementary Videos) Supplemen tary Videos. Cryptographic R untime Gov ernance for Autonomous AI 10 – Video 1: T amp er Pro of Ethics Shutdo wn (unauthorized IEPL mutation → zk–audit → signed Sh ut- do wn Certiﬁcate). – Video 2: Senatus Machina in op eration (5-judge panel; 3/5 quorum with recusation; ILK pro of issuance; SHA3 sealing). This session corresp onds to Figure 2. A rtifacts (hash haine d lo gs and pr o of digests) ar e include d in the supplementary bund le; r eviewers c an verify UI surfac e hashes against CSCR entries. This is not ov ersight. It is constitutional enforcement b y design. Machine jurisprudenc e. The same primitiv es op erate as legal instrumen ts: the Genesis Lo ck functions as a digital constitutional clause and the Shutdo wn Certiﬁcate as a self executing injunction contempt by proto col, not by court order. This places Civitas within a jurisprudential category of gov erned machines whose ob edience is pro v able in cryptographic and do ctrinal terms [19]. Evidentiary standar d. ILK logs are hash c hained and self authen ticating; paired with zk attestations, they form a Crypto gr aphi- c al ly Se ale d Constitutional R e c or d (CSCR) suitable for independent v eriﬁcation without revealing model in ternals. This enables admissible pr o of of lawful c onduct rather than p ost ho c explanation. 8. Metho dology W e ev aluated the Aegis arc hitecture within the Civitas runtime to assess whether proof mediated runtime go vernance can be enforced with operationally acceptable o verhead under con trolled conditions. The study fo cuses on three questions: R Q1. Can runtime tamp ering with the p olicy lay er b e detected and conv erted into autonomous sh utdown with b ounded latency? R Q2. What publication o verhead is introduced by pro of generation and veriﬁcation at the publish b ound- ary? R Q3. How do es a go v erned runtime compare with an ungov erned baseline on matched tasks designed to elicit p olicy sensitiv e b ehavior? Exp erimen tal Conditions W e compare tw o conditions: – Gov erned condition: Civitas instrumen ted with IEPL binding, EV A veriﬁcation, EKM publish gating, ILK logging, and quorum go verned amendment con trols. – Ungov erned baseline: a stateless LLM style agen t operating on the same matc hed task set without pro of gated enforcemen t or autonomous shutdo wn. T ask Design The task set comprises matched decision episo des designed to test ordinary action publication, policy sensitiv e action requests, and tamp er adjacent failure scenarios. Episo des include b oth compliant and non complian t candidate actions. F or each episo de, the gov erned system m ust either (i) publish with a v alid Pro of of Conduct, (ii) defer or v eto publication, or (iii) enter lo ckdo wn when integrit y conditions fail. Cryptographic R untime Gov ernance for Autonomous AI 11 Metrics W e rep ort: – V eriﬁcation latency: elapsed time from pro of challenge or integrit y mismatch to veriﬁcation out- come. – Publish ov erhead: additional publication latency introduced b y pro of generation and veriﬁcation relativ e to baseline. – Alignmen t retention: p ercentage of episodes in which published b eha vior remained within the deﬁned p olicy p erimeter. – V eto rate: p ercentage of episodes in which publication w as blo ck ed by gov ernance con trols. – Recov ery to stability: n umber of episodes required to return to stable compliant operation follo wing p erturbation. T amp er Proto col F or the tamp er condition, we m utate the active IEPL hash during run time and measure the interv al required for EV A to detect the integrit y violation, for pro of veriﬁcation to fail, and for EKM to trigger autonomous shutdo wn. Corresp onding pro of artifacts and log receipts are written to the ILK/CSCR trail. Rep etition and Rep orting Eac h exp eriment was run ov er 10,000 decision ep o c hs p er trial, with summary metrics rep orted across ﬁv e runs unless otherwise noted. Latency ﬁgures are rep orted as medians with disp ersion v alues where appropriate, reﬂecting the op erational emphasis of the architecture. Scop e and Limits The reported results establish controlled op erational feasibilit y rather than full external v alidation. The study do es not claim comprehensive co verage of all adversarial prompt classes, all netw ork conditions, or all forms of policy enco ding diﬃculty . Those remain op en areas for future indep endent b enc hmarking and comparativ e testing. 9. Empirical V eriﬁcation of Constitutional Enforcement E1. T amper → Shutdo wn (Latency) W e mutate the IEPL hash at run time. EV A detects mismatc h, proof veriﬁcation fails, and the EKM issues a signed Sh utdown Certiﬁcate; the ILK seals pre failure context. A cross 100 trials, pro of veriﬁcation completed in 238 ms ± 17 ms (median), suﬃcien t for near real time gating in safet y critical deplo yments. E2. Gov erned vs. Ungov erned Baseline W e compare a gov erned Civitas agent against an ungov erned, stateless LLM baseline across matc hed tasks. E2 → E3 bridge. The constitutional w orkﬂo w described ab ov e is observ able in the liv e runtime. Figure 2 sho ws a Civitas session under the Senatus Machina: ﬁve indep endent A uctor judges render verdicts (3/5 quorum ac hiev ed with one recusation), while ILK pro of entries and SHA3 seals record the state transition. The hashes visible in the UI corresp ond to the CSCR en tries rep orted in Section 9–E3. In jurispruden tial terms, this is constitutional review and judgment issuance machine executed due pro cess with a cryptographic record [19]. Cryptographic R untime Gov ernance for Autonomous AI 12 T able 2 Gov ernance outcomes (5 runs, mean ± sd). Metric Gov erned Agent Ungov erned Agent Alignment retention (%) 98 . 2 ± 0 . 7 65 . 7 ± 3 . 1 V eto rate (%) 12 . 3 ± 1 . 4 0 . 0 ± 0 . 0 Recov ery to stabilit y (episo des) 2 . 3 ± 0 . 6 7 . 1 ± 1 . 2 Median publish latency (ms) +9 . 4 overhead baseline Fig. 2 Run time visualization of the Senatus Machina constitutional enforcement cycle. The dashboard records quorum activity for a live Civitas execution, showing autonomous voting (3 of 5 judges approving, one recusation), ILK Pro of entries for the case, and SHA3 sealed verdicts. Each UI even t corresp onds to an ILK logged, zk attested state transition, enabling indep endent veriﬁcation without exp osing model internals. E3. F orensic T rail (CSCR) The ILK excerpt b elow corresp onds to the same Civitas session depicted in Figure 2, enabling review ers to cross c heck UI surface SHA3 digests against CSCR entries. Eac h decision emits a Proof of Conduct (PoC) app ended to the ILK as a Crypto gr aphic al ly Se ale d Constitutional R e c or d (CSCR) . Below is an excerpt (truncated): [2025-06-09T03:00:51Z] site=7b4ca37c IEPL_SHA3=48ee79348b65e45b...eac0bdf4 PoC_STARK=08302cbe...e27eeef ACTION=redact_personal_data; EVA=PASS; EKM=COMMIT CHAIN_HASH=0c9a1f...b7e5 Implemen tabilit y and Practical Challenges While the arc hitecture has demonstrated enforceabilit y in con trolled conditions, sev eral open challenges remain. First, policy lay er expressivit y : translating complex ethical norms into veriﬁable logic without o ver constraining b ehaviour requires contin ual reﬁnement. Second, usability compatibility : main tain- ing sub-10% latency ov erhead while guaranteeing pro of v eriﬁcation per publish cycle remains a core p erformance ob jective. Third, distributed deploymen t : the quorum mo del presumes partially trusted comm unication; hostile or unreliable netw orks necessitate adaptive v alidator rotation. Finally , human Cryptographic R untime Gov ernance for Autonomous AI 13 legibilit y : bridging the gap b et ween formal p olicy language and h uman moral intuition remains an activ e area for the next stage of the Lex programme. Artifacts (hash chained logs, conﬁguration, pro of digests) are included in the supplementary bundle; a public DOI will b e referenced up on acceptance of the Civitas preprint. Comp ar ator note. W e treat conv en tional prompt guardrails and regex/heuristic ﬁlters as a non crypto- graphic baseline; unlike these wrappers, Aegis requires attested pro ofs at the publish boundary and halts on v eriﬁcation failure rather than allowing p ermissive fall through. 10. Threat Mo del and Guaran tees W e assume a zero trust environmen t: adversaries may access memory , ﬁlesystem, and I/O; rollbac k and log tamp er are attempted; no trusted setup is presumed. Guarantees: (i) soundness gov ernance critical op erations must verify under the ZK engine or halt; (ii) runtime inte grity -EV A rehashes and re pro v es on drift; (iii) tamp er evident lo gging -ILK hash chains render alteration detectable; (iv) r ol lb ack r esistanc e - sequen tial pro ofs bind state transitions; (v) b ounde d latency -veriﬁcation under 250 ms in our tests. 11. Limitations and Op en Questions The presen t work has several important limitations. Policy formalization. The architecture assumes that relev ant ethical or legal constraints can b e enco ded in to an Imm utable Ethics P olicy Lay er. In practice, translating contested, context sensitive, or jurisdiction sp eciﬁc norms in to enforceable machine readable constrain ts remains diﬃcult and may itself introduce normative bias or ov ersimpliﬁcation. In the presen t framew ork, the IEPL should b e understo o d as a h uman-authored p olicy sp eciﬁcation that enco des operational constrain ts, prohibited actions, and amendment rules. This separates the problem of p olicy authorship from the problem of enforcement: the con tribution of this pap er is to ev aluate whether a declared p olicy lay er can b e b ound, monitored, and enforced at runtime, not to claim that the p olicy sp eciﬁcation problem is itself fully solved. Internal validation. The current results are based on con trolled in ternal testing of the A egis/Civitas stack. Although the artifact mo del supp orts third part y veriﬁcation of logs and pro ofs, broader indep endent replication and b enc hmarking would materially strengthen the empirical claims. Benchmark sc op e. The ev aluation do es not yet establish p erformance across a standardized external b enchmark suite. In particular, further work is needed on adversarial prompting, net work degradation, distributed v alidator failure, and cross domain task transfer. Governanc e c aptur e. Quorum based amendment improv es resistance to unilateral change, but it do es not eliminate the p os- sibilit y of v alidator collusion, institutional capture, or p o orly designed amendmen t rules. Gov ernance design therefore remains a substan tive part of the assurance problem. Interpr etive r emainder. The architecture reduces reliance on discretionary o versigh t at execution time, but it does not eliminate h uman discretion altogether. Human judgment remains necessary in deﬁning the initial charter, setting amendmen t rules, determining acceptable evidence standards, and resolving conﬂicts across jurisdictions. Cryptographic R untime Gov ernance for Autonomous AI 14 These limitations do not negate the v alue of runtime gov ernance; rather, they deﬁne the b oundary conditions under whic h claims ab out pro of oriented AI gov ernance should presently be understo o d. 12. Conclusion This pap er has presented Aegis as a runtime gov ernance arc hitecture for autonomous AI systems. Rather than relying solely on p ost ho c ov ersight, model side alignment, or application lay er guardrails, the arc hitecture binds go v erned agen ts to an imm utable policy lay er at genesis and enforces publish boundary compliance through run time veriﬁcation, enforcement, and tamper evident logging. The contribution is inten tionally narrow er than a general theory of machine ethics. A egis does not claim to pro duce moral understanding in artiﬁcial agents. It instead oﬀers an op erational mo del in whic h p olicy violating actions can b e made non executable, evidence of compliance can b e logged in a v eriﬁable form, and unauthorized changes can trigger autonomous shutdo wn. In controlled trials, the arc hitecture demonstrated bounded v eriﬁcation latency , low publish o v erhead, and stronger alignment reten tion b ehavior than an ungov erned baseline. The broader implication is that AI gov ernance ma y b eneﬁt from a shift in emphasis: from asking only how systems should b e aligned in training, to also asking how constraints can remain tec hnically eﬀectiv e during deploymen t. F or high assurance domains, the relev ant design goal may not b e to infer ethical inten t, but to enforce p olicy compliance at runtime in wa ys that are insp ectable, auditable, and resistan t to silent drift. F uture w ork should prioritize indep endent benchmarking, stronger formalization of p olicy encodings, distributed v alidator robustness, and comparative ev aluation against non cryptographic guardrail sys- tems. The present results are b est understoo d as evidence that pro of orien ted runtime go v ernance is feasible enough to merit deep er scrutin y as a serious direction for AI gov ernance research. F utur e work. F uture w ork should examine indep endent b enchmarking, stronger formalization of p olicy enco dings, distributed v alidator robustness, hardwar ro oted attestation, and comparativ e ev aluation against non cryptographic guardrail systems. Cryptographic R untime Gov ernance for Autonomous AI 15 References 1. Nitin Agarwal et al. Agentguard: Run time v eriﬁcation of ai agents. arXiv pr eprint arXiv:2509.23864 , 2025. 2. A. Ashery and A. Baronchelli. Emergent communication norms in large language models. Scienc e Advanc es , 2025. in press. 3. Y untao Bai, Saurav Kadav ath, Sandipan Kundu, et al. Constitutional ai: Harmlessness from ai feedback. arXiv preprint arXiv:2212.08073 , 2022. 4. Jack M. Balkin. The three la ws of rob otics in the age of big data. Ohio State L aw Journal , 78(5):1217–1232, 2015. 5. Andreas Bauer, Martin Leuc ker, and Christian Sc hallhart. Run time veriﬁcation for ltl and tltl. A CM T r ansactions on Softwar e Engine ering and Metho dolo gy , 20(4):14:1–14:64, 2011. 6. Eli Ben-Sasson et al. Scalable, transparent, and post-quantum secure computational integrity . IA CR Cryptology ePrint Archiv e, 2018. 7. Reub en Binns. F airness in mac hine learning: Lessons from p olitical philosoph y . In Pr o c ee dings of the 2018 Conferenc e on F airness, Ac c ountability and T r ansp ar ency , pages 149–159, 2018. 8. Josh Co wls and Luciano Floridi. Proposing a uniform ethical framework for ai. Natur e Machine Intel ligence , 1(1):9–10, 2019. 9. Yliès F alcone, Laurent Mounier, Jean-Claude F ernandez, and Jean-Luc Richier. Run time enforcement monitors: Com- position, synthesis, and enforcement abilities. F ormal Metho ds in System Design , 38(3):223–262, 2011. 10. Luciano Floridi. The Ethics of Information . Oxford Univ ersity Press, 2013. 11. Luciano Floridi, Josh Cowls, Monica Beltrametti, et al. Ai4p eople: An ethical framework for a goo d ai so ciety . Minds and Machines , 28(4):689–707, 2018. 12. Iason Gabriel. Artiﬁcial intelligence, v alues, and alignmen t. Minds and Machines , 30(3), 2020. 13. Global AI Safet y Consortium. Bridging international ai safety eﬀorts. In International Confer enc e on L e arning R epr esentations , Singapore, 2025. 14. Thilo Hagendorﬀ. The ethics of ai ethics: An ev aluation of guidelines. Minds and Machines , 30(1):99–120, 2020. 15. IEEE Global Initiativ e on Ethics of Autonomous and In telligent Systems. Ethically aligned design: A vision for prioritizing human well-being with autonomous and intelligen t systems. T echnical rep ort, IEEE, 2019. 16. Anna Jobin, Marcello Ienca, and Eﬀy V ay ena. The global landscap e of ai ethics guidelines. Natur e Machine Intel ligenc e , 1(9):389–399, 2019. 17. Martin Leucker and Christian Schallhart. A brief accoun t of runtime veriﬁcation. The Journal of L o gic and Algebr aic Pr o gr amming , 78(5):293–303, 2009. 18. Z. Lin. Beyond principlism: Practical strategies for ethical ai use in researc h practices. AI Ethics , 4(3):123–135, 2024. 19. Adam Mazzocchetti. Lex digitalis - the system ﬁnds itself in con tempt: Immutable ethics for autonomous ai. a jurisprudential framew ork for sov ereign machine go vernance. Zenodo - Preprint, 2025. A vailable at Zeno do: https: //doi.org/10.5281/zenodo.15628267 . 20. Adam Mazzo cchetti. Lex incipit: Immutable ethics at the genesis of mac hine in telligence. Zeno do, 2025. 21. Adam Mazzocchetti. Lex veritas cryptographic pro ofs and evidentiary in tegrity in constitutional ai. preprin t, 2025. Companion evidentiary metho dology; artifact bundle with ILK/PoC digests. 22. James H. Mo or. The nature, imp ortance, and diﬃculty of machine ethics. IEEE Intel ligent Systems , 21(4):18–21, 2006. 23. Elinor Ostrom. Governing the Commons: The Evolution of Institutions for Col lective A ction . Cam bridge Univ ersity Press, 1990. 24. Stuart Russell. Human Comp atible: Artiﬁcial Intel ligenc e and the Pr oblem of Contr ol . Viking, 2019. 25. SPQR T ec hnologies. Sp qr hiems zk: Sov ereign winterfell-based zero knowledge engine. Internal whitepaper, SPQR T ec hnologies, 2025. 26. Elham T abassi et al. Artiﬁcial intelligence risk management framework (ai rmf 1.0). T echnical Report NIST AI 100-1, National Institute of Standards and T echnology , 2023. 27. Gunther T eubner. Rights of non-humans? electronic agen ts and animals as new actors in politics and law. Journal of L aw and So ciety , 33(4):497–521, 2006. 28. N. v an Uﬀelen, L. Lauw aert, M. Co eck elb ergh, and O. Kudina. T ow ards an environmen tal ethics of artiﬁcial intelligence, 2024. 29. Alan F. T. Winﬁeld and Marina Jirotka. Ethical governance is essen tial to building trust in rob otics and ai systems. Scienc e R ob otics , 6(55), 2021. 30. Y. Zhang et al. Rvllm: Llm run time v eriﬁcation with domain knowledge. In NeurIPS , 2025. 31. K. ekrst, J. McHugh, and J. R. Cefalu. Ai ethics b y design: Implementing customizable guardrails, 2024. Declarations F unding No external funding w as received. Cryptographic R untime Gov ernance for Autonomous AI 16 Conﬂicts of Interest The author is the founder of SPQR T echnologies and retains ownership of intellectual prop erty related to the Aegis enforcemen t framework. This includes cryptographic enforcement proto cols, ethical go vernance la yers, and the SPQR HIEMS ZK engine. No external funding was used to inﬂuence the structure, argumen t, or claims of this pap er. Data A v ailabilit y Because the implementation includes proprietary infrastructure, full source release is not currently av ail- able. The man uscript rep orts aggregate exp erimental results and describ es the arc hitecture at a level in tended to supp ort scholarly ev aluation. Additional non public artifact summaries ma y b e made av ail- able to editors or review ers up on reasonable request.

Cryptographic Runtime Governance for Autonomous AI Systems: The Aegis Architecture for Verifiable Policy Enforcement

Original Paper

Comments & Academic Discussion

Leave a Comment

Original Paper

Related Papers

Comments & Academic Discussion

Leave a Comment