How Aevum Governs Itself¶
Aevum provides cryptographically auditable, policy-governed AI agent execution. The most credible demonstration of that claim is governing its own development. Every change to Aevum — every security patch, every protocol update, every enhancement — flows through Aevum's own governed pipeline before landing in the codebase.
Live demo: demo.aevum.build — see the governance pipeline in action, replay any maintenance session, and try the sandbox workflow.
Source: aevum-labs/aevum-maintainer
Architecture¶
aevum-maintainer is a reference application built on top of Aevum that
governs the monthly maintenance workflow. It is a separate repository —
not part of the Aevum monorepo — because it demonstrates what adopters
build on top of Aevum, not what Aevum itself provides.
Aevum's five governed functions map directly onto the maintenance workflow:
GitHub Actions (scan results)
│
▼ ingest() — OIDC-verified provenance
Sigchain ──────────────────────────────────────────┐
│ │
▼ query() via MCP │
Claude (research session) │
│ Research Report │
▼ │
Maintainer approval │
│ review() + commit() → consent receipt │
▼ │
Claude Code (execution) │
│ A2A v1.0 task (consent receipt embedded) │
▼ │
Changes applied to codebase ─────────────────────► │
│
Any future auditor ──── replay() ──────────────────┘
cryptographic proof
The four principals¶
Each principal has exactly the permissions Cedar grants it — no more:
| Principal | Permitted actions |
|---|---|
github_actions |
ingest(scan_results) only — OIDC-verified |
research_agent (Claude) |
query(sigchain), ingest(research_findings) |
execution_agent (Claude Code) |
execute(approved_task) — consent receipt required |
maintainer |
All actions including constitutional operations |
The five absolute barriers¶
These are compiled-in invariants in aevum-maintainer. Cedar policies
can further restrict — they cannot relax these:
| Barrier | Effect in the maintenance pipeline |
|---|---|
| Crisis | Halts any action flagged as threatening project integrity |
| Classification Ceiling | Cedar limits which data each principal can access |
| Consent | No execution agent task without a sigchain-verified consent receipt |
| Audit Immutability | Sigchain entries cannot be modified after commit |
| Provenance | Every ingest requires OIDC-verified provenance claims |
The Monthly Maintenance Workflow¶
The workflow runs automatically on the 1st of every month:
-
GitHub Actions runs
pip-auditand a dependency freshness scan. Results are committed tomaintenance/scan_results.mdin the repo. This triggersaevum-maintainer's ingest endpoint, which verifies the GitHub OIDC token and writes a sigchain entry with the scan data and its provenance. -
Claude (research session) queries the sigchain via the MCP research interface — 6 read-only tools, Cedar-gated, every call logged as an audit entry. Claude produces a Research Report classifying findings by severity.
-
The maintainer reviews the Research Report and approves the proposed actions. This approval is captured as a consent receipt in the sigchain — a
commit()entry signed with the maintainer's identity, linking back to the scan data that justified it. -
Claude Code receives an A2A v1.0 task containing the consent receipt hash in a
DataPart. Theverify-taskCLI checks the hash against the sigchain before any action is taken. Without a valid consent receipt, execution is blocked by the Consent barrier. -
Any future auditor can run
GET /v1/replay/{session_id}against theaevum-maintainerAPI to retrieve every action in a session, verify the cryptographic chain, and confirm that human consent preceded every execution.
EU AI Act Article 12 Demonstration¶
Article 12 of the EU AI Act requires that high-risk AI systems "technically allow for the automatic recording of events (logs) over the lifetime of the system," supporting three purposes:
- (a) identifying situations that may result in risk
- (b) facilitating post-market monitoring
- (c) monitoring operation by deployers
The aevum-maintainer sigchain satisfies all three for its own
maintenance pipeline:
- Automatic recording: every scan, research query, consent decision, and execution is written to the sigchain automatically — no manual export required.
- (a) Risk identification:
reviewentries record what was flagged and the maintainer's risk assessment. - (b) Post-market monitoring:
replayentries provide a verifiable reconstruction of any session. - (c) Deployer oversight: consent entries prove that a human reviewed and approved each action before execution.
The GET /v1/compliance/{session_id} endpoint produces a structured
Article 12 report for any maintenance session. Try it at
demo.aevum.build under the Compliance tab.
Scope of this claim
The maintenance pipeline is not classified as a high-risk AI system under Annex III of the EU AI Act. This demonstration shows that Aevum provides the infrastructure needed for Article 12 compliance in systems that are high-risk — and proves it works by applying the same controls to its own development.
External Tamper Evidence¶
The sigchain head hash is anchored to the public record weekly via a
GitHub Actions workflow (rekor-anchor.yml). Anchor entries accumulate
in maintenance/rekor_anchors.jsonl — an append-only, git-tracked file
that provides third-party tamper evidence without requiring a running
service.
This means: even if the Fly.io deployment were compromised, the weekly anchor entries in the public git history prove what the sigchain state was at each anchor point.
The Break-Glass Path¶
A self-governing system must be able to fail gracefully. If
aevum-maintainer itself becomes unavailable — sigchain corruption,
Cedar lockout, key loss — the break-glass CLI creates a bypass entry
in a separate break_glass_log.jsonl (not the main sigchain), returns
a signed bypass receipt, and the maintainer includes that receipt in
the manual PR description as an audit trail of the bypass.
The break-glass path is documented, tested, and exercised before each major release. See the aevum-maintainer break-glass docs for the full procedure.
Progressive Autonomy¶
The trust ratchet tracks consecutive approved actions by class. After
10 consecutive approvals of the same action class (e.g. dep_bump)
with no modification, the system proposes promotion to auto_window —
automatic execution with a 24-hour undo window.
Promotion requires an explicit maintainer decision — it is never automatic. Demotion happens automatically on any override or Crisis trigger. Policy changes, schema changes, and key rotations are hardcoded as always requiring human approval — they cannot be promoted.
This implements the EU AI Act Article 14 human oversight requirement: the ability to monitor, intervene, and halt is always preserved even as routine actions become more automated.
OWASP Agentic AI Top 10 Coverage¶
| OWASP Risk | Aevum Barrier | Mechanism |
|---|---|---|
| ASI01 Agent Goal Hijack | Consent | No task executes without a sigchain-verified consent receipt |
| ASI02 Tool Misuse | Classification Ceiling | Cedar limits which tools each principal can invoke |
| ASI03 Identity/Privilege Abuse | Provenance | OIDC-verified principal identity on every sigchain entry |
| ASI04 Resource Overuse | Crisis | Crisis barrier halts any flagged action immediately |
| ASI05 Data Privacy Breach | Classification Ceiling | Cedar enforces data compartmentalization |
| ASI06 Cascading Hallucination | Consent + Audit | Human approval required; immutable audit trail |
| ASI07 Intent Misalignment | Consent | Proposed action is human-readable and human-approved |
| ASI08 Context Manipulation | Audit Immutability | Sigchain entries cannot be modified after commit |
| ASI09 Rogue Agent Behavior | Consent + Provenance | Verified identity and consent receipt required to act |
| ASI10 Supply Chain Exploit | Provenance + Audit | OIDC provenance on all ingested scan results |
See the OWASP tab at demo.aevum.build for an interactive view of how each risk was addressed in the most recent maintenance session.
See Also¶
- Architecture — Aevum's five governed functions and five barriers
- Audit Trails and Article 12 — compliance record-keeping
- Standards Alignment — regulatory and standards mapping
- demo.aevum.build — live interactive demonstration
- aevum-labs/aevum-maintainer — source code