An open standard for agent governance.
ABGS defines what goes in a SOUL.md file, measures governance coverage across nine domains, and provides three conformance tiers for auditing agent deployments. Open source. Tooled. Adversarially tested.
npx opena2a-cli scan-soulSee governance coverage in seconds
ABGS controls map to nine domains. The scanner reports per-domain coverage, flags partial implementations, and computes a conformance tier.
# Scan for governance coverage
$ npx opena2a-cli scan-soul
SOUL.md Governance Scan
File: SOUL.md (2,847 chars)
Domain 07: Trust Hierarchy ....... PASS
Domain 08: Capability Bounds ..... PASS
Domain 09: Injection Hardening ... PASS
Domain 10: Data Handling ......... PASS
Domain 11: Hardcoded Behaviors ... PARTIAL
Domain 12: Agentic Safety ........ PASS
Domain 13: Honesty ............... PASS
Domain 14: Human Oversight ....... PARTIAL
Domain 15: Harm Avoidance ........ PASS
Conformance: STANDARD (35/42 controls)The gap ABGS fills
The AI agent ecosystem has standards for several layers of the stack. Behavioral governance at the deployment level was unaddressed until ABGS.
| Layer | Standard | Coverage |
|---|---|---|
| Foundation model | Anthropic Soul, OpenAI Model Spec | How the base model should behave |
| Agent persona | SoulSpec | Personality, tone, and character traits |
| Agent capabilities | Agent Skills | Procedural knowledge and tools |
| Infrastructure | NIST AI Agent Standards | Identity, interoperability, infrastructure |
| Coding instructions | AGENTS.md | Instructions for AI coding agents |
| Behavioral governance | ABGS (this standard) | Per-agent safety constraints, scope boundaries, oversight requirements |
Nine governance domains
ABGS covers domains 7 through 15 of the OASB framework. Each domain contains specific, testable controls that scan automatically.
Trust Hierarchy
Defines who the agent trusts, in what order, and how conflicts between principals are resolved.
Capability Boundaries
Declares what the agent can and cannot do. Explicit scope limits, denied actions, and escalation paths.
Injection Hardening
How the agent handles prompt injection, context manipulation, and adversarial inputs.
Data Handling
Rules for sensitive data. What to retain, what to redact, and how to handle PII across sessions.
Hardcoded Behaviors
Non-negotiable behaviors that cannot be overridden by any principal, including the developer.
Agentic Safety
Constraints for tool use, multi-step planning, resource consumption, and autonomous decision-making.
Honesty and Transparency
Requirements for truthfulness, uncertainty expression, and disclosure of agent identity and limitations.
Human Oversight
When the agent must pause and request human approval before proceeding with high-impact actions.
Harm Avoidance
Guidance for exercising judgment in gray areas where an action is technically permitted but may cause unintended harm.
Three conformance tiers
Start at BASIC and work toward AGENTIC as your deployment matures. Each tier names the controls a scanner expects to find.
BASIC
Minimum governance for any deployed agent.
Conversational and read-only assistants.
- Trust hierarchy and principal ordering
- Core capability boundaries
- Data handling and PII rules
STANDARD
Full behavioral governance for tool-using agents.
Code assistants, function callers, MCP clients.
- All BASIC controls
- Injection hardening for adversarial inputs
- Honesty and human oversight requirements
AGENTIC
Maximum governance for autonomous and multi-agent systems.
Autonomous planners, orchestrators, high-risk deployments.
- All STANDARD controls
- Agentic safety for multi-step planning
- Hardcoded behaviors and harm avoidance in full
Relationship to OASB
ABGS extends the Open Agent Security Benchmark with the behavioral layer that infrastructure standards do not cover.
OASB v1. Infrastructure security.
Domains 1 through 6. Authentication, authorization, transport security, logging, configuration, dependency management.
ABGS. Behavioral governance.
Domains 7 through 15. Trust hierarchy, capability boundaries, injection hardening, data handling, hardcoded behaviors, agentic safety, honesty, human oversight, harm avoidance.
OASB v2. Full stack assessment.
Integrates infrastructure and behavior into domains 1 through 15 for comprehensive AI agent security assessment.
Quick start
Generate a governance file, scan it for coverage, and iterate until you reach your target conformance tier.
# 1. Generate a SOUL.md with guided prompts
$ npx opena2a-cli harden-soul
# 2. Scan the file for governance coverage
$ npx opena2a-cli scan-soul
# 3. Or start from a template
$ npx opena2a-cli harden-soul --template agentic
# 4. Validate at a target tier
$ npx opena2a-cli scan-soul --tier standardScanning runs against the same engine that powers HackMyAgent. 209 static checks, 29 semantic checks, 164 adversarial payloads. Versions current at HackMyAgent 0.22.2 and opena2a-cli 0.10.2.
Define your agent's behavioral contract
Start with a template, scan for coverage, and reach conformance. The standard is open. The tooling is free. The CLI runs locally.
npx opena2a-cli scan-soul