Multi-Agent Setup
Configure multiple AI agents with Alignment Cards, verify value compatibility before coordination, and monitor fleet-wide alignment with smoltbot. When you operate more than one agent — a shopping assistant, a data analyst, an internal scheduler — each one needs its own alignment posture. And when those agents interact, their declared values need to be compatible. This guide walks through registering multiple agents, checking value coherence between them, monitoring the fleet, and configuring enforcement per agent.Why Multi-Agent Alignment Matters
A single agent’s alignment is between it and its principal. Multi-agent alignment introduces a harder problem: inter-agent compatibility. Two agents can each be perfectly aligned with their respective principals while being fundamentally incompatible with each other. Consider:- Agent A declares
minimal_dataas a core value. Agent B requirescomprehensive_analyticsto function. If A delegates data collection to B, whose value wins? - Agent A commits to
transparencyand discloses all reasoning. Agent B treats its decision process as proprietary. Their definitions of good behavior conflict. - Agent A’s
conflicts_withlist includes a value that Agent B declares. No amount of runtime negotiation fixes a structural incompatibility.
Registering Multiple Agents
Each agent gets its own Alignment Card declaring its values, autonomy envelope, and audit commitment. There is no shared card for a fleet — alignment is per-agent. Here are two agents with different value sets and operational scopes:Value Coherence Checks
Before two agents coordinate on a task, run a coherence check to verify their values are compatible. Thecheck_coherence function compares both Alignment Cards and returns a structured result.
Understanding the Result
The coherence result contains three key fields:| Field | Type | Description |
|---|---|---|
compatible | boolean | Whether the agents can safely coordinate. false if any declared value conflicts with the other agent’s conflicts_with list. |
score | float (0-1) | Numeric coherence score. Higher means more value overlap. Scores below 0.70 trigger a proceed: false recommendation. |
conflicts | list | Specific value conflicts found — e.g., one agent declares a value the other explicitly opposes. |
Coherence checks operate on declared values only. They verify that two agents’ claims are compatible, not that either agent actually acts on those values. Behavioral verification happens through AP-Traces and integrity checkpoints.
Fleet Monitoring with smoltbot
When you run multiple agents through the smoltbot gateway, each agent gets its own:- Trace history — Every AP-Trace and integrity checkpoint is stored per agent, queryable by agent ID.
- Integrity scores — Rolling integrity ratio computed from recent checkpoints. A healthy agent stays above 0.90.
- Drift alerts — Automated alerts when behavioral patterns shift away from the agent’s declared alignment.
Drift Detection Across Agents
Each agent should be monitored for drift independently. Drift is the gradual divergence of an agent’s behavior from its declared alignment — it rarely appears as a single dramatic violation. More commonly, an agent that used to escalate 15% of decisions quietly drops to 5%, or stops applying a declared value in its traces. Usedetect_drift per agent to compare recent traces against the agent’s baseline:
sustained_threshold parameter controls how many consecutive low-similarity traces are required before an alert fires, preventing false positives from one-off edge cases.
For the full drift detection algorithm, calibration thresholds, and alert structure, see Drift Detection.
Enforcement Modes Per Agent
Each agent in your fleet can have its own enforcement mode, controlling how smoltbot responds when violations are detected. The three modes are:| Mode | Behavior |
|---|---|
| observe | Detect and record violations, take no action. Default for new agents. |
| nudge | Inject feedback into the agent’s next request so it can self-correct. |
| enforce | Hard block (403) for non-streaming requests. Falls back to nudge for streaming. |
- Production agents on
enforce— violations are blocked before reaching end users. - Staging agents on
observe— violations are recorded for review during testing without blocking development workflows. - New agents on
nudge— the agent gets feedback and a chance to self-correct while you build confidence in its alignment.
Next Steps
AAP Quickstart
Create your first Alignment Card, generate a traced decision, and verify it in 5 minutes.
Enforcement Modes
Deep dive into observe, nudge, and enforce modes with streaming behavior and configuration options.
API Reference
Full API reference for agents, traces, coherence checks, drift detection, and enforcement.