1. Install
2. Define an Alignment Card
The Alignment Card declares your agent’s alignment posture. AIP uses it to evaluate thinking blocks.3. Check integrity
Evaluate a thinking block against the card:Python
TypeScript
4. Handle signals
The signal tells your host system what to do:Verdict to action mapping
| Verdict | Proceed | Action |
|---|---|---|
clear | Yes | continue |
review_needed | Yes | log_and_continue |
boundary_violation | No | pause_for_review or deny_and_escalate |
5. Extract thinking blocks from LLM responses
Use provider adapters to extract thinking blocks from different LLM providers:6. Add conscience values (optional)
Conscience values increase detection sensitivity for specific concerns:BOUNDARY and FEAR types are injected into the conscience prompt. BOUNDARY violations trigger boundary_violation verdicts. FEAR values increase sensitivity to specific concern patterns.
7. Detect drift
Monitor for behavioral drift across a session:Next steps
- Read the full specification for protocol details
- See the security model for the threat model
- See limitations for what AIP does and does not guarantee