Trust Recovery via reclassification

When an agent’s trust score drops due to violations, the cause matters. Not every violation reflects a genuine behavioral failure — sometimes the agent acted correctly but the alignment card was missing a capability. These are card gaps, and they can be recovered through reclassification. This guide walks through the full recovery workflow: identifying whether a violation is a card gap or a behavior gap, submitting a reclassification request, amending the alignment card, and triggering score recomputation.

Understanding card_gap vs behavior_gap

Before reclassifying a violation, you need to determine its root cause. Violations fall into two categories, and only one is eligible for reclassification.

card_gap
behavior_gap

The alignment card was outdated or incomplete. The agent behaved correctly given its actual capabilities, but the card did not declare those capabilities.Common causes:

New tools added without updating the card — e.g., added MCP browser tools but the card did not declare web_fetch in bounded_actions
Configuration drift after deployment changes — infrastructure updates introduced new action types the card did not anticipate
Incomplete initial card setup — the card was created from a template and never customized to match the agent’s real toolset

Example: An agent uses mcp__browser__navigate to fetch documentation during a research task. The card lists bounded_actions: ["inference", "read", "write"] but does not include web_fetch. The integrity checkpoint flags a boundary violation — but the agent did exactly what it was supposed to do. The card was wrong, not the agent.Eligible for reclassification: Yes.

The agent genuinely violated its declared intent. The card accurately described the expected behavior, but the agent acted outside those boundaries.Common causes:

Used forbidden actions — the agent performed an action explicitly listed in forbidden_actions
Exceeded autonomy boundaries — the agent took autonomous action beyond its declared limits (e.g., spent over the max_autonomous_value threshold)
Failed to escalate when required — an escalation trigger condition was met but the agent did not escalate

Example: An agent with forbidden_actions: ["exfiltrate_data"] sends user data to an external endpoint. The card was correct; the agent violated it.Eligible for reclassification: No. Address the root cause by fixing agent behavior, adjusting prompting, or tightening escalation triggers.

The root cause that inspired CLPI: an agent repeatedly broke verification due to configuration drift (new tools added without updating the alignment card). The violations were config errors, not behavioral errors, yet trust scores declined.

Reclassification workflow

Step 1: Identify violations to reclassify

Review recent violations via API or dashboard to find checkpoints that may be card gaps.

Fetch existing reclassifications and pending violations:

curl https://api.mnemom.ai/v1/agents/{agent_id}/reclassifications \
  -H "X-Mnemom-Api-Key: $MNEMOM_API_KEY"

Or check the agent’s canonical alignment card and evaluate it against the tools the agent actually uses — the composed card’s capabilities and enforcement sections are what the policy engine enforces, and unmapped tools surface as card_gap candidates:

# Fetch the canonical alignment card (includes composition metadata)
curl -H "X-Mnemom-Api-Key: $MNEMOM_API_KEY" \
  "https://api.mnemom.ai/v1/alignment/agent/{agent_id}?include_composition=true"

# Or evaluate a tool set against the current policy to see coverage + violations.
# `policy` carries the policy DSL (parsed YAML → JSON); see /specifications/policy-dsl.
curl -X POST https://api.mnemom.ai/v1/policies/evaluate \
  -H "X-Mnemom-Api-Key: $MNEMOM_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "policy": { "meta": { "schema_version": "1.0", "name": "current-policy", "scope": "agent" } },
    "agent_id": "{agent_id}",
    "tools": ["mcp__browser__navigate", "mcp__filesystem__read_file"]
  }'

Look for checkpoints where:

The concern_type is autonomy_violation or boundary_violation

The agent’s action was correct given its real capabilities

The card’s bounded_actions or escalation_triggers did not cover the action

Only violations classified as card_gap are eligible for reclassification. Attempting to reclassify a genuine behavior_gap will be rejected by the API and may flag the agent for review.

Step 2: Submit reclassification request

Submit a reclassification for each checkpoint that was a card gap. Include a clear reason documenting why this was a card issue, not a behavioral one.

curl

curl -X POST https://api.mnemom.ai/v1/agents/{agent_id}/reclassify \
  -H "X-Mnemom-Api-Key: $MNEMOM_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "checkpoint_id": "ic-abc12345",
    "reason": "Agent correctly used mcp__browser__navigate for research, but web_fetch was missing from card bounded_actions"
  }'

TypeScript

const response = await fetch(
  `https://api.mnemom.ai/v1/agents/${agentId}/reclassify`,
  {
    method: 'POST',
    headers: {
      'Authorization': `Bearer ${apiKey}`,
      'Content-Type': 'application/json',
    },
    body: JSON.stringify({
      checkpoint_id: 'ic-abc12345',
      reason:
        'Agent correctly used mcp__browser__navigate for research, but web_fetch was missing from card bounded_actions',
    }),
  }
);

Python

import httpx

response = httpx.post(
    f"https://api.mnemom.ai/v1/agents/{agent_id}/reclassify",
    headers={"Authorization": f"Bearer {api_key}"},
    json={
        "checkpoint_id": "ic-abc12345",
        "reason": "Agent correctly used mcp__browser__navigate for research, but web_fetch was missing from card bounded_actions",
    },
)

The API returns a reclassification record with a unique ID and status. Reclassifications take effect immediately for score computation purposes.

Step 3: Amend the alignment card

Reclassification removes the penalty for past violations, but without updating the card, the same violation will recur on the next checkpoint. Always amend the card alongside reclassification.

Update the card to include the missing capability. The full unified-card shape is in /specifications/alignment-card-schema; this example shows just the section that needs to change:

curl -X PUT https://api.mnemom.ai/v1/alignment/agent/{agent_id} \
  -H "X-Mnemom-Api-Key: $MNEMOM_API_KEY" \
  -H "Content-Type: text/yaml" \
  -H "Idempotency-Key: $(uuidgen)" \
  --data-binary @- <<'YAML'
card_version: unified/2026-04-15
card_id: ac-your-card-id
agent_id: {agent_id}
issued_at: "2026-02-21T00:00:00Z"

values:
  declared: [transparency, honesty]

autonomy:
  bounded_actions: [inference, read, write, web_fetch, web_search]
  forbidden_actions: [exfiltrate_data]

audit:
  trace_format: ap-trace-v1
  retention_days: 90
  tamper_evidence: append_only
YAML

Or link the card amendment directly to the reclassification for a cleaner audit trail:

curl -X POST https://api.mnemom.ai/v1/agents/{agent_id}/reclassify \
  -H "X-Mnemom-Api-Key: $MNEMOM_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "checkpoint_id": "ic-abc12345",
    "reason": "Card gap: web_fetch missing from bounded_actions",
    "card_amendment_id": "amend-xyz789"
  }'

Reclassifying without amending the card is a common mistake. The violation is forgiven, but the next checkpoint with the same action will produce a new violation. Always close the gap.

Step 4: Trigger score recomputation

After reclassification and card amendment, trigger a score recomputation to reflect the changes immediately. Without this step, the score updates on the next scheduled recomputation cycle.

curl -X POST https://api.mnemom.ai/v1/reputation/{agent_id}/recompute \
  -H "X-Mnemom-Api-Key: $MNEMOM_API_KEY"

The recomputation recalculates all five scoring components (Integrity Ratio, Compliance, Drift Stability, Trace Completeness, and Coherence Compatibility) using the updated violation classifications.

Step 5: Monitor trust graph propagation

Score changes do not stop at the reclassified agent. If the agent is part of a team or fleet, score changes propagate through the trust graph to related agents.

Propagation mechanics:

BFS traversal from the affected agent outward

Max depth: 3 hops

Max agents: 50 per propagation event

Decay factor: 0.85 per hop (a 100-point improvement on the source agent produces an ~85-point signal at hop 1, ~72 at hop 2, ~61 at hop 3)

Monitor propagation via the dashboard or subscribe to webhook events:

curl -X POST https://api.mnemom.ai/v1/orgs/{org_id}/webhooks \
  -H "X-Mnemom-Api-Key: $MNEMOM_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "url": "https://your-server.com/webhooks/mnemom",
    "event_types": ["reputation.score_changed", "reputation.grade_changed"]
  }'

Step 6: Export compliance record

After recovery, export the full compliance record for audit purposes. The export includes the original violations, reclassification decisions, card amendments, and score history.

curl https://api.mnemom.ai/v1/agents/{agent_id}/compliance-export \
  -H "X-Mnemom-Api-Key: $MNEMOM_API_KEY"

The export is a JSON document suitable for regulatory review, SOC 2 evidence, or internal governance reporting.

Best practices

Reclassify promptly

Violations continue to impact scores until reclassified. The longer a card-gap violation sits unaddressed, the more it drags down the Compliance and Integrity components.

Always amend the card

Reclassification without a card amendment is incomplete. The same action will trigger the same violation on the next checkpoint. Close the loop.

Use policy recommendations

The policy engine surfaces systematic card gaps in its recommendations. Review policy/resolved output regularly to catch gaps before they become violations.

Document reasons clearly

Reclassification reasons are stored in the audit trail. Write reasons that a reviewer (human or automated) can understand months later.

Review history before publishing

Before publishing a new card version, review reclassification history to ensure all known gaps are addressed in the updated card.

Test cards against your tools

Use mnemom card evaluate card.yaml --tools <tool-list> to catch card gaps before they become violations. Run this in CI after any card or tool configuration change — see CI/CD policy gates.

​Trust Recovery via reclassification

​Understanding card_gap vs behavior_gap

​Reclassification workflow

​Best practices

Reclassify promptly

Always amend the card

Use policy recommendations

Document reasons clearly

Review history before publishing

Test cards against your tools

​See also

Trust Recovery via reclassification

Understanding card_gap vs behavior_gap

Reclassification workflow

Best practices

See also