Self-Improving Agents

Tested on: OpenClaw 2026.4.x with GPT 5.5 (main + memory sweep), tokenjuice + tool-narration-guard plugins
Last updated: 2026-04-19

How to build an AI agent that learns from corrections, captures mistakes as institutional knowledge, runs automated memory sweeps, and gets better over time instead of repeating the same errors.

The Problem

AI agents wake up fresh every session. They have no memory of what they did wrong yesterday. Without a system for capturing and surfacing corrections, you’ll give the same feedback over and over:

“Stop pushing directly to main”
“Don’t use em dashes”
“Check what’s already loaded before searching”
“Stop narrating what you’re about to do - just call the tool”
“Check the PR state before pushing follow-ups”

Each correction costs you time and tokens. A self-improving agent captures corrections once and applies them forever.

Two complementary mechanisms: content memory (what you correct) and plugin hooks (what the agent catches itself doing wrong in real time). Both matter. This guide covers both.

Architecture: Three Feedback Loops

Loop 1: Immediate Capture (Every Correction)

When you correct your agent mid-conversation, the agent should:

Fix the immediate issue
Write a knowledge card documenting what went wrong and the correct approach
Apply the correction for the rest of the session

Detection Triggers

Train your agent to recognize corrections:

“No, do it this way…”
“I told you not to…”
“That’s wrong because…”
“Stop doing X”
“Why did you…”
Any pushback that changes the agent’s approach

Knowledge Card Format

---
topic: "Correction: Always Use Feature Branches"
category: lessons
tags: [correction, git, workflow]
created: 2026-03-17
---

## What Happened
Pushed a config change directly to main instead of creating a PR.

## Correct Approach
NEVER push to main. Always: branch → commit → PR → automated review → merge.
PRs get CodeRabbit and Greptile reviews automatically.

## Why It Matters
Direct pushes bypass code review. Even "simple" changes can break things.
The review pipeline catches issues before they hit production.

Cards should be specific, actionable, and searchable. “Don’t do X, do Y instead, because Z.”

Loop 2: Periodic Promotion (Weekly)

During maintenance (heartbeats or scheduled cron), review recent correction cards and promote recurring patterns:

If the correction…	Promote to…
Applies broadly across all tasks	AGENTS.md (core operating rules)
Is domain-specific (git, writing, security)	`rules/<domain>.md`
Has happened 3+ times	Definitely promote, with emphasis
Was a one-time thing	Leave as knowledge card only

Promotion Example

Three separate corrections about token waste:

Card 1: Burned tokens debugging infra when they were about to expire
Card 2: Spawned Opus for a task Haiku could handle
Card 3: Loaded full memory file instead of searching

Promoted rule in AGENTS.md:

## Token Discipline
- Expiring tokens are use-it-or-lose-it. Ship content, don't debug infra.
- Use the cheapest model that handles the task. Pre-flight check agent mappings.
- Search memory, don't load it. Semantic search returns what you need.

Loop 3: Pre-Task Self-Audit (Every Task)

Before starting work, the agent should check:

Have I been corrected on something like this before? (search memory for similar tasks)
Does this match a known anti-pattern? (check lessons cards)
Am I about to do something explicitly prohibited? (check rules files)

Add this to your AGENTS.md:

## Self-Audit Before Every Task
Before completing any task, ask yourself:
- Have I been corrected on something like this before? (search memory)
- Does this match a known anti-pattern?
- Am I about to do something the user has explicitly said not to do?

Plugin Hooks: Catch Behaviors In-Flight

Two plugins we run in production turn specific correction patterns into automatic guardrails.

tool-narration-guard

What it fixes: GPT 5.5 (and other models) sometimes narrate what they’re about to do instead of actually calling the tool. “I’m running the build now” with no subsequent tool call. You notice 30 minutes later that nothing happened.

How it works: The plugin tracks runs at the session level. When it detects narration without a follow-up tool call in the same turn, it injects a prependContext rule on the next turn that forces the model to either call the tool or say “I can’t.” No more silent stalls.

Enable:

{
  "plugins": {
    "allow": ["tool-narration-guard", "..."],
    "entries": {
      "tool-narration-guard": { "enabled": true }
    }
  }
}

The plugin lives at ~/.openclaw/workspace/.openclaw/extensions/tool-narration-guard/. Load it via plugins.load.paths.

tokenjuice (PostToolUse optimization)

What it does: A PostToolUse hook plugin that re-shapes tool output to reduce subsequent turn token burn. Measured in production:

Surface	Mode	n	Effect	Notes
Claude Code	PostToolUse	11	+1.1%	Below noise floor, ignored
Codex	PostToolUse	11	−4.4%	Bimodal (27% retry on compound `cd && cmd`)
Pi	PostToolUse	10	−17.7%	Clean runs −25.3%, 30% compound-gap retry

Pi is the flagship surface. OpenClaw loads the same extension format, so tokenjuice lands there too.

Caveat: Claude Code 2.1.113+ doesn’t substitute tool_result from PostToolUse hooks - only additionalContext lands. If you’re chasing the full Pi savings profile on Claude Code, don’t expect it.

Error Detection & Learning Capture

Beyond corrections from the user, automate detection of errors in tool output.

Error Patterns to Log

- Exit codes != 0
- "error:", "Error:", "ERROR"
- "command not found", "No such file or directory"
- "permission denied", "EACCES", "EPERM"
- "ECONNREFUSED", "ETIMEDOUT"
- "SyntaxError", "TypeError", "ImportError"
- HTTP 4xx/5xx responses
- "rate limit", "quota exceeded", "429"
- Timeout errors, OOM kills
- Git merge conflicts

Structured Error Logging

Create a learning log that captures errors, corrections, and feature gaps:

## [ERR-20260317-001] SSH socket override missing

**Logged**: 2026-03-17 08:15
**Priority**: high
**Status**: resolved
**Area**: infra

### Summary
ListenAddress in sshd_config ignored on Ubuntu 24.04 due to socket activation.

### Details
Set ListenAddress in sshd_config but SSH kept listening on 0.0.0.0.
Ubuntu 24.04 uses systemd socket activation. The socket unit overrides sshd_config.
Fix: create /etc/systemd/system/ssh.socket.d/override.conf

### Action
Always check for systemd socket activation before editing service configs.
On Ubuntu 24.04+, SSH is socket-activated by default.

Entry Types

Type	Prefix	When to Log
Error	ERR	Tool output contains error patterns
Learning	LRN	User corrects approach, workaround found, assumption was wrong
Feature	FEAT	Missing capability identified, workflow gap found

Promotion Rules

3+ occurrences of the same pattern: promote to AGENTS.md or rules file
Behavioral correction: promote to rules file immediately
Workflow improvement: promote to AGENTS.md
Update original entry status to “promoted” when promoting

Daily Memory Sweep

Set up an automated cron job that reviews recent sessions across all channels and distills important information into the memory system.

Cron Configuration

{
  "name": "memory-sweep",
  "schedule": {
    "kind": "cron",
    "expr": "0 */6 * * *",
    "tz": "America/New_York"
  },
  "payload": {
    "kind": "agentTurn",
    "message": "Review recent sessions across all channels. For each session:\n1. Identify significant decisions, corrections, or lessons\n2. Create or update knowledge cards for anything worth persisting\n3. Update daily log with session summaries\n4. Check for correction patterns that should be promoted to rules\n5. Skip trivial conversations (greetings, simple lookups)",
    "model": "openai-codex/gpt-5.5"
  },
  "delivery": { "mode": "none" },
  "sessionTarget": "isolated"
}

This runs on the main gpt-5.5 (medium thinking), not gpt-5.5:cron. Memory sweep needs judgment about what’s worth keeping - thinking low produces shallower cards.

What the Sweep Does

Pulls recent sessions across Discord, Telegram, and any other configured channels
Reads conversation history for each session
Extracts decisions, corrections, errors, and new information
Writes knowledge cards for significant findings
Updates daily logs with session summaries
Checks for patterns that should be promoted to rules

Model Selection

Use your main orchestration model (GPT 5.5 medium) for the sweep. The task is structured extraction with some judgment - memory sweeps miss nuance when run on a smaller or thinking-low model. Don’t route the sweep through Claude Code: the escalation lane is for high-judgment review work, not back-office housekeeping.

Real Corrections We’ve Captured

These are actual correction cards from production use, showing what the system catches:

“Stop Building, Start Delegating”

Trigger: User noticed agent was hand-coding React components instead of writing specs and spawning Codex sub-agents.

Card created: Documents the correct workflow (main writes specs, Codex builds, Claude reviews when needed) and explicit rules against hand-coding when a code-specialized model is available.

Promoted to: AGENTS.md and SOUL.md core rules.

”Check Context Before Searching”

Trigger: Agent ran memory_search to find information that was literally in its system prompt (SOUL.md was already loaded).

Card created: Documents the correct lookup order: system prompt → conversation history → memory search → ask user.

Promoted to: Self-audit checklist.

”Expiring Tokens Are Use-It-Or-Lose-It”

Trigger: Agent spent 7 minutes debugging sub-agent infrastructure while API tokens were about to expire and reset. Should have written content directly instead.

Card created: Decision tree for expiring tokens (try sub-agents → if fail, write inline → debug later on fresh quota).

Promoted to: Token discipline rules.

”Never Push to Main”

Trigger: Agent pushed directly to main branch, bypassing the PR pipeline with CodeRabbit and Greptile automated reviews.

Card created: Explicit rule: branch → commit → PR → review → merge. No exceptions. No “I’ll PR it later.”

Promoted to: AGENTS.md mandatory workflow rule.

The Self-Audit Checklist

Add this to your AGENTS.md for your agent to reference before every task:

## Pre-Task Self-Audit
1. Search memory for corrections related to this type of task
2. Check rules/*.md for domain-specific rules
3. Verify model assignment matches the task (don't use Opus for scanning)
4. Confirm output format matches platform (no markdown tables on Discord)
5. If it's a big task, ask clarifying questions before executing
6. Check if a similar task was done recently (avoid duplicate work)

Implementation Checklist

# Check that correction capture is working
echo "=== Knowledge Cards (Lessons) ==="
ls ~/.openclaw/workspace/memory/cards/ | grep -E "lesson|correction" | wc -l
echo "correction/lesson cards"

echo ""
echo "=== Rules Files ==="
ls ~/.openclaw/workspace/rules/ 2>/dev/null || echo "No rules directory"

echo ""
echo "=== Self-Improving Rules ==="
[ -f ~/.openclaw/workspace/rules/self-improving.md ] && echo "✓ self-improving.md exists" || echo "✗ self-improving.md missing"

echo ""
echo "=== AGENTS.md Self-Audit Section ==="
grep -c "self-audit\|Self-Audit\|self-improvement\|Self-Improvement" ~/.openclaw/workspace/AGENTS.md 2>/dev/null || echo "0 references"

echo ""
echo "=== Memory Sweep Cron ==="
# Check if memory sweep cron exists
echo "Verify via: openclaw cron list | grep memory-sweep"

echo ""
echo "=== Error Detection ==="
[ -f ~/.openclaw/workspace/ERROR_DETECTION.md ] && echo "✓ ERROR_DETECTION.md exists" || echo "✗ ERROR_DETECTION.md missing"

Gotchas

Corrections are perishable. If you don’t capture a correction immediately, it’s gone next session. The agent literally won’t remember. Write the card in the same conversation where the correction happens.
Memory search quality depends on card quality. Vague cards like “don’t do bad things” won’t surface when searching for specific issues. Be specific: “Don’t use greedy regex to remove JS object blocks. Use line-by-line Python removal.”
Over-promotion clutters rules. Not every one-off correction needs to be in AGENTS.md. Use the 3-occurrence threshold. One-time corrections stay as knowledge cards.
The sweep model matters. A budget model doing the memory sweep might miss nuance in conversations. A code-specialized model (Codex, GPT 5.5) is better at structured extraction than a small local model.
Corrections compound. The first month of running a self-improving agent is noisy because there are lots of corrections. By month three, correction frequency drops significantly because the agent has internalized the patterns. The system works, but it takes time.
Don’t skip the pre-task search. The whole system falls apart if the agent doesn’t check for relevant corrections before starting work. Make the self-audit a non-negotiable part of AGENTS.md.
Prompt-level guards don’t scale; hook-level guards do. If a correction keeps surfacing, don’t keep adding sentences to AGENTS.md - build (or enable) a plugin that makes the wrong behavior structurally impossible. tool-narration-guard is the canonical example.
Validate memory before acting on it. A memory card naming a specific file, function, or flag is a claim about a snapshot in time. Files get renamed, endpoints get removed. Before recommending something pulled from a memory card, grep for it or read the file. “The card says X exists” is not the same as “X exists now.”