Chapter I

AI agent stack

Multi-model orchestration. One orchestrator, many models, escalation lanes. Most of these guides assume OpenClaw as the agent framework, but the patterns generalize.

Browser Chromium lane

Not every useful model path is an API.

This stack also uses a dedicated Chromium lane for browser-native AI workflows where logged-in sessions matter more than clean API abstractions. That includes things like:

Perplexity research from a logged-in Pro account
Gemini browser workflows
Claude web workflows
ChatGPT web workflows
any task where cookies, subscriptions, uploads, or persistent session state are the real dependency

The pattern is covered in browser-llm-stack.md:

keep a persistent Chromium profile for each provider or workflow
run it in a controlled environment
serialize fragile browser tasks with a lock when needed
treat browser automation as a first-class lane, not an embarrassing fallback

This matters because some of the highest-value agent workflows still live behind browser auth, subscription entitlements, or UI-only features. If your stack ignores that reality, you end up designing around an imaginary clean-room API world.

🦞 Per-guide format lives in ../automation/cron-patterns.md.

Recipes in this chapter

Browser LLM Stack

Browser-native model work without pretending every useful workflow has a clean API.

Claude Code via tmux Relay

How to let OpenClaw drive Claude Code through an interactive tmux session for second-opinion review, without using claude -p or treating Claude as a raw backend.

Updated 2026-06-05

Compaction & Context Tuning

How to configure OpenClaw's compaction, memory flush, context pruning, and session search so your agent doesn't lose its mind (or personality) during long sessions.

Updated 2026-06-05

Local LLM Fallback

Use local models for boring, bounded work so your paid models stay available for judgment.

Updated 2026-05-11

Migrating from claude-cli to ACP

How to move Claude Opus off the main-agent slot and onto an ACP escalation path after Anthropic blocked subscription OAuth from third-party harnesses in April 2026.

Updated 2026-06-05

Multi-Model Orchestration

How to run multiple AI models in one OpenClaw setup, assign each to the right task tier, and stop burning expensive tokens on work that doesn't need them.

Updated 2026-06-05

Orchestrating with GPT 5.5: Narration Guards and Strict-Agentic Gaps

Running GPT 5.5 (via OpenAI Codex Pro) as your main orchestrator is cheap and fast compared to frontier API billing, but it has specific failure modes that will quietly eat hours of your time until…

Updated 2026-04-20

Prompt Caching: Maximize Cache Hits, Minimize Token Costs

How OpenClaw's prompt caching works across providers, how to keep your cache hit rate high, and the anti-patterns that silently cost you money every turn.

Updated 2026-06-05

Running Claude Code in OpenClaw via ACP

In April 2026, Anthropic blocked subscription OAuth from third-party harnesses. The anthropic:claude-cli backend that most OpenClaw users had plugged a Max subscription into stopped working…

Updated 2026-06-05

Self-Improving Agents

How to build an AI agent that learns from corrections, captures mistakes as institutional knowledge, runs automated memory sweeps, and gets better over time instead of repeating the same errors.

Updated 2026-04-19

Session Management: Why Your Chat App Is Holding You Back

How to manage OpenClaw sessions effectively using Discord (or similar channel-based platforms) instead of single-thread messaging apps. The difference between productive multi-project orchestration…

Updated 2026-04-19

Skills Development

How to write custom OpenClaw skills, structure them for discoverability, and extend your agent's capabilities with reusable task-specific instructions.

Updated 2026-04-19

Sub-Agent Patterns: Orchestration, Spawning, and Gotchas

How to use OpenClaw sub-agents effectively. Spawn patterns, model assignment, error handling, and the lessons we learned from breaking things.

Updated 2026-06-05