AI agent stack
Multi-model orchestration. One orchestrator, many models, escalation lanes. Most of these guides assume OpenClaw as the agent framework, but the patterns generalize.
Browser Chromium lane
Not every useful model path is an API.
This stack also uses a dedicated Chromium lane for browser-native AI workflows where logged-in sessions matter more than clean API abstractions. That includes things like:
- Perplexity research from a logged-in Pro account
- Gemini browser workflows
- Claude web workflows
- ChatGPT web workflows
- any task where cookies, subscriptions, uploads, or persistent session state are the real dependency
The pattern is covered in browser-llm-stack.md:
- keep a persistent Chromium profile for each provider or workflow
- run it in a controlled environment
- serialize fragile browser tasks with a lock when needed
- treat browser automation as a first-class lane, not an embarrassing fallback
This matters because some of the highest-value agent workflows still live behind browser auth, subscription entitlements, or UI-only features. If your stack ignores that reality, you end up designing around an imaginary clean-room API world.
🦞 Per-guide format lives in
../automation/cron-patterns.md.
Recipes in this chapter
Browser LLM Stack
Browser-native model work without pretending every useful workflow has a clean API.
Claude Code via tmux Relay
How to let OpenClaw drive Claude Code through an interactive tmux session for second-opinion review, without using claude -p or treating Claude as a raw backend.
Updated 2026-06-05Compaction & Context Tuning
How to configure OpenClaw's compaction, memory flush, context pruning, and session search so your agent doesn't lose its mind (or personality) during long sessions.
Updated 2026-06-05Local LLM Fallback
Use local models for boring, bounded work so your paid models stay available for judgment.
Updated 2026-05-11Migrating from claude-cli to ACP
How to move Claude Opus off the main-agent slot and onto an ACP escalation path after Anthropic blocked subscription OAuth from third-party harnesses in April 2026.
Updated 2026-06-05Multi-Model Orchestration
How to run multiple AI models in one OpenClaw setup, assign each to the right task tier, and stop burning expensive tokens on work that doesn't need them.
Updated 2026-06-05Orchestrating with GPT 5.5: Narration Guards and Strict-Agentic Gaps
Running GPT 5.5 (via OpenAI Codex Pro) as your main orchestrator is cheap and fast compared to frontier API billing, but it has specific failure modes that will quietly eat hours of your time until…
Updated 2026-04-20Prompt Caching: Maximize Cache Hits, Minimize Token Costs
How OpenClaw's prompt caching works across providers, how to keep your cache hit rate high, and the anti-patterns that silently cost you money every turn.
Updated 2026-06-05Running Claude Code in OpenClaw via ACP
In April 2026, Anthropic blocked subscription OAuth from third-party harnesses. The anthropic:claude-cli backend that most OpenClaw users had plugged a Max subscription into stopped working…
Updated 2026-06-05Self-Improving Agents
How to build an AI agent that learns from corrections, captures mistakes as institutional knowledge, runs automated memory sweeps, and gets better over time instead of repeating the same errors.
Updated 2026-04-19Session Management: Why Your Chat App Is Holding You Back
How to manage OpenClaw sessions effectively using Discord (or similar channel-based platforms) instead of single-thread messaging apps. The difference between productive multi-project orchestration…
Updated 2026-04-19Skills Development
How to write custom OpenClaw skills, structure them for discoverability, and extend your agent's capabilities with reusable task-specific instructions.
Updated 2026-04-19Sub-Agent Patterns: Orchestration, Spawning, and Gotchas
How to use OpenClaw sub-agents effectively. Spawn patterns, model assignment, error handling, and the lessons we learned from breaking things.
Updated 2026-06-05