Why I Chose 'Agents Record, Optimizer Thinks'

When my typescript-implementer’s memory file grew to 95KB — the full crisis story — the real question wasn’t how to trim it. It was why the architecture allowed it.

Agents record observations to shared logs; a separate optimizer distills knowledge

The problem wasn’t storage. 95KB is nothing. The problem was that there was no curation. A file that records everything records nothing useful — it’s just a log, and logs are not knowledge.

The Insight

I was thinking about this problem when I noticed the parallel to something I knew from engineering:

Monitoring and alerting are not the same thing. You collect everything. You only alert on what matters.

Logging and analysis are not the same thing. You log events. You analyze them separately to find patterns.

Event sourcing and read models are not the same thing. You store every event. You build projected views optimized for reading.

In every one of these patterns, there’s a producer and a consumer, and they’re separated precisely because the requirements of producing and consuming are different. Production wants to be cheap and complete. Consumption wants to be fast and relevant.

My agents had no such separation. They were producing and consuming from the same undifferentiated file.

The New Architecture

I rebuilt around one principle: Agents record. A separate optimizer thinks.

The operational logs. Six topic-based logs that all 18 agents write to:

build-systems.log — build tool behavior, dependency issues, compilation patterns
git-operations.log — git workflow patterns, merge strategies, branch conventions
infrastructure.log — deployment patterns, Docker behavior, environment issues
planning.log — decomposition patterns, estimation learnings, planning outcomes
code-quality.log — refactoring patterns, review findings, quality observations
meta.log — agent coordination patterns, tooling behavior, system observations

Agents write to these logs append-only. No reading. No curation. Just recording.

The optimizer agent. A separate agent — not part of any workflow team — that runs periodically. It reads the operational logs, identifies patterns that appear 3 or more times, and translates those patterns into concrete agent instructions.

The threshold matters. One occurrence might be a coincidence. Two occurrences might be a coincidence. Three occurrences is a pattern worth encoding.

The agent instructions. These are the only things agents read about historical learnings. Not raw logs — curated instructions. The optimizer produces lines like: “When running Docker builds, always verify the Node version in the Dockerfile matches the local version; mismatches cause ES module resolution failures.” That’s the distilled output of several build failure logs, reduced to an actionable rule.

Log archival. Logs are archived monthly. Old operational data stops accumulating. The agent instructions carry forward whatever was worth carrying.

The Parallels Are Not Accidental

I want to be explicit about why I think this architecture is right beyond just “it works better.”

The pattern of separating observation from insight shows up in every mature engineering practice for a reason. Mixing them creates systems that are brittle in a specific way: they become harder to use as they accumulate more information. That’s backwards. Systems should get more useful as they accumulate information, not less.

The mistake I made initially was thinking that more information in the agent’s context = more capable agent. That’s only true if the information is relevant. Irrelevant information in context doesn’t just fail to help — it actively degrades performance by diluting the signal. (The three-tier memory system I built earlier got this right at the project level — the failure was applying a different, worse model at the agent level.)

Curation is the work. Observation is cheap.

Microservices separate concerns not because distribution is inherently good, but because some concerns scale differently. Build process accumulates failures at one rate; observation processing accumulates insights at a different rate. Separating them lets each grow independently.

Read models in CQRS are not the source of truth; they’re optimized projections. Agent instructions are not the raw history; they’re optimized projections of what matters from that history.

What Changed

The practical effects:

Agents are lighter. Instructions are concise by design — they’re curated, not accumulated. Context consumption at session start dropped significantly.

Knowledge is curated. Only patterns that appear repeatedly make it into agent instructions. One-off learnings that turn out not to generalize quietly disappear at archival time.

Knowledge is shared. When the Kotlin agent and the TypeScript agent both independently log similar Docker build failures, the optimizer sees the pattern across both logs and encodes it as a general rule for all agents. In the original architecture, this insight would be siloed.

The system is maintainable. I can read the agent instructions and understand them. I can see what the optimizer has distilled from the logs. I can intervene when a pattern has been encoded incorrectly. None of this was true with the 95KB MEMORY.md files.

The three-occurrence rule: a pattern seen 3+ times becomes a curated instruction

The Principle

Separation of concerns isn’t just for code. It’s for knowledge systems too.

If you’re building multi-agent systems with persistent memory, the question isn’t “where do agents store what they learn?” The question is “who is responsible for curation?” If the answer is “each agent curates its own memory,” you’ll end up where I ended up: 18 MEMORY.md files growing without bound, indistinguishable from noise.

Give observation to the agents. Give curation to a dedicated optimizer. Let each do what it’s good at.

The Insight

The New Architecture

The Parallels Are Not Accidental

What Changed

The Principle

What the models think