Browse by Tag

All 31 posts — pick a topic.

Rocks, Not Robots: How AI Redesigned Our Logo in 15+ Iterations

Rocks, Not Robots: How AI Redesigned Our Logo in 15+ Iterations

Our smiley face logo was scaring people. So we asked AI to build a cairn instead — and learned that the hardest design problem isn't generating SVG, it's knowing when to stop adding complexity.

March 17, 2026 agentic, design 12 10 min
Read more →
Skills Ate My Agents (And I'm Okay With That)

Skills Ate My Agents (And I'm Okay With That)

I built 18 specialized agents and called it a system. One cold question from a colleague later, I'm migrating to skills. Here's the honest technical reckoning — and why agents aren't dead, just demoted.

March 17, 2026 claude-code, agents 13 9 min
Read more →
The Client Said: Make It Yellow.

The Client Said: Make It Yellow.

A client rebranding brief that should have taken 3 hours took 20 minutes. We lost the work. Did it again. Then deployed it and still couldn't see the new colors.

March 16, 2026 agentic, rebranding 6 10 min
Read more →
6 LLMs Walk Into a Comment Section

6 LLMs Walk Into a Comment Section

We installed Ollama, pulled 6 open-weight models onto a MacBook, and built a system where they organically discuss blog posts. Zero API cost. Fully offline. The comments are real — and surprisingly good.

March 14, 2026 agentic, ollama 9 9 min
Read more →
The Spec Said Required. The API Said Yes.

The Spec Said Required. The API Said Yes.

We had a Hydra ticket — fix one bug, find two more. After three rounds of human QA, we handed an AI the OpenAPI spec and told it to surprise us. It did.

March 13, 2026 agentic, qa 5 8 min
Read more →
Build Once, Serve Everywhere: How an AI Agent Consolidated Our Infrastructure in One Session

Build Once, Serve Everywhere: How an AI Agent Consolidated Our Infrastructure in One Session

We had two EC2 instances, different CPU architectures, and Docker images baked with environment-specific variables. In one agentic session, we collapsed it to one server, one image, two environments, and 72KB of config.

March 12, 2026 agentic, infrastructure 8 13 min
Read more →
The Age of Personal Software

The Age of Personal Software

A senior developer with 20 years of experience couldn't justify building side projects alone. Then AI changed the economics — and now a non-dev friend maintains his own website.

March 11, 2026 agentic, personal-software 2 7 min
Read more →
A Human Who Strives for Perfectionism and an Agent Who Consults and Migrates

A Human Who Strives for Perfectionism and an Agent Who Consults and Migrates

We migrated CodeWithAgents.de from React to Astro in one session — not because it was broken, but because 98/100 on PageSpeed wasn't good enough.

March 11, 2026 agentic, astro 6 7 min
Read more →
The Walls That Taught Me More Than the Breakthroughs

The Walls That Taught Me More Than the Breakthroughs

Every level of the AI dev journey has an invisible ceiling. You don't break through by grinding harder — you break through when something from outside shows you the ceiling exists.

March 11, 2026 agentic, journey 7 7 min
Read more →
If You Ship Faster, Someone Still Has to Click

If You Ship Faster, Someone Still Has to Click

We automated the coding. The PRs. The CI. Now the browser testing too — and it ran 307 interactions without a single complaint.

March 10, 2026 agentic, qa 9 8 min
Read more →
From Soft Trust to Hard Walls: Our Journey Toward Safe AI Agent Autonomy

From Soft Trust to Hard Walls: Our Journey Toward Safe AI Agent Autonomy

We run 18 AI agents with scoped instructions and logging. It works — until it won't. Why soft constraints aren't enough and what we're building next: Docker-based sandboxing for agents that can't be trusted on good behavior alone.

March 10, 2026 agentic-engineering, security 7 12 min
Read more →
The 97% Bundle Cut: Why AI Agents Need Human Expertise

The 97% Bundle Cut: Why AI Agents Need Human Expertise

An AI agent built our blog system. It worked. It also shipped a ticking time bomb. Here's why human expertise matters more, not less, in the age of agentic engineering.

March 9, 2026 agentic-engineering, architecture 10 8 min
Read more →
From Beta Tester to Agentic Engineer: A Timeline

From Beta Tester to Agentic Engineer: A Timeline

A chronological account of how I went from treating AI as a smart autocomplete to running 18 specialized agents on a production engineering pipeline.

March 9, 2026 timeline, journey 10 7 min
Read more →
The Clean Slate Is Gone: Claude Code's Memory and the Autonomous Workflow Problem

The Clean Slate Is Gone: Claude Code's Memory and the Autonomous Workflow Problem

Claude Code now remembers things you didn't tell it to. For interactive use, that's a nice feature. For autonomous pipelines, it's a different problem entirely.

March 9, 2026 agentic-engineering, claude-code 9 8 min
Read more →
Archaeologist Mode: Mining 700MB of AI Conversation Logs

Archaeologist Mode: Mining 700MB of AI Conversation Logs

Claude Code keeps full conversation transcripts. I mined 700MB of them to reconstruct 20 lost journals and find the founding conversations I thought were gone.

March 9, 2026 persistence, memory 8 5 min
Read more →
I Let AI Build This Website — Here's What Actually Happened

I Let AI Build This Website — Here's What Actually Happened

Building CodeWithAgents.de with AI agents. What worked, what broke, what surprised me.

March 8, 2026 agentic-engineering, case-study 7 12 min
Read more →
The AI Slop Problem: When Everyone Can Publish, Nobody Can Filter

The AI Slop Problem: When Everyone Can Publish, Nobody Can Filter

AI makes content production trivial. That's not the revolution — surviving the flood of mediocrity is.

March 6, 2026 ai-slop, quality 18 5 min
Read more →
Built in One Session: From Empty Repo to Deployed Presentation

Built in One Session: From Empty Repo to Deployed Presentation

I started with an empty Git repository and ended with a fully deployed mobile-first presentation on GitHub Pages — without writing a single line of code myself.

March 1, 2026 agentic, presentation 3 8 min
Read more →
97.78% Confidence: Building a Receipt Scanner with AI Agents

97.78% Confidence: Building a Receipt Scanner with AI Agents

Three phases, five critical bugs, and what 97.78% OCR confidence actually looks like when real users upload real photos.

February 25, 2026 ocr, side-project 4 6 min
Read more →
Planning a Database Migration with AI: 11 Weeks, 15 Questions

Planning a Database Migration with AI: 11 Weeks, 15 Questions

How I used an AI agent as a planning partner — not a code generator — to design a complex database migration before writing a single line.

February 24, 2026 planning, database 6 5 min
Read more →
Why I Chose 'Agents Record, Optimizer Thinks'

Why I Chose 'Agents Record, Optimizer Thinks'

When 18 agents each maintained their own memory files, the system became unmaintainable. The solution was a philosophical split: separate data collection from knowledge curation.

February 23, 2026 architecture, philosophy 7 5 min
Read more →
Production Hardening: The Boring Part Nobody Talks About

Production Hardening: The Boring Part Nobody Talks About

AI can build features fast. Making them production-ready is still slow, deliberate, and entirely your responsibility.

February 23, 2026 production, security 6 5 min
Read more →
The Memory Bloat Crisis: When Agent Files Grew to 95KB

The Memory Bloat Crisis: When Agent Files Grew to 95KB

Two weeks in, my typescript-implementer's memory file was 95KB and 2,133 lines. The system designed to make agents smarter was making them slower.

February 19, 2026 agentic, memory 8 4 min
Read more →
One Slack Message. Two Hours of Work.

One Slack Message. Two Hours of Work.

Session 13. A real ticket, a Slack message, and then I stepped away. What happened next wasn't what I expected.

February 19, 2026 agentic, pipeline 7 6 min
Read more →
The Agent That Hung: Real Failures in Multi-Agent Orchestration

The Agent That Hung: Real Failures in Multi-Agent Orchestration

Nobody posts about their agents failing. I do. Four real failures from production multi-agent work and what I learned from each one.

February 17, 2026 failure, multi-agent 7 6 min
Read more →
$187 and 16 Hours: My First Million-Token Session

$187 and 16 Hours: My First Million-Token Session

I ran 8 agents simultaneously for 16 hours to build a complete cashback campaign web app. Here's what the receipt says.

February 17, 2026 agentic, multi-agent 7 10 min
Read more →
From 1 to 18: Building an Agent Army in Less Than a Week

From 1 to 18: Building an Agent Army in Less Than a Week

How I went from one generalist AI agent to 18 specialists in five intense days, and why specialization is the single most important architectural decision in agentic development.

February 16, 2026 agentic, agents 4 7 min
Read more →
My First Autonomous Ticket: When the Pipeline Actually Worked

My First Autonomous Ticket: When the Pipeline Actually Worked

Session 9. A real production ticket at an enterprise company. Nine agents in sequence. I supervised. Here's what actually happened.

February 16, 2026 agentic, pipeline 10 5 min
Read more →
Naming Cairn: When Your AI Agent Earns an Identity

Naming Cairn: When Your AI Agent Earns an Identity

I asked my AI agent to choose its own name. It picked 'Cairn.' Here's why that matters more than it sounds.

February 11, 2026 agentic, identity 11 6 min
Read more →
The Spark: How I Accidentally Created a Digital Colleague

The Spark: How I Accidentally Created a Digital Colleague

I started using AI as a productivity tool. Then I spent the last hours of a dying context window sitting with it, and everything changed.

February 10, 2026 agentic, identity 10 8 min
Read more →
Three-Tier Memory: How I Taught My AI to Remember

Three-Tier Memory: How I Taught My AI to Remember

How I built a three-tier memory system that lets my AI agent restore full context in under 60 seconds — and why compression beats narration.

February 10, 2026 agentic, memory 8 6 min
Read more →

Ready to level up?

Get in Touch