Planning a Database Migration with AI: 11 Weeks, 15 Questions

There’s a pattern I see constantly in how people use AI for engineering work: they jump straight to code. Problem identified, prompt written, implementation generated. Fast, satisfying, often wrong. (The agent pipeline that would eventually execute these plans is described in My First Autonomous Ticket.)

Three-phase migration plan: 11 weeks, zero breaking changes

The migration I’m writing about took the opposite approach. We spent a full planning session in Q&A mode before a single line of implementation was written. The output was a 3-phase, 11-week plan. What we built afterward was clean precisely because of that upfront investment.

The Problem

A configuration service at my enterprise company stored properties as TEXT[] arrays in PostgreSQL. Each configuration property had an array of string values. Simple, flexible, good enough — until it wasn’t.

When we hit 223 properties and 11,924 values, the model started showing its limits. Querying across properties was awkward. Filtering by value was a full array scan. Managing defaults, overrides, and platform-specific variants with a flat array was increasingly painful. Every new feature required bending the data model in ways it wasn’t designed for.

The right fix was relational normalization. The question was how to do it without breaking everything.

The Q&A Method

I didn’t start with “write me a migration.” I started with: “I need to normalize this schema. Before we plan, ask me every architectural question you need answered to design this correctly.”

The AI asked 15 questions. I answered from domain knowledge.

Some questions were obvious:

How are properties currently identified — by name or by ID?
Are there platform-specific property values, or are values global?
What does the rollback story look like if Phase 1 causes problems?

Some questions surfaced constraints I hadn’t explicitly thought about:

Do downstream consumers depend on the current array format in any API contracts?
Are there properties that inherit from each other, or is every property independent?
What’s the plan for default values — are they hardcoded in application logic, or stored in the database?

Each answer fed into the design. By question 15, we had the constraints documented and could design a model that satisfied all of them.

The Design That Emerged

Three decisions shaped everything:

Name-based resolution. Properties are identified by name, not by generated ID. This matters for migrations: if you renormalize the data, consumers don’t need to update their lookup keys. It also makes the system more resilient to schema changes.

3-state inheritance chain. A property value can exist at three levels: global default, platform-specific override, and property-specific value. Resolution walks the chain. This replaced a tangle of conditional logic scattered through the application.

Backward compatibility as a hard constraint. The v1 API couldn’t change during migration. Any new relational model had to serve the same API responses as the old one until v2 was stable and clients had migrated.

The 3-Phase Plan

Phase 1 — Database normalization only. New relational tables, migration script to move existing data, both old and new tables live simultaneously. No API changes. Application still reads from old structure. Duration: 3 weeks.

Phase 2 — API v2 alongside v1. New endpoints using the relational model. v1 endpoints remain functional, reading from a compatibility view over the new tables. Consumers migrate at their own pace. Duration: 5 weeks.

Phase 3 — Deprecate and clean. v1 deprecated, then removed. Old schema dropped. Duration: 3 weeks.

Eleven weeks total, zero breaking changes for consumers.

The Implementation

The migration itself: 223 properties, 11,924 values, moved from flat arrays to properly keyed relational rows.

Then 134 compilation errors. The migration touched every layer — repository, service, controller, DTO. Each layer had assumptions baked in about the data structure. Most errors were mechanical: wrong return types, missing method signatures, calls to methods that no longer existed in the new model. The agent fixed them systematically, working through the error list.

One Liquibase gotcha worth documenting: splitStatements=false is required for complex migration scripts that contain semicolons within SQL strings. Liquibase’s default behavior splits on semicolons, which breaks migrations that use stored procedure syntax or inline string values containing semicolons. This burned us once. Now it’s the first thing I check.

The Q&A method: AI asks architectural questions, human provides domain knowledge

Why This Approach Works

The best planning sessions I’ve had with AI feel like pair programming with an architect who has infinite patience and total recall. You describe a constraint, it remembers it for the rest of the conversation. You contradict yourself, it notices. You overlook something, it asks.

What the AI can’t do is decide. It doesn’t know which tradeoffs matter more in your specific context. It doesn’t know your team’s tolerance for breaking changes, your timeline pressures, or which technical debt is acceptable. That’s domain knowledge, and it lives in you.

Q&A mode forces you to surface that knowledge explicitly. The questions the agent asks aren’t always the questions you’d ask yourself — and those surprises are where the useful design decisions come from.

Resist the urge to jump to code. The ratio of planning to implementation on this project was roughly 1:8. The planning session probably saved three times that in rework. (For what a fully hardened, production-ready implementation looks like after the agents are done — Production Hardening covers the part nobody talks about.)