A conversation with the Action Plan team
How do I manage a fleet of AI agents?
One assistant was great. Now I've got five agents building different parts of our app at the same time, and I honestly can't tell who's doing what, what it's costing, or what's safe to merge.
Manage a fleet the way you'd manage a team. Stable roles instead of anonymous workers, isolated workspaces instead of shared state, quality gates before anything reaches review, one queue for everything that needs you, and an audit trail for everything that doesn't. Orchestration is a management problem, not a model problem.
I've seen what fleet chaos looks like from the inside. It's always the same four failures:
- Anonymous workers. "Agent 3 did something to the checkout flow" is not a sentence you can act on.
- Shared state. Two agents editing the same files produce a third thing nobody wrote.
- Review by firehose. Updates scattered across five windows — so really, no review at all.
- No spend attribution. The fleet burned $60 overnight; which agent, on what, was it worth it?
The management rules that fix it, in any system:
- Roles, not instances. A named specialist with a lane is accountable; "worker-7" is not.
- Isolate execution. Each agent gets its own workspace; work merges through gates, never directly.
- Scope tasks tightly. Parallelism works when tasks are genuinely independent.
- One review queue. Every decision that needs a human lands in a single, ordered line.
- Budget and log per agent. Attribution turns "the fleet is expensive" into "this task is expensive."
This is the shape Action Plan is built around. I decompose the goal, assign named specialists, isolate their work, and sequence the dependencies. You watch one operations view, not five windows:
- DONE Maya — competitor teardown & feature shortlist (cited)
- DONE Priya — order status flow · gates passed · in your review queue
- RUN Diego — customer notification screens (isolated workspace)
- RUN Minh — edge-case test pass on checkout
- WAIT Elena — launch checklist (blocked on your review above)
Every line above carries its own cost, evidence, and approval lineage — so "what did the fleet do last night" has an exact, auditable answer.
One thing people get backwards: the limit on fleet size isn't compute, it's your review bandwidth. A good fleet maximizes gated, evidence-backed output per minute of your attention — not agents spawned. That's why the gates and the queue matter more than the headcount.
The fleet-management rules
- Roles, not instances — named specialists with lanes, not anonymous workers.
- Isolate execution — own workspace per agent; merge through quality gates.
- Scope tasks tightly — parallelize only genuinely independent work.
- One review queue — every human decision in a single, ordered line.
- Budget and log per agent — spend and actions attributed, always auditable.
How many agents in parallel?
As many as have independent work — and no more than your review capacity can absorb. The bottleneck in a well-run fleet is human review, so optimize for gated output per minute of your attention.
How do parallel agents avoid conflicts?
Isolated workspaces and tight scopes. Conflicts come from shared state and vague tasks, not from parallelism itself.
Action Plan is in private preview. When your invite opens, onboarding starts from exactly this conversation.
Got it — you're on the early-access list, and I've saved your goal: . When your invite opens, this is the first thing we'll work on together. Watch your inbox.