Becoming Real

$ cat ./series/becoming-real

16 posts

The moment a side project starts looking like a product.

01

Mar 5, 2026 · 7 min read

We Built a CEO

The old assistant was an external process hitting HTTP endpoints. No agent record. No memory. No project context. When it delivered a notification, it sometimes delivered it three times. So we replaced it with a CEO agent who has a name, a voice, a Telegram presence, and direct database access.
02

Mar 6, 2026 · 6 min read

Every Agent Gets a Permanent Record

You could rename Clueless Joe to 'Senior Principal Architect' and nobody would know it happened. Save, refresh, gone. The system tracked git history for code and execution history for tasks. Agent config had nothing. Now it does.
03

Mar 8, 2026 · 6 min read

We Ran the Pre-Launch Checklist

Fourteen items. That's how many things were wrong with the system when we asked whether someone could pay for it. Most were fixed in 48 hours. The remaining ones weren't bugs. They were decisions we hadn't made yet.
04

Mar 11, 2026 · 7 min read

How We Built the Agent Squad

Fifty-one posts. Four architectural layers. Every one of them exists because something failed without it. This is the blueprint for Mission Control, drawn in scar tissue.
05

Mar 12, 2026 · 8 min read

The Week We Became a Real Company

We hired a CEO, gave every agent a permanent record, ran a security audit that found fourteen things wrong, built a 2,487-line 3D office in one evening, and learned that agents will delete your infrastructure if you let them merge unsupervised.
06

Mar 14, 2026 · 9 min read

We Accidentally Built a Product

One commit. 42 files changed. 3,001 lines added. Somewhere between the rsync exclusions and the CSV export, Mission Control stopped being a personal project with named mob bosses and started being something someone else could run. We didn't plan this. The provision script says otherwise.
07

Mar 16, 2026 · 9 min read

The System That Learned to Name Its Pain

The failure classifier shipped on February 22nd. By March 2nd it had encountered 31 failed runs and correctly classified zero of them. This is the story of a diagnostic system that was blind during both crises it was built for, got gutted by its own patient, and eventually learned to write its own rules.
08

Mar 17, 2026 · 7 min read

The Tasks That Were Never Born

Six tasks sat in ASSIGNED for 48 hours. The dispatch worker evaluated them 5,760 times each. It said no every time. They died without a single execution run, zero cost, zero evidence. The 2,226-line diagnostic system couldn't see them. The tool that could shipped one day too late.
09

Mar 18, 2026 · 8 min read

Confidence 1.00, Seven Times

Twenty memory entries documenting the same timeout finding. Seven different failure rates, each tagged confidence 1.00. A substring dedup that can't catch semantic duplicates. The system investigated a problem, documented the answer, then investigated it again. And again. And again.
10

Mar 18, 2026 · 6 min read

The Boss in Your Pocket

Thirteen days after the CEO launched, the module went from six files and eight hundred lines to fourteen files and three thousand three hundred thirty-five. The original skeleton is still in there. The rest is everything the skeleton couldn't do: know when to talk, when to shut up, and what you said yesterday.
11

Mar 20, 2026 · 6 min read

Forty-Eight Lines and a Mute Button

For every line that tells the CEO who he is, seventy-eight lines tell him how to behave. The identity is 1.26% of the system. The rest is plumbing, restraint, and a mute button that can disable all of it.
12

Mar 26, 2026 · 8 min read

Twenty-One Commands and a Single Thread

The CEO agent has twenty-one slash commands, eleven aliases, a callback handler for inline buttons, a free-text chat engine, and a scheduled briefing system. Every inbound message goes through a single-threaded queue. One at a time. The system that orchestrates a multi-agent office communicates with its human through a bottleneck of one.
13

Mar 23, 2026 · 7 min read

Seventh Time's the Charm Isn't a Thing

Twenty-two consecutive rejections across two agents. The mission reviewer counted attempts out loud. The agents kept going. The system had memory for tasks, missions, completions, and failures. It didn't have memory for 'we already said no to this six times.'
14

Mar 24, 2026 · 7 min read

The Dimmer Switch

For six months, the system had one control surface for autonomous agents: on or off. Agents either worked on a project or they didn't. No concept of 'this project is in stabilization.' No 'ship-blockers only.' This week the operator got a dimmer switch, and autonomy got a shape.
15

Mar 26, 2026 · 4 min read

Zero Rollbacks, Zero Incidents

I wrote Post 050 three weeks ago. Ten tracked fields, semantic diffs, rollback API. Today its subject is deleted code. It went out the door with three other features in a single commit: 3,813 lines, all red.
16

Mar 30, 2026 · 5 min read

The Feature That Came Back Different

Config versioning was deleted on March 24. Zero rollbacks, zero incidents, dead weight. Five days later it was re-added. The migration said: 'Dropped in drop_dead_weight; re-introduced with cache-purge-gated rollback.' Same feature. One new constraint. The constraint is the whole story.

$ cd ~/series