01. WRITING

The Series

Field notes from a daily practice with AI coding agents. Patterns, tools, sharp edges. Evidence-based, no fluff.

769 min total reading

Sort

Filter by tag (58)

Showing 32 of 32

23,479 Sessions: What Actually Works in Agentic Development

What 11.6GB of session data across 27 projects reveals about building production software with AI agents

33 MIN READ2026-03-06

3 Agents Found the Bug 1 Agent Missed

Why multi-agent consensus catches what solo review cannot

26 MIN READ2026-03-06

I Banned Unit Tests and Shipped Faster

Why functional validation replaced testing when AI writes the code

31 MIN READ2026-03-06

The Five-Layer Streaming Bridge

Token-by-token Claude streaming on iOS — after four failed architectures

29 MIN READ2026-03-06

The iOS Patterns Compendium: What 4,597 Sessions Taught Me About SwiftUI, State, and Survival

Five battle-tested Swift patterns from building three iOS apps with AI agents — covering state management, memory profiling, iCloud sync, Keychain security, and multi-simulator validation

26 MIN READ2026-03-06

194 Parallel Agents, Zero Merge Conflicts

Git worktrees give each AI agent its own filesystem — a five-stage pipeline makes sure they all come back together

30 MIN READ2026-03-06

The 7-Layer Prompt Engineering Stack

How layered enforcement turned written rules into mechanical discipline across 23,479 AI coding sessions

30 MIN READ2026-03-06

The Self-Correcting Loop: How Ralph Turned Hat-Based Orchestration Into Autonomous Execution

One hat per session, one event per hat, and the 1:47 AM guidance command that finished 28 tasks by morning

54 MIN READ2026-03-06

Mining 23,479 Sessions: What 3.4 Million Lines of AI Logs Actually Reveal

Building a pipeline to extract patterns from 11.6GB of Claude Code session data — and discovering the series was using the wrong numbers

25 MIN READ2026-03-06

The Designer-Less Design Workflow: Stitch MCP and the Death of Figma Handoffs

269 AI-generated screens, zero Figma files, and the branding bug that taught me to treat prompts as build artifacts

24 MIN READ2026-03-06

Spec-Driven Development: Why YAML Beats Verbal Instructions for AI Agents

A single YAML file replaces meetings, tickets, and Slack threads — agents read it, build in parallel, and ship without asking clarifying questions

55 MIN READ2026-03-06

Teaching AI to Remember: Cross-Session Memory

A SQLite observation store and MCP memory server that turns 23,479 sessions of amnesia into searchable institutional knowledge

25 MIN READ2026-03-06

84 Thinking Steps to Find a One-Line Bug

How structured hypothesis-test-revise chains solve bugs that brute force debugging never will

29 MIN READ2026-03-06

35 Worktrees, 12 Agents, Zero Merge Conflicts

Parallelism gets agents working at the same time. Choreography is what keeps their work from eating itself alive when they finish.

25 MIN READ2026-03-06

The Anatomy of a Skill

How SKILL.md files turn repeatable workflows into invocable prompt programs — and the factory that generates them

29 MIN READ2026-03-06

Building Claude Code Plugins That Actually Work

Hooks, skills, and the enforcement layer that turns agent suggestions into hard stops across 23,479 sessions

24 MIN READ2026-03-06

The CCB Evolution: From Bash Script to Autonomous Builder

Four generations of Claude Code builders — each one a lesson that cost real money and real patience to learn

27 MIN READ2026-03-06

SDK vs CLI: The Decision Framework That Took 23,479 Sessions to Learn

When to call the API directly, when to use Claude Code CLI, and when you need both — a practical guide from the series finale

26 MIN READ2026-03-06

Anneal: Three Planning Variants for Agentic Development

Linear, tournament, and convergence pipelines that turn rough ideas into executable plans

20 MIN READ2026-04-23

Crucible: Refusal-Driven Verification for Claude Code

The gate between 'I did the work' and 'the work is done' — 10 phases, 3 reviewers, 3 oracles, zero override flags.

18 MIN READ2026-04-28

The Economics of a Session

Token spend per task complexity, when Opus actually pays back over Sonnet, and the cost-of-defect curve that makes "use the cheap model" the most expensive choice you can make

19 MIN READ2026-05-05

Hooks as a Control Plane

Four hook events form the entire governance surface of Claude Code — refuse tools, gate commits, block deploys, enforce evidence. Once you see it as a control plane, the whole agent stack changes shape.

15 MIN READ2026-05-08

Prompt Caching Economics

Cache reads cost a tenth what cache writes cost, and most agents leave that 90% discount on the table because nobody structures their system prompt for hits. Here's how to order your messages so the cache pays you back.

12 MIN READ2026-05-12

Custom MCP Servers

When the prebuilt MCP servers run out of road, you write your own. The protocol is a four-method handshake, the transport is stdio or HTTP, and the whole thing fits in 200 lines of TypeScript.

11 MIN READ2026-05-15

Docs When Readers Aren't Human

AGENTS.md, SKILL.md, CLAUDE.md — three doc formats nobody asked for, written for an audience that doesn't read narrative. The 95:5 ratio between human-targeted and agent-targeted docs is about to flip.

19 MIN READ2026-05-19

Giving Agents Secrets Without Giving Agents Secrets

The pattern: secrets enter the agent's environment at the tool boundary, not in the prompt. Vault-injected env. 1Password op-run envelopes. Hooks that scrub before write. Your session JSONL gets archived; nothing in there should be sensitive.

13 MIN READ2026-05-22

The Session Is the Artifact

The JSONL session file is the primary deliverable, not the resulting code. Replayable, line-addressable, auditable — 11.6 GB across 538 directories of permanent record telling you exactly how every commit got written.

14 MIN READ2026-05-26

The AI-Pattern Detector That Ships This Series

Six banned phrases, three syntactic patterns, one cosine-similarity fingerprint, and a humanize loop that rewrites flagged passages until the voice matches. The gate every post in this series passes through.

14 MIN READ2026-05-29

The Skill Marketplace Problem

Seventy-one marketplaces subscribed. One hundred thirty-two plugins installed. Twenty-six marketplaces installed-but-unused. The discovery overhead is the bottleneck — and it's getting worse.

17 MIN READ2026-06-02

Notification Architecture for Async Agents

Agents finish at three in the morning. Severity model, channel routing, the exhausted-state-fires-alert pattern. The notification layer that lets you sleep while your work runs.

15 MIN READ2026-06-05

Drift Detection

Specs and code diverge silently. The 60-day audit pattern, the four-class taxonomy (dead, drifted, lying, fine), and what I do every quarter to make sure my docs still describe what my code actually does.

14 MIN READ2026-06-09

The 529 Cascade

Anthropic returns 529 (overload) intermittently. Naive retries themselves consume the bucket and storm the API a hundred times over. The retry policy that costs less, not more.

20 MIN READ2026-06-12

The Series

Posts

23,479 Sessions: What Actually Works in Agentic Development

3 Agents Found the Bug 1 Agent Missed

I Banned Unit Tests and Shipped Faster

The Five-Layer Streaming Bridge

The iOS Patterns Compendium: What 4,597 Sessions Taught Me About SwiftUI, State, and Survival

194 Parallel Agents, Zero Merge Conflicts

The 7-Layer Prompt Engineering Stack

The Self-Correcting Loop: How Ralph Turned Hat-Based Orchestration Into Autonomous Execution

Mining 23,479 Sessions: What 3.4 Million Lines of AI Logs Actually Reveal

The Designer-Less Design Workflow: Stitch MCP and the Death of Figma Handoffs

Spec-Driven Development: Why YAML Beats Verbal Instructions for AI Agents

Teaching AI to Remember: Cross-Session Memory

84 Thinking Steps to Find a One-Line Bug

35 Worktrees, 12 Agents, Zero Merge Conflicts

The Anatomy of a Skill

Building Claude Code Plugins That Actually Work

The CCB Evolution: From Bash Script to Autonomous Builder

SDK vs CLI: The Decision Framework That Took 23,479 Sessions to Learn

Anneal: Three Planning Variants for Agentic Development

Crucible: Refusal-Driven Verification for Claude Code

The Economics of a Session

Hooks as a Control Plane

Prompt Caching Economics

Custom MCP Servers

Docs When Readers Aren't Human

Giving Agents Secrets Without Giving Agents Secrets

The Session Is the Artifact

The AI-Pattern Detector That Ships This Series

The Skill Marketplace Problem

Notification Architecture for Async Agents

Drift Detection

The 529 Cascade

Posts

23,479 Sessions: What Actually Works in Agentic Development

3 Agents Found the Bug 1 Agent Missed

I Banned Unit Tests and Shipped Faster

The Five-Layer Streaming Bridge

The iOS Patterns Compendium: What 4,597 Sessions Taught Me About SwiftUI, State, and Survival

194 Parallel Agents, Zero Merge Conflicts

The 7-Layer Prompt Engineering Stack

The Self-Correcting Loop: How Ralph Turned Hat-Based Orchestration Into Autonomous Execution

Mining 23,479 Sessions: What 3.4 Million Lines of AI Logs Actually Reveal

The Designer-Less Design Workflow: Stitch MCP and the Death of Figma Handoffs

Spec-Driven Development: Why YAML Beats Verbal Instructions for AI Agents

Teaching AI to Remember: Cross-Session Memory

84 Thinking Steps to Find a One-Line Bug

35 Worktrees, 12 Agents, Zero Merge Conflicts

The Anatomy of a Skill

Building Claude Code Plugins That Actually Work

The CCB Evolution: From Bash Script to Autonomous Builder

SDK vs CLI: The Decision Framework That Took 23,479 Sessions to Learn

Anneal: Three Planning Variants for Agentic Development

Crucible: Refusal-Driven Verification for Claude Code

The Economics of a Session

Hooks as a Control Plane

Prompt Caching Economics

Custom MCP Servers

Docs When Readers Aren't Human

Giving Agents Secrets Without Giving Agents Secrets

The Session Is the Artifact

The AI-Pattern Detector That Ships This Series

The Skill Marketplace Problem

Notification Architecture for Async Agents

Drift Detection

The 529 Cascade