Engineering workflow diagram with checkpoints, event history, approval gate, and pause-resume arrows

Durable Execution Is the Difference Between a Demo and a System

The live demo repo for this series is 67ailab/harness-engineering, and for this post I did change the repo before publishing. The new capability shipped in commit 9612b58, which adds persisted run summaries plus replay-oriented history inspection to the existing approval-gated harness. The key changes are in src/harness_engineering/store.py and src/harness_engineering/cli.py. That addition matters because durable execution is where most agent demos quietly stop being honest. It is easy to show a model calling tools in one uninterrupted run. It is much harder to explain what happens when execution pauses for approval, the process dies, the machine reboots, the reviewer returns malformed output, or an operator needs to understand what state the run is actually in. ...

May 4, 2026 · 67 AI Lab
A futuristic diagram of an autonomous SRE agent architecture, showing a central brain connected to various monitoring tools and servers, glowing blue and green lines, high tech style

Architecting Autonomous, Long-Running, Scalable SRE Agents

It is relatively easy to build an SRE agent that can solve a single, well-defined problem in a demo environment. You give it a prompt, access to a few tools, and watch it restart a pod or query a log file. It feels like magic. But taking that agent and asking it to run 24/7, monitor thousands of services, handle concurrent incidents, and never hallucinate a destructive command is a different engineering challenge entirely. It moves us from the realm of “AI scripting” to distributed systems architecture. ...

February 22, 2026 · 67 AI Lab