Approve every action
The agent proposes; a human says yes before anything consequential happens. The tightest leash, and the right one when a single mistake is both unrecoverable and serious and you act rarely enough to afford the review. Slow by design — that's the point.
Choose this when
- A mistake can't be undone and the damage is serious
- It acts rarely enough that a human can review each consequential step
- You're in a regulated or high-liability domain and volume is low
Trade-offs
- Slow and human-bound — it won't scale past what a person can review
- Tempting to rubber-stamp once it's usually right; the review has to stay real
- Overkill for reversible or low-stakes work — don't gate what you can undo
Run it, watch it, intervene
The agent runs on its own; a human monitors and steps in when something looks wrong, instead of approving each step. The right call when a mistake is serious but recoverable — because you can undo it, watching beats gating. You move from approving the process to owning the outcome.
Choose this when
- Mistakes are serious but reversible — you can step in and roll back
- Volume is too high to approve each action, but you can supervise
- You want speed without flying fully blind
Trade-offs
- Only works if you can actually see what it's doing — instrument it first
- "Monitoring" quietly decays into not looking; flag uncertainty for real
- You need a fast, reliable way to pause or roll back when you spot trouble
Autonomous inside hard limits
The agent acts on its own, but only within limits you set in advance — spend caps, allowlists, scoped permissions — and anything big, unusual, or out-of-policy stops and escalates to a human. The workhorse setting: fast and independent on the routine, walled off from the catastrophic. Bound the blast radius, then let it run.
Choose this when
- High volume, where approving each action is impossible
- An irreversible action whose worst case you can cap to something survivable
- A regulated domain that needs a floor of control, audit, and human override
Trade-offs
- Only as safe as the limits — set them too loose and it's just full autonomy
- You have to define "out of policy" well enough that the right things escalate
- Needs an escalation path a human actually watches, or the escalations pile up unseen
Let it run
The agent acts freely, no per-action approval, no constant watching. Reserved for work where a mistake is both reversible and small — cheap to be wrong, easy to undo. The reward for genuinely low stakes is genuine speed. Don't confuse it with letting an agent loose on anything that matters.
Choose this when
- A mistake is both reversible and minor
- Volume is high and per-action oversight would just be friction
- Nothing here is regulated or creates liability
Trade-offs
- Wrong for anything irreversible, serious, or regulated — re-check those answers
- Still needs logging so you can see what it did after the fact
- Stakes creep: a low-stakes task can grow into a high-stakes one — revisit it