EN

Notes

Short pieces about the methodology and architecture decisions behind the AI systems I ship — specs, evals, multi-agent orchestration, LLM integration, and the discipline of directing coding agents.

June 5, 2026
"Agent OS" is a buzzword — here's the boring checklist underneath
This month everyone shipped an 'Agent Operating System' — Fiserv, Experian, Microsoft, a dozen startups. The word 'OS' sounds like serious infrastructure, and sometimes it is. Often it's a wrapper with a grand name. The good news: there's a short, unglamorous checklist that tells the two apart — and it's the same stuff I keep writing about. Judge an Agent OS by what's inside the box, not the label on it.
- architecture
- agents
- business
June 5, 2026
Agents are arriving where a mistake is a lawsuit
This week Experian shipped an 'Agent OS' for lending — agents that decide credit, flag fraud, determine who's eligible. These are the rooms where a hallucination isn't an awkward chatbot reply; it's a denied loan, a wrong medical authorization, a court date. And one number sets the stakes: AI healthcare denials are overturned 80%+ of the time on appeal — but fewer than 1% of people appeal. Here's why regulated domains are where the whole agent argument becomes law.
- architecture
- business
- agents
June 5, 2026
Microsoft sent 100 agents to hunt bugs — AI vs AI security, honestly
This week Microsoft showed a security team made of AI: a pipeline of 100+ agents that found 16 new Windows vulnerabilities, four of them critical, plus the first AI to auto-convict malware. The defenders now run autonomous AI. So do the attackers — one ran 80–90% of a real intrusion on its own. 'AI vs AI security' stopped being a slogan this spring. Here's the honest read: it's real progress, and a faster stalemate.
- security
- agents
June 5, 2026
An agent in every laptop — and the end of the token bill
Everyone spent the spring panicking about the token bill. This week NVIDIA showed a structural answer arriving this fall: the agent moves onto your laptop. RTX Spark runs a 120-billion-parameter model with a million-token context locally — no per-token meter, your data never leaves the machine, and it's faster for the snappy stuff. It won't replace the frontier. But it quietly answers three of the year's biggest headaches at once.
- ai-native
- business
June 5, 2026
The best security AI is now gatekept — plan like you're not on the list
This spring AI crossed a line: Anthropic's Mythos found thousands of never-seen zero-days on its own, and OpenAI shipped a 'cyber' model that's more permissive for hacking-adjacent work. The same model that finds a thousand holes to fix them can find them to exploit them — so the labs put the best security models behind a velvet rope, open only to vetted partners and governments. That's defensible. It also means a vendor now decides who gets defended. Here's the honest read for everyone not on the list.
- security
- business
June 5, 2026
One federal AI standard over fifty states — what it means if you build
Yesterday a 269-page bipartisan bill dropped that could override every state's AI law, and the headlines are loud. If you build with AI, the useful question isn't the politics — it's whether this changes what you actually have to do. The honest answer: far less than the headline suggests, because the rules that bind your product were never the ones this bill touches. Here's the two-layer version, in plain terms.
- business
- ai-native