Notes
Short pieces about the methodology and architecture decisions behind the AI systems I ship — specs, evals, multi-agent orchestration, LLM integration, and the discipline of directing coding agents.
June 6, 2026
Copilot turned the meter on. The $20 tool was always a fiction.
On June 1, GitHub switched every Copilot plan to usage-based billing, and developers watched a single AI coding session eat a whole month's budget before lunch. The internet called it greed. It isn't — it's the meter finally being visible. Agentic AI was never a flat $20 a month; you just weren't shown the bill. Here's what that means for anyone building on these tools, and why cost has to be a design decision now, not a surprise.
- business
- agents
- ai-native
June 6, 2026
Three frontier models landed this month. I changed one line.
June 2026 is dropping three new frontier models in a single month — GPT-5.6, Gemini 3.5 Pro, and Claude Mythos — on top of two that shipped in May. For most teams that's a stressful treadmill of 'should we migrate?' For one that built the boring way, it's three free upgrades, each a one-line config change. The relentless model churn isn't a problem to keep up with. It's a gift you collect automatically — but only if you built the seam to catch it.
- ai-native
- architecture
- agents
June 6, 2026
12,520 agent tools are on the internet with no lock on the door
Security researchers scanned the internet and found 12,520 MCP servers — the tool connectors that let AI agents do real things — sitting in the open, most with no authentication at all. Some let any stranger run database queries or shell commands. The protocol just made auth mandatory in its June spec, but the damage shows the deeper bug: the default was 'no lock,' and people shipped the default. If your agent can reach a tool, so can everyone else — unless you decided otherwise on purpose.
- security
- architecture
- agents
June 6, 2026
Your IDE isn't where you build anymore
Mistral just shipped a coding mode where you start a build from Slack, the agent works in a cloud sandbox while your laptop is off, and you get back a pull request to review. Cursor and others are doing the same. The editor — the thing we treated as the place where software gets made — is quietly becoming one trigger among many. The work is moving from keystrokes in a window to tasks you delegate and diffs you judge. That's a bigger shift in the job than it sounds.
- methodology
- ai-native
- agents
June 6, 2026
Salesforce admits it: your workflow was built for humans, not agents
Salesforce just launched a whole product to fix the reason enterprise AI keeps stalling, and the diagnosis is the honest part: agents aren't failing because the models can't reason — they're failing because the workflows underneath were never built for a machine that follows instructions literally. Your processes are full of gaps that a human quietly fills and an agent walks straight off. That's the real work nobody wants to do, and no model upgrade fixes it.
- architecture
- methodology
- agents
June 5, 2026
"Agent OS" is a buzzword — here's the boring checklist underneath
This month everyone shipped an 'Agent Operating System' — Fiserv, Experian, Microsoft, a dozen startups. The word 'OS' sounds like serious infrastructure, and sometimes it is. Often it's a wrapper with a grand name. The good news: there's a short, unglamorous checklist that tells the two apart — and it's the same stuff I keep writing about. Judge an Agent OS by what's inside the box, not the label on it.
- architecture
- agents
- business