Notes
Short pieces about the methodology and architecture decisions behind the AI systems I ship — specs, evals, multi-agent orchestration, LLM integration, and the discipline of directing coding agents.
June 5, 2026
The AI just started profiling you in the background
Until this week, ChatGPT only remembered what you told it to. As of June 4 it 'dreams' — a background process reads across all your past chats and quietly builds a model of you, keeping it current on its own. That's a genuinely useful upgrade and the moment a chatbot became a profiler. The EU's data regulator said exactly that, today. Here's what actually changed, in plain terms — and why it's the grounding problem aimed at you.
- ai-native
- security
June 5, 2026
Four flagships in four weeks — "which model wins" is a design smell
This month a wave of flagship models is landing almost on top of each other — Gemini 3.5 Pro, a new Claude, Grok 5, with Opus 4.8 already out. Everyone's refreshing leaderboards. If that wave makes you anxious — are we on the best one, should we switch — the anxiety is telling you something about your architecture, not the models. Here's the honest read, and what 'stay swappable' actually takes.
- ai-native
- architecture
June 4, 2026
A token paywall is not SaaS
Founders are pricing AI products with SaaS instincts — flat monthly, per-seat — and quietly bleeding, because the thing that made SaaS magical is gone. Near-zero marginal cost is dead: every user burns tokens, forever, and cost scales with use. GitHub Copilot lost up to $80 a month per heavy user at a flat $10. AI products aren't software with great margins; they're closer to a utility with real cost of goods. Price like it.
- business
- ai-native
June 4, 2026
The labs are racing on price now, not IQ
For two years a flagship model reveal had one headline: we're the smartest, here's the benchmark we beat. At Microsoft Build 2026 the headline changed — same league as Opus, but ~10x more output per dollar and 60% fewer tokens. The boast moved from IQ to efficiency, and the whole industry is reorganizing around price, not peak capability. Here's why the axis flipped, and what it means if you build.
- ai-native
- business
- agents
June 4, 2026
The bottom rung is gone
The data is in, and it's specific: AI ate the junior, not the senior. Companies aren't firing juniors — they've quietly stopped hiring them, because the tasks juniors did are exactly what agents now do. It looks efficient. But seniors aren't hired, they're grown — a senior is a junior who survived a few thousand bugs. Cut the juniors to save money now, and you starve the pipeline that makes the seniors you depend on. We're eating the seed corn.
- careers
- ai-native
- business
June 4, 2026
The bill came due
For two years the cost of AI felt like a rounding error. In 2026 the invoice arrived — Uber burned a year's AI budget in four months, Microsoft yanked Claude Code from its own engineers, JPMorgan says tokens are eating internet profits. This isn't a blip. Token billing inverts the economics software was built on, and the cruel twist is that a better agent costs more. Here's what actually changed, honestly.
- business
- ai-native
- agents