🔗 Link
Dogfooding Is the New Code Review
AI agents can ship code faster than humans can review it. Correct code still needs human time to become good software.
AI agents can ship code faster than humans can review it. Correct code still needs human time to become good software.
OpenAI explained why Codex kept talking about goblins. Of course it was reward hacking.
Codex GPT-5.5 apparently needed an explicit, repeated reminder not to talk about certain creatures unless they are actually relevant.
Anthropic's harness engineering article highlights what's missing from most AI coding: scrutiny and evaluation.