📝 Post
ML Monitoring: Keep It Simple, Keep It Compounding
You don't need drift detection to start. One dashboard, one runbook, and a process that gets a little better every time.
You don't need drift detection to start. One dashboard, one runbook, and a process that gets a little better every time.
Why would you make 1M context the default at no extra cost unless something changed architecturally?
What to expect when running local models on consumer hardware
The Muon optimization algorithm stablizes multi-million dollar GPu training run and does so on the strength of three "magic" numbers