📝 Post
Notes on Self-Hosting LLMs (Early 2026)
What to expect when running local models on consumer hardware
What to expect when running local models on consumer hardware
The Muon optimization algorithm stablizes multi-million dollar GPu training run and does so on the strength of three "magic" numbers