Thoughts, guides, and experiments in AI engineering.
June 22, 2026
Practical habits for reducing token waste when coding with local AI models.
June 21, 2026
Getting 90 tokens per second from Qwen 3.6 27B on a Windows 11 rig with an RTX 5090.
Choose an article from the list to read it.