1 article tagged Local LLMs.
Claude as the architect. A cheap local model as the drafter. Claude QAs everything. The token bill drops as a side-effect of using each model for what it does best. Here is how I run it on my own rig.