Ask HN: Has anyone replaced Claude/GPT with a local model for daily coding?
1238 points by cloudking 2 days ago | 532 comments
tl;dr: A Hacker News user asks whether anyone has fully replaced Claude or GPT with a local model for their primary coding workflow, rather than just experimental use. They're requesting details on setups and performance metrics like tokens per second.
HN Discussion:
  • Successfully replaced cloud models with local Qwen/Gemma setups for privacy and cost savings
  • ~Local models work for most tasks but still fall back to frontier models for complex work
  • Local models aren't worth the effort given the opportunity cost versus frontier models
  • ~Open source models via third-party inference providers offer better speed/cost than local or big providers
  • Expectations matter; appropriately-sized local models are good enough for scoped tasks