Back to News
Advertisement

Ask HN: Running local LLMs? What's your model and hardware

aalfiedotwtf about 5 hours ago 4 comments

ES version is available. Content is displayed in original English for accuracy.

Advertisement

⚡ Community Insights

Discussion Sentiment

100% Positive

Analyzed from 95 words in the discussion.

Trending Topics

#coder#laguna#looks#context#projects#codestral#codegemma#codellama#north#mini

Discussion (4 Comments)Read Original on HackerNews

roscasabout 4 hours ago
qwen3-coder:30b

codestral:22b

codegemma:7b

codellama:34b

north-mini-code-1.0:q8_0

laguna-xs.2:latest

Currently testing those above on AMD Ryzen 5 3600x with 48GB of RAM and a nVidia 3080 with 10GB of VRAM.

Favorite model is laguna-xs.2 because it is really fast on CPU and very good.

alfiedotwtfabout 4 hours ago
Oh! Looks like I’ve been sleeping on Laguna!
alfiedotwtfabout 4 hours ago
If you’re able to run qwen3-coder, have you thought about 3.6 27B or 35B? Looking at benchmarks, 3.6 looks its gained a lot over qwen3-coder
cyanydeezabout 2 hours ago
qwen 3.6 35B on 128GB strix halo.

perfect speed to not melt the brain and can extend context for well scoped projects.

need to work with dynamic context pruning to ensure full reuse in larger projects.

deer-flow seems. to work well for project scoping and high level evals. opencode for coding.