Back to News
Advertisement
Advertisement

⚑ Community Insights

Discussion Sentiment

67% Positive

Analyzed from 312 words in the discussion.

Trending Topics

#per#cost#user#gpu#electricity#rent#power#qwen#more#around

Discussion (9 Comments)Read Original on HackerNews

breputβ€’21 minutes ago
> We'll assume a 32B dense model, as they've have gotten quite good for production use and a B200 can comfortably serve them. This could be a Gemma, Qwen, DeepSeek, whatever.

That seems like a very consequential point to include halfway through the post. They aren't wrong that Qwen 3.6 26B or Gemma 4 31B are quite good, depending on the use case, but if we're doing napkin math, I'd want some more headroom in the assumptions.

They really ought to have Qwen parameterize their post's calculations and add sliders so a reader could play around with the values.

JBAnderson5β€’26 minutes ago
> Realistically, serving 300 users per GPU you'll spend a lifetime cost of about $133 per user, plus the datacenter/upkeep bill.

What is the operational cost and when does it become more expensive than the upfront capex?

The B200 tops out at 1000W and idles around 140W. It averages around 600W. https://www.lightly.ai/blog/nvidia-b200-vs-h100 U.S. average electricity cost is $.14 per kWh in March. https://www.eia.gov/electricity/monthly/epm_table_grapher.ph...

600/1000 *.14 =$0.084 per hour $2.01 per day $60.30 per month With 300 users, $.20 per user per month. Seems fairly cheap for the electricity.

Does anyone know how to estimate colo/data center rent costs? Where did I screw up my estimates?

BadBadJellyBeanβ€’8 minutes ago
I wonder what the power costs are when you put jet turbines in front of your DC to power it.
smalltorchβ€’about 2 hours ago
>This largely depends on whether you own or rent your hardware. At $40,000 per B200, your lifetime cost per user is 40_000/num_users. In the 100% duty cycle case (worst for cost), that's 6k$ per user. Realistically, serving 300 users per GPU you'll spend a lifetime cost of about $133 per user, plus the datacenter/upkeep bill. If you rent the GPU, the cost is more straightforward. At an hourly rate of $43, your hourly cost per user is 4/num_users. For num_users=300 you get an hourly rate of about $0.013 per user, or $9.36 per month.

This leads me to believe you can buy a GPU but leave it at a data center?

Do people do this? I don't understand. Or are you equating upkeep bill to electricity on premises?

__sβ€’about 2 hours ago
smalltorchβ€’about 2 hours ago
So what's the cost separating them from placing this box at their premise?

Network throughout?

namibjβ€’about 1 hour ago
Plus power and cooling.
BadBadJellyBeanβ€’32 minutes ago
I'd like to see a bit of the running costs inside the napkin math. Power, cooling, maintenance, rent, etc. are probably significant factors as well.