Back to News
Advertisement
Advertisement

⚡ Community Insights

Discussion Sentiment

83% Positive

Analyzed from 144 words in the discussion.

Trending Topics

#gemma#model#fast#speed#tps#question#opus#imagine#myself#okay

Discussion (6 Comments)Read Original on HackerNews

tmanchester•about 9 hours ago
Okay this is actually pretty cool. Gemma 4 is a nice little model and I've really enjoyed playing around with it. At 1800 tok/s turns are essentially instant, it's a bit of a trip
simianwords•about 1 hour ago
I just tried it on their website and it is extremely fast. I wonder what is the value prop of this? Where would I want

1. a smaller model

2. also non local, hosted on cloud

I can't think of any case.

anthonypasq•about 1 hour ago
speed is always better. if you have ever used a coding agent with 1000 tps going back to 50 seems like walking through sludge. for simple question i hate waiting 2 minutes for opus to loop 50 times just to read some files and answer a question.

its not necessarily specifically labout gemma 4, but in a year or 2 when we have opus class models at 2000 tps imagine the productivity.

simianwords•about 1 hour ago
Of course I think speed is preferable but I don’t see myself paying for a fast Gemma
anthonypasq•35 minutes ago
i mean, i can imagine a million different apps that use ai that want cheap multimodal capabilities with high latency.
simianwords•about 1 hour ago
Answering myself: fancy autocomplete in my IDE?

Text autocorrect on my phone? Like give it all the context about me and so on.