FR version is available. Content is displayed in original English for accuracy.
Advertisement
Advertisement
⚡ Community Insights
Discussion Sentiment
67% Positive
Analyzed from 299 words in the discussion.
Trending Topics
#models#don#model#india#local#miles#sarvam#medical#need#domain

Discussion (5 Comments)Read Original on HackerNews
Wow bad idea. Domain specific models simply don’t work. Ever. You should not be using some shoddy 3M model for medical purposes when you can spend just a few dollars extra and get GPT that is miles and miles better. The local language value proposition is also exaggerated.
This article keeps repeating the lie that network is hard to find in India and that local models win. This is on the face ridiculous to anyone who has been to India. Almost everyone has access to a smartphone with 4g connection. What they don’t have is the ability to afford a phone that can run a good model. Why would I as a poor farmer in India, use an extremely underpowered 3B model on my 100 dollar smartphone when I can use the free version of ChatGPT that is miles ahead in every dimension?
My 1000 dollar iPhone can barely run Gemma 4 which is hardly usable for serious questions anyway.
I do get the need for Indian ecosystem to build internal competency so that when the time comes they are prepared. But for now pursuing a distillation attack strategy like China looks better. Or have companies that specialise in integration locally - something big model companies don’t have expertise in.
All these capitalist funded AI models with bloated hardware requirements