Back to News
Advertisement
Advertisement

⚡ Community Insights

Discussion Sentiment

75% Positive

Analyzed from 5238 words in the discussion.

Trending Topics

#models#mistral#european#model#europe#small#money#don#data#company

Discussion (193 Comments)Read Original on HackerNews

trouve_search1 day ago
OK, I'm 100% rooting for both Mistral and task focused small models.

But Mistral has fall really far behind since 2025Q3. It seems they can't get good reasoning models working at even medium context sizes, which is necessary to be at the table right now.

Gemma4 and Qwen3.6 are currently best in the small size; Mistral's "small" model has ~4x the parameter count at 120B and isn't even competing with models a quarter its size.

Back one year ago with Mistral Small 3.1 they were keeping up, but they've fallen into irrelevancy right now.

If Mistral seriously wants to play the on-prem and small task-specific model game, a decent proxy would be to build models that get the r/localLlama crowd excited

ar01 day ago
I agree. I am a paying Le Chat Pro user, really rooting for a European alternative. But the quality difference between Mistral and the frontier labs is growing too big to ignore. It’s worrying to me that they didn’t talk much about new models at the conference, because that is really where their focus should be IMHO.

I am wondering what is keeping them back, though: Money? Compute? Skills? Training data? My fear is that you are really only getting really good models by training on very dubious data (outputs from the frontier models etc) and that Mistral is too European and too enterprisey to take those risks.

mattnewton1 day ago
My theory with no insider information: it’s a little of all of the above, but mostly money. To some extent, you can dig yourself out of a data hole with RL and a lot of compute. And you can buy a lot of compute and some data with a lot of money. Big labs have been operating in this regime for a while and it’s one of the drivers behind their costs beyond just scaling the weights and doing the actual training. Mistral just doesn’t have access to this level of compute or the money to try and muscle their way in.
MichaelZuoabout 23 hours ago
Don’t they supposedly have a huge amount of EU support?

Or at least there’s been a lot of noise about that.

teifererabout 21 hours ago
> I am wondering what is keeping them back, though: Money? Compute? Skills? Training data?

Not ruthless enough and no backing by a corrupt govt administration that has no morals but focuses on self-enrichment instead.

Might sound drastic but I think that's actually closer to the truth thn everbody likes to admit.

> My fear is that you are really only getting really good models by training on very dubious data (outputs from the frontier models etc) and that Mistral is too European and too enterprisey to take those risks.

Exactly.

steve_adams_8610 minutes ago
My hope is that people working with less resources will need to push on ingenuity to do more with less, leading to innovations. But there’s certainly no guarantee of it
miki123211about 8 hours ago
Should it, though?

I think an European company, taking Chinese models, perhaps doing its own post-training on them and training the Chinese-ness out, with a great chat service, enterprise API and coding agent, could be pretty valuable in itself.

pyvpxabout 2 hours ago
What does “training the Chinese-ness out” even mean?!
sofixaabout 12 hours ago
> I am wondering what is keeping them back, though: Money? Compute? Skills? Training data?

Considering all their talk about new DCs and compute, and a few offhand comments, it sounded to me that compute is a big limitation.

pembrookabout 21 hours ago
> what is keeping them back, though: Money? Compute? Skills? Training data?

All of the above and more. Everything holding Mistral back is the same thing that has held Europe back from competing in the entire digital revolution. See this 1991 article lamenting the loss of any viable European PC manufacturer: https://www.nytimes.com/1991/04/22/business/europe-stumbles-...

Mistral being in Europe is disadvantaged with:

1. Money: less diverse private pension fund environment = less LPs to invest in VC funds = less VC dollars to invest in new ventures. European money is vacuumed out of the private sector into state pension funds and dumped into low yielding government bonds. This starves the private sector of capital while inflating the % of GDP driven by government spending every year (government pension funds buying government bonds in circular fashion enable runaway deficit spending...just like circular AI infrastructure spending).

2. Talent & compute: due to #1, Silicon Valley can outbid Europe for the best talent and hardware. Watch an OpenAI launch video and listen to all the European accents.

3. Local market fragmentation: Europe is a collection of countries that pretend to work together while not even having a unified capital market. The average EU citizen can barely communicate with their neighbor in a common language beyond the level of a toddler (english fluency is massively overstated by Americans who only experience tourist capitals).

4. Regulatory disadvantages: In everything from company regs, employee regs, unions, privacy regs, data portability regs, etc.

It's not "culture" or Europeans being "lazy" as most people would claim. There's currently thousands of young french people working 80 hour weeks creating dumb consulting powerpoints or legacy investment banking deal memos as we speak. Ambitious people exist everywhere in equal proportion, they're just working on the wrong things.

Europe can't compete in the digital revolution the same way they could compete in the industrial revolution due to various system design choices. Culture is simply the aesthetically observed byproducts of system design.

cj00about 2 hours ago
> 4. Regulatory disadvantages: In everything from company regs, employee regs, unions, privacy regs, data portability regs, etc.

Agreed. My own anecdote: my company is global and for the past 6 months, we've been working on getting regulatory and legal approval for an LLM-based feature. The initial proposals of going live in all of our markets have been pared back to exclude Europe altogether due to the regulatory environment.

When I took part in company-wide gen AI councils that reviewed new product rollouts, it seemed like there was a definite hesitation from higher ups from pushing out any leading edge features to European markets. And it's not that the regulations would necessarily block these features from going live but that they'd increase implementation costs to the point where it wouldn't be worth it.

dash2about 18 hours ago
>The average EU citizen can barely communicate with their neighbor in a common language beyond the level of a toddler (english fluency is massively overstated by Americans who only experience tourist capitals).

Not true in my experience: even German waiters in small towns tend to have pretty fluent English.

PeterStuerabout 11 hours ago
1 and 2 are the same. Infinite money without barely any consequence because of 'reserve currency' privilege. To compete with that, the EU can't nuke the dollar because it would be suicide given the Eurodollar realities, and they can't anchor EU ip and talent because our politicians are too intertwined with globalist ideology and capital.
Fnoordabout 1 hour ago
> European money is vacuumed out of the private sector into state pension funds and dumped into low yielding government bonds.

Which countries do that? The ones in NL actually invest in US big tech.

Once Europe stops investing in USA, Europe will be better able to compete.

> Talent & compute: due to #1, Silicon Valley can outbid Europe for the best talent and hardware. Watch an OpenAI launch video and listen to all the European accents.

That just denotes European students are high quality.

Brain drain is happening due to bullying and fascism. The extend of longterm danage of current administration is unclear.

> Local market fragmentation: Europe is a collection of countries that pretend to work together while not even having a unified capital market. The average EU citizen can barely communicate with their neighbor in a common language beyond the level of a toddler (english fluency is massively overstated by Americans who only experience tourist capitals).

Bollocks. I have been in Berlin and Munich various times past decades, and people there speak English very well. Nowadays, translation is a profession which got hit by the AI club.

The people in the rural areas don't have to work together with other people from rural areas. They just need websites and tooling in their native language, or a major language.

Case in point: the French company Mistral has Dutch company ASML has one of their major investors. If you go to Eindhoven area (Netherlands mini SV called Brainport Eindhoven), you get away with English perfectly fine, and there too you will hear all kind if accents.

_fizz_buzz_about 13 hours ago
> 2. Talent & compute: due to #1, Silicon Valley can outbid Europe for the best talent and hardware. Watch an OpenAI launch video and listen to all the European accents.

There is definitely a lot of truth to that. Maybe a bit of an arbitrary measure, but these are the nationalites of the people that wrote the "Attention is all you need" paper. Pretty revealing I find:

Ashish Vaswani: India

Niki Parmar: India

Jakob Uszkoreit: Germany

Llion Jones: Wales (UK)

Aidan Gomez: Canada

Łukasz Kaiser: Poland

Illia Polosukhin: Ukraine

Noam Shazeer: USA

Shitty-kittyabout 16 hours ago
You say that as if the American version of maximalist Capitalism is good or desirable to most people.

Personally, I would much rather have good public pensions and health-care, than A.I agents.

WhyComboNadirabout 6 hours ago
re: #4 Maybe it’s easier if you grow up in the system and know how to navigate the written and unwritten rules, but as a dual Canadian-American who recently gained Austrian citizenship, the regulatory friction is absolutely real. I decided to launch a new venture through an Austrian GmbH.

There are supposedly streamlined paths for local residents, but I had to go through the standard corporate pipeline. I spent three months fighting a bizarre catch-22 between my notary (who cost €3k+) and the bank. To open the account, I had to prove I deposited €10k in capital. But I couldn't make the deposit without an active bank account. On top of that, the bank's compliance team kept arbitrarily canceling my application due to "incorrect answers"... refusing to tell me what the errors actually were and forcing me to restart the entire process ab initio.

I finally just gave up. I wrote off the €3,000 notary fee and €1,000 in registered office costs as a sunk cost, and incorporated a US LLC instead. It took under 10 minutes, no notary, fees of $25 since I did it myself, plus another 20 minutes to open the business bank account.

There was no commercial reason to choose Austria; it was purely sentimental. My ancestors were entrepreneurs in Linz and Vienna, and I loved the idea of renewing that legacy. But the sheer weight of the bureaucracy managed to kill about 99% of the early-stage startup enthusiasm you normally rely on to get a new project off the ground.

barrellabout 1 hour ago
I think it really depends on what you’re doing. I use mistral for many tasks in https://phrasing.app and they blow models many times their size out of the water.

None of my tasks use reasoning though (reasoning actually kills the performance) so perhaps that’s why. Still, I just had to rewrite my pipeline, and mistral was both faster, cheaper, and substantially better than any alternative

greyskull1 day ago
> task focused small models

This is tangential: and forgive my ignorance here, but is there an inherent reason why there aren't smaller, focused models from the frontier model providers?

I'm thinking something like a software-specific subset of Opus that is the default for use in Claude Code. Smaller, cheaper to deploy and consume, maybe faster.

pavpanchekha1 day ago
OpenAI used to make Codex-specific models, but they stopped. What I've gathered from interviews and similar is that training two models isn't worth the (small) lift from having a coding-specific model. You're pre-training on everything anyway, and coding RL is reasonably useful for general-purpose models too.
greyskull1 day ago
Interesting. I'd have guessed there would be meaningful opex benefits to serving smaller models.
baq1 day ago
agreed, the next price increase from frontier labs (and the inevitable limits decrease in subscription tiers) will have people thinking real hard about their model providers and that's when mistral should be ready. however, given their recent performance, I realistically don't have my hopes high up.
amunozo1 day ago
DeepSeek is both cheaper and better than Mistral.
barrellabout 1 hour ago
Not in many tasks. I use deepseek as a fallback in https://phrasing.app and it’s always very apparent when it happen (due to mistakes/clear performance drop off)
gregorygocabout 24 hours ago
Because they distill
djvdq1 day ago
Also, new Medium 3.5 is far more expensive than previous Mistral models, and much more expensive than e.g. Deepseek
KronisLVabout 22 hours ago
I tried it out on some dev tasks with their Mistral Vibe subscription, and the performance was pretty okay (okay, not great), both in regards to development and speed. Worse than Anthropic's models I'm used to but at 20 EUR per month it wasn't a bad deal - except that the 200k context size would more or less be a deal breaker in many cases.
bermudiabout 15 hours ago
Everything is more expensive than deepseek. They aren't frontier in intelligence but they are the frontier in cost per intelligence
raincoleabout 9 hours ago
> they've fallen into irrelevancy right now

It's a very charitable take, as Mistral has never really left the realm of irrelevancy.

It's only a matter of time before EU falls back to hosting Chinese models in EU datacenters.

rhdunnabout 24 hours ago
Yeah. I run LLM models locally and for me 22B-32B is the largest I'm willing to invest in trying out.

Even though Mistral 4 has 6B active parameters per token (allowing 3-3.5 per token parameters to be loaded on a 4090), the ~240GB download + storage is pushing the limits of being able to try this out locally, especially if you are downloading and evaluating multiple models.

It also makes it harder for other people to make downstream finetunes like with what happened with the older Mistral/Magistral models.

wolttamabout 21 hours ago
I think machines like the DGX Spark are about to become a lot more common/popular. It’s big enough to run sparse 150-250B MoEs with enough throughout for a single user. Deepseek v4 Flash is #1 (in terms of usage) on OpenRouter because it’s good enough to be useful. You can run it on a Spark (though it runs better across 2, which is getting up there in cost)
chartpathabout 19 hours ago
I find Mistral Medium 3.5 with OpenCode is perfectly fine if you're willing to talk to it in a more fine-grained way about actual code. For me that's fine because even with huge frontier models I don't like trying to vibe prompt like a product manager.
coredev_1 day ago
I don't agree that they are falling behind. Using both chat and cli I get what I need and it's comparable to "sota" when I compare.
arkhabout 10 hours ago
Mistral is entering the "let's extract has much money from EU taxpayers as we can" phase of European tech company which did not get bought by a US one.

They'll end like Dailymotion, just a zombie company.

echelon1 day ago
Nobody trying to compete with Google, OpenAI, and Anthropic should be playing the small models / local models game.

Foundation model labs should be building very large reasoning models, then leaving it to the community to distill them down.

You can't scale a small model up, but you can scale a small model down.

I'm convinced the only way we'll have a seat at the table in the future and avoid total runaway takeoff is if there are very large models within 80% of the capabilities of the frontier models. Tiny RTX models do diddly squat to remain competitive.

Build open weights models for running on H200s. I'll spin them up on RunPod or Lambda.

farley131 day ago
I do think there's a chance open weight models have a bit of a moment with the costs of frontier models growing on business balance sheets. It's unfortunate from my "privacy loving" PoV that it's mostly Chinese models filling the gap. ( the top models on openrouter for instance ).

I have used Mistral models out of pure ideology for web agents and the like which aren't doing a lot of heavy lifting.

theturtletalks1 day ago
Antirez’s Deepseek 4 Flash implementation that can run on MacBooks also was a revelation. It runs decently on M5 Max 128GB and it’s pointing out other bottlenecks like prefill speed which will improve.
ahnick1 day ago
I thought distillation meant small models don't have to compete with the big models and can always eventually achieve close parity, but it's just a matter of time to do the distillation? (i.e. how much lag do you want to live with) Am I oversimplifying?
gertlabs1 day ago
There is likely a theoretical limit to how much intelligence you can pack into a model of a given size (especially when stretching that over a large input context size).

Our evals are pretty complex so we only recently started testing ~30B class models, which are now becoming quite smart (on par with the frontier from 1 year ago). Mistral is far behind, but I'm rooting for them.

Data at https://gertlabs.com/rankings

lettergram1 day ago
We actually found the Mistral Small 4, quantized to 4bit was comparable to Qwen 3.6 27B and is roughly the same size. At least from our experience on our use cases, the quantization of the Mistral model worked far better than trying to quantize the Qwen family.

Fully agree to your point though, Mistral in general is far behind where I'd expect and Qwen in particular is crushing it at the smaller sizes.

Personally, I'd consider anything 20B params and above a "medium" model. Small being <20B and large >100B. I think obviously we can get to the huge 1-2T param models, but frankly the margin of accuracy improvement for the speed hit is kinda insane (1-2% for many metrics).

rhdunnabout 23 hours ago
It's all relative. For local use I'd classify it by hardware (VRAM size) using FP8 or Q6 quantization:

1. tiny <2-3B -- easily runnable on lower-spec hardware

2. small 4-8B -- runnable on 8GB GPUs

3. medium 9-12B -- runnable on 12GB GPUs

4. large 13-24B -- runnable on 16GB (for the lower end models) and 24GB GPUs

5. very large 25-32GB -- runnable on 32GB GPUs

6. huge >32GB -- not easily runnable on consumer GPUs without compromising performance (offloading layers to the CPU/RAM), quality (heavy quantization, esp. at <= Q4), or price (investing in multi-GPU setups and/or server-grade hardware).

You could possibly split huge down further, as 70GB models (e.g. llama 3) are easier to get working than >120GB models and 1TB models are completely intractable.

srousseyabout 23 hours ago
As a Mac user:

1. tiny <2-3B -- could run in a browser even, mac neo

2. small 4-8B -- last of browser options, MacBook Air base

3. medium 9-24B -- 32GB machine, air or pro notebook or mini

4. large 25-48B -- 64GB, pro notebook or mini

5. x-large 49-100B -- 128GB MacBook Pro or Studio

6. Huge > 100B -- 256/512GB Mac Studio

kergonathabout 22 hours ago
> a decent proxy would be to build models that get the r/localLlama crowd excited

I don’t really disagree with your post, but this is not exactly right. That subreddit seems to go from hype train to hype train every week, I haven’t found anything really insightful in it for quite a while now.

thatsadudeabout 15 hours ago
Nawh, they trained on test since Llama 2, no wonder.
dyauspitrabout 23 hours ago
Mistral is bad bad. For its use cases I feel like India’s Sarvam is doing better.
ctrlkctrlsabout 22 hours ago
channeling Rocky (extraterrestrial) there I see :)
antirez1 day ago
I really want Europe to be part of the AI development and research. And I strongly cheered for Mistral. But they are accumulating too much technological delay. This needs to be fixed, otherwise it will turn into yet another proof we are not able to run large tech with good results. Basically any Chinese lab is doing much better. It's not Mistral that created I don't want to say DeepSeek, but MiMo 2.5, Minimax 2.7, and so forth. There are only weaker and/or larger and slower (no MoE) models. Not good.
b65e8bee43c2ed01 day ago
https://en.wikipedia.org/wiki/Artificial_Intelligence_Act#Pe...

Europe shot itself in the dick with this hastily implemented at the height of mass hysteria bullshit and now no sane company will build anything there. an AI startup in the US or China can be a boy and his computer. in Europe, the boy needs a dozen lawyers.

Mistral's sinking into irrelevancy despite the head start they had, the very promising early models they released, and the funding they receive, might very well be the consequence of trying to comply with all that crap.

mhitzaabout 1 hour ago
So let me get this straight. You think that Europe "shot itself in the dick" by making it harder to deploy AI that:

- manipulates, including subliminally (hope you'll like your subliminal Ads mixed into your LLM output)

- profiling for social scoring

- automated thread labeling as an individual, with no human supervision

- facetracking databases

- emotional and "well-being" monitoring at work or in schools

- + many other kinds of surveillance tools.

I hope you are joking.

edit:

For context this was a snippet of prohibited use, which the fines listed on Wikipedia (theoretically apply to), https://artificialintelligenceact.eu/article/5/

Epa095about 14 hours ago
You don't compete with anthrophopic from the basement. For that you need either a shit loads of money, or a government which are not afraid of getting very very involved.

There is a lot of Europeans working on AI, it's just that a lot of them work for American companies. Because of money.

aleccoabout 11 hours ago
I think both of you are correct.
antirezabout 23 hours ago
Possibly yes but let me remember you that France, Italy Germany were against the AI act, so here something very odd is happening, that the EU funding nations are getting marginalized by the countries they welcomed on key topics for our future, and I believe corruption could be a big part of what is happening, both internal to those three countries and at an even more alarming rate in other countries.
gregorygocabout 22 hours ago
EU big nations getting marginalized: haha. The only reason there’s no US-like tariff on Chinese cars is because Germany was too scared it would lose its access to Chinese market.
neonstaticabout 22 hours ago
> the EU funding nations are getting marginalized by the countries they welcomed

Thank you for reminding us that all animals are equal, but some are more equal

aleccoabout 11 hours ago
Who put a nepo-baby lawyer in charge of the big €95bn AI fund? EU bureaucrats living the 6-figure high life with chauffeurs and private jets in a bubble completely isolated from reality.

I hate the fake European foreign-backed right-wing parties but they didn't cause the current situation.

But I'm afraid it might be too late as the cancer spread and did too much damage. Insane regulations, no energy, looming demographic/pension crisis, tax hell, and collapsing industries.

tardedmemeabout 18 hours ago
Way more important than this act are the police raids. Someone used your SaaS to send phishing (see today's front page HN)? They'll just take all your servers away. Goodbye business. Unless they think the general public would riot, so established companies are okay. You can't build a castle on a foundation of quicksand.
darkamaulabout 22 hours ago
Well , there isn’t also the opposite take from TechCrunch where they say: Why Paris may be the most important AI city outside Silicon Valley. [0]

While the EU loves its regulation, I still feel it’s too early to write it down in the AI race. It will not replace Anthropic or OpenAI any time soon, but even Google and Meta fail to do that.

If AI continue to grow and expand, there is enough space for many more unicorns.

[0] https://techcrunch.com/2026/05/28/why-paris-may-be-the-most-...

throw-the-towelabout 8 hours ago
As someone who has actually experienced the hiring market in Paris, I have a hard time believing this. The salaries are, unfortunately, pathetic.
djvdq1 day ago
It's yet another time when EU is killing our own possibilities to build real competition to US or Chinese tech.

And yet another time they will be thinking aloud in few year "what happened that we are fully dependent on USA?"

sofixaabout 12 hours ago
Did you read even a summary of the AI Act?

The gist of it is very simple - depending on the risk of what you're doing with AI, you have to document why it did what it did, and be able to explain it; or you can't use it at all. So if you're using AI for mass surveillance, you can't; if you're using it for treating loan applications you need to be able to explain why it approved/denied; if it's a customer service chatbot, do whatever, nobody cares.

Not only is burden of the legislation fairly low (and a lot of it hasn't come into force yet), it is extremely reasonable. No, sorry, we don't want a UnitedHealthcare using a broken algorithm on purpose to deny as much care as possible and hiding behind computer says no.

gspr1 day ago
So you're saying AI models should be allowed to freely "manipulate human behavior"?
cm2012about 23 hours ago
That is almost a meaningless sentence. Cats and traffic lights both manipulate human behavior.
xienze1 day ago
The problem is that statement is a bit too open to interpretation. Ever had Claude piss you off by being stupid and talking in circles? Sounds like manipulation of human behavior!
sbinneeabout 20 hours ago
When it comes to MoE, to me, I remember Mixtral model that showed the viability of MoE for the first time. I was impressed by their technical report. To be clear, MoE idea was already out there, if I am not mistaken. If they have pushed Mixtral model family further, who knows they might have achieved the reputation of what the current Qwen family has. A missed opportunity.
kubbabout 11 hours ago
> But they are accumulating too much technological delay.

How so? Catching up is easier and cheaper than spearheading the lead.

GaProgManabout 24 hours ago
Compared to the UK Government which recently announced 10 million GBP for AI research, which will likely be scooped up by consultants. I think Europe is doing fine considering.
antirezabout 23 hours ago
The first step would be indeed to join forces with UK, in order to don't be two entities, which is very unnatural to me.
kergonathabout 10 hours ago
That Brexit ship sailed. It’s very difficult to do anything with the UK currently.
gregorygocabout 22 hours ago
No, we don’t need US’s Trojan horse in the EU
simonw1 day ago
> BNP Paribas runs Mistral models on-prem for KYC in Belgium, with sensitive data staying within the bank's walls. Abanca is using agent orchestration to handle sensitive customer information at a huge scale (2 million customers in their app). For European companies in regulated industries, this is a good alternative to relying on US hyperscalers.

Mistral leaning into on-prem and European-hosted models is very smart.

throw140820201 day ago
Respectfully, I don't think it's "very" smart. It is a fair option given their limited options? Everyone is doing FDE or (customer engineering to be more transparent) because otherwise they will just be seen as markup on token cost. And the Neo-SaaS companies will take the money instead.

Who else will buy their AI?

and what other options do they have?

lucaspillerabout 2 hours ago
There are enough big banks in Europe that want to use AI, but almost all of them have very terrible software engineering (sorry to anyone who works there), so it's not like they are going to spin up their own cluster on top of open source models. If Mistral can filll this spot (provider and consultant) it could be a big win for them. Then repeat with other similar industries and governments in Europe.
port11about 13 hours ago
You don’t think it’s smart to get reliable funding this way? From Banking the Cash Cow?

Devstral is getting better, it’s the Vibe harness that’s holding it back (I think). I can see how that would drive some business as well.

Their chat thingie isn’t very well positioned, but gets results. Could be an euro or two per month, maybe bundled with some more features. It’s not like Mistral has no options, if anything they’re just a bit complacent and not ambitious with their plans.

bg241 day ago
Also Mistral did just the right thing by acquiring Koyeb, to beef up their deployment at scale expertise.
sbinneeabout 20 hours ago
My take is that Mistral is not focusing on generating contents such as code, images, or videos. They focus on multi-lingual models, OCR, voice, and others I believe. Their model intro page manifests that although it always confuses me because it's too colorful and there are too many categories, not to mention model names. I hope their decisions will pay off.
kergonathabout 10 hours ago
From what I understood they are retiring a lot of the specialised models in favour of the main family.
ElFitzabout 22 hours ago
Isn’t that the usual EU startup playbook once they give up on the B2C or world-scale SaaS markets? Refocusing on large (European) enterprise B2B and government contracts?

It always felt to me this (enterprise B2B) was where European startups went to die.

doctorpangloss1 day ago
Yeah but why use mistral on premises instead of Qwen?
kriro1 day ago
We're talking about enterprise customers. The trivial answer is Mistral has sales teams and consultants from the same company that builds the models and from the EU.
doctorpangloss1 day ago
i can invest in public markets in a lot of $10b sales and consultants businesses, who can also put mistral on premises (or do whatever the hell people ask for), it makes mistral sound like it is yet another one of those, not a growing $1T business.
simonw1 day ago
One reason might be that Mistral doesn't have a risk of weird training biases that were required by the Chinese government.
joe_mamba1 day ago
>weird training biases that were required by the Chinese government

What is "weird training biases" to us might not be weird to them and vice versa. Just ask the Chinese what they think about LGBTQ+, Japanese, pride parades, Islam and colored minorities.

Every nation has its own biases injected in its domestic LLMs at this point. Otherwise they risk getting in trouble for hate speech/disinformation in the jurisdiction where they operate.

Same how Google Maps cleverly biases the lines of disputed borders based on where you are viewing it from. Or how Google maps switched 'Gulf of Mexico' to 'Gulf of America' in an instant when the orange man signed the paper. Google won't want to anger the US administration the same way how Mistral won't want to anger France and the EU, so Mistral will have all the EU prime directives injected into its LLMs no matter if they're ludicrous or not. The law is the law whether you agree with it or not. Companies want to survive and will pander to whatever the whims the regime they live under are at the current moment regardless of what is right or wrong.

But if I'm using a LLM for personal projects or generating a photorealistic choreographed fight between Tom Cruise and Brad Pitt, I don't care what its political biases are, I care if it solves my problem better and cheaper than the competition, and here the Chinese models could end up winning the consumer market, which is why you see Mistral and other EU alternatives focusing exclusive on B-2-B corporate market.

plaidthunder1 day ago
Because the lab working on Mistral is in the European Union.
irusensei1 day ago
Please don't run Chinese models for KYC operations.
crimsoneer1 day ago
Based on what? Is there any evidence of risk at all?
neonstaticabout 22 hours ago
It may be very smart for them, but it also shows that the EU has no desire, therefore no chance, to change and lead anything. The only thing it has is regulation.
oblioabout 16 hours ago
The EU has just poured unfathomable amounts of money into continent wide infrastructure (https://reforms-investments.ec.europa.eu/recovery-and-resili...) - due to COVID, the military - due to Russia, etc. They can't do everything.
johnbarron1 day ago
Lets hope the models can do a better KYC than the humans have been doing..because they are well known.

Or is this a case of the humans, now preparing for the excuse it was the AI failure?

"BNP Paribas Sentenced for Conspiring to Violate the Trading with the Enemy Act" - https://www.justice.gov/archives/opa/pr/bnp-paribas-sentence...

"BNP Paribas caught up in French money laundering investigation" - https://www.reuters.com/business/finance/bnp-paribas-caught-...

"BNP Paribas faces $246m fine in currency scandal" - https://www.bbc.com/news/business-40635070

"BNP Paribas caught in a Cypriot money laundering investigation" - https://www.lemonde.fr/en/les-decodeurs/article/2023/12/26/b...

In Money Laundering their track record is unmatched: https://violationtracker.goodjobsfirst.org/parent/bnp-pariba...

pavlov1 day ago
When the humans have a track record of corruption, it might make sense for a company to seek parallel opinions from a LLM so they can at least flag suspicious human decisions.

Assuming BNP Paribas leadership wants to stop the corruption of course.

johnbarron1 day ago
psychoslave1 day ago
That's just one side of the story, not following it on details, but their own le chat explained to me that the company was a capitalist succubus starving to build data center in some north European country. Hilarious if you ask me.
tnolet1 day ago
Regardless of the business. Their website design is :chefs-kiss https://mistral.ai/
Waterluvianabout 21 hours ago
What specifically is good about it? I scrolled it on my phone and it seems pretty standard corporate website?
davey480161 day ago
I love everything about Mistral's branding.
ilja1 day ago
Le Chat was great, the rebrand to Vibe is meh
akkad33about 9 hours ago
It looks very crowded and the paragraphs are off
Eldodi1 day ago
I was at the event, and was impressed by the attendance, all the leaders from the major european listed companies were there.

Also interesting to note the number of partners they invited. Going from Microsoft, Accenture and EY to startups like alpic.ai or lingo.dev . Seems like they are ramping up their M&A game too

tomaskafkaabout 10 hours ago
I have been on a lecture from great government IT person, they are evaluating models and are very unhappy about the situation, because they’d love to use Mistral, in some cases it’s the only EU based model they can use … and they know it’s really bad and falling more behind.

It is well possible that Mistral can make a profitable business by being bad, but still the only possible model for EU uses. Sad story, sad to witness.

LucidLynx1 day ago
As an European: 100x YES!

I really like the direction and the transparency of Mistral, among those players.

dtang27181 day ago
Even as a non European, it's great to see some competition from Europe against the US/Chinese models.
Orasabout 23 hours ago
Sounds like they don’t have a moat at all. It’s like software consultancy with a data centre. And then the article mentions many customers using these models on prem (so data centre is not really a plus).

What’s stopping any country backed startup from fine-tuning small open source models?

vb-8448about 2 hours ago
No one in Europe will buy from a random startup, the consultancy part is a MUST to do businesses with big corps, banks, finances, insurances, governs, public administration ...
whiplash451about 15 hours ago
Maybe because distilling small models from bigger ones that you control gives you better small models than fine-tuning from bigger models you don't control?

(I am not claiming it is the case, but stating this as an assumption)

petcat1 day ago
> Abanca is using agent orchestration to handle sensitive customer information at a huge scale (2 million customers in their app).

Maybe my perspective is skewed on what "huge scale" means, but 2 million users? That's like a few hundred megabytes of data? Or a couple GBs if there's a lot of per-user data?

vnglst1 day ago
Maybe, but using state-of-the-art large language models to solve customer support queries with agentic can quickly use a lot of tokens. What I understood from the talk is that they used agents with limited responsibility and (assumption from me) smaller models, to the make sure the answers were quick, reliable and not too costly.
hadlock1 day ago
There are several payments processing companies that are already largely using AI for customer support queries. They still have an escape hatch to a human but at least one of those companies (on the smaller side) is reporting a ~99% success rate, they are down to a handful of human customer service employees now for cases where the customer can't find/produce the transaction ID.
fidotron1 day ago
European consumer focused businesses do not scale easily the same way US ones do, which is a major contributor to their problems developing tech businesses generally.

OTOH such things can be quite defensible, they just rarely become anything like as profitable.

kioleanuabout 13 hours ago
I just got an email from them saying that they’re retiring some (most?) of the dedicated models like devstral gradually through August and one should now use the general model. Cost grows exponentially

Devstral 2 (devstral-2512 and devstral-latest) → We recommend transitioning to Mistral Medium 3.5 (mistral-medium-3-5 with reasoning_effort set to "high"), a stronger model, priced $1.5/$7.5 per million input/output tokens (change from the previous $0.4/$2).

phillc73about 12 hours ago
I received the same email, although couldn’t quite figure out which retiring model I was still using, as I thought I’d already transitioned to Mistral-Medium-3.5 for everything. Anyway, after receiving the email, my hope was that it meant they were also planning on releasing some new, improved models in the next months.
zuzululuabout 24 hours ago
Wasn't even aware Mistral was around and I think that just shows you how irrelevant it has become and not a very good sign for EU in general when the best talent are working for American AI companies.

I saw Tibo's tweet a while back and it was basically a legitimate complaint about the extreme taxation he faced back in EU (France I think) and its pretty obvious how much of a hinderance top down centralized regulation is to innovation.

While I welcome competition and independence, nobody can argue with American innovation and its ability to attract the best of the best. Once it takes seat of the AI reigns there is very little chance for other countries to compete, very much similar to semiconductor field and how only a few select countries have the talent and monopoly over its particular supply chain.

It's clear to anyone looking in that whatever EU is doing is not working (not just AI) and will not work as they do not seem flexible or humble enough to steer itself.

gregorygocabout 22 hours ago
Big tech has remote offices in every major European economy, and they pay well above top 90th percentile of market rate. It basically has a talent sucking effect on the entire economy.
Advertisement
ramstar3000about 5 hours ago
Excited to see more about their partnership they with Alexa+. In agentic and tool-calling, Mistral’s model architecture excels at the exact structured JSON output Alexa needs to trigger APIs and smart home routines without breaking.
maxdo1 day ago
Oh most prominent eu ai company . Without reading an article predict next, will update after :

1. They give up on building competitive models. It’s time to drink wine not to struggle with competition

2. Because of #1 they will talk a bit about something around llms maybe coding agents , and after start talking about sovereignty.

lejalvabout 13 hours ago
Unlike you, who drank the wine before writing the comment.
FinnKuhn1 day ago
3. They are going to start focusing on B2B implementation and deployment.

See what happened to Aleph Alpha...

Imbissabout 7 hours ago
Really hope there is going to be some competition from Europe in the AI Space.
sbinneeabout 20 hours ago
I believe that Mistral team is doing the best they can do. I like the directions they push; open models for various tasks, on-prem has a lot of potential. Sure, I use Claude code mostly for coding. But there are so many tasks other than just coding. Even for coding, eventually, I am certain they will catch up and Vibe becomes tolerable soon.
ogou1 day ago
I've said it before that Mistral is underrated. They are looking at real world use of LLMs and tooling. Bespoke models are very appealing to lots of non-tech centered companies and state agencies. Also, Mistral's actual platform is useful. While others are watching performance leaderboards like this is some eSports stream, they are building real world uses.
stephantulabout 24 hours ago
I was also at the event and was pretty disappointed. Most of the talks were pretty low on information. I was at the “build” stage, which supposedly was the technical stage, but the talks there didn’t really go into technical specifics.

The papyrus talk was awesome though.

rvzabout 21 hours ago
We should be supporting and using local models that allow you to run whatever model you want.
t0loabout 14 hours ago
Not to be confused with the fantastic AI Now institute, run by Meredith Whittaker of Signal among others.

https://ainowinstitute.org/

Almost feels like name squatting

gameshot911about 18 hours ago
Does anyone else always read "Mistrial" instead of "Mistral"? Always think I'm about to read a juicy gossip piece, and let down when it's just a standard update on an AI company.

edit A lot of AI company names are really strange, actually. "Claude" is really the best a trillion+ dollar company could come up with? It sounds like the name of a grandpa or something.