But Mistral has fall really far behind since 2025Q3. It seems they can't get good reasoning models working at even medium context sizes, which is necessary to be at the table right now.
Gemma4 and Qwen3.6 are currently best in the small size; Mistral's "small" model has ~4x the parameter count at 120B and isn't even competing with models a quarter its size.
Back one year ago with Mistral Small 3.1 they were keeping up, but they've fallen into irrelevancy right now.
If Mistral seriously wants to play the on-prem and small task-specific model game, a decent proxy would be to build models that get the r/localLlama crowd excited
I am wondering what is keeping them back, though: Money? Compute? Skills? Training data? My fear is that you are really only getting really good models by training on very dubious data (outputs from the frontier models etc) and that Mistral is too European and too enterprisey to take those risks.
Not ruthless enough and no backing by a corrupt govt administration that has no morals but focuses on self-enrichment instead.
Might sound drastic but I think that's actually closer to the truth thn everbody likes to admit.
> My fear is that you are really only getting really good models by training on very dubious data (outputs from the frontier models etc) and that Mistral is too European and too enterprisey to take those risks.
Exactly.
All of the above and more. Everything holding Mistral back is the same thing that has held Europe back from competing in the entire digital revolution. See this 1991 article lamenting the loss of any viable European PC manufacturer: https://www.nytimes.com/1991/04/22/business/europe-stumbles-...
Mistral being in Europe is disadvantaged with:
1. Money: less diverse private pension fund environment = less LPs to invest in VC funds = less VC dollars to invest in new ventures. European money is vacuumed out of the private sector into state pension funds and dumped into low yielding government bonds. This starves the private sector of capital while inflating the % of GDP driven by government spending every year (government pension funds buying government bonds in circular fashion enable runaway deficit spending...just like circular AI infrastructure spending).
2. Talent & compute: due to #1, Silicon Valley can outbid Europe for the best talent and hardware. Watch an OpenAI launch video and listen to all the European accents.
3. Local market fragmentation: Europe is a collection of countries that pretend to work together while not even having a unified capital market. The average EU citizen can barely communicate with their neighbor in a common language beyond the level of a toddler (english fluency is massively overstated by Americans who only experience tourist capitals).
4. Regulatory disadvantages: In everything from company regs, employee regs, unions, privacy regs, data portability regs, etc.
It's not "culture" or Europeans being "lazy" as most people would claim. There's currently thousands of young french people working 80 hour weeks creating dumb consulting powerpoints or legacy investment banking deal memos as we speak. Ambitious people exist everywhere in equal proportion, they're just working on the wrong things.
Europe can't compete in the digital revolution the same way they could compete in the industrial revolution due to various system design choices. Culture is simply the aesthetically observed byproducts of system design.
Considering all their talk about new DCs and compute, and a few offhand comments, it sounded to me that compute is a big limitation.
I think an European company, taking Chinese models, perhaps doing its own post-training on them and training the Chinese-ness out, with a great chat service, enterprise API and coding agent, could be pretty valuable in itself.
This is tangential: and forgive my ignorance here, but is there an inherent reason why there aren't smaller, focused models from the frontier model providers?
I'm thinking something like a software-specific subset of Opus that is the default for use in Claude Code. Smaller, cheaper to deploy and consume, maybe faster.
Foundation model labs should be building very large reasoning models, then leaving it to the community to distill them down.
You can't scale a small model up, but you can scale a small model down.
I'm convinced the only way we'll have a seat at the table in the future and avoid total runaway takeoff is if there are very large models within 80% of the capabilities of the frontier models. Tiny RTX models do diddly squat to remain competitive.
Build open weights models for running on H200s. I'll spin them up on RunPod or Lambda.
I have used Mistral models out of pure ideology for web agents and the like which aren't doing a lot of heavy lifting.
It's a very charitable take, as Mistral has never really left the realm of irrelevancy.
It's only a matter of time before EU falls back to hosting Chinese models in EU datacenters.
Fully agree to your point though, Mistral in general is far behind where I'd expect and Qwen in particular is crushing it at the smaller sizes.
Personally, I'd consider anything 20B params and above a "medium" model. Small being <20B and large >100B. I think obviously we can get to the huge 1-2T param models, but frankly the margin of accuracy improvement for the speed hit is kinda insane (1-2% for many metrics).
1. tiny <2-3B -- easily runnable on lower-spec hardware
2. small 4-8B -- runnable on 8GB GPUs
3. medium 9-12B -- runnable on 12GB GPUs
4. large 13-24B -- runnable on 16GB (for the lower end models) and 24GB GPUs
5. very large 25-32GB -- runnable on 32GB GPUs
6. huge >32GB -- not easily runnable on consumer GPUs without compromising performance (offloading layers to the CPU/RAM), quality (heavy quantization, esp. at <= Q4), or price (investing in multi-GPU setups and/or server-grade hardware).
You could possibly split huge down further, as 70GB models (e.g. llama 3) are easier to get working than >120GB models and 1TB models are completely intractable.
None of my tasks use reasoning though (reasoning actually kills the performance) so perhaps that’s why. Still, I just had to rewrite my pipeline, and mistral was both faster, cheaper, and substantially better than any alternative
Even though Mistral 4 has 6B active parameters per token (allowing 3-3.5 per token parameters to be loaded on a 4090), the ~240GB download + storage is pushing the limits of being able to try this out locally, especially if you are downloading and evaluating multiple models.
It also makes it harder for other people to make downstream finetunes like with what happened with the older Mistral/Magistral models.
They'll end like Dailymotion, just a zombie company.
I don’t really disagree with your post, but this is not exactly right. That subreddit seems to go from hype train to hype train every week, I haven’t found anything really insightful in it for quite a while now.
Mistral leaning into on-prem and European-hosted models is very smart.
Who else will buy their AI?
and what other options do they have?
Devstral is getting better, it’s the Vibe harness that’s holding it back (I think). I can see how that would drive some business as well.
Their chat thingie isn’t very well positioned, but gets results. Could be an euro or two per month, maybe bundled with some more features. It’s not like Mistral has no options, if anything they’re just a bit complacent and not ambitious with their plans.
Or is this a case of the humans, now preparing for the excuse it was the AI failure?
"BNP Paribas Sentenced for Conspiring to Violate the Trading with the Enemy Act" - https://www.justice.gov/archives/opa/pr/bnp-paribas-sentence...
"BNP Paribas caught up in French money laundering investigation" - https://www.reuters.com/business/finance/bnp-paribas-caught-...
"BNP Paribas faces $246m fine in currency scandal" - https://www.bbc.com/news/business-40635070
"BNP Paribas caught in a Cypriot money laundering investigation" - https://www.lemonde.fr/en/les-decodeurs/article/2023/12/26/b...
In Money Laundering their track record is unmatched: https://violationtracker.goodjobsfirst.org/parent/bnp-pariba...
Assuming BNP Paribas leadership wants to stop the corruption of course.
It always felt to me this (enterprise B2B) was where European startups went to die.
How so? Catching up is easier and cheaper than spearheading the lead.
Europe shot itself in the dick with this hastily implemented at the height of mass hysteria bullshit and now no sane company will build anything there. an AI startup in the US or China can be a boy and his computer. in Europe, the boy needs a dozen lawyers.
Mistral's sinking into irrelevancy despite the head start they had, the very promising early models they released, and the funding they receive, might very well be the consequence of trying to comply with all that crap.
- manipulates, including subliminally (hope you'll like your subliminal Ads mixed into your LLM output)
- profiling for social scoring
- automated thread labeling as an individual, with no human supervision
- facetracking databases
- emotional and "well-being" monitoring at work or in schools
- + many other kinds of surveillance tools.
I hope you are joking.
edit:
For context this was a snippet of prohibited use, which the fines listed on Wikipedia (theoretically apply to), https://artificialintelligenceact.eu/article/5/
There is a lot of Europeans working on AI, it's just that a lot of them work for American companies. Because of money.
The gist of it is very simple - depending on the risk of what you're doing with AI, you have to document why it did what it did, and be able to explain it; or you can't use it at all. So if you're using AI for mass surveillance, you can't; if you're using it for treating loan applications you need to be able to explain why it approved/denied; if it's a customer service chatbot, do whatever, nobody cares.
Not only is burden of the legislation fairly low (and a lot of it hasn't come into force yet), it is extremely reasonable. No, sorry, we don't want a UnitedHealthcare using a broken algorithm on purpose to deny as much care as possible and hiding behind computer says no.
While the EU loves its regulation, I still feel it’s too early to write it down in the AI race. It will not replace Anthropic or OpenAI any time soon, but even Google and Meta fail to do that.
If AI continue to grow and expand, there is enough space for many more unicorns.
[0] https://techcrunch.com/2026/05/28/why-paris-may-be-the-most-...
And yet another time they will be thinking aloud in few year "what happened that we are fully dependent on USA?"
What’s stopping any country backed startup from fine-tuning small open source models?
(I am not claiming it is the case, but stating this as an assumption)
their model's efficacy for the mainstream comparisons may not be up to the task, but they are pivoting to their own lane for it. but the scope beyond the local market, it is yet to be seen.
Maybe my perspective is skewed on what "huge scale" means, but 2 million users? That's like a few hundred megabytes of data? Or a couple GBs if there's a lot of per-user data?
OTOH such things can be quite defensible, they just rarely become anything like as profitable.
1. They give up on building competitive models. It’s time to drink wine not to struggle with competition
2. Because of #1 they will talk a bit about something around llms maybe coding agents , and after start talking about sovereignty.
I saw Tibo's tweet a while back and it was basically a legitimate complaint about the extreme taxation he faced back in EU (France I think) and its pretty obvious how much of a hinderance top down centralized regulation is to innovation.
While I welcome competition and independence, nobody can argue with American innovation and its ability to attract the best of the best. Once it takes seat of the AI reigns there is very little chance for other countries to compete, very much similar to semiconductor field and how only a few select countries have the talent and monopoly over its particular supply chain.
It's clear to anyone looking in that whatever EU is doing is not working (not just AI) and will not work as they do not seem flexible or humble enough to steer itself.
I really like the direction and the transparency of Mistral, among those players.
Devstral 2 (devstral-2512 and devstral-latest) → We recommend transitioning to Mistral Medium 3.5 (mistral-medium-3-5 with reasoning_effort set to "high"), a stronger model, priced $1.5/$7.5 per million input/output tokens (change from the previous $0.4/$2).
edit A lot of AI company names are really strange, actually. "Claude" is really the best a trillion+ dollar company could come up with? It sounds like the name of a grandpa or something.
Also interesting to note the number of partners they invited. Going from Microsoft, Accenture and EY to startups like alpic.ai or lingo.dev . Seems like they are ramping up their M&A game too
It is well possible that Mistral can make a profitable business by being bad, but still the only possible model for EU uses. Sad story, sad to witness.
The papyrus talk was awesome though.
Almost feels like name squatting