Notes from the Mistral AI Now Summit (koenvangilst.nl)

449 points by vnglst a day ago

trouve_search a day ago

OK, I'm 100% rooting for both Mistral and task focused small models.

But Mistral has fall really far behind since 2025Q3. It seems they can't get good reasoning models working at even medium context sizes, which is necessary to be at the table right now.

Gemma4 and Qwen3.6 are currently best in the small size; Mistral's "small" model has ~4x the parameter count at 120B and isn't even competing with models a quarter its size.

Back one year ago with Mistral Small 3.1 they were keeping up, but they've fallen into irrelevancy right now.

If Mistral seriously wants to play the on-prem and small task-specific model game, a decent proxy would be to build models that get the r/localLlama crowd excited

ar0 a day ago

I agree. I am a paying Le Chat Pro user, really rooting for a European alternative. But the quality difference between Mistral and the frontier labs is growing too big to ignore. It’s worrying to me that they didn’t talk much about new models at the conference, because that is really where their focus should be IMHO.

I am wondering what is keeping them back, though: Money? Compute? Skills? Training data? My fear is that you are really only getting really good models by training on very dubious data (outputs from the frontier models etc) and that Mistral is too European and too enterprisey to take those risks.

mattnewton a day ago

My theory with no insider information: it’s a little of all of the above, but mostly money. To some extent, you can dig yourself out of a data hole with RL and a lot of compute. And you can buy a lot of compute and some data with a lot of money. Big labs have been operating in this regime for a while and it’s one of the drivers behind their costs beyond just scaling the weights and doing the actual training. Mistral just doesn’t have access to this level of compute or the money to try and muscle their way in.

MichaelZuo a day ago

teiferer a day ago

> I am wondering what is keeping them back, though: Money? Compute? Skills? Training data?

Not ruthless enough and no backing by a corrupt govt administration that has no morals but focuses on self-enrichment instead.

Might sound drastic but I think that's actually closer to the truth thn everbody likes to admit.

> My fear is that you are really only getting really good models by training on very dubious data (outputs from the frontier models etc) and that Mistral is too European and too enterprisey to take those risks.

Exactly.

steve_adams_86 2 hours ago

miki123211 9 hours ago

Should it, though?

I think an European company, taking Chinese models, perhaps doing its own post-training on them and training the Chinese-ness out, with a great chat service, enterprise API and coding agent, could be pretty valuable in itself.

pyvpx 4 hours ago

sofixa 14 hours ago

> I am wondering what is keeping them back, though: Money? Compute? Skills? Training data?

Considering all their talk about new DCs and compute, and a few offhand comments, it sounded to me that compute is a big limitation.

pembrook a day ago

> what is keeping them back, though: Money? Compute? Skills? Training data?

All of the above and more. Everything holding Mistral back is the same thing that has held Europe back from competing in the entire digital revolution. See this 1991 article lamenting the loss of any viable European PC manufacturer: https://www.nytimes.com/1991/04/22/business/europe-stumbles-...

Mistral being in Europe is disadvantaged with:

1. Money: less diverse private pension fund environment = less LPs to invest in VC funds = less VC dollars to invest in new ventures. European money is vacuumed out of the private sector into state pension funds and dumped into low yielding government bonds. This starves the private sector of capital while inflating the % of GDP driven by government spending every year (government pension funds buying government bonds in circular fashion enable runaway deficit spending...just like circular AI infrastructure spending).

2. Talent & compute: due to #1, Silicon Valley can outbid Europe for the best talent and hardware. Watch an OpenAI launch video and listen to all the European accents.

3. Local market fragmentation: Europe is a collection of countries that pretend to work together while not even having a unified capital market. The average EU citizen can barely communicate with their neighbor in a common language beyond the level of a toddler (english fluency is massively overstated by Americans who only experience tourist capitals).

4. Regulatory disadvantages: In everything from company regs, employee regs, unions, privacy regs, data portability regs, etc.

It's not "culture" or Europeans being "lazy" as most people would claim. There's currently thousands of young french people working 80 hour weeks creating dumb consulting powerpoints or legacy investment banking deal memos as we speak. Ambitious people exist everywhere in equal proportion, they're just working on the wrong things.

Europe can't compete in the digital revolution the same way they could compete in the industrial revolution due to various system design choices. Culture is simply the aesthetically observed byproducts of system design.

cj00 4 hours ago

dash2 20 hours ago

PeterStuer 13 hours ago

_fizz_buzz_ 15 hours ago

Fnoord 3 hours ago

Shitty-kitty 18 hours ago

WhyComboNadir 7 hours ago

barrell 3 hours ago

I think it really depends on what you’re doing. I use mistral for many tasks in https://phrasing.app and they blow models many times their size out of the water.

None of my tasks use reasoning though (reasoning actually kills the performance) so perhaps that’s why. Still, I just had to rewrite my pipeline, and mistral was both faster, cheaper, and substantially better than any alternative

greyskull a day ago

> task focused small models

This is tangential: and forgive my ignorance here, but is there an inherent reason why there aren't smaller, focused models from the frontier model providers?

I'm thinking something like a software-specific subset of Opus that is the default for use in Claude Code. Smaller, cheaper to deploy and consume, maybe faster.

pavpanchekha a day ago

OpenAI used to make Codex-specific models, but they stopped. What I've gathered from interviews and similar is that training two models isn't worth the (small) lift from having a coding-specific model. You're pre-training on everything anyway, and coding RL is reasonably useful for general-purpose models too.

greyskull a day ago

baq a day ago

agreed, the next price increase from frontier labs (and the inevitable limits decrease in subscription tiers) will have people thinking real hard about their model providers and that's when mistral should be ready. however, given their recent performance, I realistically don't have my hopes high up.

amunozo a day ago

DeepSeek is both cheaper and better than Mistral.

barrell 3 hours ago

gregorygoc a day ago

djvdq a day ago

Also, new Medium 3.5 is far more expensive than previous Mistral models, and much more expensive than e.g. Deepseek

KronisLV a day ago

bermudi 17 hours ago

raincole 11 hours ago

> they've fallen into irrelevancy right now

It's a very charitable take, as Mistral has never really left the realm of irrelevancy.

It's only a matter of time before EU falls back to hosting Chinese models in EU datacenters.

rhdunn a day ago

Yeah. I run LLM models locally and for me 22B-32B is the largest I'm willing to invest in trying out.

Even though Mistral 4 has 6B active parameters per token (allowing 3-3.5 per token parameters to be loaded on a 4090), the ~240GB download + storage is pushing the limits of being able to try this out locally, especially if you are downloading and evaluating multiple models.

It also makes it harder for other people to make downstream finetunes like with what happened with the older Mistral/Magistral models.

wolttam a day ago

I think machines like the DGX Spark are about to become a lot more common/popular. It’s big enough to run sparse 150-250B MoEs with enough throughout for a single user. Deepseek v4 Flash is #1 (in terms of usage) on OpenRouter because it’s good enough to be useful. You can run it on a Spark (though it runs better across 2, which is getting up there in cost)

chartpath 21 hours ago

I find Mistral Medium 3.5 with OpenCode is perfectly fine if you're willing to talk to it in a more fine-grained way about actual code. For me that's fine because even with huge frontier models I don't like trying to vibe prompt like a product manager.

coredev_ a day ago

I don't agree that they are falling behind. Using both chat and cli I get what I need and it's comparable to "sota" when I compare.

arkh 11 hours ago

Mistral is entering the "let's extract has much money from EU taxpayers as we can" phase of European tech company which did not get bought by a US one.

They'll end like Dailymotion, just a zombie company.

echelon a day ago

Nobody trying to compete with Google, OpenAI, and Anthropic should be playing the small models / local models game.

Foundation model labs should be building very large reasoning models, then leaving it to the community to distill them down.

You can't scale a small model up, but you can scale a small model down.

I'm convinced the only way we'll have a seat at the table in the future and avoid total runaway takeoff is if there are very large models within 80% of the capabilities of the frontier models. Tiny RTX models do diddly squat to remain competitive.

Build open weights models for running on H200s. I'll spin them up on RunPod or Lambda.

farley13 a day ago

I do think there's a chance open weight models have a bit of a moment with the costs of frontier models growing on business balance sheets. It's unfortunate from my "privacy loving" PoV that it's mostly Chinese models filling the gap. ( the top models on openrouter for instance ).

I have used Mistral models out of pure ideology for web agents and the like which aren't doing a lot of heavy lifting.

theturtletalks a day ago

ahnick a day ago

I thought distillation meant small models don't have to compete with the big models and can always eventually achieve close parity, but it's just a matter of time to do the distillation? (i.e. how much lag do you want to live with) Am I oversimplifying?

gertlabs a day ago

lettergram a day ago

We actually found the Mistral Small 4, quantized to 4bit was comparable to Qwen 3.6 27B and is roughly the same size. At least from our experience on our use cases, the quantization of the Mistral model worked far better than trying to quantize the Qwen family.

Fully agree to your point though, Mistral in general is far behind where I'd expect and Qwen in particular is crushing it at the smaller sizes.

Personally, I'd consider anything 20B params and above a "medium" model. Small being <20B and large >100B. I think obviously we can get to the huge 1-2T param models, but frankly the margin of accuracy improvement for the speed hit is kinda insane (1-2% for many metrics).

rhdunn a day ago

It's all relative. For local use I'd classify it by hardware (VRAM size) using FP8 or Q6 quantization:

1. tiny <2-3B -- easily runnable on lower-spec hardware

2. small 4-8B -- runnable on 8GB GPUs

3. medium 9-12B -- runnable on 12GB GPUs

4. large 13-24B -- runnable on 16GB (for the lower end models) and 24GB GPUs

5. very large 25-32GB -- runnable on 32GB GPUs

6. huge >32GB -- not easily runnable on consumer GPUs without compromising performance (offloading layers to the CPU/RAM), quality (heavy quantization, esp. at <= Q4), or price (investing in multi-GPU setups and/or server-grade hardware).

You could possibly split huge down further, as 70GB models (e.g. llama 3) are easier to get working than >120GB models and 1TB models are completely intractable.

sroussey a day ago

kergonath a day ago

> a decent proxy would be to build models that get the r/localLlama crowd excited

I don’t really disagree with your post, but this is not exactly right. That subreddit seems to go from hype train to hype train every week, I haven’t found anything really insightful in it for quite a while now.

thatsadude 17 hours ago

Nawh, they trained on test since Llama 2, no wonder.

dyauspitr a day ago

Mistral is bad bad. For its use cases I feel like India’s Sarvam is doing better.

ctrlkctrls a day ago

channeling Rocky (extraterrestrial) there I see :)

antirez a day ago

I really want Europe to be part of the AI development and research. And I strongly cheered for Mistral. But they are accumulating too much technological delay. This needs to be fixed, otherwise it will turn into yet another proof we are not able to run large tech with good results. Basically any Chinese lab is doing much better. It's not Mistral that created I don't want to say DeepSeek, but MiMo 2.5, Minimax 2.7, and so forth. There are only weaker and/or larger and slower (no MoE) models. Not good.

b65e8bee43c2ed0 a day ago

https://en.wikipedia.org/wiki/Artificial_Intelligence_Act#Pe...

Europe shot itself in the dick with this hastily implemented at the height of mass hysteria bullshit and now no sane company will build anything there. an AI startup in the US or China can be a boy and his computer. in Europe, the boy needs a dozen lawyers.

Mistral's sinking into irrelevancy despite the head start they had, the very promising early models they released, and the funding they receive, might very well be the consequence of trying to comply with all that crap.

mhitza 3 hours ago

So let me get this straight. You think that Europe "shot itself in the dick" by making it harder to deploy AI that:

- manipulates, including subliminally (hope you'll like your subliminal Ads mixed into your LLM output)

- profiling for social scoring

- automated thread labeling as an individual, with no human supervision

- facetracking databases

- emotional and "well-being" monitoring at work or in schools

- + many other kinds of surveillance tools.

I hope you are joking.

edit:

For context this was a snippet of prohibited use, which the fines listed on Wikipedia (theoretically apply to), https://artificialintelligenceact.eu/article/5/

Epa095 15 hours ago

You don't compete with anthrophopic from the basement. For that you need either a shit loads of money, or a government which are not afraid of getting very very involved.

There is a lot of Europeans working on AI, it's just that a lot of them work for American companies. Because of money.

alecco 13 hours ago

antirez a day ago

Possibly yes but let me remember you that France, Italy Germany were against the AI act, so here something very odd is happening, that the EU funding nations are getting marginalized by the countries they welcomed on key topics for our future, and I believe corruption could be a big part of what is happening, both internal to those three countries and at an even more alarming rate in other countries.

gregorygoc a day ago

neonstatic a day ago

alecco 12 hours ago

tardedmeme 20 hours ago

Way more important than this act are the police raids. Someone used your SaaS to send phishing (see today's front page HN)? They'll just take all your servers away. Goodbye business. Unless they think the general public would riot, so established companies are okay. You can't build a castle on a foundation of quicksand.

darkamaul a day ago

Well , there isn’t also the opposite take from TechCrunch where they say: Why Paris may be the most important AI city outside Silicon Valley. [0]

While the EU loves its regulation, I still feel it’s too early to write it down in the AI race. It will not replace Anthropic or OpenAI any time soon, but even Google and Meta fail to do that.

If AI continue to grow and expand, there is enough space for many more unicorns.

[0] https://techcrunch.com/2026/05/28/why-paris-may-be-the-most-...

throw-the-towel 10 hours ago

djvdq a day ago

It's yet another time when EU is killing our own possibilities to build real competition to US or Chinese tech.

And yet another time they will be thinking aloud in few year "what happened that we are fully dependent on USA?"

sofixa 14 hours ago

Did you read even a summary of the AI Act?

The gist of it is very simple - depending on the risk of what you're doing with AI, you have to document why it did what it did, and be able to explain it; or you can't use it at all. So if you're using AI for mass surveillance, you can't; if you're using it for treating loan applications you need to be able to explain why it approved/denied; if it's a customer service chatbot, do whatever, nobody cares.

Not only is burden of the legislation fairly low (and a lot of it hasn't come into force yet), it is extremely reasonable. No, sorry, we don't want a UnitedHealthcare using a broken algorithm on purpose to deny as much care as possible and hiding behind computer says no.

gspr a day ago

So you're saying AI models should be allowed to freely "manipulate human behavior"?

cm2012 a day ago

xienze a day ago

sbinnee a day ago

When it comes to MoE, to me, I remember Mixtral model that showed the viability of MoE for the first time. I was impressed by their technical report. To be clear, MoE idea was already out there, if I am not mistaken. If they have pushed Mixtral model family further, who knows they might have achieved the reputation of what the current Qwen family has. A missed opportunity.

kubb 13 hours ago

> But they are accumulating too much technological delay.

How so? Catching up is easier and cheaper than spearheading the lead.

GaProgMan a day ago

Compared to the UK Government which recently announced 10 million GBP for AI research, which will likely be scooped up by consultants. I think Europe is doing fine considering.

antirez a day ago

The first step would be indeed to join forces with UK, in order to don't be two entities, which is very unnatural to me.

kergonath 12 hours ago

gregorygoc a day ago

simonw a day ago

> BNP Paribas runs Mistral models on-prem for KYC in Belgium, with sensitive data staying within the bank's walls. Abanca is using agent orchestration to handle sensitive customer information at a huge scale (2 million customers in their app). For European companies in regulated industries, this is a good alternative to relying on US hyperscalers.

Mistral leaning into on-prem and European-hosted models is very smart.

throw14082020 a day ago

Respectfully, I don't think it's "very" smart. It is a fair option given their limited options? Everyone is doing FDE or (customer engineering to be more transparent) because otherwise they will just be seen as markup on token cost. And the Neo-SaaS companies will take the money instead.

Who else will buy their AI?

and what other options do they have?

lucaspiller 4 hours ago

There are enough big banks in Europe that want to use AI, but almost all of them have very terrible software engineering (sorry to anyone who works there), so it's not like they are going to spin up their own cluster on top of open source models. If Mistral can filll this spot (provider and consultant) it could be a big win for them. Then repeat with other similar industries and governments in Europe.

port11 15 hours ago

You don’t think it’s smart to get reliable funding this way? From Banking the Cash Cow?

Devstral is getting better, it’s the Vibe harness that’s holding it back (I think). I can see how that would drive some business as well.

Their chat thingie isn’t very well positioned, but gets results. Could be an euro or two per month, maybe bundled with some more features. It’s not like Mistral has no options, if anything they’re just a bit complacent and not ambitious with their plans.

bg24 a day ago

Also Mistral did just the right thing by acquiring Koyeb, to beef up their deployment at scale expertise.

sbinnee a day ago

My take is that Mistral is not focusing on generating contents such as code, images, or videos. They focus on multi-lingual models, OCR, voice, and others I believe. Their model intro page manifests that although it always confuses me because it's too colorful and there are too many categories, not to mention model names. I hope their decisions will pay off.

kergonath 12 hours ago

From what I understood they are retiring a lot of the specialised models in favour of the main family.

ElFitz a day ago

Isn’t that the usual EU startup playbook once they give up on the B2C or world-scale SaaS markets? Refocusing on large (European) enterprise B2B and government contracts?

It always felt to me this (enterprise B2B) was where European startups went to die.

doctorpangloss a day ago

Yeah but why use mistral on premises instead of Qwen?

kriro a day ago

We're talking about enterprise customers. The trivial answer is Mistral has sales teams and consultants from the same company that builds the models and from the EU.

doctorpangloss a day ago

simonw a day ago

One reason might be that Mistral doesn't have a risk of weird training biases that were required by the Chinese government.

joe_mamba a day ago

plaidthunder a day ago

Because the lab working on Mistral is in the European Union.

irusensei a day ago

Please don't run Chinese models for KYC operations.

crimsoneer a day ago

neonstatic a day ago

It may be very smart for them, but it also shows that the EU has no desire, therefore no chance, to change and lead anything. The only thing it has is regulation.

oblio 18 hours ago

The EU has just poured unfathomable amounts of money into continent wide infrastructure (https://reforms-investments.ec.europa.eu/recovery-and-resili...) - due to COVID, the military - due to Russia, etc. They can't do everything.

johnbarron a day ago

Lets hope the models can do a better KYC than the humans have been doing..because they are well known.

Or is this a case of the humans, now preparing for the excuse it was the AI failure?

"BNP Paribas Sentenced for Conspiring to Violate the Trading with the Enemy Act" - https://www.justice.gov/archives/opa/pr/bnp-paribas-sentence...

"BNP Paribas caught up in French money laundering investigation" - https://www.reuters.com/business/finance/bnp-paribas-caught-...

"BNP Paribas faces $246m fine in currency scandal" - https://www.bbc.com/news/business-40635070

"BNP Paribas caught in a Cypriot money laundering investigation" - https://www.lemonde.fr/en/les-decodeurs/article/2023/12/26/b...

In Money Laundering their track record is unmatched: https://violationtracker.goodjobsfirst.org/parent/bnp-pariba...

pavlov a day ago

When the humans have a track record of corruption, it might make sense for a company to seek parallel opinions from a LLM so they can at least flag suspicious human decisions.

Assuming BNP Paribas leadership wants to stop the corruption of course.

johnbarron a day ago

psychoslave a day ago

That's just one side of the story, not following it on details, but their own le chat explained to me that the company was a capitalist succubus starving to build data center in some north European country. Hilarious if you ask me.

tnolet a day ago

Regardless of the business. Their website design is :chefs-kiss https://mistral.ai/

Waterluvian a day ago

What specifically is good about it? I scrolled it on my phone and it seems pretty standard corporate website?

davey48016 a day ago

I love everything about Mistral's branding.

ilja a day ago

Le Chat was great, the rebrand to Vibe is meh

akkad33 11 hours ago

It looks very crowded and the paragraphs are off

Eldodi a day ago

I was at the event, and was impressed by the attendance, all the leaders from the major european listed companies were there.

Also interesting to note the number of partners they invited. Going from Microsoft, Accenture and EY to startups like alpic.ai or lingo.dev . Seems like they are ramping up their M&A game too

tomaskafka 11 hours ago

I have been on a lecture from great government IT person, they are evaluating models and are very unhappy about the situation, because they’d love to use Mistral, in some cases it’s the only EU based model they can use … and they know it’s really bad and falling more behind.

It is well possible that Mistral can make a profitable business by being bad, but still the only possible model for EU uses. Sad story, sad to witness.

LucidLynx a day ago

As an European: 100x YES!

I really like the direction and the transparency of Mistral, among those players.

dtang2718 a day ago

Even as a non European, it's great to see some competition from Europe against the US/Chinese models.

Oras a day ago

Sounds like they don’t have a moat at all. It’s like software consultancy with a data centre. And then the article mentions many customers using these models on prem (so data centre is not really a plus).

What’s stopping any country backed startup from fine-tuning small open source models?

vb-8448 4 hours ago

No one in Europe will buy from a random startup, the consultancy part is a MUST to do businesses with big corps, banks, finances, insurances, governs, public administration ...

Oras 2 hours ago

Mistral is a startup that happened to raise 100M. I said in my comment a “country backed startup”, which mistral is

vb-8448 an hour ago

whiplash451 17 hours ago

Maybe because distilling small models from bigger ones that you control gives you better small models than fine-tuning from bigger models you don't control?

(I am not claiming it is the case, but stating this as an assumption)

petcat a day ago

> Abanca is using agent orchestration to handle sensitive customer information at a huge scale (2 million customers in their app).

Maybe my perspective is skewed on what "huge scale" means, but 2 million users? That's like a few hundred megabytes of data? Or a couple GBs if there's a lot of per-user data?

vnglst a day ago

Maybe, but using state-of-the-art large language models to solve customer support queries with agentic can quickly use a lot of tokens. What I understood from the talk is that they used agents with limited responsibility and (assumption from me) smaller models, to the make sure the answers were quick, reliable and not too costly.

hadlock a day ago

There are several payments processing companies that are already largely using AI for customer support queries. They still have an escape hatch to a human but at least one of those companies (on the smaller side) is reporting a ~99% success rate, they are down to a handful of human customer service employees now for cases where the customer can't find/produce the transaction ID.

fidotron a day ago

European consumer focused businesses do not scale easily the same way US ones do, which is a major contributor to their problems developing tech businesses generally.

OTOH such things can be quite defensible, they just rarely become anything like as profitable.

kioleanu 15 hours ago

I just got an email from them saying that they’re retiring some (most?) of the dedicated models like devstral gradually through August and one should now use the general model. Cost grows exponentially

Devstral 2 (devstral-2512 and devstral-latest) → We recommend transitioning to Mistral Medium 3.5 (mistral-medium-3-5 with reasoning_effort set to "high"), a stronger model, priced $1.5/$7.5 per million input/output tokens (change from the previous $0.4/$2).

phillc73 14 hours ago

I received the same email, although couldn’t quite figure out which retiring model I was still using, as I thought I’d already transitioned to Mistral-Medium-3.5 for everything. Anyway, after receiving the email, my hope was that it meant they were also planning on releasing some new, improved models in the next months.

zuzululu a day ago

Wasn't even aware Mistral was around and I think that just shows you how irrelevant it has become and not a very good sign for EU in general when the best talent are working for American AI companies.

I saw Tibo's tweet a while back and it was basically a legitimate complaint about the extreme taxation he faced back in EU (France I think) and its pretty obvious how much of a hinderance top down centralized regulation is to innovation.

While I welcome competition and independence, nobody can argue with American innovation and its ability to attract the best of the best. Once it takes seat of the AI reigns there is very little chance for other countries to compete, very much similar to semiconductor field and how only a few select countries have the talent and monopoly over its particular supply chain.

It's clear to anyone looking in that whatever EU is doing is not working (not just AI) and will not work as they do not seem flexible or humble enough to steer itself.

gregorygoc a day ago

Big tech has remote offices in every major European economy, and they pay well above top 90th percentile of market rate. It basically has a talent sucking effect on the entire economy.

maxdo a day ago

Oh most prominent eu ai company . Without reading an article predict next, will update after :

1. They give up on building competitive models. It’s time to drink wine not to struggle with competition

2. Because of #1 they will talk a bit about something around llms maybe coding agents , and after start talking about sovereignty.

lejalv 15 hours ago

Unlike you, who drank the wine before writing the comment.

FinnKuhn a day ago

3. They are going to start focusing on B2B implementation and deployment.

See what happened to Aleph Alpha...

ramstar3000 6 hours ago

Excited to see more about their partnership they with Alexa+. In agentic and tool-calling, Mistral’s model architecture excels at the exact structured JSON output Alexa needs to trigger APIs and smart home routines without breaking.

sbinnee a day ago

I believe that Mistral team is doing the best they can do. I like the directions they push; open models for various tasks, on-prem has a lot of potential. Sure, I use Claude code mostly for coding. But there are so many tasks other than just coding. Even for coding, eventually, I am certain they will catch up and Vibe becomes tolerable soon.

Imbiss 9 hours ago

Really hope there is going to be some competition from Europe in the AI Space.

ogou a day ago

I've said it before that Mistral is underrated. They are looking at real world use of LLMs and tooling. Bespoke models are very appealing to lots of non-tech centered companies and state agencies. Also, Mistral's actual platform is useful. While others are watching performance leaderboards like this is some eSports stream, they are building real world uses.

stephantul a day ago

I was also at the event and was pretty disappointed. Most of the talks were pretty low on information. I was at the “build” stage, which supposedly was the technical stage, but the talks there didn’t really go into technical specifics.

The papyrus talk was awesome though.

rvz a day ago

We should be supporting and using local models that allow you to run whatever model you want.

t0lo 16 hours ago

Not to be confused with the fantastic AI Now institute, run by Meredith Whittaker of Signal among others.

https://ainowinstitute.org/

Almost feels like name squatting

gameshot911 20 hours ago

Does anyone else always read "Mistrial" instead of "Mistral"? Always think I'm about to read a juicy gossip piece, and let down when it's just a standard update on an AI company.

edit A lot of AI company names are really strange, actually. "Claude" is really the best a trillion+ dollar company could come up with? It sounds like the name of a grandpa or something.