Muse Spark – Meta Superintelligence Labs (meta.ai)
184 points by snowman647 3 hours ago
tty456 2 hours ago
I don't get the comments trashing this. If it slightly beats or even matches Opus 4.6, it means Meta is capable of building a model competitive with the leading AI company. Sure, they spent a lot of money and will have on-going costs. But how much more work would it take to turn that into a coding agent people are willing to try (and pay for) along side their usage of a collection of agents (Claude, Codex, etc)? Also means Meta doesn't have to pay another company to use a SATA model across all their products (including IG and WhatsApp, vr) which will matter to their balance sheet long term (despite the constant r&d spend).
prodigycorp 2 hours ago
Comments trashing this are rightly correct skeptics who remember the benchmaxxing of llama 4. This model was out in the woods as early as like a couple months ago but they didn't release it because it was at gemini 2.5 pro levels.
zozbot234 2 hours ago
The llama4 series was one of the earliest large MoE's to be made publically available. People just ignored it because they were focused on running smaller and denser models at the time, we should know better these days.
dilap an hour ago
prodigycorp 2 hours ago
gritspants 2 hours ago
I would like someone to tell me how stupid I am. If I were Meta/Zuck I'd open source a great model the moment my company developed it. This just looks like a pitch to investors, otherwise.
jamiequint 2 hours ago
"This just looks like a pitch to investors"
The goal of public companies is generally to generate profit for their investors.
samrus an hour ago
gritspants 27 minutes ago
SoftTalker an hour ago
redox99 2 hours ago
> If it slightly beats or even matches Opus 4.6
It doesn't though
ryeguy_24 2 hours ago
Curious on why you think this. Any data points that led you to this?
howdareme an hour ago
ChipopLeMoral an hour ago
> I don't get the comments trashing this.
People like to hate on Meta regardless of anything, and regardless of whether it's justified or not. Not saying it isn't, just that it's many people's default bias.
gallerdude 3 hours ago
This would have been an amazing release 6 months ago. But the industry moves so fast, this is a trite release. Maybe it’s best for Meta to sell their superintelligence division. I don’t think Zuck’s vision is particularly compelling.
gordonhart 3 hours ago
A new model comparable (ish) to the Claude/Gemini/GPT flagships is a big deal for the industry and for Meta even if it doesn't set the new frontier.
gallerdude 2 hours ago
I’m not sure. If it was open source, certainly. But 4th place doesn’t really matter if you have nothing different to add.
lairv 2 hours ago
datadrivenangel 2 hours ago
blahblaher 2 hours ago
Why would you use this instead of the other more proven models? Unless it's significantly cheaper. The general population mostly wants it free, and the more professional users are willing to pay for good/better responses.
NitpickLawyer 2 hours ago
gordonhart 2 hours ago
zozbot234 2 hours ago
Their new Contemplating mode gives this model a Deep Research ability (akin to existing models from GPT and Gemini) that might make it quite comparable to the just-announced Mythos.
temp_praneshp an hour ago
solenoid0937 2 hours ago
dgellow 2 hours ago
I never understood why meta decided to join the race. They don’t sell compute like Google or Microsoft. Why not let others do the hard work and integrate their LLMs in your systems if needed? I assume it’s because they have Instagram, Facebook, WhatsApp, Thread data and feel they should be the ones using them for training, but it’s really not obvious how having a frontier AI lab benefits their business
observationist 2 hours ago
Adtech Money. They've got GPUs, they've got the infrastructure, and they've got the advertisement platform, and the point is getting AI that can exploit the adtech and create a flywheel effect, maximizing return from the data they collect from Insta, WhatsApp, Facebook, etc.
It's not just about LLMs, it's about being able to model consumers and markets and psychology and so on. Meta is also big in the manipulation side of things, any sort of cynical technological exploitation of humans you can imagine but that is technically legal, they're doing it for profit.
bachmeier an hour ago
> I never understood why meta decided to join the race.
I can think of at least two reasons. Price and customizability. If they train their own models on their own data, they potentially have a better model at a better price, and they're not at the mercy of Anthropic's decisions when they decide to raise prices. Additionally, if you use someone else's model, you use it the way they create it and permit you to use it. In a couple years, who has any idea how these models are used. Arguably, a company the size of Meta should be in control of their AI models.
eldenring 2 hours ago
Because there's a realistic chance this is the only important software technology moving forward, and commoditizes Metas's entire business which is software.
dgellow an hour ago
SoftTalker an hour ago
LLMs/Chat-based systems will reach a point where Facebook, WhatsApp, Threads, Instagram, etc. are all unnecessary. The idea of opening a browser or a specific app to do a thing will seem antiquated. You can do it all with your chat-based agent. Meta wants to be part of that.
operatingthetan an hour ago
dgellow an hour ago
vinni2 2 hours ago
From what I heard Meta is spending hundreds of millions each month in Claude credits for developers. So that’s a huge saving if they have own models that match Opus.
spindump8930 32 minutes ago
chermi 2 hours ago
You basically have to be involved if you're meta. Even if there's only 5% chance this AI stuff is as disruptive as the labs claim it is, you can't afford to miss out. Even if you're lagging frontier, you must develop the competency internally. Otherwise you ignored a 5% chance of total annihilation, probably even exposing you to shareholder lawsuits.
storus an hour ago
Only thanks to Meta we have competitive local LLMs. Without LLama nothing decent would have been released. Commoditize your complements in action.
bee_rider 2 hours ago
I think they just want to be a winner in the “next thing.” They hit social networking, but missed mobile operating systems and didn’t compellingly win at social media. Eventually an ambitious person with a bazillion dollars wants a clear win, right?
xnx 2 hours ago
Zuck is trying to convince himself he's good, and not just lucky.
KaiserPro 2 hours ago
A few things:
1) meta was doing this at scale before openAI
2) decent ML is critical to catagorising content at scale, the more accurate and fast the category, the finer the recommendations can be (ie instead of woman, outside as a tag for a video, woman, age, hair colour, location, subjects in view, main subject of video, video style) doing that as fast as possible with as little energy as possible is mission critical
3) The llama leak basically evaporated the moat around openAI who _could_ have become a competitor
4) for the AR stuff, all of these models (and visual models) are required to make the platform work. They also need complete ownership so that it can be distilled to make it run on tiny hardware
5) dick swinging
6) they genuinely want to become a industrial behemoth, so robots, hardware, etc are now all in scope.
aylmao an hour ago
First and most importantly is the fact they have a lot of very valuable data they wouldn't want to siphon to a competitor. This data is a key strategic asset in the space where they do business.
Secondly though, I think it has to do with the fact Meta is big enough to worry about vertical integration and full control of their business.
The whole reason they've been trying to make AR/VR happen for over a decade now is the assumption of a worst case and best case scenario. The worst case is Apple and Google wants them gone. This isn't as far fetched as it seems, Google has historically been Meta's biggest competitor and even tried to release its own social network back when Meta was threatening them. If either pulls Meta apps from their respective stores, it'd be an immense blow to Meta; their whole trillion-dollar business depends on competitor's platforms.
Meta tried making inroads into the phone business but failed; it is a very crowded market after all. So they changed their strategy. Instead of playing catch-up, they'd invent "the next iPhone" and be the first to a brand new market. This is the best case scenario; they invent a new platform where they can be dominant from day 1 and stop depending on competitor's hardware, not only removing that risk factor for them, but also unlocking a new market they can control.
AI ties into all this because it appears to be key for this next platform to happen. You will communicate with these smart glasses via voice, hand gestures, or subtle movements that a model will have to interpret. The features that could make them stand out as more than just a screen on your face are all AI related; object detection, world understanding, context awareness, etc. If all this were done via a 3rd party Meta would effectively be back on square one: a competitor could easily yank away its model access, or sell it to a competitor. Meta would be again at the mercy of others.
Compared to other big-tech players, I think it's easy to see how Meta is in a riskier position. There's little Google or Microsoft can do to kill the iPhone. There's little Apple or Google can do to kill Amazon's online store. There's little Amazon or Apple can do to kill Microsoft's business deals. Google and Meta are primarily in the business of capturing people's data, attention, and selling ads, and both Google and Apple could do quite some damage to Meta. Beyond expanding it, it's important for them to invest in ways to protect their money-printing machine.
addandsubtract an hour ago
To download all those torrents, obviously.
yoz-y 2 hours ago
AI NPCs to fill in the empty Metaverse?
gallerdude 2 hours ago
I’m sure there’s more to it than this, but it feels like Zuck has pet interests like VR and now AI.
alex1138 2 hours ago
chairmansteve 2 hours ago
Pumps up the stock price.
swyx 2 hours ago
you dont understand why zuck, who paid $1B for instagram when they had no revenue and 7 employees because he is paranoid about platform shifts, decided to join the race for (what is seeming highly possibly) the biggest platform shift in human history?
oceansky 2 hours ago
prodigycorp 2 hours ago
awestroke 2 hours ago
Because Zuck has chronic FOMO, he's said as much himself
zeroonetwothree 2 hours ago
But then how will Zuck win the billionaire dick measuring contest?
throwaw12 2 hours ago
> I don’t think Zuck’s vision is particularly compelling.
But he has to do it anyways, otherwise Meta can be disrupted easily.
Google, Apple has hardware, distribution channels for their products
Amazon has the marketplace and cloud
Microsoft has enterprise and cloud
Meta is always looking for ways to stay afloat
xnx 2 hours ago
Meta has 3.5 billion daily active users
throwaw12 2 hours ago
creddit 2 hours ago
Ran some of my internal benchmarks against this and I'm very unimpressed. I don't think this moves them into the OAI v Anthropic v Gemini conversation at all.
Major analytical errors in their response to multiple of my technical questions.
creddit 2 hours ago
Playing with this some more and it's actively not good. Just basic mathematical errors riddling responses. Did some basic adversarial testing where its responses are analyzed by Gemini and Gemini is finding basic math errors across every relatively (relative to Opus, Gemini or GPT can handle) simple ask I make. Yikes.
yalogin 33 minutes ago
Meta is in a weird spot. They caught up late to the game and instead of releasing llama as a chat bot they open sourced it, precisely because they lost the mind share. They thought chatbot is not their product and I am sure they are regretting it now. Mark is obsessed with becoming the android of something and he poured billions into the metaverse thinking he is first and failed. He then open sourced llama and wanted to be the android of llms. He ended up enabling groq but it didn’t benefit meta directly at all. They have no revenue or mind share path from llms but continue to pour billions into it. The only 1-1 mapping is with the glasses but that is a tough fit for the company given they are extremely allergic to privqcy and security.
Not sure what this is now.
zozbot234 28 minutes ago
Llama is available as a chatbot in WhatsApp.
throwaw12 2 hours ago
How is that Meta spent so much money for talent and hardware, but the model barely matches Opus 4.6?
Especially, looking at these numbers after Claude Mythos, feels like either Anthropic has some secret sauce, or everyone else is dumber compared to the talent Anthropic has
strulovich 2 hours ago
Meta did a bunch of mistakes, and look like Zuckerberg spent a lot of money on talent and made big swings to change it (that happened about a year ago)
I think it’s unrealistic to expect them to come back from that pit to the top in one year, but I wouldn’t rule them out getting there with more time. That’s a possible future. They have the money and Zuckerberg’s drive at the helm. It can go a long way.
impulser_ 2 hours ago
It's not even on par with Sonnet. It's on par with open source models and it not even open source and sit behind a private preview API.
Might as well not release anything.
coffeebeqn 2 hours ago
Matching Opus 4.6 would be pretty good? It’s the SOTA actually available model
reissbaker 2 hours ago
Muse Spark doesn't even match GLM-5.1 on most benchmarks. And GLM is open source!
solenoid0937 2 hours ago
It's benchmaxxed.
If they actually matched Opus 4.6 on such a short timeline, it would have been mighty impressive. (Keep in mind this is a new lab and they are prohibited from doing distills.)
throwaw12 2 hours ago
how do you know it's benchmaxxed?
solenoid0937 2 hours ago
luma 2 hours ago
prodigycorp 2 hours ago
username223 2 hours ago
Facebook is working with the talent that can’t find a job at some other company. It doesn’t surprise me they ship mediocrity.
zozbot234 2 hours ago
> has some secret sauce
Yup, it's called test-time compute. Mythos is described as plenty slower than Opus, enough to seriously annoy users trying to use it for quick-feedback-loop agentic work. It is most properly compared with GPT Pro, Gemini DeepThink or this latest model's "Contemplating" mode. Otherwise you're just not comparing like for like.
throwaw12 2 hours ago
> it's called test-time compute.
Why can't others easily replicate it?
coder68 2 hours ago
bguberfain 2 hours ago
We all know it... but I think they were very bold in this warning about using your private messages to train public models. _Your messages with AIs will be used to improve AI at Meta. Don't share information, including sensitive topics, about others or yourself that you don't want the AI to retain and use_
discopicante 2 hours ago
meta doesn't exactly instill confidence on using personal data responsibly. hard pass
binaryturtle an hour ago
Looks like it needs a meta account? As soon you hit enter it wants to log-in. I guess I won't try this any time soon. :)
toddmorey 2 hours ago
Question: since they've rebooted their approach to AI... have they given up on open models? There's no mention of open source or open weights or access to the models beyond their hosted services.
thegeomaster 2 hours ago
Alexandr Wang on Twitter [0] mentioned open source plans:
"this is step one. bigger models are already in development with infrastructure scaling to match. private api preview open to select partners today, with plans to open-source future versions. incredibly proud of the MSL team. excited for what’s to come!"
prodigycorp 2 hours ago
So the answer is: no. lol. Remember Llama 4 Behemoth, and how we were supposed to get more great models from it?
wmf 2 hours ago
This may be too large to run locally anyway. Maybe they will distill down some smaller open versions later.
edwcross an hour ago
What is the "BioTIER-refuse" thing mentioned in the "Bioweapons Refusal" graph?
I Googled it and found absolutely nothing.
Well, to be honest, I got 100% of websites containing the French word "boîtier" (box) with a typo.
Even on Google Scholar, the closest match is "BioTiER (Biological Training in Education and Research) Scholars Program", which is at least 10 years old and has nothing to do with that.
Is that an AI-generated image with an AI-generated name that has no physical existence?
EnderWT an hour ago
tekacs an hour ago
https://meta.ai/share/pe4HxOfv2Bp
Finding a little bit tricky to evaluate because the harness is unfortunately very, very bad (e.g. search is awful). Can't wait to try this in some real external services where we can see how it performs for real.
Definitely getting ordinary high-quality results, overall. But hard to test agentic behavior and hard to test prose quality, even, when just working off of the default chat interface.
One thing that stands out is that _for_ the quality it feels very, very fast. Perhaps it's just only very lightly loaded right now, but irrespective it's lovely to feel.
I'm quite impressed with the tone overall. It definitely feels much more like Opus than it does, like, GPT or Grok in the sense that the style is conversational, natural and enjoyable.
sidcool 3 hours ago
Will experiment with the model. But I am scared of sharing any information with the Zuck ecosystem.
syntaxing 42 minutes ago
Kinda crazy, it really felt like Meta had the lead in LLMs, especially during the early LLaMa days. What happened for them to fall so far behind? I don’t get how LLaMa 4 was such a big train wreck and they couldn’t correct the course like Google.
eranation 2 hours ago
So this is why Anthropic rushed the weirdest "pre-responsible-disclosure-totally-not-for-marketing" announcement yesterday? To make sure Spark doesn't steal their thunder? (Spark beats Opus 4.6 on some benchmarks...). Or did I become a bitter cynical old man.
hnav 2 hours ago
It's giving "OpenAI says its new model GPT-2 is too dangerous to release (2019)"
reducesuffering 35 minutes ago
[because it would start an arms race]. The very arms race we're in... They were right
signatoremo 34 minutes ago
13 days ago.
spindump8930 29 minutes ago
Yes, it's far more certain that meta released this, which is less convincing on evals, as a result of the mythos previews.
zurfer 3 hours ago
> Muse Spark is available today at meta.ai and the Meta AI app. We’re opening a private API preview to select users.
m4r1k 3 hours ago
So no Open-weight .. why one would choose Muse Spark instead of Anthropic, OpenAI, or Google models all featuring from good to amazing harness?
oliver236 2 hours ago
so glad its beating all the others on bioweapons refusal. this is what i most wanted out of the latest SOTA model
wmf 2 hours ago
Zuck has a lot more experience being summoned before Congress than you.
khalic 3 hours ago
Oh good, if they built a lab, I’m sure they took the time the precisely define what they mean by super intelligence? Right? …
52-6F-62 2 hours ago
If this is super intelligence, then it follows we must all be super-duper intelligence.
napolux an hour ago
I can't login. It sends me always the same code and it's not correct for them
visioninmyblood 2 hours ago
https://meta.ai/ this is where you can try it seems like the API is not publicly accessable yet. I feel they are very late to the game and do not show value to customers over other models.
p_stuart82 2 hours ago
late isn't the problem. private preview api and no reason to switch. that's just another hosted model
Artgor 3 hours ago
I'm cautiously waiting for the feedback from the first users. Meta has produced a lot of great models (LLama), maybe this is a comeback... but I'm cautious, as the jump in the quality is almost too high.
Also, I think people aren't used that using such models requires meta.ai or meta ai app.
solenoid0937 3 hours ago
My Meta friends say it's benchmaxxed af
loeg 2 hours ago
We used to call this "overfitting," but I suppose everything has to be maxxed now. Fitmaxxed?
conradkay 2 hours ago
It doesn't seem benchmaxxed, ARC AGI 2 score is quite bad (42.5%, GPT 5.4 is 76.1%) and coding is okay. But maybe this is the best Meta can do even benchmaxxing
The impressive part is multimodality, very plausible since there's less focus there by other labs (especially Anthropic)
santiagobasulto 2 hours ago
This looks like a very interesting model and very promising, especially after llama lost so much ground recently. I hope they release the weights
chrsw 3 hours ago
So Meta is not releasing open source models anymore?
vinni2 2 hours ago
I have to create meta account to access. No thanks.
ensen 2 hours ago
ComputerGuru 2 hours ago
So does this confirm the end of llama?
warthog 3 hours ago
Hoping the benchmarks are correct this time...
htrp 3 hours ago
Anyone done vibe testing at meta ai yet?
jansport123 2 hours ago
did they just copy the chatgpt ui?
Kuyawa 2 hours ago
> Meta AI isn't available yet in your country
Not my loss, will keep using DeepSeek then. Wake me up when my country is no longer in the wrong/right side of history.
ChrisArchitect 2 hours ago
Some more discussion: https://news.ycombinator.com/item?id=47692043
rvz 2 hours ago
Until you actually try the model itself, assume any benchmark presented to you as being part of the marketing material of the model, as it is not independently verified and completely biased.
The same is true with any other model, unless otherwise stated.
In the next few days, we'll see who Meta has paid to promote this model on social media.
OsrsNeedsf2P 2 hours ago
The only benchmark they show against SOTA models is in bioweapons refusal.
Edit: nvm I can't read, regular benchmarks against SOTA are there
ge96 an hour ago
funny how websites do that thing where it looks like you can use the product but soon as you hit enter, nope login first
babelfish 3 hours ago
Probably a better link: https://ai.meta.com/blog/introducing-muse-spark-msl/
dang 3 hours ago
I've put that link in the top text - thanks!
1970-01-01 2 hours ago
I can remember when AOL was an unstoppable giant. Except it wasn't. People eventually realized they could get a better, cheaper, faster experience with ISPs and search engines. The same path is unfolding before Meta. People have much better options, and plethora of Meta users will slowly leave until the big moat is drained. Zuck, go retire to your NZ bunker before Meta is forced to merge with another media company.
ehutch79 2 hours ago
How's the metaverse doing? It was the next big thing and how we're all going to be working inside it in... was it like 3 months ago?
Maybe they need to mine more libra coin first? or is it diem now? is that even still part of meta?
I'm sure this new AI is super intelligent and super awesome and will be writing all the code, making all the blog posts, and generating all our youtube shorts in 6 months.
serf 2 hours ago
what's with the negativity?
yeah, the metaverse got abandoned. Also: Meta was the only one to try the concept for the past X-umpteen years even though everyone in the industry ga-gas over virtual reality worlds and workplaces at every opportunity. It's literally Meta and Linden Labs (which has been on life support for 10+ years.)
The alternative is : no one does it and nothing gets abandoned, which the industry has shown itself to be exceedingly good at w.r.t VR for the past 40+ years.
To be clear: I have no faith in meta as a company; my problem lies in kicking an entity because they attempted something different.. I don't think that's productive, and it produces stuff like the past AI winters because groups get afraid of touching experimental concepts ever again lest they incur the wrath of the shareholder.
ehutch79 2 hours ago
It's not the failure here or there, it's a pattern. It's not even the failing, it's the excessive hype cycle.
We keep seeing things being overhyped, with not much thought behind it. Meta is particularly bad about it. They changed their name for the hype of their VR product, when VR was still niche and had a long way to go, and still does. They couldn't even figure out legs for launch.
Now they have a 'superintellegence'? Yeah, that sounds like just the latest in a line of bullshit. Why would this be different.
sva_ 2 hours ago
> Please don't post shallow dismissals, especially of other people's work. A good critical comment teaches us something.
ehutch79 2 hours ago
Establishing a pattern of over hyping of projects that then disappear isn't a shallow dismissal.
captn3m0 2 hours ago
Libra/Diem got sold to the bank they were partnering with (Silvergate) for $200M, which then filed for Bankruptcy.