Meta Launches Muse Spark, Its Most Capable AI Yet—But Gemini 3.1 Pro Still Leads the Pack

In short

Meta’s brand-new Muse Glow marks a shift to closed, natively multimodal AI with agent-based thinking.
Meta reports strong standard gains in health and search, however still tracks Gemini on core thinking and coding.
Integrated in 9 months with far less calculate, this indicate a brand-new efficiency-driven AI method.

Meta released Muse Glow on Wednesday, marking the very first design developed by Meta Superintelligence Labs– the group put together 9 months earlier under Chief AI Officer Alexandr Wang after Meta’s $14 billion Scale AI acquisition. It’s live now at meta.ai and the Meta AI app, with a rollout to Facebook, Instagram, and WhatsApp can be found in the next couple of weeks.

This isn’t simply another chatbot upgrade or a brand-new variation of Llama. Muse Glow is natively multimodal– it processes images, text, and voice from the ground up, instead of bolting vision onto an existing text design. It includes visual chain-of-thought, tool-use assistance, and something Meta is calling “Considering mode”: a setup that runs several AI representatives in parallel to take on more difficult issues. That’s Meta’s response to the prolonged thinking modes from Google’s Gemini Deep Believe and OpenAI’s GPT Pro.

” Muse Glow is the initial step on our scaling ladder and the very first item of a ground-up overhaul of our AI efforts,” Meta composed in a main statement. “To support more scaling, we are making tactical financial investments throughout the whole stack– from research study and design training to facilities, consisting of the Hyperion information center.”

The business dealt with more than 1,000 doctors to curate training information for Muse Glow’s medical thinking. The outcomes on HealthBench Hard– an open-ended health questions standard– stand out: Muse Glow scored 42.8, compared to 40.1 for GPT 5.4 and simply 20.6 for Gemini 3.1 Pro. That’s not a limited distinction.

On agentic search (DeepSearchQA), Muse Glow likewise leads with 74.8, beating Gemini (69.7) and GPT 5.4 (73.6 ). On CharXiv Thinking– figure understanding from clinical documents– it scored 86.4, the greatest throughout the designs in the contrast.

For those into jailbreaking AI, the design was broken open within minutes:

SYSTEM PROMPT LEAKAGE

Here’s the complete Muse Glow system trigger from Meta!

I saw @AIatMeta forgot to open source it, so I’ve done them the courtesy

PROMPT:
“””
Who are you?

You are a friendly, smart, and agentic AI assistant. You are warm and a bit spirited …

— Pliny the Liberator (@elder_plinius)April 8, 2026

However excellent is n’t the like fantastic. The total benchmark photo reveals Gemini 3.1 Pro still running ahead on a lot of classifications. The space is most noticeable on ARC AGI 2, the abstract thinking puzzle standard: Gemini scored 76.5 to Muse Glow’s 42.5. (* )On coding( LiveCodeBench Pro), Gemini’s 82.9 surpasses Meta’s 80.0. On MMMU Pro– multimodal understanding– Gemini scored 83.9 versus 80.4. Meta’s own blog site acknowledges existing efficiency spaces in long-horizon agentic systems and coding workflows.

There’s likewise a noteworthy tactical shift baked into this launch. Muse Glow is a closed design– its architecture and weights will not be revealed. That’s a sharp departure from Llama, which developed Meta’s credibility in open AI circles. After Llama 4’s underwhelming reception previously this year, Meta appears to have actually chosen the next chapter requires to be composed in a different way.

The business states it wants to open-source future variations of Muse, however for now the code remains inside Meta. The tech giant’s stock climbed up almost 9% on Wednesday following the statement, and completed the trading day up 6.5% to a cost of$ 612.42.

“Considering mode” utilizes parallel representative orchestration to press the design’s ceiling greater. Because setup, Muse Glow struck 58% on Mankind’s Last Examination and 38 %on FrontierScience Research study– area that makes it competitive with the most capable variations of Gemini and GPT, instead of their basic releases.

Meta is likewise presenting a shopping assistant that compares items and links straight to purchases, and prepares to bring Muse Glow to Facebook, Instagram, and WhatsApp in the coming weeks– following the exact same script carried out because Llama 3, putting it in front of more than 3.5 billion users. A personal API sneak peek is opening to pick designers.

The design was integrated in 9 months, internally codenamed Avocado, with Meta declaring that its brand-new pretraining stack can reach the exact same ability level as Llama 4 Radical utilizing over 10 times less calculate.

Muse Glow is explained internally as a “little and quick “initial step in the Muse household. A more capable variation is currently in advancement.

Daily Debrief

Crypto Billionaire Changpeng Zhao Swears By This Principle And Believes Adopting It Could ‘Magically’ Imp

Crypto Billionaire Changpeng Zhao Swears By This Principle And Believes Adopting It Could ‘Magically’ Imp

Saudi Arabia’s East-West Pipeline Back at Full Steam After Conflict-Driven Disruptions – ConocoPhillips (

Climatebase, SF Climate Week, Oaklandish, and Blue & Yellow Launch First-Ever Official SF Climate Week Me

Iran War Fallout Will Muddy the Rest of 2026 for Asset Markets: Analyst

Iran War Fallout Will Muddy the Rest of 2026 for Asset Markets: Analyst

Bitcoin Falls As US-Iran War Negotiations Fail In Pakistan

Justin Sun Slams WLFI Over Token Lockups, Gets Legal Threat in Response

There’s a Way to Make Bitcoin Safe From Quantum Without a Fork, Researchers Say

There’s a Way to Make Bitcoin Safe From Quantum Without a Fork, Researchers Say

New Tools Aim to Make AI ‘Vibe Coding’ Safer for Crypto

Economists Said AI Wouldn’t Take Jobs—Some Now Admit They Got It Wrong

Many Gen Z adults still get financial help from their parents

Many Gen Z adults still get financial help from their parents

Morgan Stanley predicts these beaten-down Chinese stocks can rebound on easing Middle East tensions

This options trade is a bet on oil prices falling once traffic picks up in the Strait of Hormuz

New Tools Aim to Make AI ‘Vibe Coding’ Safer for Crypto

Economists Said AI Wouldn’t Take Jobs—Some Now Admit They Got It Wrong

Time to ditch AI anxiety — experts say there’s a lot less to fear than we think

Elon Musk’s xAI Sues Colorado Over AI Law as Fight Over State Regulation Intensifies

The CIA Let AI Write Its First Intelligence Report—And AI ‘Coworkers’ Are Up Next

AI Power Play: Cohere, Aleph Alpha In Advanced Merger Talks

Bitcoin Falls As US-Iran War Negotiations Fail In Pakistan

Crypto Billionaire Changpeng Zhao Swears By This Principle And Believes Adopting It Could ‘Magically’ Imp

Saudi Arabia’s East-West Pipeline Back at Full Steam After Conflict-Driven Disruptions – ConocoPhillips (

Justin Sun Slams WLFI Over Token Lockups, Gets Legal Threat in Response

Climatebase, SF Climate Week, Oaklandish, and Blue & Yellow Launch First-Ever Official SF Climate Week Me

Michael Saylor Hints Strategy is Buying More Bitcoin

Cloudflare, ServiceNow, And Guardant Health Are Among Top 10 Large-Cap Losers Last Week (April 6-April 10

Ross Gerber Sounds Alarm as Inflation Trough Passes, Warning Tariffs and War Could Send Prices Surging Ev

Bitcoin Miners Face a Tougher Road to the 2028 Halving

Popular News

Yardeni Says Market Bottom Is In — But Volatility Is Still Elevated – VIX Short-Term Futures ETF (BATS:VI

Anthropic, OpenAI And Big Tech’s ‘Number One Goal’ Is To Kill OpenClaw, Says Venture Capitalist Jason Cal

Nvidia CEO Jensen Huang Says ‘Move To California’ Even As Billionaires Look To Flee State’s Proposed Weal

Meta Launches Muse Spark, Its Most Capable AI Yet—But Gemini 3.1 Pro Still Leads the Pack

In short

Newsletter Start every day with the leading newspaper article today, plus initial functions, a podcast, videos and more.

Related Articles

Subscribe to Updates