Close Menu

    Stay Ahead with Exclusive Updates!

    Enter your email below and be the first to know what’s happening in the ever-evolving world of technology!

    What's Hot

    Grid-Responsive AI: How Nvidia Plans to Turn Data Centers Into Power Assets with Emerald AI

    April 16, 2026

    Cyber Retaliation: How Iran-Linked Hackers Paralyzed Medical Giant Stryker

    April 16, 2026

    The Trillion-Dollar Exit: Why a SpaceX IPO Would Reshape the Space Economy

    April 15, 2026
    Facebook X (Twitter) Instagram
    Facebook X (Twitter)
    PhronewsPhronews
    • Home
    • Big Tech & Startups

      The Trillion-Dollar Exit: Why a SpaceX IPO Would Reshape the Space Economy

      April 15, 2026

      The Sacramento Blueprint: How California is Writing the U.S. AI Rulebook

      April 14, 2026

      Silicon Sovereignty: Microsoft’s $10B Play to Secure Japan’s AI Future

      April 14, 2026

      The Rule-Breaker: Why DeepSeek V4 is China’s Defiant Bid for AI Supremacy

      April 13, 2026

      Scheduling Monopoly? Why Nvidia’s SchedMD Deal Alarms Open-Source Devs

      April 13, 2026
    • Crypto

      Quantum Computing Advances Force Coinbase and Institutional Custodians to Rethink Crypto Security

      March 8, 2026

      AI Assisted Hacking Groups Target Crypto Firms With Multi-Layered Social Engineering

      February 18, 2026

      Global Crypto Regulations Expand as 2026 Begins With New Data Collection Frameworks and National Laws

      January 16, 2026

      Coinbase Bets on Stablecoin and On-Chain Growth as Key Market Drivers in 2026 Strategy

      January 10, 2026

      Tether Faces Ongoing Transparency Questions and Reserve Scrutiny Amid Massive Bitcoin Accumulation

      January 5, 2026
    • Gadgets & Smart Tech
      Featured

      AirPods Max 2: USB-C, Live Translation, and the H2 Upgrade

      By preciousMarch 26, 2026
      Recent

      AirPods Max 2: USB-C, Live Translation, and the H2 Upgrade

      March 26, 2026

      How ABB and Nvidia are Perfecting Industrial Robotics using AI Simulation

      March 20, 2026

      Neura Robotics Reaches €4B Valuation With Tether Backing

      March 12, 2026
    • Cybersecurity & Online Safety

      Cyber Retaliation: How Iran-Linked Hackers Paralyzed Medical Giant Stryker

      April 16, 2026

      Your Company Could Be Iran’s Next Target: What U.S. Tech Firms Need to Do Right Now

      April 6, 2026

      Google Is Warning Us About The Encryption Protecting Your Data Today. It May Not Survive Quantum Computing

      April 5, 2026

      Accenture and Anthropic Team Up on AI-powered Cybersecurity

      April 4, 2026

      Your BVN, Passport, and Bank Account May Already Be on the Dark Web. What Every Nigerian Must Do Right Now After the Banking Breaches

      April 4, 2026
    PhronewsPhronews
    Home»Artificial Intelligence & The Future»Researchers Uncover Critical RCE Flaws in Meta, Nvidia & Microsoft Inference Engines
    Artificial Intelligence & The Future

    Researchers Uncover Critical RCE Flaws in Meta, Nvidia & Microsoft Inference Engines

    oluchiBy oluchiNovember 27, 2025Updated:November 29, 2025No Comments
    Facebook Twitter Pinterest LinkedIn WhatsApp Reddit Tumblr Email
    Share
    Facebook Twitter LinkedIn Pinterest Email
    Image Generated from Oligo Security

    Microsoft, Meta, and Nvidia just faced a major wake-up call. Security researchers found critical remote code execution flaws in their AI inference engines. In this article, we explore the recent discovery of serious security flaws in key AI inference engines. 

    We will cover how these remote code execution vulnerabilities spread through code reuse. Then, we detail the affected frameworks and risks. Finally, we look at patches and steps for better security in AI systems.

    The ShadowMQ Vulnerability Pattern

    Security researchers at Oligo Security found a dangerous pattern called ShadowMQ. It started in Meta’s Llama Stack framework. 

    The issue? The unsafe use of ZeroMQ sockets with Python’s pickle deserialization for data handling. Now, pickle can run any code during unpickling. And this opens doors for remote attacks over networks.

    The spread of the flaw was due to copied code. Developers took files from one project to another. This incident happened without full checks, so it moved from one repository to the next.

    ShadowMQ was first spotted in October 2024 during routine scans on Meta’s Llama Stack. Oligo noticed unauthenticated ZMQ sockets and deserializing untrusted data via pickle. Now, teams must watch for these hidden chains.

    Affected Frameworks and Real-World Impact

    The bugs hit major players. Meta’s Llama Stack got CVE-2024-50050 with a CVSS score of 8.0. Nvidia’s TensorRT-LLM has CVE-2025-23254 at 9.3 severity. Microsoft’s Sarathi-Serve remains vulnerable without a CVE yet.

    Open-source ones like vLLM and SGLang also suffer. These tools power AI in big setups. Users include xAI, AMD, Intel, and clouds like AWS and Azure. Universities such as MIT and Stanford rely on them too.

    Enterprises are already feeling the heat from ShadowMQ exposures. Oligo’s scan revealed over 4,200 publicly reachable ZeroMQ ports running the vulnerable code. Many belong to Fortune 500 companies and government clouds.

    Attackers could steal models or add miners. Exploits might lead to full takeovers. One bad node could spread harm across clusters. Stats show AI threats rose 40 percent in 2025. This fits a trend where flaws hit development pipelines hard.

    Real-world testing proved the danger in minutes. Oligo researchers gained remote shells on unpatched clusters with a single payload. They extracted full Llama-3.1-70B weights in under 20 minutes. This shows why inference engines now top the list of high-value targets for nation-state actors. 

    Patches Released and Security Lessons

    Good news came with quick fixes. Meta switched to JSON serialization. Nvidia added HMAC checks in version 0.18.2. vLLM now defaults to its safe V1 engine. Modular Max Server uses msgpack instead. 

    Microsoft’s Sarathi-Serve needs urgent review. It’s a research tool but runs in production spots. Oligo urges all users to update now. 

    To avoid repeats of ShadowmMQ, audit copied code. Use safe data formats like JSON or msgpack. Test network exposures often. Don’t use pickle with untrusted data. And educate dev teams on the importance of serialization.

    As engines scale, security must keep pace. Firms now push for vetted reuse. This could shape safer standards ahead.

    See Also: https://phronews.com/nvidia-q3-2025-earnings-ai-bubble-debate/

    AI cluster security AI code reuse risks AI cybersecurity AI inference engine security AI Inference engines AI infrastructure risks AI model security breach AI patch updates AI security flaws AI security vulnerabilities AI supply chain security AI threat rise 2025 AI Vulnerabilities AWS AI systems Azure AI systems cloud AI security CVE-2024-50050 CVE-2025-23254 cybersecurity best practices for AI enterprise AI security Fortune 500 cybersecurity government cloud security JSON vs pickle security Llama Stack vulnerability Meta Meta Llama Stack Meta security patch Microsoft Microsoft Sarathi-Serve model theft risks msgpack security nation-state cyberattacks Nvidia Nvidia security patch Nvidia TensorRT-LLM Oligo security pickle deserialization risk RCE flaws remote code execution remote shell exploit Sarathi-Serve vulnerability secure AI development secure serialization SGLang vulnerability ShadowMQ TensorRT-LLM vulnerability vLLM engine update vLLM vulnerability vulnerable AI frameworks ZeroMQ exposed ports ZeroMQ vulnerability
    Share. Facebook Twitter Pinterest LinkedIn Tumblr Telegram Email
    oluchi
    • X (Twitter)
    • LinkedIn

    I am a content writer with over three years of experience. I specialize in creating clear, engaging, and value-driven content across diverse niches, and I’m now focused on the tech and business space. My strong research skills, paired with a natural storytelling ability, enable me to break down complex topics into compelling, reader-friendly articles. As an avid reader and music lover, I bring creativity, insight, and a sharp eye for detail to every piece I write.

    Related Posts

    Grid-Responsive AI: How Nvidia Plans to Turn Data Centers Into Power Assets with Emerald AI

    April 16, 2026

    The Trillion-Dollar Exit: Why a SpaceX IPO Would Reshape the Space Economy

    April 15, 2026

    The Sacramento Blueprint: How California is Writing the U.S. AI Rulebook

    April 14, 2026

    Comments are closed.

    Top Posts

    Coinbase responds to hack: customer impact and official statement

    May 22, 2025

    MIT Study Reveals ChatGPT Impairs Brain Activity & Thinking

    June 29, 2025

    From Ally to Adversary: What Elon Musk’s Feud with Trump Means for the EV Industry

    June 6, 2025

    Anthropic Will Use Claude User Chats For Data Training

    October 16, 2025
    Don't Miss
    Uncategorized

    Grid-Responsive AI: How Nvidia Plans to Turn Data Centers Into Power Assets with Emerald AI

    By preciousApril 16, 2026

    Data centers are power-hungry by design, but Nvidia wants them to work in the opposite…

    Cyber Retaliation: How Iran-Linked Hackers Paralyzed Medical Giant Stryker

    April 16, 2026

    The Trillion-Dollar Exit: Why a SpaceX IPO Would Reshape the Space Economy

    April 15, 2026

    The Sacramento Blueprint: How California is Writing the U.S. AI Rulebook

    April 14, 2026
    Stay In Touch
    • Facebook
    • Twitter
    About Us
    About Us

    Evolving from Phronesis News, Phronews brings deep insight and smart analysis to the world of technology. Stay informed, stay ahead, and navigate tech with wisdom.
    We're accepting new partnerships right now.

    Email Us: info@phronews.com

    Facebook X (Twitter) Pinterest YouTube
    Our Picks
    Most Popular

    Coinbase responds to hack: customer impact and official statement

    May 22, 2025

    MIT Study Reveals ChatGPT Impairs Brain Activity & Thinking

    June 29, 2025

    From Ally to Adversary: What Elon Musk’s Feud with Trump Means for the EV Industry

    June 6, 2025
    © 2025. Phronews.
    • Home
    • About Us
    • Get In Touch
    • Privacy Policy
    • Terms and Conditions

    Type above and press Enter to search. Press Esc to cancel.