Close Menu

    Stay Ahead with Exclusive Updates!

    Enter your email below and be the first to know what’s happening in the ever-evolving world of technology!

    What's Hot

    Tesla Is Teaching Its Self-Driving AI With Millions of Fake Crashes. Here Is Why That Might Make Real Roads Safer

    June 30, 2026

    Why Gemini’s Multimodal Update Could Finally Close the Gap With GPT-5 — and Where It Still Falls Short

    June 30, 2026

    Anthropic Says Chinese Rival Alibaba Copied Claude at Scale. Here Is What Model Extraction Actually Means and Why It Matters

    June 30, 2026
    Facebook X (Twitter) Instagram
    Facebook X (Twitter)
    PhronewsPhronews
    • Home
    • Big Tech & Startups

      Tesla Is Teaching Its Self-Driving AI With Millions of Fake Crashes. Here Is Why That Might Make Real Roads Safer

      June 30, 2026

      Why Gemini’s Multimodal Update Could Finally Close the Gap With GPT-5 — and Where It Still Falls Short

      June 30, 2026

      Anthropic Says Chinese Rival Alibaba Copied Claude at Scale. Here Is What Model Extraction Actually Means and Why It Matters

      June 30, 2026

      OpenAI Just Built the One Thing That Could Make It Stop Depending on Nvidia. Here Is What Its First Custom Chip Does

      June 30, 2026

      Norway Just Banned AI in Elementary Schools. The Country That Already Removed Smartphones From Classrooms Is Now Drawing the Firmest Line Any Government Has Set Between AI and Children.

      June 26, 2026
    • Crypto

      Market Collapse: What Happened to NFTs?

      April 23, 2026

      Quantum Computing Advances Force Coinbase and Institutional Custodians to Rethink Crypto Security

      March 8, 2026

      AI Assisted Hacking Groups Target Crypto Firms With Multi-Layered Social Engineering

      February 18, 2026

      Global Crypto Regulations Expand as 2026 Begins With New Data Collection Frameworks and National Laws

      January 16, 2026

      Coinbase Bets on Stablecoin and On-Chain Growth as Key Market Drivers in 2026 Strategy

      January 10, 2026
    • Gadgets & Smart Tech
      Featured

      Tesla Is Teaching Its Self-Driving AI With Millions of Fake Crashes. Here Is Why That Might Make Real Roads Safer

      By fariehanJune 30, 2026
      Recent

      Tesla Is Teaching Its Self-Driving AI With Millions of Fake Crashes. Here Is Why That Might Make Real Roads Safer

      June 30, 2026

      Apple Just Rebuilt Siri With AI Across Every Device It Makes. WWDC 2026 Was Not a Software Update. It Was a Strategic Repositioning

      June 20, 2026

      The 1-Petaflop Superchip: How Nvidia RTX Spark Puts Local AI Agents Directly on Your Laptop.

      June 13, 2026
    • Cybersecurity & Online Safety

      Anthropic Says Chinese Rival Alibaba Copied Claude at Scale. Here Is What Model Extraction Actually Means and Why It Matters

      June 30, 2026

      Britain’s Cyber Agency Just Warned That AI-Generated Code Could Trigger the Next Wave of Catastrophic Security Failures. The Advisory Names Vibe Coding Directly and It Is Not a Mild Caution.

      June 26, 2026

      North Korea Compromised 144 AI Developer Packages in 88 Minutes Without Touching a Single Line of Source Code. The Mastra Attack Is the Most Targeted Supply Chain Strike Against AI Development Tools Ever Documented.

      June 26, 2026

      A Criminal Group Now Holds Working Credentials for More Than 70,000 Fortinet Firewalls Across 194 Countries and Is Still Active. Accenture, Oracle, Samsung and PwC Are Among the Named Victims of FortiBleed.

      June 24, 2026

      A Dataset of 24 Billion Stolen Usernames and Passwords Just Surfaced Online. Researchers Are Already Calling It the Largest Credential Exposure of 2026.

      June 24, 2026
    PhronewsPhronews
    Home»Artificial Intelligence & The Future»Why Gemini’s Multimodal Update Could Finally Close the Gap With GPT-5 — and Where It Still Falls Short
    Artificial Intelligence & The Future

    Why Gemini’s Multimodal Update Could Finally Close the Gap With GPT-5 — and Where It Still Falls Short

    fariehanBy fariehanJune 30, 2026No Comments
    Facebook Twitter Pinterest LinkedIn WhatsApp Reddit Tumblr Email
    Share
    Facebook Twitter LinkedIn Pinterest Email

    Google Gemini’s multimodal update finally challenges GPT-5 on visual understanding. In June 2026, Google rolled out a major upgrade, targeting its longtime weak spot.

    Until now, OpenAI held a clear edge in interpreting images, video and mixed media. However, that edge is starting to narrow.

    What’s Actually New in Gemini’s Multimodal Update

    First, Google pushed Gemini-3.1-flash-image and Gemini-3-pro-image to general availability in June. The update adds video-to-image generation, a fresh capability inside Gemini-3.1-flash-image. 

    Developers can now upload a video file or paste a YouTube link. Then, Gemini generates thumbnails, movie posters, or summary infographics automatically. 

    In addition, Google launched gemini-embedding-2, its first multimodal embedding model. The model maps text, images, video, audio, and PDFs into one shared space. 

    This upgrade powers File Search, which returns visual citations alongside text. Meanwhile, Google retired Imagen completely, shifting every workflow toward Gemini models.

    How Gemini’s Multimodal Update Closes the Benchmark Gap

    Following the rollout, independent benchmark testing confirms real progress. Gemini 3.1 Pro scores 82.8 on multimodal and grounded tasks. GPT-5.5 trails significantly, posting only 70.4 in the same category. 

    Additionally, MMMU-Pro produces the widest gap between the two models. Gemini also edges ahead on the overall leaderboard, scoring 89 to 88. 

    However, the margin stays thin enough to avoid calling it a defeat. Gemini wins decisively in vision tasks, yet barely wins overall.

    GPT-5’s June Countermove

    In response, OpenAI answered swiftly with its own preview release. The company unveiled GPT-5.6 Sol, Terra, and Luna on June 26. 

    However, OpenAI chose depth over breadth for the release. Sol pushes coding performance further, setting a new benchmark record on Terminal-Bench 2.1. The model also strengthens biology research and shows measurable cybersecurity gains via ExploitBench. 

    Furthermore, Terra matches GPT-5.5’s performance while costing half as much. Consequently, OpenAI built GPT-5.6 for reasoning and agentic work, not visual tasks. The choice reveals where each company sees its strongest ground.

    Where the Gap Still Holds

    Despite Gemini’s gains, GPT-5.5 keeps a clear lead in pure reasoning tasks. It averages 85 points against Gemini’s 77.1. 

    However, CritPt shows the sharpest divide between the two models and pricing tells a similar story. GPT-5.5 costs $5 per million input tokens and $30 per million output tokens. Gemini 3.1 Pro charges only $2 and $12 for the same workload. 

    Therefore, cost favors Google, while reasoning power still favors OpenAI. Buyers simply cannot pick one model and check every box.

    Which Workflow Should Switch First

    Overall, teams running document, image, and video pipelines gain the most right now. File search across mixed media now works inside one unified API. However, coding teams and security researchers should wait before switching models. 

    GPT-5.6 remains available only through a limited partner preview. OpenAI plans a broader rollout within weeks, not immediately. Until then, GPT-5.5 keeps its edge in reasoning-heavy production work. 

    Most teams will likely run both models, routing tasks by strength. Ultimately, Gemini closes a real gap in vision, while OpenAI still holds reasoning ground tightly.

    AI benchmarks 2026 AI cost comparison AI image generation AI model comparison AI model performance AI model pricing AI reasoning AI vision models CritPt benchmark ExploitBench Frontier AI Models Gemini 3 Flash Gemini 3.1 Pro Gemini File Search Gemini multimodal update Gemini vs ChatGPT gemini-3-pro-image gemini-3.1-flash-image gemini-embedding-2 Google AI Google DeepMind Google Gemini June 2026 GPT-5 vs Gemini GPT-5.5 GPT-5.6 GPT-5.6 Sol GPT-5.6 Terra large language models MMMU-Pro multimodal AI multimodal benchmarks Nano Banana 2 OpenAI Terminal-Bench 2.1 video-to-image AI
    Share. Facebook Twitter Pinterest LinkedIn Tumblr Telegram Email
    fariehan

    Related Posts

    Tesla Is Teaching Its Self-Driving AI With Millions of Fake Crashes. Here Is Why That Might Make Real Roads Safer

    June 30, 2026

    Anthropic Says Chinese Rival Alibaba Copied Claude at Scale. Here Is What Model Extraction Actually Means and Why It Matters

    June 30, 2026

    OpenAI Just Built the One Thing That Could Make It Stop Depending on Nvidia. Here Is What Its First Custom Chip Does

    June 30, 2026

    Comments are closed.

    Top Posts

    Coinbase responds to hack: customer impact and official statement

    May 22, 2025

    Anthropic Will Use Claude User Chats For Data Training

    October 16, 2025

    Cursor AI Hits 1 Million Daily Users. Why Developers Are Switching to This Coding Tool

    March 23, 2026

    MIT Study Reveals ChatGPT Impairs Brain Activity & Thinking

    June 29, 2025
    Don't Miss
    Artificial Intelligence & The Future

    Tesla Is Teaching Its Self-Driving AI With Millions of Fake Crashes. Here Is Why That Might Make Real Roads Safer

    By fariehanJune 30, 2026

    Tesla’s self-driving AI learns from millions of simulated crashes before encountering similar risks on public…

    Why Gemini’s Multimodal Update Could Finally Close the Gap With GPT-5 — and Where It Still Falls Short

    June 30, 2026

    Anthropic Says Chinese Rival Alibaba Copied Claude at Scale. Here Is What Model Extraction Actually Means and Why It Matters

    June 30, 2026

    OpenAI Just Built the One Thing That Could Make It Stop Depending on Nvidia. Here Is What Its First Custom Chip Does

    June 30, 2026
    Stay In Touch
    • Facebook
    • Twitter
    About Us
    About Us

    Evolving from Phronesis News, Phronews brings deep insight and smart analysis to the world of technology. Stay informed, stay ahead, and navigate tech with wisdom.
    We're accepting new partnerships right now.

    Email Us: info@phronews.com

    Facebook X (Twitter) Pinterest YouTube
    Our Picks
    Most Popular

    Coinbase responds to hack: customer impact and official statement

    May 22, 2025

    Anthropic Will Use Claude User Chats For Data Training

    October 16, 2025

    Cursor AI Hits 1 Million Daily Users. Why Developers Are Switching to This Coding Tool

    March 23, 2026
    © 2025. Phronews.
    • Home
    • About Us
    • Get In Touch
    • Privacy Policy
    • Terms and Conditions

    Type above and press Enter to search. Press Esc to cancel.