Gemini Robotics 1.5: Enabling robots to plan, think and use tools to solve complex tasks
Sep 25, 2025•Channel
AI Analysis
Data from YouTube Data API v3•Updated Just now
Video Overview
Video Details
Published8 months ago
Duration4:14
Video IDUObzWjPb6XM
Languageen
CategoryScience & Technology
PrivacyPublic
Made for KidsNo
Video TypeRegular Video
Performance Metrics
Views155.4K
Likes3.2K
Comments204
Engagement Rate2.17%
Likes per 100 views2.04
Comments per 1K views1.31
Description
We’re powering an era of physical agents with Gemini Robotics 1.5 — enabling robots to perceive, plan, think, use tools and act to better solve complex, multi-step tasks.
🤖 Gemini Robotics 1.5 is our most capable vision-language-action (VLA) model that turns visual information and instructions into motor commands for a robot to perform a task. This model thinks before taking action and shows its process, helping robots assess and complete complex tasks more transparently. It also learns across embodiments, accelerating skill learning.
🤖 Gemini Robotics-ER 1.5 is our most capable vision-language model (VLM) that reasons about the physical world, natively calls digital tools and creates detailed, multi-step plans to complete a mission. This model now achieves state-of-the-art performance across spatial understanding benchmarks.
We’re making Gemini Robotics-ER 1.5 available to developers via the Gemini API in Google AI Studio and Gemini Robotics 1.5 to select partners.
Learn more: https://deepmind.google/models/gemini-robotics/
------
Subscribe to our channel https://www.youtube.com/@googledeepmind
Find us on X https://twitter.com/GoogleDeepMind
Follow us on Instagram https://instagram.com/googledeepmind
Add us on Linkedin https://www.linkedin.com/company/deepmind/