Google DeepMind recently unveiled two cutting-edge artificial intelligence models designed to assist developers in constructing robots capable of understanding their surroundings and executing intricate tasks. The company’s latest offerings expand on the earlier Gemini Robotics models introduced in March, incorporating advanced cognitive abilities that facilitate immersive experiences, as outlined in a blog post on Thursday, September 25th.
The first model, Gemini Robotics 1.5, is a vision-language-action (VLA) model that translates visual data and instructions into motor commands. The second model, Gemini Robotics-ER 1.5, is a vision-language model (VLM) that devises multi-step plans to accomplish a given mission. While Gemini Robotics-ER 1.5 was released to developers on Thursday, Gemini Robotics 1.5 is available only to selected partners.
According to Carolina Parada, the senior engineering manager at Google AI, these models represent a pivotal advancement toward the development of robots endowed with the intelligence and agility to navigate the complexities of the physical world. Parada highlighted the significance of Gemini Robotics 1.5 as a significant stride towards achieving Artificial General Intelligence (AGI) in real-world applications. By incorporating agentic capabilities, these systems are elevated beyond mere reactionary models, enabling them to reason, plan, engage with tools, and generalize solutions.
The rise of robots in Silicon Valley has gained momentum, with various models now equipped to comprehend natural language instructions and tackle complex duties. Apart from Google DeepMind’s Gemini Robotics, other players such as Meta’s PARTNR, Nvidia’s Isaac Groot N1, and Tesla’s Optimus are actively developing humanoid robots capable of executing a wide range of tasks. Meanwhile, startups like Figure AI, Cobot, and FieldAI are likewise contributing to the evolution of robotics by securing substantial investments and deploying versatile robots across industries like construction, manufacturing, and urban delivery.
Skild AI recently introduced an advanced AI model, the Skild Brain, which enables different types of robots to emulate human-like thinking, functioning, and responsiveness. This breakthrough underscores the ongoing efforts in the realm of artificial intelligence and robotics to push the boundaries of innovation and enhance the capabilities of automated systems.
For comprehensive coverage of AI developments, users can subscribe to PYMNTS’ daily AI Newsletter. Additionally, reports from various sectors, including procurement, finance, and technology, showcase the growing influence of AI and robotics in reshaping industries and driving technological progress. Partnering with innovators and pioneers remains a key focus for collaborative opportunities in this dynamic landscape.
