Google says Gemini AI is making robots smarter

Google is training the robot with Gemini AI to improve its navigation and task-completion abilities. DeepMind’s robotics team explained in a new research paper that Gemini 1.5 Pro’s longer context window, which determines how much information an AI model can process, allows users to more easily interact with the RT-2 robot using natural language instructions.

It works by taking a video tour of a designated area, such as a home or office space. Researchers use the Gemini 1.5 Pro to have the robot “watch” the video to learn about the environment. The robot can then execute commands based on what it observes using audio and/or image output, such as directing the user to a power outlet after showing them a phone and asking, “Where can I charge it?” According to DeepMind, the Gemini-powered robot achieved a 90% success rate for more than 50 user instructions in an operating area of over 9,000 square feet.

The researchers also found “preliminary evidence” that Gemini 1.5 Pro enabled the droid to plan how to carry out instructions, not just navigate. For example, when a user with a bunch of Coke cans on their desk asked the droid if it had a favorite drink, Gemini “recognized that the robot should navigate to the fridge, check if there was a Coke can, and return to the user to report the results,” the team said. DeepMind said it plans to investigate these results further.

While the video demo provided by Google is impressive, the obvious cuts after the droid recognizes each request hide the 10 to 30 seconds it takes to process these instructions, according to the research paper. It may be a while before we see more advanced environment-mapping robots living in our homes, but at the very least these robots might be able to find lost keys or wallets.

Source link

What's Hot

Travel the World for Less with Home Exchange: Explore Like a Local, Live Like a Local

How to watch CNN’s Harris Waltz interview | 2024 US Election

New Zealand damages boat on land on first day of America’s Cup

Google says Gemini AI is making robots smarter

Generative AI coding startup Magic raises $320M in investment from Eric Schmidt, Atlassian and others

It’s time for streaming services to tackle AI music

Nvidia CFO says ‘enterprise AI wave’ has begun and Fortune 100 companies are leading the way

California Passes Landmark Bill to Regulate Large-Scale AI Models | Artificial Intelligence (AI)

Google employees say AI conferencing tool gives executives easy questions

Salesforce rises as software company bets on AI tools to drive growth

Travel the World for Less with Home Exchange: Explore Like a Local, Live Like a Local

How to watch CNN’s Harris Waltz interview | 2024 US Election

New Zealand damages boat on land on first day of America’s Cup

The Supreme Court has indicated it would side with Trump if the election is close.

AdsPower: See you at Affiliate World Europe 2024 in Budapest!

TEMU Affiliate Program 2024: Earn up to £100,000 per month!

Hard Bacon files for bankruptcy as Google search changes strain affiliate marketing business

Getting Started in Affiliate Marketing: How to Make Passive Income in 2024

Our Picks

Travel the World for Less with Home Exchange: Explore Like a Local, Live Like a Local

How to watch CNN’s Harris Waltz interview | 2024 US Election

New Zealand damages boat on land on first day of America’s Cup

Most Popular

Working It guide to AI at work

Meta AI is fun, accessible, and free. Maybe it’s time to make AI chatbots a part of your life | Technology News

Generative AI Might Be Overrated

Subscribe to Updates

What's Hot

Google says Gemini AI is making robots smarter

Related Posts