Google is reportedly set to introduce its own version of Rabbit’s large action model concept, codenamed “Project Jarvis,” as early as December, according to a report from The Information. This new tool aims to assist users by automating various web-based tasks such as gathering research, making purchases, and booking flights. Insights from three individuals familiar with the project suggest that Jarvis will leverage a future iteration of Google’s Gemini technology.
Designed specifically for use with web browsers, particularly Chrome, Jarvis is intended to streamline the completion of everyday tasks by interpreting screenshots and executing commands such as clicking buttons or entering text. Currently, it takes a few seconds to perform actions, indicating that while it is functional, there may still be areas for improvement. This focus on automating online interactions positions Jarvis as a potentially valuable tool for enhancing user efficiency.
The initiative comes at a time when major AI companies are racing to develop similar capabilities. Microsoft is working on Copilot Vision, which allows users to engage with it regarding the webpages they are viewing. Apple is expected to introduce a multi-app awareness feature with its upcoming Intelligence, while Anthropic has released a beta update of its Claude model that can operate computers, albeit with limitations. OpenAI is also reportedly in the process of developing a comparable functionality.
Despite the anticipation surrounding Jarvis, The Information warns that Google’s plans to showcase the project in December may change. The company is contemplating a limited release to a select group of testers to identify and resolve any potential bugs, suggesting a cautious approach to rolling out this innovative tool.
Leave a Reply