Google Reportedly Working on AI Agent That Handles Daily Tasks

Why Trust Techopedia
Key Takeaways

  • Google may debut "Project Jarvis," a Rabbit-inspired LAM, in December.
  • The tool is expected to debut with the Gemini LLM and assist with various web tasks.
  • A limited release to testers may occur to identify and address any bugs.

Google may launch “Project Jarvis,” its Rabbit-inspired model, in December, enabling web task automation in Chrome.

The company is set to preview the computer-using agent alongside its primary Gemini large language model (LLM) launch, The Information reports.

“Project Jarvis,” named in reference to J.A.R.V.I.S. from Iron Man, would operate exclusively with a web browser, mainly Chrome. Sources state that the tool could help users automate everyday web tasks, including taking and analyzing screenshots, pressing buttons, entering text, scheduling flights, handling research, and online shopping. The article does not specify if this will be for mobile or desktop.

The report indicates that Jarvis takes “a few seconds” to perform actions, indicating it likely relies on the cloud instead of functioning on-device.

Google is reportedly considering a limited release to testers to uncover and fix bugs. The Information warns that the company’s plan to showcase Jarvis in December may change.

AI Companies Push Boundaries with LAMs

A LAM is an AI system that translates human intentions into actions, enabling tasks like booking rooms and making complex decisions. LAMs learn from extensive user action datasets for strategic planning and real-time responses.

Leading AI companies are creating LAMs similar to the one described in the report about Google. For instance, Anthropic recently unveiled AI agents that autonomously perform complex tasks on computers through its chatbot, Claude. Claude processes on-screen data and acts on users’ behalf with their consent. OpenAI is also reportedly working on a comparable version.

Microsoft’s Copilot Vision will enable users to interact with it regarding the web pages they view. Apple Intelligence is also expected to understand screen content and perform tasks across multiple apps next year.