Google’s latest AI model uses a web browser like you do

2 hours ago 3

Google is previewing a caller Gemini AI exemplary designed to navigate and interact with the web via a browser, letting AI agents bash things wrong interfaces designed for usage by radical and not robots. The model, called Gemini 2.5 Computer Use, uses “visual knowing and reasoning capabilities” to analyse a user’s petition and transportation retired a task, specified arsenic filling retired and submitting a form.

It tin beryllium utilized for UI investigating oregon navigating interfaces made for radical who don’t person an API oregon different nonstop transportation available. Other versions of this exemplary person been utilized for agentic features successful AI Mode and Project Mariner, a probe prototype that uses AI agents to transportation retired tasks connected its ain successful a browser, similar adding items to your cart based connected a database of ingredients.

Google’s announcement comes conscionable 1 time aft OpenAI revealed new apps for ChatGPT arsenic portion of its yearly Dev Day, and continues to absorption its attraction on its ChatGPT Agent feature that tin implicit analyzable tasks connected your behalf. Meanwhile, Anthropic had already released a mentation of its Claude AI exemplary with “computer use” past year. 

Google posted immoderate demo videos showing its machine usage instrumentality successful action, and notes that they are sped up 3x. 

Google says its machine usage exemplary “outperforms starring alternatives connected aggregate web and mobile benchmarks.” Unlike ChatGPT Agent and Anthropic’s machine usage tool, Google’s caller AI exemplary lone has entree to a browser — not an full machine environment. Google notes that it shows “it is not yet optimized for desktop OS-level control” and presently supports 13 actions, including opening a web browser, typing text, arsenic good arsenic dragging and dropping elements.

Gemini 2.5 Computer Use is disposable to developers done Google AI Studio and Vertex AI, but there’s besides a demo connected Browserbase, wherever you ticker arsenic it completes tasks, similar “Play a crippled of 2048” oregon “Browse Hacker News for trending debates.”

Read Entire Article