Google unveils Gemini 2.5 computer use model for smarter app and browser control

See how AI finally learns to click, type, and navigate like a human!

Jeeva Shanmugam

Published: October 8, 2025

3 Min Read

Image Credits: Google

Highlights

Gemini 2.5 computer use model lets AI perform real tasks on software interfaces without needing APIs.
It works seamlessly on web and mobile apps while maintaining fast, low-latency performance.
Built-in safety features ensure secure actions and prevent AI from performing risky operations.

Google has released the Gemini 2.5 computer use model, a new AI system designed to work directly with software interfaces. Most AI tools usually need structured code called APIs to communicate with software. But many real-world tasks, like filling online forms or navigating websites, still need a human to click buttons and type text. This new model is built to handle that.

Gemini 2.5: Google bridges the gap between AI and real-world tasks

The Gemini 2.5 can bypass APIs and work on graphical interfaces. That means it can do things like choose items from dropdown menus, scroll pages, login step by step, or fill forms automatically. Google says this model performs very well and has low latency, meaning it works fast compared to other similar AI systems. Tests like Browserbase’s Online-Mind2Web show it is among the top performers.

How Gemini 2.5 works

The AI works through the new computer_use tool inside the Gemini API. The process is simple:

Input: The AI gets your request, a screenshot of the screen, and a list of recent actions.
Processing: It decides what action to take next like clicking or typing. If the action is sensitive, like buying something online, it waits for user confirmation.
Execution and Feedback: It executes the action, takes a new screenshot and URL, and continues until the task is finished or stopped by safety rules or the user.

Right now it works best with web browsers and is showing early results for mobile apps. It does not fully support desktop OS automation yet.

Built-in safety measures

Google built safety into the Gemini 2.5 computer use model because AI controlling software can be risky. Some safety measures are:

Per-Step Safety Review: Every action is checked before it happens.
System Rules: Developers can block risky actions or require confirmation from the user.

This helps prevent mistakes or misuse of the AI.

Applications and availability

Google is already using the model for things like UI testing to speed up software quality checks and improve AI in search features. Developers can access the Gemini 2.5 in public preview through Google AI Studio and Vertex AI. You can also try demos from Browserbase or integrate it using tools like Playwright.

Overall, the Gemini 2.5 is a big step in AI. It allows agents to interact directly with interfaces like humans. This makes automation, testing, and AI assistants more capable. It is not perfect yet, but it shows a lot of promise for the future.

SOURCES:Google

Share This Article

ByJeeva Shanmugam

Making spicy content on the Internet!

Apple iPhone 18 Pro series launch event tipped for September 8 or 9

Apple iPhones

June 29, 2026

Samsung kicks off Galaxy Z Fold8 & Flip8 teaser ahead of Galaxy Unpacked event

Samsung Android Devices Phones

July 1, 2026

Sony confirms end of physical PlayStation game discs from January 2028

Sony Gaming Internet

July 3, 2026

Highlights

Gemini 2.5 computer use model lets AI perform real tasks on software interfaces without needing APIs.
It works seamlessly on web and mobile apps while maintaining fast, low-latency performance.
Built-in safety features ensure secure actions and prevent AI from performing risky operations.

Gemini 2.5: Google bridges the gap between AI and real-world tasks

How Gemini 2.5 works

The AI works through the new computer_use tool inside the Gemini API. The process is simple:

Input: The AI gets your request, a screenshot of the screen, and a list of recent actions.
Processing: It decides what action to take next like clicking or typing. If the action is sensitive, like buying something online, it waits for user confirmation.
Execution and Feedback: It executes the action, takes a new screenshot and URL, and continues until the task is finished or stopped by safety rules or the user.

Right now it works best with web browsers and is showing early results for mobile apps. It does not fully support desktop OS automation yet.

Built-in safety measures

Google built safety into the Gemini 2.5 computer use model because AI controlling software can be risky. Some safety measures are:

Per-Step Safety Review: Every action is checked before it happens.
System Rules: Developers can block risky actions or require confirmation from the user.

This helps prevent mistakes or misuse of the AI.

Applications and availability

TAGGED:Gemini Gemini 2.5 Google

SOURCES:Google

Share This Article

ByJeeva Shanmugam

Making spicy content on the Internet!

Browse

More Links

Top Stories

Samsung expects Galaxy Z Fold8 Wide to outsell Fold8 Ultra and Z Flip8

Apple reportedly drops Extreme chip plans for future macs

ChatGPT Atlas is shutting down—use these 5 AI browsers instead

Google unveils Gemini 2.5 computer use model for smarter app and browser control

See how AI finally learns to click, type, and navigate like a human!

Gemini 2.5: Google bridges the gap between AI and real-world tasks

How Gemini 2.5 works

Built-in safety measures

Applications and availability

Top Stories

Apple iPhone 18 Pro series launch event tipped for September 8 or 9

Samsung kicks off Galaxy Z Fold8 & Flip8 teaser ahead of Galaxy Unpacked event

Sony confirms end of physical PlayStation game discs from January 2028

Related Stories

OnePlus 2 Charger Revealed – That’s Reversible

You Can Now Download Music from YouTube Music’s Desktop Website

Google will send out search and browser recommendations to EU users

Elon Musk’s xAI secures $20 billion deal with strategic Nvidia partnership

Gemini 2.5: Google bridges the gap between AI and real-world tasks

How Gemini 2.5 works

Built-in safety measures

Applications and availability

Related Stories

Google’s Nearby Share to Come to Windows, Mac and Linux Soon

Android 15 Finally Adds Support For Satellite Messaging, Here’s What You Can Expect?

OpenAI may launch AI-powered earbuds in 2026

Netflix Tests AI Chatbot to Help Users Find What to Watch

Quicklinks

Company

Browse

More Links

Top Stories

Tech News Weekly

Google unveils Gemini 2.5 computer use model for smarter app and browser control

See how AI finally learns to click, type, and navigate like a human!

Gemini 2.5: Google bridges the gap between AI and real-world tasks

How Gemini 2.5 works

Built-in safety measures

Related

Applications and availability

Top Stories

Weekly Tech Bites

Related Stories

Gemini 2.5: Google bridges the gap between AI and real-world tasks

How Gemini 2.5 works

Built-in safety measures

Related

Applications and availability

Related Stories