Skip to main content

Opera has unveiled its “Browser Operator,” a native AI agent integrated directly into the browser to perform tasks on behalf of users.

Browser Operator is designed as an intrinsic part of the browser, aiming to empower users by automating mundane tasks such as online shopping, form completion, and content gathering, rather than functioning as a standalone tool.

Differing from server-based AI integrations that necessitate sending sensitive data to third-party servers, Browser Operator processes tasks locally within the Opera browser, ensuring data privacy.

A demonstration video by Opera illustrates how Browser Operator can simplify a common task like purchasing socks. Instead of manually navigating through product pages or filling out payment forms, users can assign the entire process to Browser Operator, freeing them to focus on more meaningful activities, such as spending time with family and friends.

Utilizing natural language processing powered by Opera’s AI Composer Engine, Browser Operator interprets user instructions and executes corresponding tasks within the browser. All operations are conducted locally on the user’s device, leveraging the browser’s infrastructure for safe and swift command execution.

If Browser Operator encounters a sensitive step, such as entering payment details or confirming an order, it pauses and requests user input. Additionally, users have the freedom to intervene and control the process at any time.

Every step taken by Browser Operator is transparent and reviewable, providing users with a clear understanding of task execution. In the event of mistakes, such as incorrect orders, users can instruct the AI to rectify the issue, like cancelling the order or adjusting a form.

Key Differentiators: Privacy, Performance, and Precision

Browser Operator is distinct from other AI-integrated tools due to its localized, privacy-first architecture. Unlike competitors that rely on screenshots or video recordings to understand webpage content, Opera’s approach utilizes the Document Object Model (DOM) Tree and browser layout data, a textual representation of the webpage.

This approach offers several advantages:

  • Faster Task Completion: Browser Operator directly accesses web page elements, bypassing the need to interpret pixels or emulate mouse movements, thus avoiding unnecessary overhead and enabling holistic page processing without scrolling.
  • Enhanced Privacy: With all operations conducted within the browser, user data, including logins, cookies, and browsing history, remains secure on the local device. No screenshots, keystrokes, or personal information are sent to Opera’s servers.
  • Easier Interaction with Page Elements: The AI can interact with elements not visible to the user, such as those behind cookie popups or verification dialogs, facilitating seamless access to web page content.

By enabling the browser to autonomously perform tasks, Opera is making a significant step forward in transforming browsers into “agentic” tools that not only access the internet but also actively enhance productivity.

See Also: You.com ARI: Professional-grade AI research agent for businesses




Want to learn more about AI and big data from industry leaders? Check out the AI & Big Data Expo taking place in Amsterdam, California, and London. This comprehensive event is co-located with other leading events, including the Intelligent Automation Conference, BlockX, Digital Transformation Week, and Cyber Security & Cloud Expo.

Explore other upcoming enterprise technology events and webinars powered by TechForge here


Source Link