Enhance browser tool with modal dialog handling
TL;DR
The browser tool now surfaces pending modal dialogs in snapshots and returns `blockedByDialog` status. Users can interact with specific dialogs via the new `--dialog-id` flag.
## What changed OpenClaw added modal dialog support to its browser tool. Snapshots now display any pending dialogs, and the tool returns a blockedByDialog status when one appears. A new --dialog-id flag lets users target and act on specific dialogs directly from the command line.
The update rolled out on 18 May 2026 as part of the standard CLI release. No extra configuration or paid tier is required.
## Why it matters Modal dialogs break most browser agents on real sites such as login flows, cookie banners, and confirmation prompts. Handling them inside the existing snapshot and command structure removes a common failure point without adding new services.
This move strengthens OpenClaw against closed agents that still stall on pop-ups. It bets that reliable browser control will matter more than raw model size for daily automation work.
## How to use it Pull the latest OpenClaw binary or rebuild from the GitHub repo. Run your normal browser command and inspect the returned snapshot for any blockedByDialog flag.
When a dialog appears, append --dialog-id followed by the ID shown in the snapshot to accept, dismiss, or fill fields. Test first on a non-critical site.
## Watch for Success shows up as fewer manual interventions on sites that previously required human clicks. The bet breaks if dialog detection misses nested or JavaScript-triggered modals. Expect the next step to be file-upload and CAPTCHA handling in the same command surface.
Harsh’s take
For a solo operator the change cuts down on brittle browser scripts that die on the first popup. The trade-off is one more flag to remember and test whenever you script a new site.
Most Vibe Builders will still hit edge cases where the dialog ID changes between runs. Treat this as incremental reliability, not a finished browser layer.
Update your current browser tasks with the new flag this week and log every dialog that still escapes detection.
by Harsh Desai
About OpenClaw
View the full OpenClaw page →All OpenClaw updatesMore from OpenClaw
- FeatureExpand QA-Lab with runtime parity scenarios
Added comprehensive runtime parity tiers and token-efficiency artifacts to the QA-Lab, including specific checks for Codex-vs-Pi compatibility and tool fixture coverage.
- App UpdateUpdate Node.js requirement and Pi packages
Raised the minimum supported Node.js version to 22.19 and updated Pi packages to version 0.75.1 to ensure compatibility with the latest runtime features.
- App UpdateOptimize Gateway startup and restart latency
Reduced restart ready latency by overlapping startup logging and plugin-service initialization with channel sidecars while maintaining strict readiness gating.