2026-03-14 · Lab Notes ◆ Experimental
Browser Agents
Playwright MCP made browser automation a standard agent tool. Reliability is the remaining problem.
Success rate85–92%
Median task30–90s
5-step workflow59% compound
What MCP Changed
Before
DOM scraping · brittle selectors · per-framework integration
→
After
ARIA snapshots · typed actions · standardized MCP protocol
The agent sees semantic page structure, not raw HTML. It clicks based on what an element does, not where it sits.
Success by Task Type
Each step at ~90% reliability. A 5-step workflow compounds to 59%. This is the ceiling.
Three Failure Modes
Timing
SPA content loads after page event. Agent clicks too early.
×
Detection
Cloudflare/Datadome flag automated browsers. Arms race.
Noise
Cookie banners, modals, popups — agent can't dismiss reflexively.
×
Recovery
No human intuition to reroute when an action fails.
What's Converging
Vision modelsScreenshot + click fallback
Agent memorySite-specific patterns
Session replayRecord → replay → explore
Browser automation is becoming a standard agent capability, not a product category. The terminal equivalent for the web.