Westworld v1

Westworld v1

Benchmarking and Training Web Agents in Highly Realistic Simulators via Verifiable Rewards

BrowserBench

BrowserBench

The First Benchmark for Browser Infrastructure Stealth

Web Bench: The Current State of Browser Agents

Web Bench: The Current State of Browser Agents

Benchmarking browser agents on ~2.5k tasks across 452 websites