UIArena

v1
GitHub

Static UI Grounding Benchmark for AI Agents — Orthogonal testing across Visual, DOM, and Accessibility dimensions

15 environments·5 variants per page·3 dimensions (Visual, DOM, A11y)
c0Baseline (all optimal)
c1No Visual (mincss)
c2No A11y
c3Noisy DOM
c4Hard (all degraded)
UIArena v1·3 pages × 5 variants = 15 environments·GitHub