Computer-Use Agents as Judges for Generative User Interface Paper • 2511.15567 • Published 22 days ago • 51
GGBench: A Geometric Generative Reasoning Benchmark for Unified Multimodal Models Paper • 2511.11134 • Published 27 days ago • 31