Spa-bench: A comprehensive benchmark for smartphone agent evaluation
Jingxuan Chen, Derek Yuen, Bin Xie, Yuhao Yang, Gongwei Chen, Zhihao Wu, Li Yixing, Xurui Zhou, and 3 more authors
In The Thirteenth International Conference on Learning Representations (ICLR), Spotlight (5.1%) , 2025