Xurui Zhou

xurui.jpg

I am currently a Master student at the School of Computer Science and Technology, Harbin Institute of Technology (Shenzhen), supervised by Prof. Rui Shao and Prof. Gongwei Chen. Before that, I received my Bachelor’s degree in Computer Science and Technology from Harbin Institute of Technology (Shenzhen) in 2024. I am currently seeking suitable PhD opportunities or industry positions.

My research interests focus on the broad areas of multimodal learning, AI agent and RL. Recently, I focus on

  • Multimodal Large Language Models (MLLM)
  • MLLM-based Agent
  • Reinforcement Learning for MLLM

news

Dec 01, 2025 A new paper on GUI agents and reinforcement learning has been released on arXiv!
Jun 26, 2025 One paper about GUI Agent is accepted by ICCV 2025 as Highlight! :sparkles:
Feb 11, 2025 One paper about GUI Agent Benchmark is accepted by ICLR 2025 as Spotlight! :sparkles:

selected publications

  1. HiconAgent: History Context-aware Policy Optimization for GUI Agents
    Xurui Zhou, Gongwei Chen, Yuquan Xie, Zaijing Li, Kaiwen Zhou, Shuai Wang, Shuo Yang, Zhuotao Tian, and 1 more author
    arXiv preprint arXiv:2512.01763, 2025
  2. Less is More: Empowering GUI Agent with Context-Aware Simplification
    Gongwei Chen, Xurui Zhou, Rui Shao, Yibo Lyu, Kaiwen Zhou, Shuai Wang, Wentao Li, Yinchuan Li, and 2 more authors
    In International Conference on Computer Vision (ICCV), Highlight , 2025