Pengxiang Li avatar

Pengxiang Li (李鹏翔)

Ph.D. Student @ PolyU

Email  /  CV  /  Github  /  Google Scholar
About Me

I am currently a Ph.D. student at The Hong Kong Polytechnic University. Previously, I obtained my MSc from Dalian University of Technology under the supervision of Prof. Huchuan Lu. My research focuses on Large Language Models, Multimodal GUI Agents, and Diffusion Models for Video Generation.

My recent research interests include:

News
  • [2025.01] InfiGUIAgent and InfiGUI-R1 papers released - advancing multimodal GUI agents!
  • [2025.01] Two papers accepted to ACL 2025 Findings and ICLR 2025!
  • [2024.10] Two papers accepted to WACV 2025 on VLM evaluation and TrackDiffusion!
Selected Publications
Research Interests
  • Multimodal GUI Agents: Building intelligent agents for GUI understanding and automation
  • Large Language Models: Layer normalization, depth scaling, efficient fine-tuning
  • Diffusion Models: Video generation, motion transfer, controllable synthesis
  • Autonomous Driving: Corner case evaluation, trajectory generation, safety assessment
Education
Ph.D. in Computer Science
The Hong Kong Polytechnic University
2025 - Present | Advisor: Prof. Hongxia Yang
MSc in Artificial Intelligence
Dalian University of Technology
2022 - 2025 | Advisor: Prof. Huchuan Lu
B.Sc. in Geographic Information Systems
Dalian Maritime University
2018 - 2022