
Introduction
Computer Science Master student at University of Electronic Science and Technology of China (UESTC) (GPA Ranking: Top 1.3%), specializing in:
- High-efficiency transfer learning for Vision-Language Models (VLMs)
- Training and test-time adaptation for Vision-Language-Action Models (VLAs)
Published 2 CVPR (CCF-A) papers as first/co-first author. Awarded National Scholarship and Outstanding Graduate honors.
News
[2025.5.20] 🔥 Our paper "InSpire: Vision-Language-Action Models with Intrinsic Spatial Reasoning" is available!
[2025.5.19] 🔥 Our paper "Policy Contrastive Decoding for Robotic Foundation Models" is available!
Education & Experiences
Master student at University of Electronic Science and Technology of China (UESTC), 2023 - Present
Bachelor student at University of Electronic Science and Technology of China (UESTC), 2019 - 2023
Publications
Note: (*indicates equal contribution)

Policy Contrastive Decoding for Robotic Foundation Models
Shihan Wu*, Ji Zhang*, Xu Luo, Junlin Xie, Jingkuan Song, Heng Tao Shen, Lianli Gao
Robotics · Vision-Language-Action Models · Contrastive Decoding
2025.5
[Project Page] [PDF] [arXiv] [Code]

InSpire: Vision-Language-Action Models with Intrinsic Spatial Reasoning
Ji Zhang*, Shihan Wu*, Xu Luo, Hao Wu, Lianli Gao, Heng Tao Shen, Jingkuan Song
Robotics · Vision-Language-Action Models · Spurious Correlation
2025.5
[Project Page] [PDF] [arXiv] [Code]


Rethinking Conditional Prompt Tuning for Vision-Language Models
Ji Zhang, Shihan Wu, Pengpeng Zeng, Lianli Gao, Jingkuan Song, Heng Tao Shen
Vision-language Models · Transfer Learning · Prompt Tuning
2024.8

Awards & Honors
Outstanding Graduate Student, University of Electronic Science and Technology of China (UESTC), 2025
National Scholarship, Ministry of Education of the People's Republic of China, 2024
Outstanding Graduate, University of Electronic Science and Technology of China (UESTC), 2023