Skip to content

Research

Welcome to my research page! Here you can find my latest work and interests in multimodal large language models, reinforcement learning, and agents.

Reinforcement LearningAgentVision-Language ModelsPost-Training

Reinforcement Learning

Uni-OPD: Unifying On-Policy Distillation with a Dual-Perspective Recipe Wenjin Hou*, Shangpin Peng*, Weinong Wang, Zheng Ruan, et al. Arxiv 2026papercode
Uni-DPO: A Unified Paradigm for Dynamic Preference Optimization of LLMs Shangpin Peng, Weinong Wang, Zhuotao Tian, Senqiao Yang, et al., Min Zhang ICLR 2026papercode

Vision-Language Models

HunyuanOCR Technical Report Hunyuan Vision Team, Pengyuan Lyu, Xingyu Wan, Gengluo Li, Shangpin Peng, et al., Chengquan Zhang Core Contributor of Technical Reportpapercode
Mitigating Object Hallucinations via Sentence-Level Early Intervention Shangpin Peng, Senqiao Yang, Li Jiang, Zhuotao Tian ICCV 2025papercode
Chronicles-OCR: A Cross-Temporal Perception Benchmark for the Evolutionary Trajectory of Chinese Characters Gengluo Li, Shangpin Peng, Xingyu Wan, Chengquan Zhang, et al., Han Hu Arxiv 2026papercode

Agent

Safe, or Simply Incapable? Rethinking Safety Evaluation for Phone-Use Agents Zhengyang Tang, et al., Shangpin Peng, et al., Chengquan Zhang, Han Hu Arxiv 2026papercode

Feel free to explore the projects above or contact me for more information!