I am currently a research assistant at the Hong Kong University of Science and Technology (Guangzhou) campus (HKUST-GZ), under the supervision of Jiaheng Wei (魏嘉珩) and Zixin Zhong (钟梓昕) . At the same time, I am a master’s student at UIUC, currently pursuing my Master of Science degree while doing an internship. My graduate coursework focuses on Data Science and Finance.
My current research interests revolve around:
- Noisy Labels: Investigating methods to identify noisy labels (incorrect or missing labels) in various datasets (images, text, audio).
- LLM Unlearning: Exploring techniques to make large language models “forget” certain learned information.
- Multi-Agent Reinforcement Learning: I explore new approaches in MARL, particularly its applications in robot pathfinding problems.
Beyond these research endeavors, I am also exploring the potential of integrating LLM technology with the financial sector.
During my studies at UIUC, I was a researcher at the Irisk Lab, where I participated in and led a project that used machine learning models to help businesses assess risks.
I have a deep passion for music and enjoy the pleasure it brings me in my leisure time. I previously had a long-term collaboration with the Choir of Central University of Finance and Economics.
📝 Publications
🤖 LLM Unlearning

Label Smoothing Improves Gradient Ascent in LLM Unlearning
Zirui Pang, Hao Zheng, Zhijie Deng, Ling Li, Zixin Zhong, Jiaheng Wei
- We identify the instability of Gradient Ascent in LLM unlearning.
- We propose Smoothed Gradient Ascent (SGA) with a tunable smoothing rate.
- SGA achieves more stable and effective unlearning across benchmarks.

OFFSIDE: Benchmarking Unlearning Misinformation in Multimodal Large Language Models
Hao Zheng, Zirui Pang, Ling li, Zhijie Deng, Yuhan Pu, Zhaowei Zhu, Xiaobo Xia, Jiaheng Wei
- We present OFFSIDE, a benchmark for multimodal unlearning based on football transfer rumors.
- It provides real-world, manually curated data and four evaluation settings to test forgetting, utility, and robustness.
- Our results reveal that current methods fail to unlearn visual rumors and are vulnerable to recovery and prompt attacks.

GUARD: Generation-time LLM Unlearning via Adaptive Restriction and Detection
Zhijie Deng, Chris Yuhao Liu, Zirui Pang, Xinlei He, Lei Feng, Qi Xuan, Zhaowei Zhu, Jiaheng Wei
- We propose GUARD, a generation-time unlearning framework for LLMs.
- It detects forget-related prompts and blocks forbidden tokens during generation.
- GUARD achieves effective forgetting without harming model utility.
🏞️ Label Noise

When VLMs Meet Image Classification: Test Sets Renovation via Missing Label Identification
Zirui Pang, Haosheng Tan, Yuhan Pu, Zhijie Deng, Zhouan Shen, Keyu Hu, Jiaheng Wei
- We propose REVEAL, a framework that uses vision-language models to find and fix missing and noisy labels in image classification benchmarks.
- It ensembles multiple VLMs and human feedback to renovate test sets with soft, accurate labels.
- REVEAL greatly improves dataset quality and aligns closely with human judgments.
📖 Educations
- 2024.08 - 2025.12 (now), Master, University of Illinois Urbana-Champaign, IL, USA.
- 2019.09 - 2023.06, Undergraduate, Central University of Finance and Economics, Beijing, China.
💻 Internships
- 2022.02 - 2022.04, Assistant Software Development Engineer, Iflytek, China.
- 2024.12 - present, Research Assistant, HKUST-GZ, China.
🎵 Music
During my undergraduate studies, I maintained a long-term collaboration with the Central University of Finance and Economics Choir. Serving as both the choir’s piano accompanist and a tenor vocalist, I also composed several musical works.
- The Sun (太阳), the Choir of Central University of Finance and Economics’s Anthem - Silver Award, Beijing College Student Art Festival.
- Ruyuan·Dislocated Space-Time (如愿·错位时空), selected Work of 6th China College Students Network Culture Festival.
🌍 Visitor Map