I am a Ph.D. Candidate (2022.08-) in Computing and Information Sciences at Rochester Institute of Technology (RIT), advised by Prof. Weijie Zhao. I previously interned at Amazon (Agents & Foundation Models) and ByteDance (Audio & Speech Processing). My research interests span GenAI, Agents, LLMs, AI Security, and Scalable & Trustworthy Machine Learning.
Prior to RIT, I worked as a Research Assistant at the Institute of Computer Vision (ICV), Shenzhen University, supervised by Prof. Linlin Shen (Honorary Professor at the University of Nottingham, UK). I received my B.S. degree in Computer Science and Technology in July 2021. During my undergraduate studies, I led a National Innovation Project funded by the Ministry of Education of China and received several ACM-ICPC Medals.
Agent-Omni: Test-Time Multimodal Reasoning via Model Coordination for Understanding Anything
VTBench: Evaluating Visual Tokenizers for Autoregressive Image Generation
UniGuardian: A Unified Defense for Detecting Prompt Injection, Backdoor Attacks and Adversarial Attacks
Online Gradient Boosting Decision Tree: In-Place Updates for Efficient Adding/Deleting Data
RapidUn: Influence-Driven Parameter Reweighting for Efficient Large Language Model Unlearning
ALinFiK: Learning to Approximate Linearized Future Influence Kernel for Scalable Third-Parity LLM Data Valuation
Token-wise Influential Training Data Retrieval for Large Language Models
- Huawei Lin, Yunzhi Shi, Tong Geng, Weijie Zhao, Wei Wang, Ravender Pal Singh, Agent-Omni: Test-Time Multimodal Reasoning via Model Coordination for Understanding Anything. arXiv preprint, arXiv:2511.02834, 2025.
- Huawei Lin, Tong Geng, Zhaozhuo Xu, Weijie Zhao, VTBench: Evaluating Visual Tokenizers for Autoregressive Image Generation. arXiv preprint, arXiv:2505.13439, 2025.
- Huawei Lin, Yingjie Lao, Tong Geng, Tan Yu, Weijie Zhao, UniGuardian: A Unified Defense for Detecting Prompt Injection, Backdoor Attacks and Adversarial Attacks in Large Language Models. arXiv preprint, arXiv:2502.13141, 2025.
- Huawei Lin, Jun Woo Chung, Yingjie Lao, Weijie Zhao, Online Gradient Boosting Decision Tree: In-Place Updates for Efficient Adding/Deleting Data. arXiv preprint, arXiv:2502.01634, 2025.
- Guoshenghui Zhao, Huawei Lin, Weijie Zhao, RapidUn: Influence-Driven Parameter Reweighting for Efficient Large Language Model Unlearning. arXiv preprint, arXiv:2512.04457, 2025.
- Jun Woo Chung, Huawei Lin, Weijie Zhao, Locality-Sensitive Indexing for Graph-Based Approximate Nearest Neighbor Search. SIGIR 2025.
- Yanzhou Pan, Huawei Lin, Yide Ran, Jiamin Chen, Xiaodong Yu, Weijie Zhao, Denghui Zhang, Zhaozhuo Xu, ALinFiK: Learning to Approximate Linearized Future Influence Kernel for Scalable Third-Parity LLM Data Valuation. NAACL 2025.
- Huawei Lin, Yingjie Lao, Weijie Zhao, DMin: Scalable Training Data Influence Estimation for Diffusion Models. arXiv preprint, arXiv:2412.08637, 2024.
- Huawei Lin, Jikai Long, Zhaozhuo Xu, Weijie Zhao, Token-wise Influential Training Data Retrieval for Large Language Models. ACL 2024.
- Huawei Lin, Jun Woo Chung, Yingjie Lao, Weijie Zhao, Machine Unlearning in Gradient Boosting Decision Trees. KDD 2023.
- Huawei Lin, Haozhe Liu, Qiufu Li, Linlin Shen, Activation Template Matching Loss for Explainable Face Recognition. FG 2023.
Advisor: Yunzhi Shi
- Present a novel agent-based omni agent that coordinates existing foundation models to reason over text, images, video, and audio.
- Design a flexible agent system that interprets user intent and delegates subtasks.
Advisor: Rajath Kumar, Raphael Petegrosso
- Proposed an unsupervised self-improving framework for LLMs inference that enhances generation quality.
- Implemented methods to detect potential hallucinations by certainty score.
- Responsible for interpretability and explainability on deep learning (Biometrics).
- Proposed Explainable Channel Loss (ECLoss) for explainable face recognition.
- Proposed Stream Audio Understanding Chain method for real-time processing.
- Optimized pipeline throughput by 65% using multiprocessing.