Shuo Cai (蔡硕) is a Master of Philosophy student in the Department of Computing at The Hong Kong Polytechnic University (PolyU), supervised by Prof. Hongxia Yang. His research interests currently focus on model fusion and agentic tool use for Large Language Models (LLMs).

📝 Publications

🧪 Shuo Cai, Yanggan Gu, Zihao Wang, Yuanyi Wang, Yibo Yan, Wenjun Wang, Yuhang Liu, Guanghao Zhu, Sirui Huang, Ming Li, Hongxia Yang. From Parameters to Behaviors: A Survey of Model Fusion for Large Language Models. In Preprints, 2026. 🔗[Preprint] 💻[Code]
Contribution: Defines model fusion for LLMs and organizes the literature across parameter-, representation-, and behavior-level fusion. It provides a structured taxonomy, compares representative methods, and highlights evaluation settings, applications, challenges, and future directions.
🧪 Yanggan Gu, Shuo Cai (Co-1st), Zihao Wang, Wenjun Wang, Yuanyi Wang, Pengkai Wang, Sirui Huang, Su Lu, Jianmin Wu, Hongxia Yang. FeatCal: Feature Calibration for Post-Merging Models. In arXiv 2026. 🔗[Paper] 💻[Code]
Contribution: Identifies feature drift as a key cause of performance degradation after model merging. It proposes layer-wise closed-form feature calibration using small calibration sets to repair merged models without expensive retraining.
🧪 Wenjun Wang, Yanggan Gu, Shuo Cai (Co-1st), Yuanyi Wang, Pengkai Wang, Jianmin Wu, Hongxia Yang. E-PMQ: Expert-Guided Post-Merge Quantization with Merged-Weight Anchoring. In arXiv 2026. 🔗[Paper] 💻[Code]
Contribution: Studies post-merge quantization, where quantization is applied after model merging rather than to individual source models. It introduces expert-guided targets and merged-weight anchoring to stabilize low-bit merged models and reduce quantization-induced capability loss.
🧪 Wenjun Wang, Shuo Cai (Co-1st), Congkai Xie, Mingfa Feng, Yiming Zhang, Zhen Li, Kejing Yang, Ming Li, Jiannong Cao, Hongxia Yang. InfiR2: A Comprehensive FP8 Training Recipe for Reasoning-Enhanced Language Models. In arXiv 2025. 🔗[Paper] 💻[Code]
Contribution: Builds an end-to-end FP8 training recipe for reasoning-enhanced language models. It combines continual pre-training and supervised fine-tuning to improve training efficiency while preserving reasoning performance.
🧪 Shuo Cai, Su Lu, Qi Zhou, Kejing Yang, Zhijie Sang, Congkai Xie, Hongxia Yang. InfiAlign: A Scalable and Sample-Efficient Framework for Aligning LLMs to Enhance Reasoning Capabilities. In arXiv 2025. 🔗[PDF] 💻[Code]
Contribution: Proposes a scalable alignment framework that selects high-quality reasoning data through multidimensional criteria such as diversity, difficulty, and quality. It combines SFT and DPO to improve LLM reasoning while using substantially less training data.
🧪 Congkai Xie, Shuo Cai (Co-1st), Wenjun Wang, Pengxiang Li, Zhijie Sang, Kejing Yang, Yiming Zhang, Zhen Li, Guanghao Zhu, Zeyu Liu, Yang Yu, Yuhang Liu, Su Lu, Baoyi He, Qi Zhou, Xiaotian Han, Jianbo Yuan, Shengyu Zhang, Fei Wu, Hongxia Yang. InfiR: Crafting Effective Small Language Models and Multimodal Small Language Models in Reasoning. In arXiv 2025. 🔗[PDF]
Contribution: Develops a low-cost training pipeline for small language models and multimodal small language models with strong reasoning ability. The work targets efficient deployment while maintaining competitive performance on reasoning tasks.

📖 Education

2025.09 - present, Master of Philosophy, Department of Computing, The Hong Kong Polytechnic University.
2021.09 - 2025.06, Undergraduate, Intelligent Science and Technology, School of Automation Science and Engineering, South China University of Technology.

💻 Internship

2025.06 - 2025.09, Research Intern, Infix.ai, Shenzhen.