Shuo Cai (่”ก็ก•) is a Master of Philosophy student in the Department of Computing at The Hong Kong Polytechnic University (PolyU), supervised by Prof. Hongxia Yang. His research interests currently focus on model fusion and agentic tool use for Large Language Models (LLMs).

๐Ÿ“ Publications

  • ๐Ÿงช Shuo Cai, Yanggan Gu, Zihao Wang, Yuanyi Wang, Yibo Yan, Wenjun Wang, Yuhang Liu, Guanghao Zhu, Sirui Huang, Ming Li, Hongxia Yang. From Parameters to Behaviors: A Survey of Model Fusion for Large Language Models. In Preprints, 2026. ๐Ÿ”—[Preprint]
    Work: Surveys model fusion for LLMs from parameter-level methods to behavioral analysis.

  • ๐Ÿงช Yanggan Gu, Shuo Cai (Co-1st), Zihao Wang, Wenjun Wang, Yuanyi Wang, Pengkai Wang, Sirui Huang, Su Lu, Jianmin Wu, Hongxia Yang. FeatCal: Feature Calibration for Post-Merging Models. In arXiv 2026. ๐Ÿ”—[Paper] ๐Ÿ’ป[Code]
    Work: Calibrates features to improve post-merge model behavior.

  • ๐Ÿงช Wenjun Wang, Yanggan Gu (Co-1st), Shuo Cai (Co-1st), Yuanyi Wang, Pengkai Wang, Jianmin Wu, Hongxia Yang. E-PMQ: Expert-Guided Post-Merge Quantization with Merged-Weight Anchoring. In arXiv 2026. ๐Ÿ”—[Paper] ๐Ÿ’ป[Code]
    Work: Studies expert-guided quantization for post-merge models.

  • ๐Ÿงช Wenjun Wang, Shuo Cai (Co-1st), Congkai Xie, Mingfa Feng, Yiming Zhang, Zhen Li, Kejing Yang, Ming Li, Jiannong Cao, Hongxia Yang. InfiR2: A Comprehensive FP8 Training Recipe for Reasoning-Enhanced Language Models. In arXiv 2025 (withdrawn). ๐Ÿ”—[Paper]
    Work: Explores an FP8 training recipe for reasoning-enhanced language models.

  • ๐Ÿงช Shuo Cai, Su Lu, Qi Zhou, Kejing Yang, Zhijie Sang, Congkai Xie, Hongxia Yang. InfiAlign: A Scalable and Sample-Efficient Framework for Aligning LLMs to Enhance Reasoning Capabilities. In arXiv 2025. ๐Ÿ”—[PDF]
    Work: Improves LLM reasoning with sample-efficient alignment data selection.

  • ๐Ÿงช Congkai Xie, Shuo Cai (Co-1st), Wenjun Wang, Pengxiang Li, Zhijie Sang, Kejing Yang, Yiming Zhang, Zhen Li, Guanghao Zhu, Zeyu Liu, Yang Yu, Yuhang Liu, Su Lu, Baoyi He, Qi Zhou, Xiaotian Han, Jianbo Yuan, Shengyu Zhang, Fei Wu, Hongxia Yang. InfiR: Crafting Effective Small Language Models and Multimodal Small Language Models in Reasoning. In arXiv 2025. ๐Ÿ”—[PDF]
    Work: Builds compact language and multimodal models with stronger reasoning ability.

๐Ÿ“– Education

  • 2025.09 - present, Master of Philosophy, Department of Computing, The Hong Kong Polytechnic University.
  • 2021.09 - 2025.06, Undergraduate, Intelligent Science and Technology, School of Automation Science and Engineering, South China University of Technology.

๐Ÿ’ป Internship

  • 2025.06 - 2025.09, Research Intern, Infix.ai, Shenzhen.