Ruisi Cai

Ph.D Student UT Austin

About Me

I’m a fourth year Ph.D. student in the VITA Group of Electrical and Computer Engineering Department, the Univeristiy of Texas at Austin, under the supervision of Prof. Zhangyang (Atlas) Wang. Prior to that, I obatined my B.E. degree from University of Science and Technology of China (USTC).

I’m currently working on machine learning, with research focus on:

Efficient training and reasoning for large foundation models:
- Adaptive Framework: Elastic Model for Adaptive Deployment, Mixture of Experts (MoE)
- Long Context Generation: Long Context Training & Serving, State Space Model (SSM)
AI security and privacy:
- Trustworthy ML, Robustness for Mixture of Experts (MoE), Backdoor Attack
- Distributed Training, Task Heterogeneity, Data Scaling

I’m currently open to full-time opportunities starting in Summer 2026. If you believe my background could be a good fit for your team, I’d be glad to talk.

NEWS

July. 2025. Honored to receive the Graduate Dean’s Prestigious Fellowship.
July. 2025. Thrilled to return to NVIDIA for the internship — excited for the journey ahead!
Mar. 2025. Excited to share that I have been selected to receive the ML and Systems Rising Star Awards 2025.
Feb. 2025. Began my exciting internship in Citadel Securities.
Dec. 2024. Excited to announce that I have been selected as a recipient of the NVIDIA Fellowship. Thank you, NVIDIA! 💚

Feb, 2024. My teammate, Yeonju Ro, and I have been chosen as finalists for the 2024 Qualcomm Innovation Fellowship.
Sep, 2023. I’ve just begun my incredible internship journey in NVIDIA.

Publication List

(A superscript * denotes equal contribution)

Rethinking Addressing in Language Models via Contextualized Equivariant Positional Encoding
Jiajun Zhu, Peihao Wang, Ruisi Cai, Jason D. Lee, Zhangyang Wang, Pan Li
ICML2025: International Conference on Machine Learning, [Paper]

FlexGS: Train Once, Deploy Everywhere with Many-in-One Flexible 3D Gaussian Splatting
Hengyu Liu, Yuehao Wang, Chenxin Li, Ruisi Cai, Kevin Wang, Wuyang Li, Pavlo Molchanov, Peihao Wang, Zhangyang Wang
CVPR2025: IEEE Conference on Computer Vision and Pattern Recognition

Steepest Descent Density Control for Compact 3D Gaussian Splatting
Peihao Wang, Yuehao Wang, Dilin Wang, Sreyas Mohan, Zhiwen Fan, Lemeng Wu, Ruisi Cai, Yu-Ying Yeh, Zhangyang Wang, qiang liu, Rakesh Ranjan
CVPR2025: IEEE Conference on Computer Vision and Pattern Recognition

LLaMaFlex: Many-in-one LLMs via Generalized Pruning and Weight Sharing
Ruisi Cai, Saurav Muralidharan, Greg Heinrich, Hongxu Yin, Zhangyang Wang, Jan Kautz, Pavlo Molchanov
ICLR2025: International Conference on Learning Representations, [Paper]

Understanding Bottlenecks of State Space Models through the Lens of Recency and Over-smoothing
Peihao Wang, Ruisi Cai, Yuehao Wang, Jiajun Zhu, Pragya Srivastava, Zhangyang Wang, Pan Li
ICLR2025: International Conference on Learning Representations, [Paper]

READ-ME: Refactorizing LLMs as Router-Decoupled Mixture of Experts with System Co-Design
Ruisi Cai*, Yeonju Ro*, Geon-Woo Kim, Peihao Wang, Babak Ehteshami Bejnordi, Aditya Akella, Zhangyang Wang
NeurIPS2024: Conference on Neural Information Processing Systems, [Paper] [Code]

Model-GLUE: Democratized LLM Scaling for A Large Model Zoo in the Wild
Xinyu Zhao*, Guoheng Sun*, Ruisi Cai*, Yukun Zhou*, Pingzhi Li*, Peihao Wang*, Bowen Tan, Yexiao He, Li Chen, Yi Liang, Beidi Chen, Binhang Yuan, Hongyi Wang, Ang Li, Zhangyang Wang, Tianlong Chen
NeurIPS2024 D&B: Datasets and Benchmarks Track, Conference on Neural Information Processing Systems, [Paper] [Code]

Flextron: Many-in-One Flexible Large Language Model
Ruisi Cai, Saurav Muralidharan, Greg Heinrich, Hongxu Yin, Zhangyang Wang, Jan Kautz, Pavlo Molchanov
ICML2024: International Conference on Machine Learning (Oral), [Paper] [Project]

LoCoCo: Dropping In Convolutions for Long Context Compression
Ruisi Cai, Yuandong Tian, Zhangyang Wang, Beidi Chen
ICML2024: International Conference on Machine Learning, [Paper] [Code]

Robust Mixture-of-Expert Training for Convolutional Neural Networks
Yihua Zhang, Ruisi Cai, Tianlong Chen, Guanhua Zhang, Huan Zhang, Pin-Yu Chen, Shiyu Chang, Zhangyang Wang, Sijia Liu
ICCV2023: International Conference on Computer Vision (Oral) [Paper] [Code]

$\mathrm{H_2O}$: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models
Zhenyu Zhang, Ying Sheng, Tianyi Zhou, Tianlong Chen, Lianmin Zheng, Ruisi Cai, Zhao Song, Yuandong Tian, Christopher Ré, Clark Barrett, Zhangyang Wang, Beidi Chen
NeurIPS2023: Conference on Neural Information Processing Systems, [Paper] [Code]

Robust Weight Signatures: Gaining Robustness as Easy as Patching Weights?
Ruisi Cai, Zhenyu Zhang, Zhangyang Wang
ICML2023: International Conference on Machine Learning [Paper] [Code]

Many-Task Federated Learning: A New Problem Setting and a Simple Baseline
Ruisi Cai, Xiaohan Chen, Shiwei Liu, Jayanth Srinivasa, Myungjin Lee, Ramana Kompella, Zhangyang Wang
CVPRW: 2nd Workshop on Federated Learning for Computer Vision [Paper]

Randomized Channel Shuffling: Minimal-Overhead Backdoor Attack Detection without Clean Datasets
Ruisi Cai*, Zhenyu Zhang*, Tianlong Chen, Xiaohan Chen, Zhangyang Wang
NeurIPS2022: Conference on Neural Information Processing Systems [Paper] [Code]

Try everything.

悟已往之不谏，知来者之可追。