Charlie Ruan

Charlie Ruan

MSCS @ CMU

Carnegie Mellon University

Biography

I am a second-year MSCS student at Carnegie Mellon University. I am fortunate to work with Prof. Tianqi Chen as part of the Catalyst Group, where I have also collaborated with Prof. Zhihao Jia. I am currently visiting UC Berkeley, working with Prof. Ion Stoica as part of the Sky Computing Lab.

I focus on the systems problems in machine learning systems, with various open-source development experience. I am the current lead of WebLLM and a core contributor to MLC-LLM.

Prior to coming to CMU, I obtained my B.S. degree in Computer Science and Operations Research from Cornell University, where I was fortunate to work with Prof. Christopher De Sa on distributed training and with Prof. Jim Dai on stochastic processes.

I am applying for Fall 2025 PhD programs in machine learning systems. Please feel free to contact me!

Interests
  • Machine Learning Systems
  • Distributed Systems
Education
  • MS in Computer Science, 2025

    Carnegie Mellon University

  • BS in Computer Science & Operations Research, 2023

    Cornell University

Publications

A System for Microserving of LLMs
WebLLM: A High-Performance In-Browser LLM Inference Engine
XGrammar: Flexible and Efficient Structured Generation Engine for Large Language Models
Local deployment of large-scale music AI models on commodity hardware
Emerging Platforms Meet Emerging LLMs: A Year-Long Journey of Top-Down Development
Coordinating Distributed Example Orders for Provably Accelerated Training

Projects

 
 
 
 
 
MLC-LLM
Core Contributor
June 2023 – Present Pittsburgh, PA
Enable universal native deployment for LLMs through machine learning compilation techniques. GitHub (19.3k stars)
 
 
 
 
 
WebLLM
Project Lead
June 2023 – Present Pittsburgh, PA
Leading the project to bring LLMs to run locally in client-side browser with WebGPU acceleration. GitHub (13.9k stars); talk at Google WebAI Summit ‘24

Research Experience

 
 
 
 
 
Prof. Ion Stoica, UC Berkeley
Visiting Student Researcher
Prof. Ion Stoica, UC Berkeley
August 2024 – Present Berkeley, CA
Sky Computing Lab; Investigating GPU kernel language/compiler
 
 
 
 
 
Prof. Tianqi Chen & Prof. Zhihao Jia, Carnegie Mellon University
Research Assistant
Prof. Tianqi Chen & Prof. Zhihao Jia, Carnegie Mellon University
March 2024 – Present Pittsburgh, PA
Catalyst Group; Investigating distributed LLM serving systems
 
 
 
 
 
Prof. Christopher De Sa, Cornell University
Research Assistant
Prof. Christopher De Sa, Cornell University
September 2022 – June 2023 Ithaca, NY
Investigated finding provably better data permutations in distributed learning. CD-GraB was accepted by NeurIPS'23
 
 
 
 
 
Prof. Jim Dai, Cornell University
Research Assistant
Prof. Jim Dai, Cornell University
November 2021 – September 2022 Pittsburgh, PA
Investigated using variance-reduction method approximating martingale-process in reinforcement learning with large state space

Industry Experience

 
 
 
 
 
Google
Software Engineer Intern
June 2023 – August 2023 Sunnyvale, CA
Worked on Core ML’s Distributed Runtime team, optimizing TensorFlow’s checkpoint to reduce wasted TPU cycles
 
 
 
 
 
Google Cloud
Software Engineer Intern
August 2022 – October 2022 Sunnyvale, CA
Worked on Technical Infrastructure’s Platform team, deploying accelerators in Google data centers using OpenBMC, implementing Linux daemon and firmware update APIs
 
 
 
 
 
Amazon Robotics
Software Engineer Intern
May 2022 – July 2022 Greater Boston, MA
Worked on Robotic Storage Technologies team, improving worker’s interaction with autonomous warehouse robots
 
 
 
 
 
XPENG Motors
Software Engineer Intern
June 2021 – August 2021 Shanghai, China
Optimized sensor fusion algorithms for XPeng’s self-driving cars
 
 
 
 
 
Morgina Information Technology
Software Engineer Intern
June 2020 – July 2020 Shanghai, China
Optimized multi-object tracking algorithm with millimeter-wave radar

Contact

I am applying for Fall 2025 PhD programs in machine learning systems. Please feel free to contact me!