Charlie Ruan

MSCS @ CMU

Carnegie Mellon University

Biography

I am a first-year MSCS student at Carnegie Mellon University, where I am fortunate to work with Prof. Tianqi Chen as part of the Catalyst Group.

My focus is at the intersection of machine learning and systems, with various open-source development experience. I am the current lead of Web LLM, a core contributor to MLC LLM, and have been contributing to ApacheTVM. I also contributed to TensorFlow as part of Google’s TF Runtime team in Summer 2023.

Prior to coming to CMU, I obtained my B.S. degree in Computer Science and Operations Research from Cornell University, where I was fortunate to work with Prof. Christopher De Sa on distributed ML and with Prof. Jim Dai on reinforcement learning.

Interests

Machine Learning Systems
Distributed Systems

Education

MS in Computer Science, 2025
Carnegie Mellon University
BS in Computer Science & Operations Research, 2023
Cornell University

Publications

A. Feder Cooper, Wentao Guo, Khiem Pham, Tiancheng Yuan, Charlie F. Ruan, Yucheng Lu, Christopher De Sa (2023). Coordinating Distributed Example Orders for Provably Accelerated Training. In NeurIPS'23.

PDF Cite Code Poster

Projects

Core Contributor

MLC LLM

June 2023 – Present Pittsburgh, PA

Enable universal native deployment for LLMs through machine learning compilation techniques

Project Lead

Web LLM

June 2023 – Present Pittsburgh, PA

Leading the project to bring LLMs to run locally in client-side browser with WebGPU acceleration

Research Experience

Research Assistant

Prof. Tianqi Chen & Prof. Zhihao Jia, Carnegie Mellon University

March 2024 – Present Pittsburgh, PA

Investigating distributed LLM serving systems

Research Assistant

Prof. Christopher De Sa, Cornell University

September 2022 – June 2023 Ithaca, NY

Investigated finding provably better data permutations in distributed learning. CD-GraB was accepted by NeurIPS'23

Research Assistant

Prof. Jim Dai, Cornell University

November 2021 – September 2022 Pittsburgh, PA

Investigated using variance-reduction method approximating martingale-process in reinforcement learning with large state space

Industry Experience

Software Engineer Intern

Google

June 2023 – August 2023 Sunnyvale, CA

Worked on Core ML’s Distributed Runtime team, optimizing TensorFlow’s checkpoint to reduce wasted TPU cycles

Software Engineer Intern

Google Cloud

August 2022 – October 2022 Sunnyvale, CA

Worked on Technical Infrastructure’s Platform team, deploying accelerators in Google data centers using OpenBMC, implementing Linux daemon and firmware update APIs