Efficient ML Systems and LLM Infrastructure
Building scalable, practical, and trustworthy machine learning systems.
I am an Assistant Professor in the Department of Computer Science at Rutgers University. My research focuses on scalable and efficient machine learning algorithms and systems, with a current emphasis on LLMs.
Background
I was a Senior Project Scientist at the Machine Learning Department at CMU, working with Eric Xing. I obtained my PhD in Computer Science from UW-Madison, advised by Dimitris Papailiopoulos.
Research Direction
I study efficient training and serving of large-scale machine learning models, especially large language models under real system constraints.
Recent notes
News
- Our lab received an AMD University Program AI and HPC Cluster Allocation Award.
- Our work RedCoast won the NAACL 2024 Best Demo Paper Runner-Up.
- I received the Rising Star Award at CPAL 2024.
People
Research Group
Courses
Teaching
Community
Services
Area Chair: NeurIPS 2026, MLSys 2025, CPAL 2026
PC Member: DAC 2024, EuroSys 2024, SOSP 2023 (light PC), MLSys 2023-2026, SIGKDD 2022, AAAI 2021-2022
Reviewer (Journals): JMLR, TMLR, IEEE TNNLS, IEEE IoT-J, IEEE/ACM Transactions on Networking
Reviewer (Conferences): SC 2026, COLM 2026, ICML 2019-2026, NeurIPS 2019-2025, CVPR 2021-2023, ICCV 2021-2022, ICLR 2021-2025, AAAI 2021-2024, SIGKDD 2022-2023
