Efficient ML Systems and LLM Infrastructure
Building scalable, practical, and trustworthy machine learning systems.
I am an Assistant Professor in the Department of Computer Science at Rutgers University. My research focuses on scalable and efficient machine learning algorithms and systems, with a current emphasis on LLMs.
Background
I was a Senior Project Scientist at the Machine Learning Department at CMU, working with Eric Xing. I obtained my PhD in Computer Science from UW-Madison, advised by Dimitris Papailiopoulos.
Research Direction
I study efficient training and serving of large-scale machine learning models, especially large language models under real system constraints.
LLM360: Towards Fully Transparent Open-Source LLMs
Z. Liu, A. Qiao, W. Neiswanger, H. Wang, B. Tan, T. Tao, J. Li, Y. Wang, S. Sun, O. Pangarkar, R. Fan, Y. Gu, V. Miller, Y. Zhuang, G. He, H. Li, F. Koto, L. Tang, N. Ranjan, Z. Shen, R. Iriondo, C. Mu, Z. Hu, M. Schulze, P. Nakov, T. Baldwin, E. P. Xing, COLM 2024 [arXiv]
Recent notes
News
- Our lab received an AMD University Program AI and HPC Cluster Allocation Award.
- Our work RedCoast won the NAACL 2024 Best Demo Paper Runner-Up.
- I received the Rising Star Award at CPAL 2024.
People
Research Group
Courses
Teaching
Community
Services
Area Chair: NeurIPS 2026, MLSys 2025, CPAL 2026
PC Member: DAC 2024, EuroSys 2024, SOSP 2023 (light PC), MLSys 2023-2026, SIGKDD 2022, AAAI 2021-2022
Reviewer (Journals): JMLR, TMLR, IEEE TNNLS, IEEE IoT-J, IEEE/ACM Transactions on Networking
Reviewer (Conferences): SC 2026, COLM 2026, ICML 2019-2026, NeurIPS 2019-2025, CVPR 2021-2023, ICCV 2021-2022, ICLR 2021-2025, AAAI 2021-2024, SIGKDD 2022-2023
