The System for AI Lab (SAIL) at Georgia Tech, led by Prof. Alexey Tumanov, specializes in advancing systems support and resource management for machine learning (ML) to democratize large-scale AI systems. Our research encompasses the entire AI infrastructure stack, from foundational system design to the development of efficient ML training and inference algorithms. By focusing on managing the complete ML lifecycle, SAIL aims to enhance accessibility and efficiency in AI technologies.
Recent News
- Sarathi-Serve ☸️, our paper on efficient LLM inference has been accepted at OSDI’24.
- Vidur 👳🏽, our paper on large scale LLM inference cluster simulation has been accepted at MLSys’24.
- Payman Behnam awarded NVIDIA Graduate Fellowship 2024 for advancing machine learning and systems with high-performance, low-latency, and energy-efficient hardware designs.
- Payman Behnam receives Qualcomm Innovation Fellowship 2023 for his work on Hardware-Software Co-Design for DNN inference systems.
- Amey Agrawal secures CRNCH PhD Fellowship 2023 for his research in LLM inference.