Cost-Efficient LLM Training with Lifetime-Aware Tensor Offloading via GPUDirect StorageZiqi YuanHaoyang Zhanget al.2025NeurIPS 2025
To Virtualize or Not to Virtualize: Experiences from Building Two Generations of Virtualized Infrastructure for LLM TrainingApoorve MohanMing-Hung Chenet al.2025SC 2025
DPU-based Optimization of GPU Data Paths through Real-time Diagnostics and On-Demand ControlYongxuan HuangMing-Hung Chenet al.2025CCS 2025
To virtualize or not to virtualize AI Infrastructure: A perspectiveSeetharami SeelamApoorve Mohanet al.2023ISCA 2023