Saurabh Agarwal, Rahul Garg, et al.
ACM/IEEE SC 2004
The authors argue that the minimum cost of computing can be provided by consolidating real-time workloads onto relatively large servers, which can operate at high utilization while maintaining required response time, and then filling the remaining overhead capacity with batch-like workloads sold at a significantly reduced price. © 2006 IEEE.