Performance and energy savings trade-off with uncertainty-aware cloud workload forecasting

Thumbnail Image
CEC_workshop_paper-1_AV.pdf(266.6 KB)
Accepted version
Carraro, Diego
Rossi, Andrea
Visentin, Andrea
Prestwich , Steven D.
Brown, Kenneth N.
Journal Title
Journal ISSN
Volume Title
Published Version
Research Projects
Organizational Units
Journal Issue
Cloud managers typically leverage future workload predictions to make informed decisions on resource allocation, where the ultimate goal of the allocation is to meet customers’ demands while reducing the provisioning cost. Among several workload forecasting approaches proposed in the literature, uncertainty-aware time series analysis solutions are desirable in cloud scenarios because they can predict the distribution of future demand and provide bounds associated with a given service level set by the resource manager. The effectiveness of uncertainty-based workload predictions is normally assessed in terms of accuracy metrics (e.g. MAE) and service level (e.g. Success Rate), but the effect on the resource provisioning cost is under investigated. We propose an evaluation framework to assess the impact of uncertainty-aware predictions on the performance vs cost trade-off, where we express the cost in terms of energy savings. We illustrate the framework’s effectiveness by simulating two real-world cloud scenarios where an optimizer leverages workload predictions to allocate resources to satisfy a desired service level while minimizing energy waste. Offline experiments compare representative uncertainty-aware models and a new model (HBNN++) that we propose, which predict a cluster trace’s GPU demand. We show that more effective uncertainty modelling can save energy without violating desired service level targets and that model performance varies depending on the specific details of the allocation scheme, server and GPU energy costs.
Cloud computing , Energy saving , Deep learning , Workload prediction , Uncertainty , Time series forecasting
Carraro, D., Rossi, A., Visentin, A., Prestwich, S. and Brown, K. N. (2023) ‘Performance and Energy Savings Trade-Off with Uncertainty-Aware Cloud Workload Forecasting’, The Cloud-Edge Continuum Workshop 2023 (CEC'23), IEEE ICNP'23, the 31st IEEE International Conference on Network Protocols, 10 October, Reykjavik, Iceland. Forthcoming publication
Link to publisher’s version
© 2023