Forecasting workload in cloud computing: towards uncertainty-aware predictions and transfer learning

dc.contributor.authorRossi, Andreaen
dc.contributor.authorVisentin, Andreaen
dc.contributor.authorCarraro, Diegoen
dc.contributor.authorPrestwich, Steven D.en
dc.contributor.authorBrown, Kenneth N.en
dc.contributor.funderScience Foundation Irelanden
dc.contributor.funderEuropean Commissionen
dc.contributor.funderGoogle Clouden
dc.date.accessioned2025-10-24T14:23:04Z
dc.date.available2025-10-24T14:23:04Z
dc.date.issued2025-02-25en
dc.description.abstractAccurately forecasting workload demand in cloud computing environments is essential for optimizing resource allocation, minimizing costs, and ensuring reliable service quality. As cloud computing scales to meet the needs of diverse applications - from AI and machine learning to data-intensive analytics - predictive models play a critical role in dynamically managing multiple resources. Traditional models provide limited guidance for decision-makers since they are typically univariate models, ignoring the prediction of the interplay between multiple resources, and do not account for the uncertainty of their predictions, preventing resource management from acting promptly according to such uncertainty to ensure specific target service level requirements. To address these limitations, we introduce univariate and bivariate Bayesian deep learning models that predict future workload demand of one and multiple resources respectively, while quantifying the uncertainty of their predictions. In particular, our approach leverages Hybrid Bayesian Neural Networks and probabilistic Long Short-Term Memory models, enhanced with architecture modifications to handle complex, multivariate cloud workload patterns. Moreover, we investigate fine-tuning-based transfer learning methods to enhance their adaptability in real-world cloud scenarios where new data centres with different workload characteristics operate. We validate our models on extensive datasets from Google and Alibaba cloud clusters. Results show that modelling the uncertainty of predictions positively impacts performance, especially on service level metrics, because uncertainty quantification can be tailored to desired target service levels that are critical in cloud applications. Moreover, transfer learning benefits performance in scenarios where models are built on data from the same provider.en
dc.description.sponsorshipGoogle Cloud (Research Credits program GCP203677602)en
dc.description.statusPeer revieweden
dc.description.versionPublished Versionen
dc.format.mimetypeapplication/pdfen
dc.identifier.articleid258en
dc.identifier.citationRossi, A., Visentin, A., Carraro, D., Prestwich, S. and Brown, K.N. (2025) 'Forecasting workload in cloud computing: towards uncertainty-aware predictions and transfer learning', Cluster Computing, 28(4), 258 (20pp). https://doi.org/10.1007/s10586-024-04933-2en
dc.identifier.doi10.1007/s10586-024-04933-2en
dc.identifier.eissn1573-7543en
dc.identifier.endpage20en
dc.identifier.issn1386-7857en
dc.identifier.issued4en
dc.identifier.journaltitleCluster Computingen
dc.identifier.startpage1en
dc.identifier.urihttps://hdl.handle.net/10468/18076
dc.identifier.volume28en
dc.language.isoenen
dc.publisherSpringer Natureen
dc.relation.projectinfo:eu-repo/grantAgreement/SFI/Centres for Research Training (CRT) Programme/18/CRT/6223/IE/SFI Centre for Research Training in Artificial Intelligence/en
dc.relation.projectinfo:eu-repo/grantAgreement/SFI/Research Centres Programme::Phase 2/12/RC/2289_P2/IE/INSIGHT_Phase 2 /en
dc.relation.projectinfo:eu-repo/grantAgreement/EC/HE::HORIZON-RIA/101070141/EU/Green responsibLe privACy preservIng dAta operaTIONs/GLACIATIONen
dc.relation.projectinfo:eu-repo/grantAgreement/EC/H2020::RIA/952215/EU/Foundations of Trustworthy AI - Integrating Reasoning, Learning and Optimization/TAILORen
dc.rights© 2024, the Authors. This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons. org/licenses/by/4.0/.en
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/en
dc.subjectBayesian neural networksen
dc.subjectCloud computingen
dc.subjectWorkload predictionen
dc.subjectUncertaintyen
dc.subjectDeep learningen
dc.subjectTransfer learningen
dc.titleForecasting workload in cloud computing: towards uncertainty-aware predictions and transfer learningen
dc.typeArticle (peer-reviewed)en
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
s10586-024-04933-2.pdf
Size:
1.22 MB
Format:
Adobe Portable Document Format
Description:
Published Version
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
2.71 KB
Format:
Item-specific license agreed upon to submission
Description: