Clustering-based numerosity reduction for cloud workload forecasting

dc.check.date2024-12-14en
dc.check.infoAccess to this item is restricted until 12 months after publication by request of the publisheren
dc.contributor.authorRossi, Andreaen
dc.contributor.authorVisentin, Andreaen
dc.contributor.authorPrestwich, Steven D.en
dc.contributor.authorBrown, Kenneth N.en
dc.date.accessioned2024-03-21T16:46:05Z
dc.date.available2024-03-21T16:46:05Z
dc.date.issued2023en
dc.description.abstractFinding smaller versions of large datasets that preserve the same characteristics as the original ones is becoming a central problem in Machine Learning, especially when computational resources are limited, and there is a need to reduce energy consumption. In this paper, we apply clustering techniques for wisely selecting a subset of datasets for training models for time series prediction of future workload in cloud computing. We train Bayesian Neural Networks (BNNs) and state-of-the-art probabilistic models to predict machine-level future resource demand distribution and evaluate them on unseen data from virtual machines in the Google Cloud data centre. Experiments show that selecting the training data via clustering approaches such as Self Organising Maps allows the model to achieve the same accuracy in less than half the time, requiring less than half the datasets rather than selecting more data at random. Moreover, BNNs can capture uncertainty aspects that can better inform scheduling decisions, which state-of-the-art time series forecasting methods cannot do. All the considered models achieve prediction time performance suitable for real-world scenarios.en
dc.description.statusPeer revieweden
dc.description.versionAccepted Versionen
dc.format.mimetypeapplication/pdfen
dc.identifier.citationRossi, A., Visentin, A., Prestwich, S. and Brown, K.N. (2024) ‘Clustering-based numerosity reduction for cloud workload forecasting’, in I. Chatzigiannakis and I. Karydis (eds) Algorithmic Aspects of Cloud Computing, Lecture Notes in Computer Science, vol 14503, Cham: Springer Nature Switzerland, pp. 115–132. Available at: https://doi.org/10.1007/978-3-031-49361-4_7en
dc.identifier.doihttps://doi.org/10.1007/978-3-031-49361-4_7en
dc.identifier.endpage132en
dc.identifier.isbn9783031493607en
dc.identifier.isbn9783031493614en
dc.identifier.issn0302-9743en
dc.identifier.issn1611-3349en
dc.identifier.journaltitleLecture Notes in Computer Scienceen
dc.identifier.startpage115en
dc.identifier.urihttps://hdl.handle.net/10468/15693
dc.identifier.volume14503en
dc.language.isoenen
dc.publisherSpringeren
dc.relation.ispartofAlgorithmic Aspects of Cloud Computingen
dc.relation.ispartofLecture Notes in Computer Scienceen
dc.rights© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG. This version of the contribution has been accepted for publication, after peer review but is not the Version of Record and does not reflect post-acceptance improvements, or any corrections. The Version of Record is available online at: https://doi.org/10.1007/978-3-031-49361-4_7. Use of this Accepted Version is subject to the publisher’s Accepted Manuscript terms of use https://www.springernature.com/gp/open-research/policies/accepted-manuscript-termsen
dc.subjectCloud computingen
dc.subjectWorkload predictionen
dc.subjectClusteringen
dc.subjectBayesian neural networken
dc.subjectDeep learningen
dc.titleClustering-based numerosity reduction for cloud workload forecastingen
dc.typeArticle (peer-reviewed)en
dc.typebook-chapteren
dc.typeConference itemen
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
ALGOCLOUD___Clustering_based_Numerosity.pdf
Size:
320.87 KB
Format:
Adobe Portable Document Format
Description:
Accepted version
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
2.71 KB
Format:
Item-specific license agreed upon to submission
Description: