Optimizing Simultaneous Autoscaling for Serverless Cloud Computing

Harold Ship, Evgeny Shindin, Chen Wang, Diana Arroyo, Asser Tantawi

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

This paper explores resource allocation in server-less cloud computing platforms and proposes an optimization approach for autoscaling systems. Serverless computing relieves users from resource management tasks, enabling focus on application functions. However, dynamic resource allocation and function replication based on changing loads remain crucial. Typically, autoscalers in these platforms utilize threshold-based mechanisms to adjust function replicas independently. We model applications as interconnected graphs of functions, where requests probabilistically traverse the graph, triggering associated function execution. Our objective is to develop a control policy that optimally allocates resources on servers, minimizing failed requests and response time in reaction to load changes. Using a fluid approximation model and Separated Continuous Linear Programming (SCLP), we derive an optimal control policy that determines the number of resources per replica and the required number of replicas over time. We evaluate our approach using a simulation framework built with Python and simpy. Comparing against threshold-based autoscaling, our approach demonstrates significant improvements in average response times and failed requests, ranging from 15% to over 300% in most cases. We also explore the impact of system and workload parameters on performance, providing insights into the behavior of our optimization approach under different conditions. Overall, our study contributes to advancing resource allocation strategies, enhancing efficiency and reliability in serverless cloud computing platforms.

Original languageEnglish
Title of host publicationProceedings - 2024 IEEE 17th International Conference on Cloud Computing, CLOUD 2024
EditorsRong N. Chang, Carl K. Chang, Jingwei Yang, Nimanthi Atukorala, Zhi Jin, Michael Sheng, Jing Fan, Kenneth Fletcher, Qiang He, Tevfik Kosar, Santonu Sarkar, Sreekrishnan Venkateswaran, Shangguang Wang, Xuanzhe Liu, Seetharami Seelam, Chandra Narayanaswami, Ziliang Zong
PublisherIEEE Computer Society
Pages105-114
Number of pages10
ISBN (Electronic)9798350368536
DOIs
StatePublished - 2024
Externally publishedYes
Event17th IEEE International Conference on Cloud Computing, CLOUD 2024 - Shenzhen, China
Duration: 7 Jul 202413 Jul 2024

Publication series

NameIEEE International Conference on Cloud Computing, CLOUD
ISSN (Print)2159-6182
ISSN (Electronic)2159-6190

Conference

Conference17th IEEE International Conference on Cloud Computing, CLOUD 2024
Country/TerritoryChina
CityShenzhen
Period7/07/2413/07/24

Bibliographical note

Publisher Copyright:
© 2024 IEEE.

ASJC Scopus subject areas

  • Artificial Intelligence
  • Information Systems
  • Software

Fingerprint

Dive into the research topics of 'Optimizing Simultaneous Autoscaling for Serverless Cloud Computing'. Together they form a unique fingerprint.

Cite this