Abstract
The scalability of cloud functions makes them a convenient backend for elastic data analytics pipelines where parallelism changes drastically from one stage to the next. However, cloud functions require intermediate storage systems for communication, which limits the efficiency of stateful operations. Furthermore, cloud functions are expensive, which reduces the cost-effectiveness of pure serverless architectures. We propose a hybrid architecture for data analytics that uses cloud functions for embarrassingly parallel stages and virtual cloud instances for stateful operations under a unified serverless programming framework. Extending Lithops, a serverless programming library, we implement a parallel programming interface that proactively provisions serverless and serverful cloud resources with minimal user intervention. We validate the feasibility of a hybrid architecture, by comparing it to fully serverless and serverful versions of a production-level metabolomics pipeline. We show that mixing cloud functions with virtual instances increases the cost-effectiveness of the execution by up to 188.23% over the serverless implementation, while achieving a speedup of 3.64 compared to the serverful one.
Original language | English |
---|---|
Title of host publication | Middleware Industrial Track 2024 - Proceedings of the Middleware Industrial Track, Part of |
Subtitle of host publication | Middleware 2024 |
Publisher | Association for Computing Machinery, Inc |
Pages | 15-21 |
Number of pages | 7 |
ISBN (Electronic) | 9798400713194 |
DOIs | |
State | Published - 2 Dec 2024 |
Externally published | Yes |
Event | 2024 Middleware Industrial Track, Middleware Industrial Track 2024 - Hong Kong, Hong Kong Duration: 2 Dec 2024 → 6 Dec 2024 |
Publication series
Name | Middleware Industrial Track 2024 - Proceedings of the Middleware Industrial Track, Part of: Middleware 2024 |
---|
Conference
Conference | 2024 Middleware Industrial Track, Middleware Industrial Track 2024 |
---|---|
Country/Territory | Hong Kong |
City | Hong Kong |
Period | 2/12/24 → 6/12/24 |
Bibliographical note
Publisher Copyright:© 2024 Copyright held by the owner/author(s).
Keywords
- Cloud computing
- function-as-a-service
- resource allocation
- resource efficiency
- serverless computing
ASJC Scopus subject areas
- Information Systems
- Software