| | |
| | |
Stat |
Members: 3658 Articles: 2'599'751 Articles rated: 2609
02 November 2024 |
|
| | | |
|
Article overview
| |
|
A milestone for FaaS pipelines; object storage vs VM-driven data exchange | Germán T. Eizaguirre
; Marc Sánchez-Artigas
; Pedro García-López
; | Date: |
22 Jun 2022 | Abstract: | Serverless functions provide high levels of parallelism, short startup times,
and "pay-as-you-go" billing. These attributes make them a natural substrate for
data analytics workflows. However, the impossibility of direct communication
between functions makes the execution of workflows challenging. The current
practice to share intermediate data among functions is through remote object
storage (e.g., IBM COS). Contrary to conventional wisdom, the performance of
object storage is not well understood. For instance, object storage can even be
superior to other simpler approaches like the execution of shuffle stages
(e.g., GroupBy) inside powerful VMs to avoid all-to-all transfers between
functions. Leveraging a genomics pipeline, we show that object storage is a
reasonable choice for data passing when the appropriate number of functions is
used in shuffling stages. | Source: | arXiv, 2207.12083 | Services: | Forum | Review | PDF | Favorites |
|
|
No review found.
Did you like this article?
Note: answers to reviews or questions about the article must be posted in the forum section.
Authors are not allowed to review their own article. They can use the forum section.
|
| |
|
|
|