How Does It Function? Characterizing Long-term Trends in Production Serverless Workloads

This paper releases and analyzes two new Huawei cloud serverless traces. The traces span a period of over 7 months with over 1.4 trillion function invocations combined. The first trace is derived from Huawei's internal workloads and contains detailed per-second statistics for 200 functions runn...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:arXiv.org 2023-12
Hauptverfasser: Joosen, Artjom, Hassan, Ahmed, Asenov, Martin, Singh, Rajkarn, Darlow, Luke, Wang, Jianfeng, Barker, Adam
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title arXiv.org
container_volume
creator Joosen, Artjom
Hassan, Ahmed
Asenov, Martin
Singh, Rajkarn
Darlow, Luke
Wang, Jianfeng
Barker, Adam
description This paper releases and analyzes two new Huawei cloud serverless traces. The traces span a period of over 7 months with over 1.4 trillion function invocations combined. The first trace is derived from Huawei's internal workloads and contains detailed per-second statistics for 200 functions running across multiple Huawei cloud data centers. The second trace is a representative workload from Huawei's public FaaS platform. This trace contains per-minute arrival rates for over 5000 functions running in a single Huawei data center. We present the internals of a production FaaS platform by characterizing resource consumption, cold-start times, programming languages used, periodicity, per-second versus per-minute burstiness, correlations, and popularity. Our findings show that there is considerable diversity in how serverless functions behave: requests vary by up to 9 orders of magnitude across functions, with some functions executed over 1 billion times per day; scheduling time, execution time and cold-start distributions vary across 2 to 4 orders of magnitude and have very long tails; and function invocation counts demonstrate strong periodicity for many individual functions and on an aggregate level. Our analysis also highlights the need for further research in estimating resource reservations and time-series prediction to account for the huge diversity in how serverless functions behave. Datasets and code available at https://github.com/sir-lab/data-release
doi_str_mv 10.48550/arxiv.2312.10127
format Article
fullrecord <record><control><sourceid>proquest_arxiv</sourceid><recordid>TN_cdi_arxiv_primary_2312_10127</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2903732380</sourcerecordid><originalsourceid>FETCH-LOGICAL-a520-c454e8584e9483970b51ed73ab0d1f77e54b7659d3133ddbfae929e1e9814133</originalsourceid><addsrcrecordid>eNotj0tLAzEUhYMgWGp_gCsDrmfMs0lWItU-oKDQgu6GTHOnTm2TmszUx693bF1dzuXjcD6ErijJhZaS3Nr4VR9yxinLKaFMnaEe45xmWjB2gQYpbQghbKiYlLyHXqfhEz8ESHjW4HHrV00d_B0evdloVw3E-qf2azwPfp11aYeXEbxLuPb4OQbXHnG8gHiAuIWU8EuI79tgXbpE55XdJhj83z5ajB-Xo2k2f5rMRvfzzEpGspWQArTUAozQ3ChSSgpOcVsSRyulQIpSDaVxnHLuXFlZMMwABaOp6F59dH1qPVoX-1jvbPwu_uyLo31H3JyIfQwfLaSm2IQ2-m5SwQzhijOuCf8F7M9cog</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2903732380</pqid></control><display><type>article</type><title>How Does It Function? Characterizing Long-term Trends in Production Serverless Workloads</title><source>arXiv.org</source><source>Free E- Journals</source><creator>Joosen, Artjom ; Hassan, Ahmed ; Asenov, Martin ; Singh, Rajkarn ; Darlow, Luke ; Wang, Jianfeng ; Barker, Adam</creator><creatorcontrib>Joosen, Artjom ; Hassan, Ahmed ; Asenov, Martin ; Singh, Rajkarn ; Darlow, Luke ; Wang, Jianfeng ; Barker, Adam</creatorcontrib><description>This paper releases and analyzes two new Huawei cloud serverless traces. The traces span a period of over 7 months with over 1.4 trillion function invocations combined. The first trace is derived from Huawei's internal workloads and contains detailed per-second statistics for 200 functions running across multiple Huawei cloud data centers. The second trace is a representative workload from Huawei's public FaaS platform. This trace contains per-minute arrival rates for over 5000 functions running in a single Huawei data center. We present the internals of a production FaaS platform by characterizing resource consumption, cold-start times, programming languages used, periodicity, per-second versus per-minute burstiness, correlations, and popularity. Our findings show that there is considerable diversity in how serverless functions behave: requests vary by up to 9 orders of magnitude across functions, with some functions executed over 1 billion times per day; scheduling time, execution time and cold-start distributions vary across 2 to 4 orders of magnitude and have very long tails; and function invocation counts demonstrate strong periodicity for many individual functions and on an aggregate level. Our analysis also highlights the need for further research in estimating resource reservations and time-series prediction to account for the huge diversity in how serverless functions behave. Datasets and code available at https://github.com/sir-lab/data-release</description><identifier>EISSN: 2331-8422</identifier><identifier>DOI: 10.48550/arxiv.2312.10127</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Cloud computing ; Computer Science - Distributed, Parallel, and Cluster Computing ; Computer Science - Learning ; Computer Science - Performance ; Data centers ; Programming languages ; Workload ; Workloads</subject><ispartof>arXiv.org, 2023-12</ispartof><rights>2023. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><rights>http://creativecommons.org/licenses/by/4.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,776,780,881,27902</link.rule.ids><backlink>$$Uhttps://doi.org/10.1145/3620678.3624783$$DView published paper (Access to full text may be restricted)$$Hfree_for_read</backlink><backlink>$$Uhttps://doi.org/10.48550/arXiv.2312.10127$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Joosen, Artjom</creatorcontrib><creatorcontrib>Hassan, Ahmed</creatorcontrib><creatorcontrib>Asenov, Martin</creatorcontrib><creatorcontrib>Singh, Rajkarn</creatorcontrib><creatorcontrib>Darlow, Luke</creatorcontrib><creatorcontrib>Wang, Jianfeng</creatorcontrib><creatorcontrib>Barker, Adam</creatorcontrib><title>How Does It Function? Characterizing Long-term Trends in Production Serverless Workloads</title><title>arXiv.org</title><description>This paper releases and analyzes two new Huawei cloud serverless traces. The traces span a period of over 7 months with over 1.4 trillion function invocations combined. The first trace is derived from Huawei's internal workloads and contains detailed per-second statistics for 200 functions running across multiple Huawei cloud data centers. The second trace is a representative workload from Huawei's public FaaS platform. This trace contains per-minute arrival rates for over 5000 functions running in a single Huawei data center. We present the internals of a production FaaS platform by characterizing resource consumption, cold-start times, programming languages used, periodicity, per-second versus per-minute burstiness, correlations, and popularity. Our findings show that there is considerable diversity in how serverless functions behave: requests vary by up to 9 orders of magnitude across functions, with some functions executed over 1 billion times per day; scheduling time, execution time and cold-start distributions vary across 2 to 4 orders of magnitude and have very long tails; and function invocation counts demonstrate strong periodicity for many individual functions and on an aggregate level. Our analysis also highlights the need for further research in estimating resource reservations and time-series prediction to account for the huge diversity in how serverless functions behave. Datasets and code available at https://github.com/sir-lab/data-release</description><subject>Cloud computing</subject><subject>Computer Science - Distributed, Parallel, and Cluster Computing</subject><subject>Computer Science - Learning</subject><subject>Computer Science - Performance</subject><subject>Data centers</subject><subject>Programming languages</subject><subject>Workload</subject><subject>Workloads</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>BENPR</sourceid><sourceid>GOX</sourceid><recordid>eNotj0tLAzEUhYMgWGp_gCsDrmfMs0lWItU-oKDQgu6GTHOnTm2TmszUx693bF1dzuXjcD6ErijJhZaS3Nr4VR9yxinLKaFMnaEe45xmWjB2gQYpbQghbKiYlLyHXqfhEz8ESHjW4HHrV00d_B0evdloVw3E-qf2azwPfp11aYeXEbxLuPb4OQbXHnG8gHiAuIWU8EuI79tgXbpE55XdJhj83z5ajB-Xo2k2f5rMRvfzzEpGspWQArTUAozQ3ChSSgpOcVsSRyulQIpSDaVxnHLuXFlZMMwABaOp6F59dH1qPVoX-1jvbPwu_uyLo31H3JyIfQwfLaSm2IQ2-m5SwQzhijOuCf8F7M9cog</recordid><startdate>20231215</startdate><enddate>20231215</enddate><creator>Joosen, Artjom</creator><creator>Hassan, Ahmed</creator><creator>Asenov, Martin</creator><creator>Singh, Rajkarn</creator><creator>Darlow, Luke</creator><creator>Wang, Jianfeng</creator><creator>Barker, Adam</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20231215</creationdate><title>How Does It Function? Characterizing Long-term Trends in Production Serverless Workloads</title><author>Joosen, Artjom ; Hassan, Ahmed ; Asenov, Martin ; Singh, Rajkarn ; Darlow, Luke ; Wang, Jianfeng ; Barker, Adam</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a520-c454e8584e9483970b51ed73ab0d1f77e54b7659d3133ddbfae929e1e9814133</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Cloud computing</topic><topic>Computer Science - Distributed, Parallel, and Cluster Computing</topic><topic>Computer Science - Learning</topic><topic>Computer Science - Performance</topic><topic>Data centers</topic><topic>Programming languages</topic><topic>Workload</topic><topic>Workloads</topic><toplevel>online_resources</toplevel><creatorcontrib>Joosen, Artjom</creatorcontrib><creatorcontrib>Hassan, Ahmed</creatorcontrib><creatorcontrib>Asenov, Martin</creatorcontrib><creatorcontrib>Singh, Rajkarn</creatorcontrib><creatorcontrib>Darlow, Luke</creatorcontrib><creatorcontrib>Wang, Jianfeng</creatorcontrib><creatorcontrib>Barker, Adam</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection><collection>arXiv Computer Science</collection><collection>arXiv.org</collection><jtitle>arXiv.org</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Joosen, Artjom</au><au>Hassan, Ahmed</au><au>Asenov, Martin</au><au>Singh, Rajkarn</au><au>Darlow, Luke</au><au>Wang, Jianfeng</au><au>Barker, Adam</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>How Does It Function? Characterizing Long-term Trends in Production Serverless Workloads</atitle><jtitle>arXiv.org</jtitle><date>2023-12-15</date><risdate>2023</risdate><eissn>2331-8422</eissn><abstract>This paper releases and analyzes two new Huawei cloud serverless traces. The traces span a period of over 7 months with over 1.4 trillion function invocations combined. The first trace is derived from Huawei's internal workloads and contains detailed per-second statistics for 200 functions running across multiple Huawei cloud data centers. The second trace is a representative workload from Huawei's public FaaS platform. This trace contains per-minute arrival rates for over 5000 functions running in a single Huawei data center. We present the internals of a production FaaS platform by characterizing resource consumption, cold-start times, programming languages used, periodicity, per-second versus per-minute burstiness, correlations, and popularity. Our findings show that there is considerable diversity in how serverless functions behave: requests vary by up to 9 orders of magnitude across functions, with some functions executed over 1 billion times per day; scheduling time, execution time and cold-start distributions vary across 2 to 4 orders of magnitude and have very long tails; and function invocation counts demonstrate strong periodicity for many individual functions and on an aggregate level. Our analysis also highlights the need for further research in estimating resource reservations and time-series prediction to account for the huge diversity in how serverless functions behave. Datasets and code available at https://github.com/sir-lab/data-release</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><doi>10.48550/arxiv.2312.10127</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier EISSN: 2331-8422
ispartof arXiv.org, 2023-12
issn 2331-8422
language eng
recordid cdi_arxiv_primary_2312_10127
source arXiv.org; Free E- Journals
subjects Cloud computing
Computer Science - Distributed, Parallel, and Cluster Computing
Computer Science - Learning
Computer Science - Performance
Data centers
Programming languages
Workload
Workloads
title How Does It Function? Characterizing Long-term Trends in Production Serverless Workloads
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-28T16%3A41%3A13IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_arxiv&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=How%20Does%20It%20Function?%20Characterizing%20Long-term%20Trends%20in%20Production%20Serverless%20Workloads&rft.jtitle=arXiv.org&rft.au=Joosen,%20Artjom&rft.date=2023-12-15&rft.eissn=2331-8422&rft_id=info:doi/10.48550/arxiv.2312.10127&rft_dat=%3Cproquest_arxiv%3E2903732380%3C/proquest_arxiv%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2903732380&rft_id=info:pmid/&rfr_iscdi=true