RuYi: Optimizing Burst Buffer through Automated, Fine-Grained Process-to-BB Mapping

Current supercomputers use an SSD-based storage layer called Burst Buffer (BB) to provide I/O-intensive applications with accelerated storage access. However, efficiently utilizing this limited and expensive storage remains a critical issue, creating an urgent need for implementing Quality of Servic...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on computers 2024-12, p.1-13
Hauptverfasser: Hua, Yusheng, Shi, Xuanhua, He, Ligang, He, Kang, Zhang, Teng, Jin, Hai, Chen, Yong
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 13
container_issue
container_start_page 1
container_title IEEE transactions on computers
container_volume
creator Hua, Yusheng
Shi, Xuanhua
He, Ligang
He, Kang
Zhang, Teng
Jin, Hai
Chen, Yong
description Current supercomputers use an SSD-based storage layer called Burst Buffer (BB) to provide I/O-intensive applications with accelerated storage access. However, efficiently utilizing this limited and expensive storage remains a critical issue, creating an urgent need for implementing Quality of Service (QoS) in BB. To address this, we propose RuYi, a QoS-aware method to provide applications with bandwidth guarantees in the BB file system. RuYi tackles two main issues. First, it quantitatively profiles available bandwidth resources in BB to ensure reliable QoS, a crucial aspect seldom studied in the literature. Second, RuYi offers fine-grained process-level QoS via an innovative process-to-BB mapping, maximizing resource utilization-something not achievable with conventional coarse-grained compute-to-BB mapping. We evaluated RuYi on a subsystem of the leading exascale supercomputer Sunway, consisting of 4,000 compute nodes and 200 BB nodes. The experimental results demonstrate that RuYi achieves an impressive end-to-end bandwidth control accuracy of 97%, while improving BB utilization by up to 116% compared to conventional coarse-grained compute-to-BB mapping.
doi_str_mv 10.1109/TC.2024.3510624
format Article
fullrecord <record><control><sourceid>crossref_ieee_</sourceid><recordid>TN_cdi_ieee_primary_10772616</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>10772616</ieee_id><sourcerecordid>10_1109_TC_2024_3510624</sourcerecordid><originalsourceid>FETCH-LOGICAL-c1024-2f8fc8d0f2390f0b0c77daca6e5cdf3a0fae25f3c5bec857fc587a89ea2dc42f3</originalsourceid><addsrcrecordid>eNpNkLFOwzAYhC0EEqEwszD4AXD7247jmK2NaEEqKoIyMEWuY7dGpInsZICnJ1U7sNwtd6fTh9AthTGloCbrYsyApWMuKGQsPUMJFUISpUR2jhIAmhPFU7hEVzF-AQwZUAl6f-s__QNetZ2v_a_fb_GsD7Eb1DkbcLcLTb_d4WnfNbXubHWP535vySLowSr8GhpjYyRdQ2Yz_KLbdpi4RhdOf0d7c_IR-pg_rosnslwtnovpkhg6HCXM5c7kFTjGFTjYgJGy0kZnVpjKcQ1OWyYcN2JjTS6kMyKXOldWs8qkzPERmhx3TWhiDNaVbfC1Dj8lhfLApFwX5YFJeWIyNO6ODW-t_ZeWkmU0438GQF4z</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>RuYi: Optimizing Burst Buffer through Automated, Fine-Grained Process-to-BB Mapping</title><source>IEEE Xplore</source><creator>Hua, Yusheng ; Shi, Xuanhua ; He, Ligang ; He, Kang ; Zhang, Teng ; Jin, Hai ; Chen, Yong</creator><creatorcontrib>Hua, Yusheng ; Shi, Xuanhua ; He, Ligang ; He, Kang ; Zhang, Teng ; Jin, Hai ; Chen, Yong</creatorcontrib><description>Current supercomputers use an SSD-based storage layer called Burst Buffer (BB) to provide I/O-intensive applications with accelerated storage access. However, efficiently utilizing this limited and expensive storage remains a critical issue, creating an urgent need for implementing Quality of Service (QoS) in BB. To address this, we propose RuYi, a QoS-aware method to provide applications with bandwidth guarantees in the BB file system. RuYi tackles two main issues. First, it quantitatively profiles available bandwidth resources in BB to ensure reliable QoS, a crucial aspect seldom studied in the literature. Second, RuYi offers fine-grained process-level QoS via an innovative process-to-BB mapping, maximizing resource utilization-something not achievable with conventional coarse-grained compute-to-BB mapping. We evaluated RuYi on a subsystem of the leading exascale supercomputer Sunway, consisting of 4,000 compute nodes and 200 BB nodes. The experimental results demonstrate that RuYi achieves an impressive end-to-end bandwidth control accuracy of 97%, while improving BB utilization by up to 116% compared to conventional coarse-grained compute-to-BB mapping.</description><identifier>ISSN: 0018-9340</identifier><identifier>EISSN: 1557-9956</identifier><identifier>DOI: 10.1109/TC.2024.3510624</identifier><identifier>CODEN: ITCOB4</identifier><language>eng</language><publisher>IEEE</publisher><subject>Accuracy ; Bandwidth ; Burst Buffer ; Degradation ; File System ; File systems ; HPC ; Interference ; Processor scheduling ; QoS ; Quality of service ; Reliability ; Resource management ; Supercomputers</subject><ispartof>IEEE transactions on computers, 2024-12, p.1-13</ispartof><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/10772616$$EHTML$$P50$$Gieee$$Hfree_for_read</linktohtml><link.rule.ids>314,780,784,796,27915,27916,54749</link.rule.ids></links><search><creatorcontrib>Hua, Yusheng</creatorcontrib><creatorcontrib>Shi, Xuanhua</creatorcontrib><creatorcontrib>He, Ligang</creatorcontrib><creatorcontrib>He, Kang</creatorcontrib><creatorcontrib>Zhang, Teng</creatorcontrib><creatorcontrib>Jin, Hai</creatorcontrib><creatorcontrib>Chen, Yong</creatorcontrib><title>RuYi: Optimizing Burst Buffer through Automated, Fine-Grained Process-to-BB Mapping</title><title>IEEE transactions on computers</title><addtitle>TC</addtitle><description>Current supercomputers use an SSD-based storage layer called Burst Buffer (BB) to provide I/O-intensive applications with accelerated storage access. However, efficiently utilizing this limited and expensive storage remains a critical issue, creating an urgent need for implementing Quality of Service (QoS) in BB. To address this, we propose RuYi, a QoS-aware method to provide applications with bandwidth guarantees in the BB file system. RuYi tackles two main issues. First, it quantitatively profiles available bandwidth resources in BB to ensure reliable QoS, a crucial aspect seldom studied in the literature. Second, RuYi offers fine-grained process-level QoS via an innovative process-to-BB mapping, maximizing resource utilization-something not achievable with conventional coarse-grained compute-to-BB mapping. We evaluated RuYi on a subsystem of the leading exascale supercomputer Sunway, consisting of 4,000 compute nodes and 200 BB nodes. The experimental results demonstrate that RuYi achieves an impressive end-to-end bandwidth control accuracy of 97%, while improving BB utilization by up to 116% compared to conventional coarse-grained compute-to-BB mapping.</description><subject>Accuracy</subject><subject>Bandwidth</subject><subject>Burst Buffer</subject><subject>Degradation</subject><subject>File System</subject><subject>File systems</subject><subject>HPC</subject><subject>Interference</subject><subject>Processor scheduling</subject><subject>QoS</subject><subject>Quality of service</subject><subject>Reliability</subject><subject>Resource management</subject><subject>Supercomputers</subject><issn>0018-9340</issn><issn>1557-9956</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>ESBDL</sourceid><sourceid>RIE</sourceid><recordid>eNpNkLFOwzAYhC0EEqEwszD4AXD7247jmK2NaEEqKoIyMEWuY7dGpInsZICnJ1U7sNwtd6fTh9AthTGloCbrYsyApWMuKGQsPUMJFUISpUR2jhIAmhPFU7hEVzF-AQwZUAl6f-s__QNetZ2v_a_fb_GsD7Eb1DkbcLcLTb_d4WnfNbXubHWP535vySLowSr8GhpjYyRdQ2Yz_KLbdpi4RhdOf0d7c_IR-pg_rosnslwtnovpkhg6HCXM5c7kFTjGFTjYgJGy0kZnVpjKcQ1OWyYcN2JjTS6kMyKXOldWs8qkzPERmhx3TWhiDNaVbfC1Dj8lhfLApFwX5YFJeWIyNO6ODW-t_ZeWkmU0438GQF4z</recordid><startdate>20241202</startdate><enddate>20241202</enddate><creator>Hua, Yusheng</creator><creator>Shi, Xuanhua</creator><creator>He, Ligang</creator><creator>He, Kang</creator><creator>Zhang, Teng</creator><creator>Jin, Hai</creator><creator>Chen, Yong</creator><general>IEEE</general><scope>97E</scope><scope>ESBDL</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope></search><sort><creationdate>20241202</creationdate><title>RuYi: Optimizing Burst Buffer through Automated, Fine-Grained Process-to-BB Mapping</title><author>Hua, Yusheng ; Shi, Xuanhua ; He, Ligang ; He, Kang ; Zhang, Teng ; Jin, Hai ; Chen, Yong</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c1024-2f8fc8d0f2390f0b0c77daca6e5cdf3a0fae25f3c5bec857fc587a89ea2dc42f3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Accuracy</topic><topic>Bandwidth</topic><topic>Burst Buffer</topic><topic>Degradation</topic><topic>File System</topic><topic>File systems</topic><topic>HPC</topic><topic>Interference</topic><topic>Processor scheduling</topic><topic>QoS</topic><topic>Quality of service</topic><topic>Reliability</topic><topic>Resource management</topic><topic>Supercomputers</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Hua, Yusheng</creatorcontrib><creatorcontrib>Shi, Xuanhua</creatorcontrib><creatorcontrib>He, Ligang</creatorcontrib><creatorcontrib>He, Kang</creatorcontrib><creatorcontrib>Zhang, Teng</creatorcontrib><creatorcontrib>Jin, Hai</creatorcontrib><creatorcontrib>Chen, Yong</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE Xplore Open Access Journals</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Xplore</collection><collection>CrossRef</collection><jtitle>IEEE transactions on computers</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Hua, Yusheng</au><au>Shi, Xuanhua</au><au>He, Ligang</au><au>He, Kang</au><au>Zhang, Teng</au><au>Jin, Hai</au><au>Chen, Yong</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>RuYi: Optimizing Burst Buffer through Automated, Fine-Grained Process-to-BB Mapping</atitle><jtitle>IEEE transactions on computers</jtitle><stitle>TC</stitle><date>2024-12-02</date><risdate>2024</risdate><spage>1</spage><epage>13</epage><pages>1-13</pages><issn>0018-9340</issn><eissn>1557-9956</eissn><coden>ITCOB4</coden><abstract>Current supercomputers use an SSD-based storage layer called Burst Buffer (BB) to provide I/O-intensive applications with accelerated storage access. However, efficiently utilizing this limited and expensive storage remains a critical issue, creating an urgent need for implementing Quality of Service (QoS) in BB. To address this, we propose RuYi, a QoS-aware method to provide applications with bandwidth guarantees in the BB file system. RuYi tackles two main issues. First, it quantitatively profiles available bandwidth resources in BB to ensure reliable QoS, a crucial aspect seldom studied in the literature. Second, RuYi offers fine-grained process-level QoS via an innovative process-to-BB mapping, maximizing resource utilization-something not achievable with conventional coarse-grained compute-to-BB mapping. We evaluated RuYi on a subsystem of the leading exascale supercomputer Sunway, consisting of 4,000 compute nodes and 200 BB nodes. The experimental results demonstrate that RuYi achieves an impressive end-to-end bandwidth control accuracy of 97%, while improving BB utilization by up to 116% compared to conventional coarse-grained compute-to-BB mapping.</abstract><pub>IEEE</pub><doi>10.1109/TC.2024.3510624</doi><tpages>13</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 0018-9340
ispartof IEEE transactions on computers, 2024-12, p.1-13
issn 0018-9340
1557-9956
language eng
recordid cdi_ieee_primary_10772616
source IEEE Xplore
subjects Accuracy
Bandwidth
Burst Buffer
Degradation
File System
File systems
HPC
Interference
Processor scheduling
QoS
Quality of service
Reliability
Resource management
Supercomputers
title RuYi: Optimizing Burst Buffer through Automated, Fine-Grained Process-to-BB Mapping
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-14T19%3A33%3A03IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-crossref_ieee_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=RuYi:%20Optimizing%20Burst%20Buffer%20through%20Automated,%20Fine-Grained%20Process-to-BB%20Mapping&rft.jtitle=IEEE%20transactions%20on%20computers&rft.au=Hua,%20Yusheng&rft.date=2024-12-02&rft.spage=1&rft.epage=13&rft.pages=1-13&rft.issn=0018-9340&rft.eissn=1557-9956&rft.coden=ITCOB4&rft_id=info:doi/10.1109/TC.2024.3510624&rft_dat=%3Ccrossref_ieee_%3E10_1109_TC_2024_3510624%3C/crossref_ieee_%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=10772616&rfr_iscdi=true