RuYi: Optimizing Burst Buffer through Automated, Fine-Grained Process-to-BB Mapping
Current supercomputers use an SSD-based storage layer called Burst Buffer (BB) to provide I/O-intensive applications with accelerated storage access. However, efficiently utilizing this limited and expensive storage remains a critical issue, creating an urgent need for implementing Quality of Servic...
Gespeichert in:
Veröffentlicht in: | IEEE transactions on computers 2024-12, p.1-13 |
---|---|
Hauptverfasser: | , , , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 13 |
---|---|
container_issue | |
container_start_page | 1 |
container_title | IEEE transactions on computers |
container_volume | |
creator | Hua, Yusheng Shi, Xuanhua He, Ligang He, Kang Zhang, Teng Jin, Hai Chen, Yong |
description | Current supercomputers use an SSD-based storage layer called Burst Buffer (BB) to provide I/O-intensive applications with accelerated storage access. However, efficiently utilizing this limited and expensive storage remains a critical issue, creating an urgent need for implementing Quality of Service (QoS) in BB. To address this, we propose RuYi, a QoS-aware method to provide applications with bandwidth guarantees in the BB file system. RuYi tackles two main issues. First, it quantitatively profiles available bandwidth resources in BB to ensure reliable QoS, a crucial aspect seldom studied in the literature. Second, RuYi offers fine-grained process-level QoS via an innovative process-to-BB mapping, maximizing resource utilization-something not achievable with conventional coarse-grained compute-to-BB mapping. We evaluated RuYi on a subsystem of the leading exascale supercomputer Sunway, consisting of 4,000 compute nodes and 200 BB nodes. The experimental results demonstrate that RuYi achieves an impressive end-to-end bandwidth control accuracy of 97%, while improving BB utilization by up to 116% compared to conventional coarse-grained compute-to-BB mapping. |
doi_str_mv | 10.1109/TC.2024.3510624 |
format | Article |
fullrecord | <record><control><sourceid>crossref_ieee_</sourceid><recordid>TN_cdi_ieee_primary_10772616</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>10772616</ieee_id><sourcerecordid>10_1109_TC_2024_3510624</sourcerecordid><originalsourceid>FETCH-LOGICAL-c1024-2f8fc8d0f2390f0b0c77daca6e5cdf3a0fae25f3c5bec857fc587a89ea2dc42f3</originalsourceid><addsrcrecordid>eNpNkLFOwzAYhC0EEqEwszD4AXD7247jmK2NaEEqKoIyMEWuY7dGpInsZICnJ1U7sNwtd6fTh9AthTGloCbrYsyApWMuKGQsPUMJFUISpUR2jhIAmhPFU7hEVzF-AQwZUAl6f-s__QNetZ2v_a_fb_GsD7Eb1DkbcLcLTb_d4WnfNbXubHWP535vySLowSr8GhpjYyRdQ2Yz_KLbdpi4RhdOf0d7c_IR-pg_rosnslwtnovpkhg6HCXM5c7kFTjGFTjYgJGy0kZnVpjKcQ1OWyYcN2JjTS6kMyKXOldWs8qkzPERmhx3TWhiDNaVbfC1Dj8lhfLApFwX5YFJeWIyNO6ODW-t_ZeWkmU0438GQF4z</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>RuYi: Optimizing Burst Buffer through Automated, Fine-Grained Process-to-BB Mapping</title><source>IEEE Xplore</source><creator>Hua, Yusheng ; Shi, Xuanhua ; He, Ligang ; He, Kang ; Zhang, Teng ; Jin, Hai ; Chen, Yong</creator><creatorcontrib>Hua, Yusheng ; Shi, Xuanhua ; He, Ligang ; He, Kang ; Zhang, Teng ; Jin, Hai ; Chen, Yong</creatorcontrib><description>Current supercomputers use an SSD-based storage layer called Burst Buffer (BB) to provide I/O-intensive applications with accelerated storage access. However, efficiently utilizing this limited and expensive storage remains a critical issue, creating an urgent need for implementing Quality of Service (QoS) in BB. To address this, we propose RuYi, a QoS-aware method to provide applications with bandwidth guarantees in the BB file system. RuYi tackles two main issues. First, it quantitatively profiles available bandwidth resources in BB to ensure reliable QoS, a crucial aspect seldom studied in the literature. Second, RuYi offers fine-grained process-level QoS via an innovative process-to-BB mapping, maximizing resource utilization-something not achievable with conventional coarse-grained compute-to-BB mapping. We evaluated RuYi on a subsystem of the leading exascale supercomputer Sunway, consisting of 4,000 compute nodes and 200 BB nodes. The experimental results demonstrate that RuYi achieves an impressive end-to-end bandwidth control accuracy of 97%, while improving BB utilization by up to 116% compared to conventional coarse-grained compute-to-BB mapping.</description><identifier>ISSN: 0018-9340</identifier><identifier>EISSN: 1557-9956</identifier><identifier>DOI: 10.1109/TC.2024.3510624</identifier><identifier>CODEN: ITCOB4</identifier><language>eng</language><publisher>IEEE</publisher><subject>Accuracy ; Bandwidth ; Burst Buffer ; Degradation ; File System ; File systems ; HPC ; Interference ; Processor scheduling ; QoS ; Quality of service ; Reliability ; Resource management ; Supercomputers</subject><ispartof>IEEE transactions on computers, 2024-12, p.1-13</ispartof><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/10772616$$EHTML$$P50$$Gieee$$Hfree_for_read</linktohtml><link.rule.ids>314,780,784,796,27915,27916,54749</link.rule.ids></links><search><creatorcontrib>Hua, Yusheng</creatorcontrib><creatorcontrib>Shi, Xuanhua</creatorcontrib><creatorcontrib>He, Ligang</creatorcontrib><creatorcontrib>He, Kang</creatorcontrib><creatorcontrib>Zhang, Teng</creatorcontrib><creatorcontrib>Jin, Hai</creatorcontrib><creatorcontrib>Chen, Yong</creatorcontrib><title>RuYi: Optimizing Burst Buffer through Automated, Fine-Grained Process-to-BB Mapping</title><title>IEEE transactions on computers</title><addtitle>TC</addtitle><description>Current supercomputers use an SSD-based storage layer called Burst Buffer (BB) to provide I/O-intensive applications with accelerated storage access. However, efficiently utilizing this limited and expensive storage remains a critical issue, creating an urgent need for implementing Quality of Service (QoS) in BB. To address this, we propose RuYi, a QoS-aware method to provide applications with bandwidth guarantees in the BB file system. RuYi tackles two main issues. First, it quantitatively profiles available bandwidth resources in BB to ensure reliable QoS, a crucial aspect seldom studied in the literature. Second, RuYi offers fine-grained process-level QoS via an innovative process-to-BB mapping, maximizing resource utilization-something not achievable with conventional coarse-grained compute-to-BB mapping. We evaluated RuYi on a subsystem of the leading exascale supercomputer Sunway, consisting of 4,000 compute nodes and 200 BB nodes. The experimental results demonstrate that RuYi achieves an impressive end-to-end bandwidth control accuracy of 97%, while improving BB utilization by up to 116% compared to conventional coarse-grained compute-to-BB mapping.</description><subject>Accuracy</subject><subject>Bandwidth</subject><subject>Burst Buffer</subject><subject>Degradation</subject><subject>File System</subject><subject>File systems</subject><subject>HPC</subject><subject>Interference</subject><subject>Processor scheduling</subject><subject>QoS</subject><subject>Quality of service</subject><subject>Reliability</subject><subject>Resource management</subject><subject>Supercomputers</subject><issn>0018-9340</issn><issn>1557-9956</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>ESBDL</sourceid><sourceid>RIE</sourceid><recordid>eNpNkLFOwzAYhC0EEqEwszD4AXD7247jmK2NaEEqKoIyMEWuY7dGpInsZICnJ1U7sNwtd6fTh9AthTGloCbrYsyApWMuKGQsPUMJFUISpUR2jhIAmhPFU7hEVzF-AQwZUAl6f-s__QNetZ2v_a_fb_GsD7Eb1DkbcLcLTb_d4WnfNbXubHWP535vySLowSr8GhpjYyRdQ2Yz_KLbdpi4RhdOf0d7c_IR-pg_rosnslwtnovpkhg6HCXM5c7kFTjGFTjYgJGy0kZnVpjKcQ1OWyYcN2JjTS6kMyKXOldWs8qkzPERmhx3TWhiDNaVbfC1Dj8lhfLApFwX5YFJeWIyNO6ODW-t_ZeWkmU0438GQF4z</recordid><startdate>20241202</startdate><enddate>20241202</enddate><creator>Hua, Yusheng</creator><creator>Shi, Xuanhua</creator><creator>He, Ligang</creator><creator>He, Kang</creator><creator>Zhang, Teng</creator><creator>Jin, Hai</creator><creator>Chen, Yong</creator><general>IEEE</general><scope>97E</scope><scope>ESBDL</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope></search><sort><creationdate>20241202</creationdate><title>RuYi: Optimizing Burst Buffer through Automated, Fine-Grained Process-to-BB Mapping</title><author>Hua, Yusheng ; Shi, Xuanhua ; He, Ligang ; He, Kang ; Zhang, Teng ; Jin, Hai ; Chen, Yong</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c1024-2f8fc8d0f2390f0b0c77daca6e5cdf3a0fae25f3c5bec857fc587a89ea2dc42f3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Accuracy</topic><topic>Bandwidth</topic><topic>Burst Buffer</topic><topic>Degradation</topic><topic>File System</topic><topic>File systems</topic><topic>HPC</topic><topic>Interference</topic><topic>Processor scheduling</topic><topic>QoS</topic><topic>Quality of service</topic><topic>Reliability</topic><topic>Resource management</topic><topic>Supercomputers</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Hua, Yusheng</creatorcontrib><creatorcontrib>Shi, Xuanhua</creatorcontrib><creatorcontrib>He, Ligang</creatorcontrib><creatorcontrib>He, Kang</creatorcontrib><creatorcontrib>Zhang, Teng</creatorcontrib><creatorcontrib>Jin, Hai</creatorcontrib><creatorcontrib>Chen, Yong</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE Xplore Open Access Journals</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Xplore</collection><collection>CrossRef</collection><jtitle>IEEE transactions on computers</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Hua, Yusheng</au><au>Shi, Xuanhua</au><au>He, Ligang</au><au>He, Kang</au><au>Zhang, Teng</au><au>Jin, Hai</au><au>Chen, Yong</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>RuYi: Optimizing Burst Buffer through Automated, Fine-Grained Process-to-BB Mapping</atitle><jtitle>IEEE transactions on computers</jtitle><stitle>TC</stitle><date>2024-12-02</date><risdate>2024</risdate><spage>1</spage><epage>13</epage><pages>1-13</pages><issn>0018-9340</issn><eissn>1557-9956</eissn><coden>ITCOB4</coden><abstract>Current supercomputers use an SSD-based storage layer called Burst Buffer (BB) to provide I/O-intensive applications with accelerated storage access. However, efficiently utilizing this limited and expensive storage remains a critical issue, creating an urgent need for implementing Quality of Service (QoS) in BB. To address this, we propose RuYi, a QoS-aware method to provide applications with bandwidth guarantees in the BB file system. RuYi tackles two main issues. First, it quantitatively profiles available bandwidth resources in BB to ensure reliable QoS, a crucial aspect seldom studied in the literature. Second, RuYi offers fine-grained process-level QoS via an innovative process-to-BB mapping, maximizing resource utilization-something not achievable with conventional coarse-grained compute-to-BB mapping. We evaluated RuYi on a subsystem of the leading exascale supercomputer Sunway, consisting of 4,000 compute nodes and 200 BB nodes. The experimental results demonstrate that RuYi achieves an impressive end-to-end bandwidth control accuracy of 97%, while improving BB utilization by up to 116% compared to conventional coarse-grained compute-to-BB mapping.</abstract><pub>IEEE</pub><doi>10.1109/TC.2024.3510624</doi><tpages>13</tpages><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0018-9340 |
ispartof | IEEE transactions on computers, 2024-12, p.1-13 |
issn | 0018-9340 1557-9956 |
language | eng |
recordid | cdi_ieee_primary_10772616 |
source | IEEE Xplore |
subjects | Accuracy Bandwidth Burst Buffer Degradation File System File systems HPC Interference Processor scheduling QoS Quality of service Reliability Resource management Supercomputers |
title | RuYi: Optimizing Burst Buffer through Automated, Fine-Grained Process-to-BB Mapping |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-14T19%3A33%3A03IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-crossref_ieee_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=RuYi:%20Optimizing%20Burst%20Buffer%20through%20Automated,%20Fine-Grained%20Process-to-BB%20Mapping&rft.jtitle=IEEE%20transactions%20on%20computers&rft.au=Hua,%20Yusheng&rft.date=2024-12-02&rft.spage=1&rft.epage=13&rft.pages=1-13&rft.issn=0018-9340&rft.eissn=1557-9956&rft.coden=ITCOB4&rft_id=info:doi/10.1109/TC.2024.3510624&rft_dat=%3Ccrossref_ieee_%3E10_1109_TC_2024_3510624%3C/crossref_ieee_%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=10772616&rfr_iscdi=true |