ELASTICALLY MANAGING WORKERS OF MULTI-WORKER WORKLOADS ON ACCELERATOR DEVICES

The disclosure herein describes elastically managing the execution of workers of multi-worker workloads on accelerator devices. A first worker of a workload is executed on an accelerator device during a first time interval. A first context switch point is identified when the first worker is in a fir...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: GULAVANI, Bhargav, SIVATHANU, Muthian, VISWANATHA, Srinidhi, WELANKAR, Kaustubh, SHUKLA, Dharma Kiritkumar, NEHME, Rimma Vladimirovna, RAMJEE, Ramachandran, AGRAWAL, Amey, ANUPINDI, Ravi Shreyas
Format: Patent
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator GULAVANI, Bhargav
SIVATHANU, Muthian
VISWANATHA, Srinidhi
WELANKAR, Kaustubh
SHUKLA, Dharma Kiritkumar
NEHME, Rimma Vladimirovna
RAMJEE, Ramachandran
AGRAWAL, Amey
ANUPINDI, Ravi Shreyas
description The disclosure herein describes elastically managing the execution of workers of multi-worker workloads on accelerator devices. A first worker of a workload is executed on an accelerator device during a first time interval. A first context switch point is identified when the first worker is in a first worker state. At the identified context switch point, a first memory state of the first worker is stored in a host memory and the accelerator device is configured to a second memory state of the second worker. The second worker is executed during a second time interval and a second context switch point is identified at the end of the second time interval when the second worker is in a state that is equivalent to the first worker state. During the intervals, collective communication operations between the workers are accumulated and, at the second context switch point, the accumulated operations are performed.
format Patent
fullrecord <record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_US2023236837A1</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>US2023236837A1</sourcerecordid><originalsourceid>FETCH-epo_espacenet_US2023236837A13</originalsourceid><addsrcrecordid>eNrjZPB19XEMDvF0dvTxiVTwdfRzdPf0c1cI9w_ydg0KVvB3U_AN9Qnx1IUIgMV9_B1dgDJ-Co7Ozq4-rkGOIf5BCi6uYZ7OrsE8DKxpiTnFqbxQmptB2c01xNlDN7UgPz61uCAxOTUvtSQ-NNjIwMjYyNjMwtjc0dCYOFUAyY4veA</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>ELASTICALLY MANAGING WORKERS OF MULTI-WORKER WORKLOADS ON ACCELERATOR DEVICES</title><source>esp@cenet</source><creator>GULAVANI, Bhargav ; SIVATHANU, Muthian ; VISWANATHA, Srinidhi ; WELANKAR, Kaustubh ; SHUKLA, Dharma Kiritkumar ; NEHME, Rimma Vladimirovna ; RAMJEE, Ramachandran ; AGRAWAL, Amey ; ANUPINDI, Ravi Shreyas</creator><creatorcontrib>GULAVANI, Bhargav ; SIVATHANU, Muthian ; VISWANATHA, Srinidhi ; WELANKAR, Kaustubh ; SHUKLA, Dharma Kiritkumar ; NEHME, Rimma Vladimirovna ; RAMJEE, Ramachandran ; AGRAWAL, Amey ; ANUPINDI, Ravi Shreyas</creatorcontrib><description>The disclosure herein describes elastically managing the execution of workers of multi-worker workloads on accelerator devices. A first worker of a workload is executed on an accelerator device during a first time interval. A first context switch point is identified when the first worker is in a first worker state. At the identified context switch point, a first memory state of the first worker is stored in a host memory and the accelerator device is configured to a second memory state of the second worker. The second worker is executed during a second time interval and a second context switch point is identified at the end of the second time interval when the second worker is in a state that is equivalent to the first worker state. During the intervals, collective communication operations between the workers are accumulated and, at the second context switch point, the accumulated operations are performed.</description><language>eng</language><subject>CALCULATING ; COMPUTING ; COUNTING ; ELECTRIC DIGITAL DATA PROCESSING ; PHYSICS</subject><creationdate>2023</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20230727&amp;DB=EPODOC&amp;CC=US&amp;NR=2023236837A1$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,780,885,25564,76547</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20230727&amp;DB=EPODOC&amp;CC=US&amp;NR=2023236837A1$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>GULAVANI, Bhargav</creatorcontrib><creatorcontrib>SIVATHANU, Muthian</creatorcontrib><creatorcontrib>VISWANATHA, Srinidhi</creatorcontrib><creatorcontrib>WELANKAR, Kaustubh</creatorcontrib><creatorcontrib>SHUKLA, Dharma Kiritkumar</creatorcontrib><creatorcontrib>NEHME, Rimma Vladimirovna</creatorcontrib><creatorcontrib>RAMJEE, Ramachandran</creatorcontrib><creatorcontrib>AGRAWAL, Amey</creatorcontrib><creatorcontrib>ANUPINDI, Ravi Shreyas</creatorcontrib><title>ELASTICALLY MANAGING WORKERS OF MULTI-WORKER WORKLOADS ON ACCELERATOR DEVICES</title><description>The disclosure herein describes elastically managing the execution of workers of multi-worker workloads on accelerator devices. A first worker of a workload is executed on an accelerator device during a first time interval. A first context switch point is identified when the first worker is in a first worker state. At the identified context switch point, a first memory state of the first worker is stored in a host memory and the accelerator device is configured to a second memory state of the second worker. The second worker is executed during a second time interval and a second context switch point is identified at the end of the second time interval when the second worker is in a state that is equivalent to the first worker state. During the intervals, collective communication operations between the workers are accumulated and, at the second context switch point, the accumulated operations are performed.</description><subject>CALCULATING</subject><subject>COMPUTING</subject><subject>COUNTING</subject><subject>ELECTRIC DIGITAL DATA PROCESSING</subject><subject>PHYSICS</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2023</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNrjZPB19XEMDvF0dvTxiVTwdfRzdPf0c1cI9w_ydg0KVvB3U_AN9Qnx1IUIgMV9_B1dgDJ-Co7Ozq4-rkGOIf5BCi6uYZ7OrsE8DKxpiTnFqbxQmptB2c01xNlDN7UgPz61uCAxOTUvtSQ-NNjIwMjYyNjMwtjc0dCYOFUAyY4veA</recordid><startdate>20230727</startdate><enddate>20230727</enddate><creator>GULAVANI, Bhargav</creator><creator>SIVATHANU, Muthian</creator><creator>VISWANATHA, Srinidhi</creator><creator>WELANKAR, Kaustubh</creator><creator>SHUKLA, Dharma Kiritkumar</creator><creator>NEHME, Rimma Vladimirovna</creator><creator>RAMJEE, Ramachandran</creator><creator>AGRAWAL, Amey</creator><creator>ANUPINDI, Ravi Shreyas</creator><scope>EVB</scope></search><sort><creationdate>20230727</creationdate><title>ELASTICALLY MANAGING WORKERS OF MULTI-WORKER WORKLOADS ON ACCELERATOR DEVICES</title><author>GULAVANI, Bhargav ; SIVATHANU, Muthian ; VISWANATHA, Srinidhi ; WELANKAR, Kaustubh ; SHUKLA, Dharma Kiritkumar ; NEHME, Rimma Vladimirovna ; RAMJEE, Ramachandran ; AGRAWAL, Amey ; ANUPINDI, Ravi Shreyas</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_US2023236837A13</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>eng</language><creationdate>2023</creationdate><topic>CALCULATING</topic><topic>COMPUTING</topic><topic>COUNTING</topic><topic>ELECTRIC DIGITAL DATA PROCESSING</topic><topic>PHYSICS</topic><toplevel>online_resources</toplevel><creatorcontrib>GULAVANI, Bhargav</creatorcontrib><creatorcontrib>SIVATHANU, Muthian</creatorcontrib><creatorcontrib>VISWANATHA, Srinidhi</creatorcontrib><creatorcontrib>WELANKAR, Kaustubh</creatorcontrib><creatorcontrib>SHUKLA, Dharma Kiritkumar</creatorcontrib><creatorcontrib>NEHME, Rimma Vladimirovna</creatorcontrib><creatorcontrib>RAMJEE, Ramachandran</creatorcontrib><creatorcontrib>AGRAWAL, Amey</creatorcontrib><creatorcontrib>ANUPINDI, Ravi Shreyas</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>GULAVANI, Bhargav</au><au>SIVATHANU, Muthian</au><au>VISWANATHA, Srinidhi</au><au>WELANKAR, Kaustubh</au><au>SHUKLA, Dharma Kiritkumar</au><au>NEHME, Rimma Vladimirovna</au><au>RAMJEE, Ramachandran</au><au>AGRAWAL, Amey</au><au>ANUPINDI, Ravi Shreyas</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>ELASTICALLY MANAGING WORKERS OF MULTI-WORKER WORKLOADS ON ACCELERATOR DEVICES</title><date>2023-07-27</date><risdate>2023</risdate><abstract>The disclosure herein describes elastically managing the execution of workers of multi-worker workloads on accelerator devices. A first worker of a workload is executed on an accelerator device during a first time interval. A first context switch point is identified when the first worker is in a first worker state. At the identified context switch point, a first memory state of the first worker is stored in a host memory and the accelerator device is configured to a second memory state of the second worker. The second worker is executed during a second time interval and a second context switch point is identified at the end of the second time interval when the second worker is in a state that is equivalent to the first worker state. During the intervals, collective communication operations between the workers are accumulated and, at the second context switch point, the accumulated operations are performed.</abstract><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier
ispartof
issn
language eng
recordid cdi_epo_espacenet_US2023236837A1
source esp@cenet
subjects CALCULATING
COMPUTING
COUNTING
ELECTRIC DIGITAL DATA PROCESSING
PHYSICS
title ELASTICALLY MANAGING WORKERS OF MULTI-WORKER WORKLOADS ON ACCELERATOR DEVICES
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-30T23%3A53%3A19IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=GULAVANI,%20Bhargav&rft.date=2023-07-27&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3EUS2023236837A1%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true