ELASTICALLY MANAGING WORKERS OF MULTI-WORKER WORKLOADS ON ACCELERATOR DEVICES
The disclosure herein describes elastically managing the execution of workers of multi-worker workloads on accelerator devices. A first worker of a workload is executed on an accelerator device during a first time interval. A first context switch point is identified when the first worker is in a fir...
Gespeichert in:
Hauptverfasser: | , , , , , , , , |
---|---|
Format: | Patent |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | |
container_volume | |
creator | GULAVANI, Bhargav SIVATHANU, Muthian VISWANATHA, Srinidhi WELANKAR, Kaustubh SHUKLA, Dharma Kiritkumar NEHME, Rimma Vladimirovna RAMJEE, Ramachandran AGRAWAL, Amey ANUPINDI, Ravi Shreyas |
description | The disclosure herein describes elastically managing the execution of workers of multi-worker workloads on accelerator devices. A first worker of a workload is executed on an accelerator device during a first time interval. A first context switch point is identified when the first worker is in a first worker state. At the identified context switch point, a first memory state of the first worker is stored in a host memory and the accelerator device is configured to a second memory state of the second worker. The second worker is executed during a second time interval and a second context switch point is identified at the end of the second time interval when the second worker is in a state that is equivalent to the first worker state. During the intervals, collective communication operations between the workers are accumulated and, at the second context switch point, the accumulated operations are performed. |
format | Patent |
fullrecord | <record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_US2023236837A1</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>US2023236837A1</sourcerecordid><originalsourceid>FETCH-epo_espacenet_US2023236837A13</originalsourceid><addsrcrecordid>eNrjZPB19XEMDvF0dvTxiVTwdfRzdPf0c1cI9w_ydg0KVvB3U_AN9Qnx1IUIgMV9_B1dgDJ-Co7Ozq4-rkGOIf5BCi6uYZ7OrsE8DKxpiTnFqbxQmptB2c01xNlDN7UgPz61uCAxOTUvtSQ-NNjIwMjYyNjMwtjc0dCYOFUAyY4veA</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>ELASTICALLY MANAGING WORKERS OF MULTI-WORKER WORKLOADS ON ACCELERATOR DEVICES</title><source>esp@cenet</source><creator>GULAVANI, Bhargav ; SIVATHANU, Muthian ; VISWANATHA, Srinidhi ; WELANKAR, Kaustubh ; SHUKLA, Dharma Kiritkumar ; NEHME, Rimma Vladimirovna ; RAMJEE, Ramachandran ; AGRAWAL, Amey ; ANUPINDI, Ravi Shreyas</creator><creatorcontrib>GULAVANI, Bhargav ; SIVATHANU, Muthian ; VISWANATHA, Srinidhi ; WELANKAR, Kaustubh ; SHUKLA, Dharma Kiritkumar ; NEHME, Rimma Vladimirovna ; RAMJEE, Ramachandran ; AGRAWAL, Amey ; ANUPINDI, Ravi Shreyas</creatorcontrib><description>The disclosure herein describes elastically managing the execution of workers of multi-worker workloads on accelerator devices. A first worker of a workload is executed on an accelerator device during a first time interval. A first context switch point is identified when the first worker is in a first worker state. At the identified context switch point, a first memory state of the first worker is stored in a host memory and the accelerator device is configured to a second memory state of the second worker. The second worker is executed during a second time interval and a second context switch point is identified at the end of the second time interval when the second worker is in a state that is equivalent to the first worker state. During the intervals, collective communication operations between the workers are accumulated and, at the second context switch point, the accumulated operations are performed.</description><language>eng</language><subject>CALCULATING ; COMPUTING ; COUNTING ; ELECTRIC DIGITAL DATA PROCESSING ; PHYSICS</subject><creationdate>2023</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20230727&DB=EPODOC&CC=US&NR=2023236837A1$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,780,885,25564,76547</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20230727&DB=EPODOC&CC=US&NR=2023236837A1$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>GULAVANI, Bhargav</creatorcontrib><creatorcontrib>SIVATHANU, Muthian</creatorcontrib><creatorcontrib>VISWANATHA, Srinidhi</creatorcontrib><creatorcontrib>WELANKAR, Kaustubh</creatorcontrib><creatorcontrib>SHUKLA, Dharma Kiritkumar</creatorcontrib><creatorcontrib>NEHME, Rimma Vladimirovna</creatorcontrib><creatorcontrib>RAMJEE, Ramachandran</creatorcontrib><creatorcontrib>AGRAWAL, Amey</creatorcontrib><creatorcontrib>ANUPINDI, Ravi Shreyas</creatorcontrib><title>ELASTICALLY MANAGING WORKERS OF MULTI-WORKER WORKLOADS ON ACCELERATOR DEVICES</title><description>The disclosure herein describes elastically managing the execution of workers of multi-worker workloads on accelerator devices. A first worker of a workload is executed on an accelerator device during a first time interval. A first context switch point is identified when the first worker is in a first worker state. At the identified context switch point, a first memory state of the first worker is stored in a host memory and the accelerator device is configured to a second memory state of the second worker. The second worker is executed during a second time interval and a second context switch point is identified at the end of the second time interval when the second worker is in a state that is equivalent to the first worker state. During the intervals, collective communication operations between the workers are accumulated and, at the second context switch point, the accumulated operations are performed.</description><subject>CALCULATING</subject><subject>COMPUTING</subject><subject>COUNTING</subject><subject>ELECTRIC DIGITAL DATA PROCESSING</subject><subject>PHYSICS</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2023</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNrjZPB19XEMDvF0dvTxiVTwdfRzdPf0c1cI9w_ydg0KVvB3U_AN9Qnx1IUIgMV9_B1dgDJ-Co7Ozq4-rkGOIf5BCi6uYZ7OrsE8DKxpiTnFqbxQmptB2c01xNlDN7UgPz61uCAxOTUvtSQ-NNjIwMjYyNjMwtjc0dCYOFUAyY4veA</recordid><startdate>20230727</startdate><enddate>20230727</enddate><creator>GULAVANI, Bhargav</creator><creator>SIVATHANU, Muthian</creator><creator>VISWANATHA, Srinidhi</creator><creator>WELANKAR, Kaustubh</creator><creator>SHUKLA, Dharma Kiritkumar</creator><creator>NEHME, Rimma Vladimirovna</creator><creator>RAMJEE, Ramachandran</creator><creator>AGRAWAL, Amey</creator><creator>ANUPINDI, Ravi Shreyas</creator><scope>EVB</scope></search><sort><creationdate>20230727</creationdate><title>ELASTICALLY MANAGING WORKERS OF MULTI-WORKER WORKLOADS ON ACCELERATOR DEVICES</title><author>GULAVANI, Bhargav ; SIVATHANU, Muthian ; VISWANATHA, Srinidhi ; WELANKAR, Kaustubh ; SHUKLA, Dharma Kiritkumar ; NEHME, Rimma Vladimirovna ; RAMJEE, Ramachandran ; AGRAWAL, Amey ; ANUPINDI, Ravi Shreyas</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_US2023236837A13</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>eng</language><creationdate>2023</creationdate><topic>CALCULATING</topic><topic>COMPUTING</topic><topic>COUNTING</topic><topic>ELECTRIC DIGITAL DATA PROCESSING</topic><topic>PHYSICS</topic><toplevel>online_resources</toplevel><creatorcontrib>GULAVANI, Bhargav</creatorcontrib><creatorcontrib>SIVATHANU, Muthian</creatorcontrib><creatorcontrib>VISWANATHA, Srinidhi</creatorcontrib><creatorcontrib>WELANKAR, Kaustubh</creatorcontrib><creatorcontrib>SHUKLA, Dharma Kiritkumar</creatorcontrib><creatorcontrib>NEHME, Rimma Vladimirovna</creatorcontrib><creatorcontrib>RAMJEE, Ramachandran</creatorcontrib><creatorcontrib>AGRAWAL, Amey</creatorcontrib><creatorcontrib>ANUPINDI, Ravi Shreyas</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>GULAVANI, Bhargav</au><au>SIVATHANU, Muthian</au><au>VISWANATHA, Srinidhi</au><au>WELANKAR, Kaustubh</au><au>SHUKLA, Dharma Kiritkumar</au><au>NEHME, Rimma Vladimirovna</au><au>RAMJEE, Ramachandran</au><au>AGRAWAL, Amey</au><au>ANUPINDI, Ravi Shreyas</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>ELASTICALLY MANAGING WORKERS OF MULTI-WORKER WORKLOADS ON ACCELERATOR DEVICES</title><date>2023-07-27</date><risdate>2023</risdate><abstract>The disclosure herein describes elastically managing the execution of workers of multi-worker workloads on accelerator devices. A first worker of a workload is executed on an accelerator device during a first time interval. A first context switch point is identified when the first worker is in a first worker state. At the identified context switch point, a first memory state of the first worker is stored in a host memory and the accelerator device is configured to a second memory state of the second worker. The second worker is executed during a second time interval and a second context switch point is identified at the end of the second time interval when the second worker is in a state that is equivalent to the first worker state. During the intervals, collective communication operations between the workers are accumulated and, at the second context switch point, the accumulated operations are performed.</abstract><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | |
ispartof | |
issn | |
language | eng |
recordid | cdi_epo_espacenet_US2023236837A1 |
source | esp@cenet |
subjects | CALCULATING COMPUTING COUNTING ELECTRIC DIGITAL DATA PROCESSING PHYSICS |
title | ELASTICALLY MANAGING WORKERS OF MULTI-WORKER WORKLOADS ON ACCELERATOR DEVICES |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-30T23%3A53%3A19IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=GULAVANI,%20Bhargav&rft.date=2023-07-27&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3EUS2023236837A1%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |