Method, System, and Computer Program Product for Dynamically Scheduling Machine Learning Inference Jobs with Different Quality of Services on a Shared Infrastructure

A method, system, and computer program product for dynamically scheduling machine learning inference jobs receive or determine a plurality of performance profiles associated with a plurality of system resources, wherein each performance profile is associated with a machine learning model; receive a...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Roy, Subir, Karpenko, Igor, Walker, Peter, Lu, Ranglin, Cheng, Yinhe, Gu, Yu
Format: Patent
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator Roy, Subir
Karpenko, Igor
Walker, Peter
Lu, Ranglin
Cheng, Yinhe
Gu, Yu
description A method, system, and computer program product for dynamically scheduling machine learning inference jobs receive or determine a plurality of performance profiles associated with a plurality of system resources, wherein each performance profile is associated with a machine learning model; receive a request for system resources for an inference job associated with the machine learning model; determine a system resource of the plurality of system resources for processing the inference job associated with the machine learning model based on the plurality of performance profiles and a quality of service requirement associated with the inference job; assign the system resource to the inference job for processing the inference job; receive result data associated with processing of the inference job with the system resource; and update based on the result data, a performance profile of the plurality of the performance profiles associated with the system resource and the machine learning model.
format Patent
fullrecord <record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_US2021224665A1</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>US2021224665A1</sourcerecordid><originalsourceid>FETCH-epo_espacenet_US2021224665A13</originalsourceid><addsrcrecordid>eNqNjcFKAzEURWfjQtR_uOC2gh21e2kVFQtK7Lo8k5cmkEmGlxdlPsj_1BE_wNWBwz3c4-5ryxqKW8BMVXlYgLLDugxjUxa8SDkIDTNdswpfBJsp0xAtpTTB2MCupZgP2JINMTOemSTP4jF7Fs6W8VTeKz6jBmyi_5WK10Yp6oTiYVg-ouWKkkEwgYTdXAtVlZ_XJnzaHXlKlc_-eNKd39-9rR8ueCx7riNZzqz7nekv-2XfX69WN7fLq_-tvgGqa1cm</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>Method, System, and Computer Program Product for Dynamically Scheduling Machine Learning Inference Jobs with Different Quality of Services on a Shared Infrastructure</title><source>esp@cenet</source><creator>Roy, Subir ; Karpenko, Igor ; Walker, Peter ; Lu, Ranglin ; Cheng, Yinhe ; Gu, Yu</creator><creatorcontrib>Roy, Subir ; Karpenko, Igor ; Walker, Peter ; Lu, Ranglin ; Cheng, Yinhe ; Gu, Yu</creatorcontrib><description>A method, system, and computer program product for dynamically scheduling machine learning inference jobs receive or determine a plurality of performance profiles associated with a plurality of system resources, wherein each performance profile is associated with a machine learning model; receive a request for system resources for an inference job associated with the machine learning model; determine a system resource of the plurality of system resources for processing the inference job associated with the machine learning model based on the plurality of performance profiles and a quality of service requirement associated with the inference job; assign the system resource to the inference job for processing the inference job; receive result data associated with processing of the inference job with the system resource; and update based on the result data, a performance profile of the plurality of the performance profiles associated with the system resource and the machine learning model.</description><language>eng</language><subject>CALCULATING ; COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS ; COMPUTING ; COUNTING ; ELECTRIC DIGITAL DATA PROCESSING ; PHYSICS</subject><creationdate>2021</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20210722&amp;DB=EPODOC&amp;CC=US&amp;NR=2021224665A1$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,776,881,25543,76293</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20210722&amp;DB=EPODOC&amp;CC=US&amp;NR=2021224665A1$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>Roy, Subir</creatorcontrib><creatorcontrib>Karpenko, Igor</creatorcontrib><creatorcontrib>Walker, Peter</creatorcontrib><creatorcontrib>Lu, Ranglin</creatorcontrib><creatorcontrib>Cheng, Yinhe</creatorcontrib><creatorcontrib>Gu, Yu</creatorcontrib><title>Method, System, and Computer Program Product for Dynamically Scheduling Machine Learning Inference Jobs with Different Quality of Services on a Shared Infrastructure</title><description>A method, system, and computer program product for dynamically scheduling machine learning inference jobs receive or determine a plurality of performance profiles associated with a plurality of system resources, wherein each performance profile is associated with a machine learning model; receive a request for system resources for an inference job associated with the machine learning model; determine a system resource of the plurality of system resources for processing the inference job associated with the machine learning model based on the plurality of performance profiles and a quality of service requirement associated with the inference job; assign the system resource to the inference job for processing the inference job; receive result data associated with processing of the inference job with the system resource; and update based on the result data, a performance profile of the plurality of the performance profiles associated with the system resource and the machine learning model.</description><subject>CALCULATING</subject><subject>COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS</subject><subject>COMPUTING</subject><subject>COUNTING</subject><subject>ELECTRIC DIGITAL DATA PROCESSING</subject><subject>PHYSICS</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2021</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNqNjcFKAzEURWfjQtR_uOC2gh21e2kVFQtK7Lo8k5cmkEmGlxdlPsj_1BE_wNWBwz3c4-5ryxqKW8BMVXlYgLLDugxjUxa8SDkIDTNdswpfBJsp0xAtpTTB2MCupZgP2JINMTOemSTP4jF7Fs6W8VTeKz6jBmyi_5WK10Yp6oTiYVg-ouWKkkEwgYTdXAtVlZ_XJnzaHXlKlc_-eNKd39-9rR8ueCx7riNZzqz7nekv-2XfX69WN7fLq_-tvgGqa1cm</recordid><startdate>20210722</startdate><enddate>20210722</enddate><creator>Roy, Subir</creator><creator>Karpenko, Igor</creator><creator>Walker, Peter</creator><creator>Lu, Ranglin</creator><creator>Cheng, Yinhe</creator><creator>Gu, Yu</creator><scope>EVB</scope></search><sort><creationdate>20210722</creationdate><title>Method, System, and Computer Program Product for Dynamically Scheduling Machine Learning Inference Jobs with Different Quality of Services on a Shared Infrastructure</title><author>Roy, Subir ; Karpenko, Igor ; Walker, Peter ; Lu, Ranglin ; Cheng, Yinhe ; Gu, Yu</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_US2021224665A13</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>eng</language><creationdate>2021</creationdate><topic>CALCULATING</topic><topic>COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS</topic><topic>COMPUTING</topic><topic>COUNTING</topic><topic>ELECTRIC DIGITAL DATA PROCESSING</topic><topic>PHYSICS</topic><toplevel>online_resources</toplevel><creatorcontrib>Roy, Subir</creatorcontrib><creatorcontrib>Karpenko, Igor</creatorcontrib><creatorcontrib>Walker, Peter</creatorcontrib><creatorcontrib>Lu, Ranglin</creatorcontrib><creatorcontrib>Cheng, Yinhe</creatorcontrib><creatorcontrib>Gu, Yu</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Roy, Subir</au><au>Karpenko, Igor</au><au>Walker, Peter</au><au>Lu, Ranglin</au><au>Cheng, Yinhe</au><au>Gu, Yu</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>Method, System, and Computer Program Product for Dynamically Scheduling Machine Learning Inference Jobs with Different Quality of Services on a Shared Infrastructure</title><date>2021-07-22</date><risdate>2021</risdate><abstract>A method, system, and computer program product for dynamically scheduling machine learning inference jobs receive or determine a plurality of performance profiles associated with a plurality of system resources, wherein each performance profile is associated with a machine learning model; receive a request for system resources for an inference job associated with the machine learning model; determine a system resource of the plurality of system resources for processing the inference job associated with the machine learning model based on the plurality of performance profiles and a quality of service requirement associated with the inference job; assign the system resource to the inference job for processing the inference job; receive result data associated with processing of the inference job with the system resource; and update based on the result data, a performance profile of the plurality of the performance profiles associated with the system resource and the machine learning model.</abstract><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier
ispartof
issn
language eng
recordid cdi_epo_espacenet_US2021224665A1
source esp@cenet
subjects CALCULATING
COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
COMPUTING
COUNTING
ELECTRIC DIGITAL DATA PROCESSING
PHYSICS
title Method, System, and Computer Program Product for Dynamically Scheduling Machine Learning Inference Jobs with Different Quality of Services on a Shared Infrastructure
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-25T14%3A30%3A49IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=Roy,%20Subir&rft.date=2021-07-22&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3EUS2021224665A1%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true