Method, System, and Computer Program Product for Dynamically Scheduling Machine Learning Inference Jobs with Different Quality of Services on a Shared Infrastructure

A method, system, and computer program product for dynamically scheduling machine learning inference jobs receive or determine a plurality of performance profiles associated with a plurality of system resources, wherein each performance profile is associated with a machine learning model; receive a...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Roy, Subir, Karpenko, Igor, Walker, Peter, Lu, Ranglin, Cheng, Yinhe, Gu, Yu
Format:	Patent
Sprache:	eng
Schlagworte:	CALCULATING COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS COMPUTING COUNTING ELECTRIC DIGITAL DATA PROCESSING PHYSICS
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title
container_volume
creator	Roy, Subir Karpenko, Igor Walker, Peter Lu, Ranglin Cheng, Yinhe Gu, Yu
description	A method, system, and computer program product for dynamically scheduling machine learning inference jobs receive or determine a plurality of performance profiles associated with a plurality of system resources, wherein each performance profile is associated with a machine learning model; receive a request for system resources for an inference job associated with the machine learning model; determine a system resource of the plurality of system resources for processing the inference job associated with the machine learning model based on the plurality of performance profiles and a quality of service requirement associated with the inference job; assign the system resource to the inference job for processing the inference job; receive result data associated with processing of the inference job with the system resource; and update based on the result data, a performance profile of the plurality of the performance profiles associated with the system resource and the machine learning model.
format	Patent
fullrecord	<record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_US2021224665A1</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>US2021224665A1</sourcerecordid><originalsourceid>FETCH-epo_espacenet_US2021224665A13</originalsourceid><addsrcrecordid>eNqNjcFKAzEURWfjQtR_uOC2gh21e2kVFQtK7Lo8k5cmkEmGlxdlPsj_1BE_wNWBwz3c4-5ryxqKW8BMVXlYgLLDugxjUxa8SDkIDTNdswpfBJsp0xAtpTTB2MCupZgP2JINMTOemSTP4jF7Fs6W8VTeKz6jBmyi_5WK10Yp6oTiYVg-ouWKkkEwgYTdXAtVlZ_XJnzaHXlKlc_-eNKd39-9rR8ueCx7riNZzqz7nekv-2XfX69WN7fLq_-tvgGqa1cm</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>Method, System, and Computer Program Product for Dynamically Scheduling Machine Learning Inference Jobs with Different Quality of Services on a Shared Infrastructure</title><source>esp@cenet</source><creator>Roy, Subir ; Karpenko, Igor ; Walker, Peter ; Lu, Ranglin ; Cheng, Yinhe ; Gu, Yu</creator><creatorcontrib>Roy, Subir ; Karpenko, Igor ; Walker, Peter ; Lu, Ranglin ; Cheng, Yinhe ; Gu, Yu</creatorcontrib><description>A method, system, and computer program product for dynamically scheduling machine learning inference jobs receive or determine a plurality of performance profiles associated with a plurality of system resources, wherein each performance profile is associated with a machine learning model; receive a request for system resources for an inference job associated with the machine learning model; determine a system resource of the plurality of system resources for processing the inference job associated with the machine learning model based on the plurality of performance profiles and a quality of service requirement associated with the inference job; assign the system resource to the inference job for processing the inference job; receive result data associated with processing of the inference job with the system resource; and update based on the result data, a performance profile of the plurality of the performance profiles associated with the system resource and the machine learning model.</description><language>eng</language><subject>CALCULATING ; COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS ; COMPUTING ; COUNTING ; ELECTRIC DIGITAL DATA PROCESSING ; PHYSICS</subject><creationdate>2021</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20210722&DB=EPODOC&CC=US&NR=2021224665A1$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,776,881,25543,76293</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20210722&DB=EPODOC&CC=US&NR=2021224665A1$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>Roy, Subir</creatorcontrib><creatorcontrib>Karpenko, Igor</creatorcontrib><creatorcontrib>Walker, Peter</creatorcontrib><creatorcontrib>Lu, Ranglin</creatorcontrib><creatorcontrib>Cheng, Yinhe</creatorcontrib><creatorcontrib>Gu, Yu</creatorcontrib><title>Method, System, and Computer Program Product for Dynamically Scheduling Machine Learning Inference Jobs with Different Quality of Services on a Shared Infrastructure</title><description>A method, system, and computer program product for dynamically scheduling machine learning inference jobs receive or determine a plurality of performance profiles associated with a plurality of system resources, wherein each performance profile is associated with a machine learning model; receive a request for system resources for an inference job associated with the machine learning model; determine a system resource of the plurality of system resources for processing the inference job associated with the machine learning model based on the plurality of performance profiles and a quality of service requirement associated with the inference job; assign the system resource to the inference job for processing the inference job; receive result data associated with processing of the inference job with the system resource; and update based on the result data, a performance profile of the plurality of the performance profiles associated with the system resource and the machine learning model.</description><subject>CALCULATING</subject><subject>COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS</subject><subject>COMPUTING</subject><subject>COUNTING</subject><subject>ELECTRIC DIGITAL DATA PROCESSING</subject><subject>PHYSICS</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2021</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNqNjcFKAzEURWfjQtR_uOC2gh21e2kVFQtK7Lo8k5cmkEmGlxdlPsj_1BE_wNWBwz3c4-5ryxqKW8BMVXlYgLLDugxjUxa8SDkIDTNdswpfBJsp0xAtpTTB2MCupZgP2JINMTOemSTP4jF7Fs6W8VTeKz6jBmyi_5WK10Yp6oTiYVg-ouWKkkEwgYTdXAtVlZ_XJnzaHXlKlc_-eNKd39-9rR8ueCx7riNZzqz7nekv-2XfX69WN7fLq_-tvgGqa1cm</recordid><startdate>20210722</startdate><enddate>20210722</enddate><creator>Roy, Subir</creator><creator>Karpenko, Igor</creator><creator>Walker, Peter</creator><creator>Lu, Ranglin</creator><creator>Cheng, Yinhe</creator><creator>Gu, Yu</creator><scope>EVB</scope></search><sort><creationdate>20210722</creationdate><title>Method, System, and Computer Program Product for Dynamically Scheduling Machine Learning Inference Jobs with Different Quality of Services on a Shared Infrastructure</title><author>Roy, Subir ; Karpenko, Igor ; Walker, Peter ; Lu, Ranglin ; Cheng, Yinhe ; Gu, Yu</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_US2021224665A13</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>eng</language><creationdate>2021</creationdate><topic>CALCULATING</topic><topic>COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS</topic><topic>COMPUTING</topic><topic>COUNTING</topic><topic>ELECTRIC DIGITAL DATA PROCESSING</topic><topic>PHYSICS</topic><toplevel>online_resources</toplevel><creatorcontrib>Roy, Subir</creatorcontrib><creatorcontrib>Karpenko, Igor</creatorcontrib><creatorcontrib>Walker, Peter</creatorcontrib><creatorcontrib>Lu, Ranglin</creatorcontrib><creatorcontrib>Cheng, Yinhe</creatorcontrib><creatorcontrib>Gu, Yu</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Roy, Subir</au><au>Karpenko, Igor</au><au>Walker, Peter</au><au>Lu, Ranglin</au><au>Cheng, Yinhe</au><au>Gu, Yu</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>Method, System, and Computer Program Product for Dynamically Scheduling Machine Learning Inference Jobs with Different Quality of Services on a Shared Infrastructure</title><date>2021-07-22</date><risdate>2021</risdate><abstract>A method, system, and computer program product for dynamically scheduling machine learning inference jobs receive or determine a plurality of performance profiles associated with a plurality of system resources, wherein each performance profile is associated with a machine learning model; receive a request for system resources for an inference job associated with the machine learning model; determine a system resource of the plurality of system resources for processing the inference job associated with the machine learning model based on the plurality of performance profiles and a quality of service requirement associated with the inference job; assign the system resource to the inference job for processing the inference job; receive result data associated with processing of the inference job with the system resource; and update based on the result data, a performance profile of the plurality of the performance profiles associated with the system resource and the machine learning model.</abstract><oa>free_for_read</oa></addata></record>
fulltext	fulltext_linktorsrc
identifier
ispartof
issn
language	eng
recordid	cdi_epo_espacenet_US2021224665A1
source	esp@cenet
subjects	CALCULATING COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS COMPUTING COUNTING ELECTRIC DIGITAL DATA PROCESSING PHYSICS
title	Method, System, and Computer Program Product for Dynamically Scheduling Machine Learning Inference Jobs with Different Quality of Services on a Shared Infrastructure
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-25T14%3A30%3A49IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=Roy,%20Subir&rft.date=2021-07-22&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3EUS2021224665A1%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true