SYSTEMS, APPARATUSES, AND METHODS FOR CHAINED FUSED MULTIPLY ADD

Embodiments of systems, apparatuses, and methods for chained fused multiply add. In some embodiments, an apparatus includes a decoder to decode a single instruction having an opcode, a destination field representing a destination operand, a first source field representing a plurality of packed data...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: CORBAL, JESUS, BRADFORD, DENNIS R, GROCHOWSKI, EDWARD T, DUBTSOV, ROMAN S, GIRKAR, MILIND B, FLETCHER, THOMAS D, CHARNEY, MARK J, VALENTINE, ROBERT, FERGUSON, WARREN E, SHUSTROV, NIKITA A
Format: Patent
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator CORBAL, JESUS
BRADFORD, DENNIS R
GROCHOWSKI, EDWARD T
DUBTSOV, ROMAN S
GIRKAR, MILIND B
FLETCHER, THOMAS D
CHARNEY, MARK J
VALENTINE, ROBERT
FERGUSON, WARREN E
SHUSTROV, NIKITA A
description Embodiments of systems, apparatuses, and methods for chained fused multiply add. In some embodiments, an apparatus includes a decoder to decode a single instruction having an opcode, a destination field representing a destination operand, a first source field representing a plurality of packed data source operands of a first type that have packed data elements of a first size, a second source field representing a plurality of packed data source operands that have packed data elements of a second size, and a field for a memory location that stores a scalar value. A register file having a plurality of packed data registers includes registers for the plurality of packed data source operands that have packed data elements of a first size, the source operands that have packed data elements of a second size, and the destination operand. Execution circuitry executes the decoded single instruction to perform iterations of packed fused multiply accumulate operations by multiplying packed data elements of the sources of the first type by sub-elements of the scalar value, and adding results of these multiplications to an initial value in a first iteration and a result from a previous iteration in subsequent iterations.
format Patent
fullrecord <record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_US2021081198A1</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>US2021081198A1</sourcerecordid><originalsourceid>FETCH-epo_espacenet_US2021081198A13</originalsourceid><addsrcrecordid>eNrjZHAIjgwOcfUN1lFwDAhwDHIMCQ12BXH8XBR8XUM8_F2CFdz8gxScPRw9_VxdFNyA0kCZUJ8QzwCfSAVHFxceBta0xJziVF4ozc2g7OYa4uyhm1qQH59aXJCYnJqXWhIfGmxkYGRoYGFoaGnhaGhMnCoASEErbg</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>SYSTEMS, APPARATUSES, AND METHODS FOR CHAINED FUSED MULTIPLY ADD</title><source>esp@cenet</source><creator>CORBAL, JESUS ; BRADFORD, DENNIS R ; GROCHOWSKI, EDWARD T ; DUBTSOV, ROMAN S ; GIRKAR, MILIND B ; FLETCHER, THOMAS D ; CHARNEY, MARK J ; VALENTINE, ROBERT ; FERGUSON, WARREN E ; SHUSTROV, NIKITA A</creator><creatorcontrib>CORBAL, JESUS ; BRADFORD, DENNIS R ; GROCHOWSKI, EDWARD T ; DUBTSOV, ROMAN S ; GIRKAR, MILIND B ; FLETCHER, THOMAS D ; CHARNEY, MARK J ; VALENTINE, ROBERT ; FERGUSON, WARREN E ; SHUSTROV, NIKITA A</creatorcontrib><description>Embodiments of systems, apparatuses, and methods for chained fused multiply add. In some embodiments, an apparatus includes a decoder to decode a single instruction having an opcode, a destination field representing a destination operand, a first source field representing a plurality of packed data source operands of a first type that have packed data elements of a first size, a second source field representing a plurality of packed data source operands that have packed data elements of a second size, and a field for a memory location that stores a scalar value. A register file having a plurality of packed data registers includes registers for the plurality of packed data source operands that have packed data elements of a first size, the source operands that have packed data elements of a second size, and the destination operand. Execution circuitry executes the decoded single instruction to perform iterations of packed fused multiply accumulate operations by multiplying packed data elements of the sources of the first type by sub-elements of the scalar value, and adding results of these multiplications to an initial value in a first iteration and a result from a previous iteration in subsequent iterations.</description><language>eng</language><subject>CALCULATING ; COMPUTING ; COUNTING ; ELECTRIC DIGITAL DATA PROCESSING ; PHYSICS</subject><creationdate>2021</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20210318&amp;DB=EPODOC&amp;CC=US&amp;NR=2021081198A1$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,776,881,25542,76290</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20210318&amp;DB=EPODOC&amp;CC=US&amp;NR=2021081198A1$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>CORBAL, JESUS</creatorcontrib><creatorcontrib>BRADFORD, DENNIS R</creatorcontrib><creatorcontrib>GROCHOWSKI, EDWARD T</creatorcontrib><creatorcontrib>DUBTSOV, ROMAN S</creatorcontrib><creatorcontrib>GIRKAR, MILIND B</creatorcontrib><creatorcontrib>FLETCHER, THOMAS D</creatorcontrib><creatorcontrib>CHARNEY, MARK J</creatorcontrib><creatorcontrib>VALENTINE, ROBERT</creatorcontrib><creatorcontrib>FERGUSON, WARREN E</creatorcontrib><creatorcontrib>SHUSTROV, NIKITA A</creatorcontrib><title>SYSTEMS, APPARATUSES, AND METHODS FOR CHAINED FUSED MULTIPLY ADD</title><description>Embodiments of systems, apparatuses, and methods for chained fused multiply add. In some embodiments, an apparatus includes a decoder to decode a single instruction having an opcode, a destination field representing a destination operand, a first source field representing a plurality of packed data source operands of a first type that have packed data elements of a first size, a second source field representing a plurality of packed data source operands that have packed data elements of a second size, and a field for a memory location that stores a scalar value. A register file having a plurality of packed data registers includes registers for the plurality of packed data source operands that have packed data elements of a first size, the source operands that have packed data elements of a second size, and the destination operand. Execution circuitry executes the decoded single instruction to perform iterations of packed fused multiply accumulate operations by multiplying packed data elements of the sources of the first type by sub-elements of the scalar value, and adding results of these multiplications to an initial value in a first iteration and a result from a previous iteration in subsequent iterations.</description><subject>CALCULATING</subject><subject>COMPUTING</subject><subject>COUNTING</subject><subject>ELECTRIC DIGITAL DATA PROCESSING</subject><subject>PHYSICS</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2021</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNrjZHAIjgwOcfUN1lFwDAhwDHIMCQ12BXH8XBR8XUM8_F2CFdz8gxScPRw9_VxdFNyA0kCZUJ8QzwCfSAVHFxceBta0xJziVF4ozc2g7OYa4uyhm1qQH59aXJCYnJqXWhIfGmxkYGRoYGFoaGnhaGhMnCoASEErbg</recordid><startdate>20210318</startdate><enddate>20210318</enddate><creator>CORBAL, JESUS</creator><creator>BRADFORD, DENNIS R</creator><creator>GROCHOWSKI, EDWARD T</creator><creator>DUBTSOV, ROMAN S</creator><creator>GIRKAR, MILIND B</creator><creator>FLETCHER, THOMAS D</creator><creator>CHARNEY, MARK J</creator><creator>VALENTINE, ROBERT</creator><creator>FERGUSON, WARREN E</creator><creator>SHUSTROV, NIKITA A</creator><scope>EVB</scope></search><sort><creationdate>20210318</creationdate><title>SYSTEMS, APPARATUSES, AND METHODS FOR CHAINED FUSED MULTIPLY ADD</title><author>CORBAL, JESUS ; BRADFORD, DENNIS R ; GROCHOWSKI, EDWARD T ; DUBTSOV, ROMAN S ; GIRKAR, MILIND B ; FLETCHER, THOMAS D ; CHARNEY, MARK J ; VALENTINE, ROBERT ; FERGUSON, WARREN E ; SHUSTROV, NIKITA A</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_US2021081198A13</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>eng</language><creationdate>2021</creationdate><topic>CALCULATING</topic><topic>COMPUTING</topic><topic>COUNTING</topic><topic>ELECTRIC DIGITAL DATA PROCESSING</topic><topic>PHYSICS</topic><toplevel>online_resources</toplevel><creatorcontrib>CORBAL, JESUS</creatorcontrib><creatorcontrib>BRADFORD, DENNIS R</creatorcontrib><creatorcontrib>GROCHOWSKI, EDWARD T</creatorcontrib><creatorcontrib>DUBTSOV, ROMAN S</creatorcontrib><creatorcontrib>GIRKAR, MILIND B</creatorcontrib><creatorcontrib>FLETCHER, THOMAS D</creatorcontrib><creatorcontrib>CHARNEY, MARK J</creatorcontrib><creatorcontrib>VALENTINE, ROBERT</creatorcontrib><creatorcontrib>FERGUSON, WARREN E</creatorcontrib><creatorcontrib>SHUSTROV, NIKITA A</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>CORBAL, JESUS</au><au>BRADFORD, DENNIS R</au><au>GROCHOWSKI, EDWARD T</au><au>DUBTSOV, ROMAN S</au><au>GIRKAR, MILIND B</au><au>FLETCHER, THOMAS D</au><au>CHARNEY, MARK J</au><au>VALENTINE, ROBERT</au><au>FERGUSON, WARREN E</au><au>SHUSTROV, NIKITA A</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>SYSTEMS, APPARATUSES, AND METHODS FOR CHAINED FUSED MULTIPLY ADD</title><date>2021-03-18</date><risdate>2021</risdate><abstract>Embodiments of systems, apparatuses, and methods for chained fused multiply add. In some embodiments, an apparatus includes a decoder to decode a single instruction having an opcode, a destination field representing a destination operand, a first source field representing a plurality of packed data source operands of a first type that have packed data elements of a first size, a second source field representing a plurality of packed data source operands that have packed data elements of a second size, and a field for a memory location that stores a scalar value. A register file having a plurality of packed data registers includes registers for the plurality of packed data source operands that have packed data elements of a first size, the source operands that have packed data elements of a second size, and the destination operand. Execution circuitry executes the decoded single instruction to perform iterations of packed fused multiply accumulate operations by multiplying packed data elements of the sources of the first type by sub-elements of the scalar value, and adding results of these multiplications to an initial value in a first iteration and a result from a previous iteration in subsequent iterations.</abstract><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier
ispartof
issn
language eng
recordid cdi_epo_espacenet_US2021081198A1
source esp@cenet
subjects CALCULATING
COMPUTING
COUNTING
ELECTRIC DIGITAL DATA PROCESSING
PHYSICS
title SYSTEMS, APPARATUSES, AND METHODS FOR CHAINED FUSED MULTIPLY ADD
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-28T16%3A55%3A33IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=CORBAL,%20JESUS&rft.date=2021-03-18&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3EUS2021081198A1%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true