VECTOR FRIENDLY INSTRUCTION FORMAT AND EXECUTION THEREOF

A vector friendly instruction format and execution thereof. According to one embodiment of the invention, a processor is to execute an instruction set. The instruction set includes a first instruction format, wherein the first instruction format includes a first plurality of templates, wherein the f...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: CHARNEY, Mark, TOLL, Bret, L, OULD-AHMED-VALL, Elmoustapha, ABEL, James, C, CAVIN, Robert, D, WU, Lisa, SAIR, Suleyman, SAN ADRIAN, Jesus, Corbal, GROCHOWSKI, Edward, Thomas, YOUNT, Charles, VALENTINE, Robert, C, DURAN, Santiago, Galan, WIEDEMEIER, Jeffrey, G, GIRKAR, Milind, Baburao, ABRAHAM, Seth, SAMUDRALA, Sridhar, HALL, Jonathan, Cannon, BRADFORD, Dennis, R, SANS, Roger, Espasa, FORSYTH, Andrew, Thomas
Format: Patent
Sprache:eng ; fre ; ger
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:A vector friendly instruction format and execution thereof. According to one embodiment of the invention, a processor is to execute an instruction set. The instruction set includes a first instruction format, wherein the first instruction format includes a first plurality of templates, wherein the first instruction format has a plurality of fields including a base operation field, a data element width field, and a write mask field, wherein the first instruction format supports, through different values in the base operation field, specification of different vector operations, wherein each of the vector operations is to generate a destination vector operand including a plurality of data elements at different data element positions, wherein the first instruction format supports, through different values in the data element width field, specification of different data element widths, wherein the base operation field, the data element width field, and the write mask field may each store only one value on each occurrence of an instruction in the first instruction format in instruction streams. The processor includes a decode unit to decode the occurrences of the instructions in the first plurality of templates, including to: distinguish, for each of the occurrences, which one of the data element widths to use based on a value in the data element width field; and distinguish, for each of the occurrences, the data elements resulting from the occurrence's vector operation to be reflected in the destination vector operand's corresponding data element positions based on the write mask field's content and the data element width for the occurrence. Different values that may be stored in the write mask field distinguish different write mask registers, of a set of write mask registers, that are to store configurable write masks. The data element width for the occurrence distinguishes which of the data element positions of the destination vector operand correspond with which bits of the configurable write masks.