MULTI-PETASCALE HIGHLY EFFICIENT PARALLEL SUPERCOMPUTER

A Multi-Petascale Highly Efficient Parallel Supercomputer of 100 petaflop-scale includes node architectures based upon System-On-a-Chip technology, where each processing node comprises a single Application Specific Integrated Circuit (ASIC). The ASIC nodes are interconnected by a five dimensional to...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: CHIU GEORGE L, WAIT CHARLES D, ASAAD SAMEH, EICHENBERGER ALEXANDRE E, OHMACHT MARTIN, BELLOFATTO RALPH E, RATTERMAN JOSEPH D, O'BRIEN JOHN K, CHER CHEN-YONG, BOYLE PETER, MARCELLA JAMES A, COTEUS PAUL W, HEIDELBERGER PHILIP, ELLAVSKY MATTHEW R, VAN OOSTEN JAMES L, BLOCKSOME MICHAEL A, DOZSA GABOR J, MUFF ADAM J, KNUDSON BRANT L, TAKKEN TODD E, MUNDY MICHAEL B, SUGAVANAM KRISHNAN, EISLEY NOEL A, TRAGER BARRY M, BLUMRICH MATTHIAS A, GUNNELS JOHN A, MAMIDALA AMITH R, O'BRIEN KATHRYN M, BRUNHEROTO JOSE R, STEINMACHER-BUROW BURKHARD, WU PENG, GARA ALAN, CHEN DONG, KOPCSAY GERARD V, EVANS KAHN C, STOCKDELL WILLIAM M, SATTERFIELD DAVID L, DAVIS KRISTAN D, STUNKEL CRAIG B, POOLE RUTH J, GOODING THOMAS M, SENGER ROBERT M, GIAMPAPA MARK E, HARING RUDOLF A, INGLETT TODD A, WATSON ALFRED T, GSCHWIND MICHAEL K, FLEISCHER BRUCE M, MILLER DOUGLAS R, CHRIST NORMAN, PARKER JEFFREY J, SUGAWARA YUTAKA, MILLER SAMUEL J, WISNIEWSKI ROBERT W, SALAPURA VALENTINA, WALKUP ROBERT E, KUMAR SAMEER, HALL SHAWN A, FOX THOMAS W, MEGERIAN MARK G
Format: Patent
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:A Multi-Petascale Highly Efficient Parallel Supercomputer of 100 petaflop-scale includes node architectures based upon System-On-a-Chip technology, where each processing node comprises a single Application Specific Integrated Circuit (ASIC). The ASIC nodes are interconnected by a five dimensional torus network that optimally maximize the throughput of packet communications between nodes and minimize latency. The network implements collective network and a global asynchronous network that provides global barrier and notification functions. Integrated in the node design include a list-based prefetcher. The memory system implements transaction memory, thread level speculation, and multiversioning cache that improves soft error rate at the same time and supports DMA functionality allowing for parallel processing message-passing.