A Cost-Efficient L1-L2 Multicore Interconnect: Performance, Power, and Area Considerations
Processor manufacturers use advances in manufacturing technologies to increase the number of cores on chip in order to scale performance in a cost-efficient manner. As the number of cores scales up, not all cores can be directly connected to the main memory and there is a need for hierarchy, for exa...
Gespeichert in:
Veröffentlicht in: | IEEE transactions on circuits and systems. I, Regular papers Regular papers, 2011-03, Vol.58 (3), p.529-538 |
---|---|
Hauptverfasser: | , , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Processor manufacturers use advances in manufacturing technologies to increase the number of cores on chip in order to scale performance in a cost-efficient manner. As the number of cores scales up, not all cores can be directly connected to the main memory and there is a need for hierarchy, for example, by arranging them in clusters that share L2 caches. This paper focuses on designing cost-efficient L1-L2 interconnects. We discuss performance and power- and area-consumption considerations for a real processor designed in 45-nm technology. We explain the architectures and heuristics developed, including a smart floorplan with instance flips to address interconnect latency, customized decentralized arbitration schemes tailored per transaction type, and heterogeneous Vt device assignment to reduce overall power consumption, taking into account the expected switching factors. These and other methods worked together to achieve high throughput in a power-efficient interconnect that consumes less than 3% of the compute cluster area. |
---|---|
ISSN: | 1549-8328 1558-0806 |
DOI: | 10.1109/TCSI.2010.2073832 |