Speculative incoherent cache protocols

Multiprocessing and multithreading are becoming ubiquitous even on single chips. With increasing cache sizes, coherence misses in such systems will account for a larger fraction of all cache misses. As communication latencies increase, this larger fraction of coherence misses will cause significant...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE MICRO 2004-11, Vol.24 (6), p.104-109
Hauptverfasser: Jaehyuk Huh, Burger, D., Jichuan Chang, Sohi, G.S.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 109
container_issue 6
container_start_page 104
container_title IEEE MICRO
container_volume 24
creator Jaehyuk Huh
Burger, D.
Jichuan Chang
Sohi, G.S.
description Multiprocessing and multithreading are becoming ubiquitous even on single chips. With increasing cache sizes, coherence misses in such systems will account for a larger fraction of all cache misses. As communication latencies increase, this larger fraction of coherence misses will cause significant and increased performance losses. Tuning coherence protocols for specific communication patterns and applications can reduce communication latencies. However, these optimizations increase a protocol's design complexity, making the protocol difficult to verify. A competing approach requires parallel programmers to tune applications to work well with simpler protocols. Speculative execution has successfully improved performance in various scenarios. We propose a new type of load speculation, called coherence decoupling. Coherence decoupling is a microarchitectural mechanism that implements separate protocols for speculative use and for the eventual verification of values. The technique reduces the effect of long communication latencies while mitigating the burdens on the coherence protocol designer and the parallel programmer
doi_str_mv 10.1109/MM.2004.88
format Article
fullrecord <record><control><sourceid>proquest_RIE</sourceid><recordid>TN_cdi_proquest_miscellaneous_28813230</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>1388165</ieee_id><sourcerecordid>780246691</sourcerecordid><originalsourceid>FETCH-LOGICAL-c378t-7e4d9410ddd2ad563e564bd4b45273e65d6a5442c98629a6e642aa3b1cd1ed353</originalsourceid><addsrcrecordid>eNp90E1LAzEQBuAgCtbVi1cvxUMRYWuSydcepfgFXTyo55AmU7plu1uTXaH_3i0VBA-e5vIwM-9LyCWjU8ZocVeWU06pmBpzREasAJ0LJuCYjCjXPGca-Ck5S2lNKZWcmhGZvG3R97Xrqi8cV41vVxix6cbe-RWOt7HtWt_W6ZycLF2d8OJnZuTj8eF99pzPX59eZvfz3IM2Xa5RhEIwGkLgLkgFKJVYBLEQkmtAJYNyUgjuC6N44RQqwZ2DBfOBYQAJGZkc9g6XP3tMnd1UyWNduwbbPlluDAMOdIA3_0LGQAEYPaTPyPUfum772AwxLCsUqEIPvWTk9oB8bFOKuLTbWG1c3FlG7b5aW5Z2X601ZsBXB1wh4i-E4Tkl4RtE5HHj</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>196369717</pqid></control><display><type>article</type><title>Speculative incoherent cache protocols</title><source>IEEE Electronic Library (IEL)</source><creator>Jaehyuk Huh ; Burger, D. ; Jichuan Chang ; Sohi, G.S.</creator><creatorcontrib>Jaehyuk Huh ; Burger, D. ; Jichuan Chang ; Sohi, G.S.</creatorcontrib><description>Multiprocessing and multithreading are becoming ubiquitous even on single chips. With increasing cache sizes, coherence misses in such systems will account for a larger fraction of all cache misses. As communication latencies increase, this larger fraction of coherence misses will cause significant and increased performance losses. Tuning coherence protocols for specific communication patterns and applications can reduce communication latencies. However, these optimizations increase a protocol's design complexity, making the protocol difficult to verify. A competing approach requires parallel programmers to tune applications to work well with simpler protocols. Speculative execution has successfully improved performance in various scenarios. We propose a new type of load speculation, called coherence decoupling. Coherence decoupling is a microarchitectural mechanism that implements separate protocols for speculative use and for the eventual verification of values. The technique reduces the effect of long communication latencies while mitigating the burdens on the coherence protocol designer and the parallel programmer</description><identifier>ISSN: 0272-1732</identifier><identifier>EISSN: 1937-4143</identifier><identifier>DOI: 10.1109/MM.2004.88</identifier><identifier>CODEN: IEMIDZ</identifier><language>eng</language><publisher>Los Alamitos: IEEE</publisher><subject>Acceleration ; Access protocols ; Cache ; Coherence ; Computer programming ; Computer science ; Costs ; Decoupling ; Delay ; Design engineering ; Multiprocessing ; Multiprocessing (computers) ; Optimization ; Performance loss ; Permission ; Predictive models ; Programmers ; Registers ; Tuning</subject><ispartof>IEEE MICRO, 2004-11, Vol.24 (6), p.104-109</ispartof><rights>Copyright Institute of Electrical and Electronics Engineers, Inc. (IEEE) Nov/Dec 2004</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c378t-7e4d9410ddd2ad563e564bd4b45273e65d6a5442c98629a6e642aa3b1cd1ed353</citedby><cites>FETCH-LOGICAL-c378t-7e4d9410ddd2ad563e564bd4b45273e65d6a5442c98629a6e642aa3b1cd1ed353</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/1388165$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,780,784,796,27923,27924,54757</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/1388165$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Jaehyuk Huh</creatorcontrib><creatorcontrib>Burger, D.</creatorcontrib><creatorcontrib>Jichuan Chang</creatorcontrib><creatorcontrib>Sohi, G.S.</creatorcontrib><title>Speculative incoherent cache protocols</title><title>IEEE MICRO</title><addtitle>MM</addtitle><description>Multiprocessing and multithreading are becoming ubiquitous even on single chips. With increasing cache sizes, coherence misses in such systems will account for a larger fraction of all cache misses. As communication latencies increase, this larger fraction of coherence misses will cause significant and increased performance losses. Tuning coherence protocols for specific communication patterns and applications can reduce communication latencies. However, these optimizations increase a protocol's design complexity, making the protocol difficult to verify. A competing approach requires parallel programmers to tune applications to work well with simpler protocols. Speculative execution has successfully improved performance in various scenarios. We propose a new type of load speculation, called coherence decoupling. Coherence decoupling is a microarchitectural mechanism that implements separate protocols for speculative use and for the eventual verification of values. The technique reduces the effect of long communication latencies while mitigating the burdens on the coherence protocol designer and the parallel programmer</description><subject>Acceleration</subject><subject>Access protocols</subject><subject>Cache</subject><subject>Coherence</subject><subject>Computer programming</subject><subject>Computer science</subject><subject>Costs</subject><subject>Decoupling</subject><subject>Delay</subject><subject>Design engineering</subject><subject>Multiprocessing</subject><subject>Multiprocessing (computers)</subject><subject>Optimization</subject><subject>Performance loss</subject><subject>Permission</subject><subject>Predictive models</subject><subject>Programmers</subject><subject>Registers</subject><subject>Tuning</subject><issn>0272-1732</issn><issn>1937-4143</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2004</creationdate><recordtype>article</recordtype><sourceid>RIE</sourceid><recordid>eNp90E1LAzEQBuAgCtbVi1cvxUMRYWuSydcepfgFXTyo55AmU7plu1uTXaH_3i0VBA-e5vIwM-9LyCWjU8ZocVeWU06pmBpzREasAJ0LJuCYjCjXPGca-Ck5S2lNKZWcmhGZvG3R97Xrqi8cV41vVxix6cbe-RWOt7HtWt_W6ZycLF2d8OJnZuTj8eF99pzPX59eZvfz3IM2Xa5RhEIwGkLgLkgFKJVYBLEQkmtAJYNyUgjuC6N44RQqwZ2DBfOBYQAJGZkc9g6XP3tMnd1UyWNduwbbPlluDAMOdIA3_0LGQAEYPaTPyPUfum772AwxLCsUqEIPvWTk9oB8bFOKuLTbWG1c3FlG7b5aW5Z2X601ZsBXB1wh4i-E4Tkl4RtE5HHj</recordid><startdate>20041101</startdate><enddate>20041101</enddate><creator>Jaehyuk Huh</creator><creator>Burger, D.</creator><creator>Jichuan Chang</creator><creator>Sohi, G.S.</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>F28</scope><scope>FR3</scope></search><sort><creationdate>20041101</creationdate><title>Speculative incoherent cache protocols</title><author>Jaehyuk Huh ; Burger, D. ; Jichuan Chang ; Sohi, G.S.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c378t-7e4d9410ddd2ad563e564bd4b45273e65d6a5442c98629a6e642aa3b1cd1ed353</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2004</creationdate><topic>Acceleration</topic><topic>Access protocols</topic><topic>Cache</topic><topic>Coherence</topic><topic>Computer programming</topic><topic>Computer science</topic><topic>Costs</topic><topic>Decoupling</topic><topic>Delay</topic><topic>Design engineering</topic><topic>Multiprocessing</topic><topic>Multiprocessing (computers)</topic><topic>Optimization</topic><topic>Performance loss</topic><topic>Permission</topic><topic>Predictive models</topic><topic>Programmers</topic><topic>Registers</topic><topic>Tuning</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Jaehyuk Huh</creatorcontrib><creatorcontrib>Burger, D.</creatorcontrib><creatorcontrib>Jichuan Chang</creatorcontrib><creatorcontrib>Sohi, G.S.</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics &amp; Communications Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>ANTE: Abstracts in New Technology &amp; Engineering</collection><collection>Engineering Research Database</collection><jtitle>IEEE MICRO</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Jaehyuk Huh</au><au>Burger, D.</au><au>Jichuan Chang</au><au>Sohi, G.S.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Speculative incoherent cache protocols</atitle><jtitle>IEEE MICRO</jtitle><stitle>MM</stitle><date>2004-11-01</date><risdate>2004</risdate><volume>24</volume><issue>6</issue><spage>104</spage><epage>109</epage><pages>104-109</pages><issn>0272-1732</issn><eissn>1937-4143</eissn><coden>IEMIDZ</coden><abstract>Multiprocessing and multithreading are becoming ubiquitous even on single chips. With increasing cache sizes, coherence misses in such systems will account for a larger fraction of all cache misses. As communication latencies increase, this larger fraction of coherence misses will cause significant and increased performance losses. Tuning coherence protocols for specific communication patterns and applications can reduce communication latencies. However, these optimizations increase a protocol's design complexity, making the protocol difficult to verify. A competing approach requires parallel programmers to tune applications to work well with simpler protocols. Speculative execution has successfully improved performance in various scenarios. We propose a new type of load speculation, called coherence decoupling. Coherence decoupling is a microarchitectural mechanism that implements separate protocols for speculative use and for the eventual verification of values. The technique reduces the effect of long communication latencies while mitigating the burdens on the coherence protocol designer and the parallel programmer</abstract><cop>Los Alamitos</cop><pub>IEEE</pub><doi>10.1109/MM.2004.88</doi><tpages>6</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier ISSN: 0272-1732
ispartof IEEE MICRO, 2004-11, Vol.24 (6), p.104-109
issn 0272-1732
1937-4143
language eng
recordid cdi_proquest_miscellaneous_28813230
source IEEE Electronic Library (IEL)
subjects Acceleration
Access protocols
Cache
Coherence
Computer programming
Computer science
Costs
Decoupling
Delay
Design engineering
Multiprocessing
Multiprocessing (computers)
Optimization
Performance loss
Permission
Predictive models
Programmers
Registers
Tuning
title Speculative incoherent cache protocols
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-09T07%3A33%3A37IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_RIE&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Speculative%20incoherent%20cache%20protocols&rft.jtitle=IEEE%20MICRO&rft.au=Jaehyuk%20Huh&rft.date=2004-11-01&rft.volume=24&rft.issue=6&rft.spage=104&rft.epage=109&rft.pages=104-109&rft.issn=0272-1732&rft.eissn=1937-4143&rft.coden=IEMIDZ&rft_id=info:doi/10.1109/MM.2004.88&rft_dat=%3Cproquest_RIE%3E780246691%3C/proquest_RIE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=196369717&rft_id=info:pmid/&rft_ieee_id=1388165&rfr_iscdi=true