Speculative incoherent cache protocols
Multiprocessing and multithreading are becoming ubiquitous even on single chips. With increasing cache sizes, coherence misses in such systems will account for a larger fraction of all cache misses. As communication latencies increase, this larger fraction of coherence misses will cause significant...
Gespeichert in:
Veröffentlicht in: | IEEE MICRO 2004-11, Vol.24 (6), p.104-109 |
---|---|
Hauptverfasser: | , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 109 |
---|---|
container_issue | 6 |
container_start_page | 104 |
container_title | IEEE MICRO |
container_volume | 24 |
creator | Jaehyuk Huh Burger, D. Jichuan Chang Sohi, G.S. |
description | Multiprocessing and multithreading are becoming ubiquitous even on single chips. With increasing cache sizes, coherence misses in such systems will account for a larger fraction of all cache misses. As communication latencies increase, this larger fraction of coherence misses will cause significant and increased performance losses. Tuning coherence protocols for specific communication patterns and applications can reduce communication latencies. However, these optimizations increase a protocol's design complexity, making the protocol difficult to verify. A competing approach requires parallel programmers to tune applications to work well with simpler protocols. Speculative execution has successfully improved performance in various scenarios. We propose a new type of load speculation, called coherence decoupling. Coherence decoupling is a microarchitectural mechanism that implements separate protocols for speculative use and for the eventual verification of values. The technique reduces the effect of long communication latencies while mitigating the burdens on the coherence protocol designer and the parallel programmer |
doi_str_mv | 10.1109/MM.2004.88 |
format | Article |
fullrecord | <record><control><sourceid>proquest_RIE</sourceid><recordid>TN_cdi_proquest_miscellaneous_28813230</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>1388165</ieee_id><sourcerecordid>780246691</sourcerecordid><originalsourceid>FETCH-LOGICAL-c378t-7e4d9410ddd2ad563e564bd4b45273e65d6a5442c98629a6e642aa3b1cd1ed353</originalsourceid><addsrcrecordid>eNp90E1LAzEQBuAgCtbVi1cvxUMRYWuSydcepfgFXTyo55AmU7plu1uTXaH_3i0VBA-e5vIwM-9LyCWjU8ZocVeWU06pmBpzREasAJ0LJuCYjCjXPGca-Ck5S2lNKZWcmhGZvG3R97Xrqi8cV41vVxix6cbe-RWOt7HtWt_W6ZycLF2d8OJnZuTj8eF99pzPX59eZvfz3IM2Xa5RhEIwGkLgLkgFKJVYBLEQkmtAJYNyUgjuC6N44RQqwZ2DBfOBYQAJGZkc9g6XP3tMnd1UyWNduwbbPlluDAMOdIA3_0LGQAEYPaTPyPUfum772AwxLCsUqEIPvWTk9oB8bFOKuLTbWG1c3FlG7b5aW5Z2X601ZsBXB1wh4i-E4Tkl4RtE5HHj</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>196369717</pqid></control><display><type>article</type><title>Speculative incoherent cache protocols</title><source>IEEE Electronic Library (IEL)</source><creator>Jaehyuk Huh ; Burger, D. ; Jichuan Chang ; Sohi, G.S.</creator><creatorcontrib>Jaehyuk Huh ; Burger, D. ; Jichuan Chang ; Sohi, G.S.</creatorcontrib><description>Multiprocessing and multithreading are becoming ubiquitous even on single chips. With increasing cache sizes, coherence misses in such systems will account for a larger fraction of all cache misses. As communication latencies increase, this larger fraction of coherence misses will cause significant and increased performance losses. Tuning coherence protocols for specific communication patterns and applications can reduce communication latencies. However, these optimizations increase a protocol's design complexity, making the protocol difficult to verify. A competing approach requires parallel programmers to tune applications to work well with simpler protocols. Speculative execution has successfully improved performance in various scenarios. We propose a new type of load speculation, called coherence decoupling. Coherence decoupling is a microarchitectural mechanism that implements separate protocols for speculative use and for the eventual verification of values. The technique reduces the effect of long communication latencies while mitigating the burdens on the coherence protocol designer and the parallel programmer</description><identifier>ISSN: 0272-1732</identifier><identifier>EISSN: 1937-4143</identifier><identifier>DOI: 10.1109/MM.2004.88</identifier><identifier>CODEN: IEMIDZ</identifier><language>eng</language><publisher>Los Alamitos: IEEE</publisher><subject>Acceleration ; Access protocols ; Cache ; Coherence ; Computer programming ; Computer science ; Costs ; Decoupling ; Delay ; Design engineering ; Multiprocessing ; Multiprocessing (computers) ; Optimization ; Performance loss ; Permission ; Predictive models ; Programmers ; Registers ; Tuning</subject><ispartof>IEEE MICRO, 2004-11, Vol.24 (6), p.104-109</ispartof><rights>Copyright Institute of Electrical and Electronics Engineers, Inc. (IEEE) Nov/Dec 2004</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c378t-7e4d9410ddd2ad563e564bd4b45273e65d6a5442c98629a6e642aa3b1cd1ed353</citedby><cites>FETCH-LOGICAL-c378t-7e4d9410ddd2ad563e564bd4b45273e65d6a5442c98629a6e642aa3b1cd1ed353</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/1388165$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,780,784,796,27923,27924,54757</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/1388165$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Jaehyuk Huh</creatorcontrib><creatorcontrib>Burger, D.</creatorcontrib><creatorcontrib>Jichuan Chang</creatorcontrib><creatorcontrib>Sohi, G.S.</creatorcontrib><title>Speculative incoherent cache protocols</title><title>IEEE MICRO</title><addtitle>MM</addtitle><description>Multiprocessing and multithreading are becoming ubiquitous even on single chips. With increasing cache sizes, coherence misses in such systems will account for a larger fraction of all cache misses. As communication latencies increase, this larger fraction of coherence misses will cause significant and increased performance losses. Tuning coherence protocols for specific communication patterns and applications can reduce communication latencies. However, these optimizations increase a protocol's design complexity, making the protocol difficult to verify. A competing approach requires parallel programmers to tune applications to work well with simpler protocols. Speculative execution has successfully improved performance in various scenarios. We propose a new type of load speculation, called coherence decoupling. Coherence decoupling is a microarchitectural mechanism that implements separate protocols for speculative use and for the eventual verification of values. The technique reduces the effect of long communication latencies while mitigating the burdens on the coherence protocol designer and the parallel programmer</description><subject>Acceleration</subject><subject>Access protocols</subject><subject>Cache</subject><subject>Coherence</subject><subject>Computer programming</subject><subject>Computer science</subject><subject>Costs</subject><subject>Decoupling</subject><subject>Delay</subject><subject>Design engineering</subject><subject>Multiprocessing</subject><subject>Multiprocessing (computers)</subject><subject>Optimization</subject><subject>Performance loss</subject><subject>Permission</subject><subject>Predictive models</subject><subject>Programmers</subject><subject>Registers</subject><subject>Tuning</subject><issn>0272-1732</issn><issn>1937-4143</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2004</creationdate><recordtype>article</recordtype><sourceid>RIE</sourceid><recordid>eNp90E1LAzEQBuAgCtbVi1cvxUMRYWuSydcepfgFXTyo55AmU7plu1uTXaH_3i0VBA-e5vIwM-9LyCWjU8ZocVeWU06pmBpzREasAJ0LJuCYjCjXPGca-Ck5S2lNKZWcmhGZvG3R97Xrqi8cV41vVxix6cbe-RWOt7HtWt_W6ZycLF2d8OJnZuTj8eF99pzPX59eZvfz3IM2Xa5RhEIwGkLgLkgFKJVYBLEQkmtAJYNyUgjuC6N44RQqwZ2DBfOBYQAJGZkc9g6XP3tMnd1UyWNduwbbPlluDAMOdIA3_0LGQAEYPaTPyPUfum772AwxLCsUqEIPvWTk9oB8bFOKuLTbWG1c3FlG7b5aW5Z2X601ZsBXB1wh4i-E4Tkl4RtE5HHj</recordid><startdate>20041101</startdate><enddate>20041101</enddate><creator>Jaehyuk Huh</creator><creator>Burger, D.</creator><creator>Jichuan Chang</creator><creator>Sohi, G.S.</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>F28</scope><scope>FR3</scope></search><sort><creationdate>20041101</creationdate><title>Speculative incoherent cache protocols</title><author>Jaehyuk Huh ; Burger, D. ; Jichuan Chang ; Sohi, G.S.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c378t-7e4d9410ddd2ad563e564bd4b45273e65d6a5442c98629a6e642aa3b1cd1ed353</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2004</creationdate><topic>Acceleration</topic><topic>Access protocols</topic><topic>Cache</topic><topic>Coherence</topic><topic>Computer programming</topic><topic>Computer science</topic><topic>Costs</topic><topic>Decoupling</topic><topic>Delay</topic><topic>Design engineering</topic><topic>Multiprocessing</topic><topic>Multiprocessing (computers)</topic><topic>Optimization</topic><topic>Performance loss</topic><topic>Permission</topic><topic>Predictive models</topic><topic>Programmers</topic><topic>Registers</topic><topic>Tuning</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Jaehyuk Huh</creatorcontrib><creatorcontrib>Burger, D.</creatorcontrib><creatorcontrib>Jichuan Chang</creatorcontrib><creatorcontrib>Sohi, G.S.</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics & Communications Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>ANTE: Abstracts in New Technology & Engineering</collection><collection>Engineering Research Database</collection><jtitle>IEEE MICRO</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Jaehyuk Huh</au><au>Burger, D.</au><au>Jichuan Chang</au><au>Sohi, G.S.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Speculative incoherent cache protocols</atitle><jtitle>IEEE MICRO</jtitle><stitle>MM</stitle><date>2004-11-01</date><risdate>2004</risdate><volume>24</volume><issue>6</issue><spage>104</spage><epage>109</epage><pages>104-109</pages><issn>0272-1732</issn><eissn>1937-4143</eissn><coden>IEMIDZ</coden><abstract>Multiprocessing and multithreading are becoming ubiquitous even on single chips. With increasing cache sizes, coherence misses in such systems will account for a larger fraction of all cache misses. As communication latencies increase, this larger fraction of coherence misses will cause significant and increased performance losses. Tuning coherence protocols for specific communication patterns and applications can reduce communication latencies. However, these optimizations increase a protocol's design complexity, making the protocol difficult to verify. A competing approach requires parallel programmers to tune applications to work well with simpler protocols. Speculative execution has successfully improved performance in various scenarios. We propose a new type of load speculation, called coherence decoupling. Coherence decoupling is a microarchitectural mechanism that implements separate protocols for speculative use and for the eventual verification of values. The technique reduces the effect of long communication latencies while mitigating the burdens on the coherence protocol designer and the parallel programmer</abstract><cop>Los Alamitos</cop><pub>IEEE</pub><doi>10.1109/MM.2004.88</doi><tpages>6</tpages><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | ISSN: 0272-1732 |
ispartof | IEEE MICRO, 2004-11, Vol.24 (6), p.104-109 |
issn | 0272-1732 1937-4143 |
language | eng |
recordid | cdi_proquest_miscellaneous_28813230 |
source | IEEE Electronic Library (IEL) |
subjects | Acceleration Access protocols Cache Coherence Computer programming Computer science Costs Decoupling Delay Design engineering Multiprocessing Multiprocessing (computers) Optimization Performance loss Permission Predictive models Programmers Registers Tuning |
title | Speculative incoherent cache protocols |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-09T07%3A33%3A37IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_RIE&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Speculative%20incoherent%20cache%20protocols&rft.jtitle=IEEE%20MICRO&rft.au=Jaehyuk%20Huh&rft.date=2004-11-01&rft.volume=24&rft.issue=6&rft.spage=104&rft.epage=109&rft.pages=104-109&rft.issn=0272-1732&rft.eissn=1937-4143&rft.coden=IEMIDZ&rft_id=info:doi/10.1109/MM.2004.88&rft_dat=%3Cproquest_RIE%3E780246691%3C/proquest_RIE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=196369717&rft_id=info:pmid/&rft_ieee_id=1388165&rfr_iscdi=true |