Speculative incoherent cache protocols

Multiprocessing and multithreading are becoming ubiquitous even on single chips. With increasing cache sizes, coherence misses in such systems will account for a larger fraction of all cache misses. As communication latencies increase, this larger fraction of coherence misses will cause significant...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE MICRO 2004-11, Vol.24 (6), p.104-109
Hauptverfasser:	Jaehyuk Huh, Burger, D., Jichuan Chang, Sohi, G.S.
Format:	Artikel
Sprache:	eng
Schlagworte:	Acceleration Access protocols Cache Coherence Computer programming Computer science Costs Decoupling Delay Design engineering Multiprocessing Multiprocessing (computers) Optimization Performance loss Permission Predictive models Programmers Registers Tuning
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	109
container_issue	6
container_start_page	104
container_title	IEEE MICRO
container_volume	24
creator	Jaehyuk Huh Burger, D. Jichuan Chang Sohi, G.S.
description	Multiprocessing and multithreading are becoming ubiquitous even on single chips. With increasing cache sizes, coherence misses in such systems will account for a larger fraction of all cache misses. As communication latencies increase, this larger fraction of coherence misses will cause significant and increased performance losses. Tuning coherence protocols for specific communication patterns and applications can reduce communication latencies. However, these optimizations increase a protocol's design complexity, making the protocol difficult to verify. A competing approach requires parallel programmers to tune applications to work well with simpler protocols. Speculative execution has successfully improved performance in various scenarios. We propose a new type of load speculation, called coherence decoupling. Coherence decoupling is a microarchitectural mechanism that implements separate protocols for speculative use and for the eventual verification of values. The technique reduces the effect of long communication latencies while mitigating the burdens on the coherence protocol designer and the parallel programmer
doi_str_mv	10.1109/MM.2004.88
format	Article
fullrecord	<record><control><sourceid>proquest_RIE</sourceid><recordid>TN_cdi_proquest_miscellaneous_28813230</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>1388165</ieee_id><sourcerecordid>780246691</sourcerecordid><originalsourceid>FETCH-LOGICAL-c378t-7e4d9410ddd2ad563e564bd4b45273e65d6a5442c98629a6e642aa3b1cd1ed353</originalsourceid><addsrcrecordid>eNp90E1LAzEQBuAgCtbVi1cvxUMRYWuSydcepfgFXTyo55AmU7plu1uTXaH_3i0VBA-e5vIwM-9LyCWjU8ZocVeWU06pmBpzREasAJ0LJuCYjCjXPGca-Ck5S2lNKZWcmhGZvG3R97Xrqi8cV41vVxix6cbe-RWOt7HtWt_W6ZycLF2d8OJnZuTj8eF99pzPX59eZvfz3IM2Xa5RhEIwGkLgLkgFKJVYBLEQkmtAJYNyUgjuC6N44RQqwZ2DBfOBYQAJGZkc9g6XP3tMnd1UyWNduwbbPlluDAMOdIA3_0LGQAEYPaTPyPUfum772AwxLCsUqEIPvWTk9oB8bFOKuLTbWG1c3FlG7b5aW5Z2X601ZsBXB1wh4i-E4Tkl4RtE5HHj</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>196369717</pqid></control><display><type>article</type><title>Speculative incoherent cache protocols</title><source>IEEE Electronic Library (IEL)</source><creator>Jaehyuk Huh ; Burger, D. ; Jichuan Chang ; Sohi, G.S.</creator><creatorcontrib>Jaehyuk Huh ; Burger, D. ; Jichuan Chang ; Sohi, G.S.</creatorcontrib><description>Multiprocessing and multithreading are becoming ubiquitous even on single chips. With increasing cache sizes, coherence misses in such systems will account for a larger fraction of all cache misses. As communication latencies increase, this larger fraction of coherence misses will cause significant and increased performance losses. Tuning coherence protocols for specific communication patterns and applications can reduce communication latencies. However, these optimizations increase a protocol's design complexity, making the protocol difficult to verify. A competing approach requires parallel programmers to tune applications to work well with simpler protocols. Speculative execution has successfully improved performance in various scenarios. We propose a new type of load speculation, called coherence decoupling. Coherence decoupling is a microarchitectural mechanism that implements separate protocols for speculative use and for the eventual verification of values. The technique reduces the effect of long communication latencies while mitigating the burdens on the coherence protocol designer and the parallel programmer</description><identifier>ISSN: 0272-1732</identifier><identifier>EISSN: 1937-4143</identifier><identifier>DOI: 10.1109/MM.2004.88</identifier><identifier>CODEN: IEMIDZ</identifier><language>eng</language><publisher>Los Alamitos: IEEE</publisher><subject>Acceleration ; Access protocols ; Cache ; Coherence ; Computer programming ; Computer science ; Costs ; Decoupling ; Delay ; Design engineering ; Multiprocessing ; Multiprocessing (computers) ; Optimization ; Performance loss ; Permission ; Predictive models ; Programmers ; Registers ; Tuning</subject><ispartof>IEEE MICRO, 2004-11, Vol.24 (6), p.104-109</ispartof><rights>Copyright Institute of Electrical and Electronics Engineers, Inc. (IEEE) Nov/Dec 2004</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c378t-7e4d9410ddd2ad563e564bd4b45273e65d6a5442c98629a6e642aa3b1cd1ed353</citedby><cites>FETCH-LOGICAL-c378t-7e4d9410ddd2ad563e564bd4b45273e65d6a5442c98629a6e642aa3b1cd1ed353</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/1388165$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,780,784,796,27923,27924,54757</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/1388165$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Jaehyuk Huh</creatorcontrib><creatorcontrib>Burger, D.</creatorcontrib><creatorcontrib>Jichuan Chang</creatorcontrib><creatorcontrib>Sohi, G.S.</creatorcontrib><title>Speculative incoherent cache protocols</title><title>IEEE MICRO</title><addtitle>MM</addtitle><description>Multiprocessing and multithreading are becoming ubiquitous even on single chips. With increasing cache sizes, coherence misses in such systems will account for a larger fraction of all cache misses. As communication latencies increase, this larger fraction of coherence misses will cause significant and increased performance losses. Tuning coherence protocols for specific communication patterns and applications can reduce communication latencies. However, these optimizations increase a protocol's design complexity, making the protocol difficult to verify. A competing approach requires parallel programmers to tune applications to work well with simpler protocols. Speculative execution has successfully improved performance in various scenarios. We propose a new type of load speculation, called coherence decoupling. Coherence decoupling is a microarchitectural mechanism that implements separate protocols for speculative use and for the eventual verification of values. The technique reduces the effect of long communication latencies while mitigating the burdens on the coherence protocol designer and the parallel programmer</description><subject>Acceleration</subject><subject>Access protocols</subject><subject>Cache</subject><subject>Coherence</subject><subject>Computer programming</subject><subject>Computer science</subject><subject>Costs</subject><subject>Decoupling</subject><subject>Delay</subject><subject>Design engineering</subject><subject>Multiprocessing</subject><subject>Multiprocessing (computers)</subject><subject>Optimization</subject><subject>Performance loss</subject><subject>Permission</subject><subject>Predictive models</subject><subject>Programmers</subject><subject>Registers</subject><subject>Tuning</subject><issn>0272-1732</issn><issn>1937-4143</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2004</creationdate><recordtype>article</recordtype><sourceid>RIE</sourceid><recordid>eNp90E1LAzEQBuAgCtbVi1cvxUMRYWuSydcepfgFXTyo55AmU7plu1uTXaH_3i0VBA-e5vIwM-9LyCWjU8ZocVeWU06pmBpzREasAJ0LJuCYjCjXPGca-Ck5S2lNKZWcmhGZvG3R97Xrqi8cV41vVxix6cbe-RWOt7HtWt_W6ZycLF2d8OJnZuTj8eF99pzPX59eZvfz3IM2Xa5RhEIwGkLgLkgFKJVYBLEQkmtAJYNyUgjuC6N44RQqwZ2DBfOBYQAJGZkc9g6XP3tMnd1UyWNduwbbPlluDAMOdIA3_0LGQAEYPaTPyPUfum772AwxLCsUqEIPvWTk9oB8bFOKuLTbWG1c3FlG7b5aW5Z2X601ZsBXB1wh4i-E4Tkl4RtE5HHj</recordid><startdate>20041101</startdate><enddate>20041101</enddate><creator>Jaehyuk Huh</creator><creator>Burger, D.</creator><creator>Jichuan Chang</creator><creator>Sohi, G.S.</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>F28</scope><scope>FR3</scope></search><sort><creationdate>20041101</creationdate><title>Speculative incoherent cache protocols</title><author>Jaehyuk Huh ; Burger, D. ; Jichuan Chang ; Sohi, G.S.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c378t-7e4d9410ddd2ad563e564bd4b45273e65d6a5442c98629a6e642aa3b1cd1ed353</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2004</creationdate><topic>Acceleration</topic><topic>Access protocols</topic><topic>Cache</topic><topic>Coherence</topic><topic>Computer programming</topic><topic>Computer science</topic><topic>Costs</topic><topic>Decoupling</topic><topic>Delay</topic><topic>Design engineering</topic><topic>Multiprocessing</topic><topic>Multiprocessing (computers)</topic><topic>Optimization</topic><topic>Performance loss</topic><topic>Permission</topic><topic>Predictive models</topic><topic>Programmers</topic><topic>Registers</topic><topic>Tuning</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Jaehyuk Huh</creatorcontrib><creatorcontrib>Burger, D.</creatorcontrib><creatorcontrib>Jichuan Chang</creatorcontrib><creatorcontrib>Sohi, G.S.</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics & Communications Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>ANTE: Abstracts in New Technology & Engineering</collection><collection>Engineering Research Database</collection><jtitle>IEEE MICRO</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Jaehyuk Huh</au><au>Burger, D.</au><au>Jichuan Chang</au><au>Sohi, G.S.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Speculative incoherent cache protocols</atitle><jtitle>IEEE MICRO</jtitle><stitle>MM</stitle><date>2004-11-01</date><risdate>2004</risdate><volume>24</volume><issue>6</issue><spage>104</spage><epage>109</epage><pages>104-109</pages><issn>0272-1732</issn><eissn>1937-4143</eissn><coden>IEMIDZ</coden><abstract>Multiprocessing and multithreading are becoming ubiquitous even on single chips. With increasing cache sizes, coherence misses in such systems will account for a larger fraction of all cache misses. As communication latencies increase, this larger fraction of coherence misses will cause significant and increased performance losses. Tuning coherence protocols for specific communication patterns and applications can reduce communication latencies. However, these optimizations increase a protocol's design complexity, making the protocol difficult to verify. A competing approach requires parallel programmers to tune applications to work well with simpler protocols. Speculative execution has successfully improved performance in various scenarios. We propose a new type of load speculation, called coherence decoupling. Coherence decoupling is a microarchitectural mechanism that implements separate protocols for speculative use and for the eventual verification of values. The technique reduces the effect of long communication latencies while mitigating the burdens on the coherence protocol designer and the parallel programmer</abstract><cop>Los Alamitos</cop><pub>IEEE</pub><doi>10.1109/MM.2004.88</doi><tpages>6</tpages><oa>free_for_read</oa></addata></record>
fulltext	fulltext_linktorsrc
identifier	ISSN: 0272-1732
ispartof	IEEE MICRO, 2004-11, Vol.24 (6), p.104-109
issn	0272-1732 1937-4143
language	eng
recordid	cdi_proquest_miscellaneous_28813230
source	IEEE Electronic Library (IEL)
subjects	Acceleration Access protocols Cache Coherence Computer programming Computer science Costs Decoupling Delay Design engineering Multiprocessing Multiprocessing (computers) Optimization Performance loss Permission Predictive models Programmers Registers Tuning
title	Speculative incoherent cache protocols
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-09T07%3A33%3A37IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_RIE&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Speculative%20incoherent%20cache%20protocols&rft.jtitle=IEEE%20MICRO&rft.au=Jaehyuk%20Huh&rft.date=2004-11-01&rft.volume=24&rft.issue=6&rft.spage=104&rft.epage=109&rft.pages=104-109&rft.issn=0272-1732&rft.eissn=1937-4143&rft.coden=IEMIDZ&rft_id=info:doi/10.1109/MM.2004.88&rft_dat=%3Cproquest_RIE%3E780246691%3C/proquest_RIE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=196369717&rft_id=info:pmid/&rft_ieee_id=1388165&rfr_iscdi=true