Twig: Profile-Guided BTB Prefetching for Data Center Applications
Modern data center applications have deep software stacks, with instruction footprints that are orders of magnitude larger than typical instruction cache (I-cache) sizes. To efficiently prefetch instructions into the I-cache despite large application footprints, modern server-class processors implem...
Gespeichert in:
Hauptverfasser: | , , , , , , , , , |
---|---|
Format: | Buch |
Sprache: | eng |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | |
container_volume | |
creator | Ahmed Khan, Tanvir Brown, Nathan Sriraman, Akshitha Soundararajan, Niranjan Kumar, Rakesh Devietti, Joseph Subramoney, Sreenivas Pokam, Gilles Litz, Heiner Kasikci, Baris |
description | Modern data center applications have deep software stacks, with instruction footprints that are orders of magnitude larger than typical instruction cache (I-cache) sizes. To efficiently prefetch instructions into the I-cache despite large application footprints, modern server-class processors implement a decoupled frontend with Fetch Directed Instruction Prefetching (FDIP). In this work, we first characterize the limitations of a decoupled frontend processor with FDIP and find that FDIP suffers from significant Branch Target Buffer (BTB) misses. We also find that existing techniques (e.g., stream prefetchers and predecoders) are unable to mitigate these misses, as they rely on an incomplete understanding of a program’s branching behavior. |
format | Book |
fullrecord | <record><control><sourceid>cristin_3HK</sourceid><recordid>TN_cdi_cristin_nora_11250_2987667</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>11250_2987667</sourcerecordid><originalsourceid>FETCH-cristin_nora_11250_29876673</originalsourceid><addsrcrecordid>eNrjZHAMKc9Mt1IIKMpPy8xJ1XUvzUxJTVFwCnECCqWmpZYkZ2TmpSuk5RcpuCSWJCo4p-aVpBYpOBYU5GQmJ5Zk5ucV8zCwpiXmFKfyQmluBkU31xBnD93kosziksy8-Lz8osR4Q0MjU4N4I0sLczMzc2Ni1AAAX4gwQw</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>book</recordtype></control><display><type>book</type><title>Twig: Profile-Guided BTB Prefetching for Data Center Applications</title><source>NORA - Norwegian Open Research Archives</source><creator>Ahmed Khan, Tanvir ; Brown, Nathan ; Sriraman, Akshitha ; Soundararajan, Niranjan ; Kumar, Rakesh ; Devietti, Joseph ; Subramoney, Sreenivas ; Pokam, Gilles ; Litz, Heiner ; Kasikci, Baris</creator><creatorcontrib>Ahmed Khan, Tanvir ; Brown, Nathan ; Sriraman, Akshitha ; Soundararajan, Niranjan ; Kumar, Rakesh ; Devietti, Joseph ; Subramoney, Sreenivas ; Pokam, Gilles ; Litz, Heiner ; Kasikci, Baris</creatorcontrib><description>Modern data center applications have deep software stacks, with instruction footprints that are orders of magnitude larger than typical instruction cache (I-cache) sizes. To efficiently prefetch instructions into the I-cache despite large application footprints, modern server-class processors implement a decoupled frontend with Fetch Directed Instruction Prefetching (FDIP). In this work, we first characterize the limitations of a decoupled frontend processor with FDIP and find that FDIP suffers from significant Branch Target Buffer (BTB) misses. We also find that existing techniques (e.g., stream prefetchers and predecoders) are unable to mitigate these misses, as they rely on an incomplete understanding of a program’s branching behavior.</description><language>eng</language><publisher>Association for Computing Machinery (ACM)</publisher><ispartof>MICRO-54: 54th Annual IEEE/ACM International Symposium on Microarchitecture, 2021</ispartof><rights>info:eu-repo/semantics/openAccess</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>230,308,781,886,4049,26572</link.rule.ids><linktorsrc>$$Uhttp://hdl.handle.net/11250/2987667$$EView_record_in_NORA$$FView_record_in_$$GNORA$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>Ahmed Khan, Tanvir</creatorcontrib><creatorcontrib>Brown, Nathan</creatorcontrib><creatorcontrib>Sriraman, Akshitha</creatorcontrib><creatorcontrib>Soundararajan, Niranjan</creatorcontrib><creatorcontrib>Kumar, Rakesh</creatorcontrib><creatorcontrib>Devietti, Joseph</creatorcontrib><creatorcontrib>Subramoney, Sreenivas</creatorcontrib><creatorcontrib>Pokam, Gilles</creatorcontrib><creatorcontrib>Litz, Heiner</creatorcontrib><creatorcontrib>Kasikci, Baris</creatorcontrib><title>Twig: Profile-Guided BTB Prefetching for Data Center Applications</title><title>MICRO-54: 54th Annual IEEE/ACM International Symposium on Microarchitecture</title><description>Modern data center applications have deep software stacks, with instruction footprints that are orders of magnitude larger than typical instruction cache (I-cache) sizes. To efficiently prefetch instructions into the I-cache despite large application footprints, modern server-class processors implement a decoupled frontend with Fetch Directed Instruction Prefetching (FDIP). In this work, we first characterize the limitations of a decoupled frontend processor with FDIP and find that FDIP suffers from significant Branch Target Buffer (BTB) misses. We also find that existing techniques (e.g., stream prefetchers and predecoders) are unable to mitigate these misses, as they rely on an incomplete understanding of a program’s branching behavior.</description><fulltext>true</fulltext><rsrctype>book</rsrctype><creationdate>2021</creationdate><recordtype>book</recordtype><sourceid>3HK</sourceid><recordid>eNrjZHAMKc9Mt1IIKMpPy8xJ1XUvzUxJTVFwCnECCqWmpZYkZ2TmpSuk5RcpuCSWJCo4p-aVpBYpOBYU5GQmJ5Zk5ucV8zCwpiXmFKfyQmluBkU31xBnD93kosziksy8-Lz8osR4Q0MjU4N4I0sLczMzc2Ni1AAAX4gwQw</recordid><startdate>2021</startdate><enddate>2021</enddate><creator>Ahmed Khan, Tanvir</creator><creator>Brown, Nathan</creator><creator>Sriraman, Akshitha</creator><creator>Soundararajan, Niranjan</creator><creator>Kumar, Rakesh</creator><creator>Devietti, Joseph</creator><creator>Subramoney, Sreenivas</creator><creator>Pokam, Gilles</creator><creator>Litz, Heiner</creator><creator>Kasikci, Baris</creator><general>Association for Computing Machinery (ACM)</general><scope>3HK</scope></search><sort><creationdate>2021</creationdate><title>Twig: Profile-Guided BTB Prefetching for Data Center Applications</title><author>Ahmed Khan, Tanvir ; Brown, Nathan ; Sriraman, Akshitha ; Soundararajan, Niranjan ; Kumar, Rakesh ; Devietti, Joseph ; Subramoney, Sreenivas ; Pokam, Gilles ; Litz, Heiner ; Kasikci, Baris</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-cristin_nora_11250_29876673</frbrgroupid><rsrctype>books</rsrctype><prefilter>books</prefilter><language>eng</language><creationdate>2021</creationdate><toplevel>online_resources</toplevel><creatorcontrib>Ahmed Khan, Tanvir</creatorcontrib><creatorcontrib>Brown, Nathan</creatorcontrib><creatorcontrib>Sriraman, Akshitha</creatorcontrib><creatorcontrib>Soundararajan, Niranjan</creatorcontrib><creatorcontrib>Kumar, Rakesh</creatorcontrib><creatorcontrib>Devietti, Joseph</creatorcontrib><creatorcontrib>Subramoney, Sreenivas</creatorcontrib><creatorcontrib>Pokam, Gilles</creatorcontrib><creatorcontrib>Litz, Heiner</creatorcontrib><creatorcontrib>Kasikci, Baris</creatorcontrib><collection>NORA - Norwegian Open Research Archives</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Ahmed Khan, Tanvir</au><au>Brown, Nathan</au><au>Sriraman, Akshitha</au><au>Soundararajan, Niranjan</au><au>Kumar, Rakesh</au><au>Devietti, Joseph</au><au>Subramoney, Sreenivas</au><au>Pokam, Gilles</au><au>Litz, Heiner</au><au>Kasikci, Baris</au><format>book</format><genre>book</genre><ristype>BOOK</ristype><atitle>Twig: Profile-Guided BTB Prefetching for Data Center Applications</atitle><btitle>MICRO-54: 54th Annual IEEE/ACM International Symposium on Microarchitecture</btitle><date>2021</date><risdate>2021</risdate><abstract>Modern data center applications have deep software stacks, with instruction footprints that are orders of magnitude larger than typical instruction cache (I-cache) sizes. To efficiently prefetch instructions into the I-cache despite large application footprints, modern server-class processors implement a decoupled frontend with Fetch Directed Instruction Prefetching (FDIP). In this work, we first characterize the limitations of a decoupled frontend processor with FDIP and find that FDIP suffers from significant Branch Target Buffer (BTB) misses. We also find that existing techniques (e.g., stream prefetchers and predecoders) are unable to mitigate these misses, as they rely on an incomplete understanding of a program’s branching behavior.</abstract><pub>Association for Computing Machinery (ACM)</pub><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | |
ispartof | MICRO-54: 54th Annual IEEE/ACM International Symposium on Microarchitecture, 2021 |
issn | |
language | eng |
recordid | cdi_cristin_nora_11250_2987667 |
source | NORA - Norwegian Open Research Archives |
title | Twig: Profile-Guided BTB Prefetching for Data Center Applications |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-14T12%3A23%3A54IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-cristin_3HK&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=book&rft.atitle=Twig:%20Profile-Guided%20BTB%20Prefetching%20for%20Data%20Center%20Applications&rft.btitle=MICRO-54:%2054th%20Annual%20IEEE/ACM%20International%20Symposium%20on%20Microarchitecture&rft.au=Ahmed%20Khan,%20Tanvir&rft.date=2021&rft_id=info:doi/&rft_dat=%3Ccristin_3HK%3E11250_2987667%3C/cristin_3HK%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |