Twig: Profile-Guided BTB Prefetching for Data Center Applications

Modern data center applications have deep software stacks, with instruction footprints that are orders of magnitude larger than typical instruction cache (I-cache) sizes. To efficiently prefetch instructions into the I-cache despite large application footprints, modern server-class processors implem...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Ahmed Khan, Tanvir, Brown, Nathan, Sriraman, Akshitha, Soundararajan, Niranjan, Kumar, Rakesh, Devietti, Joseph, Subramoney, Sreenivas, Pokam, Gilles, Litz, Heiner, Kasikci, Baris
Format: Buch
Sprache:eng
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator Ahmed Khan, Tanvir
Brown, Nathan
Sriraman, Akshitha
Soundararajan, Niranjan
Kumar, Rakesh
Devietti, Joseph
Subramoney, Sreenivas
Pokam, Gilles
Litz, Heiner
Kasikci, Baris
description Modern data center applications have deep software stacks, with instruction footprints that are orders of magnitude larger than typical instruction cache (I-cache) sizes. To efficiently prefetch instructions into the I-cache despite large application footprints, modern server-class processors implement a decoupled frontend with Fetch Directed Instruction Prefetching (FDIP). In this work, we first characterize the limitations of a decoupled frontend processor with FDIP and find that FDIP suffers from significant Branch Target Buffer (BTB) misses. We also find that existing techniques (e.g., stream prefetchers and predecoders) are unable to mitigate these misses, as they rely on an incomplete understanding of a program’s branching behavior.
format Book
fullrecord <record><control><sourceid>cristin_3HK</sourceid><recordid>TN_cdi_cristin_nora_11250_2987667</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>11250_2987667</sourcerecordid><originalsourceid>FETCH-cristin_nora_11250_29876673</originalsourceid><addsrcrecordid>eNrjZHAMKc9Mt1IIKMpPy8xJ1XUvzUxJTVFwCnECCqWmpZYkZ2TmpSuk5RcpuCSWJCo4p-aVpBYpOBYU5GQmJ5Zk5ucV8zCwpiXmFKfyQmluBkU31xBnD93kosziksy8-Lz8osR4Q0MjU4N4I0sLczMzc2Ni1AAAX4gwQw</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>book</recordtype></control><display><type>book</type><title>Twig: Profile-Guided BTB Prefetching for Data Center Applications</title><source>NORA - Norwegian Open Research Archives</source><creator>Ahmed Khan, Tanvir ; Brown, Nathan ; Sriraman, Akshitha ; Soundararajan, Niranjan ; Kumar, Rakesh ; Devietti, Joseph ; Subramoney, Sreenivas ; Pokam, Gilles ; Litz, Heiner ; Kasikci, Baris</creator><creatorcontrib>Ahmed Khan, Tanvir ; Brown, Nathan ; Sriraman, Akshitha ; Soundararajan, Niranjan ; Kumar, Rakesh ; Devietti, Joseph ; Subramoney, Sreenivas ; Pokam, Gilles ; Litz, Heiner ; Kasikci, Baris</creatorcontrib><description>Modern data center applications have deep software stacks, with instruction footprints that are orders of magnitude larger than typical instruction cache (I-cache) sizes. To efficiently prefetch instructions into the I-cache despite large application footprints, modern server-class processors implement a decoupled frontend with Fetch Directed Instruction Prefetching (FDIP). In this work, we first characterize the limitations of a decoupled frontend processor with FDIP and find that FDIP suffers from significant Branch Target Buffer (BTB) misses. We also find that existing techniques (e.g., stream prefetchers and predecoders) are unable to mitigate these misses, as they rely on an incomplete understanding of a program’s branching behavior.</description><language>eng</language><publisher>Association for Computing Machinery (ACM)</publisher><ispartof>MICRO-54: 54th Annual IEEE/ACM International Symposium on Microarchitecture, 2021</ispartof><rights>info:eu-repo/semantics/openAccess</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>230,308,781,886,4049,26572</link.rule.ids><linktorsrc>$$Uhttp://hdl.handle.net/11250/2987667$$EView_record_in_NORA$$FView_record_in_$$GNORA$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>Ahmed Khan, Tanvir</creatorcontrib><creatorcontrib>Brown, Nathan</creatorcontrib><creatorcontrib>Sriraman, Akshitha</creatorcontrib><creatorcontrib>Soundararajan, Niranjan</creatorcontrib><creatorcontrib>Kumar, Rakesh</creatorcontrib><creatorcontrib>Devietti, Joseph</creatorcontrib><creatorcontrib>Subramoney, Sreenivas</creatorcontrib><creatorcontrib>Pokam, Gilles</creatorcontrib><creatorcontrib>Litz, Heiner</creatorcontrib><creatorcontrib>Kasikci, Baris</creatorcontrib><title>Twig: Profile-Guided BTB Prefetching for Data Center Applications</title><title>MICRO-54: 54th Annual IEEE/ACM International Symposium on Microarchitecture</title><description>Modern data center applications have deep software stacks, with instruction footprints that are orders of magnitude larger than typical instruction cache (I-cache) sizes. To efficiently prefetch instructions into the I-cache despite large application footprints, modern server-class processors implement a decoupled frontend with Fetch Directed Instruction Prefetching (FDIP). In this work, we first characterize the limitations of a decoupled frontend processor with FDIP and find that FDIP suffers from significant Branch Target Buffer (BTB) misses. We also find that existing techniques (e.g., stream prefetchers and predecoders) are unable to mitigate these misses, as they rely on an incomplete understanding of a program’s branching behavior.</description><fulltext>true</fulltext><rsrctype>book</rsrctype><creationdate>2021</creationdate><recordtype>book</recordtype><sourceid>3HK</sourceid><recordid>eNrjZHAMKc9Mt1IIKMpPy8xJ1XUvzUxJTVFwCnECCqWmpZYkZ2TmpSuk5RcpuCSWJCo4p-aVpBYpOBYU5GQmJ5Zk5ucV8zCwpiXmFKfyQmluBkU31xBnD93kosziksy8-Lz8osR4Q0MjU4N4I0sLczMzc2Ni1AAAX4gwQw</recordid><startdate>2021</startdate><enddate>2021</enddate><creator>Ahmed Khan, Tanvir</creator><creator>Brown, Nathan</creator><creator>Sriraman, Akshitha</creator><creator>Soundararajan, Niranjan</creator><creator>Kumar, Rakesh</creator><creator>Devietti, Joseph</creator><creator>Subramoney, Sreenivas</creator><creator>Pokam, Gilles</creator><creator>Litz, Heiner</creator><creator>Kasikci, Baris</creator><general>Association for Computing Machinery (ACM)</general><scope>3HK</scope></search><sort><creationdate>2021</creationdate><title>Twig: Profile-Guided BTB Prefetching for Data Center Applications</title><author>Ahmed Khan, Tanvir ; Brown, Nathan ; Sriraman, Akshitha ; Soundararajan, Niranjan ; Kumar, Rakesh ; Devietti, Joseph ; Subramoney, Sreenivas ; Pokam, Gilles ; Litz, Heiner ; Kasikci, Baris</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-cristin_nora_11250_29876673</frbrgroupid><rsrctype>books</rsrctype><prefilter>books</prefilter><language>eng</language><creationdate>2021</creationdate><toplevel>online_resources</toplevel><creatorcontrib>Ahmed Khan, Tanvir</creatorcontrib><creatorcontrib>Brown, Nathan</creatorcontrib><creatorcontrib>Sriraman, Akshitha</creatorcontrib><creatorcontrib>Soundararajan, Niranjan</creatorcontrib><creatorcontrib>Kumar, Rakesh</creatorcontrib><creatorcontrib>Devietti, Joseph</creatorcontrib><creatorcontrib>Subramoney, Sreenivas</creatorcontrib><creatorcontrib>Pokam, Gilles</creatorcontrib><creatorcontrib>Litz, Heiner</creatorcontrib><creatorcontrib>Kasikci, Baris</creatorcontrib><collection>NORA - Norwegian Open Research Archives</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Ahmed Khan, Tanvir</au><au>Brown, Nathan</au><au>Sriraman, Akshitha</au><au>Soundararajan, Niranjan</au><au>Kumar, Rakesh</au><au>Devietti, Joseph</au><au>Subramoney, Sreenivas</au><au>Pokam, Gilles</au><au>Litz, Heiner</au><au>Kasikci, Baris</au><format>book</format><genre>book</genre><ristype>BOOK</ristype><atitle>Twig: Profile-Guided BTB Prefetching for Data Center Applications</atitle><btitle>MICRO-54: 54th Annual IEEE/ACM International Symposium on Microarchitecture</btitle><date>2021</date><risdate>2021</risdate><abstract>Modern data center applications have deep software stacks, with instruction footprints that are orders of magnitude larger than typical instruction cache (I-cache) sizes. To efficiently prefetch instructions into the I-cache despite large application footprints, modern server-class processors implement a decoupled frontend with Fetch Directed Instruction Prefetching (FDIP). In this work, we first characterize the limitations of a decoupled frontend processor with FDIP and find that FDIP suffers from significant Branch Target Buffer (BTB) misses. We also find that existing techniques (e.g., stream prefetchers and predecoders) are unable to mitigate these misses, as they rely on an incomplete understanding of a program’s branching behavior.</abstract><pub>Association for Computing Machinery (ACM)</pub><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier
ispartof MICRO-54: 54th Annual IEEE/ACM International Symposium on Microarchitecture, 2021
issn
language eng
recordid cdi_cristin_nora_11250_2987667
source NORA - Norwegian Open Research Archives
title Twig: Profile-Guided BTB Prefetching for Data Center Applications
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-14T12%3A23%3A54IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-cristin_3HK&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=book&rft.atitle=Twig:%20Profile-Guided%20BTB%20Prefetching%20for%20Data%20Center%20Applications&rft.btitle=MICRO-54:%2054th%20Annual%20IEEE/ACM%20International%20Symposium%20on%20Microarchitecture&rft.au=Ahmed%20Khan,%20Tanvir&rft.date=2021&rft_id=info:doi/&rft_dat=%3Ccristin_3HK%3E11250_2987667%3C/cristin_3HK%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true