TrackFormers: In Search of Transformer-Based Particle Tracking for the High-Luminosity LHC Era

High-Energy Physics experiments are facing a multi-fold data increase with every new iteration. This is certainly the case for the upcoming High-Luminosity LHC upgrade. Such increased data processing requirements forces revisions to almost every step of the data processing pipeline. One such step in...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	arXiv.org 2024-12
Hauptverfasser:	Caron, Sascha, Dobreva, Nadezhda, Antonio Ferrer Sánchez, Martín-Guerrero, José D, Odyurt, Uraz, Roberto Ruiz de Austri Bazan, Wolffs, Zef, Zhao, Yue
Format:	Artikel
Sprache:	eng
Schlagworte:	Accuracy Data processing Datasets Large language models Luminosity Machine learning Particle tracking Performance prediction
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title	arXiv.org
container_volume
creator	Caron, Sascha Dobreva, Nadezhda Antonio Ferrer Sánchez Martín-Guerrero, José D Odyurt, Uraz Roberto Ruiz de Austri Bazan Wolffs, Zef Zhao, Yue
description	High-Energy Physics experiments are facing a multi-fold data increase with every new iteration. This is certainly the case for the upcoming High-Luminosity LHC upgrade. Such increased data processing requirements forces revisions to almost every step of the data processing pipeline. One such step in need of an overhaul is the task of particle track reconstruction, a.k.a., tracking. A Machine Learning-assisted solution is expected to provide significant improvements, since the most time-consuming step in tracking is the assignment of hits to particles or track candidates. This is the topic of this paper. We take inspiration from large language models. As such, we consider two approaches: the prediction of the next word in a sentence (next hit point in a track), as well as the one-shot prediction of all hits within an event. In an extensive design effort, we have experimented with three models based on the Transformer architecture and one model based on the U-Net architecture, performing track association predictions for collision event hit points. In our evaluation, we consider a spectrum of simple to complex representations of the problem, eliminating designs with lower metrics early on. We report extensive results, covering both prediction accuracy (score) and computational performance. We have made use of the REDVID simulation framework, as well as reductions applied to the TrackML data set, to compose five data sets from simple to complex, for our experiments. The results highlight distinct advantages among different designs in terms of prediction accuracy and computational performance, demonstrating the efficiency of our methodology. Most importantly, the results show the viability of a one-shot encoder-classifier based Transformer solution as a practical approach for the task of tracking.
format	Article
fullrecord	<record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_3078836712</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3078836712</sourcerecordid><originalsourceid>FETCH-proquest_journals_30788367123</originalsourceid><addsrcrecordid>eNqNy08LgjAcxvERBEn5Hn7QeTC3_EPHRDHwENQ5GTZ1pltt89C7T6QX0Ok5fJ7vCnmUsQAnB0o3yLe2J4TQKKZhyDx0vxleP3NtRmHsEc4KroKbugPdwEzKNgvhE7fiARdunKwHAUslVQszg-sEFLLtcDmNUmkr3QfKIoXM8B1aN3ywwv_tFu3z7JYW-GX0exLWVb2ejJqpYiROEhbFAWX_vb6-eULz</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3078836712</pqid></control><display><type>article</type><title>TrackFormers: In Search of Transformer-Based Particle Tracking for the High-Luminosity LHC Era</title><source>Free E- Journals</source><creator>Caron, Sascha ; Dobreva, Nadezhda ; Antonio Ferrer Sánchez ; Martín-Guerrero, José D ; Odyurt, Uraz ; Roberto Ruiz de Austri Bazan ; Wolffs, Zef ; Zhao, Yue</creator><creatorcontrib>Caron, Sascha ; Dobreva, Nadezhda ; Antonio Ferrer Sánchez ; Martín-Guerrero, José D ; Odyurt, Uraz ; Roberto Ruiz de Austri Bazan ; Wolffs, Zef ; Zhao, Yue</creatorcontrib><description>High-Energy Physics experiments are facing a multi-fold data increase with every new iteration. This is certainly the case for the upcoming High-Luminosity LHC upgrade. Such increased data processing requirements forces revisions to almost every step of the data processing pipeline. One such step in need of an overhaul is the task of particle track reconstruction, a.k.a., tracking. A Machine Learning-assisted solution is expected to provide significant improvements, since the most time-consuming step in tracking is the assignment of hits to particles or track candidates. This is the topic of this paper. We take inspiration from large language models. As such, we consider two approaches: the prediction of the next word in a sentence (next hit point in a track), as well as the one-shot prediction of all hits within an event. In an extensive design effort, we have experimented with three models based on the Transformer architecture and one model based on the U-Net architecture, performing track association predictions for collision event hit points. In our evaluation, we consider a spectrum of simple to complex representations of the problem, eliminating designs with lower metrics early on. We report extensive results, covering both prediction accuracy (score) and computational performance. We have made use of the REDVID simulation framework, as well as reductions applied to the TrackML data set, to compose five data sets from simple to complex, for our experiments. The results highlight distinct advantages among different designs in terms of prediction accuracy and computational performance, demonstrating the efficiency of our methodology. Most importantly, the results show the viability of a one-shot encoder-classifier based Transformer solution as a practical approach for the task of tracking.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Accuracy ; Data processing ; Datasets ; Large language models ; Luminosity ; Machine learning ; Particle tracking ; Performance prediction</subject><ispartof>arXiv.org, 2024-12</ispartof><rights>2024. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>776,780</link.rule.ids></links><search><creatorcontrib>Caron, Sascha</creatorcontrib><creatorcontrib>Dobreva, Nadezhda</creatorcontrib><creatorcontrib>Antonio Ferrer Sánchez</creatorcontrib><creatorcontrib>Martín-Guerrero, José D</creatorcontrib><creatorcontrib>Odyurt, Uraz</creatorcontrib><creatorcontrib>Roberto Ruiz de Austri Bazan</creatorcontrib><creatorcontrib>Wolffs, Zef</creatorcontrib><creatorcontrib>Zhao, Yue</creatorcontrib><title>TrackFormers: In Search of Transformer-Based Particle Tracking for the High-Luminosity LHC Era</title><title>arXiv.org</title><description>High-Energy Physics experiments are facing a multi-fold data increase with every new iteration. This is certainly the case for the upcoming High-Luminosity LHC upgrade. Such increased data processing requirements forces revisions to almost every step of the data processing pipeline. One such step in need of an overhaul is the task of particle track reconstruction, a.k.a., tracking. A Machine Learning-assisted solution is expected to provide significant improvements, since the most time-consuming step in tracking is the assignment of hits to particles or track candidates. This is the topic of this paper. We take inspiration from large language models. As such, we consider two approaches: the prediction of the next word in a sentence (next hit point in a track), as well as the one-shot prediction of all hits within an event. In an extensive design effort, we have experimented with three models based on the Transformer architecture and one model based on the U-Net architecture, performing track association predictions for collision event hit points. In our evaluation, we consider a spectrum of simple to complex representations of the problem, eliminating designs with lower metrics early on. We report extensive results, covering both prediction accuracy (score) and computational performance. We have made use of the REDVID simulation framework, as well as reductions applied to the TrackML data set, to compose five data sets from simple to complex, for our experiments. The results highlight distinct advantages among different designs in terms of prediction accuracy and computational performance, demonstrating the efficiency of our methodology. Most importantly, the results show the viability of a one-shot encoder-classifier based Transformer solution as a practical approach for the task of tracking.</description><subject>Accuracy</subject><subject>Data processing</subject><subject>Datasets</subject><subject>Large language models</subject><subject>Luminosity</subject><subject>Machine learning</subject><subject>Particle tracking</subject><subject>Performance prediction</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><recordid>eNqNy08LgjAcxvERBEn5Hn7QeTC3_EPHRDHwENQ5GTZ1pltt89C7T6QX0Ok5fJ7vCnmUsQAnB0o3yLe2J4TQKKZhyDx0vxleP3NtRmHsEc4KroKbugPdwEzKNgvhE7fiARdunKwHAUslVQszg-sEFLLtcDmNUmkr3QfKIoXM8B1aN3ywwv_tFu3z7JYW-GX0exLWVb2ejJqpYiROEhbFAWX_vb6-eULz</recordid><startdate>20241216</startdate><enddate>20241216</enddate><creator>Caron, Sascha</creator><creator>Dobreva, Nadezhda</creator><creator>Antonio Ferrer Sánchez</creator><creator>Martín-Guerrero, José D</creator><creator>Odyurt, Uraz</creator><creator>Roberto Ruiz de Austri Bazan</creator><creator>Wolffs, Zef</creator><creator>Zhao, Yue</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20241216</creationdate><title>TrackFormers: In Search of Transformer-Based Particle Tracking for the High-Luminosity LHC Era</title><author>Caron, Sascha ; Dobreva, Nadezhda ; Antonio Ferrer Sánchez ; Martín-Guerrero, José D ; Odyurt, Uraz ; Roberto Ruiz de Austri Bazan ; Wolffs, Zef ; Zhao, Yue</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_30788367123</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Accuracy</topic><topic>Data processing</topic><topic>Datasets</topic><topic>Large language models</topic><topic>Luminosity</topic><topic>Machine learning</topic><topic>Particle tracking</topic><topic>Performance prediction</topic><toplevel>online_resources</toplevel><creatorcontrib>Caron, Sascha</creatorcontrib><creatorcontrib>Dobreva, Nadezhda</creatorcontrib><creatorcontrib>Antonio Ferrer Sánchez</creatorcontrib><creatorcontrib>Martín-Guerrero, José D</creatorcontrib><creatorcontrib>Odyurt, Uraz</creatorcontrib><creatorcontrib>Roberto Ruiz de Austri Bazan</creatorcontrib><creatorcontrib>Wolffs, Zef</creatorcontrib><creatorcontrib>Zhao, Yue</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Caron, Sascha</au><au>Dobreva, Nadezhda</au><au>Antonio Ferrer Sánchez</au><au>Martín-Guerrero, José D</au><au>Odyurt, Uraz</au><au>Roberto Ruiz de Austri Bazan</au><au>Wolffs, Zef</au><au>Zhao, Yue</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>TrackFormers: In Search of Transformer-Based Particle Tracking for the High-Luminosity LHC Era</atitle><jtitle>arXiv.org</jtitle><date>2024-12-16</date><risdate>2024</risdate><eissn>2331-8422</eissn><abstract>High-Energy Physics experiments are facing a multi-fold data increase with every new iteration. This is certainly the case for the upcoming High-Luminosity LHC upgrade. Such increased data processing requirements forces revisions to almost every step of the data processing pipeline. One such step in need of an overhaul is the task of particle track reconstruction, a.k.a., tracking. A Machine Learning-assisted solution is expected to provide significant improvements, since the most time-consuming step in tracking is the assignment of hits to particles or track candidates. This is the topic of this paper. We take inspiration from large language models. As such, we consider two approaches: the prediction of the next word in a sentence (next hit point in a track), as well as the one-shot prediction of all hits within an event. In an extensive design effort, we have experimented with three models based on the Transformer architecture and one model based on the U-Net architecture, performing track association predictions for collision event hit points. In our evaluation, we consider a spectrum of simple to complex representations of the problem, eliminating designs with lower metrics early on. We report extensive results, covering both prediction accuracy (score) and computational performance. We have made use of the REDVID simulation framework, as well as reductions applied to the TrackML data set, to compose five data sets from simple to complex, for our experiments. The results highlight distinct advantages among different designs in terms of prediction accuracy and computational performance, demonstrating the efficiency of our methodology. Most importantly, the results show the viability of a one-shot encoder-classifier based Transformer solution as a practical approach for the task of tracking.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	EISSN: 2331-8422
ispartof	arXiv.org, 2024-12
issn	2331-8422
language	eng
recordid	cdi_proquest_journals_3078836712
source	Free E- Journals
subjects	Accuracy Data processing Datasets Large language models Luminosity Machine learning Particle tracking Performance prediction
title	TrackFormers: In Search of Transformer-Based Particle Tracking for the High-Luminosity LHC Era
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-25T14%3A23%3A47IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=TrackFormers:%20In%20Search%20of%20Transformer-Based%20Particle%20Tracking%20for%20the%20High-Luminosity%20LHC%20Era&rft.jtitle=arXiv.org&rft.au=Caron,%20Sascha&rft.date=2024-12-16&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E3078836712%3C/proquest%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=3078836712&rft_id=info:pmid/&rfr_iscdi=true