Minimizing speculation overhead in a parallel recognizer for regular texts

Speculative data-parallel algorithms for language recognition have been widely experimented for various types of finite-state automata (FA), deterministic (DFA) and nondeterministic (NFA), often derived from regular expressions (RE). Such an algorithm cuts the input string into chunks, independently...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Borsotti, Angelo, Breveglieri, Luca, Reghizzi, Stefano Crespi, Morzenti, Angelo
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator Borsotti, Angelo
Breveglieri, Luca
Reghizzi, Stefano Crespi
Morzenti, Angelo
description Speculative data-parallel algorithms for language recognition have been widely experimented for various types of finite-state automata (FA), deterministic (DFA) and nondeterministic (NFA), often derived from regular expressions (RE). Such an algorithm cuts the input string into chunks, independently recognizes each chunk in parallel by means of identical FAs, and at last joins the chunk results and checks overall consistency. In chunk recognition, it is necessary to speculatively start the FAs in any state, thus causing an overhead that reduces the speedup compared to a serial algorithm. Existing data-parallel DFA-based recognizers suffer from the excessive number of starting states, and the NFA-based ones suffer from the number of nondeterministic transitions. Our data-parallel algorithm is based on the new FA type called reduced interface DFA (RI-DFA), which minimizes the speculation overhead without incurring in the penalty of nondeterministic transitions or of impractically enlarged DFA machines. The algorithm is proved to be correct and theoretically efficient, because it combines the state-reduction of an NFA with the speed of deterministic transitions, thus improving on both DFA-based and NFA-based existing implementations. The practical applicability of the RI-DFA approach is confirmed by a quantitative comparison of the number of starting states for a large public benchmark of complex FAs. On multi-core computing architectures, the RI-DFA recognizer is much faster than the NFA-based one on all benchmarks, while it matches the DFA-based one on some benchmarks and performs much better on some others. The extra time cost needed to construct an RI-DFA compared to a DFA is moderate and is compatible with a practical use.
doi_str_mv 10.48550/arxiv.2412.14975
format Article
fullrecord <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2412_14975</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2412_14975</sourcerecordid><originalsourceid>FETCH-arxiv_primary_2412_149753</originalsourceid><addsrcrecordid>eNqFzbEKwkAQBNBrLET9ACv3B4xJvKDWoohgZx-WuIkLl7tjc4aYrzcGe6thYIan1DKJI73PsniD0nEbpTpJo0QfdtlUXW9sueaebQWNp-JlMLCz4FqSJ-ED2AKCR0FjyIBQ4SrLPQmUToZaDQeBQF1o5mpSomlo8cuZWp1P9-NlPaq5F65R3vlXz0d9-3_xAdoNO0U</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Minimizing speculation overhead in a parallel recognizer for regular texts</title><source>arXiv.org</source><creator>Borsotti, Angelo ; Breveglieri, Luca ; Reghizzi, Stefano Crespi ; Morzenti, Angelo</creator><creatorcontrib>Borsotti, Angelo ; Breveglieri, Luca ; Reghizzi, Stefano Crespi ; Morzenti, Angelo</creatorcontrib><description>Speculative data-parallel algorithms for language recognition have been widely experimented for various types of finite-state automata (FA), deterministic (DFA) and nondeterministic (NFA), often derived from regular expressions (RE). Such an algorithm cuts the input string into chunks, independently recognizes each chunk in parallel by means of identical FAs, and at last joins the chunk results and checks overall consistency. In chunk recognition, it is necessary to speculatively start the FAs in any state, thus causing an overhead that reduces the speedup compared to a serial algorithm. Existing data-parallel DFA-based recognizers suffer from the excessive number of starting states, and the NFA-based ones suffer from the number of nondeterministic transitions. Our data-parallel algorithm is based on the new FA type called reduced interface DFA (RI-DFA), which minimizes the speculation overhead without incurring in the penalty of nondeterministic transitions or of impractically enlarged DFA machines. The algorithm is proved to be correct and theoretically efficient, because it combines the state-reduction of an NFA with the speed of deterministic transitions, thus improving on both DFA-based and NFA-based existing implementations. The practical applicability of the RI-DFA approach is confirmed by a quantitative comparison of the number of starting states for a large public benchmark of complex FAs. On multi-core computing architectures, the RI-DFA recognizer is much faster than the NFA-based one on all benchmarks, while it matches the DFA-based one on some benchmarks and performs much better on some others. The extra time cost needed to construct an RI-DFA compared to a DFA is moderate and is compatible with a practical use.</description><identifier>DOI: 10.48550/arxiv.2412.14975</identifier><language>eng</language><subject>Computer Science - Distributed, Parallel, and Cluster Computing</subject><creationdate>2024-12</creationdate><rights>http://creativecommons.org/licenses/by-nc-sa/4.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,776,881</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2412.14975$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2412.14975$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Borsotti, Angelo</creatorcontrib><creatorcontrib>Breveglieri, Luca</creatorcontrib><creatorcontrib>Reghizzi, Stefano Crespi</creatorcontrib><creatorcontrib>Morzenti, Angelo</creatorcontrib><title>Minimizing speculation overhead in a parallel recognizer for regular texts</title><description>Speculative data-parallel algorithms for language recognition have been widely experimented for various types of finite-state automata (FA), deterministic (DFA) and nondeterministic (NFA), often derived from regular expressions (RE). Such an algorithm cuts the input string into chunks, independently recognizes each chunk in parallel by means of identical FAs, and at last joins the chunk results and checks overall consistency. In chunk recognition, it is necessary to speculatively start the FAs in any state, thus causing an overhead that reduces the speedup compared to a serial algorithm. Existing data-parallel DFA-based recognizers suffer from the excessive number of starting states, and the NFA-based ones suffer from the number of nondeterministic transitions. Our data-parallel algorithm is based on the new FA type called reduced interface DFA (RI-DFA), which minimizes the speculation overhead without incurring in the penalty of nondeterministic transitions or of impractically enlarged DFA machines. The algorithm is proved to be correct and theoretically efficient, because it combines the state-reduction of an NFA with the speed of deterministic transitions, thus improving on both DFA-based and NFA-based existing implementations. The practical applicability of the RI-DFA approach is confirmed by a quantitative comparison of the number of starting states for a large public benchmark of complex FAs. On multi-core computing architectures, the RI-DFA recognizer is much faster than the NFA-based one on all benchmarks, while it matches the DFA-based one on some benchmarks and performs much better on some others. The extra time cost needed to construct an RI-DFA compared to a DFA is moderate and is compatible with a practical use.</description><subject>Computer Science - Distributed, Parallel, and Cluster Computing</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNqFzbEKwkAQBNBrLET9ACv3B4xJvKDWoohgZx-WuIkLl7tjc4aYrzcGe6thYIan1DKJI73PsniD0nEbpTpJo0QfdtlUXW9sueaebQWNp-JlMLCz4FqSJ-ED2AKCR0FjyIBQ4SrLPQmUToZaDQeBQF1o5mpSomlo8cuZWp1P9-NlPaq5F65R3vlXz0d9-3_xAdoNO0U</recordid><startdate>20241219</startdate><enddate>20241219</enddate><creator>Borsotti, Angelo</creator><creator>Breveglieri, Luca</creator><creator>Reghizzi, Stefano Crespi</creator><creator>Morzenti, Angelo</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20241219</creationdate><title>Minimizing speculation overhead in a parallel recognizer for regular texts</title><author>Borsotti, Angelo ; Breveglieri, Luca ; Reghizzi, Stefano Crespi ; Morzenti, Angelo</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-arxiv_primary_2412_149753</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Computer Science - Distributed, Parallel, and Cluster Computing</topic><toplevel>online_resources</toplevel><creatorcontrib>Borsotti, Angelo</creatorcontrib><creatorcontrib>Breveglieri, Luca</creatorcontrib><creatorcontrib>Reghizzi, Stefano Crespi</creatorcontrib><creatorcontrib>Morzenti, Angelo</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Borsotti, Angelo</au><au>Breveglieri, Luca</au><au>Reghizzi, Stefano Crespi</au><au>Morzenti, Angelo</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Minimizing speculation overhead in a parallel recognizer for regular texts</atitle><date>2024-12-19</date><risdate>2024</risdate><abstract>Speculative data-parallel algorithms for language recognition have been widely experimented for various types of finite-state automata (FA), deterministic (DFA) and nondeterministic (NFA), often derived from regular expressions (RE). Such an algorithm cuts the input string into chunks, independently recognizes each chunk in parallel by means of identical FAs, and at last joins the chunk results and checks overall consistency. In chunk recognition, it is necessary to speculatively start the FAs in any state, thus causing an overhead that reduces the speedup compared to a serial algorithm. Existing data-parallel DFA-based recognizers suffer from the excessive number of starting states, and the NFA-based ones suffer from the number of nondeterministic transitions. Our data-parallel algorithm is based on the new FA type called reduced interface DFA (RI-DFA), which minimizes the speculation overhead without incurring in the penalty of nondeterministic transitions or of impractically enlarged DFA machines. The algorithm is proved to be correct and theoretically efficient, because it combines the state-reduction of an NFA with the speed of deterministic transitions, thus improving on both DFA-based and NFA-based existing implementations. The practical applicability of the RI-DFA approach is confirmed by a quantitative comparison of the number of starting states for a large public benchmark of complex FAs. On multi-core computing architectures, the RI-DFA recognizer is much faster than the NFA-based one on all benchmarks, while it matches the DFA-based one on some benchmarks and performs much better on some others. The extra time cost needed to construct an RI-DFA compared to a DFA is moderate and is compatible with a practical use.</abstract><doi>10.48550/arxiv.2412.14975</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier DOI: 10.48550/arxiv.2412.14975
ispartof
issn
language eng
recordid cdi_arxiv_primary_2412_14975
source arXiv.org
subjects Computer Science - Distributed, Parallel, and Cluster Computing
title Minimizing speculation overhead in a parallel recognizer for regular texts
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-11T18%3A30%3A26IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Minimizing%20speculation%20overhead%20in%20a%20parallel%20recognizer%20for%20regular%20texts&rft.au=Borsotti,%20Angelo&rft.date=2024-12-19&rft_id=info:doi/10.48550/arxiv.2412.14975&rft_dat=%3Carxiv_GOX%3E2412_14975%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true