Minimizing speculation overhead in a parallel recognizer for regular texts

Speculative data-parallel algorithms for language recognition have been widely experimented for various types of finite-state automata (FA), deterministic (DFA) and nondeterministic (NFA), often derived from regular expressions (RE). Such an algorithm cuts the input string into chunks, independently...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Borsotti, Angelo, Breveglieri, Luca, Reghizzi, Stefano Crespi, Morzenti, Angelo
Format:	Artikel
Sprache:	eng
Schlagworte:	Computer Science - Distributed, Parallel, and Cluster Computing
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title
container_volume
creator	Borsotti, Angelo Breveglieri, Luca Reghizzi, Stefano Crespi Morzenti, Angelo
description	Speculative data-parallel algorithms for language recognition have been widely experimented for various types of finite-state automata (FA), deterministic (DFA) and nondeterministic (NFA), often derived from regular expressions (RE). Such an algorithm cuts the input string into chunks, independently recognizes each chunk in parallel by means of identical FAs, and at last joins the chunk results and checks overall consistency. In chunk recognition, it is necessary to speculatively start the FAs in any state, thus causing an overhead that reduces the speedup compared to a serial algorithm. Existing data-parallel DFA-based recognizers suffer from the excessive number of starting states, and the NFA-based ones suffer from the number of nondeterministic transitions. Our data-parallel algorithm is based on the new FA type called reduced interface DFA (RI-DFA), which minimizes the speculation overhead without incurring in the penalty of nondeterministic transitions or of impractically enlarged DFA machines. The algorithm is proved to be correct and theoretically efficient, because it combines the state-reduction of an NFA with the speed of deterministic transitions, thus improving on both DFA-based and NFA-based existing implementations. The practical applicability of the RI-DFA approach is confirmed by a quantitative comparison of the number of starting states for a large public benchmark of complex FAs. On multi-core computing architectures, the RI-DFA recognizer is much faster than the NFA-based one on all benchmarks, while it matches the DFA-based one on some benchmarks and performs much better on some others. The extra time cost needed to construct an RI-DFA compared to a DFA is moderate and is compatible with a practical use.
doi_str_mv	10.48550/arxiv.2412.14975
format	Article
fullrecord	<record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2412_14975</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2412_14975</sourcerecordid><originalsourceid>FETCH-arxiv_primary_2412_149753</originalsourceid><addsrcrecordid>eNqFzbEKwkAQBNBrLET9ACv3B4xJvKDWoohgZx-WuIkLl7tjc4aYrzcGe6thYIan1DKJI73PsniD0nEbpTpJo0QfdtlUXW9sueaebQWNp-JlMLCz4FqSJ-ED2AKCR0FjyIBQ4SrLPQmUToZaDQeBQF1o5mpSomlo8cuZWp1P9-NlPaq5F65R3vlXz0d9-3_xAdoNO0U</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Minimizing speculation overhead in a parallel recognizer for regular texts</title><source>arXiv.org</source><creator>Borsotti, Angelo ; Breveglieri, Luca ; Reghizzi, Stefano Crespi ; Morzenti, Angelo</creator><creatorcontrib>Borsotti, Angelo ; Breveglieri, Luca ; Reghizzi, Stefano Crespi ; Morzenti, Angelo</creatorcontrib><description>Speculative data-parallel algorithms for language recognition have been widely experimented for various types of finite-state automata (FA), deterministic (DFA) and nondeterministic (NFA), often derived from regular expressions (RE). Such an algorithm cuts the input string into chunks, independently recognizes each chunk in parallel by means of identical FAs, and at last joins the chunk results and checks overall consistency. In chunk recognition, it is necessary to speculatively start the FAs in any state, thus causing an overhead that reduces the speedup compared to a serial algorithm. Existing data-parallel DFA-based recognizers suffer from the excessive number of starting states, and the NFA-based ones suffer from the number of nondeterministic transitions. Our data-parallel algorithm is based on the new FA type called reduced interface DFA (RI-DFA), which minimizes the speculation overhead without incurring in the penalty of nondeterministic transitions or of impractically enlarged DFA machines. The algorithm is proved to be correct and theoretically efficient, because it combines the state-reduction of an NFA with the speed of deterministic transitions, thus improving on both DFA-based and NFA-based existing implementations. The practical applicability of the RI-DFA approach is confirmed by a quantitative comparison of the number of starting states for a large public benchmark of complex FAs. On multi-core computing architectures, the RI-DFA recognizer is much faster than the NFA-based one on all benchmarks, while it matches the DFA-based one on some benchmarks and performs much better on some others. The extra time cost needed to construct an RI-DFA compared to a DFA is moderate and is compatible with a practical use.</description><identifier>DOI: 10.48550/arxiv.2412.14975</identifier><language>eng</language><subject>Computer Science - Distributed, Parallel, and Cluster Computing</subject><creationdate>2024-12</creationdate><rights>http://creativecommons.org/licenses/by-nc-sa/4.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,776,881</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2412.14975$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2412.14975$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Borsotti, Angelo</creatorcontrib><creatorcontrib>Breveglieri, Luca</creatorcontrib><creatorcontrib>Reghizzi, Stefano Crespi</creatorcontrib><creatorcontrib>Morzenti, Angelo</creatorcontrib><title>Minimizing speculation overhead in a parallel recognizer for regular texts</title><description>Speculative data-parallel algorithms for language recognition have been widely experimented for various types of finite-state automata (FA), deterministic (DFA) and nondeterministic (NFA), often derived from regular expressions (RE). Such an algorithm cuts the input string into chunks, independently recognizes each chunk in parallel by means of identical FAs, and at last joins the chunk results and checks overall consistency. In chunk recognition, it is necessary to speculatively start the FAs in any state, thus causing an overhead that reduces the speedup compared to a serial algorithm. Existing data-parallel DFA-based recognizers suffer from the excessive number of starting states, and the NFA-based ones suffer from the number of nondeterministic transitions. Our data-parallel algorithm is based on the new FA type called reduced interface DFA (RI-DFA), which minimizes the speculation overhead without incurring in the penalty of nondeterministic transitions or of impractically enlarged DFA machines. The algorithm is proved to be correct and theoretically efficient, because it combines the state-reduction of an NFA with the speed of deterministic transitions, thus improving on both DFA-based and NFA-based existing implementations. The practical applicability of the RI-DFA approach is confirmed by a quantitative comparison of the number of starting states for a large public benchmark of complex FAs. On multi-core computing architectures, the RI-DFA recognizer is much faster than the NFA-based one on all benchmarks, while it matches the DFA-based one on some benchmarks and performs much better on some others. The extra time cost needed to construct an RI-DFA compared to a DFA is moderate and is compatible with a practical use.</description><subject>Computer Science - Distributed, Parallel, and Cluster Computing</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNqFzbEKwkAQBNBrLET9ACv3B4xJvKDWoohgZx-WuIkLl7tjc4aYrzcGe6thYIan1DKJI73PsniD0nEbpTpJo0QfdtlUXW9sueaebQWNp-JlMLCz4FqSJ-ED2AKCR0FjyIBQ4SrLPQmUToZaDQeBQF1o5mpSomlo8cuZWp1P9-NlPaq5F65R3vlXz0d9-3_xAdoNO0U</recordid><startdate>20241219</startdate><enddate>20241219</enddate><creator>Borsotti, Angelo</creator><creator>Breveglieri, Luca</creator><creator>Reghizzi, Stefano Crespi</creator><creator>Morzenti, Angelo</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20241219</creationdate><title>Minimizing speculation overhead in a parallel recognizer for regular texts</title><author>Borsotti, Angelo ; Breveglieri, Luca ; Reghizzi, Stefano Crespi ; Morzenti, Angelo</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-arxiv_primary_2412_149753</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Computer Science - Distributed, Parallel, and Cluster Computing</topic><toplevel>online_resources</toplevel><creatorcontrib>Borsotti, Angelo</creatorcontrib><creatorcontrib>Breveglieri, Luca</creatorcontrib><creatorcontrib>Reghizzi, Stefano Crespi</creatorcontrib><creatorcontrib>Morzenti, Angelo</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Borsotti, Angelo</au><au>Breveglieri, Luca</au><au>Reghizzi, Stefano Crespi</au><au>Morzenti, Angelo</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Minimizing speculation overhead in a parallel recognizer for regular texts</atitle><date>2024-12-19</date><risdate>2024</risdate><abstract>Speculative data-parallel algorithms for language recognition have been widely experimented for various types of finite-state automata (FA), deterministic (DFA) and nondeterministic (NFA), often derived from regular expressions (RE). Such an algorithm cuts the input string into chunks, independently recognizes each chunk in parallel by means of identical FAs, and at last joins the chunk results and checks overall consistency. In chunk recognition, it is necessary to speculatively start the FAs in any state, thus causing an overhead that reduces the speedup compared to a serial algorithm. Existing data-parallel DFA-based recognizers suffer from the excessive number of starting states, and the NFA-based ones suffer from the number of nondeterministic transitions. Our data-parallel algorithm is based on the new FA type called reduced interface DFA (RI-DFA), which minimizes the speculation overhead without incurring in the penalty of nondeterministic transitions or of impractically enlarged DFA machines. The algorithm is proved to be correct and theoretically efficient, because it combines the state-reduction of an NFA with the speed of deterministic transitions, thus improving on both DFA-based and NFA-based existing implementations. The practical applicability of the RI-DFA approach is confirmed by a quantitative comparison of the number of starting states for a large public benchmark of complex FAs. On multi-core computing architectures, the RI-DFA recognizer is much faster than the NFA-based one on all benchmarks, while it matches the DFA-based one on some benchmarks and performs much better on some others. The extra time cost needed to construct an RI-DFA compared to a DFA is moderate and is compatible with a practical use.</abstract><doi>10.48550/arxiv.2412.14975</doi><oa>free_for_read</oa></addata></record>
fulltext	fulltext_linktorsrc
identifier	DOI: 10.48550/arxiv.2412.14975
ispartof
issn
language	eng
recordid	cdi_arxiv_primary_2412_14975
source	arXiv.org
subjects	Computer Science - Distributed, Parallel, and Cluster Computing
title	Minimizing speculation overhead in a parallel recognizer for regular texts
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-11T18%3A30%3A26IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Minimizing%20speculation%20overhead%20in%20a%20parallel%20recognizer%20for%20regular%20texts&rft.au=Borsotti,%20Angelo&rft.date=2024-12-19&rft_id=info:doi/10.48550/arxiv.2412.14975&rft_dat=%3Carxiv_GOX%3E2412_14975%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true