Giant complete automaton for uncertain multiple string matching and its high speed construction algorithm

Multiple string matching is often completed under the presence of U- or V-uncertain-strings,or combinations thereof.Recognizing large numbers of strings with U-, V-,and U-V-uncertain-strings,including the interleaving of two or more uncertain strings,is important to thoroughly gathering useful infor...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Science China. Information sciences 2011-08, Vol.54 (8), p.1562-1571
Hauptverfasser: Hu, Yue, Gao, QingShi, Guo, Li, Wang, PeiFeng
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 1571
container_issue 8
container_start_page 1562
container_title Science China. Information sciences
container_volume 54
creator Hu, Yue
Gao, QingShi
Guo, Li
Wang, PeiFeng
description Multiple string matching is often completed under the presence of U- or V-uncertain-strings,or combinations thereof.Recognizing large numbers of strings with U-, V-,and U-V-uncertain-strings,including the interleaving of two or more uncertain strings,is important to thoroughly gathering useful information and detecting harmful information.This paper proposes a complete automaton and its high-speed construction algorithm for large-scale U-, V-,and U-V-uncertain multiple strings,including two or more uncertain strings interlaced with one another.The maximum number of parallel complete automation of the V-uncertain string is also given.This paper reveals that there are two kinds of pretermissions,i.e.,similarly-connected and interlaced-string pretermissions,and that mistake may appear in the matching of the regular expressions,or states in the automaton may increase in number,if the intersection of the U-uncertain strings sets and the homologous subsequent special point in the U-uncertain strings sets are not eliminated from the whole system.
doi_str_mv 10.1007/s11432-011-4363-z
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_1671488737</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><cqvip_id>38698935</cqvip_id><sourcerecordid>1671488737</sourcerecordid><originalsourceid>FETCH-LOGICAL-c327t-1e35d8fcf3247555f7e31a3bec68a88314f0fde189b6f9eae9eb7877defb84733</originalsourceid><addsrcrecordid>eNp9kbFO3TAUhiNEJRDwAN1cdemSEucksTNWCCgSUpcisVm-zvGNUWIH2xnK03Oii4rEgBd7-P7_2J-L4iuvfvKqEpeJ8wbqsuK8bKCD8uWoOOWy60ve8_6Yzp1oSgHweFJcpPRU0QKoaiFPC3frtM_MhHmZMCPTaw6zzsEzGyJbvcGYtfNsXqfsCGEpR-f3jBgzbgftB-ZyYqPbjywtiAOVeaJWkx3V6GkfosvjfF58sXpKePG2nxUPN9d_r36X939u765-3ZcGapFLjtAO0hoLdSPatrUCgWvYoemklhJ4Yys7IJf9rrM9auxxJ6QQA9qdbOiRZ8WPQ-8Sw_OKKavZJYPTpD2GNSmSwRspBQhCv39An8IaPd1O1T0J3LiOKH6gTAwpRbRqiW7W8Z_ildr8q4N_Rf7V5l-9UKY-ZNKy6cL43vxZ6NvboDH4_TPl_k8C-k3ZQwuvzZuV6g</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2918614886</pqid></control><display><type>article</type><title>Giant complete automaton for uncertain multiple string matching and its high speed construction algorithm</title><source>Springer Nature - Complete Springer Journals</source><source>ProQuest Central UK/Ireland</source><source>Alma/SFX Local Collection</source><source>ProQuest Central</source><creator>Hu, Yue ; Gao, QingShi ; Guo, Li ; Wang, PeiFeng</creator><creatorcontrib>Hu, Yue ; Gao, QingShi ; Guo, Li ; Wang, PeiFeng</creatorcontrib><description>Multiple string matching is often completed under the presence of U- or V-uncertain-strings,or combinations thereof.Recognizing large numbers of strings with U-, V-,and U-V-uncertain-strings,including the interleaving of two or more uncertain strings,is important to thoroughly gathering useful information and detecting harmful information.This paper proposes a complete automaton and its high-speed construction algorithm for large-scale U-, V-,and U-V-uncertain multiple strings,including two or more uncertain strings interlaced with one another.The maximum number of parallel complete automation of the V-uncertain string is also given.This paper reveals that there are two kinds of pretermissions,i.e.,similarly-connected and interlaced-string pretermissions,and that mistake may appear in the matching of the regular expressions,or states in the automaton may increase in number,if the intersection of the U-uncertain strings sets and the homologous subsequent special point in the U-uncertain strings sets are not eliminated from the whole system.</description><identifier>ISSN: 1674-733X</identifier><identifier>EISSN: 1869-1919</identifier><identifier>DOI: 10.1007/s11432-011-4363-z</identifier><language>eng</language><publisher>Heidelberg: SP Science China Press</publisher><subject>Algorithms ; Automation ; Computer Science ; Construction ; High speed ; Information Systems and Communication Service ; Research Papers ; String matching ; Strings ; 信息检测 ; 字符串匹配 ; 巨人 ; 正则表达式 ; 算法 ; 自动机</subject><ispartof>Science China. Information sciences, 2011-08, Vol.54 (8), p.1562-1571</ispartof><rights>Science China Press and Springer-Verlag Berlin Heidelberg 2011</rights><rights>Science China Press and Springer-Verlag Berlin Heidelberg 2011.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c327t-1e35d8fcf3247555f7e31a3bec68a88314f0fde189b6f9eae9eb7877defb84733</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Uhttp://image.cqvip.com/vip1000/qk/84009A/84009A.jpg</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s11432-011-4363-z$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://www.proquest.com/docview/2918614886?pq-origsite=primo$$EHTML$$P50$$Gproquest$$H</linktohtml><link.rule.ids>314,777,781,21369,27905,27906,33725,33726,41469,42538,43786,51300,64364,64366,64368,72218</link.rule.ids></links><search><creatorcontrib>Hu, Yue</creatorcontrib><creatorcontrib>Gao, QingShi</creatorcontrib><creatorcontrib>Guo, Li</creatorcontrib><creatorcontrib>Wang, PeiFeng</creatorcontrib><title>Giant complete automaton for uncertain multiple string matching and its high speed construction algorithm</title><title>Science China. Information sciences</title><addtitle>Sci. China Inf. Sci</addtitle><addtitle>SCIENCE CHINA Information Sciences</addtitle><description>Multiple string matching is often completed under the presence of U- or V-uncertain-strings,or combinations thereof.Recognizing large numbers of strings with U-, V-,and U-V-uncertain-strings,including the interleaving of two or more uncertain strings,is important to thoroughly gathering useful information and detecting harmful information.This paper proposes a complete automaton and its high-speed construction algorithm for large-scale U-, V-,and U-V-uncertain multiple strings,including two or more uncertain strings interlaced with one another.The maximum number of parallel complete automation of the V-uncertain string is also given.This paper reveals that there are two kinds of pretermissions,i.e.,similarly-connected and interlaced-string pretermissions,and that mistake may appear in the matching of the regular expressions,or states in the automaton may increase in number,if the intersection of the U-uncertain strings sets and the homologous subsequent special point in the U-uncertain strings sets are not eliminated from the whole system.</description><subject>Algorithms</subject><subject>Automation</subject><subject>Computer Science</subject><subject>Construction</subject><subject>High speed</subject><subject>Information Systems and Communication Service</subject><subject>Research Papers</subject><subject>String matching</subject><subject>Strings</subject><subject>信息检测</subject><subject>字符串匹配</subject><subject>巨人</subject><subject>正则表达式</subject><subject>算法</subject><subject>自动机</subject><issn>1674-733X</issn><issn>1869-1919</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2011</creationdate><recordtype>article</recordtype><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GNUQQ</sourceid><recordid>eNp9kbFO3TAUhiNEJRDwAN1cdemSEucksTNWCCgSUpcisVm-zvGNUWIH2xnK03Oii4rEgBd7-P7_2J-L4iuvfvKqEpeJ8wbqsuK8bKCD8uWoOOWy60ve8_6Yzp1oSgHweFJcpPRU0QKoaiFPC3frtM_MhHmZMCPTaw6zzsEzGyJbvcGYtfNsXqfsCGEpR-f3jBgzbgftB-ZyYqPbjywtiAOVeaJWkx3V6GkfosvjfF58sXpKePG2nxUPN9d_r36X939u765-3ZcGapFLjtAO0hoLdSPatrUCgWvYoemklhJ4Yys7IJf9rrM9auxxJ6QQA9qdbOiRZ8WPQ-8Sw_OKKavZJYPTpD2GNSmSwRspBQhCv39An8IaPd1O1T0J3LiOKH6gTAwpRbRqiW7W8Z_ildr8q4N_Rf7V5l-9UKY-ZNKy6cL43vxZ6NvboDH4_TPl_k8C-k3ZQwuvzZuV6g</recordid><startdate>20110801</startdate><enddate>20110801</enddate><creator>Hu, Yue</creator><creator>Gao, QingShi</creator><creator>Guo, Li</creator><creator>Wang, PeiFeng</creator><general>SP Science China Press</general><general>Springer Nature B.V</general><scope>2RA</scope><scope>92L</scope><scope>CQIGP</scope><scope>W92</scope><scope>~WA</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>8FE</scope><scope>8FG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K7-</scope><scope>P5Z</scope><scope>P62</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>7SC</scope><scope>8FD</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20110801</creationdate><title>Giant complete automaton for uncertain multiple string matching and its high speed construction algorithm</title><author>Hu, Yue ; Gao, QingShi ; Guo, Li ; Wang, PeiFeng</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c327t-1e35d8fcf3247555f7e31a3bec68a88314f0fde189b6f9eae9eb7877defb84733</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2011</creationdate><topic>Algorithms</topic><topic>Automation</topic><topic>Computer Science</topic><topic>Construction</topic><topic>High speed</topic><topic>Information Systems and Communication Service</topic><topic>Research Papers</topic><topic>String matching</topic><topic>Strings</topic><topic>信息检测</topic><topic>字符串匹配</topic><topic>巨人</topic><topic>正则表达式</topic><topic>算法</topic><topic>自动机</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Hu, Yue</creatorcontrib><creatorcontrib>Gao, QingShi</creatorcontrib><creatorcontrib>Guo, Li</creatorcontrib><creatorcontrib>Wang, PeiFeng</creatorcontrib><collection>中文科技期刊数据库</collection><collection>中文科技期刊数据库-CALIS站点</collection><collection>中文科技期刊数据库-7.0平台</collection><collection>中文科技期刊数据库-工程技术</collection><collection>中文科技期刊数据库- 镜像站点</collection><collection>CrossRef</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central UK/Ireland</collection><collection>Advanced Technologies &amp; Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>Computer Science Database</collection><collection>Advanced Technologies &amp; Aerospace Database</collection><collection>ProQuest Advanced Technologies &amp; Aerospace Collection</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Science China. Information sciences</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Hu, Yue</au><au>Gao, QingShi</au><au>Guo, Li</au><au>Wang, PeiFeng</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Giant complete automaton for uncertain multiple string matching and its high speed construction algorithm</atitle><jtitle>Science China. Information sciences</jtitle><stitle>Sci. China Inf. Sci</stitle><addtitle>SCIENCE CHINA Information Sciences</addtitle><date>2011-08-01</date><risdate>2011</risdate><volume>54</volume><issue>8</issue><spage>1562</spage><epage>1571</epage><pages>1562-1571</pages><issn>1674-733X</issn><eissn>1869-1919</eissn><abstract>Multiple string matching is often completed under the presence of U- or V-uncertain-strings,or combinations thereof.Recognizing large numbers of strings with U-, V-,and U-V-uncertain-strings,including the interleaving of two or more uncertain strings,is important to thoroughly gathering useful information and detecting harmful information.This paper proposes a complete automaton and its high-speed construction algorithm for large-scale U-, V-,and U-V-uncertain multiple strings,including two or more uncertain strings interlaced with one another.The maximum number of parallel complete automation of the V-uncertain string is also given.This paper reveals that there are two kinds of pretermissions,i.e.,similarly-connected and interlaced-string pretermissions,and that mistake may appear in the matching of the regular expressions,or states in the automaton may increase in number,if the intersection of the U-uncertain strings sets and the homologous subsequent special point in the U-uncertain strings sets are not eliminated from the whole system.</abstract><cop>Heidelberg</cop><pub>SP Science China Press</pub><doi>10.1007/s11432-011-4363-z</doi><tpages>10</tpages></addata></record>
fulltext fulltext
identifier ISSN: 1674-733X
ispartof Science China. Information sciences, 2011-08, Vol.54 (8), p.1562-1571
issn 1674-733X
1869-1919
language eng
recordid cdi_proquest_miscellaneous_1671488737
source Springer Nature - Complete Springer Journals; ProQuest Central UK/Ireland; Alma/SFX Local Collection; ProQuest Central
subjects Algorithms
Automation
Computer Science
Construction
High speed
Information Systems and Communication Service
Research Papers
String matching
Strings
信息检测
字符串匹配
巨人
正则表达式
算法
自动机
title Giant complete automaton for uncertain multiple string matching and its high speed construction algorithm
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-20T14%3A04%3A46IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Giant%20complete%20automaton%20for%20uncertain%20multiple%20string%20matching%20and%20its%20high%20speed%20construction%20algorithm&rft.jtitle=Science%20China.%20Information%20sciences&rft.au=Hu,%20Yue&rft.date=2011-08-01&rft.volume=54&rft.issue=8&rft.spage=1562&rft.epage=1571&rft.pages=1562-1571&rft.issn=1674-733X&rft.eissn=1869-1919&rft_id=info:doi/10.1007/s11432-011-4363-z&rft_dat=%3Cproquest_cross%3E1671488737%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2918614886&rft_id=info:pmid/&rft_cqvip_id=38698935&rfr_iscdi=true