Noisy Bloom Filters for Multi-Set Membership Testing

This paper is on designing a compact data structure for multi-set membership testing allowing fast set querying. Multi-set membership testing is a fundamental operation for computing systems and networking applications. Most existing schemes for multi-set membership testing are built upon Bloom filt...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Performance evaluation review 2016-06, Vol.44 (1), p.139-151
Hauptverfasser: Dai, Haipeng, Zhong, Yuankun, Liu, Alex X., Wang, Wei, Li, Meng
Format: Artikel
Sprache:eng
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 151
container_issue 1
container_start_page 139
container_title Performance evaluation review
container_volume 44
creator Dai, Haipeng
Zhong, Yuankun
Liu, Alex X.
Wang, Wei
Li, Meng
description This paper is on designing a compact data structure for multi-set membership testing allowing fast set querying. Multi-set membership testing is a fundamental operation for computing systems and networking applications. Most existing schemes for multi-set membership testing are built upon Bloom filter, and fall short in either storage space cost or query speed. To address this issue, in this paper we propose Noisy Bloom Filter (NBF) and Error Corrected Noisy Bloom Filter (NBF-E) for multi-set membership testing. For theoretical analysis, we optimize their classification failure rate and false positive rate, and present criteria for selection between NBF and NBF-E. The key novelty of NBF and NBF-E is to store set ID information in a compact but noisy way that allows fast recording and querying, and use denoising method for querying. Especially, NBF-E incorporates asymmetric error-correcting coding technique into NBF to enhance the resilience of query results to noise by revealing and leveraging the asymmetric error nature of query results. To evaluate NBF and NBF-E in comparison with prior art, we conducted experiments using real-world network traces. The results show that NBF and NBF-E significantly advance the state-of-the-art on multi-set membership testing.
doi_str_mv 10.1145/2964791.2901451
format Article
fullrecord <record><control><sourceid>crossref</sourceid><recordid>TN_cdi_crossref_primary_10_1145_2964791_2901451</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>10_1145_2964791_2901451</sourcerecordid><originalsourceid>FETCH-LOGICAL-c861-3bc0b08ece7faca1a9a6782baf0379f4483c29caa22f153c07aa58797fcc91413</originalsourceid><addsrcrecordid>eNotzzFPwzAUBGAPIFFKZ1b_gbR-thP7jVBRQGphaHbrxbLBKCGVHYb-e4rIdLobTvoYuwexBtD1RmKjDcJaorhUuGILAY2qakS8YbelfAkBRoJdMP02pnLmj_04DnyX-inkwuOY-eGnn1J1DBM_hKG7rJ_pxNtQpvT9cceuI_UlrOZcsnb31G5fqv378-v2YV9520ClOi86YYMPJpInIKTGWNlRFMpg1NoqL9ETSRmhVl4YotoaNNF7BA1qyTb_tz6PpeQQ3SmngfLZgXB_Tjc73exUv51sRt8</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Noisy Bloom Filters for Multi-Set Membership Testing</title><source>ACM Digital Library</source><creator>Dai, Haipeng ; Zhong, Yuankun ; Liu, Alex X. ; Wang, Wei ; Li, Meng</creator><creatorcontrib>Dai, Haipeng ; Zhong, Yuankun ; Liu, Alex X. ; Wang, Wei ; Li, Meng</creatorcontrib><description>This paper is on designing a compact data structure for multi-set membership testing allowing fast set querying. Multi-set membership testing is a fundamental operation for computing systems and networking applications. Most existing schemes for multi-set membership testing are built upon Bloom filter, and fall short in either storage space cost or query speed. To address this issue, in this paper we propose Noisy Bloom Filter (NBF) and Error Corrected Noisy Bloom Filter (NBF-E) for multi-set membership testing. For theoretical analysis, we optimize their classification failure rate and false positive rate, and present criteria for selection between NBF and NBF-E. The key novelty of NBF and NBF-E is to store set ID information in a compact but noisy way that allows fast recording and querying, and use denoising method for querying. Especially, NBF-E incorporates asymmetric error-correcting coding technique into NBF to enhance the resilience of query results to noise by revealing and leveraging the asymmetric error nature of query results. To evaluate NBF and NBF-E in comparison with prior art, we conducted experiments using real-world network traces. The results show that NBF and NBF-E significantly advance the state-of-the-art on multi-set membership testing.</description><identifier>ISSN: 0163-5999</identifier><identifier>DOI: 10.1145/2964791.2901451</identifier><language>eng</language><ispartof>Performance evaluation review, 2016-06, Vol.44 (1), p.139-151</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c861-3bc0b08ece7faca1a9a6782baf0379f4483c29caa22f153c07aa58797fcc91413</citedby><cites>FETCH-LOGICAL-c861-3bc0b08ece7faca1a9a6782baf0379f4483c29caa22f153c07aa58797fcc91413</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids></links><search><creatorcontrib>Dai, Haipeng</creatorcontrib><creatorcontrib>Zhong, Yuankun</creatorcontrib><creatorcontrib>Liu, Alex X.</creatorcontrib><creatorcontrib>Wang, Wei</creatorcontrib><creatorcontrib>Li, Meng</creatorcontrib><title>Noisy Bloom Filters for Multi-Set Membership Testing</title><title>Performance evaluation review</title><description>This paper is on designing a compact data structure for multi-set membership testing allowing fast set querying. Multi-set membership testing is a fundamental operation for computing systems and networking applications. Most existing schemes for multi-set membership testing are built upon Bloom filter, and fall short in either storage space cost or query speed. To address this issue, in this paper we propose Noisy Bloom Filter (NBF) and Error Corrected Noisy Bloom Filter (NBF-E) for multi-set membership testing. For theoretical analysis, we optimize their classification failure rate and false positive rate, and present criteria for selection between NBF and NBF-E. The key novelty of NBF and NBF-E is to store set ID information in a compact but noisy way that allows fast recording and querying, and use denoising method for querying. Especially, NBF-E incorporates asymmetric error-correcting coding technique into NBF to enhance the resilience of query results to noise by revealing and leveraging the asymmetric error nature of query results. To evaluate NBF and NBF-E in comparison with prior art, we conducted experiments using real-world network traces. The results show that NBF and NBF-E significantly advance the state-of-the-art on multi-set membership testing.</description><issn>0163-5999</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2016</creationdate><recordtype>article</recordtype><recordid>eNotzzFPwzAUBGAPIFFKZ1b_gbR-thP7jVBRQGphaHbrxbLBKCGVHYb-e4rIdLobTvoYuwexBtD1RmKjDcJaorhUuGILAY2qakS8YbelfAkBRoJdMP02pnLmj_04DnyX-inkwuOY-eGnn1J1DBM_hKG7rJ_pxNtQpvT9cceuI_UlrOZcsnb31G5fqv378-v2YV9520ClOi86YYMPJpInIKTGWNlRFMpg1NoqL9ETSRmhVl4YotoaNNF7BA1qyTb_tz6PpeQQ3SmngfLZgXB_Tjc73exUv51sRt8</recordid><startdate>20160630</startdate><enddate>20160630</enddate><creator>Dai, Haipeng</creator><creator>Zhong, Yuankun</creator><creator>Liu, Alex X.</creator><creator>Wang, Wei</creator><creator>Li, Meng</creator><scope>AAYXX</scope><scope>CITATION</scope></search><sort><creationdate>20160630</creationdate><title>Noisy Bloom Filters for Multi-Set Membership Testing</title><author>Dai, Haipeng ; Zhong, Yuankun ; Liu, Alex X. ; Wang, Wei ; Li, Meng</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c861-3bc0b08ece7faca1a9a6782baf0379f4483c29caa22f153c07aa58797fcc91413</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2016</creationdate><toplevel>online_resources</toplevel><creatorcontrib>Dai, Haipeng</creatorcontrib><creatorcontrib>Zhong, Yuankun</creatorcontrib><creatorcontrib>Liu, Alex X.</creatorcontrib><creatorcontrib>Wang, Wei</creatorcontrib><creatorcontrib>Li, Meng</creatorcontrib><collection>CrossRef</collection><jtitle>Performance evaluation review</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Dai, Haipeng</au><au>Zhong, Yuankun</au><au>Liu, Alex X.</au><au>Wang, Wei</au><au>Li, Meng</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Noisy Bloom Filters for Multi-Set Membership Testing</atitle><jtitle>Performance evaluation review</jtitle><date>2016-06-30</date><risdate>2016</risdate><volume>44</volume><issue>1</issue><spage>139</spage><epage>151</epage><pages>139-151</pages><issn>0163-5999</issn><abstract>This paper is on designing a compact data structure for multi-set membership testing allowing fast set querying. Multi-set membership testing is a fundamental operation for computing systems and networking applications. Most existing schemes for multi-set membership testing are built upon Bloom filter, and fall short in either storage space cost or query speed. To address this issue, in this paper we propose Noisy Bloom Filter (NBF) and Error Corrected Noisy Bloom Filter (NBF-E) for multi-set membership testing. For theoretical analysis, we optimize their classification failure rate and false positive rate, and present criteria for selection between NBF and NBF-E. The key novelty of NBF and NBF-E is to store set ID information in a compact but noisy way that allows fast recording and querying, and use denoising method for querying. Especially, NBF-E incorporates asymmetric error-correcting coding technique into NBF to enhance the resilience of query results to noise by revealing and leveraging the asymmetric error nature of query results. To evaluate NBF and NBF-E in comparison with prior art, we conducted experiments using real-world network traces. The results show that NBF and NBF-E significantly advance the state-of-the-art on multi-set membership testing.</abstract><doi>10.1145/2964791.2901451</doi><tpages>13</tpages></addata></record>
fulltext fulltext
identifier ISSN: 0163-5999
ispartof Performance evaluation review, 2016-06, Vol.44 (1), p.139-151
issn 0163-5999
language eng
recordid cdi_crossref_primary_10_1145_2964791_2901451
source ACM Digital Library
title Noisy Bloom Filters for Multi-Set Membership Testing
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-21T05%3A20%3A13IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-crossref&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Noisy%20Bloom%20Filters%20for%20Multi-Set%20Membership%20Testing&rft.jtitle=Performance%20evaluation%20review&rft.au=Dai,%20Haipeng&rft.date=2016-06-30&rft.volume=44&rft.issue=1&rft.spage=139&rft.epage=151&rft.pages=139-151&rft.issn=0163-5999&rft_id=info:doi/10.1145/2964791.2901451&rft_dat=%3Ccrossref%3E10_1145_2964791_2901451%3C/crossref%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true