Indexing Uncertain Data in General Metric Spaces

In this study, we deal with the problem of efficiently answering range queries over uncertain objects in a general metric space. In this study, an uncertain object is an object that always exists but its actual value is uncertain and modeled by a multivariate probability density function. As a major...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on knowledge and data engineering 2012-09, Vol.24 (9), p.1640-1657
Hauptverfasser: Angiulli, Fabrizio, Fassetti, Fabio
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 1657
container_issue 9
container_start_page 1640
container_title IEEE transactions on knowledge and data engineering
container_volume 24
creator Angiulli, Fabrizio
Fassetti, Fabio
description In this study, we deal with the problem of efficiently answering range queries over uncertain objects in a general metric space. In this study, an uncertain object is an object that always exists but its actual value is uncertain and modeled by a multivariate probability density function. As a major contribution, this is the first work providing an effective technique for indexing uncertain objects coming from general metric spaces. We generalize the reverse triangle inequality to the probabilistic setting in order to exploit it as a discard condition. Then, we introduce a novel pivot-based indexing technique, called UP-index, and show how it can be employed to speed up range query computation. Importantly, the candidate selection phase of our technique is able to noticeably reduce the set of candidates with little time requirements. Finally, we provide a criterion to measure the quality of a set of pivots and study the problem of selecting a good set of pivots according to the introduced criterion. We report some intractability results and then design an approximate algorithm with statistical guarantees for selecting pivots. Experimental results validate the effectiveness of the proposed approach and reveal that the introduced technique may be even preferable to indexing techniques specifically designed for the euclidean space.
doi_str_mv 10.1109/TKDE.2011.93
format Article
fullrecord <record><control><sourceid>crossref_RIE</sourceid><recordid>TN_cdi_ieee_primary_6247408</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>6247408</ieee_id><sourcerecordid>10_1109_TKDE_2011_93</sourcerecordid><originalsourceid>FETCH-LOGICAL-c255t-bbe76285d1a649bcbefdff390a46d33e37e8a6ac422b89b72675bc45516dcc903</originalsourceid><addsrcrecordid>eNo9jztPwzAURi0EEqWwsbHkB5Bwr98eUV9UFDHQzpHt3KCgEio7A_x7UhUxnW84-qTD2C1ChQjuYfs8X1QcECsnztgElbIlR4fn4waJpRTSXLKrnD8AwBqLEwbrvqHvrn8vdn2kNPiuL-Z-8MXIFfWU_L54oSF1sXg7-Ej5ml20fp_p5o9TtlsutrOncvO6Ws8eN2XkSg1lCGQ0t6pBr6ULMVDbtK1w4KVuhCBhyHrto-Q8WBcM10aFKJVC3cToQEzZ_ek3pq-cE7X1IXWfPv3UCPWxtj7W1sfa2olRvzvpHRH9q5pLI8GKX_UFT5s</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Indexing Uncertain Data in General Metric Spaces</title><source>IEEE Electronic Library (IEL)</source><creator>Angiulli, Fabrizio ; Fassetti, Fabio</creator><creatorcontrib>Angiulli, Fabrizio ; Fassetti, Fabio</creatorcontrib><description>In this study, we deal with the problem of efficiently answering range queries over uncertain objects in a general metric space. In this study, an uncertain object is an object that always exists but its actual value is uncertain and modeled by a multivariate probability density function. As a major contribution, this is the first work providing an effective technique for indexing uncertain objects coming from general metric spaces. We generalize the reverse triangle inequality to the probabilistic setting in order to exploit it as a discard condition. Then, we introduce a novel pivot-based indexing technique, called UP-index, and show how it can be employed to speed up range query computation. Importantly, the candidate selection phase of our technique is able to noticeably reduce the set of candidates with little time requirements. Finally, we provide a criterion to measure the quality of a set of pivots and study the problem of selecting a good set of pivots according to the introduced criterion. We report some intractability results and then design an approximate algorithm with statistical guarantees for selecting pivots. Experimental results validate the effectiveness of the proposed approach and reveal that the introduced technique may be even preferable to indexing techniques specifically designed for the euclidean space.</description><identifier>ISSN: 1041-4347</identifier><identifier>EISSN: 1558-2191</identifier><identifier>DOI: 10.1109/TKDE.2011.93</identifier><identifier>CODEN: ITKEEH</identifier><language>eng</language><publisher>IEEE</publisher><subject>Extraterrestrial measurements ; Histograms ; Indexing ; metric spaces ; Probability density function ; uncertain data ; Upper bound</subject><ispartof>IEEE transactions on knowledge and data engineering, 2012-09, Vol.24 (9), p.1640-1657</ispartof><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c255t-bbe76285d1a649bcbefdff390a46d33e37e8a6ac422b89b72675bc45516dcc903</citedby><cites>FETCH-LOGICAL-c255t-bbe76285d1a649bcbefdff390a46d33e37e8a6ac422b89b72675bc45516dcc903</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/6247408$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,780,784,796,27924,27925,54758</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/6247408$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Angiulli, Fabrizio</creatorcontrib><creatorcontrib>Fassetti, Fabio</creatorcontrib><title>Indexing Uncertain Data in General Metric Spaces</title><title>IEEE transactions on knowledge and data engineering</title><addtitle>TKDE</addtitle><description>In this study, we deal with the problem of efficiently answering range queries over uncertain objects in a general metric space. In this study, an uncertain object is an object that always exists but its actual value is uncertain and modeled by a multivariate probability density function. As a major contribution, this is the first work providing an effective technique for indexing uncertain objects coming from general metric spaces. We generalize the reverse triangle inequality to the probabilistic setting in order to exploit it as a discard condition. Then, we introduce a novel pivot-based indexing technique, called UP-index, and show how it can be employed to speed up range query computation. Importantly, the candidate selection phase of our technique is able to noticeably reduce the set of candidates with little time requirements. Finally, we provide a criterion to measure the quality of a set of pivots and study the problem of selecting a good set of pivots according to the introduced criterion. We report some intractability results and then design an approximate algorithm with statistical guarantees for selecting pivots. Experimental results validate the effectiveness of the proposed approach and reveal that the introduced technique may be even preferable to indexing techniques specifically designed for the euclidean space.</description><subject>Extraterrestrial measurements</subject><subject>Histograms</subject><subject>Indexing</subject><subject>metric spaces</subject><subject>Probability density function</subject><subject>uncertain data</subject><subject>Upper bound</subject><issn>1041-4347</issn><issn>1558-2191</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2012</creationdate><recordtype>article</recordtype><sourceid>RIE</sourceid><recordid>eNo9jztPwzAURi0EEqWwsbHkB5Bwr98eUV9UFDHQzpHt3KCgEio7A_x7UhUxnW84-qTD2C1ChQjuYfs8X1QcECsnztgElbIlR4fn4waJpRTSXLKrnD8AwBqLEwbrvqHvrn8vdn2kNPiuL-Z-8MXIFfWU_L54oSF1sXg7-Ej5ml20fp_p5o9TtlsutrOncvO6Ws8eN2XkSg1lCGQ0t6pBr6ULMVDbtK1w4KVuhCBhyHrto-Q8WBcM10aFKJVC3cToQEzZ_ek3pq-cE7X1IXWfPv3UCPWxtj7W1sfa2olRvzvpHRH9q5pLI8GKX_UFT5s</recordid><startdate>20120901</startdate><enddate>20120901</enddate><creator>Angiulli, Fabrizio</creator><creator>Fassetti, Fabio</creator><general>IEEE</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope></search><sort><creationdate>20120901</creationdate><title>Indexing Uncertain Data in General Metric Spaces</title><author>Angiulli, Fabrizio ; Fassetti, Fabio</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c255t-bbe76285d1a649bcbefdff390a46d33e37e8a6ac422b89b72675bc45516dcc903</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2012</creationdate><topic>Extraterrestrial measurements</topic><topic>Histograms</topic><topic>Indexing</topic><topic>metric spaces</topic><topic>Probability density function</topic><topic>uncertain data</topic><topic>Upper bound</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Angiulli, Fabrizio</creatorcontrib><creatorcontrib>Fassetti, Fabio</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>CrossRef</collection><jtitle>IEEE transactions on knowledge and data engineering</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Angiulli, Fabrizio</au><au>Fassetti, Fabio</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Indexing Uncertain Data in General Metric Spaces</atitle><jtitle>IEEE transactions on knowledge and data engineering</jtitle><stitle>TKDE</stitle><date>2012-09-01</date><risdate>2012</risdate><volume>24</volume><issue>9</issue><spage>1640</spage><epage>1657</epage><pages>1640-1657</pages><issn>1041-4347</issn><eissn>1558-2191</eissn><coden>ITKEEH</coden><abstract>In this study, we deal with the problem of efficiently answering range queries over uncertain objects in a general metric space. In this study, an uncertain object is an object that always exists but its actual value is uncertain and modeled by a multivariate probability density function. As a major contribution, this is the first work providing an effective technique for indexing uncertain objects coming from general metric spaces. We generalize the reverse triangle inequality to the probabilistic setting in order to exploit it as a discard condition. Then, we introduce a novel pivot-based indexing technique, called UP-index, and show how it can be employed to speed up range query computation. Importantly, the candidate selection phase of our technique is able to noticeably reduce the set of candidates with little time requirements. Finally, we provide a criterion to measure the quality of a set of pivots and study the problem of selecting a good set of pivots according to the introduced criterion. We report some intractability results and then design an approximate algorithm with statistical guarantees for selecting pivots. Experimental results validate the effectiveness of the proposed approach and reveal that the introduced technique may be even preferable to indexing techniques specifically designed for the euclidean space.</abstract><pub>IEEE</pub><doi>10.1109/TKDE.2011.93</doi><tpages>18</tpages></addata></record>
fulltext fulltext_linktorsrc
identifier ISSN: 1041-4347
ispartof IEEE transactions on knowledge and data engineering, 2012-09, Vol.24 (9), p.1640-1657
issn 1041-4347
1558-2191
language eng
recordid cdi_ieee_primary_6247408
source IEEE Electronic Library (IEL)
subjects Extraterrestrial measurements
Histograms
Indexing
metric spaces
Probability density function
uncertain data
Upper bound
title Indexing Uncertain Data in General Metric Spaces
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-28T00%3A01%3A13IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-crossref_RIE&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Indexing%20Uncertain%20Data%20in%20General%20Metric%20Spaces&rft.jtitle=IEEE%20transactions%20on%20knowledge%20and%20data%20engineering&rft.au=Angiulli,%20Fabrizio&rft.date=2012-09-01&rft.volume=24&rft.issue=9&rft.spage=1640&rft.epage=1657&rft.pages=1640-1657&rft.issn=1041-4347&rft.eissn=1558-2191&rft.coden=ITKEEH&rft_id=info:doi/10.1109/TKDE.2011.93&rft_dat=%3Ccrossref_RIE%3E10_1109_TKDE_2011_93%3C/crossref_RIE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=6247408&rfr_iscdi=true