Approximate Nearest Neighbor for Curves: Simple, Efficient, and Deterministic

In the ( 1 + ε , r ) - approximate near-neighbor problem for curves (ANNC) under some similarity measure δ , the goal is to construct a data structure for a given set C of curves that supports approximate near-neighbor queries: Given a query curve Q , if there exists a curve C ∈ C such that δ ( Q ,...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Algorithmica 2023-05, Vol.85 (5), p.1490-1519
Hauptverfasser:	Filtser, Arnold, Filtser, Omrit, Katz, Matthew J.
Format:	Artikel
Sprache:	eng
Schlagworte:	Algorithm Analysis and Problem Complexity Algorithms Computer Science Computer Systems Organization and Communication Networks Data structures Data Structures and Information Theory Mathematics of Computing Queries Similarity Similarity measures Theory of Computation
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	1519
container_issue	5
container_start_page	1490
container_title	Algorithmica
container_volume	85
creator	Filtser, Arnold Filtser, Omrit Katz, Matthew J.
description	In the ( 1 + ε , r ) - approximate near-neighbor problem for curves (ANNC) under some similarity measure δ , the goal is to construct a data structure for a given set C of curves that supports approximate near-neighbor queries: Given a query curve Q , if there exists a curve C ∈ C such that δ ( Q , C ) ≤ r , then return a curve C ′ ∈ C with δ ( Q , C ′ ) ≤ ( 1 + ε ) r . There exists an efficient reduction from the ( 1 + ε ) - approximate nearest-neighbor problem to ANNC, where in the former problem the answer to a query is a curve C ∈ C with δ ( Q , C ) ≤ ( 1 + ε ) · δ ( Q , C ∗ ) , where C ∗ is the curve of C most similar to Q . Given a set C of n curves, each consisting of m points in d dimensions, we construct a data structure for ANNC that uses n · O ( 1 ε ) md storage space and has O ( md ) query time (for a query curve of length m ), where the similarity measure between two curves is their discrete Fréchet or dynamic time warping distance. Our method is simple to implement, deterministic, and results in an exponential improvement in both query time and storage space compared to all previous bounds. Further, we also consider the asymmetric version of ANNC, where the length of the query curves is k ≪ m , and obtain essentially the same storage and query bounds as above, except that m is replaced by k . Finally, we apply our method to a version of approximate range counting for curves and achieve similar bounds.
doi_str_mv	10.1007/s00453-022-01080-1
format	Article
fullrecord	<record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2805488155</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2805488155</sourcerecordid><originalsourceid>FETCH-LOGICAL-c270t-e264724b7ccab9b5c7be4debd8c006821851ef4b95f70a80cd81f035da504a443</originalsourceid><addsrcrecordid>eNp9UMtOwzAQtBBIlMIPcIrEtYG1Y8cut6qUh1TgAJwtx1kXV20S7BTB32MIEjcOq9FKM7szQ8gphXMKIC8iABdFDozlQEFBTvfIiPIirYLTfTICKlXOSyoPyVGMawDK5LQckftZ14X2w29Nj9kDmoCxT-hXr1UbMpdmvgvvGC-zJ7_tNjjJFs5567HpJ5lp6uwKewxb3_jYe3tMDpzZRDz5xTF5uV48z2_z5ePN3Xy2zC2T0OfISi4Zr6S1pppWwsoKeY1VrSxAqRhVgqLj1VQ4CUaBrRV1UIjaCOCG82JMzoa7yfvbLlnW63YXmvRSM5USK0WFSCw2sGxoYwzodBdS0PCpKejv2vRQm0616Z_aNE2iYhDFRG5WGP5O_6P6Agdib1w</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2805488155</pqid></control><display><type>article</type><title>Approximate Nearest Neighbor for Curves: Simple, Efficient, and Deterministic</title><source>SpringerLink Journals - AutoHoldings</source><creator>Filtser, Arnold ; Filtser, Omrit ; Katz, Matthew J.</creator><creatorcontrib>Filtser, Arnold ; Filtser, Omrit ; Katz, Matthew J.</creatorcontrib><description>In the ( 1 + ε , r ) - approximate near-neighbor problem for curves (ANNC) under some similarity measure δ , the goal is to construct a data structure for a given set C of curves that supports approximate near-neighbor queries: Given a query curve Q , if there exists a curve C ∈ C such that δ ( Q , C ) ≤ r , then return a curve C ′ ∈ C with δ ( Q , C ′ ) ≤ ( 1 + ε ) r . There exists an efficient reduction from the ( 1 + ε ) - approximate nearest-neighbor problem to ANNC, where in the former problem the answer to a query is a curve C ∈ C with δ ( Q , C ) ≤ ( 1 + ε ) · δ ( Q , C ∗ ) , where C ∗ is the curve of C most similar to Q . Given a set C of n curves, each consisting of m points in d dimensions, we construct a data structure for ANNC that uses n · O ( 1 ε ) md storage space and has O ( md ) query time (for a query curve of length m ), where the similarity measure between two curves is their discrete Fréchet or dynamic time warping distance. Our method is simple to implement, deterministic, and results in an exponential improvement in both query time and storage space compared to all previous bounds. Further, we also consider the asymmetric version of ANNC, where the length of the query curves is k ≪ m , and obtain essentially the same storage and query bounds as above, except that m is replaced by k . Finally, we apply our method to a version of approximate range counting for curves and achieve similar bounds.</description><identifier>ISSN: 0178-4617</identifier><identifier>EISSN: 1432-0541</identifier><identifier>DOI: 10.1007/s00453-022-01080-1</identifier><language>eng</language><publisher>New York: Springer US</publisher><subject>Algorithm Analysis and Problem Complexity ; Algorithms ; Computer Science ; Computer Systems Organization and Communication Networks ; Data structures ; Data Structures and Information Theory ; Mathematics of Computing ; Queries ; Similarity ; Similarity measures ; Theory of Computation</subject><ispartof>Algorithmica, 2023-05, Vol.85 (5), p.1490-1519</ispartof><rights>The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2022. Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c270t-e264724b7ccab9b5c7be4debd8c006821851ef4b95f70a80cd81f035da504a443</cites><orcidid>0000-0002-3978-1428</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s00453-022-01080-1$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s00453-022-01080-1$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>314,776,780,27901,27902,41464,42533,51294</link.rule.ids></links><search><creatorcontrib>Filtser, Arnold</creatorcontrib><creatorcontrib>Filtser, Omrit</creatorcontrib><creatorcontrib>Katz, Matthew J.</creatorcontrib><title>Approximate Nearest Neighbor for Curves: Simple, Efficient, and Deterministic</title><title>Algorithmica</title><addtitle>Algorithmica</addtitle><description>In the ( 1 + ε , r ) - approximate near-neighbor problem for curves (ANNC) under some similarity measure δ , the goal is to construct a data structure for a given set C of curves that supports approximate near-neighbor queries: Given a query curve Q , if there exists a curve C ∈ C such that δ ( Q , C ) ≤ r , then return a curve C ′ ∈ C with δ ( Q , C ′ ) ≤ ( 1 + ε ) r . There exists an efficient reduction from the ( 1 + ε ) - approximate nearest-neighbor problem to ANNC, where in the former problem the answer to a query is a curve C ∈ C with δ ( Q , C ) ≤ ( 1 + ε ) · δ ( Q , C ∗ ) , where C ∗ is the curve of C most similar to Q . Given a set C of n curves, each consisting of m points in d dimensions, we construct a data structure for ANNC that uses n · O ( 1 ε ) md storage space and has O ( md ) query time (for a query curve of length m ), where the similarity measure between two curves is their discrete Fréchet or dynamic time warping distance. Our method is simple to implement, deterministic, and results in an exponential improvement in both query time and storage space compared to all previous bounds. Further, we also consider the asymmetric version of ANNC, where the length of the query curves is k ≪ m , and obtain essentially the same storage and query bounds as above, except that m is replaced by k . Finally, we apply our method to a version of approximate range counting for curves and achieve similar bounds.</description><subject>Algorithm Analysis and Problem Complexity</subject><subject>Algorithms</subject><subject>Computer Science</subject><subject>Computer Systems Organization and Communication Networks</subject><subject>Data structures</subject><subject>Data Structures and Information Theory</subject><subject>Mathematics of Computing</subject><subject>Queries</subject><subject>Similarity</subject><subject>Similarity measures</subject><subject>Theory of Computation</subject><issn>0178-4617</issn><issn>1432-0541</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><recordid>eNp9UMtOwzAQtBBIlMIPcIrEtYG1Y8cut6qUh1TgAJwtx1kXV20S7BTB32MIEjcOq9FKM7szQ8gphXMKIC8iABdFDozlQEFBTvfIiPIirYLTfTICKlXOSyoPyVGMawDK5LQckftZ14X2w29Nj9kDmoCxT-hXr1UbMpdmvgvvGC-zJ7_tNjjJFs5567HpJ5lp6uwKewxb3_jYe3tMDpzZRDz5xTF5uV48z2_z5ePN3Xy2zC2T0OfISi4Zr6S1pppWwsoKeY1VrSxAqRhVgqLj1VQ4CUaBrRV1UIjaCOCG82JMzoa7yfvbLlnW63YXmvRSM5USK0WFSCw2sGxoYwzodBdS0PCpKejv2vRQm0616Z_aNE2iYhDFRG5WGP5O_6P6Agdib1w</recordid><startdate>20230501</startdate><enddate>20230501</enddate><creator>Filtser, Arnold</creator><creator>Filtser, Omrit</creator><creator>Katz, Matthew J.</creator><general>Springer US</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><orcidid>https://orcid.org/0000-0002-3978-1428</orcidid></search><sort><creationdate>20230501</creationdate><title>Approximate Nearest Neighbor for Curves: Simple, Efficient, and Deterministic</title><author>Filtser, Arnold ; Filtser, Omrit ; Katz, Matthew J.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c270t-e264724b7ccab9b5c7be4debd8c006821851ef4b95f70a80cd81f035da504a443</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Algorithm Analysis and Problem Complexity</topic><topic>Algorithms</topic><topic>Computer Science</topic><topic>Computer Systems Organization and Communication Networks</topic><topic>Data structures</topic><topic>Data Structures and Information Theory</topic><topic>Mathematics of Computing</topic><topic>Queries</topic><topic>Similarity</topic><topic>Similarity measures</topic><topic>Theory of Computation</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Filtser, Arnold</creatorcontrib><creatorcontrib>Filtser, Omrit</creatorcontrib><creatorcontrib>Katz, Matthew J.</creatorcontrib><collection>CrossRef</collection><jtitle>Algorithmica</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Filtser, Arnold</au><au>Filtser, Omrit</au><au>Katz, Matthew J.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Approximate Nearest Neighbor for Curves: Simple, Efficient, and Deterministic</atitle><jtitle>Algorithmica</jtitle><stitle>Algorithmica</stitle><date>2023-05-01</date><risdate>2023</risdate><volume>85</volume><issue>5</issue><spage>1490</spage><epage>1519</epage><pages>1490-1519</pages><issn>0178-4617</issn><eissn>1432-0541</eissn><abstract>In the ( 1 + ε , r ) - approximate near-neighbor problem for curves (ANNC) under some similarity measure δ , the goal is to construct a data structure for a given set C of curves that supports approximate near-neighbor queries: Given a query curve Q , if there exists a curve C ∈ C such that δ ( Q , C ) ≤ r , then return a curve C ′ ∈ C with δ ( Q , C ′ ) ≤ ( 1 + ε ) r . There exists an efficient reduction from the ( 1 + ε ) - approximate nearest-neighbor problem to ANNC, where in the former problem the answer to a query is a curve C ∈ C with δ ( Q , C ) ≤ ( 1 + ε ) · δ ( Q , C ∗ ) , where C ∗ is the curve of C most similar to Q . Given a set C of n curves, each consisting of m points in d dimensions, we construct a data structure for ANNC that uses n · O ( 1 ε ) md storage space and has O ( md ) query time (for a query curve of length m ), where the similarity measure between two curves is their discrete Fréchet or dynamic time warping distance. Our method is simple to implement, deterministic, and results in an exponential improvement in both query time and storage space compared to all previous bounds. Further, we also consider the asymmetric version of ANNC, where the length of the query curves is k ≪ m , and obtain essentially the same storage and query bounds as above, except that m is replaced by k . Finally, we apply our method to a version of approximate range counting for curves and achieve similar bounds.</abstract><cop>New York</cop><pub>Springer US</pub><doi>10.1007/s00453-022-01080-1</doi><tpages>30</tpages><orcidid>https://orcid.org/0000-0002-3978-1428</orcidid></addata></record>
fulltext	fulltext
identifier	ISSN: 0178-4617
ispartof	Algorithmica, 2023-05, Vol.85 (5), p.1490-1519
issn	0178-4617 1432-0541
language	eng
recordid	cdi_proquest_journals_2805488155
source	SpringerLink Journals - AutoHoldings
subjects	Algorithm Analysis and Problem Complexity Algorithms Computer Science Computer Systems Organization and Communication Networks Data structures Data Structures and Information Theory Mathematics of Computing Queries Similarity Similarity measures Theory of Computation
title	Approximate Nearest Neighbor for Curves: Simple, Efficient, and Deterministic
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-04T23%3A04%3A51IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Approximate%20Nearest%20Neighbor%20for%20Curves:%20Simple,%20Efficient,%20and%20Deterministic&rft.jtitle=Algorithmica&rft.au=Filtser,%20Arnold&rft.date=2023-05-01&rft.volume=85&rft.issue=5&rft.spage=1490&rft.epage=1519&rft.pages=1490-1519&rft.issn=0178-4617&rft.eissn=1432-0541&rft_id=info:doi/10.1007/s00453-022-01080-1&rft_dat=%3Cproquest_cross%3E2805488155%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2805488155&rft_id=info:pmid/&rfr_iscdi=true