α-Probabilistic flexible aggregate nearest neighbor search in road networks using landmark multidimensional scaling
In spatial database and road network applications, the search for the nearest neighbor (NN) from a given query object q is the most fundamental and important problem. Aggregate nearest neighbor (ANN) search is an extension of the NN search with a set of query objects Q = { q 0 , ⋯ , q M - 1 } and fi...
Gespeichert in:
Veröffentlicht in: | The Journal of supercomputing 2021-02, Vol.77 (2), p.2138-2153 |
---|---|
Hauptverfasser: | , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 2153 |
---|---|
container_issue | 2 |
container_start_page | 2138 |
container_title | The Journal of supercomputing |
container_volume | 77 |
creator | Chung, Moonyoung Loh, Woong-Kee |
description | In spatial database and road network applications, the search for the nearest neighbor (NN) from a given query object
q
is the most fundamental and important problem. Aggregate nearest neighbor (ANN) search is an extension of the NN search with a set of query objects
Q
=
{
q
0
,
⋯
,
q
M
-
1
}
and finds the object
p
∗
that minimizes
g
{
d
(
p
∗
,
q
i
)
,
q
i
∈
Q
}
, where
g
(max or sum) is an aggregate function and
d
() is a distance function between two objects. Flexible aggregate nearest neighbor (FANN) search is an extension of the ANN search with the introduction of a flexibility factor
ϕ
(
0
<
ϕ
≤
1
)
and finds the object
p
∗
and the set of query objects
Q
ϕ
∗
that minimize
g
{
d
(
p
∗
,
q
i
)
,
q
i
∈
Q
ϕ
∗
}
, where
Q
ϕ
∗
can be any subset of
Q
of size
ϕ
|
Q
|
. This study proposes an efficient
α
-probabilistic FANN search algorithm in road networks. The state-of-the-art FANN search algorithm in road networks, which is known as IER-
k
NN
, used the Euclidean distance based on the two-dimensional coordinates of objects when choosing an R-tree node that most potentially contains
p
∗
. However, since the Euclidean distance is significantly different from the actual shortest-path distance between objects, IER-
k
NN
looks up many unnecessary nodes, thereby incurring many calculations of ‘expensive’ shortest-path distances and eventually performance degradation. The proposed algorithm transforms road network objects into
k
-dimensional Euclidean space objects while preserving the distances between them as much as possible using
landmark multidimensional scaling (LMDS)
. Since the Euclidean distance after LMDS transformation is very close to the shortest-path distance, the lookup of unnecessary R-tree nodes and the calculation of expensive shortest-path distances are reduced significantly, thereby greatly improving the search performance. As a result of performance comparison experiments conducted for various real road networks and parameters, the proposed algorithm always achieved higher performance than IER-
k
NN
; the performance (execution time) of the proposed algorithm was improved by up to 10.87 times without loss of accuracy. |
doi_str_mv | 10.1007/s11227-020-03521-6 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2480787400</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2480787400</sourcerecordid><originalsourceid>FETCH-LOGICAL-c363t-39bfec62dbeb176b24740db838d26055d28e0aa6ad1ca98a57272e98c093fad3</originalsourceid><addsrcrecordid>eNp9UEtOwzAUtBBIlM8FWFliHXi2k9hZooqfVAkW3Vt27KRu3bjYqYBjcRHOhCFI7FiNNG9mNG8QuiBwRQD4dSKEUl4AhQJYRUlRH6AZqTgroBTlIZpBk0-iKukxOklpDQAl42yGxs-P4jkGrbTzLo2uxZ23b057i1XfR9ur0eLBqmjTmNH1Kx0iTploV9gNOAZlMj--hrhJeJ_c0GOvBrNVcYO3ez8647Z2SC4MyuPUKp8VZ-ioUz7Z8188Rcu72-X8oVg83T_ObxZFy2o2FqzRnW1rarTVhNealrwEowUThtZQVYYKC0rVypBWNUJVnHJqG9FCwzpl2Cm6nGJ3Mbzs8wNyHfYx90iSlgK4yHGQVXRStTGkFG0nd9Hl-u-SgPweV07jyjyu_BlX1tnEJlPK4qG38S_6H9cXE92AMw</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2480787400</pqid></control><display><type>article</type><title>α-Probabilistic flexible aggregate nearest neighbor search in road networks using landmark multidimensional scaling</title><source>SpringerLink (Online service)</source><creator>Chung, Moonyoung ; Loh, Woong-Kee</creator><creatorcontrib>Chung, Moonyoung ; Loh, Woong-Kee</creatorcontrib><description>In spatial database and road network applications, the search for the nearest neighbor (NN) from a given query object
q
is the most fundamental and important problem. Aggregate nearest neighbor (ANN) search is an extension of the NN search with a set of query objects
Q
=
{
q
0
,
⋯
,
q
M
-
1
}
and finds the object
p
∗
that minimizes
g
{
d
(
p
∗
,
q
i
)
,
q
i
∈
Q
}
, where
g
(max or sum) is an aggregate function and
d
() is a distance function between two objects. Flexible aggregate nearest neighbor (FANN) search is an extension of the ANN search with the introduction of a flexibility factor
ϕ
(
0
<
ϕ
≤
1
)
and finds the object
p
∗
and the set of query objects
Q
ϕ
∗
that minimize
g
{
d
(
p
∗
,
q
i
)
,
q
i
∈
Q
ϕ
∗
}
, where
Q
ϕ
∗
can be any subset of
Q
of size
ϕ
|
Q
|
. This study proposes an efficient
α
-probabilistic FANN search algorithm in road networks. The state-of-the-art FANN search algorithm in road networks, which is known as IER-
k
NN
, used the Euclidean distance based on the two-dimensional coordinates of objects when choosing an R-tree node that most potentially contains
p
∗
. However, since the Euclidean distance is significantly different from the actual shortest-path distance between objects, IER-
k
NN
looks up many unnecessary nodes, thereby incurring many calculations of ‘expensive’ shortest-path distances and eventually performance degradation. The proposed algorithm transforms road network objects into
k
-dimensional Euclidean space objects while preserving the distances between them as much as possible using
landmark multidimensional scaling (LMDS)
. Since the Euclidean distance after LMDS transformation is very close to the shortest-path distance, the lookup of unnecessary R-tree nodes and the calculation of expensive shortest-path distances are reduced significantly, thereby greatly improving the search performance. As a result of performance comparison experiments conducted for various real road networks and parameters, the proposed algorithm always achieved higher performance than IER-
k
NN
; the performance (execution time) of the proposed algorithm was improved by up to 10.87 times without loss of accuracy.</description><identifier>ISSN: 0920-8542</identifier><identifier>EISSN: 1573-0484</identifier><identifier>DOI: 10.1007/s11227-020-03521-6</identifier><language>eng</language><publisher>New York: Springer US</publisher><subject>Advances in Big Data and Deep Learning ; Algorithms ; Compilers ; Computer Science ; Euclidean geometry ; Euclidean space ; Interpreters ; Landmarks ; Mathematical analysis ; Networks ; Nodes ; Performance degradation ; Processor Architectures ; Programming Languages ; Queries ; Roads ; Search algorithms ; Shortest-path problems</subject><ispartof>The Journal of supercomputing, 2021-02, Vol.77 (2), p.2138-2153</ispartof><rights>The Author(s) 2020</rights><rights>The Author(s) 2020. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c363t-39bfec62dbeb176b24740db838d26055d28e0aa6ad1ca98a57272e98c093fad3</citedby><cites>FETCH-LOGICAL-c363t-39bfec62dbeb176b24740db838d26055d28e0aa6ad1ca98a57272e98c093fad3</cites><orcidid>0000-0002-3161-6479</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s11227-020-03521-6$$EPDF$$P50$$Gspringer$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s11227-020-03521-6$$EHTML$$P50$$Gspringer$$Hfree_for_read</linktohtml><link.rule.ids>314,780,784,27924,27925,41488,42557,51319</link.rule.ids></links><search><creatorcontrib>Chung, Moonyoung</creatorcontrib><creatorcontrib>Loh, Woong-Kee</creatorcontrib><title>α-Probabilistic flexible aggregate nearest neighbor search in road networks using landmark multidimensional scaling</title><title>The Journal of supercomputing</title><addtitle>J Supercomput</addtitle><description>In spatial database and road network applications, the search for the nearest neighbor (NN) from a given query object
q
is the most fundamental and important problem. Aggregate nearest neighbor (ANN) search is an extension of the NN search with a set of query objects
Q
=
{
q
0
,
⋯
,
q
M
-
1
}
and finds the object
p
∗
that minimizes
g
{
d
(
p
∗
,
q
i
)
,
q
i
∈
Q
}
, where
g
(max or sum) is an aggregate function and
d
() is a distance function between two objects. Flexible aggregate nearest neighbor (FANN) search is an extension of the ANN search with the introduction of a flexibility factor
ϕ
(
0
<
ϕ
≤
1
)
and finds the object
p
∗
and the set of query objects
Q
ϕ
∗
that minimize
g
{
d
(
p
∗
,
q
i
)
,
q
i
∈
Q
ϕ
∗
}
, where
Q
ϕ
∗
can be any subset of
Q
of size
ϕ
|
Q
|
. This study proposes an efficient
α
-probabilistic FANN search algorithm in road networks. The state-of-the-art FANN search algorithm in road networks, which is known as IER-
k
NN
, used the Euclidean distance based on the two-dimensional coordinates of objects when choosing an R-tree node that most potentially contains
p
∗
. However, since the Euclidean distance is significantly different from the actual shortest-path distance between objects, IER-
k
NN
looks up many unnecessary nodes, thereby incurring many calculations of ‘expensive’ shortest-path distances and eventually performance degradation. The proposed algorithm transforms road network objects into
k
-dimensional Euclidean space objects while preserving the distances between them as much as possible using
landmark multidimensional scaling (LMDS)
. Since the Euclidean distance after LMDS transformation is very close to the shortest-path distance, the lookup of unnecessary R-tree nodes and the calculation of expensive shortest-path distances are reduced significantly, thereby greatly improving the search performance. As a result of performance comparison experiments conducted for various real road networks and parameters, the proposed algorithm always achieved higher performance than IER-
k
NN
; the performance (execution time) of the proposed algorithm was improved by up to 10.87 times without loss of accuracy.</description><subject>Advances in Big Data and Deep Learning</subject><subject>Algorithms</subject><subject>Compilers</subject><subject>Computer Science</subject><subject>Euclidean geometry</subject><subject>Euclidean space</subject><subject>Interpreters</subject><subject>Landmarks</subject><subject>Mathematical analysis</subject><subject>Networks</subject><subject>Nodes</subject><subject>Performance degradation</subject><subject>Processor Architectures</subject><subject>Programming Languages</subject><subject>Queries</subject><subject>Roads</subject><subject>Search algorithms</subject><subject>Shortest-path problems</subject><issn>0920-8542</issn><issn>1573-0484</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><sourceid>C6C</sourceid><recordid>eNp9UEtOwzAUtBBIlM8FWFliHXi2k9hZooqfVAkW3Vt27KRu3bjYqYBjcRHOhCFI7FiNNG9mNG8QuiBwRQD4dSKEUl4AhQJYRUlRH6AZqTgroBTlIZpBk0-iKukxOklpDQAl42yGxs-P4jkGrbTzLo2uxZ23b057i1XfR9ur0eLBqmjTmNH1Kx0iTploV9gNOAZlMj--hrhJeJ_c0GOvBrNVcYO3ez8647Z2SC4MyuPUKp8VZ-ioUz7Z8188Rcu72-X8oVg83T_ObxZFy2o2FqzRnW1rarTVhNealrwEowUThtZQVYYKC0rVypBWNUJVnHJqG9FCwzpl2Cm6nGJ3Mbzs8wNyHfYx90iSlgK4yHGQVXRStTGkFG0nd9Hl-u-SgPweV07jyjyu_BlX1tnEJlPK4qG38S_6H9cXE92AMw</recordid><startdate>20210201</startdate><enddate>20210201</enddate><creator>Chung, Moonyoung</creator><creator>Loh, Woong-Kee</creator><general>Springer US</general><general>Springer Nature B.V</general><scope>C6C</scope><scope>AAYXX</scope><scope>CITATION</scope><orcidid>https://orcid.org/0000-0002-3161-6479</orcidid></search><sort><creationdate>20210201</creationdate><title>α-Probabilistic flexible aggregate nearest neighbor search in road networks using landmark multidimensional scaling</title><author>Chung, Moonyoung ; Loh, Woong-Kee</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c363t-39bfec62dbeb176b24740db838d26055d28e0aa6ad1ca98a57272e98c093fad3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Advances in Big Data and Deep Learning</topic><topic>Algorithms</topic><topic>Compilers</topic><topic>Computer Science</topic><topic>Euclidean geometry</topic><topic>Euclidean space</topic><topic>Interpreters</topic><topic>Landmarks</topic><topic>Mathematical analysis</topic><topic>Networks</topic><topic>Nodes</topic><topic>Performance degradation</topic><topic>Processor Architectures</topic><topic>Programming Languages</topic><topic>Queries</topic><topic>Roads</topic><topic>Search algorithms</topic><topic>Shortest-path problems</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Chung, Moonyoung</creatorcontrib><creatorcontrib>Loh, Woong-Kee</creatorcontrib><collection>SpringerOpen</collection><collection>CrossRef</collection><jtitle>The Journal of supercomputing</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Chung, Moonyoung</au><au>Loh, Woong-Kee</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>α-Probabilistic flexible aggregate nearest neighbor search in road networks using landmark multidimensional scaling</atitle><jtitle>The Journal of supercomputing</jtitle><stitle>J Supercomput</stitle><date>2021-02-01</date><risdate>2021</risdate><volume>77</volume><issue>2</issue><spage>2138</spage><epage>2153</epage><pages>2138-2153</pages><issn>0920-8542</issn><eissn>1573-0484</eissn><abstract>In spatial database and road network applications, the search for the nearest neighbor (NN) from a given query object
q
is the most fundamental and important problem. Aggregate nearest neighbor (ANN) search is an extension of the NN search with a set of query objects
Q
=
{
q
0
,
⋯
,
q
M
-
1
}
and finds the object
p
∗
that minimizes
g
{
d
(
p
∗
,
q
i
)
,
q
i
∈
Q
}
, where
g
(max or sum) is an aggregate function and
d
() is a distance function between two objects. Flexible aggregate nearest neighbor (FANN) search is an extension of the ANN search with the introduction of a flexibility factor
ϕ
(
0
<
ϕ
≤
1
)
and finds the object
p
∗
and the set of query objects
Q
ϕ
∗
that minimize
g
{
d
(
p
∗
,
q
i
)
,
q
i
∈
Q
ϕ
∗
}
, where
Q
ϕ
∗
can be any subset of
Q
of size
ϕ
|
Q
|
. This study proposes an efficient
α
-probabilistic FANN search algorithm in road networks. The state-of-the-art FANN search algorithm in road networks, which is known as IER-
k
NN
, used the Euclidean distance based on the two-dimensional coordinates of objects when choosing an R-tree node that most potentially contains
p
∗
. However, since the Euclidean distance is significantly different from the actual shortest-path distance between objects, IER-
k
NN
looks up many unnecessary nodes, thereby incurring many calculations of ‘expensive’ shortest-path distances and eventually performance degradation. The proposed algorithm transforms road network objects into
k
-dimensional Euclidean space objects while preserving the distances between them as much as possible using
landmark multidimensional scaling (LMDS)
. Since the Euclidean distance after LMDS transformation is very close to the shortest-path distance, the lookup of unnecessary R-tree nodes and the calculation of expensive shortest-path distances are reduced significantly, thereby greatly improving the search performance. As a result of performance comparison experiments conducted for various real road networks and parameters, the proposed algorithm always achieved higher performance than IER-
k
NN
; the performance (execution time) of the proposed algorithm was improved by up to 10.87 times without loss of accuracy.</abstract><cop>New York</cop><pub>Springer US</pub><doi>10.1007/s11227-020-03521-6</doi><tpages>16</tpages><orcidid>https://orcid.org/0000-0002-3161-6479</orcidid><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0920-8542 |
ispartof | The Journal of supercomputing, 2021-02, Vol.77 (2), p.2138-2153 |
issn | 0920-8542 1573-0484 |
language | eng |
recordid | cdi_proquest_journals_2480787400 |
source | SpringerLink (Online service) |
subjects | Advances in Big Data and Deep Learning Algorithms Compilers Computer Science Euclidean geometry Euclidean space Interpreters Landmarks Mathematical analysis Networks Nodes Performance degradation Processor Architectures Programming Languages Queries Roads Search algorithms Shortest-path problems |
title | α-Probabilistic flexible aggregate nearest neighbor search in road networks using landmark multidimensional scaling |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-01T06%3A35%3A56IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=%CE%B1-Probabilistic%20flexible%20aggregate%20nearest%20neighbor%20search%20in%20road%20networks%20using%20landmark%20multidimensional%20scaling&rft.jtitle=The%20Journal%20of%20supercomputing&rft.au=Chung,%20Moonyoung&rft.date=2021-02-01&rft.volume=77&rft.issue=2&rft.spage=2138&rft.epage=2153&rft.pages=2138-2153&rft.issn=0920-8542&rft.eissn=1573-0484&rft_id=info:doi/10.1007/s11227-020-03521-6&rft_dat=%3Cproquest_cross%3E2480787400%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2480787400&rft_id=info:pmid/&rfr_iscdi=true |