α-Probabilistic flexible aggregate nearest neighbor search in road networks using landmark multidimensional scaling

In spatial database and road network applications, the search for the nearest neighbor (NN) from a given query object q is the most fundamental and important problem. Aggregate nearest neighbor (ANN) search is an extension of the NN search with a set of query objects Q = { q 0 , ⋯ , q M - 1 } and fi...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:The Journal of supercomputing 2021-02, Vol.77 (2), p.2138-2153
Hauptverfasser: Chung, Moonyoung, Loh, Woong-Kee
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:In spatial database and road network applications, the search for the nearest neighbor (NN) from a given query object q is the most fundamental and important problem. Aggregate nearest neighbor (ANN) search is an extension of the NN search with a set of query objects Q = { q 0 , ⋯ , q M - 1 } and finds the object p ∗ that minimizes g { d ( p ∗ , q i ) , q i ∈ Q } , where g (max or sum) is an aggregate function and d () is a distance function between two objects. Flexible aggregate nearest neighbor (FANN) search is an extension of the ANN search with the introduction of a flexibility factor ϕ ( 0 < ϕ ≤ 1 ) and finds the object p ∗ and the set of query objects Q ϕ ∗ that minimize g { d ( p ∗ , q i ) , q i ∈ Q ϕ ∗ } , where Q ϕ ∗ can be any subset of Q of size ϕ | Q | . This study proposes an efficient α -probabilistic FANN search algorithm in road networks. The state-of-the-art FANN search algorithm in road networks, which is known as IER- k NN , used the Euclidean distance based on the two-dimensional coordinates of objects when choosing an R-tree node that most potentially contains p ∗ . However, since the Euclidean distance is significantly different from the actual shortest-path distance between objects, IER- k NN looks up many unnecessary nodes, thereby incurring many calculations of ‘expensive’ shortest-path distances and eventually performance degradation. The proposed algorithm transforms road network objects into k -dimensional Euclidean space objects while preserving the distances between them as much as possible using landmark multidimensional scaling (LMDS) . Since the Euclidean distance after LMDS transformation is very close to the shortest-path distance, the lookup of unnecessary R-tree nodes and the calculation of expensive shortest-path distances are reduced significantly, thereby greatly improving the search performance. As a result of performance comparison experiments conducted for various real road networks and parameters, the proposed algorithm always achieved higher performance than IER- k NN ; the performance (execution time) of the proposed algorithm was improved by up to 10.87 times without loss of accuracy.
ISSN:0920-8542
1573-0484
DOI:10.1007/s11227-020-03521-6