REDUCING THE DIMENSIONAL DEPENDENCE FOR RANK-BASED SIMILARITY SEARCH

In this paper a data structure for k-NN search, the Rank Cover Tree (RCT) is implemented. The pruning tests for RCT rely on the comparison of similarity values not on the other properties of the underlying space, such as the triangle inequality. Objects are selected according to their ranks with res...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:International Journal of Advanced Engineering and Science 2016-05, Vol.5 (1), p.10-10
Hauptverfasser: Shelke, Jeevan Arjun, Humnabadkar, Viresh Shivshankar, Kale, Tushar Tinajirao, Kulkarni, Girish Gundopant, Kulkarni, P S
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:In this paper a data structure for k-NN search, the Rank Cover Tree (RCT) is implemented. The pruning tests for RCT rely on the comparison of similarity values not on the other properties of the underlying space, such as the triangle inequality. Objects are selected according to their ranks with respect to the query object, allowing much tighter control on the overall execution costs. Theoretical analysis shows that with very high probability, the RCT returns a correct query result in time that depends very competitively on a measure of the intrinsic dimensionality of the data set. The experimental results for the RCT show that non-metric pruning strategies for similarity search can be practical even when the representational dimension of the data is extremely high. They also show that the RCT is capable of meeting or exceeding the level of performance of state-of-the-art methods that make use of metric pruning or other selection tests involving numerical constraints on distance values.
ISSN:2304-7712
2304-7720