Weighted top-k dominating queries on highly incomplete data

Top-k dominating (TKD) query retrieves the top k items that dominate other objects in the dataset. This is a key decision-making tool for any organization since it allows data analysts to discover dominant objects that can be used for recommendation. Incomplete data is a regular occurrence in real-w...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Information systems (Oxford) 2022-07, Vol.107, p.102008, Article 102008
Hauptverfasser: Fattah, H.M. Abdul, Hasan, K.M. Azharul, Tsuji, Tatsuo
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Top-k dominating (TKD) query retrieves the top k items that dominate other objects in the dataset. This is a key decision-making tool for any organization since it allows data analysts to discover dominant objects that can be used for recommendation. Incomplete data is a regular occurrence in real-world applications which occurs in many ways such as system failure, privacy protection, data loss, unavailability of data, and other issues. In this paper, we introduce a new approach for answering the top-k dominating queries over incomplete data. In many scenarios, the dominating object is one which has very high average rating but the number of rating is very low. We apply a weighted factor to calculate the score for dominating object. Hence realistic recommendation is possible. The idea of data bucketing is used to prune the non-candidate objects. The buckets are built using the B+ tree that makes the processing faster for high retrieval performance. In terms of top-k dominating query performance with incomplete data, the proposed model outperforms previous methods.
ISSN:0306-4379
1873-6076
DOI:10.1016/j.is.2022.102008