Weighted top-k dominating queries on highly incomplete data

Top-k dominating (TKD) query retrieves the top k items that dominate other objects in the dataset. This is a key decision-making tool for any organization since it allows data analysts to discover dominant objects that can be used for recommendation. Incomplete data is a regular occurrence in real-w...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Information systems (Oxford) 2022-07, Vol.107, p.102008, Article 102008
Hauptverfasser:	Fattah, H.M. Abdul, Hasan, K.M. Azharul, Tsuji, Tatsuo
Format:	Artikel
Sprache:	eng
Schlagworte:	B+ tree Buckets Data loss Decision analysis Decision making Dominance relationship Incomplete data Information systems Object recognition Queries Query processing Skyline Top-k dominating query
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Top-k dominating (TKD) query retrieves the top k items that dominate other objects in the dataset. This is a key decision-making tool for any organization since it allows data analysts to discover dominant objects that can be used for recommendation. Incomplete data is a regular occurrence in real-world applications which occurs in many ways such as system failure, privacy protection, data loss, unavailability of data, and other issues. In this paper, we introduce a new approach for answering the top-k dominating queries over incomplete data. In many scenarios, the dominating object is one which has very high average rating but the number of rating is very low. We apply a weighted factor to calculate the score for dominating object. Hence realistic recommendation is possible. The idea of data bucketing is used to prune the non-candidate objects. The buckets are built using the B+ tree that makes the processing faster for high retrieval performance. In terms of top-k dominating query performance with incomplete data, the proposed model outperforms previous methods.
ISSN:	0306-4379 1873-6076
DOI:	10.1016/j.is.2022.102008