Formal Approach to Data Accuracy Evaluation

Usually, data quality is defined by multiple attributes that allow classifying the output data (such as completeness, freshness, and accuracy) or the methods exploiting these data (such as dependability, performance, and protection). Among the suggested quality attributes, we will discuss one of the...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Informatica (Ljubljana) 2022-06, Vol.46 (2), p.243-258
Hauptverfasser: Belkacem, Athamena, Houhamdi, Zina
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Usually, data quality is defined by multiple attributes that allow classifying the output data (such as completeness, freshness, and accuracy) or the methods exploiting these data (such as dependability, performance, and protection). Among the suggested quality attributes, we will discuss one of the principal categories: data accuracy. Scientific experiments, decision-making, and data retrieval are examples of situations that require a formal evaluation approach to data accuracy. The evaluation approach should be adaptable to distinct understandings of data accuracy and distinct end-user expectations. This study investigates data accuracy and defines dimensions and metrics that affect its evaluation. The investigation of data accuracy generates problems in the user expectation specification and database quality models. This work describes our proposed approach for data accuracy evaluation by defining an evaluation algorithm that considers the distribution of inaccuracies in database relations. The approach decomposes the query output in accordance with data accuracy, labels every part with its accuracy value, and addresses the possibility of enforcing data accuracy by using these values. This study mainly contributes by proposing an explicit evaluation of quality attributes of data accuracy, a formal evaluation approach to data accuracy, and suggesting some improvement actions to reinforce data accuracy.
ISSN:0350-5596
1854-3871
DOI:10.31449/inf.v46i2.3027