User-Oriented Approach to Data Quality Evaluation

The paper proposes a new data object-driven approach to data quality evaluation. It consists of three main components: (1) a data object, (2) data quality requirements, and (3) data quality evaluation process. As data quality is of relative nature, the data object and quality requirements are (a) us...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:J.UCS (Annual print and CD-ROM archive ed.) 2020-01, Vol.26 (1), p.107-126
Hauptverfasser: Nikiforova, Anastasija, Bicevskis, Janis, Bicevska, Zane, Oditis, Ivo
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 126
container_issue 1
container_start_page 107
container_title J.UCS (Annual print and CD-ROM archive ed.)
container_volume 26
creator Nikiforova, Anastasija
Bicevskis, Janis
Bicevska, Zane
Oditis, Ivo
description The paper proposes a new data object-driven approach to data quality evaluation. It consists of three main components: (1) a data object, (2) data quality requirements, and (3) data quality evaluation process. As data quality is of relative nature, the data object and quality requirements are (a) use-case dependent and (b) defined by the user in accordance with his needs. All three components of the presented data quality model are described using graphical Domain Specific Languages (DSLs). In accordance with Model-Driven Architecture (MDA), the data quality model is built in two steps: (1) creating a platform-independent model (PIM), and (2) converting the created PIM into a platform-specific model (PSM). The PIM comprises informal specifications of data quality. The PSM describes the implementation of a data quality model, thus making it executable, enabling data object scanning and detecting data quality defects and anomalies. The proposed approach was applied to open data sets, analysing their quality. At least 3 advantages were highlighted: (1) a graphical data quality model allows the definition of data quality by non-IT and non-data quality experts as the presented diagrams are easy to read, create and modify, (2) the data quality model allows an analysis of "third-party" data without deeper knowledge on how the data were accrued and processed, (3) the quality of the data can be described at least at two levels of abstraction - informally using natural language or formally by including executable artefacts such as SQL statements.
doi_str_mv 10.3897/jucs.2020.007
format Article
fullrecord <record><control><sourceid>gale_doaj_</sourceid><recordid>TN_cdi_gale_infotracacademiconefile_A777671522</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><galeid>A777671522</galeid><doaj_id>oai_doaj_org_article_ab91624be58c49da9d9226b4bd555c63</doaj_id><sourcerecordid>A777671522</sourcerecordid><originalsourceid>FETCH-LOGICAL-c381t-7be6f4d71cf7010335590dfbb0bd9a942954b31397a4ef4a3b46ac74912ffcef3</originalsourceid><addsrcrecordid>eNo9kFtLw0AQhYMoWKuPvucPJO59s4-lVi0UimDBt2X2Vrek2bJJhf57UysyDzMc5hwOX1E8YlTTRsmn3dH2NUEE1QjJq2KCFGsqoURz_X_zz9viru93CBEhVDMp8Kb3uVrn6LvBu3J2OOQE9qscUvkMA5TvR2jjcCoX39AeYYipuy9uArS9f_jb02LzsviYv1Wr9etyPltVljZ4qKTxIjAnsQ0SYUQp5wq5YAwyToFiRHFmKKZKAvOBATVMgJVMYRKC9YFOi-Ul1yXY6UOOe8gnnSDqXyHlrYY8RNt6DUZhQZjxvLFMOVBOESIMM45zbgUds-pL1hbG99iFNGSw4zi_jzZ1PsRRn0kphcSckNFQXQw2p77PPvwXwEifYeszbH2GrUfY9Ac1mHHs</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>User-Oriented Approach to Data Quality Evaluation</title><source>DOAJ Directory of Open Access Journals</source><source>Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals</source><creator>Nikiforova, Anastasija ; Bicevskis, Janis ; Bicevska, Zane ; Oditis, Ivo</creator><creatorcontrib>Nikiforova, Anastasija ; Bicevskis, Janis ; Bicevska, Zane ; Oditis, Ivo</creatorcontrib><description>The paper proposes a new data object-driven approach to data quality evaluation. It consists of three main components: (1) a data object, (2) data quality requirements, and (3) data quality evaluation process. As data quality is of relative nature, the data object and quality requirements are (a) use-case dependent and (b) defined by the user in accordance with his needs. All three components of the presented data quality model are described using graphical Domain Specific Languages (DSLs). In accordance with Model-Driven Architecture (MDA), the data quality model is built in two steps: (1) creating a platform-independent model (PIM), and (2) converting the created PIM into a platform-specific model (PSM). The PIM comprises informal specifications of data quality. The PSM describes the implementation of a data quality model, thus making it executable, enabling data object scanning and detecting data quality defects and anomalies. The proposed approach was applied to open data sets, analysing their quality. At least 3 advantages were highlighted: (1) a graphical data quality model allows the definition of data quality by non-IT and non-data quality experts as the presented diagrams are easy to read, create and modify, (2) the data quality model allows an analysis of "third-party" data without deeper knowledge on how the data were accrued and processed, (3) the quality of the data can be described at least at two levels of abstraction - informally using natural language or formally by including executable artefacts such as SQL statements.</description><identifier>ISSN: 0948-695X</identifier><identifier>EISSN: 0948-6968</identifier><identifier>DOI: 10.3897/jucs.2020.007</identifier><language>eng</language><publisher>Pensoft Publishers</publisher><subject>Analysis ; data object ; data quality ; domain-specific languag ; Information management</subject><ispartof>J.UCS (Annual print and CD-ROM archive ed.), 2020-01, Vol.26 (1), p.107-126</ispartof><rights>COPYRIGHT 2020 Pensoft Publishers</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c381t-7be6f4d71cf7010335590dfbb0bd9a942954b31397a4ef4a3b46ac74912ffcef3</citedby></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,860,27903,27904</link.rule.ids></links><search><creatorcontrib>Nikiforova, Anastasija</creatorcontrib><creatorcontrib>Bicevskis, Janis</creatorcontrib><creatorcontrib>Bicevska, Zane</creatorcontrib><creatorcontrib>Oditis, Ivo</creatorcontrib><title>User-Oriented Approach to Data Quality Evaluation</title><title>J.UCS (Annual print and CD-ROM archive ed.)</title><description>The paper proposes a new data object-driven approach to data quality evaluation. It consists of three main components: (1) a data object, (2) data quality requirements, and (3) data quality evaluation process. As data quality is of relative nature, the data object and quality requirements are (a) use-case dependent and (b) defined by the user in accordance with his needs. All three components of the presented data quality model are described using graphical Domain Specific Languages (DSLs). In accordance with Model-Driven Architecture (MDA), the data quality model is built in two steps: (1) creating a platform-independent model (PIM), and (2) converting the created PIM into a platform-specific model (PSM). The PIM comprises informal specifications of data quality. The PSM describes the implementation of a data quality model, thus making it executable, enabling data object scanning and detecting data quality defects and anomalies. The proposed approach was applied to open data sets, analysing their quality. At least 3 advantages were highlighted: (1) a graphical data quality model allows the definition of data quality by non-IT and non-data quality experts as the presented diagrams are easy to read, create and modify, (2) the data quality model allows an analysis of "third-party" data without deeper knowledge on how the data were accrued and processed, (3) the quality of the data can be described at least at two levels of abstraction - informally using natural language or formally by including executable artefacts such as SQL statements.</description><subject>Analysis</subject><subject>data object</subject><subject>data quality</subject><subject>domain-specific languag</subject><subject>Information management</subject><issn>0948-695X</issn><issn>0948-6968</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><sourceid>DOA</sourceid><recordid>eNo9kFtLw0AQhYMoWKuPvucPJO59s4-lVi0UimDBt2X2Vrek2bJJhf57UysyDzMc5hwOX1E8YlTTRsmn3dH2NUEE1QjJq2KCFGsqoURz_X_zz9viru93CBEhVDMp8Kb3uVrn6LvBu3J2OOQE9qscUvkMA5TvR2jjcCoX39AeYYipuy9uArS9f_jb02LzsviYv1Wr9etyPltVljZ4qKTxIjAnsQ0SYUQp5wq5YAwyToFiRHFmKKZKAvOBATVMgJVMYRKC9YFOi-Ul1yXY6UOOe8gnnSDqXyHlrYY8RNt6DUZhQZjxvLFMOVBOESIMM45zbgUds-pL1hbG99iFNGSw4zi_jzZ1PsRRn0kphcSckNFQXQw2p77PPvwXwEifYeszbH2GrUfY9Ac1mHHs</recordid><startdate>20200101</startdate><enddate>20200101</enddate><creator>Nikiforova, Anastasija</creator><creator>Bicevskis, Janis</creator><creator>Bicevska, Zane</creator><creator>Oditis, Ivo</creator><general>Pensoft Publishers</general><general>Graz University of Technology</general><scope>AAYXX</scope><scope>CITATION</scope><scope>DOA</scope></search><sort><creationdate>20200101</creationdate><title>User-Oriented Approach to Data Quality Evaluation</title><author>Nikiforova, Anastasija ; Bicevskis, Janis ; Bicevska, Zane ; Oditis, Ivo</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c381t-7be6f4d71cf7010335590dfbb0bd9a942954b31397a4ef4a3b46ac74912ffcef3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Analysis</topic><topic>data object</topic><topic>data quality</topic><topic>domain-specific languag</topic><topic>Information management</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Nikiforova, Anastasija</creatorcontrib><creatorcontrib>Bicevskis, Janis</creatorcontrib><creatorcontrib>Bicevska, Zane</creatorcontrib><creatorcontrib>Oditis, Ivo</creatorcontrib><collection>CrossRef</collection><collection>DOAJ Directory of Open Access Journals</collection><jtitle>J.UCS (Annual print and CD-ROM archive ed.)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Nikiforova, Anastasija</au><au>Bicevskis, Janis</au><au>Bicevska, Zane</au><au>Oditis, Ivo</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>User-Oriented Approach to Data Quality Evaluation</atitle><jtitle>J.UCS (Annual print and CD-ROM archive ed.)</jtitle><date>2020-01-01</date><risdate>2020</risdate><volume>26</volume><issue>1</issue><spage>107</spage><epage>126</epage><pages>107-126</pages><issn>0948-695X</issn><eissn>0948-6968</eissn><abstract>The paper proposes a new data object-driven approach to data quality evaluation. It consists of three main components: (1) a data object, (2) data quality requirements, and (3) data quality evaluation process. As data quality is of relative nature, the data object and quality requirements are (a) use-case dependent and (b) defined by the user in accordance with his needs. All three components of the presented data quality model are described using graphical Domain Specific Languages (DSLs). In accordance with Model-Driven Architecture (MDA), the data quality model is built in two steps: (1) creating a platform-independent model (PIM), and (2) converting the created PIM into a platform-specific model (PSM). The PIM comprises informal specifications of data quality. The PSM describes the implementation of a data quality model, thus making it executable, enabling data object scanning and detecting data quality defects and anomalies. The proposed approach was applied to open data sets, analysing their quality. At least 3 advantages were highlighted: (1) a graphical data quality model allows the definition of data quality by non-IT and non-data quality experts as the presented diagrams are easy to read, create and modify, (2) the data quality model allows an analysis of "third-party" data without deeper knowledge on how the data were accrued and processed, (3) the quality of the data can be described at least at two levels of abstraction - informally using natural language or formally by including executable artefacts such as SQL statements.</abstract><pub>Pensoft Publishers</pub><doi>10.3897/jucs.2020.007</doi><tpages>20</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 0948-695X
ispartof J.UCS (Annual print and CD-ROM archive ed.), 2020-01, Vol.26 (1), p.107-126
issn 0948-695X
0948-6968
language eng
recordid cdi_gale_infotracacademiconefile_A777671522
source DOAJ Directory of Open Access Journals; Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals
subjects Analysis
data object
data quality
domain-specific languag
Information management
title User-Oriented Approach to Data Quality Evaluation
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-23T22%3A47%3A44IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-gale_doaj_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=User-Oriented%20Approach%20to%20Data%20Quality%20Evaluation&rft.jtitle=J.UCS%20(Annual%20print%20and%20CD-ROM%20archive%20ed.)&rft.au=Nikiforova,%20Anastasija&rft.date=2020-01-01&rft.volume=26&rft.issue=1&rft.spage=107&rft.epage=126&rft.pages=107-126&rft.issn=0948-695X&rft.eissn=0948-6968&rft_id=info:doi/10.3897/jucs.2020.007&rft_dat=%3Cgale_doaj_%3EA777671522%3C/gale_doaj_%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_galeid=A777671522&rft_doaj_id=oai_doaj_org_article_ab91624be58c49da9d9226b4bd555c63&rfr_iscdi=true