Managing validation models and rules to apply to data sets

Provided are a method, system, and article of manufacture for managing validation models and rules to apply to data sets. A schema definition describing a structure of at least one column in a first data set having a plurality of columns and records providing data for each of the columns is received...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: AGRAWAL GEETIKA, SAILLET YANNICK, ROTH MARY ANN, LABRIE JACQUES JOSEPH
Format: Patent
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator AGRAWAL GEETIKA
SAILLET YANNICK
ROTH MARY ANN
LABRIE JACQUES JOSEPH
description Provided are a method, system, and article of manufacture for managing validation models and rules to apply to data sets. A schema definition describing a structure of at least one column in a first data set having a plurality of columns and records providing data for each of the columns is received. At least one model is generated, wherein each model asserts conditions for at least one column in a record of the first data set. The schema definition and the at least one model are stored in a data quality model. Selection is received of a second data set and the data quality model. A determination is made as to whether a structure of the second data set is compatible with the schema definition in the selected data quality model. Each model in the data quality model is applied to the records in the second data set to validate the records in the second data set in response to determining that the structure of the second data set and the schema definition are compatible.
format Patent
fullrecord <record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_US8401987B2</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>US8401987B2</sourcerecordid><originalsourceid>FETCH-epo_espacenet_US8401987B23</originalsourceid><addsrcrecordid>eNrjZLDyTcxLTM_MS1coS8zJTEksyczPU8jNT0nNKVZIzEtRKCrNSS1WKMlXSCwoyKkEMYBqEhWKU0uKeRhY0xJzilN5oTQ3g4Kba4izh25qQX58anFBYnJqXmpJfGiwhYmBoaWFuZORMRFKAAQrLpE</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>Managing validation models and rules to apply to data sets</title><source>esp@cenet</source><creator>AGRAWAL GEETIKA ; SAILLET YANNICK ; ROTH MARY ANN ; LABRIE JACQUES JOSEPH</creator><creatorcontrib>AGRAWAL GEETIKA ; SAILLET YANNICK ; ROTH MARY ANN ; LABRIE JACQUES JOSEPH</creatorcontrib><description>Provided are a method, system, and article of manufacture for managing validation models and rules to apply to data sets. A schema definition describing a structure of at least one column in a first data set having a plurality of columns and records providing data for each of the columns is received. At least one model is generated, wherein each model asserts conditions for at least one column in a record of the first data set. The schema definition and the at least one model are stored in a data quality model. Selection is received of a second data set and the data quality model. A determination is made as to whether a structure of the second data set is compatible with the schema definition in the selected data quality model. Each model in the data quality model is applied to the records in the second data set to validate the records in the second data set in response to determining that the structure of the second data set and the schema definition are compatible.</description><language>eng</language><subject>CALCULATING ; COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS ; COMPUTING ; COUNTING ; ELECTRIC DIGITAL DATA PROCESSING ; PHYSICS</subject><creationdate>2013</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20130319&amp;DB=EPODOC&amp;CC=US&amp;NR=8401987B2$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,309,781,886,25566,76549</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20130319&amp;DB=EPODOC&amp;CC=US&amp;NR=8401987B2$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>AGRAWAL GEETIKA</creatorcontrib><creatorcontrib>SAILLET YANNICK</creatorcontrib><creatorcontrib>ROTH MARY ANN</creatorcontrib><creatorcontrib>LABRIE JACQUES JOSEPH</creatorcontrib><title>Managing validation models and rules to apply to data sets</title><description>Provided are a method, system, and article of manufacture for managing validation models and rules to apply to data sets. A schema definition describing a structure of at least one column in a first data set having a plurality of columns and records providing data for each of the columns is received. At least one model is generated, wherein each model asserts conditions for at least one column in a record of the first data set. The schema definition and the at least one model are stored in a data quality model. Selection is received of a second data set and the data quality model. A determination is made as to whether a structure of the second data set is compatible with the schema definition in the selected data quality model. Each model in the data quality model is applied to the records in the second data set to validate the records in the second data set in response to determining that the structure of the second data set and the schema definition are compatible.</description><subject>CALCULATING</subject><subject>COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS</subject><subject>COMPUTING</subject><subject>COUNTING</subject><subject>ELECTRIC DIGITAL DATA PROCESSING</subject><subject>PHYSICS</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2013</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNrjZLDyTcxLTM_MS1coS8zJTEksyczPU8jNT0nNKVZIzEtRKCrNSS1WKMlXSCwoyKkEMYBqEhWKU0uKeRhY0xJzilN5oTQ3g4Kba4izh25qQX58anFBYnJqXmpJfGiwhYmBoaWFuZORMRFKAAQrLpE</recordid><startdate>20130319</startdate><enddate>20130319</enddate><creator>AGRAWAL GEETIKA</creator><creator>SAILLET YANNICK</creator><creator>ROTH MARY ANN</creator><creator>LABRIE JACQUES JOSEPH</creator><scope>EVB</scope></search><sort><creationdate>20130319</creationdate><title>Managing validation models and rules to apply to data sets</title><author>AGRAWAL GEETIKA ; SAILLET YANNICK ; ROTH MARY ANN ; LABRIE JACQUES JOSEPH</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_US8401987B23</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>eng</language><creationdate>2013</creationdate><topic>CALCULATING</topic><topic>COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS</topic><topic>COMPUTING</topic><topic>COUNTING</topic><topic>ELECTRIC DIGITAL DATA PROCESSING</topic><topic>PHYSICS</topic><toplevel>online_resources</toplevel><creatorcontrib>AGRAWAL GEETIKA</creatorcontrib><creatorcontrib>SAILLET YANNICK</creatorcontrib><creatorcontrib>ROTH MARY ANN</creatorcontrib><creatorcontrib>LABRIE JACQUES JOSEPH</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>AGRAWAL GEETIKA</au><au>SAILLET YANNICK</au><au>ROTH MARY ANN</au><au>LABRIE JACQUES JOSEPH</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>Managing validation models and rules to apply to data sets</title><date>2013-03-19</date><risdate>2013</risdate><abstract>Provided are a method, system, and article of manufacture for managing validation models and rules to apply to data sets. A schema definition describing a structure of at least one column in a first data set having a plurality of columns and records providing data for each of the columns is received. At least one model is generated, wherein each model asserts conditions for at least one column in a record of the first data set. The schema definition and the at least one model are stored in a data quality model. Selection is received of a second data set and the data quality model. A determination is made as to whether a structure of the second data set is compatible with the schema definition in the selected data quality model. Each model in the data quality model is applied to the records in the second data set to validate the records in the second data set in response to determining that the structure of the second data set and the schema definition are compatible.</abstract><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier
ispartof
issn
language eng
recordid cdi_epo_espacenet_US8401987B2
source esp@cenet
subjects CALCULATING
COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
COMPUTING
COUNTING
ELECTRIC DIGITAL DATA PROCESSING
PHYSICS
title Managing validation models and rules to apply to data sets
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-17T22%3A59%3A43IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=AGRAWAL%20GEETIKA&rft.date=2013-03-19&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3EUS8401987B2%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true