Computing a Data Dividend

Quality data is a fundamental contributor to success in statistics and machine learning. If a statistical assessment or machine learning leads to decisions that create value, data contributors may want a share of that value. This paper presents methods to assess the value of individual data samples,...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
1. Verfasser: Bax, Eric
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator Bax, Eric
description Quality data is a fundamental contributor to success in statistics and machine learning. If a statistical assessment or machine learning leads to decisions that create value, data contributors may want a share of that value. This paper presents methods to assess the value of individual data samples, and of sets of samples, to apportion value among different data contributors. We use Shapley values for individual samples and Owen values for combined samples, and show that these values can be computed in polynomial time in spite of their definitions having numbers of terms that are exponential in the number of samples.
doi_str_mv 10.48550/arxiv.1905.01805
format Article
fullrecord <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_1905_01805</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>1905_01805</sourcerecordid><originalsourceid>FETCH-LOGICAL-a675-b44549829d2de40037c640dd646126f44ece745aabf23eb42fa63b7c816e1a6e3</originalsourceid><addsrcrecordid>eNotzrsOgjAYhuEuDga9ACa5AbCHvwVGg8eExIWd_KXFNBEkiETvXkWX792-PIT4jEaQSEnX2D_dGLGUyoiyhMo58bNb0z0G114CDLY4fMaNztjWLMisxuvdLv_1SLHfFdkxzM-HU7bJQ1SxDDWAhDThqeHGAqUirhRQYxQoxlUNYCsbg0TUNRdWA69RCR1XCVOWobLCI6vf7WQru9412L_Kr7GcjOINXsA0rw</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Computing a Data Dividend</title><source>arXiv.org</source><creator>Bax, Eric</creator><creatorcontrib>Bax, Eric</creatorcontrib><description>Quality data is a fundamental contributor to success in statistics and machine learning. If a statistical assessment or machine learning leads to decisions that create value, data contributors may want a share of that value. This paper presents methods to assess the value of individual data samples, and of sets of samples, to apportion value among different data contributors. We use Shapley values for individual samples and Owen values for combined samples, and show that these values can be computed in polynomial time in spite of their definitions having numbers of terms that are exponential in the number of samples.</description><identifier>DOI: 10.48550/arxiv.1905.01805</identifier><language>eng</language><subject>Computer Science - Computer Science and Game Theory ; Computer Science - Computers and Society ; Quantitative Finance - Economics ; Statistics - Computation</subject><creationdate>2019-05</creationdate><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,780,885</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/1905.01805$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.1905.01805$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Bax, Eric</creatorcontrib><title>Computing a Data Dividend</title><description>Quality data is a fundamental contributor to success in statistics and machine learning. If a statistical assessment or machine learning leads to decisions that create value, data contributors may want a share of that value. This paper presents methods to assess the value of individual data samples, and of sets of samples, to apportion value among different data contributors. We use Shapley values for individual samples and Owen values for combined samples, and show that these values can be computed in polynomial time in spite of their definitions having numbers of terms that are exponential in the number of samples.</description><subject>Computer Science - Computer Science and Game Theory</subject><subject>Computer Science - Computers and Society</subject><subject>Quantitative Finance - Economics</subject><subject>Statistics - Computation</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2019</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotzrsOgjAYhuEuDga9ACa5AbCHvwVGg8eExIWd_KXFNBEkiETvXkWX792-PIT4jEaQSEnX2D_dGLGUyoiyhMo58bNb0z0G114CDLY4fMaNztjWLMisxuvdLv_1SLHfFdkxzM-HU7bJQ1SxDDWAhDThqeHGAqUirhRQYxQoxlUNYCsbg0TUNRdWA69RCR1XCVOWobLCI6vf7WQru9412L_Kr7GcjOINXsA0rw</recordid><startdate>20190505</startdate><enddate>20190505</enddate><creator>Bax, Eric</creator><scope>ADEOX</scope><scope>AKY</scope><scope>EPD</scope><scope>GOX</scope></search><sort><creationdate>20190505</creationdate><title>Computing a Data Dividend</title><author>Bax, Eric</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a675-b44549829d2de40037c640dd646126f44ece745aabf23eb42fa63b7c816e1a6e3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2019</creationdate><topic>Computer Science - Computer Science and Game Theory</topic><topic>Computer Science - Computers and Society</topic><topic>Quantitative Finance - Economics</topic><topic>Statistics - Computation</topic><toplevel>online_resources</toplevel><creatorcontrib>Bax, Eric</creatorcontrib><collection>arXiv Economics</collection><collection>arXiv Computer Science</collection><collection>arXiv Statistics</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Bax, Eric</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Computing a Data Dividend</atitle><date>2019-05-05</date><risdate>2019</risdate><abstract>Quality data is a fundamental contributor to success in statistics and machine learning. If a statistical assessment or machine learning leads to decisions that create value, data contributors may want a share of that value. This paper presents methods to assess the value of individual data samples, and of sets of samples, to apportion value among different data contributors. We use Shapley values for individual samples and Owen values for combined samples, and show that these values can be computed in polynomial time in spite of their definitions having numbers of terms that are exponential in the number of samples.</abstract><doi>10.48550/arxiv.1905.01805</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier DOI: 10.48550/arxiv.1905.01805
ispartof
issn
language eng
recordid cdi_arxiv_primary_1905_01805
source arXiv.org
subjects Computer Science - Computer Science and Game Theory
Computer Science - Computers and Society
Quantitative Finance - Economics
Statistics - Computation
title Computing a Data Dividend
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-31T00%3A10%3A56IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Computing%20a%20Data%20Dividend&rft.au=Bax,%20Eric&rft.date=2019-05-05&rft_id=info:doi/10.48550/arxiv.1905.01805&rft_dat=%3Carxiv_GOX%3E1905_01805%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true