A Unified Maximum Likelihood Approach for Optimal Distribution Property Estimation

The advent of data science has spurred interest in estimating properties of distributions over large alphabets. Fundamental symmetric properties such as support size, support coverage, entropy, and proximity to uniformity, received most attention, with each property estimated using a different techn...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Acharya, Jayadev, Das, Hirakendu, Orlitsky, Alon, Suresh, Ananda Theertha
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator Acharya, Jayadev
Das, Hirakendu
Orlitsky, Alon
Suresh, Ananda Theertha
description The advent of data science has spurred interest in estimating properties of distributions over large alphabets. Fundamental symmetric properties such as support size, support coverage, entropy, and proximity to uniformity, received most attention, with each property estimated using a different technique and often intricate analysis tools. We prove that for all these properties, a single, simple, plug-in estimator---profile maximum likelihood (PML)---performs as well as the best specialized techniques. This raises the possibility that PML may optimally estimate many other symmetric properties.
doi_str_mv 10.48550/arxiv.1611.02960
format Article
fullrecord <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_1611_02960</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>1611_02960</sourcerecordid><originalsourceid>FETCH-LOGICAL-a670-fd28b94da1b5aa367c47a9bc7b2e60d26e80aca8322152d35844b20a54e3e7aa3</originalsourceid><addsrcrecordid>eNotj0tqwzAYhLXpoqQ9QFfVBezqLXlp0vQBLiklWZtflkxE7MjITklu3zjNamCGmeFD6ImSXBgpyQukU_jNqaI0J6xQ5B79lHh7CG3wDn_BKfTHHldh77uwi9HhchhShGaH25jwephCDx1-DeOUgj1OIR7wd4qDT9MZr8Y5nb0HdNdCN_rHmy7Q5m21WX5k1fr9c1lWGShNstYxYwvhgFoJwJVuhIbCNtoyr4hjyhsCDRjOGJXMcWmEsIyAFJ57fWks0PP_7BWqHtLlPp3rGa6-wvE_MUJK5Q</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>A Unified Maximum Likelihood Approach for Optimal Distribution Property Estimation</title><source>arXiv.org</source><creator>Acharya, Jayadev ; Das, Hirakendu ; Orlitsky, Alon ; Suresh, Ananda Theertha</creator><creatorcontrib>Acharya, Jayadev ; Das, Hirakendu ; Orlitsky, Alon ; Suresh, Ananda Theertha</creatorcontrib><description>The advent of data science has spurred interest in estimating properties of distributions over large alphabets. Fundamental symmetric properties such as support size, support coverage, entropy, and proximity to uniformity, received most attention, with each property estimated using a different technique and often intricate analysis tools. We prove that for all these properties, a single, simple, plug-in estimator---profile maximum likelihood (PML)---performs as well as the best specialized techniques. This raises the possibility that PML may optimally estimate many other symmetric properties.</description><identifier>DOI: 10.48550/arxiv.1611.02960</identifier><language>eng</language><subject>Computer Science - Data Structures and Algorithms ; Computer Science - Information Theory ; Computer Science - Learning ; Mathematics - Information Theory</subject><creationdate>2016-11</creationdate><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,780,885</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/1611.02960$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.1611.02960$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Acharya, Jayadev</creatorcontrib><creatorcontrib>Das, Hirakendu</creatorcontrib><creatorcontrib>Orlitsky, Alon</creatorcontrib><creatorcontrib>Suresh, Ananda Theertha</creatorcontrib><title>A Unified Maximum Likelihood Approach for Optimal Distribution Property Estimation</title><description>The advent of data science has spurred interest in estimating properties of distributions over large alphabets. Fundamental symmetric properties such as support size, support coverage, entropy, and proximity to uniformity, received most attention, with each property estimated using a different technique and often intricate analysis tools. We prove that for all these properties, a single, simple, plug-in estimator---profile maximum likelihood (PML)---performs as well as the best specialized techniques. This raises the possibility that PML may optimally estimate many other symmetric properties.</description><subject>Computer Science - Data Structures and Algorithms</subject><subject>Computer Science - Information Theory</subject><subject>Computer Science - Learning</subject><subject>Mathematics - Information Theory</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2016</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotj0tqwzAYhLXpoqQ9QFfVBezqLXlp0vQBLiklWZtflkxE7MjITklu3zjNamCGmeFD6ImSXBgpyQukU_jNqaI0J6xQ5B79lHh7CG3wDn_BKfTHHldh77uwi9HhchhShGaH25jwephCDx1-DeOUgj1OIR7wd4qDT9MZr8Y5nb0HdNdCN_rHmy7Q5m21WX5k1fr9c1lWGShNstYxYwvhgFoJwJVuhIbCNtoyr4hjyhsCDRjOGJXMcWmEsIyAFJ57fWks0PP_7BWqHtLlPp3rGa6-wvE_MUJK5Q</recordid><startdate>20161109</startdate><enddate>20161109</enddate><creator>Acharya, Jayadev</creator><creator>Das, Hirakendu</creator><creator>Orlitsky, Alon</creator><creator>Suresh, Ananda Theertha</creator><scope>AKY</scope><scope>AKZ</scope><scope>GOX</scope></search><sort><creationdate>20161109</creationdate><title>A Unified Maximum Likelihood Approach for Optimal Distribution Property Estimation</title><author>Acharya, Jayadev ; Das, Hirakendu ; Orlitsky, Alon ; Suresh, Ananda Theertha</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a670-fd28b94da1b5aa367c47a9bc7b2e60d26e80aca8322152d35844b20a54e3e7aa3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2016</creationdate><topic>Computer Science - Data Structures and Algorithms</topic><topic>Computer Science - Information Theory</topic><topic>Computer Science - Learning</topic><topic>Mathematics - Information Theory</topic><toplevel>online_resources</toplevel><creatorcontrib>Acharya, Jayadev</creatorcontrib><creatorcontrib>Das, Hirakendu</creatorcontrib><creatorcontrib>Orlitsky, Alon</creatorcontrib><creatorcontrib>Suresh, Ananda Theertha</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv Mathematics</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Acharya, Jayadev</au><au>Das, Hirakendu</au><au>Orlitsky, Alon</au><au>Suresh, Ananda Theertha</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A Unified Maximum Likelihood Approach for Optimal Distribution Property Estimation</atitle><date>2016-11-09</date><risdate>2016</risdate><abstract>The advent of data science has spurred interest in estimating properties of distributions over large alphabets. Fundamental symmetric properties such as support size, support coverage, entropy, and proximity to uniformity, received most attention, with each property estimated using a different technique and often intricate analysis tools. We prove that for all these properties, a single, simple, plug-in estimator---profile maximum likelihood (PML)---performs as well as the best specialized techniques. This raises the possibility that PML may optimally estimate many other symmetric properties.</abstract><doi>10.48550/arxiv.1611.02960</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier DOI: 10.48550/arxiv.1611.02960
ispartof
issn
language eng
recordid cdi_arxiv_primary_1611_02960
source arXiv.org
subjects Computer Science - Data Structures and Algorithms
Computer Science - Information Theory
Computer Science - Learning
Mathematics - Information Theory
title A Unified Maximum Likelihood Approach for Optimal Distribution Property Estimation
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-22T11%3A43%3A37IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20Unified%20Maximum%20Likelihood%20Approach%20for%20Optimal%20Distribution%20Property%20Estimation&rft.au=Acharya,%20Jayadev&rft.date=2016-11-09&rft_id=info:doi/10.48550/arxiv.1611.02960&rft_dat=%3Carxiv_GOX%3E1611_02960%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true