The design of absorbing Bayesian pursuit algorithms and the formal analyses of their ε-optimality

The fundamental phenomenon that has been used to enhance the convergence speed of learning automata (LA) is that of incorporating the running maximum likelihood (ML) estimates of the action reward probabilities into the probability updating rules for selecting the actions. The frontiers of this fiel...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Pattern analysis and applications : PAA 2017-08, Vol.20 (3), p.797-808
Hauptverfasser: Zhang, Xuan, Oommen, B. John, Granmo, Ole-Christoffer
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 808
container_issue 3
container_start_page 797
container_title Pattern analysis and applications : PAA
container_volume 20
creator Zhang, Xuan
Oommen, B. John
Granmo, Ole-Christoffer
description The fundamental phenomenon that has been used to enhance the convergence speed of learning automata (LA) is that of incorporating the running maximum likelihood (ML) estimates of the action reward probabilities into the probability updating rules for selecting the actions. The frontiers of this field have been recently expanded by replacing the ML estimates with their corresponding Bayesian counterparts that incorporate the properties of the conjugate priors. These constitute the Bayesian pursuit algorithm (BPA), and the discretized Bayesian pursuit algorithm. Although these algorithms have been designed and efficiently implemented, and are, arguably, the fastest and most accurate LA reported in the literature, the proofs of their ϵ -optimal convergence has been unsolved. This is precisely the intent of this paper. In this paper, we present a single unifying analysis by which the proofs of both the continuous and discretized schemes are proven. We emphasize that unlike the ML-based pursuit schemes, the Bayesian schemes have to not only consider the estimates themselves but also the distributional forms of their conjugate posteriors and their higher order moments—all of which render the proofs to be particularly challenging. As far as we know, apart from the results themselves, the methodologies of this proof have been unreported in the literature—they are both pioneering and novel.
doi_str_mv 10.1007/s10044-016-0535-1
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_1919129174</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>1919129174</sourcerecordid><originalsourceid>FETCH-LOGICAL-c359t-fd671a8e6a8c8ad37cf4b1431113af5424df7dbd0a451fa62ccafcc9253774fb3</originalsourceid><addsrcrecordid>eNp1UM1KxDAYDKLguvoA3gKeo_mapGmPuvgHgpcVvIU0Tbpduk1N2sM-mK_hM5mlIl4kkHzfZGYYBqFLoNdAqbyJ6eacUMgJFUwQOEIL4IwRKcT78e_M4RSdxbillDGWFQtUrTcW1za2TY-9w7qKPlRt3-A7vU-o7vEwhTi1I9Zd40M7bnYR677GY9I5H3a6S6vu9tHGg0GC24C_PokfxjZ9tuP-HJ043UV78fMu0dvD_Xr1RF5eH59Xty_EMFGOxNW5BF3YXBem0DWTxvEqxQYApp3gGa-drKuaai7A6TwzRjtjykwwKbmr2BJdzb5D8B-TjaPa-imkbFFBmU5WguSJBTPLBB9jsE4NIQUNewVUHapUc5UqVakOVSpImmzWxMTtGxv-OP8r-gYwOnjV</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1919129174</pqid></control><display><type>article</type><title>The design of absorbing Bayesian pursuit algorithms and the formal analyses of their ε-optimality</title><source>SpringerLink Journals - AutoHoldings</source><creator>Zhang, Xuan ; Oommen, B. John ; Granmo, Ole-Christoffer</creator><creatorcontrib>Zhang, Xuan ; Oommen, B. John ; Granmo, Ole-Christoffer</creatorcontrib><description>The fundamental phenomenon that has been used to enhance the convergence speed of learning automata (LA) is that of incorporating the running maximum likelihood (ML) estimates of the action reward probabilities into the probability updating rules for selecting the actions. The frontiers of this field have been recently expanded by replacing the ML estimates with their corresponding Bayesian counterparts that incorporate the properties of the conjugate priors. These constitute the Bayesian pursuit algorithm (BPA), and the discretized Bayesian pursuit algorithm. Although these algorithms have been designed and efficiently implemented, and are, arguably, the fastest and most accurate LA reported in the literature, the proofs of their ϵ -optimal convergence has been unsolved. This is precisely the intent of this paper. In this paper, we present a single unifying analysis by which the proofs of both the continuous and discretized schemes are proven. We emphasize that unlike the ML-based pursuit schemes, the Bayesian schemes have to not only consider the estimates themselves but also the distributional forms of their conjugate posteriors and their higher order moments—all of which render the proofs to be particularly challenging. As far as we know, apart from the results themselves, the methodologies of this proof have been unreported in the literature—they are both pioneering and novel.</description><identifier>ISSN: 1433-7541</identifier><identifier>EISSN: 1433-755X</identifier><identifier>DOI: 10.1007/s10044-016-0535-1</identifier><language>eng</language><publisher>London: Springer London</publisher><subject>Algorithms ; Bayesian analysis ; Computer Science ; Convergence ; Estimates ; Maximum likelihood estimates ; Optimization ; Pattern Recognition ; Theoretical Advances</subject><ispartof>Pattern analysis and applications : PAA, 2017-08, Vol.20 (3), p.797-808</ispartof><rights>Springer-Verlag London 2016</rights><rights>Copyright Springer Science &amp; Business Media 2017</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c359t-fd671a8e6a8c8ad37cf4b1431113af5424df7dbd0a451fa62ccafcc9253774fb3</citedby><cites>FETCH-LOGICAL-c359t-fd671a8e6a8c8ad37cf4b1431113af5424df7dbd0a451fa62ccafcc9253774fb3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s10044-016-0535-1$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s10044-016-0535-1$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>314,780,784,27924,27925,41488,42557,51319</link.rule.ids></links><search><creatorcontrib>Zhang, Xuan</creatorcontrib><creatorcontrib>Oommen, B. John</creatorcontrib><creatorcontrib>Granmo, Ole-Christoffer</creatorcontrib><title>The design of absorbing Bayesian pursuit algorithms and the formal analyses of their ε-optimality</title><title>Pattern analysis and applications : PAA</title><addtitle>Pattern Anal Applic</addtitle><description>The fundamental phenomenon that has been used to enhance the convergence speed of learning automata (LA) is that of incorporating the running maximum likelihood (ML) estimates of the action reward probabilities into the probability updating rules for selecting the actions. The frontiers of this field have been recently expanded by replacing the ML estimates with their corresponding Bayesian counterparts that incorporate the properties of the conjugate priors. These constitute the Bayesian pursuit algorithm (BPA), and the discretized Bayesian pursuit algorithm. Although these algorithms have been designed and efficiently implemented, and are, arguably, the fastest and most accurate LA reported in the literature, the proofs of their ϵ -optimal convergence has been unsolved. This is precisely the intent of this paper. In this paper, we present a single unifying analysis by which the proofs of both the continuous and discretized schemes are proven. We emphasize that unlike the ML-based pursuit schemes, the Bayesian schemes have to not only consider the estimates themselves but also the distributional forms of their conjugate posteriors and their higher order moments—all of which render the proofs to be particularly challenging. As far as we know, apart from the results themselves, the methodologies of this proof have been unreported in the literature—they are both pioneering and novel.</description><subject>Algorithms</subject><subject>Bayesian analysis</subject><subject>Computer Science</subject><subject>Convergence</subject><subject>Estimates</subject><subject>Maximum likelihood estimates</subject><subject>Optimization</subject><subject>Pattern Recognition</subject><subject>Theoretical Advances</subject><issn>1433-7541</issn><issn>1433-755X</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2017</creationdate><recordtype>article</recordtype><recordid>eNp1UM1KxDAYDKLguvoA3gKeo_mapGmPuvgHgpcVvIU0Tbpduk1N2sM-mK_hM5mlIl4kkHzfZGYYBqFLoNdAqbyJ6eacUMgJFUwQOEIL4IwRKcT78e_M4RSdxbillDGWFQtUrTcW1za2TY-9w7qKPlRt3-A7vU-o7vEwhTi1I9Zd40M7bnYR677GY9I5H3a6S6vu9tHGg0GC24C_PokfxjZ9tuP-HJ043UV78fMu0dvD_Xr1RF5eH59Xty_EMFGOxNW5BF3YXBem0DWTxvEqxQYApp3gGa-drKuaai7A6TwzRjtjykwwKbmr2BJdzb5D8B-TjaPa-imkbFFBmU5WguSJBTPLBB9jsE4NIQUNewVUHapUc5UqVakOVSpImmzWxMTtGxv-OP8r-gYwOnjV</recordid><startdate>20170801</startdate><enddate>20170801</enddate><creator>Zhang, Xuan</creator><creator>Oommen, B. John</creator><creator>Granmo, Ole-Christoffer</creator><general>Springer London</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope></search><sort><creationdate>20170801</creationdate><title>The design of absorbing Bayesian pursuit algorithms and the formal analyses of their ε-optimality</title><author>Zhang, Xuan ; Oommen, B. John ; Granmo, Ole-Christoffer</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c359t-fd671a8e6a8c8ad37cf4b1431113af5424df7dbd0a451fa62ccafcc9253774fb3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2017</creationdate><topic>Algorithms</topic><topic>Bayesian analysis</topic><topic>Computer Science</topic><topic>Convergence</topic><topic>Estimates</topic><topic>Maximum likelihood estimates</topic><topic>Optimization</topic><topic>Pattern Recognition</topic><topic>Theoretical Advances</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Zhang, Xuan</creatorcontrib><creatorcontrib>Oommen, B. John</creatorcontrib><creatorcontrib>Granmo, Ole-Christoffer</creatorcontrib><collection>CrossRef</collection><jtitle>Pattern analysis and applications : PAA</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Zhang, Xuan</au><au>Oommen, B. John</au><au>Granmo, Ole-Christoffer</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>The design of absorbing Bayesian pursuit algorithms and the formal analyses of their ε-optimality</atitle><jtitle>Pattern analysis and applications : PAA</jtitle><stitle>Pattern Anal Applic</stitle><date>2017-08-01</date><risdate>2017</risdate><volume>20</volume><issue>3</issue><spage>797</spage><epage>808</epage><pages>797-808</pages><issn>1433-7541</issn><eissn>1433-755X</eissn><abstract>The fundamental phenomenon that has been used to enhance the convergence speed of learning automata (LA) is that of incorporating the running maximum likelihood (ML) estimates of the action reward probabilities into the probability updating rules for selecting the actions. The frontiers of this field have been recently expanded by replacing the ML estimates with their corresponding Bayesian counterparts that incorporate the properties of the conjugate priors. These constitute the Bayesian pursuit algorithm (BPA), and the discretized Bayesian pursuit algorithm. Although these algorithms have been designed and efficiently implemented, and are, arguably, the fastest and most accurate LA reported in the literature, the proofs of their ϵ -optimal convergence has been unsolved. This is precisely the intent of this paper. In this paper, we present a single unifying analysis by which the proofs of both the continuous and discretized schemes are proven. We emphasize that unlike the ML-based pursuit schemes, the Bayesian schemes have to not only consider the estimates themselves but also the distributional forms of their conjugate posteriors and their higher order moments—all of which render the proofs to be particularly challenging. As far as we know, apart from the results themselves, the methodologies of this proof have been unreported in the literature—they are both pioneering and novel.</abstract><cop>London</cop><pub>Springer London</pub><doi>10.1007/s10044-016-0535-1</doi><tpages>12</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1433-7541
ispartof Pattern analysis and applications : PAA, 2017-08, Vol.20 (3), p.797-808
issn 1433-7541
1433-755X
language eng
recordid cdi_proquest_journals_1919129174
source SpringerLink Journals - AutoHoldings
subjects Algorithms
Bayesian analysis
Computer Science
Convergence
Estimates
Maximum likelihood estimates
Optimization
Pattern Recognition
Theoretical Advances
title The design of absorbing Bayesian pursuit algorithms and the formal analyses of their ε-optimality
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-06T07%3A17%3A57IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=The%20design%20of%20absorbing%20Bayesian%20pursuit%20algorithms%20and%20the%20formal%20analyses%20of%20their%20%CE%B5-optimality&rft.jtitle=Pattern%20analysis%20and%20applications%20:%20PAA&rft.au=Zhang,%20Xuan&rft.date=2017-08-01&rft.volume=20&rft.issue=3&rft.spage=797&rft.epage=808&rft.pages=797-808&rft.issn=1433-7541&rft.eissn=1433-755X&rft_id=info:doi/10.1007/s10044-016-0535-1&rft_dat=%3Cproquest_cross%3E1919129174%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1919129174&rft_id=info:pmid/&rfr_iscdi=true