The design of absorbing Bayesian pursuit algorithms and the formal analyses of their ε-optimality

The fundamental phenomenon that has been used to enhance the convergence speed of learning automata (LA) is that of incorporating the running maximum likelihood (ML) estimates of the action reward probabilities into the probability updating rules for selecting the actions. The frontiers of this fiel...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Pattern analysis and applications : PAA 2017-08, Vol.20 (3), p.797-808
Hauptverfasser:	Zhang, Xuan, Oommen, B. John, Granmo, Ole-Christoffer
Format:	Artikel
Sprache:	eng
Schlagworte:	Algorithms Bayesian analysis Computer Science Convergence Estimates Maximum likelihood estimates Optimization Pattern Recognition Theoretical Advances
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	808
container_issue	3
container_start_page	797
container_title	Pattern analysis and applications : PAA
container_volume	20
creator	Zhang, Xuan Oommen, B. John Granmo, Ole-Christoffer
description	The fundamental phenomenon that has been used to enhance the convergence speed of learning automata (LA) is that of incorporating the running maximum likelihood (ML) estimates of the action reward probabilities into the probability updating rules for selecting the actions. The frontiers of this field have been recently expanded by replacing the ML estimates with their corresponding Bayesian counterparts that incorporate the properties of the conjugate priors. These constitute the Bayesian pursuit algorithm (BPA), and the discretized Bayesian pursuit algorithm. Although these algorithms have been designed and efficiently implemented, and are, arguably, the fastest and most accurate LA reported in the literature, the proofs of their ϵ -optimal convergence has been unsolved. This is precisely the intent of this paper. In this paper, we present a single unifying analysis by which the proofs of both the continuous and discretized schemes are proven. We emphasize that unlike the ML-based pursuit schemes, the Bayesian schemes have to not only consider the estimates themselves but also the distributional forms of their conjugate posteriors and their higher order moments—all of which render the proofs to be particularly challenging. As far as we know, apart from the results themselves, the methodologies of this proof have been unreported in the literature—they are both pioneering and novel.
doi_str_mv	10.1007/s10044-016-0535-1
format	Article
fullrecord	<record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_1919129174</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>1919129174</sourcerecordid><originalsourceid>FETCH-LOGICAL-c359t-fd671a8e6a8c8ad37cf4b1431113af5424df7dbd0a451fa62ccafcc9253774fb3</originalsourceid><addsrcrecordid>eNp1UM1KxDAYDKLguvoA3gKeo_mapGmPuvgHgpcVvIU0Tbpduk1N2sM-mK_hM5mlIl4kkHzfZGYYBqFLoNdAqbyJ6eacUMgJFUwQOEIL4IwRKcT78e_M4RSdxbillDGWFQtUrTcW1za2TY-9w7qKPlRt3-A7vU-o7vEwhTi1I9Zd40M7bnYR677GY9I5H3a6S6vu9tHGg0GC24C_PokfxjZ9tuP-HJ043UV78fMu0dvD_Xr1RF5eH59Xty_EMFGOxNW5BF3YXBem0DWTxvEqxQYApp3gGa-drKuaai7A6TwzRjtjykwwKbmr2BJdzb5D8B-TjaPa-imkbFFBmU5WguSJBTPLBB9jsE4NIQUNewVUHapUc5UqVakOVSpImmzWxMTtGxv-OP8r-gYwOnjV</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1919129174</pqid></control><display><type>article</type><title>The design of absorbing Bayesian pursuit algorithms and the formal analyses of their ε-optimality</title><source>SpringerLink Journals - AutoHoldings</source><creator>Zhang, Xuan ; Oommen, B. John ; Granmo, Ole-Christoffer</creator><creatorcontrib>Zhang, Xuan ; Oommen, B. John ; Granmo, Ole-Christoffer</creatorcontrib><description>The fundamental phenomenon that has been used to enhance the convergence speed of learning automata (LA) is that of incorporating the running maximum likelihood (ML) estimates of the action reward probabilities into the probability updating rules for selecting the actions. The frontiers of this field have been recently expanded by replacing the ML estimates with their corresponding Bayesian counterparts that incorporate the properties of the conjugate priors. These constitute the Bayesian pursuit algorithm (BPA), and the discretized Bayesian pursuit algorithm. Although these algorithms have been designed and efficiently implemented, and are, arguably, the fastest and most accurate LA reported in the literature, the proofs of their ϵ -optimal convergence has been unsolved. This is precisely the intent of this paper. In this paper, we present a single unifying analysis by which the proofs of both the continuous and discretized schemes are proven. We emphasize that unlike the ML-based pursuit schemes, the Bayesian schemes have to not only consider the estimates themselves but also the distributional forms of their conjugate posteriors and their higher order moments—all of which render the proofs to be particularly challenging. As far as we know, apart from the results themselves, the methodologies of this proof have been unreported in the literature—they are both pioneering and novel.</description><identifier>ISSN: 1433-7541</identifier><identifier>EISSN: 1433-755X</identifier><identifier>DOI: 10.1007/s10044-016-0535-1</identifier><language>eng</language><publisher>London: Springer London</publisher><subject>Algorithms ; Bayesian analysis ; Computer Science ; Convergence ; Estimates ; Maximum likelihood estimates ; Optimization ; Pattern Recognition ; Theoretical Advances</subject><ispartof>Pattern analysis and applications : PAA, 2017-08, Vol.20 (3), p.797-808</ispartof><rights>Springer-Verlag London 2016</rights><rights>Copyright Springer Science & Business Media 2017</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c359t-fd671a8e6a8c8ad37cf4b1431113af5424df7dbd0a451fa62ccafcc9253774fb3</citedby><cites>FETCH-LOGICAL-c359t-fd671a8e6a8c8ad37cf4b1431113af5424df7dbd0a451fa62ccafcc9253774fb3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s10044-016-0535-1$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s10044-016-0535-1$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>314,780,784,27924,27925,41488,42557,51319</link.rule.ids></links><search><creatorcontrib>Zhang, Xuan</creatorcontrib><creatorcontrib>Oommen, B. John</creatorcontrib><creatorcontrib>Granmo, Ole-Christoffer</creatorcontrib><title>The design of absorbing Bayesian pursuit algorithms and the formal analyses of their ε-optimality</title><title>Pattern analysis and applications : PAA</title><addtitle>Pattern Anal Applic</addtitle><description>The fundamental phenomenon that has been used to enhance the convergence speed of learning automata (LA) is that of incorporating the running maximum likelihood (ML) estimates of the action reward probabilities into the probability updating rules for selecting the actions. The frontiers of this field have been recently expanded by replacing the ML estimates with their corresponding Bayesian counterparts that incorporate the properties of the conjugate priors. These constitute the Bayesian pursuit algorithm (BPA), and the discretized Bayesian pursuit algorithm. Although these algorithms have been designed and efficiently implemented, and are, arguably, the fastest and most accurate LA reported in the literature, the proofs of their ϵ -optimal convergence has been unsolved. This is precisely the intent of this paper. In this paper, we present a single unifying analysis by which the proofs of both the continuous and discretized schemes are proven. We emphasize that unlike the ML-based pursuit schemes, the Bayesian schemes have to not only consider the estimates themselves but also the distributional forms of their conjugate posteriors and their higher order moments—all of which render the proofs to be particularly challenging. As far as we know, apart from the results themselves, the methodologies of this proof have been unreported in the literature—they are both pioneering and novel.</description><subject>Algorithms</subject><subject>Bayesian analysis</subject><subject>Computer Science</subject><subject>Convergence</subject><subject>Estimates</subject><subject>Maximum likelihood estimates</subject><subject>Optimization</subject><subject>Pattern Recognition</subject><subject>Theoretical Advances</subject><issn>1433-7541</issn><issn>1433-755X</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2017</creationdate><recordtype>article</recordtype><recordid>eNp1UM1KxDAYDKLguvoA3gKeo_mapGmPuvgHgpcVvIU0Tbpduk1N2sM-mK_hM5mlIl4kkHzfZGYYBqFLoNdAqbyJ6eacUMgJFUwQOEIL4IwRKcT78e_M4RSdxbillDGWFQtUrTcW1za2TY-9w7qKPlRt3-A7vU-o7vEwhTi1I9Zd40M7bnYR677GY9I5H3a6S6vu9tHGg0GC24C_PokfxjZ9tuP-HJ043UV78fMu0dvD_Xr1RF5eH59Xty_EMFGOxNW5BF3YXBem0DWTxvEqxQYApp3gGa-drKuaai7A6TwzRjtjykwwKbmr2BJdzb5D8B-TjaPa-imkbFFBmU5WguSJBTPLBB9jsE4NIQUNewVUHapUc5UqVakOVSpImmzWxMTtGxv-OP8r-gYwOnjV</recordid><startdate>20170801</startdate><enddate>20170801</enddate><creator>Zhang, Xuan</creator><creator>Oommen, B. John</creator><creator>Granmo, Ole-Christoffer</creator><general>Springer London</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope></search><sort><creationdate>20170801</creationdate><title>The design of absorbing Bayesian pursuit algorithms and the formal analyses of their ε-optimality</title><author>Zhang, Xuan ; Oommen, B. John ; Granmo, Ole-Christoffer</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c359t-fd671a8e6a8c8ad37cf4b1431113af5424df7dbd0a451fa62ccafcc9253774fb3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2017</creationdate><topic>Algorithms</topic><topic>Bayesian analysis</topic><topic>Computer Science</topic><topic>Convergence</topic><topic>Estimates</topic><topic>Maximum likelihood estimates</topic><topic>Optimization</topic><topic>Pattern Recognition</topic><topic>Theoretical Advances</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Zhang, Xuan</creatorcontrib><creatorcontrib>Oommen, B. John</creatorcontrib><creatorcontrib>Granmo, Ole-Christoffer</creatorcontrib><collection>CrossRef</collection><jtitle>Pattern analysis and applications : PAA</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Zhang, Xuan</au><au>Oommen, B. John</au><au>Granmo, Ole-Christoffer</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>The design of absorbing Bayesian pursuit algorithms and the formal analyses of their ε-optimality</atitle><jtitle>Pattern analysis and applications : PAA</jtitle><stitle>Pattern Anal Applic</stitle><date>2017-08-01</date><risdate>2017</risdate><volume>20</volume><issue>3</issue><spage>797</spage><epage>808</epage><pages>797-808</pages><issn>1433-7541</issn><eissn>1433-755X</eissn><abstract>The fundamental phenomenon that has been used to enhance the convergence speed of learning automata (LA) is that of incorporating the running maximum likelihood (ML) estimates of the action reward probabilities into the probability updating rules for selecting the actions. The frontiers of this field have been recently expanded by replacing the ML estimates with their corresponding Bayesian counterparts that incorporate the properties of the conjugate priors. These constitute the Bayesian pursuit algorithm (BPA), and the discretized Bayesian pursuit algorithm. Although these algorithms have been designed and efficiently implemented, and are, arguably, the fastest and most accurate LA reported in the literature, the proofs of their ϵ -optimal convergence has been unsolved. This is precisely the intent of this paper. In this paper, we present a single unifying analysis by which the proofs of both the continuous and discretized schemes are proven. We emphasize that unlike the ML-based pursuit schemes, the Bayesian schemes have to not only consider the estimates themselves but also the distributional forms of their conjugate posteriors and their higher order moments—all of which render the proofs to be particularly challenging. As far as we know, apart from the results themselves, the methodologies of this proof have been unreported in the literature—they are both pioneering and novel.</abstract><cop>London</cop><pub>Springer London</pub><doi>10.1007/s10044-016-0535-1</doi><tpages>12</tpages><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	ISSN: 1433-7541
ispartof	Pattern analysis and applications : PAA, 2017-08, Vol.20 (3), p.797-808
issn	1433-7541 1433-755X
language	eng
recordid	cdi_proquest_journals_1919129174
source	SpringerLink Journals - AutoHoldings
subjects	Algorithms Bayesian analysis Computer Science Convergence Estimates Maximum likelihood estimates Optimization Pattern Recognition Theoretical Advances
title	The design of absorbing Bayesian pursuit algorithms and the formal analyses of their ε-optimality
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-06T07%3A17%3A57IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=The%20design%20of%20absorbing%20Bayesian%20pursuit%20algorithms%20and%20the%20formal%20analyses%20of%20their%20%CE%B5-optimality&rft.jtitle=Pattern%20analysis%20and%20applications%20:%20PAA&rft.au=Zhang,%20Xuan&rft.date=2017-08-01&rft.volume=20&rft.issue=3&rft.spage=797&rft.epage=808&rft.pages=797-808&rft.issn=1433-7541&rft.eissn=1433-755X&rft_id=info:doi/10.1007/s10044-016-0535-1&rft_dat=%3Cproquest_cross%3E1919129174%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1919129174&rft_id=info:pmid/&rfr_iscdi=true