The design of absorbing Bayesian pursuit algorithms and the formal analyses of their ε-optimality
The fundamental phenomenon that has been used to enhance the convergence speed of learning automata (LA) is that of incorporating the running maximum likelihood (ML) estimates of the action reward probabilities into the probability updating rules for selecting the actions. The frontiers of this fiel...
Gespeichert in:
Veröffentlicht in: | Pattern analysis and applications : PAA 2017-08, Vol.20 (3), p.797-808 |
---|---|
Hauptverfasser: | , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 808 |
---|---|
container_issue | 3 |
container_start_page | 797 |
container_title | Pattern analysis and applications : PAA |
container_volume | 20 |
creator | Zhang, Xuan Oommen, B. John Granmo, Ole-Christoffer |
description | The fundamental phenomenon that has been used to enhance the convergence speed of learning automata (LA) is that of incorporating the running maximum likelihood (ML) estimates of the action reward probabilities into the probability updating rules for selecting the actions. The frontiers of this field have been recently expanded by replacing the ML estimates with their corresponding Bayesian counterparts that incorporate the properties of the conjugate priors. These constitute the Bayesian pursuit algorithm (BPA), and the discretized Bayesian pursuit algorithm. Although these algorithms have been designed and efficiently implemented, and are, arguably, the fastest and most accurate LA reported in the literature, the proofs of their
ϵ
-optimal convergence has been unsolved. This is precisely the intent of this paper. In this paper, we present a single unifying analysis by which the proofs of both the continuous and discretized schemes are proven. We emphasize that unlike the ML-based pursuit schemes, the Bayesian schemes have to not only consider the estimates themselves but also the distributional forms of their conjugate posteriors and their higher order moments—all of which render the proofs to be particularly challenging. As far as we know, apart from the results themselves, the methodologies of this proof have been unreported in the literature—they are both pioneering and novel. |
doi_str_mv | 10.1007/s10044-016-0535-1 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_1919129174</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>1919129174</sourcerecordid><originalsourceid>FETCH-LOGICAL-c359t-fd671a8e6a8c8ad37cf4b1431113af5424df7dbd0a451fa62ccafcc9253774fb3</originalsourceid><addsrcrecordid>eNp1UM1KxDAYDKLguvoA3gKeo_mapGmPuvgHgpcVvIU0Tbpduk1N2sM-mK_hM5mlIl4kkHzfZGYYBqFLoNdAqbyJ6eacUMgJFUwQOEIL4IwRKcT78e_M4RSdxbillDGWFQtUrTcW1za2TY-9w7qKPlRt3-A7vU-o7vEwhTi1I9Zd40M7bnYR677GY9I5H3a6S6vu9tHGg0GC24C_PokfxjZ9tuP-HJ043UV78fMu0dvD_Xr1RF5eH59Xty_EMFGOxNW5BF3YXBem0DWTxvEqxQYApp3gGa-drKuaai7A6TwzRjtjykwwKbmr2BJdzb5D8B-TjaPa-imkbFFBmU5WguSJBTPLBB9jsE4NIQUNewVUHapUc5UqVakOVSpImmzWxMTtGxv-OP8r-gYwOnjV</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1919129174</pqid></control><display><type>article</type><title>The design of absorbing Bayesian pursuit algorithms and the formal analyses of their ε-optimality</title><source>SpringerLink Journals - AutoHoldings</source><creator>Zhang, Xuan ; Oommen, B. John ; Granmo, Ole-Christoffer</creator><creatorcontrib>Zhang, Xuan ; Oommen, B. John ; Granmo, Ole-Christoffer</creatorcontrib><description>The fundamental phenomenon that has been used to enhance the convergence speed of learning automata (LA) is that of incorporating the running maximum likelihood (ML) estimates of the action reward probabilities into the probability updating rules for selecting the actions. The frontiers of this field have been recently expanded by replacing the ML estimates with their corresponding Bayesian counterparts that incorporate the properties of the conjugate priors. These constitute the Bayesian pursuit algorithm (BPA), and the discretized Bayesian pursuit algorithm. Although these algorithms have been designed and efficiently implemented, and are, arguably, the fastest and most accurate LA reported in the literature, the proofs of their
ϵ
-optimal convergence has been unsolved. This is precisely the intent of this paper. In this paper, we present a single unifying analysis by which the proofs of both the continuous and discretized schemes are proven. We emphasize that unlike the ML-based pursuit schemes, the Bayesian schemes have to not only consider the estimates themselves but also the distributional forms of their conjugate posteriors and their higher order moments—all of which render the proofs to be particularly challenging. As far as we know, apart from the results themselves, the methodologies of this proof have been unreported in the literature—they are both pioneering and novel.</description><identifier>ISSN: 1433-7541</identifier><identifier>EISSN: 1433-755X</identifier><identifier>DOI: 10.1007/s10044-016-0535-1</identifier><language>eng</language><publisher>London: Springer London</publisher><subject>Algorithms ; Bayesian analysis ; Computer Science ; Convergence ; Estimates ; Maximum likelihood estimates ; Optimization ; Pattern Recognition ; Theoretical Advances</subject><ispartof>Pattern analysis and applications : PAA, 2017-08, Vol.20 (3), p.797-808</ispartof><rights>Springer-Verlag London 2016</rights><rights>Copyright Springer Science & Business Media 2017</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c359t-fd671a8e6a8c8ad37cf4b1431113af5424df7dbd0a451fa62ccafcc9253774fb3</citedby><cites>FETCH-LOGICAL-c359t-fd671a8e6a8c8ad37cf4b1431113af5424df7dbd0a451fa62ccafcc9253774fb3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s10044-016-0535-1$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s10044-016-0535-1$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>314,780,784,27924,27925,41488,42557,51319</link.rule.ids></links><search><creatorcontrib>Zhang, Xuan</creatorcontrib><creatorcontrib>Oommen, B. John</creatorcontrib><creatorcontrib>Granmo, Ole-Christoffer</creatorcontrib><title>The design of absorbing Bayesian pursuit algorithms and the formal analyses of their ε-optimality</title><title>Pattern analysis and applications : PAA</title><addtitle>Pattern Anal Applic</addtitle><description>The fundamental phenomenon that has been used to enhance the convergence speed of learning automata (LA) is that of incorporating the running maximum likelihood (ML) estimates of the action reward probabilities into the probability updating rules for selecting the actions. The frontiers of this field have been recently expanded by replacing the ML estimates with their corresponding Bayesian counterparts that incorporate the properties of the conjugate priors. These constitute the Bayesian pursuit algorithm (BPA), and the discretized Bayesian pursuit algorithm. Although these algorithms have been designed and efficiently implemented, and are, arguably, the fastest and most accurate LA reported in the literature, the proofs of their
ϵ
-optimal convergence has been unsolved. This is precisely the intent of this paper. In this paper, we present a single unifying analysis by which the proofs of both the continuous and discretized schemes are proven. We emphasize that unlike the ML-based pursuit schemes, the Bayesian schemes have to not only consider the estimates themselves but also the distributional forms of their conjugate posteriors and their higher order moments—all of which render the proofs to be particularly challenging. As far as we know, apart from the results themselves, the methodologies of this proof have been unreported in the literature—they are both pioneering and novel.</description><subject>Algorithms</subject><subject>Bayesian analysis</subject><subject>Computer Science</subject><subject>Convergence</subject><subject>Estimates</subject><subject>Maximum likelihood estimates</subject><subject>Optimization</subject><subject>Pattern Recognition</subject><subject>Theoretical Advances</subject><issn>1433-7541</issn><issn>1433-755X</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2017</creationdate><recordtype>article</recordtype><recordid>eNp1UM1KxDAYDKLguvoA3gKeo_mapGmPuvgHgpcVvIU0Tbpduk1N2sM-mK_hM5mlIl4kkHzfZGYYBqFLoNdAqbyJ6eacUMgJFUwQOEIL4IwRKcT78e_M4RSdxbillDGWFQtUrTcW1za2TY-9w7qKPlRt3-A7vU-o7vEwhTi1I9Zd40M7bnYR677GY9I5H3a6S6vu9tHGg0GC24C_PokfxjZ9tuP-HJ043UV78fMu0dvD_Xr1RF5eH59Xty_EMFGOxNW5BF3YXBem0DWTxvEqxQYApp3gGa-drKuaai7A6TwzRjtjykwwKbmr2BJdzb5D8B-TjaPa-imkbFFBmU5WguSJBTPLBB9jsE4NIQUNewVUHapUc5UqVakOVSpImmzWxMTtGxv-OP8r-gYwOnjV</recordid><startdate>20170801</startdate><enddate>20170801</enddate><creator>Zhang, Xuan</creator><creator>Oommen, B. John</creator><creator>Granmo, Ole-Christoffer</creator><general>Springer London</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope></search><sort><creationdate>20170801</creationdate><title>The design of absorbing Bayesian pursuit algorithms and the formal analyses of their ε-optimality</title><author>Zhang, Xuan ; Oommen, B. John ; Granmo, Ole-Christoffer</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c359t-fd671a8e6a8c8ad37cf4b1431113af5424df7dbd0a451fa62ccafcc9253774fb3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2017</creationdate><topic>Algorithms</topic><topic>Bayesian analysis</topic><topic>Computer Science</topic><topic>Convergence</topic><topic>Estimates</topic><topic>Maximum likelihood estimates</topic><topic>Optimization</topic><topic>Pattern Recognition</topic><topic>Theoretical Advances</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Zhang, Xuan</creatorcontrib><creatorcontrib>Oommen, B. John</creatorcontrib><creatorcontrib>Granmo, Ole-Christoffer</creatorcontrib><collection>CrossRef</collection><jtitle>Pattern analysis and applications : PAA</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Zhang, Xuan</au><au>Oommen, B. John</au><au>Granmo, Ole-Christoffer</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>The design of absorbing Bayesian pursuit algorithms and the formal analyses of their ε-optimality</atitle><jtitle>Pattern analysis and applications : PAA</jtitle><stitle>Pattern Anal Applic</stitle><date>2017-08-01</date><risdate>2017</risdate><volume>20</volume><issue>3</issue><spage>797</spage><epage>808</epage><pages>797-808</pages><issn>1433-7541</issn><eissn>1433-755X</eissn><abstract>The fundamental phenomenon that has been used to enhance the convergence speed of learning automata (LA) is that of incorporating the running maximum likelihood (ML) estimates of the action reward probabilities into the probability updating rules for selecting the actions. The frontiers of this field have been recently expanded by replacing the ML estimates with their corresponding Bayesian counterparts that incorporate the properties of the conjugate priors. These constitute the Bayesian pursuit algorithm (BPA), and the discretized Bayesian pursuit algorithm. Although these algorithms have been designed and efficiently implemented, and are, arguably, the fastest and most accurate LA reported in the literature, the proofs of their
ϵ
-optimal convergence has been unsolved. This is precisely the intent of this paper. In this paper, we present a single unifying analysis by which the proofs of both the continuous and discretized schemes are proven. We emphasize that unlike the ML-based pursuit schemes, the Bayesian schemes have to not only consider the estimates themselves but also the distributional forms of their conjugate posteriors and their higher order moments—all of which render the proofs to be particularly challenging. As far as we know, apart from the results themselves, the methodologies of this proof have been unreported in the literature—they are both pioneering and novel.</abstract><cop>London</cop><pub>Springer London</pub><doi>10.1007/s10044-016-0535-1</doi><tpages>12</tpages><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1433-7541 |
ispartof | Pattern analysis and applications : PAA, 2017-08, Vol.20 (3), p.797-808 |
issn | 1433-7541 1433-755X |
language | eng |
recordid | cdi_proquest_journals_1919129174 |
source | SpringerLink Journals - AutoHoldings |
subjects | Algorithms Bayesian analysis Computer Science Convergence Estimates Maximum likelihood estimates Optimization Pattern Recognition Theoretical Advances |
title | The design of absorbing Bayesian pursuit algorithms and the formal analyses of their ε-optimality |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-06T07%3A17%3A57IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=The%20design%20of%20absorbing%20Bayesian%20pursuit%20algorithms%20and%20the%20formal%20analyses%20of%20their%20%CE%B5-optimality&rft.jtitle=Pattern%20analysis%20and%20applications%20:%20PAA&rft.au=Zhang,%20Xuan&rft.date=2017-08-01&rft.volume=20&rft.issue=3&rft.spage=797&rft.epage=808&rft.pages=797-808&rft.issn=1433-7541&rft.eissn=1433-755X&rft_id=info:doi/10.1007/s10044-016-0535-1&rft_dat=%3Cproquest_cross%3E1919129174%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1919129174&rft_id=info:pmid/&rfr_iscdi=true |