Logistic Regression, AdaBoost and Bregman Distances

We give a unified account of boosting and logistic regression in which each learning problem is cast in terms of optimization of Bregman distances. The striking similarity of the two problems in this framework allows us to design and analyze algorithms for both simultaneously, and to easily adapt al...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Machine learning 2002-07, Vol.48 (1-3), p.253-285
Hauptverfasser:	Collins, Michael, Schapire, Robert E, Singer, Yoram
Format:	Artikel
Sprache:	eng
Schlagworte:	Algorithms Studies
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	285
container_issue	1-3
container_start_page	253
container_title	Machine learning
container_volume	48
creator	Collins, Michael Schapire, Robert E Singer, Yoram
description	We give a unified account of boosting and logistic regression in which each learning problem is cast in terms of optimization of Bregman distances. The striking similarity of the two problems in this framework allows us to design and analyze algorithms for both simultaneously, and to easily adapt algorithms designed for one problem to the other. For both problems, we give new algorithms and explain their potential advantages over existing methods. These algorithms are iterative and can be divided into two types based on whether the parameters are updated sequentially (one at a time) or in parallel (all at once). We also describe a parameterized family of algorithms that includes both a sequential- and a parallel-update algorithm as special cases, thus showing how the sequential and parallel approaches can themselves be unified. For all of the algorithms, we give convergence proofs using a general formalization of the auxiliary-function proof technique. As one of our sequential-update algorithms is equivalent to AdaBoost, this provides the first general proof of convergence for AdaBoost. We show that all of our algorithms generalize easily to the multiclass case, and we contrast the new algorithms with the iterative scaling algorithm. We conclude with a few experimental results with synthetic data that highlight the behavior of the old and newly proposed algorithms in different settings.[PUBLICATION ABSTRACT]
doi_str_mv	10.1023/A:1013912006537
format	Article
fullrecord	<record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_miscellaneous_27185377</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2165704591</sourcerecordid><originalsourceid>FETCH-LOGICAL-p211t-e5683d97d765f6eabe37dbedf9070a94fff42b6da648b9eab5179ba917c758903</originalsourceid><addsrcrecordid>eNpdj0tLxDAUhYMoWEfXbosLV1bvTZqXu874hIIgui5pk5QOnWRsOv_fiq5cncX5-DiHkEuEWwTK7qp7BGQaKYDgTB6RDLlkBXDBj0kGSvFCIOWn5CylLQBQoURGWB37Ic1Dl7-7fnIpDTHc5JU16xjTnJtg8_Xk-p0J-cPCmdC5dE5OvBmTu_jLFfl8evzYvBT12_PrpqqLPUWcC8eFYlZLKwX3wpnWMWlbZ70GCUaX3vuStsIaUapWLz1HqVujUXaSKw1sRa5_vfspfh1cmpvdkDo3jia4eEgNlaiWp3IBr_6B23iYwrKt-TFRDqjZN1dkU3Y</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>758925019</pqid></control><display><type>article</type><title>Logistic Regression, AdaBoost and Bregman Distances</title><source>SpringerLink Journals</source><creator>Collins, Michael ; Schapire, Robert E ; Singer, Yoram</creator><creatorcontrib>Collins, Michael ; Schapire, Robert E ; Singer, Yoram</creatorcontrib><description>We give a unified account of boosting and logistic regression in which each learning problem is cast in terms of optimization of Bregman distances. The striking similarity of the two problems in this framework allows us to design and analyze algorithms for both simultaneously, and to easily adapt algorithms designed for one problem to the other. For both problems, we give new algorithms and explain their potential advantages over existing methods. These algorithms are iterative and can be divided into two types based on whether the parameters are updated sequentially (one at a time) or in parallel (all at once). We also describe a parameterized family of algorithms that includes both a sequential- and a parallel-update algorithm as special cases, thus showing how the sequential and parallel approaches can themselves be unified. For all of the algorithms, we give convergence proofs using a general formalization of the auxiliary-function proof technique. As one of our sequential-update algorithms is equivalent to AdaBoost, this provides the first general proof of convergence for AdaBoost. We show that all of our algorithms generalize easily to the multiclass case, and we contrast the new algorithms with the iterative scaling algorithm. We conclude with a few experimental results with synthetic data that highlight the behavior of the old and newly proposed algorithms in different settings.[PUBLICATION ABSTRACT]</description><identifier>ISSN: 0885-6125</identifier><identifier>EISSN: 1573-0565</identifier><identifier>DOI: 10.1023/A:1013912006537</identifier><language>eng</language><publisher>Dordrecht: Springer Nature B.V</publisher><subject>Algorithms ; Studies</subject><ispartof>Machine learning, 2002-07, Vol.48 (1-3), p.253-285</ispartof><rights>Kluwer Academic Publishers 2002</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,27901,27902</link.rule.ids></links><search><creatorcontrib>Collins, Michael</creatorcontrib><creatorcontrib>Schapire, Robert E</creatorcontrib><creatorcontrib>Singer, Yoram</creatorcontrib><title>Logistic Regression, AdaBoost and Bregman Distances</title><title>Machine learning</title><description>We give a unified account of boosting and logistic regression in which each learning problem is cast in terms of optimization of Bregman distances. The striking similarity of the two problems in this framework allows us to design and analyze algorithms for both simultaneously, and to easily adapt algorithms designed for one problem to the other. For both problems, we give new algorithms and explain their potential advantages over existing methods. These algorithms are iterative and can be divided into two types based on whether the parameters are updated sequentially (one at a time) or in parallel (all at once). We also describe a parameterized family of algorithms that includes both a sequential- and a parallel-update algorithm as special cases, thus showing how the sequential and parallel approaches can themselves be unified. For all of the algorithms, we give convergence proofs using a general formalization of the auxiliary-function proof technique. As one of our sequential-update algorithms is equivalent to AdaBoost, this provides the first general proof of convergence for AdaBoost. We show that all of our algorithms generalize easily to the multiclass case, and we contrast the new algorithms with the iterative scaling algorithm. We conclude with a few experimental results with synthetic data that highlight the behavior of the old and newly proposed algorithms in different settings.[PUBLICATION ABSTRACT]</description><subject>Algorithms</subject><subject>Studies</subject><issn>0885-6125</issn><issn>1573-0565</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2002</creationdate><recordtype>article</recordtype><sourceid>BENPR</sourceid><recordid>eNpdj0tLxDAUhYMoWEfXbosLV1bvTZqXu874hIIgui5pk5QOnWRsOv_fiq5cncX5-DiHkEuEWwTK7qp7BGQaKYDgTB6RDLlkBXDBj0kGSvFCIOWn5CylLQBQoURGWB37Ic1Dl7-7fnIpDTHc5JU16xjTnJtg8_Xk-p0J-cPCmdC5dE5OvBmTu_jLFfl8evzYvBT12_PrpqqLPUWcC8eFYlZLKwX3wpnWMWlbZ70GCUaX3vuStsIaUapWLz1HqVujUXaSKw1sRa5_vfspfh1cmpvdkDo3jia4eEgNlaiWp3IBr_6B23iYwrKt-TFRDqjZN1dkU3Y</recordid><startdate>20020701</startdate><enddate>20020701</enddate><creator>Collins, Michael</creator><creator>Schapire, Robert E</creator><creator>Singer, Yoram</creator><general>Springer Nature B.V</general><scope>3V.</scope><scope>7SC</scope><scope>7XB</scope><scope>88I</scope><scope>8AL</scope><scope>8AO</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>8FK</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K7-</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>M0N</scope><scope>M2P</scope><scope>P5Z</scope><scope>P62</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>Q9U</scope></search><sort><creationdate>20020701</creationdate><title>Logistic Regression, AdaBoost and Bregman Distances</title><author>Collins, Michael ; Schapire, Robert E ; Singer, Yoram</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-p211t-e5683d97d765f6eabe37dbedf9070a94fff42b6da648b9eab5179ba917c758903</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2002</creationdate><topic>Algorithms</topic><topic>Studies</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Collins, Michael</creatorcontrib><creatorcontrib>Schapire, Robert E</creatorcontrib><creatorcontrib>Singer, Yoram</creatorcontrib><collection>ProQuest Central (Corporate)</collection><collection>Computer and Information Systems Abstracts</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Science Database (Alumni Edition)</collection><collection>Computing Database (Alumni Edition)</collection><collection>ProQuest Pharma Collection</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>Advanced Technologies & Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>Computer Science Database</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>Computing Database</collection><collection>Science Database</collection><collection>Advanced Technologies & Aerospace Database</collection><collection>ProQuest Advanced Technologies & Aerospace Collection</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>ProQuest Central Basic</collection><jtitle>Machine learning</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Collins, Michael</au><au>Schapire, Robert E</au><au>Singer, Yoram</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Logistic Regression, AdaBoost and Bregman Distances</atitle><jtitle>Machine learning</jtitle><date>2002-07-01</date><risdate>2002</risdate><volume>48</volume><issue>1-3</issue><spage>253</spage><epage>285</epage><pages>253-285</pages><issn>0885-6125</issn><eissn>1573-0565</eissn><abstract>We give a unified account of boosting and logistic regression in which each learning problem is cast in terms of optimization of Bregman distances. The striking similarity of the two problems in this framework allows us to design and analyze algorithms for both simultaneously, and to easily adapt algorithms designed for one problem to the other. For both problems, we give new algorithms and explain their potential advantages over existing methods. These algorithms are iterative and can be divided into two types based on whether the parameters are updated sequentially (one at a time) or in parallel (all at once). We also describe a parameterized family of algorithms that includes both a sequential- and a parallel-update algorithm as special cases, thus showing how the sequential and parallel approaches can themselves be unified. For all of the algorithms, we give convergence proofs using a general formalization of the auxiliary-function proof technique. As one of our sequential-update algorithms is equivalent to AdaBoost, this provides the first general proof of convergence for AdaBoost. We show that all of our algorithms generalize easily to the multiclass case, and we contrast the new algorithms with the iterative scaling algorithm. We conclude with a few experimental results with synthetic data that highlight the behavior of the old and newly proposed algorithms in different settings.[PUBLICATION ABSTRACT]</abstract><cop>Dordrecht</cop><pub>Springer Nature B.V</pub><doi>10.1023/A:1013912006537</doi><tpages>33</tpages></addata></record>
fulltext	fulltext
identifier	ISSN: 0885-6125
ispartof	Machine learning, 2002-07, Vol.48 (1-3), p.253-285
issn	0885-6125 1573-0565
language	eng
recordid	cdi_proquest_miscellaneous_27185377
source	SpringerLink Journals
subjects	Algorithms Studies
title	Logistic Regression, AdaBoost and Bregman Distances
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-06T03%3A06%3A58IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Logistic%20Regression,%20AdaBoost%20and%20Bregman%20Distances&rft.jtitle=Machine%20learning&rft.au=Collins,%20Michael&rft.date=2002-07-01&rft.volume=48&rft.issue=1-3&rft.spage=253&rft.epage=285&rft.pages=253-285&rft.issn=0885-6125&rft.eissn=1573-0565&rft_id=info:doi/10.1023/A:1013912006537&rft_dat=%3Cproquest%3E2165704591%3C/proquest%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=758925019&rft_id=info:pmid/&rfr_iscdi=true