ON MULTI-VIEW LEARNING WITH ADDITIVE MODELS
In many scientific settings data can be naturally partitioned into variable groupings called views. Common examples include environmental (1st view) and genetic information (2nd view) in ecological applications, chemical (1st view) and biological (2nd view) data in drug discovery. Multi-view data al...
Gespeichert in:
Veröffentlicht in: | The annals of applied statistics 2009-03, Vol.3 (1), p.292-318 |
---|---|
Hauptverfasser: | , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 318 |
---|---|
container_issue | 1 |
container_start_page | 292 |
container_title | The annals of applied statistics |
container_volume | 3 |
creator | CULP, Mark MICHAILIDIS, George JOHNSON, Kjell |
description | In many scientific settings data can be naturally partitioned into variable groupings called views. Common examples include environmental (1st view) and genetic information (2nd view) in ecological applications, chemical (1st view) and biological (2nd view) data in drug discovery. Multi-view data also occur in text analysis and proteomics applications where one view consists of a graph with observations as the vertices and a weighted measure of pairwise similarity between observations as the edges. Further, in several of these applications the observations can be partitioned into two sets, one where the response is observed (labeled) and the other where the response is not (unlabeled). The problem for simultaneously addressing viewed data and incorporating unlabeled observations in training is referred to as multi-view transductive learning. In this work we introduce and study a comprehensive generalized fixed point additive modeling framework for multi-view transductive learning, where any view is represented by a linear smoother. The problem of view selection is discussed using a generalized Akaike Information Criterion, which provides an approach for testing the contribution of each view. An efficient implementation is provided for fitting these models with both backfitting and local-scoring type algorithms adjusted to semi-supervised graph-based learning. The proposed technique is assessed on both synthetic and real data sets and is shown to be competitive to state-of-the-art co-training and graph-based techniques. |
doi_str_mv | 10.1214/08-AOAS202 |
format | Article |
fullrecord | <record><control><sourceid>pascalfrancis_proje</sourceid><recordid>TN_cdi_projecteuclid_primary_oai_CULeuclid_euclid_aoas_1239888372</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>21496827</sourcerecordid><originalsourceid>FETCH-LOGICAL-c355t-281227cefa007a8cc21b32fc967be1d0a7a899e478d0ac8110be4c8715b192f23</originalsourceid><addsrcrecordid>eNo9UE9LwzAcDaLgnF78BL14UaL5JW2THMtWt0C3guu2Y0izFDqqHc08-O1tWdnpPR7vDzyEnoG8A4Xwgwic5MmGEnqDJiBDwJwxcjtwRnEMEb9HD94fCYlCEcIEveXrYLXNCoV3Kt0HWZp8rdV6EexVsQyS-VwVapcGq3yeZptHdFeZxrunEado-5kWsyXO8oWaJRm2LIrOmAqglFtXGUK4EdZSKBmtrIx56eBATC9K6UIuem4FACldaAWHqARJK8qmKLn0nrr26OzZ_dqmPuhTV3-b7k-3ptazbTaqI5jWeA2USSEE40PH66XDdq33nauucSB6uEoTocerevPLOGi8NU3VmR9b-2uid8tYUM7-Af2AY8o</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>ON MULTI-VIEW LEARNING WITH ADDITIVE MODELS</title><source>JSTOR Mathematics and Statistics</source><source>Project Euclid</source><source>Alma/SFX Local Collection</source><source>JSTOR</source><source>EZB Electronic Journals Library</source><creator>CULP, Mark ; MICHAILIDIS, George ; JOHNSON, Kjell</creator><creatorcontrib>CULP, Mark ; MICHAILIDIS, George ; JOHNSON, Kjell</creatorcontrib><description>In many scientific settings data can be naturally partitioned into variable groupings called views. Common examples include environmental (1st view) and genetic information (2nd view) in ecological applications, chemical (1st view) and biological (2nd view) data in drug discovery. Multi-view data also occur in text analysis and proteomics applications where one view consists of a graph with observations as the vertices and a weighted measure of pairwise similarity between observations as the edges. Further, in several of these applications the observations can be partitioned into two sets, one where the response is observed (labeled) and the other where the response is not (unlabeled). The problem for simultaneously addressing viewed data and incorporating unlabeled observations in training is referred to as multi-view transductive learning. In this work we introduce and study a comprehensive generalized fixed point additive modeling framework for multi-view transductive learning, where any view is represented by a linear smoother. The problem of view selection is discussed using a generalized Akaike Information Criterion, which provides an approach for testing the contribution of each view. An efficient implementation is provided for fitting these models with both backfitting and local-scoring type algorithms adjusted to semi-supervised graph-based learning. The proposed technique is assessed on both synthetic and real data sets and is shown to be competitive to state-of-the-art co-training and graph-based techniques.</description><identifier>ISSN: 1932-6157</identifier><identifier>EISSN: 1941-7330</identifier><identifier>DOI: 10.1214/08-AOAS202</identifier><language>eng</language><publisher>Cleveland, OH: Institute of Mathematical Statistics</publisher><subject>Applications ; Biology, psychology, social sciences ; Exact sciences and technology ; generalized additive model ; Global analysis, analysis on manifolds ; Markov processes ; Mathematics ; model selection ; Multi-view learning ; Multivariate analysis ; Probability and statistics ; Probability theory and stochastic processes ; Sciences and techniques of general use ; semi-supervised learning ; smoothing ; Statistics ; Topology. Manifolds and cell complexes. Global analysis and analysis on manifolds</subject><ispartof>The annals of applied statistics, 2009-03, Vol.3 (1), p.292-318</ispartof><rights>2015 INIST-CNRS</rights><rights>Copyright 2009 Institute of Mathematical Statistics</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c355t-281227cefa007a8cc21b32fc967be1d0a7a899e478d0ac8110be4c8715b192f23</citedby><cites>FETCH-LOGICAL-c355t-281227cefa007a8cc21b32fc967be1d0a7a899e478d0ac8110be4c8715b192f23</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>230,314,780,784,885,926,27922,27923</link.rule.ids><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&idt=21496827$$DView record in Pascal Francis$$Hfree_for_read</backlink></links><search><creatorcontrib>CULP, Mark</creatorcontrib><creatorcontrib>MICHAILIDIS, George</creatorcontrib><creatorcontrib>JOHNSON, Kjell</creatorcontrib><title>ON MULTI-VIEW LEARNING WITH ADDITIVE MODELS</title><title>The annals of applied statistics</title><description>In many scientific settings data can be naturally partitioned into variable groupings called views. Common examples include environmental (1st view) and genetic information (2nd view) in ecological applications, chemical (1st view) and biological (2nd view) data in drug discovery. Multi-view data also occur in text analysis and proteomics applications where one view consists of a graph with observations as the vertices and a weighted measure of pairwise similarity between observations as the edges. Further, in several of these applications the observations can be partitioned into two sets, one where the response is observed (labeled) and the other where the response is not (unlabeled). The problem for simultaneously addressing viewed data and incorporating unlabeled observations in training is referred to as multi-view transductive learning. In this work we introduce and study a comprehensive generalized fixed point additive modeling framework for multi-view transductive learning, where any view is represented by a linear smoother. The problem of view selection is discussed using a generalized Akaike Information Criterion, which provides an approach for testing the contribution of each view. An efficient implementation is provided for fitting these models with both backfitting and local-scoring type algorithms adjusted to semi-supervised graph-based learning. The proposed technique is assessed on both synthetic and real data sets and is shown to be competitive to state-of-the-art co-training and graph-based techniques.</description><subject>Applications</subject><subject>Biology, psychology, social sciences</subject><subject>Exact sciences and technology</subject><subject>generalized additive model</subject><subject>Global analysis, analysis on manifolds</subject><subject>Markov processes</subject><subject>Mathematics</subject><subject>model selection</subject><subject>Multi-view learning</subject><subject>Multivariate analysis</subject><subject>Probability and statistics</subject><subject>Probability theory and stochastic processes</subject><subject>Sciences and techniques of general use</subject><subject>semi-supervised learning</subject><subject>smoothing</subject><subject>Statistics</subject><subject>Topology. Manifolds and cell complexes. Global analysis and analysis on manifolds</subject><issn>1932-6157</issn><issn>1941-7330</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2009</creationdate><recordtype>article</recordtype><recordid>eNo9UE9LwzAcDaLgnF78BL14UaL5JW2THMtWt0C3guu2Y0izFDqqHc08-O1tWdnpPR7vDzyEnoG8A4Xwgwic5MmGEnqDJiBDwJwxcjtwRnEMEb9HD94fCYlCEcIEveXrYLXNCoV3Kt0HWZp8rdV6EexVsQyS-VwVapcGq3yeZptHdFeZxrunEado-5kWsyXO8oWaJRm2LIrOmAqglFtXGUK4EdZSKBmtrIx56eBATC9K6UIuem4FACldaAWHqARJK8qmKLn0nrr26OzZ_dqmPuhTV3-b7k-3ptazbTaqI5jWeA2USSEE40PH66XDdq33nauucSB6uEoTocerevPLOGi8NU3VmR9b-2uid8tYUM7-Af2AY8o</recordid><startdate>20090301</startdate><enddate>20090301</enddate><creator>CULP, Mark</creator><creator>MICHAILIDIS, George</creator><creator>JOHNSON, Kjell</creator><general>Institute of Mathematical Statistics</general><general>The Institute of Mathematical Statistics</general><scope>IQODW</scope><scope>AAYXX</scope><scope>CITATION</scope></search><sort><creationdate>20090301</creationdate><title>ON MULTI-VIEW LEARNING WITH ADDITIVE MODELS</title><author>CULP, Mark ; MICHAILIDIS, George ; JOHNSON, Kjell</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c355t-281227cefa007a8cc21b32fc967be1d0a7a899e478d0ac8110be4c8715b192f23</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2009</creationdate><topic>Applications</topic><topic>Biology, psychology, social sciences</topic><topic>Exact sciences and technology</topic><topic>generalized additive model</topic><topic>Global analysis, analysis on manifolds</topic><topic>Markov processes</topic><topic>Mathematics</topic><topic>model selection</topic><topic>Multi-view learning</topic><topic>Multivariate analysis</topic><topic>Probability and statistics</topic><topic>Probability theory and stochastic processes</topic><topic>Sciences and techniques of general use</topic><topic>semi-supervised learning</topic><topic>smoothing</topic><topic>Statistics</topic><topic>Topology. Manifolds and cell complexes. Global analysis and analysis on manifolds</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>CULP, Mark</creatorcontrib><creatorcontrib>MICHAILIDIS, George</creatorcontrib><creatorcontrib>JOHNSON, Kjell</creatorcontrib><collection>Pascal-Francis</collection><collection>CrossRef</collection><jtitle>The annals of applied statistics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>CULP, Mark</au><au>MICHAILIDIS, George</au><au>JOHNSON, Kjell</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>ON MULTI-VIEW LEARNING WITH ADDITIVE MODELS</atitle><jtitle>The annals of applied statistics</jtitle><date>2009-03-01</date><risdate>2009</risdate><volume>3</volume><issue>1</issue><spage>292</spage><epage>318</epage><pages>292-318</pages><issn>1932-6157</issn><eissn>1941-7330</eissn><abstract>In many scientific settings data can be naturally partitioned into variable groupings called views. Common examples include environmental (1st view) and genetic information (2nd view) in ecological applications, chemical (1st view) and biological (2nd view) data in drug discovery. Multi-view data also occur in text analysis and proteomics applications where one view consists of a graph with observations as the vertices and a weighted measure of pairwise similarity between observations as the edges. Further, in several of these applications the observations can be partitioned into two sets, one where the response is observed (labeled) and the other where the response is not (unlabeled). The problem for simultaneously addressing viewed data and incorporating unlabeled observations in training is referred to as multi-view transductive learning. In this work we introduce and study a comprehensive generalized fixed point additive modeling framework for multi-view transductive learning, where any view is represented by a linear smoother. The problem of view selection is discussed using a generalized Akaike Information Criterion, which provides an approach for testing the contribution of each view. An efficient implementation is provided for fitting these models with both backfitting and local-scoring type algorithms adjusted to semi-supervised graph-based learning. The proposed technique is assessed on both synthetic and real data sets and is shown to be competitive to state-of-the-art co-training and graph-based techniques.</abstract><cop>Cleveland, OH</cop><pub>Institute of Mathematical Statistics</pub><doi>10.1214/08-AOAS202</doi><tpages>27</tpages><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1932-6157 |
ispartof | The annals of applied statistics, 2009-03, Vol.3 (1), p.292-318 |
issn | 1932-6157 1941-7330 |
language | eng |
recordid | cdi_projecteuclid_primary_oai_CULeuclid_euclid_aoas_1239888372 |
source | JSTOR Mathematics and Statistics; Project Euclid; Alma/SFX Local Collection; JSTOR; EZB Electronic Journals Library |
subjects | Applications Biology, psychology, social sciences Exact sciences and technology generalized additive model Global analysis, analysis on manifolds Markov processes Mathematics model selection Multi-view learning Multivariate analysis Probability and statistics Probability theory and stochastic processes Sciences and techniques of general use semi-supervised learning smoothing Statistics Topology. Manifolds and cell complexes. Global analysis and analysis on manifolds |
title | ON MULTI-VIEW LEARNING WITH ADDITIVE MODELS |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-14T16%3A39%3A23IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-pascalfrancis_proje&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=ON%20MULTI-VIEW%20LEARNING%20WITH%20ADDITIVE%20MODELS&rft.jtitle=The%20annals%20of%20applied%20statistics&rft.au=CULP,%20Mark&rft.date=2009-03-01&rft.volume=3&rft.issue=1&rft.spage=292&rft.epage=318&rft.pages=292-318&rft.issn=1932-6157&rft.eissn=1941-7330&rft_id=info:doi/10.1214/08-AOAS202&rft_dat=%3Cpascalfrancis_proje%3E21496827%3C/pascalfrancis_proje%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |