ON MULTI-VIEW LEARNING WITH ADDITIVE MODELS

In many scientific settings data can be naturally partitioned into variable groupings called views. Common examples include environmental (1st view) and genetic information (2nd view) in ecological applications, chemical (1st view) and biological (2nd view) data in drug discovery. Multi-view data al...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:The annals of applied statistics 2009-03, Vol.3 (1), p.292-318
Hauptverfasser: CULP, Mark, MICHAILIDIS, George, JOHNSON, Kjell
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 318
container_issue 1
container_start_page 292
container_title The annals of applied statistics
container_volume 3
creator CULP, Mark
MICHAILIDIS, George
JOHNSON, Kjell
description In many scientific settings data can be naturally partitioned into variable groupings called views. Common examples include environmental (1st view) and genetic information (2nd view) in ecological applications, chemical (1st view) and biological (2nd view) data in drug discovery. Multi-view data also occur in text analysis and proteomics applications where one view consists of a graph with observations as the vertices and a weighted measure of pairwise similarity between observations as the edges. Further, in several of these applications the observations can be partitioned into two sets, one where the response is observed (labeled) and the other where the response is not (unlabeled). The problem for simultaneously addressing viewed data and incorporating unlabeled observations in training is referred to as multi-view transductive learning. In this work we introduce and study a comprehensive generalized fixed point additive modeling framework for multi-view transductive learning, where any view is represented by a linear smoother. The problem of view selection is discussed using a generalized Akaike Information Criterion, which provides an approach for testing the contribution of each view. An efficient implementation is provided for fitting these models with both backfitting and local-scoring type algorithms adjusted to semi-supervised graph-based learning. The proposed technique is assessed on both synthetic and real data sets and is shown to be competitive to state-of-the-art co-training and graph-based techniques.
doi_str_mv 10.1214/08-AOAS202
format Article
fullrecord <record><control><sourceid>pascalfrancis_proje</sourceid><recordid>TN_cdi_projecteuclid_primary_oai_CULeuclid_euclid_aoas_1239888372</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>21496827</sourcerecordid><originalsourceid>FETCH-LOGICAL-c355t-281227cefa007a8cc21b32fc967be1d0a7a899e478d0ac8110be4c8715b192f23</originalsourceid><addsrcrecordid>eNo9UE9LwzAcDaLgnF78BL14UaL5JW2THMtWt0C3guu2Y0izFDqqHc08-O1tWdnpPR7vDzyEnoG8A4Xwgwic5MmGEnqDJiBDwJwxcjtwRnEMEb9HD94fCYlCEcIEveXrYLXNCoV3Kt0HWZp8rdV6EexVsQyS-VwVapcGq3yeZptHdFeZxrunEado-5kWsyXO8oWaJRm2LIrOmAqglFtXGUK4EdZSKBmtrIx56eBATC9K6UIuem4FACldaAWHqARJK8qmKLn0nrr26OzZ_dqmPuhTV3-b7k-3ptazbTaqI5jWeA2USSEE40PH66XDdq33nauucSB6uEoTocerevPLOGi8NU3VmR9b-2uid8tYUM7-Af2AY8o</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>ON MULTI-VIEW LEARNING WITH ADDITIVE MODELS</title><source>JSTOR Mathematics and Statistics</source><source>Project Euclid</source><source>Alma/SFX Local Collection</source><source>JSTOR</source><source>EZB Electronic Journals Library</source><creator>CULP, Mark ; MICHAILIDIS, George ; JOHNSON, Kjell</creator><creatorcontrib>CULP, Mark ; MICHAILIDIS, George ; JOHNSON, Kjell</creatorcontrib><description>In many scientific settings data can be naturally partitioned into variable groupings called views. Common examples include environmental (1st view) and genetic information (2nd view) in ecological applications, chemical (1st view) and biological (2nd view) data in drug discovery. Multi-view data also occur in text analysis and proteomics applications where one view consists of a graph with observations as the vertices and a weighted measure of pairwise similarity between observations as the edges. Further, in several of these applications the observations can be partitioned into two sets, one where the response is observed (labeled) and the other where the response is not (unlabeled). The problem for simultaneously addressing viewed data and incorporating unlabeled observations in training is referred to as multi-view transductive learning. In this work we introduce and study a comprehensive generalized fixed point additive modeling framework for multi-view transductive learning, where any view is represented by a linear smoother. The problem of view selection is discussed using a generalized Akaike Information Criterion, which provides an approach for testing the contribution of each view. An efficient implementation is provided for fitting these models with both backfitting and local-scoring type algorithms adjusted to semi-supervised graph-based learning. The proposed technique is assessed on both synthetic and real data sets and is shown to be competitive to state-of-the-art co-training and graph-based techniques.</description><identifier>ISSN: 1932-6157</identifier><identifier>EISSN: 1941-7330</identifier><identifier>DOI: 10.1214/08-AOAS202</identifier><language>eng</language><publisher>Cleveland, OH: Institute of Mathematical Statistics</publisher><subject>Applications ; Biology, psychology, social sciences ; Exact sciences and technology ; generalized additive model ; Global analysis, analysis on manifolds ; Markov processes ; Mathematics ; model selection ; Multi-view learning ; Multivariate analysis ; Probability and statistics ; Probability theory and stochastic processes ; Sciences and techniques of general use ; semi-supervised learning ; smoothing ; Statistics ; Topology. Manifolds and cell complexes. Global analysis and analysis on manifolds</subject><ispartof>The annals of applied statistics, 2009-03, Vol.3 (1), p.292-318</ispartof><rights>2015 INIST-CNRS</rights><rights>Copyright 2009 Institute of Mathematical Statistics</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c355t-281227cefa007a8cc21b32fc967be1d0a7a899e478d0ac8110be4c8715b192f23</citedby><cites>FETCH-LOGICAL-c355t-281227cefa007a8cc21b32fc967be1d0a7a899e478d0ac8110be4c8715b192f23</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>230,314,780,784,885,926,27922,27923</link.rule.ids><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&amp;idt=21496827$$DView record in Pascal Francis$$Hfree_for_read</backlink></links><search><creatorcontrib>CULP, Mark</creatorcontrib><creatorcontrib>MICHAILIDIS, George</creatorcontrib><creatorcontrib>JOHNSON, Kjell</creatorcontrib><title>ON MULTI-VIEW LEARNING WITH ADDITIVE MODELS</title><title>The annals of applied statistics</title><description>In many scientific settings data can be naturally partitioned into variable groupings called views. Common examples include environmental (1st view) and genetic information (2nd view) in ecological applications, chemical (1st view) and biological (2nd view) data in drug discovery. Multi-view data also occur in text analysis and proteomics applications where one view consists of a graph with observations as the vertices and a weighted measure of pairwise similarity between observations as the edges. Further, in several of these applications the observations can be partitioned into two sets, one where the response is observed (labeled) and the other where the response is not (unlabeled). The problem for simultaneously addressing viewed data and incorporating unlabeled observations in training is referred to as multi-view transductive learning. In this work we introduce and study a comprehensive generalized fixed point additive modeling framework for multi-view transductive learning, where any view is represented by a linear smoother. The problem of view selection is discussed using a generalized Akaike Information Criterion, which provides an approach for testing the contribution of each view. An efficient implementation is provided for fitting these models with both backfitting and local-scoring type algorithms adjusted to semi-supervised graph-based learning. The proposed technique is assessed on both synthetic and real data sets and is shown to be competitive to state-of-the-art co-training and graph-based techniques.</description><subject>Applications</subject><subject>Biology, psychology, social sciences</subject><subject>Exact sciences and technology</subject><subject>generalized additive model</subject><subject>Global analysis, analysis on manifolds</subject><subject>Markov processes</subject><subject>Mathematics</subject><subject>model selection</subject><subject>Multi-view learning</subject><subject>Multivariate analysis</subject><subject>Probability and statistics</subject><subject>Probability theory and stochastic processes</subject><subject>Sciences and techniques of general use</subject><subject>semi-supervised learning</subject><subject>smoothing</subject><subject>Statistics</subject><subject>Topology. Manifolds and cell complexes. Global analysis and analysis on manifolds</subject><issn>1932-6157</issn><issn>1941-7330</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2009</creationdate><recordtype>article</recordtype><recordid>eNo9UE9LwzAcDaLgnF78BL14UaL5JW2THMtWt0C3guu2Y0izFDqqHc08-O1tWdnpPR7vDzyEnoG8A4Xwgwic5MmGEnqDJiBDwJwxcjtwRnEMEb9HD94fCYlCEcIEveXrYLXNCoV3Kt0HWZp8rdV6EexVsQyS-VwVapcGq3yeZptHdFeZxrunEado-5kWsyXO8oWaJRm2LIrOmAqglFtXGUK4EdZSKBmtrIx56eBATC9K6UIuem4FACldaAWHqARJK8qmKLn0nrr26OzZ_dqmPuhTV3-b7k-3ptazbTaqI5jWeA2USSEE40PH66XDdq33nauucSB6uEoTocerevPLOGi8NU3VmR9b-2uid8tYUM7-Af2AY8o</recordid><startdate>20090301</startdate><enddate>20090301</enddate><creator>CULP, Mark</creator><creator>MICHAILIDIS, George</creator><creator>JOHNSON, Kjell</creator><general>Institute of Mathematical Statistics</general><general>The Institute of Mathematical Statistics</general><scope>IQODW</scope><scope>AAYXX</scope><scope>CITATION</scope></search><sort><creationdate>20090301</creationdate><title>ON MULTI-VIEW LEARNING WITH ADDITIVE MODELS</title><author>CULP, Mark ; MICHAILIDIS, George ; JOHNSON, Kjell</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c355t-281227cefa007a8cc21b32fc967be1d0a7a899e478d0ac8110be4c8715b192f23</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2009</creationdate><topic>Applications</topic><topic>Biology, psychology, social sciences</topic><topic>Exact sciences and technology</topic><topic>generalized additive model</topic><topic>Global analysis, analysis on manifolds</topic><topic>Markov processes</topic><topic>Mathematics</topic><topic>model selection</topic><topic>Multi-view learning</topic><topic>Multivariate analysis</topic><topic>Probability and statistics</topic><topic>Probability theory and stochastic processes</topic><topic>Sciences and techniques of general use</topic><topic>semi-supervised learning</topic><topic>smoothing</topic><topic>Statistics</topic><topic>Topology. Manifolds and cell complexes. Global analysis and analysis on manifolds</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>CULP, Mark</creatorcontrib><creatorcontrib>MICHAILIDIS, George</creatorcontrib><creatorcontrib>JOHNSON, Kjell</creatorcontrib><collection>Pascal-Francis</collection><collection>CrossRef</collection><jtitle>The annals of applied statistics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>CULP, Mark</au><au>MICHAILIDIS, George</au><au>JOHNSON, Kjell</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>ON MULTI-VIEW LEARNING WITH ADDITIVE MODELS</atitle><jtitle>The annals of applied statistics</jtitle><date>2009-03-01</date><risdate>2009</risdate><volume>3</volume><issue>1</issue><spage>292</spage><epage>318</epage><pages>292-318</pages><issn>1932-6157</issn><eissn>1941-7330</eissn><abstract>In many scientific settings data can be naturally partitioned into variable groupings called views. Common examples include environmental (1st view) and genetic information (2nd view) in ecological applications, chemical (1st view) and biological (2nd view) data in drug discovery. Multi-view data also occur in text analysis and proteomics applications where one view consists of a graph with observations as the vertices and a weighted measure of pairwise similarity between observations as the edges. Further, in several of these applications the observations can be partitioned into two sets, one where the response is observed (labeled) and the other where the response is not (unlabeled). The problem for simultaneously addressing viewed data and incorporating unlabeled observations in training is referred to as multi-view transductive learning. In this work we introduce and study a comprehensive generalized fixed point additive modeling framework for multi-view transductive learning, where any view is represented by a linear smoother. The problem of view selection is discussed using a generalized Akaike Information Criterion, which provides an approach for testing the contribution of each view. An efficient implementation is provided for fitting these models with both backfitting and local-scoring type algorithms adjusted to semi-supervised graph-based learning. The proposed technique is assessed on both synthetic and real data sets and is shown to be competitive to state-of-the-art co-training and graph-based techniques.</abstract><cop>Cleveland, OH</cop><pub>Institute of Mathematical Statistics</pub><doi>10.1214/08-AOAS202</doi><tpages>27</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1932-6157
ispartof The annals of applied statistics, 2009-03, Vol.3 (1), p.292-318
issn 1932-6157
1941-7330
language eng
recordid cdi_projecteuclid_primary_oai_CULeuclid_euclid_aoas_1239888372
source JSTOR Mathematics and Statistics; Project Euclid; Alma/SFX Local Collection; JSTOR; EZB Electronic Journals Library
subjects Applications
Biology, psychology, social sciences
Exact sciences and technology
generalized additive model
Global analysis, analysis on manifolds
Markov processes
Mathematics
model selection
Multi-view learning
Multivariate analysis
Probability and statistics
Probability theory and stochastic processes
Sciences and techniques of general use
semi-supervised learning
smoothing
Statistics
Topology. Manifolds and cell complexes. Global analysis and analysis on manifolds
title ON MULTI-VIEW LEARNING WITH ADDITIVE MODELS
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-14T16%3A39%3A23IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-pascalfrancis_proje&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=ON%20MULTI-VIEW%20LEARNING%20WITH%20ADDITIVE%20MODELS&rft.jtitle=The%20annals%20of%20applied%20statistics&rft.au=CULP,%20Mark&rft.date=2009-03-01&rft.volume=3&rft.issue=1&rft.spage=292&rft.epage=318&rft.pages=292-318&rft.issn=1932-6157&rft.eissn=1941-7330&rft_id=info:doi/10.1214/08-AOAS202&rft_dat=%3Cpascalfrancis_proje%3E21496827%3C/pascalfrancis_proje%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true