Discriminability and uncertainty in principal component analysis (PCA) of temporal check-all-that-apply (TCATA) data

•TCATA data explored and visualized using PCA.•Bootstrap procedures used to investigate uncertainty in scores.•Uncertainty in scores visualized in different ways.•Index proposed to evaluate discriminability of scores.•New graphical features proposed for PCA biplots. Temporal check-all-that-apply (TC...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Food quality and preference 2022-03, Vol.96, p.104370, Article 104370
Hauptverfasser: Castura, J.C., Rutledge, D.N., Ross, C.F., Næs, T.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page 104370
container_title Food quality and preference
container_volume 96
creator Castura, J.C.
Rutledge, D.N.
Ross, C.F.
Næs, T.
description •TCATA data explored and visualized using PCA.•Bootstrap procedures used to investigate uncertainty in scores.•Uncertainty in scores visualized in different ways.•Index proposed to evaluate discriminability of scores.•New graphical features proposed for PCA biplots. Temporal check-all-that-apply (TCATA) data can be summarized and explored using principal component analysis (PCA). Here we analyze TCATA data on Syrah wines obtained from a trained sensory panel. We evaluate new and existing methods to explore the uncertainty in the PCA scores. To do so, we use the bootstrap procedure to obtain many virtual panels from the real panel’s data. Virtual-panel PCA scores are obtained using two methods. The first method, called the partial bootstrap (PB), obtains virtual-panel scores from regression. The second method, called the truncated total bootstrap (TTB), applies PCA to the virtual-panel results to obtain scores, which are truncated and superimposed on the real-panel scores by Procrustes rotation. We use the virtual scores from each method to investigate uncertainty in the real-panel PCA scores visually and numerically. To understand the uncertainty of the scores, we obtain confidence ellipses (CEs) and their areas, as well as confidence intervals (CIs) and their widths. Next, to determine whether PCA scores for different samples are well separated, we propose a procedure for approximating the standard errors of sample differences and correcting for multiple comparisons. We propose a discriminability index, and show that it can enhance the interpretability of PCA results. We incorporate graphical features into our PCA biplots to visualize discriminability. We did not find a large difference between the PB and TTB methods for understanding the uncertainty and discriminability in PCA scores. Although the TCATA data that we analyzed have a special structure, the methodological approaches presented here can be readily adapted to other applications of PCA.
doi_str_mv 10.1016/j.foodqual.2021.104370
format Article
fullrecord <record><control><sourceid>elsevier_hal_p</sourceid><recordid>TN_cdi_hal_primary_oai_HAL_hal_03469143v1</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0950329321002524</els_id><sourcerecordid>S0950329321002524</sourcerecordid><originalsourceid>FETCH-LOGICAL-c394t-92db077a867ecd8bfc723b676c1a1e11a6122f022b884ee48e4e82289f51f1b03</originalsourceid><addsrcrecordid>eNqFkMtOwzAQRS0EEuXxCyjLdpHisdM8dkThUaRKsChra-I4qoubBNutlL_HVYEtq9HMnDvSHELugM6BQnq_nbd933zt0cwZZRCGCc_oGZlAnvE45Qk_JxNaLGjMWcEvyZVzW0oho8AmxD9qJ63e6Q5rbbQfI-yaaN9JZT3qLvS6iwarO6kHNJHsd0Pfqc4HDM3otIum71U5i_o28irs7BHaKPkZozGx36CPcRjMGE3XVbkOYIMeb8hFi8ap2596TT6en9bVMl69vbxW5SqWvEh8XLCmplmGeZop2eR1KzPG6zRLJSAoAEyBsZYyVud5olSSq0TljOVFu4AWasqvyex0d4NGhCd2aEfRoxbLciWOM8qTtICEHyCw6YmVtnfOqvYvAFQcPYut-PUsjp7FyXMIPpyCKnxy0MoKJ7UKAhttlfSi6fV_J74Boj2J1w</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Discriminability and uncertainty in principal component analysis (PCA) of temporal check-all-that-apply (TCATA) data</title><source>Elsevier ScienceDirect Journals Complete</source><creator>Castura, J.C. ; Rutledge, D.N. ; Ross, C.F. ; Næs, T.</creator><creatorcontrib>Castura, J.C. ; Rutledge, D.N. ; Ross, C.F. ; Næs, T.</creatorcontrib><description>•TCATA data explored and visualized using PCA.•Bootstrap procedures used to investigate uncertainty in scores.•Uncertainty in scores visualized in different ways.•Index proposed to evaluate discriminability of scores.•New graphical features proposed for PCA biplots. Temporal check-all-that-apply (TCATA) data can be summarized and explored using principal component analysis (PCA). Here we analyze TCATA data on Syrah wines obtained from a trained sensory panel. We evaluate new and existing methods to explore the uncertainty in the PCA scores. To do so, we use the bootstrap procedure to obtain many virtual panels from the real panel’s data. Virtual-panel PCA scores are obtained using two methods. The first method, called the partial bootstrap (PB), obtains virtual-panel scores from regression. The second method, called the truncated total bootstrap (TTB), applies PCA to the virtual-panel results to obtain scores, which are truncated and superimposed on the real-panel scores by Procrustes rotation. We use the virtual scores from each method to investigate uncertainty in the real-panel PCA scores visually and numerically. To understand the uncertainty of the scores, we obtain confidence ellipses (CEs) and their areas, as well as confidence intervals (CIs) and their widths. Next, to determine whether PCA scores for different samples are well separated, we propose a procedure for approximating the standard errors of sample differences and correcting for multiple comparisons. We propose a discriminability index, and show that it can enhance the interpretability of PCA results. We incorporate graphical features into our PCA biplots to visualize discriminability. We did not find a large difference between the PB and TTB methods for understanding the uncertainty and discriminability in PCA scores. Although the TCATA data that we analyzed have a special structure, the methodological approaches presented here can be readily adapted to other applications of PCA.</description><identifier>ISSN: 0950-3293</identifier><identifier>EISSN: 1873-6343</identifier><identifier>DOI: 10.1016/j.foodqual.2021.104370</identifier><language>eng</language><publisher>Elsevier Ltd</publisher><subject>Bootstrap ; Discriminability ; Mathematics ; Principal component analysis (PCA) ; Statistics ; Temporal check-all-that-apply (TCATA) ; Uncertainty ; Wine</subject><ispartof>Food quality and preference, 2022-03, Vol.96, p.104370, Article 104370</ispartof><rights>2021 Elsevier Ltd</rights><rights>Distributed under a Creative Commons Attribution 4.0 International License</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c394t-92db077a867ecd8bfc723b676c1a1e11a6122f022b884ee48e4e82289f51f1b03</citedby><cites>FETCH-LOGICAL-c394t-92db077a867ecd8bfc723b676c1a1e11a6122f022b884ee48e4e82289f51f1b03</cites><orcidid>0000-0002-1640-833X</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://dx.doi.org/10.1016/j.foodqual.2021.104370$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>230,314,780,784,885,3550,27924,27925,45995</link.rule.ids><backlink>$$Uhttps://hal.science/hal-03469143$$DView record in HAL$$Hfree_for_read</backlink></links><search><creatorcontrib>Castura, J.C.</creatorcontrib><creatorcontrib>Rutledge, D.N.</creatorcontrib><creatorcontrib>Ross, C.F.</creatorcontrib><creatorcontrib>Næs, T.</creatorcontrib><title>Discriminability and uncertainty in principal component analysis (PCA) of temporal check-all-that-apply (TCATA) data</title><title>Food quality and preference</title><description>•TCATA data explored and visualized using PCA.•Bootstrap procedures used to investigate uncertainty in scores.•Uncertainty in scores visualized in different ways.•Index proposed to evaluate discriminability of scores.•New graphical features proposed for PCA biplots. Temporal check-all-that-apply (TCATA) data can be summarized and explored using principal component analysis (PCA). Here we analyze TCATA data on Syrah wines obtained from a trained sensory panel. We evaluate new and existing methods to explore the uncertainty in the PCA scores. To do so, we use the bootstrap procedure to obtain many virtual panels from the real panel’s data. Virtual-panel PCA scores are obtained using two methods. The first method, called the partial bootstrap (PB), obtains virtual-panel scores from regression. The second method, called the truncated total bootstrap (TTB), applies PCA to the virtual-panel results to obtain scores, which are truncated and superimposed on the real-panel scores by Procrustes rotation. We use the virtual scores from each method to investigate uncertainty in the real-panel PCA scores visually and numerically. To understand the uncertainty of the scores, we obtain confidence ellipses (CEs) and their areas, as well as confidence intervals (CIs) and their widths. Next, to determine whether PCA scores for different samples are well separated, we propose a procedure for approximating the standard errors of sample differences and correcting for multiple comparisons. We propose a discriminability index, and show that it can enhance the interpretability of PCA results. We incorporate graphical features into our PCA biplots to visualize discriminability. We did not find a large difference between the PB and TTB methods for understanding the uncertainty and discriminability in PCA scores. Although the TCATA data that we analyzed have a special structure, the methodological approaches presented here can be readily adapted to other applications of PCA.</description><subject>Bootstrap</subject><subject>Discriminability</subject><subject>Mathematics</subject><subject>Principal component analysis (PCA)</subject><subject>Statistics</subject><subject>Temporal check-all-that-apply (TCATA)</subject><subject>Uncertainty</subject><subject>Wine</subject><issn>0950-3293</issn><issn>1873-6343</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><recordid>eNqFkMtOwzAQRS0EEuXxCyjLdpHisdM8dkThUaRKsChra-I4qoubBNutlL_HVYEtq9HMnDvSHELugM6BQnq_nbd933zt0cwZZRCGCc_oGZlAnvE45Qk_JxNaLGjMWcEvyZVzW0oho8AmxD9qJ63e6Q5rbbQfI-yaaN9JZT3qLvS6iwarO6kHNJHsd0Pfqc4HDM3otIum71U5i_o28irs7BHaKPkZozGx36CPcRjMGE3XVbkOYIMeb8hFi8ap2596TT6en9bVMl69vbxW5SqWvEh8XLCmplmGeZop2eR1KzPG6zRLJSAoAEyBsZYyVud5olSSq0TljOVFu4AWasqvyex0d4NGhCd2aEfRoxbLciWOM8qTtICEHyCw6YmVtnfOqvYvAFQcPYut-PUsjp7FyXMIPpyCKnxy0MoKJ7UKAhttlfSi6fV_J74Boj2J1w</recordid><startdate>202203</startdate><enddate>202203</enddate><creator>Castura, J.C.</creator><creator>Rutledge, D.N.</creator><creator>Ross, C.F.</creator><creator>Næs, T.</creator><general>Elsevier Ltd</general><general>Elsevier</general><scope>AAYXX</scope><scope>CITATION</scope><scope>1XC</scope><orcidid>https://orcid.org/0000-0002-1640-833X</orcidid></search><sort><creationdate>202203</creationdate><title>Discriminability and uncertainty in principal component analysis (PCA) of temporal check-all-that-apply (TCATA) data</title><author>Castura, J.C. ; Rutledge, D.N. ; Ross, C.F. ; Næs, T.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c394t-92db077a867ecd8bfc723b676c1a1e11a6122f022b884ee48e4e82289f51f1b03</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Bootstrap</topic><topic>Discriminability</topic><topic>Mathematics</topic><topic>Principal component analysis (PCA)</topic><topic>Statistics</topic><topic>Temporal check-all-that-apply (TCATA)</topic><topic>Uncertainty</topic><topic>Wine</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Castura, J.C.</creatorcontrib><creatorcontrib>Rutledge, D.N.</creatorcontrib><creatorcontrib>Ross, C.F.</creatorcontrib><creatorcontrib>Næs, T.</creatorcontrib><collection>CrossRef</collection><collection>Hyper Article en Ligne (HAL)</collection><jtitle>Food quality and preference</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Castura, J.C.</au><au>Rutledge, D.N.</au><au>Ross, C.F.</au><au>Næs, T.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Discriminability and uncertainty in principal component analysis (PCA) of temporal check-all-that-apply (TCATA) data</atitle><jtitle>Food quality and preference</jtitle><date>2022-03</date><risdate>2022</risdate><volume>96</volume><spage>104370</spage><pages>104370-</pages><artnum>104370</artnum><issn>0950-3293</issn><eissn>1873-6343</eissn><abstract>•TCATA data explored and visualized using PCA.•Bootstrap procedures used to investigate uncertainty in scores.•Uncertainty in scores visualized in different ways.•Index proposed to evaluate discriminability of scores.•New graphical features proposed for PCA biplots. Temporal check-all-that-apply (TCATA) data can be summarized and explored using principal component analysis (PCA). Here we analyze TCATA data on Syrah wines obtained from a trained sensory panel. We evaluate new and existing methods to explore the uncertainty in the PCA scores. To do so, we use the bootstrap procedure to obtain many virtual panels from the real panel’s data. Virtual-panel PCA scores are obtained using two methods. The first method, called the partial bootstrap (PB), obtains virtual-panel scores from regression. The second method, called the truncated total bootstrap (TTB), applies PCA to the virtual-panel results to obtain scores, which are truncated and superimposed on the real-panel scores by Procrustes rotation. We use the virtual scores from each method to investigate uncertainty in the real-panel PCA scores visually and numerically. To understand the uncertainty of the scores, we obtain confidence ellipses (CEs) and their areas, as well as confidence intervals (CIs) and their widths. Next, to determine whether PCA scores for different samples are well separated, we propose a procedure for approximating the standard errors of sample differences and correcting for multiple comparisons. We propose a discriminability index, and show that it can enhance the interpretability of PCA results. We incorporate graphical features into our PCA biplots to visualize discriminability. We did not find a large difference between the PB and TTB methods for understanding the uncertainty and discriminability in PCA scores. Although the TCATA data that we analyzed have a special structure, the methodological approaches presented here can be readily adapted to other applications of PCA.</abstract><pub>Elsevier Ltd</pub><doi>10.1016/j.foodqual.2021.104370</doi><orcidid>https://orcid.org/0000-0002-1640-833X</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 0950-3293
ispartof Food quality and preference, 2022-03, Vol.96, p.104370, Article 104370
issn 0950-3293
1873-6343
language eng
recordid cdi_hal_primary_oai_HAL_hal_03469143v1
source Elsevier ScienceDirect Journals Complete
subjects Bootstrap
Discriminability
Mathematics
Principal component analysis (PCA)
Statistics
Temporal check-all-that-apply (TCATA)
Uncertainty
Wine
title Discriminability and uncertainty in principal component analysis (PCA) of temporal check-all-that-apply (TCATA) data
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-26T07%3A17%3A22IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-elsevier_hal_p&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Discriminability%20and%20uncertainty%20in%20principal%20component%20analysis%20(PCA)%20of%20temporal%20check-all-that-apply%20(TCATA)%20data&rft.jtitle=Food%20quality%20and%20preference&rft.au=Castura,%20J.C.&rft.date=2022-03&rft.volume=96&rft.spage=104370&rft.pages=104370-&rft.artnum=104370&rft.issn=0950-3293&rft.eissn=1873-6343&rft_id=info:doi/10.1016/j.foodqual.2021.104370&rft_dat=%3Celsevier_hal_p%3ES0950329321002524%3C/elsevier_hal_p%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_els_id=S0950329321002524&rfr_iscdi=true