Comparison of Partial Least Squares Regression (PLSR) and Principal Components Regression (PCR) Methods for Protein and Hardness Predictions using the Near-Infrared (NIR) Hyperspectral Images of Bulk Samples of Canadian Wheat

The objective of this study was to compare the predictions of the protein contents and hardness values by partial least squares regression (PLSR) and principal components regression (PCR) models for bulk samples of Canadian wheat, which were obtained from different locations and crop years. Wheat sa...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Food and bioprocess technology 2015-01, Vol.8 (1), p.31-40
Hauptverfasser: Mahesh, S, Jayas, D. S, Paliwal, J, White, N. D. G
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 40
container_issue 1
container_start_page 31
container_title Food and bioprocess technology
container_volume 8
creator Mahesh, S
Jayas, D. S
Paliwal, J
White, N. D. G
description The objective of this study was to compare the predictions of the protein contents and hardness values by partial least squares regression (PLSR) and principal components regression (PCR) models for bulk samples of Canadian wheat, which were obtained from different locations and crop years. Wheat samples of Canada Western Red Spring (CWRS), Canada Western Hard White Spring (CWHWS), Canada Western Soft White Spring (CWSWS), and Canada Prairie Spring Red (CPSR) classes were obtained from nearby agricultural farms in the main wheat growing locations in the Provinces of Alberta, Saskatchewan, and Manitoba from 2007, 2008, and 2009 crop years. Wheat samples were conditioned to moisture levels of 13, 16, and 19 % (wet basis) and pooled together for developing the regression models. A database of the near-infrared (NIR) hyperspectral image cubes of bulk samples of wheat classes was created in the wavelength region of 960–1,700 nm with 10 nm intervals. Reference protein contents and hardness values were determined using the Dumatherm method and single kernel characterization system (SKCS), respectively. A tenfold cross-validation was used for the ten-factor partial least squares regression (PLSR) and principal components regression (PCR) models for prediction purposes. Prediction performances of regression models were assessed by calculating the estimated mean square errors of prediction (MSEP), standard error of cross-validation (SECV), and correlation coefficient (r). Using the full data set in the protein prediction study, the ten-component PLSR model gave 1.76, 1.33, and 0.68 for the estimated MSEP, SECV, and r, respectively, which were better than the results for the ten-component PCR model (2.02, 1.42, and 0.62, respectively). For the hardness prediction, the estimated MSEP, SECV, and r values were 147.7, 12.15, and 0.82, respectively, for the ten-component PLSR model using the full data set. The PLSR models prediction performances outperformed the PCR models for predicting protein contents and hardness of wheat.
doi_str_mv 10.1007/s11947-014-1381-z
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2410781557</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2410781557</sourcerecordid><originalsourceid>FETCH-LOGICAL-c476t-fb79e639cbf8d4609e7941da55a2af62007615a8da72a2f6618b26bb069052f93</originalsourceid><addsrcrecordid>eNp9kc9u1DAQxiMEEqXwAJywxKUcAp78cZIjRIVdaSmrLhVHa5LYWZddO7WdQ_u2vAmzBIHEgYM1Huv3fTPylyQvgb8Fzqt3AaApqpRDkUJeQ_rwKDmDJi_TEorm8Z97zp8mz0K45VzwAvKz5EfrjhN6E5xlTrMt-mjwwDYKQ2S7uxm9CuxajVSCIeZiu9ldv2FoB7b1xvZmIvrk4ayy8R-0JfKzins3BKadJ4WLythf6hX6wRJJj2owfSRFYHMwdmRxr9iVQp-urfa0wMAurtZktbqflA-T6qOnoesjjrQbLf1hPnxnOzxOh6Vv0eJg0LJve4XxefJE4yGoF7_reXLz8fJru0o3Xz6t2_ebtC8qEVPdVY0SedN3uh4KwRtVNQUMWJaYoRYZfbKAEusBqwwzLQTUXSa6jouGl5lu8vPk9eI7eXc3qxDlrZu9pZEyK4BXNZRlRRQsVO9dCF5pOXlzRH8vgctTknJJUlKS8pSkfCBNtmgCsXZU_q_z_0SvFpFGJ3GkhOXNLuNQcjqCC8h_AkDRrN8</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2410781557</pqid></control><display><type>article</type><title>Comparison of Partial Least Squares Regression (PLSR) and Principal Components Regression (PCR) Methods for Protein and Hardness Predictions using the Near-Infrared (NIR) Hyperspectral Images of Bulk Samples of Canadian Wheat</title><source>SpringerLink Journals - AutoHoldings</source><creator>Mahesh, S ; Jayas, D. S ; Paliwal, J ; White, N. D. G</creator><creatorcontrib>Mahesh, S ; Jayas, D. S ; Paliwal, J ; White, N. D. G</creatorcontrib><description>The objective of this study was to compare the predictions of the protein contents and hardness values by partial least squares regression (PLSR) and principal components regression (PCR) models for bulk samples of Canadian wheat, which were obtained from different locations and crop years. Wheat samples of Canada Western Red Spring (CWRS), Canada Western Hard White Spring (CWHWS), Canada Western Soft White Spring (CWSWS), and Canada Prairie Spring Red (CPSR) classes were obtained from nearby agricultural farms in the main wheat growing locations in the Provinces of Alberta, Saskatchewan, and Manitoba from 2007, 2008, and 2009 crop years. Wheat samples were conditioned to moisture levels of 13, 16, and 19 % (wet basis) and pooled together for developing the regression models. A database of the near-infrared (NIR) hyperspectral image cubes of bulk samples of wheat classes was created in the wavelength region of 960–1,700 nm with 10 nm intervals. Reference protein contents and hardness values were determined using the Dumatherm method and single kernel characterization system (SKCS), respectively. A tenfold cross-validation was used for the ten-factor partial least squares regression (PLSR) and principal components regression (PCR) models for prediction purposes. Prediction performances of regression models were assessed by calculating the estimated mean square errors of prediction (MSEP), standard error of cross-validation (SECV), and correlation coefficient (r). Using the full data set in the protein prediction study, the ten-component PLSR model gave 1.76, 1.33, and 0.68 for the estimated MSEP, SECV, and r, respectively, which were better than the results for the ten-component PCR model (2.02, 1.42, and 0.62, respectively). For the hardness prediction, the estimated MSEP, SECV, and r values were 147.7, 12.15, and 0.82, respectively, for the ten-component PLSR model using the full data set. The PLSR models prediction performances outperformed the PCR models for predicting protein contents and hardness of wheat.</description><identifier>ISSN: 1935-5130</identifier><identifier>EISSN: 1935-5149</identifier><identifier>DOI: 10.1007/s11947-014-1381-z</identifier><language>eng</language><publisher>Boston: Springer-Verlag</publisher><subject>Agriculture ; bioprocessing ; Biotechnology ; Bulk sampling ; Cereal crops ; Chemistry ; Chemistry and Materials Science ; Chemistry/Food Science ; correlation ; Correlation coefficient ; Correlation coefficients ; Cubes ; data collection ; Datasets ; Farming ; Farms ; Food Science ; Hardness ; hyperspectral imagery ; Hyperspectral imaging ; I.R. radiation ; Infrared imagery ; least squares ; Least squares method ; Near infrared radiation ; Original Paper ; Performance prediction ; polymerase chain reaction ; prediction ; Predictions ; Proteins ; Regression analysis ; Regression models ; seeds ; Standard error ; Standard error of estimate ; wavelengths ; Wheat ; wheat classes</subject><ispartof>Food and bioprocess technology, 2015-01, Vol.8 (1), p.31-40</ispartof><rights>Springer Science+Business Media New York 2014</rights><rights>Springer Science+Business Media New York 2014.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c476t-fb79e639cbf8d4609e7941da55a2af62007615a8da72a2f6618b26bb069052f93</citedby><cites>FETCH-LOGICAL-c476t-fb79e639cbf8d4609e7941da55a2af62007615a8da72a2f6618b26bb069052f93</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s11947-014-1381-z$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s11947-014-1381-z$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>314,776,780,27901,27902,41464,42533,51294</link.rule.ids></links><search><creatorcontrib>Mahesh, S</creatorcontrib><creatorcontrib>Jayas, D. S</creatorcontrib><creatorcontrib>Paliwal, J</creatorcontrib><creatorcontrib>White, N. D. G</creatorcontrib><title>Comparison of Partial Least Squares Regression (PLSR) and Principal Components Regression (PCR) Methods for Protein and Hardness Predictions using the Near-Infrared (NIR) Hyperspectral Images of Bulk Samples of Canadian Wheat</title><title>Food and bioprocess technology</title><addtitle>Food Bioprocess Technol</addtitle><description>The objective of this study was to compare the predictions of the protein contents and hardness values by partial least squares regression (PLSR) and principal components regression (PCR) models for bulk samples of Canadian wheat, which were obtained from different locations and crop years. Wheat samples of Canada Western Red Spring (CWRS), Canada Western Hard White Spring (CWHWS), Canada Western Soft White Spring (CWSWS), and Canada Prairie Spring Red (CPSR) classes were obtained from nearby agricultural farms in the main wheat growing locations in the Provinces of Alberta, Saskatchewan, and Manitoba from 2007, 2008, and 2009 crop years. Wheat samples were conditioned to moisture levels of 13, 16, and 19 % (wet basis) and pooled together for developing the regression models. A database of the near-infrared (NIR) hyperspectral image cubes of bulk samples of wheat classes was created in the wavelength region of 960–1,700 nm with 10 nm intervals. Reference protein contents and hardness values were determined using the Dumatherm method and single kernel characterization system (SKCS), respectively. A tenfold cross-validation was used for the ten-factor partial least squares regression (PLSR) and principal components regression (PCR) models for prediction purposes. Prediction performances of regression models were assessed by calculating the estimated mean square errors of prediction (MSEP), standard error of cross-validation (SECV), and correlation coefficient (r). Using the full data set in the protein prediction study, the ten-component PLSR model gave 1.76, 1.33, and 0.68 for the estimated MSEP, SECV, and r, respectively, which were better than the results for the ten-component PCR model (2.02, 1.42, and 0.62, respectively). For the hardness prediction, the estimated MSEP, SECV, and r values were 147.7, 12.15, and 0.82, respectively, for the ten-component PLSR model using the full data set. The PLSR models prediction performances outperformed the PCR models for predicting protein contents and hardness of wheat.</description><subject>Agriculture</subject><subject>bioprocessing</subject><subject>Biotechnology</subject><subject>Bulk sampling</subject><subject>Cereal crops</subject><subject>Chemistry</subject><subject>Chemistry and Materials Science</subject><subject>Chemistry/Food Science</subject><subject>correlation</subject><subject>Correlation coefficient</subject><subject>Correlation coefficients</subject><subject>Cubes</subject><subject>data collection</subject><subject>Datasets</subject><subject>Farming</subject><subject>Farms</subject><subject>Food Science</subject><subject>Hardness</subject><subject>hyperspectral imagery</subject><subject>Hyperspectral imaging</subject><subject>I.R. radiation</subject><subject>Infrared imagery</subject><subject>least squares</subject><subject>Least squares method</subject><subject>Near infrared radiation</subject><subject>Original Paper</subject><subject>Performance prediction</subject><subject>polymerase chain reaction</subject><subject>prediction</subject><subject>Predictions</subject><subject>Proteins</subject><subject>Regression analysis</subject><subject>Regression models</subject><subject>seeds</subject><subject>Standard error</subject><subject>Standard error of estimate</subject><subject>wavelengths</subject><subject>Wheat</subject><subject>wheat classes</subject><issn>1935-5130</issn><issn>1935-5149</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2015</creationdate><recordtype>article</recordtype><sourceid>BENPR</sourceid><recordid>eNp9kc9u1DAQxiMEEqXwAJywxKUcAp78cZIjRIVdaSmrLhVHa5LYWZddO7WdQ_u2vAmzBIHEgYM1Huv3fTPylyQvgb8Fzqt3AaApqpRDkUJeQ_rwKDmDJi_TEorm8Z97zp8mz0K45VzwAvKz5EfrjhN6E5xlTrMt-mjwwDYKQ2S7uxm9CuxajVSCIeZiu9ldv2FoB7b1xvZmIvrk4ayy8R-0JfKzins3BKadJ4WLythf6hX6wRJJj2owfSRFYHMwdmRxr9iVQp-urfa0wMAurtZktbqflA-T6qOnoesjjrQbLf1hPnxnOzxOh6Vv0eJg0LJve4XxefJE4yGoF7_reXLz8fJru0o3Xz6t2_ebtC8qEVPdVY0SedN3uh4KwRtVNQUMWJaYoRYZfbKAEusBqwwzLQTUXSa6jouGl5lu8vPk9eI7eXc3qxDlrZu9pZEyK4BXNZRlRRQsVO9dCF5pOXlzRH8vgctTknJJUlKS8pSkfCBNtmgCsXZU_q_z_0SvFpFGJ3GkhOXNLuNQcjqCC8h_AkDRrN8</recordid><startdate>20150101</startdate><enddate>20150101</enddate><creator>Mahesh, S</creator><creator>Jayas, D. S</creator><creator>Paliwal, J</creator><creator>White, N. D. G</creator><general>Springer-Verlag</general><general>Springer US</general><general>Springer Nature B.V</general><scope>FBQ</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7X2</scope><scope>8FE</scope><scope>8FG</scope><scope>8FH</scope><scope>8FK</scope><scope>ABJCF</scope><scope>AEUYN</scope><scope>AFKRA</scope><scope>ATCPS</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>BHPHI</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M0K</scope><scope>M7S</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PTHSS</scope></search><sort><creationdate>20150101</creationdate><title>Comparison of Partial Least Squares Regression (PLSR) and Principal Components Regression (PCR) Methods for Protein and Hardness Predictions using the Near-Infrared (NIR) Hyperspectral Images of Bulk Samples of Canadian Wheat</title><author>Mahesh, S ; Jayas, D. S ; Paliwal, J ; White, N. D. G</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c476t-fb79e639cbf8d4609e7941da55a2af62007615a8da72a2f6618b26bb069052f93</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2015</creationdate><topic>Agriculture</topic><topic>bioprocessing</topic><topic>Biotechnology</topic><topic>Bulk sampling</topic><topic>Cereal crops</topic><topic>Chemistry</topic><topic>Chemistry and Materials Science</topic><topic>Chemistry/Food Science</topic><topic>correlation</topic><topic>Correlation coefficient</topic><topic>Correlation coefficients</topic><topic>Cubes</topic><topic>data collection</topic><topic>Datasets</topic><topic>Farming</topic><topic>Farms</topic><topic>Food Science</topic><topic>Hardness</topic><topic>hyperspectral imagery</topic><topic>Hyperspectral imaging</topic><topic>I.R. radiation</topic><topic>Infrared imagery</topic><topic>least squares</topic><topic>Least squares method</topic><topic>Near infrared radiation</topic><topic>Original Paper</topic><topic>Performance prediction</topic><topic>polymerase chain reaction</topic><topic>prediction</topic><topic>Predictions</topic><topic>Proteins</topic><topic>Regression analysis</topic><topic>Regression models</topic><topic>seeds</topic><topic>Standard error</topic><topic>Standard error of estimate</topic><topic>wavelengths</topic><topic>Wheat</topic><topic>wheat classes</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Mahesh, S</creatorcontrib><creatorcontrib>Jayas, D. S</creatorcontrib><creatorcontrib>Paliwal, J</creatorcontrib><creatorcontrib>White, N. D. G</creatorcontrib><collection>AGRIS</collection><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Agricultural Science Collection</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest One Sustainability</collection><collection>ProQuest Central UK/Ireland</collection><collection>Agricultural &amp; Environmental Science Collection</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>Natural Science Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Agricultural Science Database</collection><collection>Engineering Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>Engineering Collection</collection><jtitle>Food and bioprocess technology</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Mahesh, S</au><au>Jayas, D. S</au><au>Paliwal, J</au><au>White, N. D. G</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Comparison of Partial Least Squares Regression (PLSR) and Principal Components Regression (PCR) Methods for Protein and Hardness Predictions using the Near-Infrared (NIR) Hyperspectral Images of Bulk Samples of Canadian Wheat</atitle><jtitle>Food and bioprocess technology</jtitle><stitle>Food Bioprocess Technol</stitle><date>2015-01-01</date><risdate>2015</risdate><volume>8</volume><issue>1</issue><spage>31</spage><epage>40</epage><pages>31-40</pages><issn>1935-5130</issn><eissn>1935-5149</eissn><abstract>The objective of this study was to compare the predictions of the protein contents and hardness values by partial least squares regression (PLSR) and principal components regression (PCR) models for bulk samples of Canadian wheat, which were obtained from different locations and crop years. Wheat samples of Canada Western Red Spring (CWRS), Canada Western Hard White Spring (CWHWS), Canada Western Soft White Spring (CWSWS), and Canada Prairie Spring Red (CPSR) classes were obtained from nearby agricultural farms in the main wheat growing locations in the Provinces of Alberta, Saskatchewan, and Manitoba from 2007, 2008, and 2009 crop years. Wheat samples were conditioned to moisture levels of 13, 16, and 19 % (wet basis) and pooled together for developing the regression models. A database of the near-infrared (NIR) hyperspectral image cubes of bulk samples of wheat classes was created in the wavelength region of 960–1,700 nm with 10 nm intervals. Reference protein contents and hardness values were determined using the Dumatherm method and single kernel characterization system (SKCS), respectively. A tenfold cross-validation was used for the ten-factor partial least squares regression (PLSR) and principal components regression (PCR) models for prediction purposes. Prediction performances of regression models were assessed by calculating the estimated mean square errors of prediction (MSEP), standard error of cross-validation (SECV), and correlation coefficient (r). Using the full data set in the protein prediction study, the ten-component PLSR model gave 1.76, 1.33, and 0.68 for the estimated MSEP, SECV, and r, respectively, which were better than the results for the ten-component PCR model (2.02, 1.42, and 0.62, respectively). For the hardness prediction, the estimated MSEP, SECV, and r values were 147.7, 12.15, and 0.82, respectively, for the ten-component PLSR model using the full data set. The PLSR models prediction performances outperformed the PCR models for predicting protein contents and hardness of wheat.</abstract><cop>Boston</cop><pub>Springer-Verlag</pub><doi>10.1007/s11947-014-1381-z</doi><tpages>10</tpages></addata></record>
fulltext fulltext
identifier ISSN: 1935-5130
ispartof Food and bioprocess technology, 2015-01, Vol.8 (1), p.31-40
issn 1935-5130
1935-5149
language eng
recordid cdi_proquest_journals_2410781557
source SpringerLink Journals - AutoHoldings
subjects Agriculture
bioprocessing
Biotechnology
Bulk sampling
Cereal crops
Chemistry
Chemistry and Materials Science
Chemistry/Food Science
correlation
Correlation coefficient
Correlation coefficients
Cubes
data collection
Datasets
Farming
Farms
Food Science
Hardness
hyperspectral imagery
Hyperspectral imaging
I.R. radiation
Infrared imagery
least squares
Least squares method
Near infrared radiation
Original Paper
Performance prediction
polymerase chain reaction
prediction
Predictions
Proteins
Regression analysis
Regression models
seeds
Standard error
Standard error of estimate
wavelengths
Wheat
wheat classes
title Comparison of Partial Least Squares Regression (PLSR) and Principal Components Regression (PCR) Methods for Protein and Hardness Predictions using the Near-Infrared (NIR) Hyperspectral Images of Bulk Samples of Canadian Wheat
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-09T00%3A24%3A24IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Comparison%20of%20Partial%20Least%20Squares%20Regression%20(PLSR)%20and%20Principal%20Components%20Regression%20(PCR)%20Methods%20for%20Protein%20and%20Hardness%20Predictions%20using%20the%20Near-Infrared%20(NIR)%20Hyperspectral%20Images%20of%20Bulk%20Samples%20of%20Canadian%20Wheat&rft.jtitle=Food%20and%20bioprocess%20technology&rft.au=Mahesh,%20S&rft.date=2015-01-01&rft.volume=8&rft.issue=1&rft.spage=31&rft.epage=40&rft.pages=31-40&rft.issn=1935-5130&rft.eissn=1935-5149&rft_id=info:doi/10.1007/s11947-014-1381-z&rft_dat=%3Cproquest_cross%3E2410781557%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2410781557&rft_id=info:pmid/&rfr_iscdi=true