Comparison of Partial Least Squares Regression (PLSR) and Principal Components Regression (PCR) Methods for Protein and Hardness Predictions using the Near-Infrared (NIR) Hyperspectral Images of Bulk Samples of Canadian Wheat
The objective of this study was to compare the predictions of the protein contents and hardness values by partial least squares regression (PLSR) and principal components regression (PCR) models for bulk samples of Canadian wheat, which were obtained from different locations and crop years. Wheat sa...
Gespeichert in:
Veröffentlicht in: | Food and bioprocess technology 2015-01, Vol.8 (1), p.31-40 |
---|---|
Hauptverfasser: | , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 40 |
---|---|
container_issue | 1 |
container_start_page | 31 |
container_title | Food and bioprocess technology |
container_volume | 8 |
creator | Mahesh, S Jayas, D. S Paliwal, J White, N. D. G |
description | The objective of this study was to compare the predictions of the protein contents and hardness values by partial least squares regression (PLSR) and principal components regression (PCR) models for bulk samples of Canadian wheat, which were obtained from different locations and crop years. Wheat samples of Canada Western Red Spring (CWRS), Canada Western Hard White Spring (CWHWS), Canada Western Soft White Spring (CWSWS), and Canada Prairie Spring Red (CPSR) classes were obtained from nearby agricultural farms in the main wheat growing locations in the Provinces of Alberta, Saskatchewan, and Manitoba from 2007, 2008, and 2009 crop years. Wheat samples were conditioned to moisture levels of 13, 16, and 19 % (wet basis) and pooled together for developing the regression models. A database of the near-infrared (NIR) hyperspectral image cubes of bulk samples of wheat classes was created in the wavelength region of 960–1,700 nm with 10 nm intervals. Reference protein contents and hardness values were determined using the Dumatherm method and single kernel characterization system (SKCS), respectively. A tenfold cross-validation was used for the ten-factor partial least squares regression (PLSR) and principal components regression (PCR) models for prediction purposes. Prediction performances of regression models were assessed by calculating the estimated mean square errors of prediction (MSEP), standard error of cross-validation (SECV), and correlation coefficient (r). Using the full data set in the protein prediction study, the ten-component PLSR model gave 1.76, 1.33, and 0.68 for the estimated MSEP, SECV, and r, respectively, which were better than the results for the ten-component PCR model (2.02, 1.42, and 0.62, respectively). For the hardness prediction, the estimated MSEP, SECV, and r values were 147.7, 12.15, and 0.82, respectively, for the ten-component PLSR model using the full data set. The PLSR models prediction performances outperformed the PCR models for predicting protein contents and hardness of wheat. |
doi_str_mv | 10.1007/s11947-014-1381-z |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2410781557</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2410781557</sourcerecordid><originalsourceid>FETCH-LOGICAL-c476t-fb79e639cbf8d4609e7941da55a2af62007615a8da72a2f6618b26bb069052f93</originalsourceid><addsrcrecordid>eNp9kc9u1DAQxiMEEqXwAJywxKUcAp78cZIjRIVdaSmrLhVHa5LYWZddO7WdQ_u2vAmzBIHEgYM1Huv3fTPylyQvgb8Fzqt3AaApqpRDkUJeQ_rwKDmDJi_TEorm8Z97zp8mz0K45VzwAvKz5EfrjhN6E5xlTrMt-mjwwDYKQ2S7uxm9CuxajVSCIeZiu9ldv2FoB7b1xvZmIvrk4ayy8R-0JfKzins3BKadJ4WLythf6hX6wRJJj2owfSRFYHMwdmRxr9iVQp-urfa0wMAurtZktbqflA-T6qOnoesjjrQbLf1hPnxnOzxOh6Vv0eJg0LJve4XxefJE4yGoF7_reXLz8fJru0o3Xz6t2_ebtC8qEVPdVY0SedN3uh4KwRtVNQUMWJaYoRYZfbKAEusBqwwzLQTUXSa6jouGl5lu8vPk9eI7eXc3qxDlrZu9pZEyK4BXNZRlRRQsVO9dCF5pOXlzRH8vgctTknJJUlKS8pSkfCBNtmgCsXZU_q_z_0SvFpFGJ3GkhOXNLuNQcjqCC8h_AkDRrN8</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2410781557</pqid></control><display><type>article</type><title>Comparison of Partial Least Squares Regression (PLSR) and Principal Components Regression (PCR) Methods for Protein and Hardness Predictions using the Near-Infrared (NIR) Hyperspectral Images of Bulk Samples of Canadian Wheat</title><source>SpringerLink Journals - AutoHoldings</source><creator>Mahesh, S ; Jayas, D. S ; Paliwal, J ; White, N. D. G</creator><creatorcontrib>Mahesh, S ; Jayas, D. S ; Paliwal, J ; White, N. D. G</creatorcontrib><description>The objective of this study was to compare the predictions of the protein contents and hardness values by partial least squares regression (PLSR) and principal components regression (PCR) models for bulk samples of Canadian wheat, which were obtained from different locations and crop years. Wheat samples of Canada Western Red Spring (CWRS), Canada Western Hard White Spring (CWHWS), Canada Western Soft White Spring (CWSWS), and Canada Prairie Spring Red (CPSR) classes were obtained from nearby agricultural farms in the main wheat growing locations in the Provinces of Alberta, Saskatchewan, and Manitoba from 2007, 2008, and 2009 crop years. Wheat samples were conditioned to moisture levels of 13, 16, and 19 % (wet basis) and pooled together for developing the regression models. A database of the near-infrared (NIR) hyperspectral image cubes of bulk samples of wheat classes was created in the wavelength region of 960–1,700 nm with 10 nm intervals. Reference protein contents and hardness values were determined using the Dumatherm method and single kernel characterization system (SKCS), respectively. A tenfold cross-validation was used for the ten-factor partial least squares regression (PLSR) and principal components regression (PCR) models for prediction purposes. Prediction performances of regression models were assessed by calculating the estimated mean square errors of prediction (MSEP), standard error of cross-validation (SECV), and correlation coefficient (r). Using the full data set in the protein prediction study, the ten-component PLSR model gave 1.76, 1.33, and 0.68 for the estimated MSEP, SECV, and r, respectively, which were better than the results for the ten-component PCR model (2.02, 1.42, and 0.62, respectively). For the hardness prediction, the estimated MSEP, SECV, and r values were 147.7, 12.15, and 0.82, respectively, for the ten-component PLSR model using the full data set. The PLSR models prediction performances outperformed the PCR models for predicting protein contents and hardness of wheat.</description><identifier>ISSN: 1935-5130</identifier><identifier>EISSN: 1935-5149</identifier><identifier>DOI: 10.1007/s11947-014-1381-z</identifier><language>eng</language><publisher>Boston: Springer-Verlag</publisher><subject>Agriculture ; bioprocessing ; Biotechnology ; Bulk sampling ; Cereal crops ; Chemistry ; Chemistry and Materials Science ; Chemistry/Food Science ; correlation ; Correlation coefficient ; Correlation coefficients ; Cubes ; data collection ; Datasets ; Farming ; Farms ; Food Science ; Hardness ; hyperspectral imagery ; Hyperspectral imaging ; I.R. radiation ; Infrared imagery ; least squares ; Least squares method ; Near infrared radiation ; Original Paper ; Performance prediction ; polymerase chain reaction ; prediction ; Predictions ; Proteins ; Regression analysis ; Regression models ; seeds ; Standard error ; Standard error of estimate ; wavelengths ; Wheat ; wheat classes</subject><ispartof>Food and bioprocess technology, 2015-01, Vol.8 (1), p.31-40</ispartof><rights>Springer Science+Business Media New York 2014</rights><rights>Springer Science+Business Media New York 2014.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c476t-fb79e639cbf8d4609e7941da55a2af62007615a8da72a2f6618b26bb069052f93</citedby><cites>FETCH-LOGICAL-c476t-fb79e639cbf8d4609e7941da55a2af62007615a8da72a2f6618b26bb069052f93</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s11947-014-1381-z$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s11947-014-1381-z$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>314,776,780,27901,27902,41464,42533,51294</link.rule.ids></links><search><creatorcontrib>Mahesh, S</creatorcontrib><creatorcontrib>Jayas, D. S</creatorcontrib><creatorcontrib>Paliwal, J</creatorcontrib><creatorcontrib>White, N. D. G</creatorcontrib><title>Comparison of Partial Least Squares Regression (PLSR) and Principal Components Regression (PCR) Methods for Protein and Hardness Predictions using the Near-Infrared (NIR) Hyperspectral Images of Bulk Samples of Canadian Wheat</title><title>Food and bioprocess technology</title><addtitle>Food Bioprocess Technol</addtitle><description>The objective of this study was to compare the predictions of the protein contents and hardness values by partial least squares regression (PLSR) and principal components regression (PCR) models for bulk samples of Canadian wheat, which were obtained from different locations and crop years. Wheat samples of Canada Western Red Spring (CWRS), Canada Western Hard White Spring (CWHWS), Canada Western Soft White Spring (CWSWS), and Canada Prairie Spring Red (CPSR) classes were obtained from nearby agricultural farms in the main wheat growing locations in the Provinces of Alberta, Saskatchewan, and Manitoba from 2007, 2008, and 2009 crop years. Wheat samples were conditioned to moisture levels of 13, 16, and 19 % (wet basis) and pooled together for developing the regression models. A database of the near-infrared (NIR) hyperspectral image cubes of bulk samples of wheat classes was created in the wavelength region of 960–1,700 nm with 10 nm intervals. Reference protein contents and hardness values were determined using the Dumatherm method and single kernel characterization system (SKCS), respectively. A tenfold cross-validation was used for the ten-factor partial least squares regression (PLSR) and principal components regression (PCR) models for prediction purposes. Prediction performances of regression models were assessed by calculating the estimated mean square errors of prediction (MSEP), standard error of cross-validation (SECV), and correlation coefficient (r). Using the full data set in the protein prediction study, the ten-component PLSR model gave 1.76, 1.33, and 0.68 for the estimated MSEP, SECV, and r, respectively, which were better than the results for the ten-component PCR model (2.02, 1.42, and 0.62, respectively). For the hardness prediction, the estimated MSEP, SECV, and r values were 147.7, 12.15, and 0.82, respectively, for the ten-component PLSR model using the full data set. The PLSR models prediction performances outperformed the PCR models for predicting protein contents and hardness of wheat.</description><subject>Agriculture</subject><subject>bioprocessing</subject><subject>Biotechnology</subject><subject>Bulk sampling</subject><subject>Cereal crops</subject><subject>Chemistry</subject><subject>Chemistry and Materials Science</subject><subject>Chemistry/Food Science</subject><subject>correlation</subject><subject>Correlation coefficient</subject><subject>Correlation coefficients</subject><subject>Cubes</subject><subject>data collection</subject><subject>Datasets</subject><subject>Farming</subject><subject>Farms</subject><subject>Food Science</subject><subject>Hardness</subject><subject>hyperspectral imagery</subject><subject>Hyperspectral imaging</subject><subject>I.R. radiation</subject><subject>Infrared imagery</subject><subject>least squares</subject><subject>Least squares method</subject><subject>Near infrared radiation</subject><subject>Original Paper</subject><subject>Performance prediction</subject><subject>polymerase chain reaction</subject><subject>prediction</subject><subject>Predictions</subject><subject>Proteins</subject><subject>Regression analysis</subject><subject>Regression models</subject><subject>seeds</subject><subject>Standard error</subject><subject>Standard error of estimate</subject><subject>wavelengths</subject><subject>Wheat</subject><subject>wheat classes</subject><issn>1935-5130</issn><issn>1935-5149</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2015</creationdate><recordtype>article</recordtype><sourceid>BENPR</sourceid><recordid>eNp9kc9u1DAQxiMEEqXwAJywxKUcAp78cZIjRIVdaSmrLhVHa5LYWZddO7WdQ_u2vAmzBIHEgYM1Huv3fTPylyQvgb8Fzqt3AaApqpRDkUJeQ_rwKDmDJi_TEorm8Z97zp8mz0K45VzwAvKz5EfrjhN6E5xlTrMt-mjwwDYKQ2S7uxm9CuxajVSCIeZiu9ldv2FoB7b1xvZmIvrk4ayy8R-0JfKzins3BKadJ4WLythf6hX6wRJJj2owfSRFYHMwdmRxr9iVQp-urfa0wMAurtZktbqflA-T6qOnoesjjrQbLf1hPnxnOzxOh6Vv0eJg0LJve4XxefJE4yGoF7_reXLz8fJru0o3Xz6t2_ebtC8qEVPdVY0SedN3uh4KwRtVNQUMWJaYoRYZfbKAEusBqwwzLQTUXSa6jouGl5lu8vPk9eI7eXc3qxDlrZu9pZEyK4BXNZRlRRQsVO9dCF5pOXlzRH8vgctTknJJUlKS8pSkfCBNtmgCsXZU_q_z_0SvFpFGJ3GkhOXNLuNQcjqCC8h_AkDRrN8</recordid><startdate>20150101</startdate><enddate>20150101</enddate><creator>Mahesh, S</creator><creator>Jayas, D. S</creator><creator>Paliwal, J</creator><creator>White, N. D. G</creator><general>Springer-Verlag</general><general>Springer US</general><general>Springer Nature B.V</general><scope>FBQ</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7X2</scope><scope>8FE</scope><scope>8FG</scope><scope>8FH</scope><scope>8FK</scope><scope>ABJCF</scope><scope>AEUYN</scope><scope>AFKRA</scope><scope>ATCPS</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>BHPHI</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M0K</scope><scope>M7S</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PTHSS</scope></search><sort><creationdate>20150101</creationdate><title>Comparison of Partial Least Squares Regression (PLSR) and Principal Components Regression (PCR) Methods for Protein and Hardness Predictions using the Near-Infrared (NIR) Hyperspectral Images of Bulk Samples of Canadian Wheat</title><author>Mahesh, S ; Jayas, D. S ; Paliwal, J ; White, N. D. G</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c476t-fb79e639cbf8d4609e7941da55a2af62007615a8da72a2f6618b26bb069052f93</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2015</creationdate><topic>Agriculture</topic><topic>bioprocessing</topic><topic>Biotechnology</topic><topic>Bulk sampling</topic><topic>Cereal crops</topic><topic>Chemistry</topic><topic>Chemistry and Materials Science</topic><topic>Chemistry/Food Science</topic><topic>correlation</topic><topic>Correlation coefficient</topic><topic>Correlation coefficients</topic><topic>Cubes</topic><topic>data collection</topic><topic>Datasets</topic><topic>Farming</topic><topic>Farms</topic><topic>Food Science</topic><topic>Hardness</topic><topic>hyperspectral imagery</topic><topic>Hyperspectral imaging</topic><topic>I.R. radiation</topic><topic>Infrared imagery</topic><topic>least squares</topic><topic>Least squares method</topic><topic>Near infrared radiation</topic><topic>Original Paper</topic><topic>Performance prediction</topic><topic>polymerase chain reaction</topic><topic>prediction</topic><topic>Predictions</topic><topic>Proteins</topic><topic>Regression analysis</topic><topic>Regression models</topic><topic>seeds</topic><topic>Standard error</topic><topic>Standard error of estimate</topic><topic>wavelengths</topic><topic>Wheat</topic><topic>wheat classes</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Mahesh, S</creatorcontrib><creatorcontrib>Jayas, D. S</creatorcontrib><creatorcontrib>Paliwal, J</creatorcontrib><creatorcontrib>White, N. D. G</creatorcontrib><collection>AGRIS</collection><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Agricultural Science Collection</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest One Sustainability</collection><collection>ProQuest Central UK/Ireland</collection><collection>Agricultural & Environmental Science Collection</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>Natural Science Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Agricultural Science Database</collection><collection>Engineering Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>Engineering Collection</collection><jtitle>Food and bioprocess technology</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Mahesh, S</au><au>Jayas, D. S</au><au>Paliwal, J</au><au>White, N. D. G</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Comparison of Partial Least Squares Regression (PLSR) and Principal Components Regression (PCR) Methods for Protein and Hardness Predictions using the Near-Infrared (NIR) Hyperspectral Images of Bulk Samples of Canadian Wheat</atitle><jtitle>Food and bioprocess technology</jtitle><stitle>Food Bioprocess Technol</stitle><date>2015-01-01</date><risdate>2015</risdate><volume>8</volume><issue>1</issue><spage>31</spage><epage>40</epage><pages>31-40</pages><issn>1935-5130</issn><eissn>1935-5149</eissn><abstract>The objective of this study was to compare the predictions of the protein contents and hardness values by partial least squares regression (PLSR) and principal components regression (PCR) models for bulk samples of Canadian wheat, which were obtained from different locations and crop years. Wheat samples of Canada Western Red Spring (CWRS), Canada Western Hard White Spring (CWHWS), Canada Western Soft White Spring (CWSWS), and Canada Prairie Spring Red (CPSR) classes were obtained from nearby agricultural farms in the main wheat growing locations in the Provinces of Alberta, Saskatchewan, and Manitoba from 2007, 2008, and 2009 crop years. Wheat samples were conditioned to moisture levels of 13, 16, and 19 % (wet basis) and pooled together for developing the regression models. A database of the near-infrared (NIR) hyperspectral image cubes of bulk samples of wheat classes was created in the wavelength region of 960–1,700 nm with 10 nm intervals. Reference protein contents and hardness values were determined using the Dumatherm method and single kernel characterization system (SKCS), respectively. A tenfold cross-validation was used for the ten-factor partial least squares regression (PLSR) and principal components regression (PCR) models for prediction purposes. Prediction performances of regression models were assessed by calculating the estimated mean square errors of prediction (MSEP), standard error of cross-validation (SECV), and correlation coefficient (r). Using the full data set in the protein prediction study, the ten-component PLSR model gave 1.76, 1.33, and 0.68 for the estimated MSEP, SECV, and r, respectively, which were better than the results for the ten-component PCR model (2.02, 1.42, and 0.62, respectively). For the hardness prediction, the estimated MSEP, SECV, and r values were 147.7, 12.15, and 0.82, respectively, for the ten-component PLSR model using the full data set. The PLSR models prediction performances outperformed the PCR models for predicting protein contents and hardness of wheat.</abstract><cop>Boston</cop><pub>Springer-Verlag</pub><doi>10.1007/s11947-014-1381-z</doi><tpages>10</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1935-5130 |
ispartof | Food and bioprocess technology, 2015-01, Vol.8 (1), p.31-40 |
issn | 1935-5130 1935-5149 |
language | eng |
recordid | cdi_proquest_journals_2410781557 |
source | SpringerLink Journals - AutoHoldings |
subjects | Agriculture bioprocessing Biotechnology Bulk sampling Cereal crops Chemistry Chemistry and Materials Science Chemistry/Food Science correlation Correlation coefficient Correlation coefficients Cubes data collection Datasets Farming Farms Food Science Hardness hyperspectral imagery Hyperspectral imaging I.R. radiation Infrared imagery least squares Least squares method Near infrared radiation Original Paper Performance prediction polymerase chain reaction prediction Predictions Proteins Regression analysis Regression models seeds Standard error Standard error of estimate wavelengths Wheat wheat classes |
title | Comparison of Partial Least Squares Regression (PLSR) and Principal Components Regression (PCR) Methods for Protein and Hardness Predictions using the Near-Infrared (NIR) Hyperspectral Images of Bulk Samples of Canadian Wheat |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-09T00%3A24%3A24IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Comparison%20of%20Partial%20Least%20Squares%20Regression%20(PLSR)%20and%20Principal%20Components%20Regression%20(PCR)%20Methods%20for%20Protein%20and%20Hardness%20Predictions%20using%20the%20Near-Infrared%20(NIR)%20Hyperspectral%20Images%20of%20Bulk%20Samples%20of%20Canadian%20Wheat&rft.jtitle=Food%20and%20bioprocess%20technology&rft.au=Mahesh,%20S&rft.date=2015-01-01&rft.volume=8&rft.issue=1&rft.spage=31&rft.epage=40&rft.pages=31-40&rft.issn=1935-5130&rft.eissn=1935-5149&rft_id=info:doi/10.1007/s11947-014-1381-z&rft_dat=%3Cproquest_cross%3E2410781557%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2410781557&rft_id=info:pmid/&rfr_iscdi=true |