Prediction of Depression in Cancer Patients With Different Classification Criteria, Linear Discriminant Analysis versus Logistic Regression

Logistic regression (LR) and linear discriminant analysis (LDA) are two popular statistical models for prediction of group membership. Although they are very similar, the LDA makes more assumptions about the data. When categorical and continuous variables used simultaneously, the optimal choice betw...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Global journal of health science 2015-11, Vol.8 (7), p.41-46
Hauptverfasser: Shayan, Zahra, Mohammad Gholi Mezerji, Naser, Shayan, Leila, Naseri, Parisa
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 46
container_issue 7
container_start_page 41
container_title Global journal of health science
container_volume 8
creator Shayan, Zahra
Mohammad Gholi Mezerji, Naser
Shayan, Leila
Naseri, Parisa
description Logistic regression (LR) and linear discriminant analysis (LDA) are two popular statistical models for prediction of group membership. Although they are very similar, the LDA makes more assumptions about the data. When categorical and continuous variables used simultaneously, the optimal choice between the two models is questionable. In most studies, classification error (CE) is used to discriminate between subjects in several groups, but this index is not suitable to predict the accuracy of the outcome. The present study compared LR and LDA models using classification indices. This cross-sectional study selected 243 cancer patients. Sample sets of different sizes (n = 50, 100, 150, 200, 220) were randomly selected and the CE, B, and Q classification indices were calculated by the LR and LDA models. CE revealed the a lack of superiority for one model over the other, but the results showed that LR performed better than LDA for the B and Q indices in all situations. No significant effect for sample size on CE was noted for selection of an optimal model. Assessment of the accuracy of prediction of real data indicated that the B and Q indices are appropriate for selection of an optimal model. The results of this study showed that LR performs better in some cases and LDA in others when based on CE. The CE index is not appropriate for classification, although the B and Q indices performed better and offered more efficient criteria for comparison and discrimination between groups.
doi_str_mv 10.5539/gjhs.v8n7p41
format Article
fullrecord <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_4965639</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>1770223704</sourcerecordid><originalsourceid>FETCH-LOGICAL-c261t-c8a2f045688bdd978d3367039aa0d4173ba72377346789527c49028eeb5609cc3</originalsourceid><addsrcrecordid>eNpVkU1PGzEQhi1EVSjtjTPykQMBe_21viChhX5IkYqqVj1ajnc2Mdp4g2cTid_An65TQgSn8cjPvPOOXkJOObtUStir-cMCLzd1MivJD8gxt1xPrJHycP8W-oh8QnxgTGvF1UdyVGlbKcvYMXm-z9DGMMYh0aGjt7DKgLjtYqKNTwEyvfdjhDQi_RvHBb2NXQe59LTpfUG7GPz_8SbHEXL0F3QaE_hcSAw5LmPyBb5Jvn_CiHQDGddIp8M84hgD_QXz3crP5EPne4Qvu3pC_ny9-918n0x_fvvR3EwnodJ8nITaVx2TStf1rG2tqVshtGHCes9ayY2YeVMJY4TUpraqMkFaVtUAM6WZDUGckOsX3dV6toQ2lFuy792qePX5yQ0-uvc_KS7cfNg4abXSwhaB851AHh7XgKNbllOh732CYY2OG8OqYoHJgl68oCEPiBm6_RrO3DY_t83P7fIr-Nlba3v4NTDxD9cHm2s</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1770223704</pqid></control><display><type>article</type><title>Prediction of Depression in Cancer Patients With Different Classification Criteria, Linear Discriminant Analysis versus Logistic Regression</title><source>MEDLINE</source><source>EZB-FREE-00999 freely available EZB journals</source><source>PubMed Central</source><source>PubMed Central Open Access</source><creator>Shayan, Zahra ; Mohammad Gholi Mezerji, Naser ; Shayan, Leila ; Naseri, Parisa</creator><creatorcontrib>Shayan, Zahra ; Mohammad Gholi Mezerji, Naser ; Shayan, Leila ; Naseri, Parisa</creatorcontrib><description>Logistic regression (LR) and linear discriminant analysis (LDA) are two popular statistical models for prediction of group membership. Although they are very similar, the LDA makes more assumptions about the data. When categorical and continuous variables used simultaneously, the optimal choice between the two models is questionable. In most studies, classification error (CE) is used to discriminate between subjects in several groups, but this index is not suitable to predict the accuracy of the outcome. The present study compared LR and LDA models using classification indices. This cross-sectional study selected 243 cancer patients. Sample sets of different sizes (n = 50, 100, 150, 200, 220) were randomly selected and the CE, B, and Q classification indices were calculated by the LR and LDA models. CE revealed the a lack of superiority for one model over the other, but the results showed that LR performed better than LDA for the B and Q indices in all situations. No significant effect for sample size on CE was noted for selection of an optimal model. Assessment of the accuracy of prediction of real data indicated that the B and Q indices are appropriate for selection of an optimal model. The results of this study showed that LR performs better in some cases and LDA in others when based on CE. The CE index is not appropriate for classification, although the B and Q indices performed better and offered more efficient criteria for comparison and discrimination between groups.</description><identifier>ISSN: 1916-9736</identifier><identifier>EISSN: 1916-9744</identifier><identifier>DOI: 10.5539/gjhs.v8n7p41</identifier><identifier>PMID: 26925900</identifier><language>eng</language><publisher>Canada: Canadian Center of Science and Education</publisher><subject>Cross-Sectional Studies ; Demography ; Depression - etiology ; Discriminant Analysis ; Female ; Humans ; Logistic Models ; Male ; Neoplasms - psychology ; Predictive Value of Tests</subject><ispartof>Global journal of health science, 2015-11, Vol.8 (7), p.41-46</ispartof><rights>Copyright: © Canadian Center of Science and Education 2016</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c261t-c8a2f045688bdd978d3367039aa0d4173ba72377346789527c49028eeb5609cc3</citedby></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC4965639/pdf/$$EPDF$$P50$$Gpubmedcentral$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC4965639/$$EHTML$$P50$$Gpubmedcentral$$Hfree_for_read</linktohtml><link.rule.ids>230,314,723,776,780,881,27901,27902,53766,53768</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/26925900$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Shayan, Zahra</creatorcontrib><creatorcontrib>Mohammad Gholi Mezerji, Naser</creatorcontrib><creatorcontrib>Shayan, Leila</creatorcontrib><creatorcontrib>Naseri, Parisa</creatorcontrib><title>Prediction of Depression in Cancer Patients With Different Classification Criteria, Linear Discriminant Analysis versus Logistic Regression</title><title>Global journal of health science</title><addtitle>Glob J Health Sci</addtitle><description>Logistic regression (LR) and linear discriminant analysis (LDA) are two popular statistical models for prediction of group membership. Although they are very similar, the LDA makes more assumptions about the data. When categorical and continuous variables used simultaneously, the optimal choice between the two models is questionable. In most studies, classification error (CE) is used to discriminate between subjects in several groups, but this index is not suitable to predict the accuracy of the outcome. The present study compared LR and LDA models using classification indices. This cross-sectional study selected 243 cancer patients. Sample sets of different sizes (n = 50, 100, 150, 200, 220) were randomly selected and the CE, B, and Q classification indices were calculated by the LR and LDA models. CE revealed the a lack of superiority for one model over the other, but the results showed that LR performed better than LDA for the B and Q indices in all situations. No significant effect for sample size on CE was noted for selection of an optimal model. Assessment of the accuracy of prediction of real data indicated that the B and Q indices are appropriate for selection of an optimal model. The results of this study showed that LR performs better in some cases and LDA in others when based on CE. The CE index is not appropriate for classification, although the B and Q indices performed better and offered more efficient criteria for comparison and discrimination between groups.</description><subject>Cross-Sectional Studies</subject><subject>Demography</subject><subject>Depression - etiology</subject><subject>Discriminant Analysis</subject><subject>Female</subject><subject>Humans</subject><subject>Logistic Models</subject><subject>Male</subject><subject>Neoplasms - psychology</subject><subject>Predictive Value of Tests</subject><issn>1916-9736</issn><issn>1916-9744</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2015</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNpVkU1PGzEQhi1EVSjtjTPykQMBe_21viChhX5IkYqqVj1ajnc2Mdp4g2cTid_An65TQgSn8cjPvPOOXkJOObtUStir-cMCLzd1MivJD8gxt1xPrJHycP8W-oh8QnxgTGvF1UdyVGlbKcvYMXm-z9DGMMYh0aGjt7DKgLjtYqKNTwEyvfdjhDQi_RvHBb2NXQe59LTpfUG7GPz_8SbHEXL0F3QaE_hcSAw5LmPyBb5Jvn_CiHQDGddIp8M84hgD_QXz3crP5EPne4Qvu3pC_ny9-918n0x_fvvR3EwnodJ8nITaVx2TStf1rG2tqVshtGHCes9ayY2YeVMJY4TUpraqMkFaVtUAM6WZDUGckOsX3dV6toQ2lFuy792qePX5yQ0-uvc_KS7cfNg4abXSwhaB851AHh7XgKNbllOh732CYY2OG8OqYoHJgl68oCEPiBm6_RrO3DY_t83P7fIr-Nlba3v4NTDxD9cHm2s</recordid><startdate>20151103</startdate><enddate>20151103</enddate><creator>Shayan, Zahra</creator><creator>Mohammad Gholi Mezerji, Naser</creator><creator>Shayan, Leila</creator><creator>Naseri, Parisa</creator><general>Canadian Center of Science and Education</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><scope>5PM</scope></search><sort><creationdate>20151103</creationdate><title>Prediction of Depression in Cancer Patients With Different Classification Criteria, Linear Discriminant Analysis versus Logistic Regression</title><author>Shayan, Zahra ; Mohammad Gholi Mezerji, Naser ; Shayan, Leila ; Naseri, Parisa</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c261t-c8a2f045688bdd978d3367039aa0d4173ba72377346789527c49028eeb5609cc3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2015</creationdate><topic>Cross-Sectional Studies</topic><topic>Demography</topic><topic>Depression - etiology</topic><topic>Discriminant Analysis</topic><topic>Female</topic><topic>Humans</topic><topic>Logistic Models</topic><topic>Male</topic><topic>Neoplasms - psychology</topic><topic>Predictive Value of Tests</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Shayan, Zahra</creatorcontrib><creatorcontrib>Mohammad Gholi Mezerji, Naser</creatorcontrib><creatorcontrib>Shayan, Leila</creatorcontrib><creatorcontrib>Naseri, Parisa</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Global journal of health science</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Shayan, Zahra</au><au>Mohammad Gholi Mezerji, Naser</au><au>Shayan, Leila</au><au>Naseri, Parisa</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Prediction of Depression in Cancer Patients With Different Classification Criteria, Linear Discriminant Analysis versus Logistic Regression</atitle><jtitle>Global journal of health science</jtitle><addtitle>Glob J Health Sci</addtitle><date>2015-11-03</date><risdate>2015</risdate><volume>8</volume><issue>7</issue><spage>41</spage><epage>46</epage><pages>41-46</pages><issn>1916-9736</issn><eissn>1916-9744</eissn><abstract>Logistic regression (LR) and linear discriminant analysis (LDA) are two popular statistical models for prediction of group membership. Although they are very similar, the LDA makes more assumptions about the data. When categorical and continuous variables used simultaneously, the optimal choice between the two models is questionable. In most studies, classification error (CE) is used to discriminate between subjects in several groups, but this index is not suitable to predict the accuracy of the outcome. The present study compared LR and LDA models using classification indices. This cross-sectional study selected 243 cancer patients. Sample sets of different sizes (n = 50, 100, 150, 200, 220) were randomly selected and the CE, B, and Q classification indices were calculated by the LR and LDA models. CE revealed the a lack of superiority for one model over the other, but the results showed that LR performed better than LDA for the B and Q indices in all situations. No significant effect for sample size on CE was noted for selection of an optimal model. Assessment of the accuracy of prediction of real data indicated that the B and Q indices are appropriate for selection of an optimal model. The results of this study showed that LR performs better in some cases and LDA in others when based on CE. The CE index is not appropriate for classification, although the B and Q indices performed better and offered more efficient criteria for comparison and discrimination between groups.</abstract><cop>Canada</cop><pub>Canadian Center of Science and Education</pub><pmid>26925900</pmid><doi>10.5539/gjhs.v8n7p41</doi><tpages>6</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1916-9736
ispartof Global journal of health science, 2015-11, Vol.8 (7), p.41-46
issn 1916-9736
1916-9744
language eng
recordid cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_4965639
source MEDLINE; EZB-FREE-00999 freely available EZB journals; PubMed Central; PubMed Central Open Access
subjects Cross-Sectional Studies
Demography
Depression - etiology
Discriminant Analysis
Female
Humans
Logistic Models
Male
Neoplasms - psychology
Predictive Value of Tests
title Prediction of Depression in Cancer Patients With Different Classification Criteria, Linear Discriminant Analysis versus Logistic Regression
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-07T22%3A46%3A50IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Prediction%20of%20Depression%20in%20Cancer%20Patients%20With%20Different%20Classification%20Criteria,%20Linear%20Discriminant%20Analysis%20versus%20Logistic%20Regression&rft.jtitle=Global%20journal%20of%20health%20science&rft.au=Shayan,%20Zahra&rft.date=2015-11-03&rft.volume=8&rft.issue=7&rft.spage=41&rft.epage=46&rft.pages=41-46&rft.issn=1916-9736&rft.eissn=1916-9744&rft_id=info:doi/10.5539/gjhs.v8n7p41&rft_dat=%3Cproquest_pubme%3E1770223704%3C/proquest_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1770223704&rft_id=info:pmid/26925900&rfr_iscdi=true