Obtaining a threshold for the stewart index and its extension to ridge regression

The linear regression model is widely applied to measure the relationship between a dependent variable and a set of independent variables. When the independent variables are related to each other, it is said that the model presents collinearity. If the relationship is between the intercept and at le...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Computational statistics 2021-06, Vol.36 (2), p.1011-1029
Hauptverfasser: Sánchez, Ainara Rodríguez, Gómez, Román Salmerón, García, Catalina García
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 1029
container_issue 2
container_start_page 1011
container_title Computational statistics
container_volume 36
creator Sánchez, Ainara Rodríguez
Gómez, Román Salmerón
García, Catalina García
description The linear regression model is widely applied to measure the relationship between a dependent variable and a set of independent variables. When the independent variables are related to each other, it is said that the model presents collinearity. If the relationship is between the intercept and at least one of the independent variables, the collinearity is nonessential, while if the relationship is between the independent variables (excluding the intercept), the collinearity is essential. The Stewart index allows the detection of both types of near multicollinearity. However, to the best of our knowledge, there are no established thresholds for this measure from which to consider that the multicollinearity is worrying. This is the main goal of this paper, which presents a Monte Carlo simulation to relate this measure to the condition number. An additional goal of this paper is to extend the Stewart index for its application after the estimation by ridge regression that is widely applied to estimate model with multicollinearity as an alternative to ordinary least squares (OLS). This extension could be also applied to determine the appropriate value for the ridge factor.
doi_str_mv 10.1007/s00180-020-01047-2
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2524567929</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2524567929</sourcerecordid><originalsourceid>FETCH-LOGICAL-c319t-2f4c025f06b581e6e9bf8f3b81f1361927513a071e717a8fcc7b1d83596c913c3</originalsourceid><addsrcrecordid>eNp9kE1LxDAQhoMouH78AU8Bz9WZpE2aoyx-wcIi6DmkbdLtsqZrksX135u1gjcPwzDD-8zAQ8gVwg0CyNsIgDUUwHIhlLJgR2SGAnmhRFUfkxmokhclCHZKzmJcAzAmGc7Iy7JJZvCD76mhaRVsXI2bjrox5MnSmOynCYkOvrN7anxHhxSp3Sfr4zB6mkYahq63NNg-s4fdBTlxZhPt5W8_J28P96_zp2KxfHye3y2KlqNKBXNlC6xyIJqqRiusalzteFOjQy5QMVkhNyDRSpSmdm0rG-xqXinRKuQtPyfX091tGD92Nia9HnfB55eaVayshFRM5RSbUm0YYwzW6W0Y3k340gj6oE5P6nRWp3_UaZYhPkExh31vw9_pf6hvOixwtg</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2524567929</pqid></control><display><type>article</type><title>Obtaining a threshold for the stewart index and its extension to ridge regression</title><source>Springer Online Journals Complete</source><creator>Sánchez, Ainara Rodríguez ; Gómez, Román Salmerón ; García, Catalina García</creator><creatorcontrib>Sánchez, Ainara Rodríguez ; Gómez, Román Salmerón ; García, Catalina García</creatorcontrib><description>The linear regression model is widely applied to measure the relationship between a dependent variable and a set of independent variables. When the independent variables are related to each other, it is said that the model presents collinearity. If the relationship is between the intercept and at least one of the independent variables, the collinearity is nonessential, while if the relationship is between the independent variables (excluding the intercept), the collinearity is essential. The Stewart index allows the detection of both types of near multicollinearity. However, to the best of our knowledge, there are no established thresholds for this measure from which to consider that the multicollinearity is worrying. This is the main goal of this paper, which presents a Monte Carlo simulation to relate this measure to the condition number. An additional goal of this paper is to extend the Stewart index for its application after the estimation by ridge regression that is widely applied to estimate model with multicollinearity as an alternative to ordinary least squares (OLS). This extension could be also applied to determine the appropriate value for the ridge factor.</description><identifier>ISSN: 0943-4062</identifier><identifier>EISSN: 1613-9658</identifier><identifier>DOI: 10.1007/s00180-020-01047-2</identifier><language>eng</language><publisher>Berlin/Heidelberg: Springer Berlin Heidelberg</publisher><subject>Collinearity ; Dependent variables ; Economic Theory/Quantitative Economics/Mathematical Methods ; Independent variables ; Mathematics and Statistics ; Monte Carlo simulation ; Original Paper ; Probability and Statistics in Computer Science ; Probability Theory and Stochastic Processes ; Regression models ; Statistics ; Variables</subject><ispartof>Computational statistics, 2021-06, Vol.36 (2), p.1011-1029</ispartof><rights>Springer-Verlag GmbH Germany, part of Springer Nature 2020</rights><rights>Springer-Verlag GmbH Germany, part of Springer Nature 2020.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c319t-2f4c025f06b581e6e9bf8f3b81f1361927513a071e717a8fcc7b1d83596c913c3</citedby><cites>FETCH-LOGICAL-c319t-2f4c025f06b581e6e9bf8f3b81f1361927513a071e717a8fcc7b1d83596c913c3</cites><orcidid>0000-0001-9925-9802</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s00180-020-01047-2$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s00180-020-01047-2$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>314,780,784,27924,27925,41488,42557,51319</link.rule.ids></links><search><creatorcontrib>Sánchez, Ainara Rodríguez</creatorcontrib><creatorcontrib>Gómez, Román Salmerón</creatorcontrib><creatorcontrib>García, Catalina García</creatorcontrib><title>Obtaining a threshold for the stewart index and its extension to ridge regression</title><title>Computational statistics</title><addtitle>Comput Stat</addtitle><description>The linear regression model is widely applied to measure the relationship between a dependent variable and a set of independent variables. When the independent variables are related to each other, it is said that the model presents collinearity. If the relationship is between the intercept and at least one of the independent variables, the collinearity is nonessential, while if the relationship is between the independent variables (excluding the intercept), the collinearity is essential. The Stewart index allows the detection of both types of near multicollinearity. However, to the best of our knowledge, there are no established thresholds for this measure from which to consider that the multicollinearity is worrying. This is the main goal of this paper, which presents a Monte Carlo simulation to relate this measure to the condition number. An additional goal of this paper is to extend the Stewart index for its application after the estimation by ridge regression that is widely applied to estimate model with multicollinearity as an alternative to ordinary least squares (OLS). This extension could be also applied to determine the appropriate value for the ridge factor.</description><subject>Collinearity</subject><subject>Dependent variables</subject><subject>Economic Theory/Quantitative Economics/Mathematical Methods</subject><subject>Independent variables</subject><subject>Mathematics and Statistics</subject><subject>Monte Carlo simulation</subject><subject>Original Paper</subject><subject>Probability and Statistics in Computer Science</subject><subject>Probability Theory and Stochastic Processes</subject><subject>Regression models</subject><subject>Statistics</subject><subject>Variables</subject><issn>0943-4062</issn><issn>1613-9658</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><sourceid>8G5</sourceid><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GNUQQ</sourceid><sourceid>GUQSH</sourceid><sourceid>M2O</sourceid><recordid>eNp9kE1LxDAQhoMouH78AU8Bz9WZpE2aoyx-wcIi6DmkbdLtsqZrksX135u1gjcPwzDD-8zAQ8gVwg0CyNsIgDUUwHIhlLJgR2SGAnmhRFUfkxmokhclCHZKzmJcAzAmGc7Iy7JJZvCD76mhaRVsXI2bjrox5MnSmOynCYkOvrN7anxHhxSp3Sfr4zB6mkYahq63NNg-s4fdBTlxZhPt5W8_J28P96_zp2KxfHye3y2KlqNKBXNlC6xyIJqqRiusalzteFOjQy5QMVkhNyDRSpSmdm0rG-xqXinRKuQtPyfX091tGD92Nia9HnfB55eaVayshFRM5RSbUm0YYwzW6W0Y3k340gj6oE5P6nRWp3_UaZYhPkExh31vw9_pf6hvOixwtg</recordid><startdate>20210601</startdate><enddate>20210601</enddate><creator>Sánchez, Ainara Rodríguez</creator><creator>Gómez, Román Salmerón</creator><creator>García, Catalina García</creator><general>Springer Berlin Heidelberg</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7SC</scope><scope>7TB</scope><scope>7WY</scope><scope>7WZ</scope><scope>7XB</scope><scope>87Z</scope><scope>88I</scope><scope>8AL</scope><scope>8C1</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>8FK</scope><scope>8FL</scope><scope>8G5</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BEZIV</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>FR3</scope><scope>FRNLG</scope><scope>FYUFA</scope><scope>F~G</scope><scope>GHDGH</scope><scope>GNUQQ</scope><scope>GUQSH</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K60</scope><scope>K6~</scope><scope>K7-</scope><scope>KR7</scope><scope>L.-</scope><scope>L6V</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>M0C</scope><scope>M0N</scope><scope>M2O</scope><scope>M2P</scope><scope>M7S</scope><scope>MBDVC</scope><scope>P5Z</scope><scope>P62</scope><scope>PQBIZ</scope><scope>PQBZA</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PTHSS</scope><scope>Q9U</scope><orcidid>https://orcid.org/0000-0001-9925-9802</orcidid></search><sort><creationdate>20210601</creationdate><title>Obtaining a threshold for the stewart index and its extension to ridge regression</title><author>Sánchez, Ainara Rodríguez ; Gómez, Román Salmerón ; García, Catalina García</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c319t-2f4c025f06b581e6e9bf8f3b81f1361927513a071e717a8fcc7b1d83596c913c3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Collinearity</topic><topic>Dependent variables</topic><topic>Economic Theory/Quantitative Economics/Mathematical Methods</topic><topic>Independent variables</topic><topic>Mathematics and Statistics</topic><topic>Monte Carlo simulation</topic><topic>Original Paper</topic><topic>Probability and Statistics in Computer Science</topic><topic>Probability Theory and Stochastic Processes</topic><topic>Regression models</topic><topic>Statistics</topic><topic>Variables</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Sánchez, Ainara Rodríguez</creatorcontrib><creatorcontrib>Gómez, Román Salmerón</creatorcontrib><creatorcontrib>García, Catalina García</creatorcontrib><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Computer and Information Systems Abstracts</collection><collection>Mechanical &amp; Transportation Engineering Abstracts</collection><collection>ABI/INFORM Collection</collection><collection>ABI/INFORM Global (PDF only)</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>ABI/INFORM Global (Alumni Edition)</collection><collection>Science Database (Alumni Edition)</collection><collection>Computing Database (Alumni Edition)</collection><collection>Public Health Database</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ABI/INFORM Collection (Alumni Edition)</collection><collection>Research Library (Alumni Edition)</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>Advanced Technologies &amp; Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Business Premium Collection</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>Engineering Research Database</collection><collection>Business Premium Collection (Alumni)</collection><collection>Health Research Premium Collection</collection><collection>ABI/INFORM Global (Corporate)</collection><collection>Health Research Premium Collection (Alumni)</collection><collection>ProQuest Central Student</collection><collection>Research Library Prep</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>ProQuest Business Collection (Alumni Edition)</collection><collection>ProQuest Business Collection</collection><collection>Computer Science Database</collection><collection>Civil Engineering Abstracts</collection><collection>ABI/INFORM Professional Advanced</collection><collection>ProQuest Engineering Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>ABI/INFORM Global</collection><collection>Computing Database</collection><collection>Research Library</collection><collection>Science Database</collection><collection>Engineering Database</collection><collection>Research Library (Corporate)</collection><collection>Advanced Technologies &amp; Aerospace Database</collection><collection>ProQuest Advanced Technologies &amp; Aerospace Collection</collection><collection>ProQuest One Business</collection><collection>ProQuest One Business (Alumni)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>Engineering Collection</collection><collection>ProQuest Central Basic</collection><jtitle>Computational statistics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Sánchez, Ainara Rodríguez</au><au>Gómez, Román Salmerón</au><au>García, Catalina García</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Obtaining a threshold for the stewart index and its extension to ridge regression</atitle><jtitle>Computational statistics</jtitle><stitle>Comput Stat</stitle><date>2021-06-01</date><risdate>2021</risdate><volume>36</volume><issue>2</issue><spage>1011</spage><epage>1029</epage><pages>1011-1029</pages><issn>0943-4062</issn><eissn>1613-9658</eissn><abstract>The linear regression model is widely applied to measure the relationship between a dependent variable and a set of independent variables. When the independent variables are related to each other, it is said that the model presents collinearity. If the relationship is between the intercept and at least one of the independent variables, the collinearity is nonessential, while if the relationship is between the independent variables (excluding the intercept), the collinearity is essential. The Stewart index allows the detection of both types of near multicollinearity. However, to the best of our knowledge, there are no established thresholds for this measure from which to consider that the multicollinearity is worrying. This is the main goal of this paper, which presents a Monte Carlo simulation to relate this measure to the condition number. An additional goal of this paper is to extend the Stewart index for its application after the estimation by ridge regression that is widely applied to estimate model with multicollinearity as an alternative to ordinary least squares (OLS). This extension could be also applied to determine the appropriate value for the ridge factor.</abstract><cop>Berlin/Heidelberg</cop><pub>Springer Berlin Heidelberg</pub><doi>10.1007/s00180-020-01047-2</doi><tpages>19</tpages><orcidid>https://orcid.org/0000-0001-9925-9802</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 0943-4062
ispartof Computational statistics, 2021-06, Vol.36 (2), p.1011-1029
issn 0943-4062
1613-9658
language eng
recordid cdi_proquest_journals_2524567929
source Springer Online Journals Complete
subjects Collinearity
Dependent variables
Economic Theory/Quantitative Economics/Mathematical Methods
Independent variables
Mathematics and Statistics
Monte Carlo simulation
Original Paper
Probability and Statistics in Computer Science
Probability Theory and Stochastic Processes
Regression models
Statistics
Variables
title Obtaining a threshold for the stewart index and its extension to ridge regression
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-22T17%3A50%3A37IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Obtaining%20a%20threshold%20for%20the%20stewart%20index%20and%20its%20extension%20to%20ridge%20regression&rft.jtitle=Computational%20statistics&rft.au=S%C3%A1nchez,%20Ainara%20Rodr%C3%ADguez&rft.date=2021-06-01&rft.volume=36&rft.issue=2&rft.spage=1011&rft.epage=1029&rft.pages=1011-1029&rft.issn=0943-4062&rft.eissn=1613-9658&rft_id=info:doi/10.1007/s00180-020-01047-2&rft_dat=%3Cproquest_cross%3E2524567929%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2524567929&rft_id=info:pmid/&rfr_iscdi=true