Calculating the Relative Importance of Multiple Regression Predictor Variables Using Dominance Analysis and Random Forests

Researchers often make claims regarding the importance of predictor variables in multiple regression analysis by comparing standardized regression coefficients (standardized beta coefficients). This practice has been criticized as a misuse of multiple regression analysis. As a remedy, I highlight th...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Language learning 2023-03, Vol.73 (1), p.161-196
1. Verfasser: Mizumoto, Atsushi
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Researchers often make claims regarding the importance of predictor variables in multiple regression analysis by comparing standardized regression coefficients (standardized beta coefficients). This practice has been criticized as a misuse of multiple regression analysis. As a remedy, I highlight the use of dominance analysis and random forests, a machine learning technique, in this method showcase article for accurately determining predictor importance in multiple regression analysis. To demonstrate the utility of dominance analysis and random forests, I reproduced the results of an empirical study and applied these analytical procedures. The results reconfirmed that multiple regression analysis should always be accompanied by dominance analysis and random forests to identify the unique contribution of individual predictors while considering correlations among predictors. I also introduce a web application for facilitating the use of dominance analysis and random forests among second language researchers.
ISSN:0023-8333
1467-9922
DOI:10.1111/lang.12518