Boosting for statistical modelling-A non-technical introduction
Boosting algorithms were originally developed for machine learning but were later adapted to estimate statistical models—offering various practical advantages such as automated variable selection and implicit regularization of effect estimates. The interpretation of the resulting models, however, re...
Gespeichert in:
Veröffentlicht in: | Statistical modelling 2018-06, Vol.18 (3-4), p.365-384 |
---|---|
Hauptverfasser: | , |
Format: | Artikel |
Sprache: | eng |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Boosting algorithms were originally developed for machine learning but were later
adapted to estimate statistical models—offering various practical advantages
such as automated variable selection and implicit regularization of effect
estimates. The interpretation of the resulting models, however, remains the same
as if they had been fitted by classical methods. Boosting, hence, allows to use
an advanced machine learning scheme to estimate various types of statistical
models. This tutorial aims to highlight how boosting can be used for
semi-parametric modelling, what practical implications follow from the design of
the algorithm and what kind of drawbacks data analysts have to expect. We
illustrate the application of boosting in the analysis of a stunting score from
children in India and a high-dimensional dataset of tumour DNA to develop a
biomarker for the occurrence of metastases in breast cancer patients. |
---|---|
ISSN: | 1471-082X 1477-0342 |
DOI: | 10.1177/1471082X17748086 |