Linear Aggregation in Tree-based Estimators

Regression trees and their ensemble methods are popular methods for nonparametric regression: they combine strong predictive performance with interpretable estimators. To improve their utility for locally smooth response surfaces, we study regression trees and random forests with linear aggregation...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Künzel, Sören R., Saarinen, Theo F., Liu, Edward W., Sekhon, Jasjeet S.
Format:	Dataset
Sprache:	eng
Schlagworte:	Biotechnology Cancer Ecology FOS: Biological sciences FOS: Chemical sciences FOS: Computer and information sciences FOS: Mathematics Information Systems not elsewhere classified Inorganic Chemistry Mathematical Sciences not elsewhere classified Neuroscience Pharmacology Space Science
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Regression trees and their ensemble methods are popular methods for nonparametric regression: they combine strong predictive performance with interpretable estimators. To improve their utility for locally smooth response surfaces, we study regression trees and random forests with linear aggregation functions. We introduce a new algorithm that finds the best axis-aligned split to fit linear aggregation functions on the corresponding nodes, and we offer a quasilinear time implementation. We demonstrate the algorithm’s favorable performance on real-world benchmarks and in an extensive simulation study, and we demonstrate its improved interpretability using a large get-out-the-vote experiment. We provide an open-source software package that implements several tree-based estimators with linear aggregation functions.
DOI:	10.6084/m9.figshare.18131169