A performance comparison of machine learning methods to estimate the fast-growing forest plantation yield based on laser scanning metrics

•Machine learning models tackle high-dimensional problems.•The machine learning models are not limited to a subset of predictor variables.•The random forest had the best RMSE compared to neural network and support vector regression.•The coefficient of determination and bias were similar to all model...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Computers and electronics in agriculture 2015-08, Vol.116, p.221-227
Hauptverfasser: Görgens, Eric Bastos, Montaghi, Alessandro, Rodriguez, Luiz Carlos Estraviz
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:•Machine learning models tackle high-dimensional problems.•The machine learning models are not limited to a subset of predictor variables.•The random forest had the best RMSE compared to neural network and support vector regression.•The coefficient of determination and bias were similar to all modeling techniques. Machine learning models appear to be an attractive route towards tackling high-dimensional problems, particularly in areas where a lack of knowledge exists regarding the development of effective algorithms, and where programs must dynamically adapt to changing conditions. The objective of this study was to evaluate the performance of three machine learning tools for predicting stand volume of fast-growing forest plantations, based on statistical vegetation metrics extracted from an Airborne Laser Scanning (ALS) survey. The forests used in this study were composed of 1138ha of commercial plantations that consisted of hybrids of Eucalyptus grandis and Eucalyptus urophylla, managed for pulp production. Three machine learning tools were implemented: neural network (NN), random forest (RF) and support vector regression (SV); and their performance was compared to a regression model (RM). The RF and the RM presented an RMSE in the leave-one-out cross-validation of 31.80 and 30.56m3ha−1 respectively. The NN and SV presented a higher RMSE than the others, equal to 64.44 and 65.30m3ha−1. The coefficient of determination and bias were similar to all modeling techniques. The ranking of ALS metrics based on their relative importance for the estimation of stand volume showed some differences. Rather than being limited to a subset of predictor variables, machine learning techniques explored the complete metrics set, looking for patterns between them and the dependent variable.
ISSN:0168-1699
1872-7107
DOI:10.1016/j.compag.2015.07.004