Data from: Impact of ecological redundancy on the performance of machine learning classifiers in vegetation mapping

Vegetation maps are models of the real vegetation patterns and are considered important tools in conservation and management planning. Maps created through traditional methods can be expensive and time‐consuming, thus, new more efficient approaches are needed. The prediction of vegetation patterns u...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Macintyre, Paul D, Van Niekerk, Adriaan, Dobrowolski, Mark P, Tsakalos, James L, Mucina, Ladislav
Format: Dataset
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator Macintyre, Paul D
Van Niekerk, Adriaan
Dobrowolski, Mark P
Tsakalos, James L
Mucina, Ladislav
description Vegetation maps are models of the real vegetation patterns and are considered important tools in conservation and management planning. Maps created through traditional methods can be expensive and time‐consuming, thus, new more efficient approaches are needed. The prediction of vegetation patterns using machine learning shows promise, but many factors may impact on its performance. One important factor is the nature of the vegetation–environment relationship assessed and ecological redundancy. We used two datasets with known ecological redundancy levels (strength of the vegetation–environment relationship) to evaluate the performance of four machine learning (ML) classifiers (classification trees, random forests, support vector machines, and nearest neighbor). These models used climatic and soil variables as environmental predictors with pretreatment of the datasets (principal component analysis and feature selection) and involved three spatial scales. We show that the ML classifiers produced more reliable results in regions where the vegetation–environment relationship is stronger as opposed to regions characterized by redundant vegetation patterns. The pretreatment of datasets and reduction in prediction scale had a substantial influence on the predictive performance of the classifiers. The use of ML classifiers to create potential vegetation maps shows promise as a more efficient way of vegetation modeling. The difference in performance between areas with poorly versus well‐structured vegetation–environment relationships shows that some level of understanding of the ecology of the target region is required prior to their application. Even in areas with poorly structured vegetation–environment relationships, it is possible to improve classifier performance by either pretreating the dataset or reducing the spatial scale of the predictions.,Underlying environmental matrix and training sites used for the machine learning predictionsEnvironmental matrices.zip, | Associated Persons Paul D. Macintyre (Creator); Adriaan Van Niekerk (Creator); James L. Tsakalos (Creator); Ladislav Mucina (Creator)
doi_str_mv 10.5061/dryad.1m8tg17
format Dataset
fullrecord <record><control><sourceid>ands_AACKF</sourceid><recordid>TN_cdi_datacite_primary_10_5061_dryad_1m8tg17</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>oai_ands_org_au_1609404</sourcerecordid><originalsourceid>FETCH-LOGICAL-a707-f0f85b75f5927cd9f11dbf43e6ed51afd08c0a7f6ef06be586c8dbcc0ab3a9e83</originalsourceid><addsrcrecordid>eNotkD1rwzAQhr10KGnH7to6OZVqW7K7lfQrEOiSXZylO0dgSUZWCvn3dT6m416e9-CeongSfN1wKV5sOoFdC9_mQaj7Yv6ADIxS9G9s6ycwmUViaOIYB2dgZAntMVgI5sRiYPmAbMJEMfklwjPrwRxcQDYipODCwMwI8-zIYZqZC-wPB8yQ3dL2ME0L8VDcEYwzPt7mqth_fe43P-Xu93u7ed-VoLgqiVPb9KqhpntVxnYkhO2prlCibQSQ5a3hoEgicdlj00rT2t4sWV9Bh221KsrrWbu8aFxGPSXnIZ204PrsQl9c6JuLhX--8hDsrJMFHcHpyxLToOGoheRdzevqH9VdayY</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>dataset</recordtype></control><display><type>dataset</type><title>Data from: Impact of ecological redundancy on the performance of machine learning classifiers in vegetation mapping</title><source>Research Data Australia (RDA)</source><creator>Macintyre, Paul D ; Van Niekerk, Adriaan ; Dobrowolski, Mark P ; Tsakalos, James L ; Mucina, Ladislav</creator><creatorcontrib>Macintyre, Paul D ; Van Niekerk, Adriaan ; Dobrowolski, Mark P ; Tsakalos, James L ; Mucina, Ladislav</creatorcontrib><description>Vegetation maps are models of the real vegetation patterns and are considered important tools in conservation and management planning. Maps created through traditional methods can be expensive and time‐consuming, thus, new more efficient approaches are needed. The prediction of vegetation patterns using machine learning shows promise, but many factors may impact on its performance. One important factor is the nature of the vegetation–environment relationship assessed and ecological redundancy. We used two datasets with known ecological redundancy levels (strength of the vegetation–environment relationship) to evaluate the performance of four machine learning (ML) classifiers (classification trees, random forests, support vector machines, and nearest neighbor). These models used climatic and soil variables as environmental predictors with pretreatment of the datasets (principal component analysis and feature selection) and involved three spatial scales. We show that the ML classifiers produced more reliable results in regions where the vegetation–environment relationship is stronger as opposed to regions characterized by redundant vegetation patterns. The pretreatment of datasets and reduction in prediction scale had a substantial influence on the predictive performance of the classifiers. The use of ML classifiers to create potential vegetation maps shows promise as a more efficient way of vegetation modeling. The difference in performance between areas with poorly versus well‐structured vegetation–environment relationships shows that some level of understanding of the ecology of the target region is required prior to their application. Even in areas with poorly structured vegetation–environment relationships, it is possible to improve classifier performance by either pretreating the dataset or reducing the spatial scale of the predictions.,Underlying environmental matrix and training sites used for the machine learning predictionsEnvironmental matrices.zip, | Associated Persons Paul D. Macintyre (Creator); Adriaan Van Niekerk (Creator); James L. Tsakalos (Creator); Ladislav Mucina (Creator)</description><identifier>DOI: 10.5061/dryad.1m8tg17</identifier><language>eng</language><publisher>DRYAD</publisher><subject>Environmental matrix ; prediction mapping</subject><creationdate>2019</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps:/researchdata.edu.au/1609404$$EHTML$$P50$$Gands$$Hfree_for_read</linktohtml><link.rule.ids>780,1894,27279,76229</link.rule.ids><linktorsrc>$$Uhttp://researchdata.ands.org.au/1609404$$EView_record_in_Australian_National_Data_Service$$FView_record_in_$$GAustralian_National_Data_Service$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>Macintyre, Paul D</creatorcontrib><creatorcontrib>Van Niekerk, Adriaan</creatorcontrib><creatorcontrib>Dobrowolski, Mark P</creatorcontrib><creatorcontrib>Tsakalos, James L</creatorcontrib><creatorcontrib>Mucina, Ladislav</creatorcontrib><title>Data from: Impact of ecological redundancy on the performance of machine learning classifiers in vegetation mapping</title><description>Vegetation maps are models of the real vegetation patterns and are considered important tools in conservation and management planning. Maps created through traditional methods can be expensive and time‐consuming, thus, new more efficient approaches are needed. The prediction of vegetation patterns using machine learning shows promise, but many factors may impact on its performance. One important factor is the nature of the vegetation–environment relationship assessed and ecological redundancy. We used two datasets with known ecological redundancy levels (strength of the vegetation–environment relationship) to evaluate the performance of four machine learning (ML) classifiers (classification trees, random forests, support vector machines, and nearest neighbor). These models used climatic and soil variables as environmental predictors with pretreatment of the datasets (principal component analysis and feature selection) and involved three spatial scales. We show that the ML classifiers produced more reliable results in regions where the vegetation–environment relationship is stronger as opposed to regions characterized by redundant vegetation patterns. The pretreatment of datasets and reduction in prediction scale had a substantial influence on the predictive performance of the classifiers. The use of ML classifiers to create potential vegetation maps shows promise as a more efficient way of vegetation modeling. The difference in performance between areas with poorly versus well‐structured vegetation–environment relationships shows that some level of understanding of the ecology of the target region is required prior to their application. Even in areas with poorly structured vegetation–environment relationships, it is possible to improve classifier performance by either pretreating the dataset or reducing the spatial scale of the predictions.,Underlying environmental matrix and training sites used for the machine learning predictionsEnvironmental matrices.zip, | Associated Persons Paul D. Macintyre (Creator); Adriaan Van Niekerk (Creator); James L. Tsakalos (Creator); Ladislav Mucina (Creator)</description><subject>Environmental matrix</subject><subject>prediction mapping</subject><fulltext>true</fulltext><rsrctype>dataset</rsrctype><creationdate>2019</creationdate><recordtype>dataset</recordtype><sourceid>AACKF</sourceid><sourceid>PQ8</sourceid><recordid>eNotkD1rwzAQhr10KGnH7to6OZVqW7K7lfQrEOiSXZylO0dgSUZWCvn3dT6m416e9-CeongSfN1wKV5sOoFdC9_mQaj7Yv6ADIxS9G9s6ycwmUViaOIYB2dgZAntMVgI5sRiYPmAbMJEMfklwjPrwRxcQDYipODCwMwI8-zIYZqZC-wPB8yQ3dL2ME0L8VDcEYwzPt7mqth_fe43P-Xu93u7ed-VoLgqiVPb9KqhpntVxnYkhO2prlCibQSQ5a3hoEgicdlj00rT2t4sWV9Bh221KsrrWbu8aFxGPSXnIZ204PrsQl9c6JuLhX--8hDsrJMFHcHpyxLToOGoheRdzevqH9VdayY</recordid><startdate>2019</startdate><enddate>2019</enddate><creator>Macintyre, Paul D</creator><creator>Van Niekerk, Adriaan</creator><creator>Dobrowolski, Mark P</creator><creator>Tsakalos, James L</creator><creator>Mucina, Ladislav</creator><general>DRYAD</general><general>Dryad</general><scope>AACKF</scope><scope>ADJYW</scope><scope>DYCCY</scope><scope>PQ8</scope></search><sort><creationdate>2019</creationdate><title>Data from: Impact of ecological redundancy on the performance of machine learning classifiers in vegetation mapping</title><author>Macintyre, Paul D ; Van Niekerk, Adriaan ; Dobrowolski, Mark P ; Tsakalos, James L ; Mucina, Ladislav</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a707-f0f85b75f5927cd9f11dbf43e6ed51afd08c0a7f6ef06be586c8dbcc0ab3a9e83</frbrgroupid><rsrctype>datasets</rsrctype><prefilter>datasets</prefilter><language>eng</language><creationdate>2019</creationdate><topic>Environmental matrix</topic><topic>prediction mapping</topic><toplevel>online_resources</toplevel><creatorcontrib>Macintyre, Paul D</creatorcontrib><creatorcontrib>Van Niekerk, Adriaan</creatorcontrib><creatorcontrib>Dobrowolski, Mark P</creatorcontrib><creatorcontrib>Tsakalos, James L</creatorcontrib><creatorcontrib>Mucina, Ladislav</creatorcontrib><collection>Research Data Australia (RDA)</collection><collection>Research Data Australia (RDA) Full Text</collection><collection>DataCite (Open Access)</collection><collection>DataCite</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Macintyre, Paul D</au><au>Van Niekerk, Adriaan</au><au>Dobrowolski, Mark P</au><au>Tsakalos, James L</au><au>Mucina, Ladislav</au><format>book</format><genre>unknown</genre><ristype>DATA</ristype><title>Data from: Impact of ecological redundancy on the performance of machine learning classifiers in vegetation mapping</title><date>2019</date><risdate>2019</risdate><abstract>Vegetation maps are models of the real vegetation patterns and are considered important tools in conservation and management planning. Maps created through traditional methods can be expensive and time‐consuming, thus, new more efficient approaches are needed. The prediction of vegetation patterns using machine learning shows promise, but many factors may impact on its performance. One important factor is the nature of the vegetation–environment relationship assessed and ecological redundancy. We used two datasets with known ecological redundancy levels (strength of the vegetation–environment relationship) to evaluate the performance of four machine learning (ML) classifiers (classification trees, random forests, support vector machines, and nearest neighbor). These models used climatic and soil variables as environmental predictors with pretreatment of the datasets (principal component analysis and feature selection) and involved three spatial scales. We show that the ML classifiers produced more reliable results in regions where the vegetation–environment relationship is stronger as opposed to regions characterized by redundant vegetation patterns. The pretreatment of datasets and reduction in prediction scale had a substantial influence on the predictive performance of the classifiers. The use of ML classifiers to create potential vegetation maps shows promise as a more efficient way of vegetation modeling. The difference in performance between areas with poorly versus well‐structured vegetation–environment relationships shows that some level of understanding of the ecology of the target region is required prior to their application. Even in areas with poorly structured vegetation–environment relationships, it is possible to improve classifier performance by either pretreating the dataset or reducing the spatial scale of the predictions.,Underlying environmental matrix and training sites used for the machine learning predictionsEnvironmental matrices.zip, | Associated Persons Paul D. Macintyre (Creator); Adriaan Van Niekerk (Creator); James L. Tsakalos (Creator); Ladislav Mucina (Creator)</abstract><pub>DRYAD</pub><doi>10.5061/dryad.1m8tg17</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier DOI: 10.5061/dryad.1m8tg17
ispartof
issn
language eng
recordid cdi_datacite_primary_10_5061_dryad_1m8tg17
source Research Data Australia (RDA)
subjects Environmental matrix
prediction mapping
title Data from: Impact of ecological redundancy on the performance of machine learning classifiers in vegetation mapping
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-29T14%3A28%3A11IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ands_AACKF&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=unknown&rft.au=Macintyre,%20Paul%20D&rft.date=2019&rft_id=info:doi/10.5061/dryad.1m8tg17&rft_dat=%3Cands_AACKF%3Eoai_ands_org_au_1609404%3C/ands_AACKF%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true