How much is enough? Influence of number of presence observations on the performance of species distribution models
Species distribution modeling (SDM) can be useful for many applied purposes, e.g., mapping and monitoring of rare and endangered species. Sparse presence data are a recurrent, major obstacle to precise modeling of species distributions. Thus, knowing the minimum number of presences required to obtai...
Gespeichert in:
Veröffentlicht in: | Sommerfeltia 2019-01, Vol.39 (1), p.1-28 |
---|---|
Hauptverfasser: | , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Species distribution modeling (SDM) can be useful for many applied purposes, e.g., mapping and monitoring of rare and endangered species. Sparse presence data are a recurrent, major obstacle to precise modeling of species distributions. Thus, knowing the minimum number of presences required to obtain reliable distribution models is of fundamental importance for applied use of SDM. This study uses a novel approach to assess the critical sample size (CSS) sufficient for an accurate prediction of species distributions with Maximum Entropy Modeling (MaxEnt). Large presence datasets for thirty insect species, ranging from generalists to specialists regarding their responses to main bioclimatic gradients, were used to produce reference distribution models. Models based on replicated subsamples of different size drawn randomly from the full dataset were compared to the reference model using the index of vector similarity distribution models. Models based on replicated subsamples of different size drawn randomly from the full dataset were compared to the reference model using the index of vector similarity ( IVS ). Two thresholds for IVS were determined based on comparison of nine reference models to random null models. The threshold values correspond to 0.95 and 0.99 probability that a model outperforms a random null model in terms of similarity to the reference dataset. For 90% of the species, clearly nonrandom models were obtained with less than 10 presence observations, and for 97% of the species with less than 15 presence observations. We conclude that the number of presence observations required to produce nonrandom models is generally low and, accordingly, that even sparse datasets may be useful for distribution modelling. |
---|---|
ISSN: | 2084-0098 0800-6865 2084-0098 |
DOI: | 10.2478/som-2019-0001 |