Use of data-driven model to analyse the occurrence patterns of an indicator fish species in river: A case study for Alburnoides eichwaldii (De Filippi, 1863) in Shafaroud River, north of Iran

•We integrated logistic regression with an optimizer for prediction of Alburnoides eichwaldii in river.•The prediction was more reliable after variable selection.•The water quality variables had more contribution to the prediction than the physical-habitat ones.•The prediction would be more reliable...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Ecological engineering 2019-08, Vol.133, p.10-19
Hauptverfasser: Zarkami, Rahmat, Darizin, Zeinab, Sadeghi Pasvisheh, Roghayeh, Bani, Ali, Ghane, Ahmad
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:•We integrated logistic regression with an optimizer for prediction of Alburnoides eichwaldii in river.•The prediction was more reliable after variable selection.•The water quality variables had more contribution to the prediction than the physical-habitat ones.•The prediction would be more reliable by adding more data and pertinent variables to the model. The present study aims to integrate multinomial logistic regression with an input variable selection method, genetic algorithm, GA, to select the most important explanatory variables for evaluating the occurrence patterns of the bleak (Alburnoides eichwaldii) in river. Seven different sampling sites (from the source to the mouth of the Shafaroud River, north of Iran) were considered to analyse the probability of occurrence of the fish during one year sampling campaign. The abundance of bleak (based on 42 fish presence and 42 fish absence data, as outputs of model) together with a set of physical-chemical water characteristics and river morphology (84 instances as inputs of model) were monthly and repeatedly recorded at each sampling site. Two-third of instances (56) was used for training and the remaining of instances (28) as test set. The results of paired Student’s t-test showed that the predictive performances of model (% correctly classified instance and Kappa statistics) were improved after variable selection method. GA selected 9 of 18 input variables including dissolved oxygen, pH, water temperature, river depth, electric conductivity, total hardness, nitrite, orthophosphate and sulphate. The curves of binary logistic regression confirmed that increasing three of the selected variables (dissolved oxygen, water temperature and pH) might increase the probability of bleak presence while increasing concentration of other selected variables might decrease the probability of fish occurrence in the river basin (p 
ISSN:0925-8574
1872-6992
DOI:10.1016/j.ecoleng.2019.04.018