GWAS-assisted genomic prediction of cadmium accumulation in maize kernel with machine learning and linear statistical methods

The production and use of many heavy meal contained materials almost inevitably release cadmium (Cd) into environment, generating Cd pollutants with adverse impacts on food and human health. Developing an effective method for Cd concentration evaluation in food crops could be an effective approach f...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Journal of hazardous materials 2023-01, Vol.441, p.129929, Article 129929
Hauptverfasser: Yan, Huili, Guo, Hanyao, Xu, Wenxiu, Dai, Changhua, Kimani, Wilson, Xie, Jianyin, Zhang, Hezifan, Li, Ting, Wang, Feng, Yu, Yijun, Ma, Mi, Hao, Zhuanfang, He, Zhenyan
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The production and use of many heavy meal contained materials almost inevitably release cadmium (Cd) into environment, generating Cd pollutants with adverse impacts on food and human health. Developing an effective method for Cd concentration evaluation in food crops could be an effective approach for toxicity prediction and pollution control. Here, we exploited the genotype-to-phenotype relationship of maize kernel Cd accumulation at whole-genome level, and developed genome-wide association study (GWAS) assisted genomic-enabled prediction (GP) models using machine learning and linear statistical methods. In benchmark tests, marker density and training populations were key parameters in determining GP baseline precision. With optimized parameters, three statistical methods, including Bayes A, ridge regression–best linear unbiased prediction (rrBLUP) and random forest (RF), showed the highest prediction accuracy (Bayes A, 0.83; rrBLUP, 0.89; RF, 0.75) with 100 iterations of cross-validation. In field trial, GP models with rrBLUP performed better than Bayes A and RF, with a higher GP accuracy (rMG) and lower mean absolute error value. Integrating GP with GWAS can be implemented as an effective strategy for accurate evaluation of Cd concentration, which could provide useful guidelines for accelerating the selection and breeding cycle of low-Cd food crops and addressing the environmental Cd contamination problem. [Display omitted] •An innovative approach was developed for food crop Cd contamination management.•GWAS-assisted genomic prediction provides an effective strategy to predict ZmKCd.•GP model demonstrated superior prediction accuracy (rMG=0.89) for ZmKCd prediction.
ISSN:0304-3894
1873-3336
DOI:10.1016/j.jhazmat.2022.129929