DA: Population structure inference using discriminant analysis

Genetic variations in a species across geographic areas typically exhibit spatial clines. There is increasing interest in inferring population genetic structure to understand the patterns of genetic variation and the evolution of a species. Here, we present the da package and propose to infer popula...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Methods in ecology and evolution 2022-02, Vol.13 (2), p.485-499
Hauptverfasser: Qin, Xinghu, Lock, Thomas Ryan, Kallenbach, Robert L.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Genetic variations in a species across geographic areas typically exhibit spatial clines. There is increasing interest in inferring population genetic structure to understand the patterns of genetic variation and the evolution of a species. Here, we present the da package and propose to infer population structure using discriminant analysis (DA). We incorporate five supervised learning approaches (DAPC, LDAKPC, LFDA, LFDAKPC and KLFDA) into da package within the same DA family, but with different linear and nonlinear properties. We tested the performance and properties of these five approaches for population structure inference using both simulated and empirical data. Results showed that these five approaches preserved the same global genetic structure under each genetic scenario. Notably, genetic features produced from KLFDA and LFDA had higher correlations with FST under isolation‐by‐distance model and higher discriminatory power in population structure identification, with KLFDA achieving the best performance. The applications to empirical data indicated that all these methods could intuitively capture the continuous genetic gradients while LFDA and KLFDA could discriminate nuanced population structures that the other approaches cannot. These DA methods can be applied to other statistical inferences in genetics and beyond. The da package is available at https://cran.r‐project.org/web/packages/DA/index.html. We recommend users choosing these approaches appropriately depending on their scientific questions and target data.
ISSN:2041-210X
2041-210X
DOI:10.1111/2041-210X.13748