Genotyping-by-sequencing and ecological niche modeling illuminate phylogeography, admixture, and Pleistocene range dynamics in quaking aspen (Populus tremuloides)
- Published
- Accepted
- Subject Areas
- Biodiversity, Biogeography, Genetics, Genomics, Plant Science
- Keywords
- admixture, North America, quaking aspen, phylogeography, population structure, single nucleotide polymorphisms (SNPs)
- Copyright
- © 2018 Bagley et al.
- Licence
- This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Preprints) and either DOI or URL of the article must be cited.
- Cite this article
- 2018. Genotyping-by-sequencing and ecological niche modeling illuminate phylogeography, admixture, and Pleistocene range dynamics in quaking aspen (Populus tremuloides) PeerJ Preprints 6:e27162v1 https://doi.org/10.7287/peerj.preprints.27162v1
Abstract
Populus tremuloides is the widest-ranging tree species in North America, and an ecologically important component of mesic forest ecosystems displaced by the Pleistocene glaciations. Using phylogeographic analyses of genome-wide SNPs (34,796 SNPs, 183 individuals) and ecological niche modeling, we inferred population structure, admixture, and Pleistocene range dynamics of P. tremuloides, and tested several historical biogeographical hypotheses. We found three genetic lineages located in coastal (cluster 1), Cascadian/Northern Rocky Mountains (cluster 2), and Southern Rocky Mountains to northern regions (cluster 3) of the P. tremuloidesrange, with phylogenomic relationships of the form ((cluster 1, cluster 2), cluster 3). The main vector of admixture was from cluster 3 into cluster 2, with the admixture zone trending northwest through the Rocky Mountains along a recognized phenotypic cline (Utah to Idaho). Clusters 1 and 2 provided mixed support for the ‘stable-edge hypothesis’ that unglaciated southwestern populations persisted in situ since the last glaciation. By contrast, cluster 3 exhibited ‘trailing-edge’ dynamics, e.g. clinal genetic variation and niche suitability predictions signifying complete northward post-glacial expansion. Results were also consistent with the ‘inland dispersal hypothesis’ predicting post-glacial assembly of Pacific Northwestern forest ecosystems, but rejected the hypothesis that Pacific-coastal populations were colonized during outburst flooding from glacial Lake Missoula. Overall, congruent patterns between our phylogeographic and ecological niche modeling results and fossil pollen data demonstrate complex mixtures of stable- versus trailing-edge dynamics and refugial locations within P. tremuloides. These findings confirm and refine previous genetic studies, while strongly supporting a distinct Pacific-coastal genetic lineage of quaking aspen.
Author Comment
This is a preprint submission to PeerJ Preprints.
Supplemental Information
Data S1: Sample list, collection site details, and summary of individual assignments to genetic clusters in ADMIXTURE and DAPC analyses
Data S2: ENM modeling results summary, including model selection results and diagnostic statistics from ENMeval (Muscarella et al.2014) and MaxEnt model metrics from ENMwizard (Heming et al.2018)
Table S1: Per-locus genetic summary statistics for P. tremuloides and P. trichocarpa individuals in this study
Table S2: Pairwise FST estimates between P. tremuloides genetic clusters, and their 95% confidence intervals (CIs)
Table S3: Hierarchical analysis of genetic variance among P. tremuloides SNPs
Results are given for variance partitioning within genetic clusters (Fclust/total), within subpopulations by cluster (Fpop/clust), and within individual trees relative to populations. Ranges given in parentheses are 95% confidence intervals.
Figure S1: Plot of ADMIXTURE cross-validation error versus K, showing that K= 3 is the best fit for the full P. tremuloidesand P. trichocarpa dataset of 34,796 SNPs (sensuAlexander & Lange 2011)
Figure S2: Plot of Bayesian information criterion (BIC) scores for k-means clustering solutions over a range of K, from the first step of DAPC, with K = 3 being the best solution
Figure S3: Results of DAPC cross-validation in Restablishing that the appropriate number of principal components to retain ranges from 20–100 with similarly high (>90%) prediction success
Figure S4: DAPC loading values plotted for all 34,796 SNPs, with SNP name labels beside the SNPs with the highest loadings
Figure S5: Genetic patterns of heterozygote and singleton alleles within and among P. tremuloides genetic clusters, calculated while excluding putatively admixed edge populations (Qmax < 0.75)
Results are analogous to corresponding panels of Fig. 3 (see Fig. 3 caption for details), but based on different sampling.
Figure S6: Heatmap of interindividual Nei’s D estimates
Results were reordered by row and column means, and flanked by dendrograms of the values. Color key and histogram at top left show the distribution of mean D-values.
Figure S7: Heatmap of unordered interpopulation FST estimates flanked by clustering dendrograms from the distances
Color key and histogram at top left show the distribution of mean FSTvalues.
Figure S8: Results of isolation by distance tests
Results are based on generalized linear modeling analyses of linearized FST versus log[geographic distance (km)] of P. tremuloides (top left) and its genetic clusters 1 (top right), 2 (bottom left), and 3 (bottom right).
Figure S9: Unrooted maximum-likelihood tree topology from the ingroup-only TreeMix analysis
Results are show the tree graph estimated when a single migration event was allowed (A), and residual plot of the graph (B). Scale bars and legends same as in Fig. 2.
Figure S10:
Results of TreeMix analyses on the full dataset conducted over varying levels of k block sizes (10 to 5000 bp) accounting for linkage disequilibrium. Scale bars and legends same as in Figs. 2 and S9.
Figure S11: Projection of the final present-day ecological niche model of P. tremuloides onto three late Pleistocene climatic scenarios
The ENM was built with MaxEnt and bioclimatic variables obtained from WorldClim v1, and the three Pleistocene time-slices (B–D) correspond to scenarios described in the Fig. 4 caption and Table 1. Model projections show continuous suitability values obtained using the cloglog format of MaxEnt after the application of a 10th-percentile threshold. Extent of LGM ice sheets is indicated in white.