Intragenomic polymorphisms among high-copy loci: A genus-wide study of nuclear ribosomal DNA in Asclepias (Apocynaceae)

Department of Botany & Plant Pathology, Oregon State University, Corvallis, OR, USA
Department of Biology, Hobart and William Smith Colleges, Geneva, NY, USA
Department of Botany, Oklahoma State University, Stillwater, OK, USA
DOI
10.7287/peerj.preprints.512v1
Subject Areas
Bioinformatics, Evolutionary Studies, Genetics, Genomics, Plant Science
Keywords
concerted evolution, genome skimming, high-copy, intragenomic polymorphism, partial SNP (pSNP), nuclear ribosomal DNA (nrDNA), intra-individual site polymorphism, 2ISP, Asclepias, ITS
Copyright
© 2014 Weitemier et al.
Licence
This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ PrePrints) and either DOI or URL of the article must be cited.
Cite this article
Weitemier K, Straub SCK, Fishbein M, Liston A. 2014. Intragenomic polymorphisms among high-copy loci: A genus-wide study of nuclear ribosomal DNA in Asclepias (Apocynaceae) PeerJ PrePrints 2:e512v1

Abstract

Despite knowledge that concerted evolution of high-copy loci is often imperfect, few studies investigate the extent of intragenomic polymorphisms and comparisons across a large number of species are rarely made. We present a bioinformatic pipeline for characterizing polymorphisms within an individual among copies of a high-copy locus. Results are presented for nuclear ribosomal DNA (nrDNA) across the milkweed genus, Asclepias. The 18S-26S portion of the nrDNA cistron of Asclepias syriaca served as a reference for assembly of the region from 124 samples representing 90 species of Asclepias. Reads were mapped back to each individual's consensus and at each position reads differing from the consensus were tallied using a custom perl script. Low frequency polymorphisms existed in all individuals (mean = 5.8%). Most nrDNA positions (91%) were polymorphic in at least one individual, with polymorphic sites being less frequent in subunit regions and loops. Highly polymorphic sites existed in each individual, with highest abundance in the “noncoding” ITS regions. Phylogenetic signal was present in the distribution of intragenomic polymorphisms across the genus. Intragenomic polymorphisms in nrDNA are common in Asclepias, being found at higher frequency than any other study to date. The high and variable frequency of polymorphisms across species highlights concerns that phylogenetic applications of nrDNA may be error-prone. The new analytical approach provided here is applicable to other taxa and other high-copy regions characterized by low coverage genome sequencing (genome skimming).

Author Comment

This is a submission to PeerJ for review.

Supplemental Information

Command pipeline for tallying polymorphic positions

DOI: 10.7287/peerj.preprints.512v1/supp-1