Targeted NGS for species level phylogenomics: “made to measure” or “one size fits all”?
A peer-reviewed article of this Preprint also exists.
Author and article information
Abstract
Targeted high-throughput sequencing using hybrid-enrichment offers a promising source of data for inferring multiple, meaningfully resolved, independent gene trees suitable to address challenging phylogenetic problems in species complexes and rapid radiations. The targets in question can either be adopted directly from more or less universal tools, or custom made for particular clades at considerably greater effort. We applied custom made scripts to select sets of homologous sequence markers from transcriptome and WGS data for use in the flowering plant genus Erica (Ericaceae). We compared the resulting targets to those that would be selected both using different available tools (Hyb-Seq; MarkerMiner), and when optimising for broader clades of more distantly related taxa (Ericales; eudicots). Approaches comparing more divergent genomes (including MarkerMiner, irrespective of input data) delivered fewer and shorter potential markers than those targeted for Erica. The latter may nevertheless be effective for sequence capture across the wider family Ericaceae. We tested the targets delivered by our scripts by obtaining an empirical dataset. The resulting sequence variation was lower than that of standard nuclear ribosomal markers (that in Erica fail to deliver a well resolved gene tree), confirming the importance of maximising the lengths of individual markers. We conclude that rather than searching for “one size fits all” universal markers, we should improve and make more accessible the tools necessary for developing “made to measure” ones.
Cite this as
2017. Targeted NGS for species level phylogenomics: “made to measure” or “one size fits all”? PeerJ Preprints 5:e2763v2 https://doi.org/10.7287/peerj.preprints.2763v2Author comment
There is a manuscript draft for submission to a peer reviewed journal. It has been modified from the previous version by addition of details to the bioinformatics part of the methods and further results to the supplementary material.
Sections
Supplemental Information
Supplementary data 1: Exons sequences corresponding to the 134 markers selected for the empirical study and the complete pools of marker selected using each of the methods compared (fasta format)
Supplementary data 4: Table documenting markers as represented in Supplementary data 1-3
Additional Information
Competing Interests
The authors declare that they have no competing interests.
Author Contributions
Malvina Kadlec conceived and designed the experiments, performed the experiments, analyzed the data, wrote the paper, prepared figures and/or tables.
Dirk U Bellstedt contributed reagents/materials/analysis tools, reviewed drafts of the paper.
Nicholas C Le Maitre contributed reagents/materials/analysis tools, reviewed drafts of the paper.
Michael D Pirie conceived and designed the experiments, wrote the paper.
Field Study Permissions
The following information was supplied relating to field study approvals (i.e., approving body and any reference numbers):
We thank Cape Nature and South Africa National Parks for assistance with permits (Cape Nature: 0028-AAA008-00134; South Africa National Parks: CRC-2009/007-2014)
DNA Deposition
The following information was supplied regarding the deposition of DNA sequences:
Sequence alignments are available in a supplemental file.
Data Deposition
The following information was supplied regarding data availability:
GitHub
9765d11
https://github.com/MaKadlec/Select-Markers/tree/AllMarkers
GitHub
31409eb
https://github.com/MaKadlec/Select-Markers/tree/BestMarkers.py
Funding
The authors acknowledge funding from the Deutsche Forschungsgemeinschaft (DFG; PI1169/1-1 to MDP); South African National Research Foundation (NRF; DUB and MDP). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.