A guide to carrying out a phylogenomic target sequence capture project

Department of Biological and Environmental Sciences, University of Gothenburg, Gothenburg, Sweden
Gothenburg Global Biodiversity Centre, Gothenburg, Sweden
Institute of Entomology, Biology Centre of the Czech Academy of Sciences, České Budějovice, Czech Republic
Programa de Pós-Graduação em Genética, Conservação e Biologia Evolutiva, Instituto Nacional de Pesquisas da Amazônia, Manaus, Brazil
Coordenação de Zoologia, Museu Paraense Emílio Goeldi, Belém, Brazil
INRA - Centre Nouvelle-Aquitaine-Poitiers, Lusignan, France
Natural History Museum, University of Oslo, Oslo, Norway
Department of Anthropology, National Museum of Natural History, Smithsonian Institution, Washington, United States of America
Royal Botanic Gardens, Kew, Richmond, United Kingdom
DOI
10.7287/peerj.preprints.27968v1
Subject Areas
Bioinformatics, Computational Biology, Genetics, Genomics, Molecular Biology
Keywords
Phylogenetics, NGS, Hyb-Seq, Hybrid capture, Bait, Probe, Enrichment, Exon, Anchored, Illumina
Copyright
© 2019 Andermann et al.
Licence
This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Preprints) and either DOI or URL of the article must be cited.
Cite this article
Andermann T, Torres Jimenez MF, Matos-Maraví P, Batista R, Blanco-Pastor JL, Gustafsson ALS, Kistler L, Liberal IM, Oxelman B, Bacon CD, Antonelli A. 2019. A guide to carrying out a phylogenomic target sequence capture project. PeerJ Preprints 7:e27968v1

Abstract

High-throughput DNA sequencing techniques enable time- and cost-effective sequencing of large portions of the genome. Instead of sequencing and annotating whole genomes, many phylogenetic studies focus sequencing efforts on large sets of pre-selected loci, which further reduces costs and bioinformatic challenges while increasing sequencing depth. One common approach that enriches loci before sequencing is often referred to as target sequence capture. This technique has been shown to be applicable to phylogenetic studies of greatly varying evolutionary depth and has proven to produce powerful, large multi-locus DNA sequence datasets of selected loci, suitable for phylogenetic analyses. However, target capture requires careful theoretical and practical considerations, which will greatly affect the success of the experiment. Here we provide an easy-to-follow flowchart for adequately designing phylogenomic target capture experiments, and we discuss necessary considerations and decisions from the first steps in the lab to the final bioinformatic processing of the sequence data. We particularly discuss issues and challenges related to the taxonomic scope, sample quality, and available genomic resources of target capture projects and how these issues affect all steps from bait design to the bioinformatic processing of the data. Altogether this review outlines a roadmap for future target capture experiments and is intended to assist researchers with making informed decisions for designing and carrying out successful phylogenetic target capture studies

Author Comment

This manuscript provides an easy-to-follow workflow for adequately designing phylogenomic target capture experiments. We discuss issues and challenges related to target capture design, lab protocols, and data analysis.