ANGSD-wrapper: utilities for analyzing next generation sequencing data

Department of Plant Sciences, University of California, Davis, Davis, California, United States
Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul, MN, United States
Center for Population Biology and Genome Center, University of California, Davis, Davis, California, United States
DOI
10.7287/peerj.preprints.1472v2
Subject Areas
Bioinformatics, Evolutionary Studies, Genetics, Genomics
Keywords
software, population genetics, genotype likelihood
Copyright
© 2016 Durvasula et al.
Licence
This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ PrePrints) and either DOI or URL of the article must be cited.
Cite this article
Durvasula A, Hoffman PJ, Kent TV, Liu C, Kono TJY, Morrell PL, Ross-Ibarra J. 2016. ANGSD-wrapper: utilities for analyzing next generation sequencing data. PeerJ PrePrints 4:e1472v2

Abstract

High throughput sequencing has changed many aspects of population genetics, molecular ecology, and related fields, affecting both experimental design and data analysis. The software package ANGSD allows users to perform a number of population genetic analyses on high-throughput sequencing data. ANGSD uses probabilistic approaches to calculate genome-wide descriptive statistics. The package makes use of genotype likelihood estimates rather than SNP calls and is specifically designed to produce more accurate results for samples with low sequencing depth. ANGSD makes use of full genome data while handling a wide array of sampling and experimental designs. Here we present ANGSD-wrapper, a set of wrapper scripts that provide a user-friendly interface for running ANGSD and visualizing results. ANGSD-wrapper supports multiple types of analyses including esti- mates of nucleotide sequence diversity and performing neutrality tests, principal component analysis, estimation of admixture proportions for individuals samples, and calculation of statistics that quantify recent introgression. ANGSD-wrapper also provides interactive graphing of ANGSD results to enhance data exploration. We demonstrate the usefulness of ANGSD-wrapper by analyzing resequencing data from populations of wild and domesticated Zea. ANGSD-wrapper is freely available from https://github.com/mojaveazure/angsd-wrapper.

Author Comment

We updated manuscript to reflect changes in the program and redid the example analysis with a new set of data.