No budget mitogenomics: Assembling 14 new mitogenomes for the ant subfamily Pseudomyrmecinae from public data

Instituto de Bioquímica Médica Leopoldo de Meis, Universidade Federal do Rio de Janeiro, Rio de Janeiro, Rio de Janeiro, Brazil
DOI
10.7287/peerj.preprints.27342v1
Subject Areas
Biodiversity, Bioinformatics, Entomology, Evolutionary Studies, Genomics
Keywords
Pseudomyrmecinae, Mitogenomics, Data mining, Bioinformatics, Phylogenomics, Ant evolutionary biology, Next Generation Sequencing, Public data
Copyright
© 2018 Vieira et al.
Licence
This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Preprints) and either DOI or URL of the article must be cited.
Cite this article
Vieira GA, Prosdocimi F. 2018. No budget mitogenomics: Assembling 14 new mitogenomes for the ant subfamily Pseudomyrmecinae from public data. PeerJ Preprints 6:e27342v1

Abstract

The advent of Next Generation Sequencing has reduced sequencing costs and increased genomic projects from a huge amount of organismal taxa, generating an unprecedented amount of genomic datasets publicly available. Often, only a tiny fraction of outstanding relevance of the genome data produced by researchers is used in their works. This fact allows the data generated to be recycled in further projects worldwide. The assembly of complete mitogenomes is frequently overlooked though it is useful to understand evolutionary relationships among taxa, especially those presenting poor mtDNA sampling at the level of genera and families. This is exactly the case for ants (Hymenoptera:Formicidae) and more specifically for the subfamily Pseudomyrmecinae, a group of arboreal ants with several cases of convergent coevolution without any complete mitochondrial sequence available. In this work, we assembled, annotated and performed comparative genomics analyses of 14 new complete mitochondria from Pseudomyrmecinae species relying solely on public datasets available from the Sequence Read Archive (SRA). We used all complete mitogenomes available for ants to study the gene order conservation and also to generate two phylogenetic trees using both (i) concatenated set of 13 mitochondrial genes and (ii) the whole mitochondrial sequences. Even though the tree topologies diverged subtly from each other (and from previous studies), our results confirm several known relationships and generate new evidences for sister clade classification inside Pseudomyrmecinae clade. We also performed a synteny analysis for Formcidae and identified possible sites in which nucleotidic insertions happened in mitogenomes of pseudomyrmecine ants. Using a data mining/bioinformatics approach, the current work increased the number of complete mitochondrial genomes available for ants from 15 to 29, demonstrating the unique potential of public databases for mitogenomics studies. The wide applications of mitogenomes in research and presence of mitochondrial data in different public dataset types makes the “no budget mitogenomics” approach ideal for comprehensive molecular studies, especially for subsampled taxa.

Author Comment

This is a submission to PeerJ for review.

Supplemental Information

Assembly coverage for all 14 Pseudomyrmecinae mitogenomes

DOI: 10.7287/peerj.preprints.27342v1/supp-1

Multifasta containing all 14 mitochondrial genome sequences assembled for Myrmecinae using public data

DOI: 10.7287/peerj.preprints.27342v1/supp-2

Information on the fourteen publicly available genomic datasets for Pseudomirmecinae ants downloaded from SRA database

DOI: 10.7287/peerj.preprints.27342v1/supp-3

Information on the fourteen publicly available genomic datasets for Pseudomirmecinae ants downloaded from SRA database

DOI: 10.7287/peerj.preprints.27342v1/supp-4

Accession number, species name and reference for all mitochondria used in phylogenetic trees

DOI: 10.7287/peerj.preprints.27342v1/supp-5

Direct links for download of the 14 Pseudomyrmecinae datasets used in this study to assemble complete mitochondrial genome sequences

DOI: 10.7287/peerj.preprints.27342v1/supp-6