MerCat: a versatile k-mer counter and diversity estimator for database-independent property analysis obtained from metagenomic and/or metatranscriptomic sequencing data

Richard Allen White III; Ajay Panyala; Kevin Glass; Sean Colby; Kurt R Glaesemann; Christer Jansson; Janet K Jansson

doi:10.7287/peerj.preprints.2825v1

MerCat: a versatile k-mer counter and diversity estimator for database-independent property analysis obtained from metagenomic and/or metatranscriptomic sequencing data

Richard Allen White III ¹, Ajay Panyala², Kevin Glass³, Sean Colby¹, Kurt R Glaesemann², Christer Jansson³, Janet K Jansson¹

1 Biological Sciences Division, Pacific Northwest National Laboratory (PNNL), Richland, Washington, USA

2 Infomation technology, High Performance Computing (HPC) and Cloud Services, Pacific Northwest National Laboratory (PNNL), Richland, Washington, USA

3 Environmental and Molecular Sciences Laboratory (EMSL), Pacific Northwest National Laboratory (PNNL), Richland, Washington, USA

DOI: 10.7287/peerj.preprints.2825v1

Published: 2017-02-21
Accepted: 2017-02-21

Subject Areas: Bioinformatics, Computational Biology, Data Science, Scientific Computing and Simulation, Software Engineering
Keywords: K-mer counting, Database-independent property analysis (DIPA), Metagenomic analysis, Metatranscriptomic analysis, Diversity-estimation

Licence: This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Preprints) and either DOI or URL of the article must be cited.

Cite this article: White III RA, Panyala A, Glass K, Colby S, Glaesemann KR, Jansson C, Jansson JK. 2017. MerCat: a versatile k-mer counter and diversity estimator for database-independent property analysis obtained from metagenomic and/or metatranscriptomic sequencing data. PeerJ Preprints 5:e2825v1 https://doi.org/10.7287/peerj.preprints.2825v1

Abstract

MerCat (“ Mer - Cat enate”) is a parallel, highly scalable and modular property software package for robust analysis of features in next-generation sequencing data. Using assembled contigs and raw sequence reads from any platform as input, MerCat performs k-mer counting of any length k, resulting in feature abundance counts tables. MerCat allows for direct analysis of data properties without reference sequence database dependency commonly used by search tools such as BLAST for compositional analysis of whole community shotgun sequencing (e.g., metagenomes and metatranscriptomes).

Author Comment

Initial version of our manuscript submitted for peer review to Bioinformatics

Add your feedback

Top referrals unique visitors

Share this preprint

Metrics

Download article