Culture-independent detection and characterisation of Mycobacterium tuberculosis and M. africanum in sputum samples using shotgun metagenomics on a benchtop sequencer

Microbiology and Infection Unit, Warwick Medical School, University of Warwick, Coventry, United Kingdom
Disease Control and Elimination Theme, Medical Research Council Unit, Fajara, The Gambia
Vaccinology, Medical Research Council Unit, Fajara, The Gambia
DOI
10.7287/peerj.preprints.482v1
Subject Areas
Bioinformatics, Genomics, Microbiology, Infectious Diseases, Respiratory Medicine
Keywords
Tuberculosis, sputum, diagnosis, metagenomics, Mycobacterium tuberculosis
Copyright
© 2014 Doughty et al.
Licence
This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ PrePrints) and either DOI or URL of the article must be cited.
Cite this article
Doughty EL, Sergeant MJ, Adetifa IMO, Antonio M, Pallen MJ. 2014. Culture-independent detection and characterisation of Mycobacterium tuberculosis and M. africanum in sputum samples using shotgun metagenomics on a benchtop sequencer. PeerJ PrePrints 2:e482v1

Abstract

Tuberculosis remains a major global health problem. Laboratory diagnostic methods that allow effective, early detection of cases are central to management of tuberculosis in the individual patient and in the community. Since the 1880s, laboratory diagnosis of tuberculosis has relied primarily on microscopy and culture. However, microscopy fails to provide species- or lineage-level identification and culture-based workflows for diagnosis of tuberculosis remain complex, expensive, slow, technically demanding and poorly able to handle mixed infections. We therefore explored the potential of shotgun metagenomics, sequencing of DNA from samples without culture or target-specific amplification or capture, to detect and characterise strains from the Mycobacterium tuberculosis complex in smear-positive sputum samples obtained from The Gambia in West Africa. Eight smear- and culture-positive sputum samples were investigated using a differential-lysis protocol followed by a kit-based DNA extraction method, with sequencing performed on a benchtop sequencing instrument, the Illumina MiSeq. The number of sequence reads in each sputum-derived metagenome ranged from 989,442 to 2,818,238. The proportion of reads in each metagenome mapping against the human genome ranged from 20% to 99%. We were able to detect sequences from the M. tuberculosis complex in all eight samples, with coverage of the H37Rv reference genome ranging from 0.002X to 0.7X. By analysing the distribution of large sequence polymorphisms (deletions and the locations of the insertion element IS6110) and single nucleotide polymorphisms (SNPs), we were able to assign seven of eight metagenome-derived genomes to a species and lineage within the M. tuberculosis complex. Two metagenome-derived mycobacterial genomes were assigned to M. africanum, a species largely confined to West Africa; the others that could be assigned belonged to lineages T, H or LAM within the clade of “modern” M. tuberculosis strains. We have provided proof of principle that shotgun metagenomics can be used to detect and characterise M. tuberculosis sequences from sputum samples without culture or target-specific amplification or capture, using an accessible benchtop-sequencing platform, the Illumina MiSeq, and relatively simple DNA extraction, sequencing and bioinformatics protocols. In our hands, sputum metagenomics does not yet deliver sufficient depth of coverage to allow sequence-based sensitivity testing; it remains to be determined whether improvements in DNA extraction protocols alone can deliver this or whether culture, capture or amplification steps will be required. Nonetheless, we can foresee a tipping point when a unified automated metagenomics-based workflow might start to compete with the plethora of methods currently in use in the diagnostic microbiology laboratory.

Author Comment

This submission has been accepted for publication at PeerJ.

Supplemental Information

Detection and characterisation of Mycobacterium tuberculosis in sputum samples using shotgun metagenomics Figure S1 Detailed phylogenetic placement of metagenome-derived genomes

For each sample, the majority bases at each reference SNP position (or gaps if there was no coverage at that position) were concatenated and the sequence was placed in the reference tree using pplacer (see Methods). The output from pplacer (jplace file) was parsed and a new file produced, such that each alternative placement for a sample could be displayed in the tree along with the posterior probability of that placement. Trees were generated from the modified place file, using guppy from the pplacer suite of programs. Only the top 3 placements for each sample are shown. The combined pp values for each alternative placement of sample in a clade were used to ascertain the likelihood of that sample belonging to that clade.

DOI: 10.7287/peerj.preprints.482v1/supp-1

SNP matrix used to generate tree shown in Figure 1

DOI: 10.7287/peerj.preprints.482v1/supp-2

Repetitive genes excluded from SNP calling

DOI: 10.7287/peerj.preprints.482v1/supp-3