"PeerJ Preprints" is a venue for early communication or feedback before peer review. Data may be preliminary.

Supplemental Information

Supplementary Figure 1. Mock community datasets analyzed in this study

DOI: 10.7287/peerj.preprints.934v2/supp-1

Supplementary Figure 2. Mock community A composition

DOI: 10.7287/peerj.preprints.934v2/supp-2

Supplementary Figure 3. Mock community B composition

DOI: 10.7287/peerj.preprints.934v2/supp-3

Supplementary Figure 4. Mock community C composition

DOI: 10.7287/peerj.preprints.934v2/supp-4

Supplementary Figure 5. Mock community D composition

DOI: 10.7287/peerj.preprints.934v2/supp-5

Supplementary Figure 6

Taxonomy classifier configuration and mock community composition alter assignment accuracy at family-level.

DOI: 10.7287/peerj.preprints.934v2/supp-6

Supplementary Figure 7

Taxonomy classifier configuration and mock community composition alter assignment accuracy at species-level.

DOI: 10.7287/peerj.preprints.934v2/supp-7

Supplementary Figure 8

Taxonomy classifier selection critically shapes assignment accuracy of simulated communities. Violin plots illustrate the distribution of precision, recall, and F-measure values across all simulated communities and all parameter configurations for a given method for family-level (left), genus-level (middle), or species-level taxonomy assignments (right). Heavy dashed lines indicate median values, fine dashed lines indicate quartiles.

DOI: 10.7287/peerj.preprints.934v2/supp-8

Supplementary Figure 9

Taxonomic lineages represented in reference databases

DOI: 10.7287/peerj.preprints.934v2/supp-9

Supplementary Figure 10

Evaluation of mothur taxonomy classifier. A, Distribution of F-measure scores across all partial-reference simulated communities and all parameter configurations for each method for species-level taxonomy assignments (right). Heavy dashed lines indicate median values, fine dashed lines indicate quartiles. SM = SortMeRNA. B, Confidence configuration and simulated community composition alter assignment accuracy at species-level. See figure 4 for full description of analysis and comparison to other classifiers and configurations.

DOI: 10.7287/peerj.preprints.934v2/supp-10

Additional Information

Competing Interests

The authors declare they have no competing interests.

Author Contributions

Nicholas A Bokulich conceived and designed the experiments, performed the experiments, analyzed the data, wrote the paper, prepared figures and/or tables, reviewed drafts of the paper.

Jai Ram Rideout conceived and designed the experiments, performed the experiments, analyzed the data, wrote the paper, prepared figures and/or tables, reviewed drafts of the paper.

Evguenia Kopylova performed the experiments, analyzed the data, contributed reagents/materials/analysis tools, reviewed drafts of the paper.

Evan Bolyen performed the experiments, analyzed the data, prepared figures and/or tables, reviewed drafts of the paper.

Jessica Patnode performed the experiments, analyzed the data, reviewed drafts of the paper.

Zach Ellett performed the experiments, analyzed the data, reviewed drafts of the paper.

Daniel McDonald performed the experiments, analyzed the data, reviewed drafts of the paper.

Benjamin Wolfe contributed reagents/materials/analysis tools, reviewed drafts of the paper.

Corinne F Maurice contributed reagents/materials/analysis tools, reviewed drafts of the paper.

Rachel J Dutton contributed reagents/materials/analysis tools, reviewed drafts of the paper.

Peter J Turnbaugh conceived and designed the experiments, contributed reagents/materials/analysis tools, reviewed drafts of the paper.

Rob Knight conceived and designed the experiments, reviewed drafts of the paper.

J Gregory Caporaso conceived and designed the experiments, performed the experiments, analyzed the data, wrote the paper, prepared figures and/or tables, reviewed drafts of the paper.

Data Deposition

The following information was supplied regarding the deposition of related data:


This work was supported in part by the 2014 Kinsella Memorial Award (NAB), NIH P50 GM068763 (PJT and CFM), NSF IGERT grant number 1144807 (DM), the NIH and the Howard Hughes Medical Institute (RK), and the Arizona Board of Regents TRIF (JGC). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Add your feedback

Before adding feedback, consider if it can be asked as a question instead, and if so then use the Question tab. Pointing out typos is fine, but authors are encouraged to accept only substantially helpful feedback.

Some Markdown syntax is allowed: _italic_ **bold** ^superscript^ ~subscript~ %%blockquote%% [link text](link URL)
By posting this you agree to PeerJ's commenting policies
14 Citations   Views   Downloads