QIIME 2: Reproducible, interactive, scalable, and extensible microbiome data science

Pathogen and Microbiome Institute, Northern Arizona University, Flagstaff, AZ, USA
Metabolic Epidemiology Branch, National Cancer Institute, Rockville, MD, USA
Department of Computer Science and Engineering, University of Minnesota, Minneapolis, Minnesota, USA
Biology Department, Woods Hole Oceanographic Institution, Woods Hole, MA, USA
Department of Population Health and Reproduction, University of California, Davis, CA, USA
Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA, USA
Center for Microbiome Informatics and Therapeutics, Massachusetts Institute of Technology, Cambridge, MA, USA
University of Copenhagen, Faculty of Health and Medical Sciences, Novo Nordisk Foundation Center for Basic Metabolic Research, Copenhagen, Denmark
Centre for Integrative Biology, University of Trento, Trento, Italy
State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, China
Centre of Excellence for Plant and Microbial Sciences (CEPAMS), Institute of Genetics and Developmental Biology, Chinese Academy of Sciences & John Innes Centre, Beijing, China
University of Chinese Academy of Sciences, Beijing, China
Department of Microbiology and Immunology, University of California, San Francisco, CA, USA
Division of Gastroenterology and Nutrition, Children's Hospital of Philadelphia, Philadelphia, PA, USA
Hepatology, Children's Hospital of Philadelphia, Philadelphia, PA, USA
Novo Nordisk Foundation Center for Basic Metabolic Research, Faculty of Health and Medical Sciences, University of Copenhagen, Denmark
Earth and Biological Sciences Directorate, Pacific Northwest National Laboratory, Richland, WA, USA
Department of Population Health & Pathobiology, North Carolina State University, Raleigh, NC, USA
Bioinformatics Research Center, North Carolina State University, Raleigh, NC, USA
Collaborative mass spectrometry innovation center, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, San Diego, CA, USA
Department of Biological Sciences, Northern Arizona University, Flagstaff, AZ, USA
Department of Microbiology and Immunology, Dalhousie University, Halifax, Nova Scotia, Canada
Irving K. Barber School of Arts and Sciences, University of British Columbia, Kelowna, British Columbia, Canada
A. Watson Armour III Center for Animal Health and Welfare, Aquarium Microbiome Project, John G. Shedd Aquarium, Chicago, IL, USA
Department of Biology, University of British Columbia Okanagan, Okanagan, BC, Canada
Computational Bioscience Graduate Program, University of Colorado Denver Anschutz Medical Campus, Aurora, Colorado, USA
Department of Medicine, Division of Biomedical Informatics and Personalized Medicine, University of Colorado Denver Anschutz Medical Campus, Aurora, Colorado, USA
Irving K. Barber School of Arts and Sciences, Department of Biology, The University of British Columbia, Kelowna, BC, Canada
Department of Medicine, The University of British Columbia, Kelowna, BC, Canada
Department of Pediatrics, University of California San Diego, La Jolla, CA, USA
Center for Microbial Ecology, Michigan State University, East Lansing, MI, USA
Department of Computer Science and Engineering, University of Minnesota, Minneapolis, MN, USA
Stanford University, Statistics Department, Palo Alto, CA, USA
Department of Computer Science and Engineering, University of California San Diego, La Jolla, CA, USA
Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
Broad Institute of MIT and Harvard, Cambridge, MA, USA
Research School of Biology, The Australian National University, Canberra, ACT, Australia
Department of Pediatric Oncology, Hematology and Clinical Immunology, Heinrich-Heine University Dusseldorf, Dusseldorf, Germany
Department of Family Medicine and Public Health, University of California San Diego, La Jolla, CA, USA
College of Pharmacy, Sookmyung Women's University, Seoul, Republic of Korea
San Diego State University, Department of Biology, San Diego, CA, USA
Biotechnology Institute, University of Minnesota, Saint Paul, MN, USA
Scripps Institution of Oceanography, University of California San Diego, La Jolla, CA, USA
Department of Pharmacology, Dalhousie University, Halifax, Nova Scotia, Canada
Science Education, Howard Hughes Medical Institute, Ashburn, VA, USA
Department of Microbiome Science, Max Planck Institute for Developmental Biology, Tübingen, Germany
Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY, USA
Department of Medicine, Division of Biomedical Informatics and Personalized Medicine, University of Colorado Denver Anschutz Medical Campus, Aurora, CO, USA
Department of Computer Science & Engineering, University of California San Diego, La Jolla, CA, USA
Department of Statistics, University of Washington, Seattle, WA, USA
Department of Animal Science, Colorado State University, Fort Collins, CO, USA
Irving K. Barber School of Arts and Sciences, Unit 2 (Biology), University of British Columbia, Kelowna, BC, Canada
Mountain View, Google LLC, Mountain View, CA, USA
Center for Microbiome Innovation, University of California San Diego, La Jolla, CA, USA
School of Information Studies, Syracuse University, Syracuse, NY, USA
School of STEM, University of Washington Bothell, Bothell, WA, USA
Department of Biological Sciences, Webster University, St Louis, MO, USA
Agricultural Research Service, Genomics and Bioinformatics Research Unit, United States Department of Agriculture, Gainesville, FL, USA
College of Medicine, Department of Biomedical Informatics, University of Arkansas for Medical Sciences, Little Rock, AR, USA
Computational Bioscience Program, University of Colorado Denver Anschutz Medical Campus, Aurora, CO, USA
Department of Civil and Environmental Engineering, Colorado School of Mines, Golden, CO, USA
Department of Biological Sciences and Northern Gulf Institute, University of Southern Mississippi, Hattiesburg, Mississippi, USA
Ocean Chemistry and Ecosystems Division, Atlantic Oceanographic and Meteorological Laboratory, National Oceanic and Atmospheric Administration, La Jolla, CA, USA
Department of Biology, San Diego State University, San Diego, CA, USA
Department of Environmental and Occupational Health Sciences, University of Washington, Seattle, WA, USA
Division of Biological Sciences, University of California San Diego, San Diego, CA, USA
Department of Microbiology and Immunology, University of California San Francisco, San Francisco, CA, USA
Quantitative and Systems Biology Graduate Program, University of California Merced, Merced, CA, USA
Bioinformatics Group, Wageningen University, Wageningen, The Netherlands
Department of Mathematics, University of Arizona, Tucson, AZ, USA
National Laboratory Service, Environment Agency, Starcross, UK
College of Agriculture and Life Sciences, University of Florida, Gainesville, FL, USA
Department of Biostatistics, University of Washington, Seattle, WA, USA
University of Washington Bothell, School of STEM, Division of Biological Sciences, Bothell, WA, USA
Merck & Co. Inc., Kenilworth, NJ, USA
Department of Computer Science and Engineering, University of California San Diego, La Jolla, California, USA
DOI
10.7287/peerj.preprints.27295v2
Subject Areas
Bioinformatics, Ecology, Microbiology, Data Science
Keywords
microbiome, bioinformatics, data science, software, microbial ecology, reproducibility
Copyright
© 2018 Bolyen et al.
Licence
This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Preprints) and either DOI or URL of the article must be cited.
Cite this article
Bolyen E, Rideout JR, Dillon MR, Bokulich NA, Abnet C, Al-Ghalith GA, Alexander H, Alm EJ, Arumugam M, Asnicar F, Bai Y, Bisanz JE, Bittinger K, Brejnrod A, Brislawn CJ, Brown CT, Callahan BJ, Caraballo-Rodríguez AM, Chase J, Cope E, Da Silva R, Dorrestein PC, Douglas GM, Durall DM, Duvallet C, Edwardson CF, Ernst M, Estaki M, Fouquier J, Gauglitz JM, Gibson DL, Gonzalez A, Gorlick K, Guo J, Hillmann B, Holmes S, Holste H, Huttenhower C, Huttley G, Janssen S, Jarmusch AK, Jiang L, Kaehler B, Kang KB, Keefe CR, Keim P, Kelley ST, Knights D, Koester I, Kosciolek T, Kreps J, Langille MG, Lee J, Ley R, Liu Y, Loftfield E, Lozupone C, Maher M, Marotz C, Martin BD, McDonald D, McIver LJ, Melnik AV, Metcalf JL, Morgan SC, Morton J, Naimey AT, Navas-Molina JA, Nothias LF, Orchanian SB, Pearson T, Peoples SL, Petras D, Preuss ML, Pruesse E, Rasmussen LB, Rivers A, Robeson, II MS, Rosenthal P, Segata N, Shaffer M, Shiffer A, Sinha R, Song SJ, Spear JR, Swafford AD, Thompson LR, Torres PJ, Trinh P, Tripathi A, Turnbaugh PJ, Ul-Hasan S, van der Hooft JJ, Vargas F, Vázquez-Baeza Y, Vogtmann E, von Hippel M, Walters W, Wan Y, Wang M, Warren J, Weber KC, Williamson CH, Willis AD, Xu ZZ, Zaneveld JR, Zhang Y, Zhu Q, Knight R, Caporaso JG. 2018. QIIME 2: Reproducible, interactive, scalable, and extensible microbiome data science. PeerJ Preprints 6:e27295v2

Abstract

We present QIIME 2, an open-source microbiome data science platform accessible to users spanning the microbiome research ecosystem, from scientists and engineers to clinicians and policy makers. QIIME 2 provides new features that will drive the next generation of microbiome research. These include interactive spatial and temporal analysis and visualization tools, support for metabolomics and shotgun metagenomics analysis, and automated data provenance tracking to ensure reproducible, transparent microbiome data science.

Author Comment

This version integrates editor feedback received prior to peer review at another journal. It adds discussion of other tools being used for microbiome data science, worked examples illustrating how QIIME 2 works (Supplementary File 2), and feedback on QIIME 2 from 15 users (Supplementary File 3).

Supplemental Information

Supplementary File 1 contains the QIIME 2 .qzv files corresponding to Figure 1a-d

These are also accessible at https://github.com/qiime2/paper1 and can be viewed using QIIME 2 View ( https://view.qiime2.org ) where readers can interact with the results, and explore the methods used to generate them (see the Provenance tab after loading a .qzv file with QIIME 2 View).

DOI: 10.7287/peerj.preprints.27295v2/supp-1