Digestiflow: from BCL to FASTQ with ease
- Published
- Accepted
- Subject Areas
- Bioinformatics, Computational Science
- Keywords
- next-generation sequencing, demultiplexing, scientific data management
- Copyright
- © 2019 Holtgrewe et al.
- Licence
- This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Preprints) and either DOI or URL of the article must be cited.
- Cite this article
- 2019. Digestiflow: from BCL to FASTQ with ease. PeerJ Preprints 7:e27717v4 https://doi.org/10.7287/peerj.preprints.27717v4
Abstract
Management raw sequencing data and its preprocessing (conversion into sequences and demultiplexing) remains a challenging topic for groups running sequencing devices. They face many challenges in such efforts and solutions ranging from manual management of spreadsheets to very complex and customized LIMS systems handling much more than just sequencing raw data. In this manuscript, we describe the software package DigestiFlow that focuses on the management of Illumina flow cell sample sheets and raw data. It allows for automated extraction of information from flow cell data and management of sample sheets. Furthermore, it allows for the automated and reproducible conversion of Illumina base calls to sequences and the demultiplexing thereof using bcl2fastq and Picard Tools, followed by quality control report generation.
Author Comment
Adjusting preprint to resubmission.
Supplemental Information
Digestiflow Server Documentation
The documentation of Digestiflow at the time of publication. An up-to-date version is available online. The link can be found on the project page on GitHub.