Review History


All reviews of published articles are made public. This includes manuscript files, peer review comments, author rebuttals and revised materials. Note: This was optional for articles submitted before 13 February 2023.

Peer reviewers are encouraged (but not required) to provide their names to the authors when submitting their peer review. If they agree to provide their name, then their personal profile page will reflect a public acknowledgment that they performed a review (even if the article is rejected). If the article is accepted, then reviewers who provided their name will be associated with the article itself.

View examples of open peer review.

Summary

  • The initial submission of this article was received on April 25th, 2014 and was peer-reviewed by 2 reviewers and the Academic Editor.
  • The Academic Editor made their initial decision on May 12th, 2014.
  • The first revision was submitted on May 19th, 2014 and was reviewed by the Academic Editor.
  • The article was Accepted by the Academic Editor on May 19th, 2014.

Version 0.2 (accepted)

· May 19, 2014 · Academic Editor

Accept

Thank you for addressing the reviewers' comments. I am now happy to accept your manuscript for publication at PeerJ.

Version 0.1 (original submission)

· May 12, 2014 · Academic Editor

Minor Revisions

As you can see, both reviewers were very positive about your work and have requested only very minor changes. Based on their feedback, I would ask you to address the following points:

- One reviewer finds the Mendelian violation rate in the trio analysis higher than expected for both BALSA and GATK. Are there any reasons why this could be the case?

- Explain in the main text whether filtering was used in the trio study.

- Describe the size distribution of indels detected by BALSA and simulated indels in the main text.

- Move the summarized description of the workflow and the SNAPSHOT format to the main text from the supplementary material.

·

Basic reporting

The authors present a new whole genome and exome sequencing analysis pipeline that uses programming optimized for GPU processors to enable much faster analysis than most current methods. Importantly, they optimize all parts of the process to go from raw reads to variant calls, including homozygous reference calls. While it is much faster, one drawback is that it requires more memory than other methods.

Experimental design

It would be useful to describe what annotations the authors use in their random forest model in the main text of the paper.

It would be useful to describe the size distribution of indels detected by BALSA in the main text.

It would be useful to describe the size distribution of simulated indels in the main text.

For the trio study, it would be useful to say whether filtering was used in the main text. The Mendelian violation rate seems higher than I’d expect for both BALSA and GATK. I recommend that the authors manually investigate a subset of these errors to determine what might be causing them.

Since the NA12878 sample the authors analyzed is part of the Genome in a Bottle Consortium effort, the authors may find it useful to compare their calls to the high-confidence SNP, indel, and homozygous reference genotypes from this study (see the paper http://www.nature.com/nbt/journal/v32/n3/full/nbt.2835.html and most recent calls at http://genomeinabottle.org/blog-entry/new-high-confidence-na12878-genotypes-integrating-phased-pedigree-calls). This could help the authors estimate sensitivity and specificity in the high-confidence regions. It would also be useful for the authors to inspect manually the alignments around a subset of the discordant calls.

Validity of the findings

The conclusions are well-written, justified, and useful.

·

Basic reporting

The article is well written and meets all standards. Prior literature is appropriately referenced. I would prefer that the supplementary figures were included in the main text (unless there is a particular length restriction which prohibits this).
I would also like to see a summarized description of the workflow and the SNAPSHOT format in the main text, rather than the reader having to read this in the supplementary material.

Experimental design

The experimental design is adequate.

Validity of the findings

The data presented are robust and statistically sound.

Additional comments

This is an important piece of work, and is a genuine advance in the field.

All text and materials provided via this peer-review history page are made available under a Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.