Javascript is disabled in your browser. Please enable Javascript to view PeerJ.

Review History
Biotea: semantics for Pubmed Central

All reviews of published articles are made public. This includes manuscript files, peer review comments, author rebuttals and revised materials. Note: This was optional for articles submitted before 13 February 2023.

Peer reviewers are encouraged (but not required) to provide their names to the authors when submitting their peer review. If they agree to provide their name, then their personal profile page will reflect a public acknowledgment that they performed a review (even if the article is rejected). If the article is accepted, then reviewers who provided their name will be associated with the article itself.

View examples of open peer review.

Summary

The initial submission of this article was received on August 9th, 2017 and was peer-reviewed by 3 reviewers and the Academic Editor.
The Academic Editor made their initial decision on September 25th, 2017.
The first revision was submitted on December 7th, 2017 and was reviewed by the Academic Editor.
The article was Accepted by the Academic Editor on December 7th, 2017.

Version 0.2 (accepted)

Ludmil Alexandrov · Dec 7, 2017 · Academic Editor

You have addressed all of the reviewers' comments. Your manuscript has been accepted for publication.

Download Version 0.2 (PDF) Download author's response letter (v0.2) - submitted Dec 7, 2017

Version 0.1 (original submission)

Ludmil Alexandrov · Sep 25, 2017 · Academic Editor

Minor Revisions

Please address all comments raised by the reviewers. Please add two more examples as suggested by the third reviewer.

Reviewer 1 · Aug 27, 2017

Basic reporting

I enjoyed reading the manuscript. The presented work is sound and highly practical. The manuscript is well written.

I commend the authors for semantically processing articles from >7K journals and making all the data and software code available. In my opinion the submitted manuscript should be accepted for a publication after suggested minor revisions:

- ‘the NER service provided by the NCBO Annotator’ The accuracy of many other NER tools is higher than of this one, see for example tools provided by NaCTeM. Those other tools also are using ontologies. Please explain your choice.
- Overall, there is no discussion of text mining (TM) as a closely related area of research. Of course, there are principle differences with what the authors are producing, but many TM steps can be re-used by Biotea.
- I would prefer to see more examples and discussion o how Biotea can be used.
- Line 64: the first mentioning of Biotea – you need to provide more explanation what it is. The same about hypothes.is
- Proofread the manuscript, examples: Lines 48-49: add missing gaps; Line 65: “representation, idem. that “, etc. etc.

Experimental design

no comment

Validity of the findings

no comment

Additional comments

no comment

Cite this review as

Anonymous Reviewer (2018) Peer Review #1 of "Biotea: semantics for Pubmed Central (v0.1)". PeerJ https://doi.org/10.7287/peerj.4201v0.1/reviews/1

Reviewer 2 · Sep 12, 2017

Basic reporting

The authors do an excellent job of explaining what they mean by their term "RDFize". However, I do not see the need to coin a new verb, especially considering the authors later turn the this new verb into a noun: RDFization. I think it best editorial practice if they refrain from coining a new term and instead refer to RDF generation or RDF creation as is appropriate.

Suggested grammatical changes:

Lines 280-281. "We are using hierarchical ... using the cosine distance as the metric"
Line 409. "The resulting dataset is over 150 Gigabytes in size"
Line 421. Parameterize (actually suggest rewriting this sentence - it's not completely clear)

Experimental design

No comment

Validity of the findings

No comment

Additional comments

Just a suggestion. You note several software dependencies for using Biotea - Maven, Java and Eclipse. These particular programs tend to have very specific versions for various OS platforms. I think it would be helpful to your audience and potentially increase the usage of Biotea if you were to provide a preconfigured Virtual Machine image using Ubuntu. Virtualbox provides such customized VM's for various purposes (http://www.oracle.com/technetwork/community/developer-vm/index.html), as well as Bitcurator (https://www.bitcurator.net/)

Cite this review as

Anonymous Reviewer (2018) Peer Review #2 of "Biotea: semantics for Pubmed Central (v0.1)". PeerJ https://doi.org/10.7287/peerj.4201v0.1/reviews/2

Reviewer 3 · Sep 25, 2017

Basic reporting

The subject of the manuscript is one that is important to the future of science, an one that is close to my heart: the improved reporting of scientific knowledge through the use of semantic technology. There is a clear and urgent need to improve the reporting of scientific knowledge, and it is scandalous that so much public and charitable money is spent on science that cannot properly be used because it is inaccessible to computers.

The Introduction and background are clear, and the literature is well referenced and relevant. Clear, unambiguous, professional English language is used throughout. The figures are relevant, high quality, well labelled and described.

Experimental design

The models, services, software and datasets are available.

Validity of the findings

The authors demonstrate the utility of Biotea in two examples. This is the weakest part of the manuscript:
• Example 1 concerns the retrieval and clustering of papers annotated with the SNOMED CT term ‘renal  cell carcinoma’. Unfortunately, the three papers investigated have little to do with ‘renal  cell carcinoma’, although it is true that this phrase occurs in all of them. However, the authors do a good job of describing why papers 3862691 and 3862582 are more similar with each other than with 3899087.
• Example 2 involves the creation of a very long SPARQL query, but the results of the query are not described. The SPARQL query would be very hard for a domain scientist to generate without the use of some tool.
I recommend that two other examples are used to demonstrate the utility of Biotea.

Around line 255 the manuscript describes the mapping between Biotea and SIO concepts: ‘encapsulating the original data type property value; thus, a bibo:pmid with the value “28300141” is  mapped to the object property sio:has_identifier, this is linked to the class sio:identifier  that is related by means of sio:has_value to the actual PMID “28300141”.’ I don’t see how this mapping captures the key information that the identifier comes from PubMed .

Cite this review as

Anonymous Reviewer (2018) Peer Review #3 of "Biotea: semantics for Pubmed Central (v0.1)". PeerJ https://doi.org/10.7287/peerj.4201v0.1/reviews/3

Download Original Submission (PDF) - submitted Aug 9, 2017

All text and materials provided via this peer-review history page are made available under a Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Review History Biotea: semantics for Pubmed Central

Summary

Version 0.2 (accepted)

Ludmil Alexandrov · Dec 7, 2017 · Academic Editor

Version 0.1 (original submission)

Ludmil Alexandrov · Sep 25, 2017 · Academic Editor

Reviewer 1 · Aug 27, 2017

Basic reporting

Experimental design

Validity of the findings

Additional comments

Reviewer 2 · Sep 12, 2017

Basic reporting

Experimental design

Validity of the findings

Additional comments

Reviewer 3 · Sep 25, 2017

Basic reporting

Experimental design

Validity of the findings

Review History
Biotea: semantics for Pubmed Central