Javascript is disabled in your browser. Please enable Javascript to view PeerJ.

Review History
A hybrid extraction model for semantic knowledge discovery of water conservancy big data

All reviews of published articles are made public. This includes manuscript files, peer review comments, author rebuttals and revised materials. Note: This was optional for articles submitted before 13 February 2023.

Peer reviewers are encouraged (but not required) to provide their names to the authors when submitting their peer review. If they agree to provide their name, then their personal profile page will reflect a public acknowledgment that they performed a review (even if the article is rejected). If the article is accepted, then reviewers who provided their name will be associated with the article itself.

View examples of open peer review.

Summary

The initial submission of this article was received on March 25th, 2025 and was peer-reviewed by 2 reviewers and the Academic Editor.
The Academic Editor made their initial decision on April 17th, 2025.
The first revision was submitted on May 12th, 2025 and was reviewed by 2 reviewers and the Academic Editor.
The article was Accepted by the Academic Editor on May 26th, 2025.

Version 0.2 (accepted)

Syed Hassan Shah · May 26, 2025 · Academic Editor

Accept

In the opinions of original reviewers and mine, this revised paper can be accepted now.

[# PeerJ Staff Note - this decision was reviewed and approved by Xiangjie Kong, a PeerJ Section Editor covering this Section #]

Reviewer 1 · May 26, 2025

Basic reporting

In this version of the manuscript, I don't have any further comments. The authors have solved all my comments.

Experimental design

N/A

Validity of the findings

N/A

Additional comments

N/A

Cite this review as

Anonymous Reviewer (2025) Peer Review #1 of "A hybrid extraction model for semantic knowledge discovery of water conservancy big data (v0.2)". PeerJ Computer Science

Fawad Naseer · May 16, 2025

Basic reporting

Authors have incorporated all the suggested points. I consider it goog to accept it for publish

Experimental design

Authors have incorporated all the suggested points. I consider it goog to accept it for publish

Validity of the findings

Authors have incorporated all the suggested points. I consider it goog to accept it for publish

Additional comments

Authors have incorporated all the suggested points. I consider it goog to accept it for publish

Cite this review as

Naseer F (2025) Peer Review #2 of "A hybrid extraction model for semantic knowledge discovery of water conservancy big data (v0.2)". PeerJ Computer Science

Download Version 0.2 (PDF) Download author's response letter (v0.2) - submitted May 12, 2025

Version 0.1 (original submission)

Syed Hassan Shah · Apr 17, 2025 · Academic Editor

Major Revisions

In the opinions of reviewers and mine, this paper should undergo a major revision.

**PeerJ Staff Note:** Please ensure that all review and editorial comments are addressed in a response letter and that any edits or clarifications mentioned in the letter are also inserted into the revised manuscript where appropriate.

**Language Note:** The review process has identified that the English language must be improved. PeerJ can provide language editing services - please contact us at [email protected] for pricing (be sure to provide your manuscript number and title). Alternatively, you should make your own arrangements to improve the language quality and provide details in your response letter. – PeerJ Staff

Reviewer 1 · Apr 13, 2025

Basic reporting

This manuscript presents a novel deep learning-based model (WIEM-DL) designed to perform cross-website information extraction in water conservancy public opinion analysis. The authors propose an integrated framework combining BERT embeddings, BiLSTM, attention mechanisms, and CRF for entity recognition and sentiment extraction, demonstrating superior performance over existing models like BERT-CRF and BiLSTM-CRF. The work addresses the need for scalable, transferable web information extraction in domain-specific big data environments. The strengths of the manuscript are a followss:

- The problem is well-motivated and relevant to applied NLP and domain-specific public opinion monitoring.

- The WIEM-DL model is reasonably designed using current deep learning components.

- Empirical results show strong performance and the model's ability to generalize across different website structures.

However, there are some issues also available in the literature:

- The manuscript requires revision for grammar, clarity, and fluency. Sentence phrasing and inconsistent terminologies are present in the manuscript.

- The manuscript includes excessive theoretical background and repetitive explanations in methodology without integrating them into the core contribution.

Experimental design

- The training set appears to be based on small dataset, which is not appropriate given the complexity of the task. More detail is needed on annotation procedures, dataset availability, and validation methods to support the reported performance.

- While the WIEM-DL outperforms baselines, the novelty over existing hybrid models is not well-demonstrated.

Validity of the findings

- A deeper analysis of why this configuration works better, including ablation studies and more extensive cross-domain benchmarks, is required.

- Figures and tables are referenced but not clearly described. It would be better if the authors discussed the figures and the results clearly in the text.

Additional comments

The manuscript needs a significant revision before acceptance! There are many major changes required.

Cite this review as

Anonymous Reviewer (2025) Peer Review #1 of "A hybrid extraction model for semantic knowledge discovery of water conservancy big data (v0.1)". PeerJ Computer Science

Fawad Naseer · Apr 16, 2025

Basic reporting

1Poor English grammar and awkward phrasing throughout requires thorough editing
2Introduction fails to clearly establish the research gap and significance
3Literature review lacks logical organization
4Figures have inadequate captions and explanations of symbols/abbreviations
55Methodology presentation is redundant and fragmented
6Missing citations for key statements and inconsistent reference formatting

Experimental design

Training hyperparameters (learning rate, batch size, etc.) are mentioned only briefly at lines 660-667 without justification for these choices.
How the 10 manually annotated pages were selected (lines 642-647).
Discuss statistical significance of performance differences, Include confidence intervals for results.

Validity of the findings

no comment

Additional comments

Lines 39-42: Expand on specific challenges in traditional semantic knowledge extraction techniques with concrete examples.
Lines 79-90: This paragraph repeats information from the introduction. Consider restructuring to avoid redundancy.
Lines 124-132: The contribution statements would be stronger if they were more specific about the technical innovations rather than general advantages.
Section on BERT Embedding (lines 414-427): Provide more technical details on how sentiment analysis is integrated with BERT embeddings.
The experimental results section would benefit from ablation studies to show the contribution of each component of the WIEM-DL model.
Tables 3 and 4: Some values appear inconsistent or missing. Review and ensure all data is accurately presented.

Consider adding a limitations and future work section that honestly addresses current shortcomings and potential improvements.

Cite this review as

Naseer F (2025) Peer Review #2 of "A hybrid extraction model for semantic knowledge discovery of water conservancy big data (v0.1)". PeerJ Computer Science

Download Original Submission (PDF) - submitted Mar 25, 2025

All text and materials provided via this peer-review history page are made available under a Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Review History A hybrid extraction model for semantic knowledge discovery of water conservancy big data

Summary

Version 0.2 (accepted)

Syed Hassan Shah · May 26, 2025 · Academic Editor

Reviewer 1 · May 26, 2025

Basic reporting

Experimental design

Validity of the findings

Additional comments

Fawad Naseer · May 16, 2025

Basic reporting

Experimental design

Validity of the findings

Additional comments

Version 0.1 (original submission)

Syed Hassan Shah · Apr 17, 2025 · Academic Editor

Reviewer 1 · Apr 13, 2025

Basic reporting

Experimental design

Validity of the findings

Additional comments

Fawad Naseer · Apr 16, 2025

Basic reporting

Experimental design

Validity of the findings

Additional comments

Review History
A hybrid extraction model for semantic knowledge discovery of water conservancy big data