Javascript is disabled in your browser. Please enable Javascript to view PeerJ.

Review History
An enhanced algorithm for semantic-based feature reduction in spam filtering

All reviews of published articles are made public. This includes manuscript files, peer review comments, author rebuttals and revised materials. Note: This was optional for articles submitted before 13 February 2023.

Peer reviewers are encouraged (but not required) to provide their names to the authors when submitting their peer review. If they agree to provide their name, then their personal profile page will reflect a public acknowledgment that they performed a review (even if the article is rejected). If the article is accepted, then reviewers who provided their name will be associated with the article itself.

View examples of open peer review.

Summary

The initial submission of this article was received on January 25th, 2024 and was peer-reviewed by 2 reviewers and the Academic Editor.
The Academic Editor made their initial decision on April 8th, 2024.
The first revision was submitted on June 9th, 2024 and was reviewed by the Academic Editor.
The article was Accepted by the Academic Editor on June 26th, 2024.

Version 0.2 (accepted)

Carlos Fernandez-Lozano · Jun 26, 2024 · Academic Editor

Accept

I have evaluated the comments of the reviewers, and your responses. I have decided to accept the article. Congratulations on the acceptance of your manuscript!

[# PeerJ Staff Note - this decision was reviewed and approved by Daniel Katz, a 'PeerJ Computer Science' Section Editor covering this Section #]

Download Version 0.2 (PDF) Download author's response letter - submitted Jun 9, 2024

Version 0.1 (original submission)

Carlos Fernandez-Lozano · Apr 8, 2024 · Academic Editor

Major Revisions

Please, read carefully the reviewers' comments and give a point to point response to address the concerns.

**PeerJ Staff Note:** Please ensure that all review, editorial, and staff comments are addressed in a response letter and that any edits or clarifications mentioned in the letter are also inserted into the revised manuscript where appropriate.

Reviewer 1 · Mar 19, 2024

Basic reporting

This manuscript proposes a new loss feature reduction scheme, which can achieve higher accuracy in FP error and use fewer computing resources. In addition, the manuscript also proposes a new effective semantic-based dimensionality reduction algorithm for binary classification problems (e-SDRS). Experimental results have shown that by using this dimensionality reduction algorithm, classifiers can achieve better accuracy and have lower computational requirements.

Experimental design

Unfortunately, there are still some issues with this manuscript.

1. The comparative dimension of the experiment is insufficient. The author needs to compare more computing resources to prove the progressiveness of the algorithm.

Validity of the findings

2. There are many syntax errors, such as (i) The study includes experiments using two datasets (small and medium sizes) and a detailed comparison with two previous optimization based approaches ->approaches (ii) These techniques have been successfully applied to solve a wide variety of problems with specific constraints including spam filtering, language detection, or->and news categorization (Kowsari et al., 2019).

Cite this review as

Anonymous Reviewer (2024) Peer Review #1 of "An enhanced algorithm for semantic-based feature reduction in spam filtering (v0.1)". PeerJ Computer Science

Reviewer 2 · Apr 8, 2024

Basic reporting

All comments are written in the "Additional comments" section.

Experimental design

All comments are written in the "Additional comments" section.

Validity of the findings

All comments are written in the "Additional comments" section.

Additional comments

In this study, spam mail detection, one of the text classification problems, is discussed. One of the difficult tasks in text classification is the dimension reduction/feature selection/feature reduction task. The authors propose a semantic features-based approach for dimensionality reduction. The proposed approach was developed for two-class problems. My comments about the work are as follows:
- The literature research section in Chapter 2 should be expanded by focusing on current studies.
- No details are given regarding the experimental adjustments section. More details are needed. For example, what was done in the pre-processing steps? Which tools were used for the pre-processing step? Was a term weighting strategy used? If so, what term weighting method was used? Which of the document vector representations was used? Details like this should be given.
- Baseline, Real and Theoretical scenarios described in the article are not fully understood. What these are can be explained more clearly.
- What is the total number of words in the data sets used? How do classification algorithms perform when classification is made with all word numbers? On the other hand, the number of words is reduced by the proposed technique. What is the result achieved with few (reduced) words? Providing detailed results like this will reveal the effect of the proposed method more.
- The proposed approach can be compared with some of the filter and wrapper feature selection techniques used in text classification problems. They are not feature selection/dimensional reduction approaches with the same characteristic feature reduction methods. But giving a result in terms of its effects on classification will be very important for researchers.
- I recommend comparing the results obtained with the results in the literature (specific to the same data sets).

Cite this review as

Anonymous Reviewer (2024) Peer Review #2 of "An enhanced algorithm for semantic-based feature reduction in spam filtering (v0.1)". PeerJ Computer Science

Download Original Submission (PDF) - submitted Jan 25, 2024

All text and materials provided via this peer-review history page are made available under a Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Review History An enhanced algorithm for semantic-based feature reduction in spam filtering

Summary

Version 0.2 (accepted)

Carlos Fernandez-Lozano · Jun 26, 2024 · Academic Editor

Version 0.1 (original submission)

Carlos Fernandez-Lozano · Apr 8, 2024 · Academic Editor

Reviewer 1 · Mar 19, 2024

Basic reporting

Experimental design

Validity of the findings

Reviewer 2 · Apr 8, 2024

Basic reporting

Experimental design

Validity of the findings

Additional comments

Review History
An enhanced algorithm for semantic-based feature reduction in spam filtering