Javascript is disabled in your browser. Please enable Javascript to view PeerJ.

Review History
Transformers and capsule networks vs classical ML on clinical data for alzheimer classification

All reviews of published articles are made public. This includes manuscript files, peer review comments, author rebuttals and revised materials. Note: This was optional for articles submitted before 13 February 2023.

Peer reviewers are encouraged (but not required) to provide their names to the authors when submitting their peer review. If they agree to provide their name, then their personal profile page will reflect a public acknowledgment that they performed a review (even if the article is rejected). If the article is accepted, then reviewers who provided their name will be associated with the article itself.

View examples of open peer review.

Summary

The initial submission of this article was received on February 25th, 2025 and was peer-reviewed by 2 reviewers and the Academic Editor.
The Academic Editor made their initial decision on May 2nd, 2025.
The first revision was submitted on June 5th, 2025 and was reviewed by 2 reviewers and the Academic Editor.
A further revision was submitted on August 18th, 2025 and was reviewed by the Academic Editor.
The article was Accepted by the Academic Editor on August 20th, 2025.

Version 0.3 (accepted)

Shibiao Wan · Aug 20, 2025 · Academic Editor

Accept

The reviewers have addressed the remaining concerns. I recommend accepting this manuscript.

[# PeerJ Staff Note - this decision was reviewed and approved by Jyotismita Chaki, a PeerJ Section Editor covering this Section #]

Download Version 0.3 (PDF) Download author's response letter (v0.3) - submitted Aug 18, 2025

Version 0.2

Shibiao Wan · Jun 25, 2025 · Academic Editor

Major Revisions

There are some major concerns that need to be addressed.

Reviewer 1 · Jun 10, 2025

Basic reporting

small LMCI sample, generalization to external datasets

Experimental design

Please change the title of paper. Make it short and remain main idea.

Validity of the findings

The manuscript is well-structured and written in clear scientific English.

Annotated reviews are not available for download in order to protect the identity of reviewers who chose to remain anonymous.

Cite this review as

Anonymous Reviewer (2025) Peer Review #1 of "Transformers and capsule networks vs classical ML on clinical data for alzheimer classification (v0.2)". PeerJ Computer Science

Reviewer 2 · Jun 19, 2025

Basic reporting

No comment.

Experimental design

No comment.

Validity of the findings

No comment.

Additional comments

The authors have properly addressed all my concerns. Looking forward to trying the last version of your code.

Cite this review as

Anonymous Reviewer (2025) Peer Review #2 of "Transformers and capsule networks vs classical ML on clinical data for alzheimer classification (v0.2)". PeerJ Computer Science

Download Version 0.2 (PDF) Download author's response letter (v0.2) - submitted Jun 5, 2025

Version 0.1 (original submission)

Shibiao Wan · May 2, 2025 · Academic Editor

Major Revisions

The reviewers have substantial concerns about this manuscript. The authors should provide point-to-point responses to address all the concerns and provide a revised manuscript with the revised parts being marked in different color.

**Language Note:** The review process has identified that the English language must be improved. PeerJ can provide language editing services - please contact us at [email protected] for pricing (be sure to provide your manuscript number and title). Alternatively, you should make your own arrangements to improve the language quality and provide details in your response letter. – PeerJ Staff

Reviewer 1 · Mar 31, 2025

Basic reporting

Improve English clarity and abstract.
Add statistical significance testing of results.
Explain model architectures and the training process in more detail.

Experimental design

Improve section organization and figure references.
Discuss overfitting and model generalization more explicitly.
Visualize feature importance and include ablation studies.

Validity of the findings

Include stratified k-fold cross-validation results if available.
Add a brief model interpretability discussion per model type.

Additional comments

Please see attached file report!

Annotated reviews are not available for download in order to protect the identity of reviewers who chose to remain anonymous.

Cite this review as

Anonymous Reviewer (2025) Peer Review #1 of "Transformers and capsule networks vs classical ML on clinical data for alzheimer classification (v0.1)". PeerJ Computer Science

Reviewer 2 · Apr 7, 2025

Basic reporting

- English language usage needs revision. Numerous instances of awkward phrasing, minor grammatical errors, and typographical issues (e.g., “classiûcation” instead of “classification”, “Trasnformer” instead of “Transformer”) detract from the clarity. A thorough language edit by a native speaker or a professional editing service is highly recommended.

- The text occasionally presents long, complex sentences that could be broken up for better readability. Consider simplifying sentence structure where possible.

- The introduction provides a reasonable background on Alzheimer’s disease and the challenges of diagnosis, as well as a discussion of machine learning and deep learning approaches. However, while the context is set, the motivation for comparing specific models (e.g., CNN+DigitCapsule-Net, CNN+Transformer Encoder, traditional ML models) could be strengthened. Explicitly state the knowledge gap and the potential impact of the comparative study.

- There are several formatting inconsistencies (e.g., the placement of figures and tables, numbering of sections) that should be aligned with PeerJ’s standards.

- Section headings and subheadings are generally clear, but it would help to ensure that each section begins with a concise statement of its purpose.

- Overall, the figures are relevant to the content, but several images suffer from quality issues (low resolution or unclear labeling). It is strongly recommended that all figures be produced at high resolution and that axis labels, legends, and annotations abelegible in both digital and print formats.

- Also, consider revising the color schemes of the plots to ensure that they are accessible (e.g., for color-blind readers) and conform to journal standards.

Experimental design

- The research question is well-defined: to compare the performance of advanced deep learning models (including a Transformer Encoder and DigitCapsule-Net) with traditional ML methods for classifying Alzheimer’s disease stages using clinical data. The rationale for focusing on clinical data, as opposed to more commonly used imaging data, is clearly stated and represents a novel approach. As per the ADNI database, use a proper way to reference it as a link or cite a paper by the database providers.

- The methods section is comprehensive, describing data acquisition from the ADNI database, data pre-processing (including feature selection and handling missing values), and the application of several oversampling techniques (SMOTE, ADASYN, etc.) to balance the dataset. As well, the description of each machine learning and deep learning model is detailed, including the mathematical formulations (e.g., for SVM, gradient boosting, CNN, and Capsule-Net). This level of detail is commendable, but some parts could be streamlined to improve readability.

- As per the use of oversampling methods, consider and discuss clearly over different aspects: 1) SMOTE and ADASYN may not be advisable in situations involving high-dimensional data, noisy or mislabeled instances, significant class overlap, very small datasets, time series or sequential data, and datasets dominated by categorical variables. 2) In such cases, these techniques can generate unrealistic or noisy synthetic samples, amplify class confusion, or lead to overfitting. 3) Alternative strategies like ensemble methods, cost-sensitive learning, anomaly detection, or domain-specific data augmentation may be more suitable depending on the context.

- The manuscript provides sufficient details about hyperparameters, model architectures, and the experimental pipeline (including the use of grid search and cross-validation). However, additional clarity on the random seed settings and software versions (beyond the brief mention of Google Colab specifications) would be beneficial for replication.

- The use of multiple evaluation metrics (accuracy, precision, recall, F1 score, confusion matrices, and ROC curves) demonstrates a rigorous approach to model validation. It would also be helpful to include statistical tests to compare model performances formally. For example, provide p-values or confidence intervals when claiming superiority of one model over another.

Validity of the findings

- The results section presents an extensive evaluation of the models on both balanced and unbalanced datasets. The inclusion of cross-validation helps to assess the robustness of the findings. The discussion of results is mostly quantitative. However, the manuscript would benefit from a more in-depth qualitative interpretation. Specifically, the implications of the differences between deep learning and traditional ML models should be further explored, including potential limitations of using clinical data exclusively.

- The comparative analysis is a strong aspect of the study. The novel incorporation of Transformer Encoders and Capsule Networks in this context is innovative. Nonetheless, the authors should discuss potential reasons why traditional models might still outperform advanced deep learning models in certain scenarios with structured clinical data, and consider including a discussion of computational efficiency or interpretability.

- While the study is comprehensive, the authors should acknowledge limitations more clearly. For example, the generalizability of the findings may be limited by the dataset size or the inherent biases in clinical data.

- The manuscript follows PeerJ’s guidelines on data sharing and reproducibility. Ensure that all raw data and code are accessible in the final published version.

Additional comments

The following is a general assessment of the manuscript: The paper addresses a timely and important challenge—enhancing early Alzheimer’s disease diagnosis through machine learning, offering a comprehensive methodological framework and a valuable comparison between deep learning and traditional techniques. However, the manuscript requires extensive language editing to improve clarity, along with revisions to several figures for better quality and accessibility. Some method sections, particularly those with complex mathematical formulations, could be simplified or moved to supplementary materials to maintain narrative flow. The discussion should be expanded to better interpret the results, highlight clinical implications, and address limitations. Additionally, consistency in terminology and notation throughout the text needs to be ensured.

Cite this review as

Anonymous Reviewer (2025) Peer Review #2 of "Transformers and capsule networks vs classical ML on clinical data for alzheimer classification (v0.1)". PeerJ Computer Science

Download Original Submission (PDF) - submitted Feb 25, 2025

All text and materials provided via this peer-review history page are made available under a Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Review History Transformers and capsule networks vs classical ML on clinical data for alzheimer classification

Summary

Version 0.3 (accepted)

Shibiao Wan · Aug 20, 2025 · Academic Editor

Version 0.2

Shibiao Wan · Jun 25, 2025 · Academic Editor

Reviewer 1 · Jun 10, 2025

Basic reporting

Experimental design

Validity of the findings

Reviewer 2 · Jun 19, 2025

Basic reporting

Experimental design

Validity of the findings

Additional comments

Version 0.1 (original submission)

Shibiao Wan · May 2, 2025 · Academic Editor

Reviewer 1 · Mar 31, 2025

Basic reporting

Experimental design

Validity of the findings

Additional comments

Reviewer 2 · Apr 7, 2025

Basic reporting

Experimental design

Validity of the findings

Additional comments

Review History
Transformers and capsule networks vs classical ML on clinical data for alzheimer classification