Challenges issues and future recommendations of deep learning techniques for SARS-CoV-2 detection utilising X-ray and CT images: a comprehensive review

View article
Loading...
PeerJ Computer Science

Main article text

 

Introduction

Relevant review articles

  • To investigate systematically on state-of-the-art deep learning breakthroughs on imaging models in SARS-CoV-2 medical diagnostics, including categorization, recent applications, methodology, data, and obstacles.

  • Discuss the overall findings, performances, uses, and research gaps, together with pertinent recent research references, basic datatype operations, deep learning models with fully illustrated structure and formulas, and the SARS-CoV-2 evaluation algorithm.

  • Critical comparison of all modern SARS-CoV-2 analysis methods based on deep neural networks including advantages, weaknesses, limitations, research directions, and concern about some of the most successful methods.

  • Experimentally analyze the twelve most sophisticated deep learning techniques for SARS-CoV-2 analysis, with a focus on the most useful technique.

  • To discuss the challenges, privacy, benefits and drawbacks of deep learning models and recommend the best deep learning tools for successful SARS-CoV-2 detection.

Rationale and intended audience of the research

Article selection process

Overview of deep learning approaches for sars-cov-2 analysis

Open source data for SARS-CoV-2 analysis

Data pre-processiong

Types of deep learning approaches

Performance metrics used in COVID-19 detection

Precision=(TP)TP+FP
Recall=(TP)TP+FN
F1=2(PrecessionRecall)Precession+Recall
Loss(y)=yilog(ˉyi)+(1yi)+log(1ˉyi)

Deep learning methods

Basic operating step in deep learning based COVID-19 detection

CNN method

Transfer learning method

  • LeNet: In 1989, LeCun et al. (1989) developed the LeNet CNN structure. A recent method for analysing SARS-CoV-2 using the LeNet model yielded an accuracy of 56.06 percent (Islam & Matin, 2020). Figure 11 depicts the fundamental structure of LeNet for analysing SARS-CoV-2 from the image.

  • AlexNet: AlexNet (Krizhevsky, Sutskever & Hinton, 2012) contained eight layers, with the first five being convolution layers, followed by max-pooling layers in some settings, and the last three being completely linked layers, as seen in Fig. 11. Which used the non-saturating ReLU activation function, which had better training accuracy than tanh and sigmoid. AlexNetcite19 for image-based SARS-CoV-2 analysis. Figure 11 shows the fundamental structure of AlexNet (Ardakani et al., 2020), which is used to analyse SARS-CoV-2 from images.

  • VGG: The Visual Geometry Group Network (VGG) (Simonyan & Zisserman, 2014) model does a good job at detecting SARS-CoV-2. ResNet (Ardakani et al., 2020) has observed recently SARS-CoV-2. Figure 11 shows the fundamental structure of VGG-16, which may be used to analyse SARS-CoV-2 from a picture.

  • DenseNet: The “Dense Convolutional Net” (Huang et al., 2017) (DenseNet) is a relatively new concept in neural nets for visual object recognition. SARS-CoV-2 detection is accomplished with success using DenseNet (Horry et al., 2020). Figure 11 shows the fundamental structure of VGG-16, which may be used to analyse SARS-CoV-2 from a picture.

  • InceptionNet: An Inception Module (Szegedy et al., 2016) in SARS-CoV-2 detection is accomplished with success utilizing Inception (Horry et al., 2020). Figure 11 shows the fundamental structure of VGG-16, which may be used to analyse SARS-CoV-2 from a picture.

  • ResNet: ResNet (Szegedy et al., 2016) is comparable to VGGNet, but it is eight times more profound (Too et al., 2019). SARS-CoV-2 was successfully evaluated with around 95 percent correctness by ResNet (Ardakani et al., 2020). Figure 11 depicts the fundamental structure of ResNet for analysing SARS-CoV-2 from an image.

  • Inception-ResNet-V2: Inception V3 (Szegedy et al., 2017) in SARS-CoV-2 detection has achieved favorable performance using the Inception as well as ReNet model together (Horry et al., 2020). Figure 11 depicts Inception-ResNet basic model.

  • XceptionNet: The Xception framework (Chollet, 2017) in SARS-CoV-2 detection with Xception yields a satisfactory result (Horry et al., 2020). Figure 11 shows the fundamental structure of XceptionNet, which is used to analyse SARS-CoV-2 from images.

  • GoogleNet: GoogleNet utilises Inception modules, which facilitate the network to choose from a wide range of convolutional filter sizes within every block. Figure 11 shows the GoogleNet convolutional neural network, which is based on the Inception architecture and is a type of convolutional neural network. GoogleNet is now detecting SARS-CoV-2 successfully. Figure 8 depicts the fundamental structure of GoogleNet for analysing SARS-CoV-2 from an image. Detection of SARS-CoV-2 from an X-ray image by utilising GoogLeNet-COD got 87.5% accuracy (Yu et al., 2020).

  • MobileNet: MobileNet is a reduced design that uses depth-wise separable CNN layers to build basic deep neural networks (Howard et al., 2017) in many standard deeper learning systems. Figure 11 shows the fundamental operating sequence of MobileNet to analyse SARS-CoV-2 from the image in a new application of MobileNet to identify SARS-CoV-2 (Ardakani et al., 2020).

RNN method

Attention method

Capsule network method

Hybrid method

Result analysis

Comparative analysis

Experimental analysis for model recommendation

Discussion and recommendation

Potential of the proposed method across different demographics and healthcare settings

Limitation of deep learning model

  • 1.

    Data quantity prerequisites for training (Subramaniam et al., 2023): Achieving significant performance gains over alternative approaches demands a substantial volume of data. Deep learning thrives on extensive datasets for effective training.

  • 2.

    Computational and space complexity (Asif et al., 2023): Time and space complexity issues in SARS-CoV-2 detection using X-ray image deep learning are multifaceted. Training’s extensive duration due to complex architectures and large datasets delays model development. Deployment encounters high inference time, hampering real-time diagnosis in clinical settings. Resource-intensive computations require robust hardware, limiting deployment in resource-constrained environments. Large model and dataset sizes raise storage and infrastructure costs. Complex model interpretation hinders understanding of prediction rationale. Balancing overfitting prevention and predictive accuracy is delicate. Ensuring model generalization across diverse conditions adds complexity. Addressing these challenges is vital to harnessing deep learning’s potential for effective SARS-CoV-2 detection via X-ray images.

  • 3.

    Interpretability challenges (De Falco, De Pietro & Sannino, 2023; Islam et al., 2023b): Deciphering output solely based on training is intricate, often requiring the implementation of classifiers. Techniques rooted in convolutional neural networks are commonly employed to address this challenge.

  • 4.

    Black box problems (De Falco, De Pietro & Sannino, 2023): Deep learning models learn complex patterns from extensive image datasets that are not easily discernible by humans. However, their opaque decision-making process, known as the black box problem, creates trust, refinement, and application challenges, especially in clinical settings. Strategies like interpretable machine learning, model distillation, adversarial training, self-explaining neural networks, and visualization techniques aim to address this issue. Ongoing research seeks to enhance deep learning model reliability and usefulness, particularly in clinical contexts. The black box problem is particularly significant in deep learning due to its reliance on concealed patterns in large image datasets. This opacity undermines confidence in predictions, vital in healthcare where understanding model operations is key for informed decisions. Debugging and improving models are complicated by this problem, as identifying issues without comprehending internal workings hampers performance enhancements.

  • 5.

    Overfitting problems (Meedeniya et al., 2022): Overfitting is a concern in machine learning, including deep learning for SARS-CoV-2 detection via X-ray images. It happens when a model becomes overly attuned to training data, hindering its ability to generalize to new data. Consequently, this can result in inaccurate predictions. Overfitting arises from factors like a small training dataset, limiting the model’s exposure to genuine patterns. Excessive model complexity exacerbates this issue by causing the model to memorize data rather than learn meaningful insights. Regularization, a technique to prevent overfitting, introduces a penalty to the model’s loss function, promoting simplicity and curbing the model’s tendency to grasp irrelevant training data patterns.

  • 6.

    Lack of contextual understanding (Hu et al., 2022): Deep learning models used for SARS-CoV-2 detection from X-ray images learn patterns associated with the disease but lack contextual understanding, leading to potential inaccuracies in predictions and challenges in explanation. For instance, the model might identify SARS-CoV-2-related features without comprehending their significance within the overall image. This hinders its ability to explain predictions and poses trust issues in clinical settings. To tackle this limitation multimodal data, transfer learning methods and interpretable machine learning can be employed. Real-time SARS-CoV-2 data problems:

  • 7.

    Real-life data problems (Srivastava et al., 2022): Real time data helps to detect and predict disease precisely. In SARS-CoV-2 detection using X-ray image deep learning includes the following real time data problems:

  • (a)

    Data scarcity (Ali, Grönlund & Shah, 2023): Limited SARS-CoV-2 data due to the disease’s novelty hampers accurate model training, reducing comparability with diseases possessing more data; (b) Data imbalance (Calderon-Ramirez et al., 2021; Chamseddine et al., 2022): Unequal representation of SARS-CoV-2 cases vs non-SARS-CoV-2 cases makes it challenging for models to correctly identify the disease; (c) Data Noise (Momeny et al., 2021): Noisy data containing artifacts interferes with model learning, impacting accuracy; (d) Data bias (Catalá et al., 2021): Biased data not representing diverse populations reduces model accuracy for certain groups.

Drawbacks of different deep learning models

Ethical considerations and privacy issues in SARS-CoV-2 analysis

Challenges in SARS-CoV-2 analysis

  1. SARS-CoV-2 data generation (Phukan et al., 2022): Manual data generation is challenging in the field of health-related concern. We should be assisted by health or clinical institutions to get patient data. Permission and government allowance to make available for analysis is the big challenging issues in data generation.

  2. Data availability (Tsai et al., 2021): Most of data set for SARS-CoV-2 analysis is public and some are private. To analyse all these data sets, we have to be aware about its verification of labelling, augmentation and balancing (Tsai et al., 2021).

  3. SARS-CoV-2 data labelling legislation and verification (Yang et al., 2021): SARS-CoV-2 results are exchanged according to strict regulations in countries all over the world. One of the primary procedures specifically states that the bare minimal amount of specimens and data from infected individuals must be gathered in the shortest amount of time. As a result, analysis is becoming more and more difficult to perform. Before using data to analyse, proper data labelling and its correctness need to be confirmed by the source and its released licence.

  4. SARS-CoV-2 data augmentation (Nishio et al., 2020): SARS-CoV-2 data augmentation approaches are becoming more popular for analysing output quality. The data augmentation field demands new research and study in order to provide fresh/synthetic data with sophisticated applications. Artificial Intelligence tough to use GANs to create high-resolution images, for example. If there are biases in the original dataset, there will be biases in the data supplemented from it. As a result, figuring out the optimal data augmentation strategy is essential.

  5. Noisy SARS-CoV-2 data (Momeny et al., 2021): The presence of noisy SARS-CoV-2 data in a data set can have a major impact on the accuracy of any useful information prediction. Much empirical research has demonstrated that data set noise significantly reduced classification accuracy and resulted in poor prediction outcomes.

  6. Quality and dimension of SARS-CoV-2 image data (Rajinikanth et al., 2020): SARS-CoV-2 image quality should be good quality other it may lead deep learning classifier to predict wrong classification. Normally, most of the raw data from medical institutes are in 3D form and deep learning models are not suitable to be trained with 3D data, it can deal with 2D data. So it should be a concern and solved.

  7. Imbalanced SARS-CoV-2 data (Calderon-Ramirez et al., 2021; Chamseddine et al., 2022): Because of the significantly lopsided class distribution and disproportionate misclassification costs, balanced categorization is particularly difficult. Properties including dataset size, label noise, and data dispersion add to the difficulty of imbalanced classification. Resampling SARS-CoV-2 data and other tools can handle imbalanced data.

  8. SARS-CoV-2 data preprocessing (Maity, Nair & Chandra, 2020): There are some challenges in SARS-CoV-2 dat preprocessing like: missing data, manual input, data inconsistency, regional formats, wrong data types, file manipulation, and missing anonymization. Data preprocessing with data cleaning, data integration, data transformation, and data reduction are all examples of data preprocessing. Data cleaning is a technique for removing noise and correcting data discrepancies.

  9. Model development, improvement (Chauhan, Palivela & Tiwari, 2021) and tuning (Lee et al., 2020) for SARS-CoV-2 detection: Because of the significantly lopsided SARS-CoV-2 class distribution and disproportionate misclassification costs, balanced categorization is particularly difficult. Properties including dataset size, label noise, and data dispersion add to the difficulty of imbalanced classification. Deep learning models have challenges if there are the following issues like: not enough training data, poor quality of data, irrelevant features, non-representative training data, overfitting and underfitting.

  10. Intensive training expenses (Gupta et al., 2023): The complexity of SARS-CoV-2 data models results in resource-intensive training processes. The computational requirements necessitate the use of numerous workstations and costly GPUs, leading to elevated operational expenses.

  11. Skill and expertise demand (Attallah, 2023): Deep learning lacks a standardized model selection process. Optimal selection demands an understanding of topology, training methodologies, and other intricate characteristics, often posing challenges for individuals with limited expertise.

Future scopes

Conclusion

  • Implications of this research:

    Through this research and analysis of SARS-CoV-2 detection with deep learning methods has broad implications. This research suggests that deep learning analysis is crucial for accurate and versatile detection, addressing knowledge gaps, benchmarking performance, and improving clinical applications. It guides how the DL method reduces workload, accelerates medical decisions, aids remote diagnosis, and presents ethical AI use and data privacy. It also enhances public awareness of AI’s healthcare role. Overall, deep learning analysis drives research, informs practice, and aids pandemic control.

    Furthermore, the study delves into highly performing models and proposes potential enhancements to current deep learning models, thereby inviting academic engagement in this domain. It is imperative to recognize that image-based deep learning systems offer limited insights into afflicted individuals. However, this assertion does not necessarily imply that deep learning algorithms can supplant the role of physicians or clinicians in clinical diagnosis. The lack of a standardized benchmark, data balance, and accurate labelling represents a notable drawback in SARS-CoV-2 deep learning diagnostic systems. In the near future, deep learning researchers are anticipated to collaborate closely with radiologists and medical specialists, forging efficient support systems for detecting SARS-CoV-2 infections, especially during initial diagnosis and determination of infection severity.

    The collaboration between deep learning experts and medical professionals is projected to yield substantial progress in identifying SARS-CoV-2-infected patients. For robust practice, image-based data analysis of SARS-CoV-2 demands ample volume and meticulous labelling. Prospective endeavours encompass real-time data processing, multimodal analysis, and predictive assistance for medical practitioners. A more comprehensive analysis involving diverse models on distinct image datasets is envisioned in forthcoming investigations. This will yield clearer insights into the performance and recommendations of the most adept models. Deep learning specialists, in collaboration with radiologists, will play a pivotal role in identifying SARS-CoV-2 infections. This review underscores the potential for building adaptive and high-performing SARS-CoV-2 analysis systems through multimodal approaches, particularly integrating image-based data analysis.

Appendix

Additional Information and Declarations

Competing Interests

The authors declare that they have no competing interests related to this article.

Author Contributions

Md Shofiqul Islam conceived and designed the experiments, performed the experiments, analyzed the data, performed the computation work, prepared figures and/or tables, authored or reviewed drafts of the article, and approved the final draft.

Fahmid Al Farid conceived and designed the experiments, performed the experiments, analyzed the data, performed the computation work, authored or reviewed drafts of the article, and approved the final draft.

F. M. Javed Mehedi Shamrat conceived and designed the experiments, performed the experiments, prepared figures and/or tables, and approved the final draft.

Md Nahidul Islam conceived and designed the experiments, analyzed the data, performed the computation work, prepared figures and/or tables, authored or reviewed drafts of the article, and approved the final draft.

Mamunur Rashid conceived and designed the experiments, performed the experiments, analyzed the data, performed the computation work, prepared figures and/or tables, authored or reviewed drafts of the article, and approved the final draft.

Bifta Sama Bari conceived and designed the experiments, prepared figures and/or tables, authored or reviewed drafts of the article, and approved the final draft.

Junaidi Abdullah conceived and designed the experiments, authored or reviewed drafts of the article, and approved the final draft.

Muhammad Nazrul Islam conceived and designed the experiments, authored or reviewed drafts of the article, and approved the final draft.

Md Akhtaruzzaman conceived and designed the experiments, authored or reviewed drafts of the article, and approved the final draft.

Muhammad Nomani Kabir conceived and designed the experiments, authored or reviewed drafts of the article, and approved the final draft.

Sarina Mansor conceived and designed the experiments, authored or reviewed drafts of the article, and approved the final draft.

Hezerul Abdul Karim conceived and designed the experiments, authored or reviewed drafts of the article, and approved the final draft.

Data Availability

The following information was supplied regarding data availability:

This is a literature review.

Funding

This work was supported by Multimedia University, Malaysia, Grant No. MMUI/230023. There was no additional external funding received for this study. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

498 Visitors 527 Views 16 Downloads