Integrating whole genome sequencing and machine learning for predicting antimicrobial resistance in critical pathogens: a systematic review of antimicrobial susceptibility tests

Carlos M. Ardila; Pradeep K. Yadalam; Daniel González-Arroyave

doi:10.7717/peerj.18213

Integrating whole genome sequencing and machine learning for predicting antimicrobial resistance in critical pathogens: a systematic review of antimicrobial susceptibility tests

Carlos M. Ardila ^1,2, Pradeep K. Yadalam³, Daniel González-Arroyave⁴

1Basic Sciences Department, Faculty of Dentistry, Universidad de Antioquia, Medellin, Colombia

2CIFE University Center, Cuernavaca, Mexico

3Periodontics, Saveetha University, Saveetha, India

4Surgery, Universidad Pontificia Bolivariana, Medellin, Colombia

DOI: 10.7717/peerj.18213

Published: 2024-10-09
Accepted: 2024-09-11
Received: 2024-07-15

Academic Editor: Vladimir Uversky

Subject Areas: Computational Biology, Microbiology, Infectious Diseases, Data Mining and Machine Learning
Keywords: Antimicrobial resistance, Machine learning, Whole genome sequencing, Prediction models, Risk score

Copyright: © 2024 Ardila et al.
Licence: This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.

Cite this article: Ardila CM, Yadalam PK, González-Arroyave D. 2024. Integrating whole genome sequencing and machine learning for predicting antimicrobial resistance in critical pathogens: a systematic review of antimicrobial susceptibility tests. PeerJ 12:e18213 https://doi.org/10.7717/peerj.18213

The authors have chosen to make the review history of this article public.

Abstract

Background

Infections caused by antibiotic-resistant bacteria pose a major challenge to modern healthcare. This systematic review evaluates the efficacy of machine learning (ML) approaches in predicting antimicrobial resistance (AMR) in critical pathogens (CP), considering Whole Genome Sequencing (WGS) and antimicrobial susceptibility testing (AST).

Methods

The search covered databases including PubMed/MEDLINE, EMBASE, Web of Science, SCOPUS, and SCIELO, from their inception until June 2024. The review protocol was officially registered on PROSPERO (CRD42024543099).

Results

The review included 26 papers, analyzing data from 104,141 microbial samples. Random Forest (RF), XGBoost, and logistic regression (LR) emerged as the top-performing models, with mean Area Under the Receiver Operating Characteristic (AUC) values of 0.89, 0.87, and 0.87, respectively. RF showed superior performance with AUC values ranging from 0.66 to 0.97, while XGBoost and LR showed similar performance with AUC values ranging from 0.83 to 0.91 and 0.76 to 0.96, respectively. Most studies indicate that integrating WGS and AST data into ML models enhances predictive performance, improves antibiotic stewardship, and provides valuable clinical decision support. ML shows significant promise for predicting AMR by integrating WGS and AST data in CP. Standardized guidelines are needed to ensure consistency in future research.

Introduction

Antimicrobial resistance (AMR) is the ability of bacteria to withstand antimicrobial treatments, particularly antibiotics. Infections caused by antibiotic-resistant bacteria are a significant concern for modern healthcare, posing a serious public health risk (Ahmad et al., 2023). Projections estimate that bacterial infections could result in approximately 10 million deaths annually by 2050 (Lüftinger et al., 2023). A recent meta-analysis on the impact of resistant bacteria on human health revealed that in 2019, antibiotic-resistant bacteria (ARBs) directly caused 1.27 million deaths, with an additional 4.95 million deaths associated with ARBs (Antimicrobial Resistance Collaborators, 2022). Moreover, ARBs are identified as a leading cause of mortality in low-income countries (Antimicrobial Resistance Collaborators, 2022; Ruiz-Blanco et al., 2022). Infections with Escherichia coli, Staphylococcus aureus, Klebsiella pneumoniae, Acinetobacter baumannii, and Pseudomonas aeruginosa, all classified as critical and high-priority pathogens (CP) by the World Health Organization (WHO) (Ahmad et al., 2023; Antimicrobial Resistance Collaborators, 2022), account for a significant portion of these deaths. In 2019, over 100,000 deaths were attributed to antimicrobial resistance (AMR) caused by a single pathogen-drug combination: methicillin-resistant S. aureus (MRSA). Six other combinations caused 50,000 to 100,000 deaths each, including multidrug-resistant E. coli, fluoroquinolone-resistant E. coli, carbapenem-resistant A. baumannii, carbapenem-resistant K. pneumoniae, and third-generation cephalosporin-resistant K. pneumoniae. Additionally, a recent assessment of the clinical pipeline revealed the development of 50 antibiotics, but only 12 showed efficacy against certain priority Gram-negative bacteria (Butler & Paterson, 2020; Pormohammad, Nasiri & Azimi, 2019).

Research indicates that the rapid administration of appropriate antimicrobials significantly improves patient outcomes. For instance, in cases of bacteremia, the risk of death doubles if effective antibiotics are not administered within 24 h. Globally, only about half of antibiotic prescriptions are accurate. Consequently, quick point-of-care diagnostic tests are crucial for addressing this issue (Ahmad et al., 2023; Milani et al., 2019).

The current culture-based methods for detecting and diagnosing pathogenic diseases are insufficient. Most culturable bacteria associated with diseases can be detected after 24-48 h of incubation. Additionally, pathogen identification often requires an extra 2-4 h, and if AMR is suspected, antibiotic susceptibility testing (AST) adds another 18-24 h. Consequently, the total time to collect patient samples and obtain information on antibiotic susceptibility patterns in clinical practice can range from 2 to 4 days at best (Ahmad et al., 2023; Taxt et al., 2020).

Emerging micro- and nanotechnologies for bacterial identification and AST include phenotypic methods like microfluidic-based bacterial culture, and molecular techniques such as multiplex PCR, hybridization probes, nanoparticles, synthetic biology, and mass spectrometry. While PCR and mass spectrometry have improved bacterial detection in positive cultures, they have limitations. PCR requires a predetermined target, and MALDI-TOF mass spectrometry is costly (Ahmad et al., 2023; Taxt et al., 2020; Humphries et al., 2023). Additionally, these methods, which assess one or more resistance genes, are inadequate for predicting antimicrobial susceptibility because resistance often results from a complex interplay of resistance genes, regulatory factors, and mutations that together produce a phenotypic susceptibility profile (Humphries et al., 2023).

Whole genome sequencing (WGS) offers a solution to some of these challenges by obviating the requirement for specialized primers or probes. Additionally, with the increasing affordability of real-time sequencing, WGS has emerged as a feasible alternative to the laborious, culture-dependent methods of the past (Ahmad et al., 2023). Moreover, genome sequencing data offer an additional dimension to AMR research, enabling the analysis of genetic pathways underlying AMR in individual strains (Su, Satola & Read, 2019; VanOeffelen et al., 2021). Several publicly accessible services have been established to assist in identifying AMR indications based on the presence of resistance-associated single nucleotide polymorphisms (SNPs) and genes (VanOeffelen et al., 2021; Bortolaia et al., 2020). Integration of AST data with genome sequences also holds promise in uncovering genomic regions directly involved in resistance, influenced by epistasis, or linked to the emergence of AMR (VanOeffelen et al., 2021; Hendriksen et al., 2019).

The utilization of machine learning (ML) methods for forecasting AMR indicators and pinpointing genetic regions associated with resistance has garnered considerable interest in recent literature (VanOeffelen et al., 2021; Anahtar, Yang & Kanjilal, 2021). ML technologies leverage a wide array of variables inherent in genomic data to construct nonlinear models that forecast phenotypic AST outcomes (Humphries et al., 2023). ML offers an alternative method for predicting AMR from sequence data without necessitating prior knowledge of chromosomal alterations or mobilizable genes (Aytan-Aktug et al., 2020; Moradigaravand et al., 2018a). Numerous ML approaches can consider the implications of multiple mutations and/or mobilizable genes. Various studies have employed different ML techniques to forecast AMR profiles for diverse bacterial species and drug combinations (Aytan-Aktug et al., 2020; Moradigaravand et al., 2018a; Drouin et al., 2019). The primary distinction among these studies lies in how bacterial genomes are transformed into features, which are subsequently input into ML algorithms (Aytan-Aktug et al., 2020). Unlike culture-based AST or nucleic acid amplification tests, which are frequently constrained in the scope of resistant phenotypes ascertainable through a single test, WGS-AST enables the simultaneous determination of antibiotic resistance phenotypes across the entirety of the genome. Furthermore, it facilitates the screening of phenotypes influenced by multiple loci with ease (Ahmad et al., 2023; Lüftinger et al., 2023; Humphries et al., 2023; Su, Satola & Read, 2019; VanOeffelen et al., 2021). However, no systematic review has been identified that has evaluated the ability of ML to predict antimicrobial resistance in CP using WGS. Therefore, the objective of this systematic review is to evaluate the efficacy of ML approaches in predicting AMR in CP, considering WGS-AST.

Materials and Methods

Protocol and registration

The systematic review followed a search methodology in line with PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-analyses) guidelines (Page et al., 2021). The review protocol was officially registered on PROSPERO.

Eligibility criteria

The systematic review was guided by a question formulated within the Population, Intervention, Comparison, and Outcomes (PICO) framework:

P: samples of patients with CP subjected to AST.

I: Machine learning approaches utilizing WGS-AST

C: Alternative prediction approaches.

O: Prediction of AMR.

This review included studies evaluating the efficacy of ML in predicting AMR in CP, utilizing WGS data and AST. Exclusion criteria comprised animal and in vitro studies, case series and case reports. Moreover, reviews, brief communications, conference proceedings, abstracts, and studies lacking essential information about ML s and predicted performance indicators were omitted.

Information sources

The search strategy involved reviewing various databases, including PubMed/MEDLINE, Web of Science, EMBASE, SCOPUS, and SCIELO, along with searching gray literature sources via Google Scholar. A comprehensive electronic database search was conducted from the inception of these databases until June 2024, without any language restrictions. Moreover, further records were identified by examining the reference lists and citations of all selected full-text papers for potential inclusion in this study.

Search strategy

The search included the following terms: “whole genome sequencing” AND “microbiome” AND “genomics” AND “genome” AND “antibiotic resistance genes” AND “antimicrobial resistance prediction “AND “disk diffusion antimicrobial tests” AND “agar dilution” AND “minimal inhibitory concentration” AND “antimicrobial susceptibility testing” AND “antimicrobial resistance” OR “antibiotic resistance” AND “microbial” OR “bacterial” AND “Escherichia coli” AND “Staphylococcus aureus” AND “Klebsiella pneumoniae” AND “Acinetobacter baumannii” AND “Pseudomonas aeruginosa” AND “infection” AND “machine learning” OR “ machine learning algorithms “OR “deep learning” OR “prediction model” OR “risk assessment” OR “risk prediction”. These search methods employ database-specific syntax and operators to retrieve articles related to the provided queries. Adjustments can be required to adapt the single exploration functionality and syntax rules of each database. Table 1 displays the search strategies for each specified database using the provided terms.

Table 1:

Explorations managed in the selected databases.

Database	Search strategy
PubMed/MEDLINE	((“whole genome sequencing” AND “microbiome” AND “genomics” AND “genome”) AND (“antibiotic resistance genes” AND “antimicrobial resistance prediction “AND “disk diffusion antimicrobial tests” AND “agar dilution” AND “minimal inhibitory concentration” AND “antimicrobial susceptibility testing”) AND (”antimicrobial resistance” OR ”antibiotic resistance”) AND (”microbial” OR ”bacterial”) AND ”Escherichia coli ” AND “Staphylococcus aureus” AND “Klebsiella pneumoniae” AND “Acinetobacter baumannii” AND “Pseudomonas aeruginosa” AND ”infection” AND (”machine learning” OR“machine learning algorithms” OR ”deep learning” OR ”prediction model” OR “risk assessment” OR “risk prediction”))
Scopus	TITLE-ABS-KEY((“whole genome sequencing” AND “microbiome” AND “genomics” AND “genome”) AND (“antibiotic resistance genes” AND “antimicrobial resistance prediction “AND “disk diffusion antimicrobial tests” AND “agar dilution” AND “minimal inhibitory concentration” AND “antimicrobial susceptibility testing”) (”antimicrobial resistance” OR ”antibiotic resistance”) AND (”microbial” OR ”bacterial”) AND ”Escherichia coli ” AND “Staphylococcus aureus” AND “Klebsiella pneumoniae” AND “Acinetobacter baumannii” AND “Pseudomonas aeruginosa” AND ”infection” AND (”machine learning” OR “machine learning algorithms” OR ”deep learning” OR ”prediction model” OR “risk assessment” OR “risk prediction”))
Scielo	(“whole genome sequencing” AND “microbiome” AND” “genomics” AND “genome”) AND (“antibiotic resistance genes” AND “antimicrobial resistance prediction “AND “disk diffusion antimicrobial tests” AND “agar dilution” AND “minimal inhibitory concentration” AND “antimicrobial susceptibility testing”) AND (”Antimicrobial resistance” OR ”antibiotic resistance”) AND (”microbial” OR ”bacterial”) AND ”Escherichia coli ” AND “Staphylococcus aureus” AND “Klebsiella pneumoniae” AND “Acinetobacter baumannii” AND “Pseudomonas aeruginosa” AND ”infection” AND (”machine learning” OR “machine learning algorithms” OR ”deep learning” OR ”prediction model” OR “risk assessment” OR “risk prediction”)
Embase	(“whole genome sequencing” AND “microbiome” AND“genomics” AND “genome”) AND (“antibiotic resistance genes” AND “antimicrobial resistance prediction “AND “disk diffusion antimicrobial tests” AND “agar dilution” AND “minimal inhibitory concentration” “antimicrobial susceptibility testing”) AND (“antimicrobial resistance’ OR ’antibiotic resistance’) AND (’microbial’ OR ’bacterial’) AND ’Escherichia coli’ AND ’Staphylococcus aureus’ AND ’Klebsiella pneumoniae’ AND ’Acinetobacter baumannii’ AND ’Pseudomonas aeruginosa’ AND ’infection’ AND (’machine learning’ OR “machine learning algorithms” OR ’deep learning’ OR ’prediction model’ OR “risk assessment” OR “risk prediction”)
Web of Science	TS=(“whole genome sequencing” AND “microbiome” AND ““genomics” AND “genome”) AND (“antibiotic resistance genes” AND “antimicrobial resistance prediction “AND “disk diffusion antimicrobial tests” AND “agar dilution” AND “minimal inhibitory concentration” “antimicrobial susceptibility testing”) AND TS=(”Antimicrobial resistance” OR ”antibiotic resistance”) AND TS=(”microbial” OR ”bacterial”) AND TS=”Escherichia coli ” AND “Staphylococcus aureus” AND “Klebsiella pneumoniae” AND “Acinetobacter baumannii” AND “Pseudomonas aeruginosa” AND TS=”infection” AND TS=(”machine learning” OR “machine learning algorithms” OR ”deep learning” OR ”prediction model” OR “risk assessment” OR “risk prediction”)
Google Scholar	“whole genome sequencing” AND “microbiome” AND “genomics” AND “genome”AND “antibiotic resistance genes” AND “antimicrobial resistance prediction “AND “disk diffusion antimicrobial tests” AND “agar dilution” AND “minimal inhibitory concentration”AND “antimicrobial susceptibility testing” AND ”Antimicrobial resistance” OR ”antibiotic resistance” AND ”microbial” OR ”bacterial” AND ”Escherichia coli” “Staphylococcus aureus” AND “Klebsiella pneumoniae” AND “Acinetobacter baumannii” AND “Pseudomonas aeruginosa” AND ”infection” AND ”machine learning” OR “machine learning algorithms” OR ”deep learning” OR ”prediction model” OR “risk assessment” OR “risk prediction”

DOI: 10.7717/peerj.18213/table-1

Study selection

Two investigators (CMA and DGA) individually assessed the eligibility of titles and abstracts, followed by a comprehensive review of full-text studies. Full-text evaluation was conducted independently to ascertain eligibility. When discrepancies arose, they were initially discussed between the two investigators to reach a consensus. If disagreements persisted after discussion, a third scholar (PKY) was consulted to provide an independent evaluation and make the final decision. Interobserver concordance was evaluated by means of the Kappa test for statistical significance, with a threshold of >90 indicating consistency.

Data collection

Two researchers (CMA and DGA) autonomously extracted information applying tailored data extraction strategies. A comparative study was managed to ensure consistency in the acquired information. The resistance report, variables utilized, machine learning approach, performance measures, and WGS data for model construction and validation were gathered from the revised articles. Furthermore, an orderly recording of points including authors, publishing year and country was carried out.

Assessment of bias risk and study quality in individual studies

The PROBAST instrument, which assesses both the risk of bias and the applicability of prediction model research for systematic reviews, was employed to evaluate bias (Moons et al., 2019). A total of 20 signaling items were examined across four domains: participants, predictors, results, and analysis. Moreover, the first three domains were evaluated for each involved investigation. The risk of bias was categorized as “high risk” if at least one question was answered “no” or “probably no” without suitable justification. A field was contemplated to have an “unclear risk” if indispensable report for some signaling item was absent, but there were no items that would classify the domain as high risk.

Summary measurements

Data from the included studies were collected using descriptive statistics such as mean differences, standard deviation values, and ranges, with a focus on continuous outcomes. If the papers demonstrated significant homogeneity, the possibility of managing a meta-analysis was assessed.

Results

Study selection

After searching as indicated, 773 studies were recognized in electronic databases. After subtracting duplicates and employing suitability conditions, 68 documents experienced a thorough full-text review. Omissions through this assessment were primarily due to the absence of WGS and AST of CP, or to inadequate data in the model validation procedure. Following the last phase of the suitability valuation, this study finally incorporated 26 articles. Figure 1 illustrates a detailed depiction of the examination flowchart.

Figure 1: Prisma flowchart.

Download full-size image

DOI: 10.7717/peerj.18213/fig-1

Features of the investigations

Table 2 summarizes the main characteristics of the 26 studies included in this review (Ahmad et al., 2023; Humphries et al., 2023; VanOeffelen et al., 2021; Aytan-Aktug et al., 2020; Noman et al., 2023; Yang & Wu, 2022; Stracy et al., 2022; Pearcy et al., 2021; Benkwitz-Bedford et al., 2021; Stanton et al., 2022; Ren et al., 2022; Wang et al., 2021; Sunuwar & Azad, 2021; Lüftinger et al., 2021; Májek et al., 2021; Khaledi et al., 2020; Hyun et al., 2020; Pataki et al., 2020; Macesic et al., 2020; Kim et al., 2020; Coolen et al., 2019; Nguyen et al., 2018; Moradigaravand et al., 2018b; Her & Wu, 2018; Pesesky et al., 2016; Davis et al., 2016). The exploration includes studies available between 2016 (Pesesky et al., 2016; Davis et al., 2016) and 2023 (Ahmad et al., 2023; Humphries et al., 2023; Noman et al., 2023). The investigations examine data from 104,141 microbial samples. Most of these studies were conducted in the United States and Europe. This table also indicates the antibiotics that were examined. The AMR of CP has been widely investigated utilizing various antibiotics. Among them, Ciprofloxacin, Ceftazidime, and Gentamicin were the most frequently utilized. These antibiotics were frequently assessed to comprehend the resistance patterns and trends in CP infections. E. coli, S. aureus and K. pneumoniae were the microorganisms most subjected to testing using WGS and AST methods. All the included studies specified resistance patterns of CP.

Table 2:

Summary of the main characteristics of the studies.

Authors and country	Number of microbial strains	Antibiotics studied	Critical and high-priority pathogens studied	Machine learning model	Assessment of performance
Noman et al., 2023, China	1,200	Ampicillin Amoxicillin Meropenem Cefepime Fosfomycin Ceftazidime Chloramphenicol Erythromycin Tetracycline Gentamycin Butirosin Ciprofoxacin	Pseudomonas aeruginosa	RF BioWeka	Sensitivity Specificity Accuracy Precision bACC
Humphries et al., 2023, USA	100	Cefepime	Escherichia coli	NGD	Accuracy AUC
Ahmad et al., 2023, Norway	21	Ampicillin Amikacin Ceftazidime Ciprofloxacin Gentamicin Imipenem	Escherichia coli, Staphylococcus aureus, Klebsiella pneumoniae, and Acinetobacter baumannii	DCNN	Sensitivity Specificity Accuracy
Yang & Wu, 2022, Taiwan	9,548	Gentamycin Ciprofoxacin Imipenem Amikacin Ceftazidime Trimethoprim/ sulfamethoxazole Tobramycin Tetracycline Ampicillin/sulbactam Levofloxacin Metropenem	Acinetobacter baumannii Escherichia coli Klebsiella pneumoniae Staphylococcus aureus	XGBoost SVM RF DT	Precision Recall F1-score AUC
Stracy et al., 2022, Israel	1,113	Trimethoprim/sulfa Ciprofloxacin, Ofloxacin, Amoxicillin/Cefuroxime axetil Cephalexin, Nitrofurantoin Fosfomycin	Escherichia coli	LR	AUC
Pearcy et al., 2021, UK	3,616	Ampicillin Meropenem Aztreonam Cefoxitin Cefepime Cefuroxime Ciprofloxacin Levofloxacin Aminoglycosides Trimethoprim Tetracycline	Escherichia coli	GBDT	Accuracy AUCROC Precision Recall
Benkwitz-Bedford et al., 2021, UK	1,407	Chloramphenicol Ciprofloxacin Ceftriaxone Kanamycin Tetracycline Trimethoprim	Escherichia coli	LR GBDT NN	k-fold cross- validation
Stanton et al., 2022, USA	1,019	Carbapenems	Pseudomonas aeruginosa	AdaBoost	AUC k-fold cross- validation
Ren et al., 2022, Germany	1,509	Ciprofloxacin Cefotaxime Ceftazidime Gentamicin	Escherichia coli	LR RF SVM CNN	Recall Precision AUC k-fold cross- validation
Wang et al., 2021, China	673	Erythromycin Cefoxitin Oxacillin Clindamycin Chloramphenicol Ciprofloxacin Gentamicin Penicillin Trimethoprim/ Sulfamethoxazole Tetracycline	Staphylococcus aureus	LR SVM RBF	Sensitivity Specificity Accuracy AUC k-fold cross- validation
Van Oeffelen et al., 2021, USA	67,817	128 antibiotics	Escherichia coli, Staphylococcus aureus, Klebsiella pneumoniae, Acinetobacter baumannii, and Pseudomonas aeruginosa	AdaBoost RF XGBoost	Accuracy AUC k-fold cross- validation
Sunuwar & Azad, 2021, USA	724	Doripenem Ertapenem Imipenem Meropenem	Escherichia coli, Klebsiella pneumoniae, and Pseudomonas aeruginosa	LR gNB SVM DT RF KNN LDA mNB AdaBoost GBDT ETC BG	Recall Precision AUC k-fold cross- validation
Lüftinger et al., 2021, Austria	8,704	Ampicillin Amikacin Ceftazidime Ciprofloxacin Gentamicin Imipenem Fluoroquinolones Piperacillin Tobramycin Ertapenem Imipenem Meropenem	Escherichia coli, Staphylococcus aureus, Klebsiella pneumoniae, Acinetobacter baumannii, and Pseudomonas aeruginosa	XGBoost LR SCM	Sensitivity Specificity Accuracy k-fold cross- validation
Májek et al., 2021, Austria	19,521	30 antibiotics	Escherichia coli, Staphylococcus aureus, Klebsiella pneumoniae, Acinetobacter baumannii, and Pseudomonas aeruginosa	XGBoost	Accuracy
Khaledi et al., 2020, Germany	414	Tobramycin Ceftazidime Ciprofloxacin Meropenem	Pseudomonas aeruginosa	SVM RF LR	Sensitivity k-fold cross- validation
Hyun et al., 2020, USA	2,332	14 antimicrobials	Staphylococcus aureusEscherichia coliPseudomonas aeruginosa	SVM	Accuracy Precision Recall AUC k-fold cross- validation
Pataki et al., 2020, Hungary	704	Ciprofloxacin	Escherichia coli	LR RF	Accuracy AUC k-fold cross- validation
Macesic et al., 2020, USA	386	Polymyxin	Klebsiella pneumoniae	LR RF SVC GBDT	Accuracy Precision Recall AUC k-fold cross- validation F1-score
Kim et al., 2020, USA	3,393	29 antibiotics	Escherichia coli, Staphylococcus aureus, Klebsiella pneumoniae, Acinetobacter baumannii, and Pseudomonas aeruginosa	XGBoost SVM 3-layer NN AdaBoost	Accuracy AUC k-fold cross- validation F1-score
Aytan-Aktug et al., 2020, Denmark	2,930	Ciprofloxacin Rifampin Streptomycin	Escherichia coli, and Staphylococcus aureus	RF NN	AUC k-fold cross- validation
Coolen et al., 2019, Netherlands	84	Cefotaxime Cefoxitin Ceftazidime	Escherichia coli	DT	Accuracy k-fold cross- validation
Nguyen et al., 2018, USA	1,668	20 antibiotics	Klebsiella pneumoniae	XGBoost	Accuracy k-fold cross- validation
Moradigaravand et al., 2018b, UK	1,936	11 antibiotics	Escherichia coli	RF GBDT NN LR	Accuracy k-fold cross- validation F1-score
Her & Wu, 2018, Taiwan	59	38 antibiotics	Escherichia coli	SVM NB RF AdaBoost	AUC k-fold cross- validation
Pesesky et al., 2016, USA	78	12 antibiotics	Escherichia coli and Klebsiella pneumoniae	LR RB	AUC k-fold cross- validation
Davis et al., 2016, USA	848	Carbapenem Methicillin	Staphylococcus aureus and Acinetobacter baumannii	AdaBoost	Accuracy AUC k-fold cross- validation F1-score

DOI: 10.7717/peerj.18213/table-2

Notes:

AdaBoost: Adaptive Boosting Decision Trees
AUC: Area Under the Receiver Operating Characteristic Curve
BG: Bagging Classifier
CNN: Convolutional Neural Network
NN: Neural Network
DCNN: Deep convolutional neural networks
DT: Decision Trees
ENLR: Elastic Net Regularized Logistic Regression
ETC: ExtraTrees Classifier
gNB: Gaussian Naive Bayes
GBDT: Gradient-Boosted Decision Trees
KNN: K-Nearest Neighbors
LDA: Linear Discriminant Analysis
LR: Logistic Regression
mNB: Multinomial Naïve Bayes
MCC: Matthew Correlation Coefficient
NGD: Next Gen Diagnostic
RF: Random Forest
RBF: Radial Basis Function
RB: Rules-Based Algorithms
SCM: Set Covering Machine;
SG: Stacked Generalization
SVM: Support Vector Machine
XGBoost: eXtreme Gradient Boosting
bACC: average of sensitivity and specificity

XGBoost, Random Forest (RF), logistic regression (LR), and support vector machine (SVM) were the ML models most frequently utilized in the studies included.

The most utilized metric for performance evaluation was the area under the receiver operating characteristic curve (AUC) and accuracy. Furthermore, the most common validation approach applied was k-fold cross.

Most studies suggest that integrating WGS and AST data into ML models is crucial for enhancing predictive performance, improving antibiotic stewardship, and offering valuable clinical decision support. Key aspects of each study are highlighted below.

Noman et al. (2023) demonstrate the effectiveness of ML-based feature selection using BioWeka and RF in predicting antimicrobial drug resistance in P. aeruginosa with high accuracy. The model achieved a AUC area of 0.91 and a mean accuracy of over 97% across 12 different antibiotic families. This approach enables early identification of patients at high risk of antibiotic resistance, allowing for informed decisions on empiric therapy and potentially reducing the spread of antibiotic-resistant infections. The model’s accuracy in detecting antibiotic resistance could have significant benefits for individuals, healthcare systems, and society, including improved patient outcomes, optimized antibiotic treatment, and enhanced infection prevention strategies. Preliminary data also demonstrate ML’s performance for clinically important antimicrobial-species pairs, encouraging further development of sequence-based susceptibility prediction and its validation for clinical practice (Humphries et al., 2023). Additionally, a combined workflow with quantitative polymerase chain reaction (QPM) and WGS complemented with deep learning data analyses could be transformative for detecting pathogens, characterizing AMR profiles, and providing valuable clinical decision support (Ahmad et al., 2023). ML personalized antibiotic recommendations based on patient history offer a means to reduce the emergence and spread of resistant pathogens (Stracy et al., 2022). Mutations associated with carbapenem resistance in P. aeruginosa are detected using an ML model incorporating genetic variations (Stanton et al., 2022). ML models outperform other conventional models, with the ability to identify mutations associated with AMR for each antibiotic (Ren et al., 2022). Well-curated AST datasets are essential for building high-quality ML models and advancing AI in biological sciences (VanOeffelen et al., 2021). A bioinformatics framework utilizing WGS-AMR data predicts resistance phenotypes and ranks AMR genes by importance (Sunuwar & Azad, 2021). Best practice techniques for AMR prediction from WGS data include genome distance-aware cross-validation and stacked generalization (Yang & Wu, 2022).

Predicting the functional impact of mutations using PROVEAN (Protein Variation Effect Analyzer) improves the predictive performance of AMR models for P. aeruginosa and E. coli (Májek et al., 2021). ML accurately predicts phenotypic resistance in K. pneumoniae and identifies genomic features determining susceptibility (Macesic et al., 2020). Variant detection methods and prediction models offer valuable tools for AMR research, achieving high accuracies through nested cross-validation (Kim et al., 2020). Species-independent models predict multi-AMR profiles for multiple species without losing robustness (Aytan-Aktug et al., 2020). WGS and ML algorithms differentiate ampC genotypes in E. coli based on phenotypic susceptibility testing (Coolen et al., 2019). ML algorithms predict antibiotic resistance with the best accuracy for AMR genes within the accessory part of the pan-genome in E. coli (Her & Wu, 2018). Rules-based and ML algorithms achieve high agreement with phenotypic diagnostics for predicting resistance, with genotype-based testing showing great promise as a diagnostic tool (Pesesky et al., 2016). AdaBoost machine learning classifiers accurately identify carbapenem resistance in A. baumannii and methicillin resistance in S. aureus (Davis et al., 2016). Pearcy et al. (2021) developed a computational pipeline combining ML and genome-scale metabolic models to understand the complex relationships between genetic determinants of resistance and metabolism in E. coli. The approach identified 225 known AMR-conferring genes and predicted 20 genetic determinants essential for growth and 17 linked to auxotrophic behavior. The study revealed clusters of AMR-conferring genes affecting similar metabolic processes, suggesting that adaptations in cell wall, energy, iron, and nucleotide metabolism are associated with AMR. Khaledi et al. (2020) investigated the use of genomic and transcriptomic data to predict antimicrobial resistance in P. aeruginosa. By analyzing the genomes and transcriptomes of 414 drug-resistant clinical isolates, they developed ML models that accurately predicted resistance to four commonly used antibiotics. The models achieved high sensitivity and predictive values (0.8−0.9 or >0.9) using gene presence/absence, sequence variation, and expression profiles. The study demonstrates the potential for a molecular resistance profiling tool to rapidly and reliably predict antimicrobial susceptibility, enabling earlier and more informed treatment decisions. Hyun et al. (2020) developed a ML workflow using pan-genomes and random subspace ensembles (RSEs) to detect AMR associations. This approach was applied to 288 S. aureus, 456 P. aeruginosa, and 1,588 E. coli genomes. The study found that RSEs outperformed traditional statistical tests and previous ensemble approaches, identifying 45 known AMR-conferring genes and alleles, as well as 25 candidate associations. The results confirmed existing knowledge of fluoroquinolone resistance mechanisms and suggested a simple mutational landscape for FQ resistance. This approach has the potential to predict AMR determinants in a wider range of microbial pathogens as larger datasets become available. Benkwitz-Bedford et al. (2021) used ML to predict the growth of 1.407 genetically diverse E. coli strains under exposure to subinhibitory concentrations of six classes of antimicrobials. The study found that whole-genome information was superior to known AMR genes in predicting growth yields and doubling times, with moderate correlations (0.63 and 0.59, respectively). The results identified genes and SNPs determining growth and recapitulated known AMR determinants. While the approach showed promise, the remaining missing heritability poses a challenge for achieving clinical-level accuracy and precision. The study highlights the potential of predictive modeling for understanding AMR and identifying genetic determinants of growth under antimicrobial exposure. These findings underscore the significance of ML in advancing our understanding of AMR and improving clinical decision-making to combat antibiotic resistance.

Table 3 displays the AUC values for ML prediction of AMR using WGS and AST in CP. The results show that RF, XGBoost, and LR emerged as the top-performing models, with mean AUC values of 0.89, 0.87, and 0.87, respectively. RF showed superior performance with AUC values ranging from 0.66 to 0.97, while XGBoost and LR showed similar performance with AUC values ranging from 0.83 to 0.91 and 0.76 to 0.96, respectively. The other models showed varying levels of performance, with some achieving high AUC values, such as NN (0.97) and SVM (0.96), while others showed lower performance, such as DT (0.60) and GBDT (0.75).

Table 3:

Comparative analysis of AUC values for various machine learning ML models.

Study	DT	GBDT	RF	Bio Weka	XGBoost	AdaBoost	NN	SVM	NGD	DCNN	RBF	RB	LR
Noman et al. (2023)			0.96	0.98
Humphries et al. (2023)									0.97
Ahmad et al. (2023)										0.95
Yang & Wu (2022)	0.85		0.96		0.97			0.95
Stracy et al. (2022)													0.76
Pearcy et al. (2021)		0.98
Benkwitz-Bedford et al. (2021)		0.90					0.85						0.75
Stanton et al. (2022)						0.60
Ren et al. (2022)			0.90					0.77		0.80			0.81
Wang et al. (2021)								0.96			0.96		0.96
Van Oeffelen et al. (2021)					0.92
Sunuwar & Azad (2021)	0.97	0.81	0.93			0.82		0.84					0.81
Lüftinger et al. (2021)					0.84								0.84
Májek et al. (2021)					0.86
Khaledi et al. (2020)			0.67					0.83					0.84
Hyun et al. (2020)								0.79-1
Pataki et al. (2020)			0.80										0.79
Macesic et al. (2020)		0.89	0.90					0.93					0.90
Kim et al. (2020)					0.91
Aytan-Aktug et al. (2020)			0.97				0.92
Coolen et al. (2019)	0.88
Nguyen et al. (2018)			0.92
Moradigaravand et al. (2018b)		0.91	0.84				0.82						0.78
Her & Wu (2018)			0.66			0.77		0.77
Pesesky et al. (2016)												0.89	0.91
Davis et al. (2016)						0.94

DOI: 10.7717/peerj.18213/table-3

Notes:

DT: decision tree
GBDT: gradient-boosted decision trees
RB: rules-based algorithm
RBF: linear radial basis function
RF: Random Forest
XGBoost: eXtreme Gradient Boosting
NN: Neural network
AdaBoost: Adaptive Boosting
WEKA: Data Mining Software in Java Workbench
LR: logistic regression
SVM: Support Vector Machinne

Notably, the predictors selected by these algorithms largely aligned with those identified by LR. Importantly, all ML models accurately predicted resistance patterns in CP across multiple antibiotics using data from WGS and AST.

Assessment of bias risk

As per the PROBAST instrument, studies focused on model development and validation exhibit a heightened risk of bias when participant data is sourced from existing databases such as routine care registries. If an evaluation is rated high for at least one domain, it should be regarded as having a “high risk of bias” or “high concern” concerning pertinence. Consequently, most reports incorporated in this systematic review were evaluated to possess a high risk of bias due to their inherent characteristics (Table 4).

Table 4:

Evaluation of risk bias (Moons et al., 2019).

Study	Risk of bias				Applicability			Overall
	Participants	Predictors	Outcome	Analysis	Participant	Predictor	Outcome	Risk of bias	Applicability
Noman et al. (2023)	–	+	+	+	+	+	+	-	+
Humphries et al. (2023)	–	+	+	+	+	+	+	-	+
Ahmad et al. (2023)	–	+	+	+	+	+	+	-	+
Yang & Wu (2022)	–	+	+	+	+	+	+	-	+
Stracy et al. (2022)	–	+	+	+	+	+	+	-	+
Pearcy et al. (2021)	–	+	+	+	+	+	+	-	+
Benkwitz-Bedford et al. (2021)	–	+	+	+	+	+	+	-	+
Stanton et al. (2022)	–	+	+	-	+	+	+	-	+
Ren et al. (2022)	–	+	+	+	+	+	+	-	+
Wang et al. (2021)	–	+	+	+	+	+	+	-	+
Van Oeffelen et al. (2021)	–	+	+	+	+	+	+	-	+
Sunuwar & Azad (2021)	–	+	+	+	+	+	+	-	+
Lüftinger et al. (2021)	–	+	+	+	+	+	+	-	+
Májek et al. (2021)	–	+	+	+	+	+	+	-	+
Khaledi et al. (2020)	–	+	+	+	+	+	+	-	+
Hyun et al. (2020)	–	+	+	+	+	+	+	-	+
Pataki et al. (2020)	–	-	-	?	+	+	+	-	+
Macesic et al. (2020)	–	+	+	+	+	?	+	-	?
Kim et al. (2020)	–	+	+	+	+	+	+	-	+
Aytan-Aktug et al. (2020)	–	+	+	+	+	+	+	-	+
Coolen et al. (2019)	–	+	+	+	+	+	+	-	+
Nguyen et al. (2018)	–	+	+	+	+	+	+	-	+
Moradigaravand et al. (2018b)	–	+	+	+	+	+	+	-	+
Her & Wu (2018)	–	+	+	-	+	+	+	-	+
Pesesky et al. (2016)	–	+	?	?	+	+	+	?	+
Davis et al. (2016)	–	+	+	+	+	+	+	-	+

DOI: 10.7717/peerj.18213/table-4

Notes:

+: low risk
=: high risk
?: unclear risk

Discussion

This study was conducted to evaluate ML predictions for AMR in CP utilizing WGS and AST was performed. Although LR was usually used for prediction, RF and XGBoost were also commonly utilized. Notably, RF demonstrated the highest AUC values compared to LR. Furthermore, other algorithms such as SVM, AdaBoost, and Neural Networks were utilized. Importantly, all ML models accurately predicted resistance patterns in CP across multiple antibiotics using WGS and AST.

To the best of our knowledge, this is the inaugural systematic review to evaluate the efficacy of ML models in predicting AMR utilizing WGS and AST specifically for critical and high-priority pathogens. This inquiry builds upon a previous systematic study proposing ML as a promising tool for AMR prediction (Tang et al., 2022). Alarmingly, nearly half of the research included in that publication did not delineate resistance patterns, whereas all studies reviewed herein did so. The integration of WGS and AST data into our investigation is imperative for enhancing the robustness and practicality of ML models in AMR prediction. By elucidating resistance patterns, our review elucidates the efficacy of these models in guiding antimicrobial therapy.

WGS data provide an alternative perspective on AMR, enabling researchers to assess the genetic pathways that confer AMR in each strain (Ahmad et al., 2023; Lüftinger et al., 2023; Humphries et al., 2023; Noman et al., 2023; Yang & Wu, 2022; Stracy et al., 2022; Pearcy et al., 2021; Benkwitz-Bedford et al., 2021; Khaledi et al., 2020; Hyun et al., 2020; Pataki et al., 2020; Macesic et al., 2020; Kim et al., 2020). Numerous publicly accessible resources have been developed to assist in identifying AMR indicators by detecting the presence of genes and single nucleotide polymorphisms that confer resistance (Su, Satola & Read, 2019; VanOeffelen et al., 2021). The combination of AST data with WGS also has the potential to unveil genomic regions directly involved in resistance, altered due to epistasis, or linked to the occurrence of AMR (VanOeffelen et al., 2021; Hendriksen et al., 2019). Several resources, such as the National Center for Biotechnology Information, European Molecular Biology Laboratory- European Bioinformatics Institute, Relational Sequencing TB Data Platform, AR Isolate Bank, and Pathogenwatch, offer genome datasets matched with AST data for further analyses like comparative genomics and modeling (VanOeffelen et al., 2021; Sayers et al., 2020; Matamoros et al., 2020).

While WGS provides a valuable genetic blueprint that can predict AMR, integrating AST data enhances the ability to confirm phenotypic resistance patterns. This approach addresses the inherent limitation that genomic data alone cannot fully account for all phenotypic expressions of resistance. The combination of WGS and AST data not only supports the prediction of resistance phenotypes but also helps to uncover genetic interactions, such as epistasis, that may influence AMR (VanOeffelen et al., 2021; Hendriksen et al., 2019). By employing a range of ML models, including ensemble methods like RF and XGBoost, we can better capture the complexity of AMR prediction, making these models more applicable to clinical practice.

In clinical settings, the implementation of such integrated approaches can improve the accuracy of AMR predictions, thereby guiding more effective antimicrobial therapy. We recommend that future studies continue to explore the synergistic use of WGS and AST data, alongside the development of more sophisticated ML models, to further refine the predictive power and clinical utility of these methods.

Several studies have underscored the importance of providing information about AMR testing. Without AST information, the AUC values ranged from 0.73 to 0.79. Nonetheless, incorporating AST led to even higher AUC scores, which ranged from 0.80 to 0.88 (Lewin-Epstein et al., 2021). Optimization replications indicate that, notwithstanding diffident AUC values, antibiotic selection guided by personalized antibiograms can equal or surpass physician achievement. Furthermore, such selection yielded coverage rates akin to those observed in real-world scenarios, while requiring fewer broad-spectrum antibiotics (Corbin et al., 2022). This underscores a persistent and critical challenge in antibiotic stewardship.

Likewise, it has been verified that the quality of initial data and the precision of metagenomic binning are crucial for the effectiveness of subsequent applications like genomic AST. A workflow designed for native samples with low bacterial complexity and adequate on-target sequencing depth demonstrates comparable performance to genomic AST on isolate sequencing data (Lüftinger et al., 2021).

The AUC serves as a widely adopted standard measure for assessing model functioning. This measured was identified as the main success indicator in both our study and a previous review on AMR. Nevertheless, the earlier evaluation did not encompass AST in all the scrutinized publications, nor did it evaluate WGS (Tang et al., 2022). Notably, there is a disparity in the range of AUC values between the two studies for LR outcomes (0.76−0.96 in our assessment versus 0.50−0.83), as well as for other ML results (0.48−0.92 versus 0.83−0.91 for XGBoost and 0.66−0.97 for RF in our study). The differences in selection criteria between the two assessments present challenges in comparing the outcomes directly. Nevertheless, it is conceivable that the incorporation of WGS-AST impacts the outcomes, and that the models react differently based on the input factors. In this framework, research has demonstrated that while comparing results across different settings has its limitations, some models established previously (Mintz, Chowers & Obolski, 2023) exhibit superior performance compared to other studies (Yelin et al., 2019; Feretzakis et al., 2020). Curiously, these ML models demonstrated good performance on a diverse dataset, which included various microorganisms, test informants, and clinical divisions (Mintz, Chowers & Obolski, 2023). A previous study (Feretzakis et al., 2020) predicted AMR using data from a specific medical unit, based on the Gram stain result of the sample, achieving an AUC of 0.72. In contrast, another study (Yelin et al., 2019) focused on predicting AMR exclusively in cases using urine samples and limited the analysis to three microorganisms, achieving an AUC of 0.83.

Typically, predictors are selected by means of both unadjusted and multivariate LR models. Here, usual input risk features contain AMR patterns, WGS, colonization, ART, and past AMR circumstances. These characteristics are narrowly associated with AMR and can be utilized as predictors in various ML models and risk score assessments (Goodman et al., 2016). However, it is challenging to determine if including additional variables, such as underlying disorders, improves prediction exactitude. Besides, well-known issues, such as the practice of proton pump inhibitors (PPIs) (Shang, Lin & Goetz, 2000), can be ignored in some studies. Consequently, further prospective research is needed to better understand the impact of PPI usage.

Another method for validating final predictors is to use feature selection processes (Tang et al., 2022; Mintz, Chowers & Obolski, 2023; Yelin et al., 2019; Feretzakis et al., 2020). While predictors identified by these algorithms align with those proposed by LR models or previous data, others, such as admission times, have ambiguous relationships with AMR. Domain expertise and a structured approach are considered essential for sorting through the substantial quantities of data from health organizations (Tang et al., 2022).

The current study’s findings suggest that a ML forecast based on WGS-AST could aid in guiding antibiotic recommendations for confirmed carbapenemase-producing CP infections. A previous systematic review (Tang et al., 2022) reported similar results. However, other studies have compared the efficacy of ML systems to risk scores, with inconsistent outcomes (Moran et al., 2020; Lee et al., 2021). Indeed, the findings in this field vary significantly. One comprehensive review, which aimed to develop diagnostic or prognostic clinical prediction models for binary outcomes using clinical data, found no evidence that ML outperformed LR, contradicting the results of two other systematic reviews. One review (Beunza et al., 2019) indicated that ML algorithms can enhance the diagnostic and prognostic capabilities of traditional regression techniques, while another (Sufriyana et al., 2020) recommended reanalyzing existing LR models for various outcomes and comparing them to algorithms adhering to established standards. Although risk scores can provide valuable bedside assistance, it is assumed that health organisms integrated with ML may address this concern by leveraging considerable volumes of information (Tang et al., 2022). The primary advantage of ML lies in its continuous learning development, leading to superior model exactitude and a broad range of uses in healthcare information. Dissimilar to conventional statistical approaches, ML does not rely on specific assumptions, which are frequently overlooked or critically examined in clinical information (Rajula et al., 2020). Consequently, the choice of algorithms would be directed by the investigation topic and the purpose framework.

Partial information is unavoidable in certain studies, leading to statistical difficulty and bias in ML projections. Another challenge is data disparity in the AMR prediction model (Tang et al., 2022). This discrepancy adversely affects calculation functioning, as classifiers incline to favor the majority class to diminish global inaccuracy proportions (Japkowicz, 2000). To alleviate this subject, methods such as resampling, correcting hyperparameters, and meticulous method choice may be used (Tang et al., 2022). Upcoming investigators must cooperate with distinct groups to perform high-grade models.

While classification-based ML models have been pivotal in predicting AMR phenotypes by categorizing pathogens as resistant or susceptible, it is essential to consider the regression-based approaches that directly predict the minimum inhibitory concentration (MIC) (Yang, Su & Wu, 2023). MIC is a critical quantitative measure that reflects the lowest concentration of an antibiotic required to inhibit bacterial growth. Classification models typically rely on established MIC breakpoints to determine resistance or susceptibility, as defined by standards from organizations such as the Clinical and Laboratory Standards Institute (CLSI) (Noman et al., 2023; Benkwitz-Bedford et al., 2021). However, these breakpoints are periodically reviewed and updated, which could affect the consistency of classification-based predictions. In contrast, regression models that predict MIC values provide a more detailed and adaptable understanding of the bacterial response to antibiotics, allowing for more precise phenotypic annotations (Yang, Su & Wu, 2023).

Relevant studies included in this systematic review underscore the importance of regression-based machine learning models in the accurate prediction of MIC values (Noman et al., 2023; Benkwitz-Bedford et al., 2021; Khaledi et al., 2020; Hyun et al., 2020; Pataki et al., 2020; Macesic et al., 2020; Nguyen et al., 2018). Nguyen et al. (2018) utilized WGSD to predict MICs across various bacterial species, demonstrating the potential of these models to enhance the understanding of resistance mechanisms at a quantitative level. Similarly, Pataki et al. (2020) employed ML models to predict MIC values, which enabled a more nuanced interpretation of antimicrobial resistance that goes beyond binary classification. Yang, Su & Wu (2023) further expanded on this approach by applying a pan-genome-based feature selection method to improve the accuracy of MIC predictions in Salmonella enterica. Their work highlighted that the selected genomic features, including novel genes not previously associated with AMR, contributed significantly to the accurate prediction of MIC. This suggests that regression-based models can uncover new genetic determinants of resistance, providing insights that are not easily captured by classification methods alone. Incorporating MIC prediction into AMR research thus offers a robust complement to classification-based approaches, enhancing the granularity and applicability of machine learning in clinical microbiology.

There is no well-established tool for evaluating bias risk in ML prediction studies. One study (Delpino et al., 2022) utilized the TRIPOD statement to characterize study quality, while other studies (Fleuren et al., 2020; Christodoulou et al., 2019) employed the QUADAS-2 criteria. The TRIPOD statement serves more as a checklist than a bias assessment tool, whereas the QUADAS-2 criteria are widely used to evaluate the quality of diagnostic accuracy studies (Whiting et al., 2011). PROBAST (Moons et al., 2019) has also been used to evaluate ML in predicting AMR, as demonstrated in Tang et al. (2022).

The present review is subject to limitations. The studies under review exhibited considerable variability stemming from variations in outcomes, predictors, ML, and hyperparameters, among other factors. Most of the studies included in the review were identified as having a substantial risk of bias, precluding the possibility of conducting a meta-analysis. Two systematic reviews focusing on ML-based prediction models noted notable disparities among the studies they evaluated (Tang et al., 2022; Fleuren et al., 2020). One of these reviews observed a heterogeneity exceeding 97% (Tang et al., 2022). Another study discovered that there was no discernible discrepancy in AUC between ML and LR across 145 assessments exhibiting a low risk of bias (0.00, 95% CI −0.18 to 0.18). Nevertheless, in 137 instances characterized by a high risk of bias, ML exhibited a substantially superior AUC of 0.34 (0.20−0.47) (Christodoulou et al., 2019). Cochrane advises exercising caution when interpreting data with an I² value exceeding 50%, as it signifies considerable heterogeneity (Higgins et al., 2003; Schroll, Moustgaard & Gøtzsche, 2011).

While our study focused on a subset of critical and high-priority pathogens identified by the WHO, including E. coli, S. aureus, K. pneumoniae, A. baumannii, and P. aeruginosa, we acknowledge that other important microorganisms were not included. Notably, Salmonella spp., as well as other WHO-designated high-priority pathogens such as Enterococcus faecium, Helicobacter pylori, and Campylobacter spp., were not part of our analysis. A systematic review that encompasses these additional microorganisms would provide a more comprehensive understanding of the global burden of antimicrobial resistance. This would be particularly important for informing public health policies and interventions aimed at reducing the transmission of AMR through various routes, including food and water. Furthermore, such a review would help identify knowledge gaps and research priorities for addressing AMR in a broader range of microorganisms. Future studies should consider including these microorganisms to provide a more complete picture of the global AMR landscape.

Conclusions

By integrating whole genome sequencing and antimicrobial susceptibility testing data in critical high priority pathogens, machine learning demonstrates significant potential for predicting antimicrobial resistance. Machine learning models, particularly Random Forest, XGBoost, and logistic regression, offer valuable clinical decision support by accurately predicting antimicrobial resistance in critical pathogens. This can assist healthcare providers in making informed treatment decisions, optimizing antibiotic use, and improving patient outcomes. Standardized guidelines are imperative to uphold consistency in forthcoming studies.

Supplemental Information

PRISMA checklist

DOI: 10.7717/peerj.18213/supp-1

Download

Rationale

DOI: 10.7717/peerj.18213/supp-2

Download

[1] Ahmad A, Hettiarachchi R, Khezri A, Singh Ahluwalia B, Wadduwage DN, Ahmad R. 2023. Highly sensitive quantitative phase microscopy and deep learning aided with whole genome sequencing for rapid detection of infection and antimicrobial resistance. Frontiers in Microbiology 14:1154620

[2] Anahtar MN, Yang JH, Kanjilal S. 2021. Applications of machine learning to the problem of antimicrobial resistance: an emerging model for translational research. Journal of Clinical Microbiology 59:e0126020

[3] Antimicrobial Resistance Collaborators. 2022. Global burden of bacterial antimicrobial resistance in 2019: a systematic analysis. Lancet 399(10325):629-655

[4] Aytan-Aktug D, Clausen PTLC, Bortolaia V, Aarestrup FM, Lund O. 2020. Prediction of acquired antimicrobial resistance for multiple bacterial species using neural networks. MSystems 5(1):e00774–19

[5] Benkwitz-Bedford S, Palm M, Demirtas TY, Mustonen V, Farewell A, Warringer J, Parts L, Moradigaravand D. 2021. Machine learning prediction of resistance to subinhibitory antimicrobial concentrations from Escherichia coli genomes. MSystems 6(4):e0034621

[6] Beunza JJ, Puertas E, García-Ovejero E, Villalba G, Condes E, Koleva G, Hurtado C, Landecho MF. 2019. Comparison of machine learning algorithms for clinical event prediction (risk of coronary heart disease) Journal of Biomedical Informatics 97:103257

[7] Bortolaia V, Kaas RS, Ruppe E, Roberts MC, Schwarz S, Cattoir V, Philippon A, Allesoe RL, Rebelo AR, Florensa AF, Fagelhauer L, Chakraborty T, Neumann B, Werner G, Bender JK, Stingl K, Nguyen M, Coppens J, Xavier BB, Malhotra-Kumar S, Westh H, Pinholt M, Anjum MF, Duggett NA, Kempf I, Nykäsenoja S, Olkkola S, Wieczorek K, Amaro A, Clemente L, Mossong J, Losch S, Ragimbeau C, Lund O, Aarestrup FM. 2020. ResFinder 4.0 for predictions of phenotypes from genotypes. Journal of Antimicrobial Chemotherapy 75(12):3491-3500

[8] Butler MS, Paterson DL. 2020. Antibiotics in the clinical pipeline in 2019. The Journal of Antibiotics 73:329-364

[9] Christodoulou E, Ma J, Collins GS, Steyerberg EW, Verbakel JY, Van Calster B. 2019. A systematic review shows no performance benefit of machine learning over logistic regression for clinical prediction models. Journal of Clinical Epidemiology 110:12-22

[10] Coolen JPM, Den Drijver EPM, Kluytmans JAJW, Verweij JJ, Lamberts BA, Soer JACJ, Verhulst C, Wertheim HFL, Kolwijck E. 2019. Development of an algorithm to discriminate between plasmid- and chromosomal-mediated AmpC β-lactamase production in Escherichia coli by elaborate phenotypic and genotypic characterization. Journal of Antimicrobial Chemotherapy 74(12):3481-3488

[11] Corbin CK, Sung L, Chattopadhyay A, Noshad M, Chang A, Deresinksi S, Baiocchi M, Chen JH. 2022. Personalized antibiograms for machine learning driven antibiotic selection. Communications Medicine 2:38

[12] Davis JJ, Boisvert S, Brettin T, Kenyon RW, Mao C, Olson R, Overbeek R, Santerre J, Shukla M, Wattam AR, Will R, Xia F, Stevens R. 2016. Antimicrobial resistance prediction in PATRIC and RAST. Scientific Reports 6:27930

[13] Delpino FM, Costa ÂK, Farias SR, Chiavegatto Filho ADP, Arcêncio RA, Nunes BP. 2022. Machine learning for predicting chronic diseases: a systematic review. Public Health 205:14-25

[14] Drouin A, Letarte G, Raymond F, Marchand M, Corbeil J, Laviolette F. 2019. Interpretable genotype-to-phenotype classifiers with performance guarantees. Scientific Reports 9:4071

[15] Feretzakis G, Loupelis E, Sakagianni A, Kalles D, Martsoukou M, Lada M, Skarmoutsou N, Christopoulos C, Valakis K, Velentza A, Petropoulou S, Michelidou S, Alexiou K. 2020. Using machine learning techniques to aid empirical antibiotic therapy decisions in the intensive care unit of a general hospital in Greece. Antibiotics 9(2):50

[16] Fleuren LM, Klausch TLT, Zwager CL, Schoonmade LJ, Guo T, Roggeveen LF, Swart EL, Girbes ARJ, Thoral P, Ercole A, Hoogendoorn M, Elbers PWG. 2020. Machine learning for the prediction of sepsis: a systematic review and meta-analysis of diagnostic test accuracy. Intensive Care Medicine 46(3):383-400

[17] Goodman KE, Lessler J, Cosgrove SE, Harris AD, Lautenbach E, Han JH, Milstone AM, Massey CJ, Tamma PD, Antibacterial Resistance Leadership Group. 2016. A clinical decision tree to predict whether a bacteremic patient is infected with an extended-spectrum β-lactamase-producing organism. Clinical Infectious Diseases 63(7):896-903

[18] Hendriksen RS, Bortolaia V, Tate H, Tyson GH, Aarestrup FM, McDermott PF. 2019. Using genomics to track global antimicrobial resistance. Frontiers in Public Health 7:242

[19] Her HL, Wu YW. 2018. A pan-genome-based machine learning approach for predicting antimicrobial resistance activities of the Escherichia coli strains. Bioinformatics 34:i89-i95

[20] Higgins JP, Thompson SG, Deeks JJ, Altman DG. 2003. Measuring inconsistency in meta-analyses. BMJ 327(7414):557-560

[21] Humphries RM, Bragin E, Parkhill J, Morales G, Schmitz JE, Rhodes PA. 2023. Machine-learning model for prediction of cefepime susceptibility in Escherichia coli from whole-genome sequencing data. Journal of Clinical Microbiology 61:e0143122

[22] Hyun JC, Kavvas ES, Monk JM, Palsson BO. 2020. Machine learning with random subspace ensembles identifies antimicrobial resistance determinants from pan-genomes of three pathogens. PLOS Computational Biology 16:e1007608

[23] Japkowicz N. 2000. Learning from imbalanced data sets: a comparison of various strategies. AAAI Technical Report WS-00-05. Washington, D.C.: AAAI

[24] Khaledi A, Weimann A, Schniederjans M, Asgari E, Kuo TH, Oliver A, Cabot G, Kola A, Gastmeier P, Hogardt M, Jonas D, Mofrad MR, Bremges A, McHardy AC, Häussler S. 2020. Predicting antimicrobial resistance in Pseudomonas aeruginosa with machine learning-enabled molecular diagnostics. EMBO Molecular Medicine 12(3):e10264

[25] Kim J, Greenberg DE, Pifer R, Jiang S, Xiao G, Shelburne SA, Koh A, Xie Y, Zhan X. 2020. VAMPr: variant mapping and prediction of antibiotic resistance via explainable features and machine learning. PLOS Computational Biology 16(1):e1007511

[26] Lee ALH, To CCK, Lee ALS, Chan RCK, Wong JSH, Wong CW, Chow VCY, Lai RWM. 2021. Deep learning model for prediction of extended-spectrum beta-lactamase (ESBL) production in community-onset Enterobacteriaceae bacteraemia from a high ESBL prevalence multi-centre cohort. European Journal of Clinical Microbiology and Infectious Diseases 40(5):1049-1061

[27] Lewin-Epstein O, Baruch S, Hadany L, Stein GY, Obolski U. 2021. Predicting antibiotic resistance in hospitalized patients by applying machine learning to electronic medical records. Clinical Infectious Diseases 72:e848-e855

[28] Lüftinger L, Májek P, Beisken S, Rattei T, Posch AE. 2021. Learning from limited data: towards best practice techniques for antimicrobial resistance prediction from whole genome sequencing data. Frontiers in Cellular and Infection Microbiology 11:610348

[29] Lüftinger L, Májek P, Rattei T, Beisken S. 2023. Metagenomic antimicrobial susceptibility testing from simulated native patient samples. Antibiotics 12:366

[30] Macesic N, Bear Don’t Walk OJ IV, Pe’er I, Tatonetti NP, Peleg AY, Uhlemann AC. 2020. Predicting phenotypic polymyxin resistance in Klebsiella pneumoniae through machine learning analysis of genomic data. MSystems 5:e00656–19

[31] Májek P, Lüftinger L, Beisken S, Rattei T, Materna A. 2021. Genome-wide mutation scoring for machine-learning-based antimicrobial resistance prediction. International Journal of Molecular Sciences 22:13049

[32] Matamoros S, Hendriksen RS, Pataki BÁ, Pakseresht N, Rossello M, Silvester N, Amid C, Aarestrup FM, Koopmans M, Cochrane G, Csabai I, Lund O, Schultsz C. 2020. The compare Ml-Amr group, accelerating surveillance and research of antimicrobial resistance—an online repository for sharing of antimicrobial susceptibility data associated with whole-genome sequences. Microbial Genomics 6(5):e000342

[33] Milani RV, Wilt JK, Entwisle J, Hand J, Cazabon P, Bohan JG. 2019. Reducing inappropriate outpatient antibiotic prescribing: normative comparison using unblinded provider reports. BMJ Open Quality 8:e000351

[34] Mintz I, Chowers M, Obolski U. 2023. Prediction of ciprofloxacin resistance in hospitalized patients using machine learning. Communications Medicine 3(1):43

[35] Moons KGM, Wolff RF, Riley RD, Whiting PF, Westwood M, Collins GS, Reitsma JB, Kleijnen J, Mallett S. 2019. PROBAST: a tool to assess risk of bias and applicability of prediction model studies: explanation and elaboration. Annals of Internal Medicine 170(1):W1-W33

[36] Moradigaravand D, Palm M, Farewell A, Mustonen V, Warringer J, Parts L. 2018a. Prediction of antibiotic resistance in Escherichia coli from large-scale pan-genome data. PLOS Computational Biology 14:e1006258

[37] Moradigaravand D, Palm M, Farewell A, Mustonen V, Warringer J, Parts L. 2018b. Prediction of antibiotic resistance in Escherichia coli from large-scale pan-genome data. PLOS Computational Biology 14:e1006258

[38] Moran E, Robinson E, Green C, Keeling M, Collyer B. 2020. Towards personalized guidelines: using machine-learning algorithms to guide antimicrobial selection. Journal of Antimicrobial Chemotherapy 75:2677-2680

[39] Nguyen M, Brettin T, Long SW, Musser JM, Olsen RJ, Olson R, Shukla M, Stevens RL, Xia F, Yoo H, Davis JJ. 2018. Developing an in silico minimum inhibitory concentration panel test for Klebsiella pneumoniae. Scientific Reports 8(1):421

[40] Noman SM, Zeeshan M, Arshad J, Deressa Amentie M, Shafiq M, Yuan Y, Zeng M, Li X, Xie Q, Jiao X. 2023. Machine learning techniques for antimicrobial resistance prediction of Pseudomonas Aeruginosafrom whole genome sequence data. Computational Intelligence and Neuroscience 2023:5236168

[41] Page MJ, McKenzie JE, Bossuyt PM, Boutron I, Hoffmann TC, Mulrow CD, Shamseer L, Tetzlaff JM, Akl EA, Brennan SE, Chou R, Glanville J, Grimshaw JM, Hróbjartsson A, Lalu MM, Li T, Loder EW, Mayo-Wilson E, McDonald S, McGuinness LA, Stewart LA, Thomas J, Tricco AC, Welch VA, Whiting P, Moher D. 2021. The PRISMA 2020 statement: an updated guideline for reporting systematic reviews. International Journal of Surgery 88:105906

[42] Pataki BÁ, Matamoros S, Van der Putten BCL, Remondini D, Giampieri E, Aytan-Aktug D, Hendriksen RS, Lund O, Csabai I, Schultsz C, SPS COMPARE ML-AMR group. 2020. Understanding and predicting ciprofloxacin minimum inhibitory concentration in Escherichia coli with machine learning. Scientific Reports 10(1):15026

[43] Pearcy N, Hu Y, Baker M, Maciel-Guerra A, Xue N, Wang W, Kaler J, Peng Z, Li F, Dottorini T. 2021. Genome-scale metabolic models and machine learning reveal genetic determinants of antibiotic resistance in Escherichia coli and unravel the underlying metabolic adaptation mechanisms. MSystems 6(4):e0091320

[44] Pesesky MW, Hussain T, Wallace M, Patel S, Andleeb S, Burnham CD, Dantas G. 2016. Evaluation of machine learning and rules-based approaches for predicting antimicrobial resistance profiles in gram-negative bacilli from whole genome sequence data. Frontiers in Microbiology 7:1887

[45] Pormohammad A, Nasiri MJ, Azimi T. 2019. Prevalence of antibiotic resistance in Escherichia coli strains simultaneously isolated from humans, animals, food, and the environment: a systematic review and meta-analysis. Infection and Drug Resistance 12:1181-1197

[46] Rajula HSR, Verlato G, Manchia M, Antonucci N, Fanos V. 2020. Comparison of conventional statistical methods with machine learning in medicine: diagnosis, drug development, and treatment. Medicina 56:455

[47] Ren Y, Chakraborty T, Doijad S, Falgenhauer L, Falgenhauer J, Goesmann A, Hauschild AC, Schwengers O, Heider D. 2022. Prediction of antimicrobial resistance based on whole-genome sequencing and machine learning. Bioinformatics 38(2):325-334

[48] Ruiz-Blanco YB, Agüero-Chapin G, Romero-Molina S, Antunes A, Olari LR, Spellerberg B, Münch J, Sanchez-Garcia E. 2022. ABP-Finder: a tool to identify antibacterial peptides and the gram-staining type of targeted bacteria. Antibiotics 11(12):1708

[49] Sayers EW, Beck J, Brister JR, Bolton EE, Canese K, Comeau DC, Funk K, Ketter A, Kim S, Kimchi A, Kitts PA, Kuznetsov A, Lathrop S, Lu Z, McGarvey K, Madden TL, Murphy TD, O’Leary N, Phan L, Schneider VA, Thibaud-Nissen F, Trawick BW, Pruitt KD, Ostell J. 2020. Database resources of the national center for biotechnology information. Nucleic Acids Research 48(D1):D9-D16

[50] Schroll JB, Moustgaard R, Gøtzsche PC. 2011. Dealing with substantial heterogeneity in Cochrane reviews. Cross-sectional study. BMC Medical Research Methodology 11:22

[51] Shang JS, Lin YS, Goetz AM. 2000. Diagnosis of MRSA with neural networks andlogistic regression approach. Health Care Management Science 3:287-297

[52] Stanton RA, Campbell D, McAllister GA, Breaker E, Adamczyk M, Daniels JB, Lutgring JD, Karlsson M, Schutz K, Jacob JT, Wilson LE, Vaeth E, Li L, Lynfield R, Snippes Vagnone PM, Phipps EC, Hancock EB, Dumyati G, Tsay R, Cassidy PM, Mounsey J, Grass JE, Bulens SN, Walters MS, Halpin AL. 2022. Whole-genome sequencing reveals diversity of carbapenem-resistant Pseudomonas aeruginosa collected through CDC’s emerging infections program, United States, 2016–2018. Antimicrobial Agents and Chemotherapy 66(9):e0049622

[53] Stracy M, Snitser O, Yelin I, Amer Y, Parizade M, Katz R, Rimler G, Wolf T, Herzel E, Koren G, Kuint J, Foxman B, Chodick G, Shalev V, Kishony R. 2022. Minimizing treatment-induced emergence of antibiotic resistance in bacterial infections. Science 375(6583):889-894

[54] Su M, Satola SW, Read TD. 2019. Genome-based prediction of bacterial antibiotic resistance. Journal of Clinical Microbiology 57(3):e01405–18

[55] Sufriyana H, Husnayain A, Chen YL, Kuo CY, Singh O, Yeh TY, Wu YW, Su EC. 2020. Comparison of multivariable logistic regression and other machine learning algorithms for prognostic prediction studies in pregnancy care: systematic review and meta-analysis. JMIR Medical Informatics 8(11):e16503

[56] Sunuwar J, Azad RK. 2021. A machine learning framework to predict antibiotic resistance traits and yet unknown genes underlying resistance to specific antibiotics in bacterial strains. Briefings in Bioinformatics 22:bbab179

[57] Tang R, Luo R, Tang S, Song H, Chen X. 2022. Machine learning in predicting antimicrobial resistance: a systematic review and meta-analysis. International Journal of Antimicrobial Agents 60(5–6):106684

[58] Taxt AM, Avershina E, Frye SA, Naseer U, Ahmad R. 2020. Rapid identification of pathogens, antibiotic resistance genes and plasmids in blood cultures by nanopore sequencing. Scientific Reports 10:7622

[59] Van Oeffelen M, Nguyen M, Aytan-Aktug D, Brettin T, Dietrich EM, Kenyon RW, Machi D, Mao C, Olson R, Pusch GD, Shukla M, Stevens R, Vonstein V, Warren AS, Wattam AR, Yoo H, Davis JJ. 2021. A genomic data resource for predicting antimicrobial resistance from laboratory-derived antimicrobial susceptibility phenotypes. Briefings in Bioinformatics 22(6):bbab313

[60] Wang W, Baker M, Hu Y, Xu J, Yang D, Maciel-Guerra A, Xue N, Li H, Yan S, Li M, Bai Y, Dong Y, Peng Z, Ma J, Li F, Dottorini T. 2021. Whole-genome sequencing and machine learning analysis of staphylococcus aureus from multiple heterogeneous sources in China reveals common genetic traits of antimicrobial resistance. MSystems 6(3):e0118520