Artificial Intelligence for Antimicrobial Resistance Prediction: Challenges and Opportunities towards Practical Implementation

Ali, Tabish; Ahmed, Sarfaraz; Aslam, Muhammad

doi:10.3390/antibiotics12030523

Open AccessReview

Artificial Intelligence for Antimicrobial Resistance Prediction: Challenges and Opportunities towards Practical Implementation

by

Tabish Ali

^1,†

,

Sarfaraz Ahmed

² and

Muhammad Aslam

^3,*,†

¹

Department of Civil & Environmental Engineering, Hanyang University, Seoul 04763, Republic of Korea

²

Department of Electronics & Computer Engineering, Hanyang University, Seoul 04763, Republic of Korea

³

Department of Artificial Intelligence, Sejong University, Seoul 05006, Republic of Korea

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Antibiotics 2023, 12(3), 523; https://doi.org/10.3390/antibiotics12030523

Submission received: 7 February 2023 / Revised: 1 March 2023 / Accepted: 3 March 2023 / Published: 6 March 2023

(This article belongs to the Special Issue Machine Learning for Antimicrobial Resistance Prediction)

Download

Browse Figures

Versions Notes

Abstract

Antimicrobial resistance (AMR) is emerging as a potential threat to many lives worldwide. It is very important to understand and apply effective strategies to counter the impact of AMR and its mutation from a medical treatment point of view. The intersection of artificial intelligence (AI), especially deep learning/machine learning, has led to a new direction in antimicrobial identification. Furthermore, presently, the availability of huge amounts of data from multiple sources has made it more effective to use these artificial intelligence techniques to identify interesting insights into AMR genes such as new genes, mutations, drug identification, conditions favorable to spread, and so on. Therefore, this paper presents a review of state-of-the-art challenges and opportunities. These include interesting input features posing challenges in use, state-of-the-art deep-learning/machine-learning models for robustness and high accuracy, challenges, and prospects to apply these techniques for practical purposes. The paper concludes with the encouragement to apply AI to the AMR sector with the intention of practical diagnosis and treatment, since presently most studies are at early stages with minimal application in the practice of diagnosis and treatment of disease.

Keywords:

antimicrobial resistance genes; artificial intelligence; deep learning; machine learning; challenges and opportunities

1. Introduction

The emergence and spread of antimicrobial resistance (AMR) is a major public health challenge presently faced by the world. AMR originates when micro-organisms, such as bacteria, viruses, fungi, and parasites, become resistant to the drugs that were once effective in treating infections caused by these micro-organisms. Globally, 1.27 million deaths have occurred each year due to AMR [1]. It is also important to note that the challenge of antimicrobial resistance is limited not only to bacteria but also to viruses. Recently, emerging and re-emerging viruses such as the COVID-19 pandemic, caused by SARS-CoV-2, have shown the importance of continuous research and development in developing new antiviral agents and vaccines to fight against such events. These events have forced the authorities to take necessary actions to tackle these challenges and propose appropriate diagnostic methods [2,3,4,5]. These approaches will make it possible to take the right steps to minimize the threat of AMR, using precautions and appropriate antibiotics.

However, antibiotic resistant genes (ARG) identification poses challenges as it requires high accuracy for practical treatment and robustness to quickly identify the problem [6]. Currently, characterization and diagnostic techniques used in laboratories do not provide enough information to effectively perform surveillance [6]. Furthermore, tests generate inconsistent results depending on environment and laboratory setup [7,8]. Data obtained from high sequencing such as whole-genome sequence, along with laboratory techniques, can be used to investigate the genetic variants of widespread AMR [6].

Recently, with the advancement in technology, computing power and data storage of computers have been immensely increased, allowing tremendous amounts of data to be processed and analyzed in a little amount of time. Therefore, the applications of artificial intelligence (AI) techniques, especially machine-learning (ML) and deep-learning (DL) techniques, are being used across different fields [9,10,11]. The application of AI is also increasing in the medical sector [12,13,14]. The metaverse is being developed for intelligent healthcare [13]. The authors in [12] listed Food and Drug Administration (FDA)-approved AI/ML-enabled devices across different medical fields. Significant increase in the approval of such AI-enabled devices is observed from 2018 onwards which constitutes about 85% of all approved devices. [12]. In total, around 531 AI/ML-enabled devices have been approved, and the majority of the approved devices are related to radiology. Among the others listed, five AI/ML-enabled approved devices are related to microbiology and four are related to pathology [12].

The ML/DL models extract interesting underlying relationships and patterns between features and prospective outcomes. There are three types of deep-learning models [9,10,11]: Supervised learning models are trained using input features with a corresponding target output to approximate and find underlying non-linear relationship, and these types of models are useful in regression and prediction. Unsupervised learning is trained only on input features to make clusters or groups among the input features. Reinforcement learning is trained based on rewards and penalties, mostly suitable for control and operations.

There are different ways in which machine learning has been applied to AMR. For example, AMR is being studied by sequence-based application of AI in [15,16,17]. AI has been applied to design new antibiotics, and generates a synergy of a combination of drugs [18,19]. The machine-learning algorithms analyze patterns in data on antimicrobial use and resistance to predict which micro-organisms are likely to develop resistance to certain drugs. This can help healthcare providers and policymakers make informed decisions about which drugs to use and how to use them [6]. Machine-learning models are used for surveillance of antimicrobial resistance [6]. These models analyze large amounts of data on antimicrobial use and resistance to identify emerging resistance patterns and potential hotspots of AMR. This helps public health authorities respond to outbreaks of resistant infections more quickly and effectively. For antimicrobial stewardship, machine learning is also used to optimize antimicrobial use in healthcare settings, for example by identifying the optimal combination of drugs to use for a particular infection or by predicting which patients are at risk of developing a resistant infection [20]. This can help reduce the overall burden of AMR.

Having mentioned these applications, the ML/DL models also encounter certain challenges during application in the field of antimicrobials [20,21]. An example of a major challenge is the availability of high-quality data on antimicrobials with balanced labels of susceptibility and resistance as well as those of intermediate category labels that overlap between susceptible and resistance [22]. Furthermore, there is the need to validate the results of machine-learning models, as the accuracy of these models can vary depending on the data and a particular algorithm applied. Additionally, the accuracies vary depending on experimental environmental and geographical locations, from where samples have been taken [21]. Improving accuracies of the models is another concern, as models with low accuracy cannot be applied for practical diagnostics/surveillance [21]. Furthermore, most of the research is limited to laboratory and experiments, and it is essential to apply these concepts to real-world problems [6,21]. Therefore, necessary research is needed to address these concerns, bridge gaps, and open new doors in the field of antibiotics.

This review article is organized as follows. Section 2 consists of details about ML/DL techniques for AMR. Section 3 will explain challenges, opportunities, and prospects. Finally, a conclusion is given in Section 4.

2. Artificial Intelligence (DL/ML) for Antimicrobial Resistance

Traditionally, to understand the mechanism of AMR, antimicrobial susceptibility testing (AST) is carried out based on phenotypic testing [22]. Phenotypes include information about the physical characteristics of micro-organisms, such as its shape, size and color. However, it takes considerable time to carry out this type of testing [23]. For instance, this testing takes 2 days for some bacterial pathogens, and a few weeks for slow-growing microbials [24,25]. Another type of data to study AMR is Genome sequences. Genome sequences are easier to extract owing to the reduction in costs and improved technology [26,27]. Further studies have also used environmental data such as temperature, humidity etc. to predict the occurrence of AMR [28].

AMR occurrences are predicted by different methods using these genotype data. DL/ML models are state-of-the-art tools that predict and interpret AMR [28]. These models map input features to the target labels in non-linear relationships [29]. The objective is to do regression or classification, or in some cases interpretation of the outcomes [28,30]. These models have shown good accuracy for antimicrobial susceptibility testing, if provided with enough data. Figure 1 shows the complete overview of the applications of these techniques in antimicrobial research. The following subsections give detail regarding the overall methodology of applying ML/DL for AMR prediction.

2.1. Overall Mechanism of ML/DL Models for the Prediction/Detection of AMR

Generally, prediction and classification problems are supervised learning problems, in which models are trained with the given input features to approximate a given target, also referred to as a “label”. The first step is data collection and data pre-processing. Data mostly consists of whole-genome sequences (WGS), and single-nucleotide polymorphisms (SNPs) with corresponding phenotypes [31,32]. For instance, in [31], WGSs used different isolates of E. coli strains from animals and human clinal samples. These data were both privately collected and available online as a public dataset. The aim of the paper was to study antibiotics, i.e., CIP (ciprofloxacin), CTX (cefotaxime), CTZ (ceftazidime), and GEN (gentamicin). The data consists of resistant and susceptible isolates. Features can also be generated by dividing sequences into length k, famously called k-mers [32], because it might be difficult to use complete genome strains, and also using small-length k-mers can help in identifying interesting insights of small sequences responsible for resistance [22,33,34,35,36,37].

The next important step is pre-processing and feature extraction. This consists of extracting reference alleles, variant alleles, and their position [31], after which the final SNP matrix can be built [31]. The SNPs can be encoded into chaos game representation (CGR) (A, G, C, T and N), label encoding, and one-shot encoding to train the machine-learning models [31]. For instance, to do label encoding, A, G, C, T and N in the SNP matrix can be converted into 1, 2, 3, 4, and 0 [31]. K-mers are also assigned labels with respective phenotypes and perform the encoding [32]. Data pre-processing and encoding and feature extraction all can easily be implemented using different Python packages [29]. Different machine-learning and statistical tools can also be used to generate important features. For example, a convolutional neural network (CNN) [38] with machine-learning models has been used to generate interesting features to predict AMRs.

As well as data management, different machine-learning models have been used in the literature for the prediction/classification of AMRs [31], e.g., logistic regression (LR), support vector machine (SVM), random forest (RF), and CNN. Similarly, the authors in [30,39] used a deep-learning model, which consists of layers of artificial neurons mimicking the human brain [39]. LR, RF and SVMs are implemented by the scikit-learn Python library, whereas CNN and other deep-learning architectures can be applied with TensorFlow and Python [29]. The basic idea of all these models is to generate a mathematical relationship between input features and target labels based on available data. Therefore, the selection of relevant data is very important. After training the models several times with the training data, they can draw a mapping and learn an underlying non-learning relationship [40].

Once the models are trained, they are tested against unseen data, also called test data, to validate their performance before being applied in a practical purpose. Different evaluation metrics, such as root mean square error (RMS), mean absolute error (MSE), accuracy, precision, recall and confusion matrix etc., can be used to evaluate the models [40,41]. Once accuracies are satisfied, then it can be applied for practical purposes. The following sections give details of each step.

2.2. Data Analysis and Data Management

DL/ML models require a huge amount of genotype data for AMR prediction. The genotype data usually have output labeled phenotypes. Shotgun DNA sequences from isolates are used usually as the input sequence; also, metagenomic DNA sequences can be used [39]. Furthermore, single-nucleotide variants (SNV) can also be applied as an important input feature. Antibiotic treatment-induced transcriptional responses are also used [42,43]. Varying breakpoints may be encountered when using the phenotypic data to train machine-learning models [44], e.g., different labs test drugs depending on local prescriptions. The MIC may be different from different labs depending on one- to two-fold dilutions from one laboratory to another, contributing to noise in the phenotypic output on which the model was trained [45,46]. The standard output labels are categorized into susceptible (S) and resistant (R) with very few data included in another category, intermediate (I). This implies that DL/ML models will have good accuracies when trained with data with clear MIC distributions. On the other hand, drugs with considerable mixing/overlapping between R and S can produce low accuracy. In such cases, I might be on the verge of the boundary [45,47,48]. ML models produced more than 95% accuracy on ciprofloxacin, irrespective of clinical breakpoint. The same model, when trained for azithromycin, produced lower accuracies of between 78 and 88%. The improvement of accuracies in such cases is one of the major concerns [16] because with lower accuracies it is highly risky to implement in diagnostics. The use of appropriate data features/labels, suitable robust models and optimizing the training parameters will help to achieve higher accuracy [16].

Obtaining data is a costly as well as a time-consuming process. Therefore, another important concern is to decide the quantity of data required in training. Although there is no specific rule, it depends on the quality of data and model robustness [49]. For instance, training a model to identify methicillin resistance in S. aureus needs examples of less than 100 to achieve accuracy of around 99% [49]. On the other hand, Pseudomona aeruginosa with a more variable genome might need thousands of examples to identify long resistances [50]. Another challenging concern in terms of data is to obtain evenly distributed susceptible and resistant categories, which includes a considerable range of MICs [51]. Training models on a balanced dataset should have higher sensitivity and lower specificity, and vice versa for models trained with skewed data [43] that contain higher representation of one type of data. To achieve generalization, the diversity of isolates must also need to be understood, i.e., the mechanism of resistance that varies regionally, and avoiding training on phylogenetic confounders [45,51]. Although various techniques have been proposed recently to control the population structure, deeper research is still needed. To consider all the major M. tuberculosis lineages—more than 10,000 M. tuberculosis isolates collected by the CRyPTIC Consortium worldwide—this will obtain enough samples of pyrazinamide-resistant isolates [45,52].

2.3. Prediction Strategies

Commonly, DL/ML models are trained on constant-length continuous or binary vectors. The raw input must be transformed into useful input features, which is called the feature-extraction process. The features are obtained from genome shotgun sequences by dividing sequences into sub-sequences of length k, famously called k-mers [22,33,34,35,36,37], then by marking the frequency of presence or absence of each k-mer. Then, k-mers within a given sample can be counted and transformed into vectors, making 4^k possibilities. Typically, the length of a k-mer ranges between 13 and 31 nucleotides. Longer k-mers are more specific but are error-prone in sequencing and require more training data [22,34,35,36,53]. Some other techniques of feature mapping including mapping antimicrobial resistance genes, or pangenomes. By doing so, variations in novel genes and sequences can be obtained, and can capture features depending on the absence or presence of genes and/or SNVs [43,54,55,56,57]. Furthermore, meteorological/environmental data have been used to predict the percentage of the occurrence of different AMR environments such as water [28]. Such cases are useful in terms of informing under what season or condition there is more probability of infection spread [28].

2.4. ML/DL Models

The basic idea of the DL/ML model is to build a model using a huge amount of data to capture the underlying non-linear relationship between the input features and outcomes, which would be difficult otherwise [29,40,58]. All DL and ML models are trained first on the training dataset. Once trained, these trained models are ready to be tested with unseen data. The first step is to preprocess the data and extract important and relevant input features. Then, data must be split into training, test, and validation data. First, the model is trained by feeding the training dataset features. The training process will optimize and obtain optimal parameters. During training, certain parts of training is used to validate and improve the optimization. Cross-validation is also used to make the model robust [40]. A deep-learning model consists of different hyperparameters. For a given problem, an optimal combination of these hyperparameters produce optimal results. Therefore, different techniques of hyperparameter optimization, such as Bayesian optimization, should be used to obtain the optimal combination hyperparameters [29,40]. By optimizing the models and selecting appropriate parameters, accuracy can be much improved.

Different models are suited to different types of datasets, and accordingly high accuracy can be achieved by selecting appropriate models depending on the nature of the problem or objective. For instance, simple neural networks and recurrent neural networks (RNN) [29,40] are suitable for regression problems, while convolutional neural networks (CNN) [38] are suitable for classification problems. In the case of AMR, machine-learning and deep-learning models are both used for classification as well as regression. Similarly, decision-tree methods are good at classification, and these models are suitable for tracing back the performance [28].

Therefore, one of the most important parts of applying DL/ML is to select the most appropriate model depending on the application and type of input features. Complex models usually have high variance, while relatively simpler models have higher bias [59]. Simpler models are easy to interpret, but these models might show low accuracy when it comes to complex features. Therefore, it is important to select models carefully, considering their appropriateness in terms of identification and interpretation. For instance, the authors in [38,60,61,62,63,64,65,66] used deep-learning and machine-learning models to identify different antibiotics. The authors in [38] used traditional machine learning and CNN to rapidly predict tuberculosis drug resistance accurately from genome sequences. For instance, in [60], mutations relevant to antimicrobial resistance in Mycobacterium tuberculosis are highlighted by a convolutional neural network. Interesting antibiotics were discovered by the authors in [61] using machine-learning models. In [62], deep learning is used to identify antimicrobial peptides from the human gut microbiome. The authors in [63] identified the mechanism action of antibiotics using interpretable machine-learning models. The authors in [64] used a machine-learning pipeline, mining the entire space of the peptide sequence to identify potential antimicrobial peptides. The authors in [65] used deep-transfer learning to obtain the robust prediction of antimicrobial resistance for novel antibiotics.

It is also very important that these models should be easily and powerfully interpretable when it comes to the application in the health sector or diagnostics [67,68,69,70]. Interpretable models should be able to evaluate individual input features, be traced back and forth, and make it possible to analyze and use the interactions of multiple features that are impacting on the target [28]. Decision tree-based models are classifiers that work in a hierarchy of internal nodes to evaluate the features based on variance. These models apply an explicit decision criterion until the final stage is achieved by classifying into a particular group [67,71]. Therefore, each node of these models is traceable. Therefore, it is possible to understand the features or interaction of features responsible for a particular decision. Gradient-boosting models—ensemble methods from decision tree—have been successfully used in AMR prediction [28,72,73]. Table 1 shows different DL/ML models used in AMR identification and applications. The table classifies different techniques based on their merits and demerits when applied to AMR problems.

2.5. Model Evaluation

Once the models are trained, they should be tested on different evaluation criteria. Different evaluation techniques indicate the strength of a model from different perspectives [41]. Usually, classification problems can be evaluated using a confusion matrix, more commonly when there are two class problems [41]. A confusion matrix evaluates the model based on the actual positive and negative cases against the predicted outcomes, which are true positives (TP), true negatives (TN) and false positives (FP) and false negatives (FN). TP stands for the actual positive that is predicted to be positive, and FP means actual negative cases that were predicted positives. Similarly, TN and FN can be explained. Figure 2 is an illustration of a confusion matrix of binary classification. Other evaluation criteria for classification are accuracy, recall and sensitivity [41]. In the case of regression problems, models are also evaluated based on accuracies and loss defined by root mean square error and correlation function, i.e., R2 score etc. [40] as given by equations in Table 2. This table gives the formula of different evaluation techniques.

2.6. Robustness of Different ML/DL Models in AMR Prediction

Table 3 shows the statical comparison of different models in terms of accuracy and robustness. For instance, the authors in [31] predicted resistance against antibiotics such as CIP, CTX, CTZ and GEN. The input data were obtained from E. coli strains which included around 1000 isotopes. SNPs were used as inputs. The SNPs were encoded in three different ways: label encoding, one-shot encoding, and CGR encoding. Four different models were trained, which includes CNN, LR, RF and SVM. Each model is separately trained for each type of encoding. On average, random forest regression predicted best for label encoding and CGR encoding with accuracies of 0.832 and 0.835, respectively. On the other hand, for one-shot encoding, CNN produced the best accuracy of 0.855. However, this research only considered SNPs, without considering longer genome sequences or k-mers. Therefore, it is difficult to find insights for the activities of components of the genome against these antibiotics. The authors in [66] also used the same data, but the objective was to identify new genes based on a deep CNN model. The idea was to train the base model against CIP. In addition, they used the trained weights of the CNN layers to predict the CTX, CTZ, and GEN with respective transferred models. However, the transferred model produced very low accuracies of around 40%. Therefore, a lot of improvement in these models is required.

The authors in [32] predicted AMR of Actinobacillus pleuropneumonia from WGS against five antibiotics (Tetracycline, Ampicillin, Sulfisoxazole, Trimethoprim, and Enrofloxacin). There were 96 isolated strains from A. pleuropneumonia. K-mers of the strain and the reference genes of the specific antimicrobials were used as input features. Two models were used, namely referenced-based SVM and reference-free Set Covering Machine (SCM). The accuracies of both the models were shown to be around 100%. This result indicates that the data were easily distinguishable between susceptible and resistance and did not include overlapping or intermediate cases. Therefore, such methods must be further investigated on different types of balanced data from different geographical locations. Otherwise, it would be difficult to use for practical purposes.

A strong deep-learning model, DeepARG [39] is compared with hierarchical multitask (HMD)-ARG by the authors in [30], to predict and classify ARGs. Input amino acids were encoded using one-shot encoding. The aim of the paper was to classify ARGs from non-ARGs, ARG antibiotic class classification, antibiotic mechanism classification, antibiotic mobility classification and beta-lactamese Amble classification. In ARG/non-ARG classification, the accuracy of DeepARG was 0.965 whereas that of HMD-ARG was 0.948. HMD-ARG was applied to classify the mechanism of ARG antibiotics with 0.936 accuracy. Furthermore, HMD-ARG was able to classify ARG antibiotic mobility with 0.909 accuracy. It also classified beta-lactamase Amble with 0.995 accuracy. However, in HMD-ARG, the inputs are assembled sequences, and its application scenarios may be limited, so cannot work on short reads unless heavy computational pre-processing is done. DeepARG [39] was trained on 30 categories of ARGs with the intention of predicting among these categories only; therefore, any unknown category/genes might not be predicted accurately. The objective of some other models and their limitations are shown in Table 3 as well.

3. ML/DL for AMR Prediction: Challenges and towards Practical Implementation

3.1. Challenges

Although artificial intelligence (AI) techniques can do wonders in finding AMR, paving new paths to rapid diagnostics, more accurate treatment, and cure, these opportunities come with challenges [6,20]. The mechanism of antibiotics and drugs is not completely understood, particularly in the case of newly arising diseases [6]. Furthermore, with mutations and other changes, it makes it more difficult to understand these mechanisms [77]. Additionally, the resistance behavior changes from cell level to micro-organism community level. For instance, in response to some cellular stressors [78] or antibiotics, a sub-population of bacteria might persist, and the capacity of a genome for resistance might quickly be augmented via HGT during biofilm formation [79]. These types of challenges are difficult to tackle, even if genome sequences are available. Therefore, AI models might struggle to learn the underlying mechanism with this type of evolution in resistance. However, deep-learning models, if provided with enough data and designed in a right way, would provide interesting findings [30,39,66].

Another challenge is that currently most AI models treat genes or sequences of genes separately, i.e., univariate. Although these models are accurate in prediction, the phenotypes are sometimes an outcome of a combination of genes or a combination of input features, producing a combined impact non-linearly [80]. For example, sometimes a combination of metals and antibiotic resistance genes combined produces certain AMR [80]. The maintenance and spread of AMR is suggested to be increased by association [81]. Although metal resistance might not directly impact antibiotics, its combination with AMR genes has shown enhanced resilience to AMR. Since most current AI models use single independent features, it is difficult to capture these types of collaborative or associative impacts. Minimal studies have been carried out to investigate the combined impacts of features or genes [82]. Too many multivariate features make it difficult and challenging to design models that can analyze and understand the interactions of impactful multivariate features to produce a certain outcome.

Another major concern is that, until recently, the general categorization of AMR classification was a binary classification of susceptible or resistant. Although ML/DL models show good accuracies in diagnosing highly resistant or susceptible genes [83], they might produce low accuracies if the intermediate category is also included. Designing models to add another category of an intermediate phenotype will make the overall outcome more effective in practical implementation. However, considering the intermediate category might have certain challenges. There is no standardized clearly defined boundary in a widely applicable documentation of antimicrobials between susceptible, intermediate, and resistant cases [84]. Furthermore, the definition of susceptible and resistant also keeps changing. This inconsistency of definition is summarized in [85]. Furthermore, intermediate isolates are far rarer, which might make the training and testing dataset imbalanced, causing incorrect assumptions or outcomes by the model. Complexity might arise by making multi-class classification in terms of interpretation and accuracies.

Overall limitations and challenges are summarized in this paragraph. The availability of data is a major concern. There are limited data available on AMR, particularly for less common micro-organisms or for those that have been isolated from unusual environments [49]. This can make it difficult to train effective machine-learning models. Furthermore, the quality of data is another challenge. The quality of the data used to train machine-learning models can have a significant impact on performance. Poor-quality data, such as data that is noisy or contaminated, can lead to inaccurate results. Another challenge is the imbalance of data. In some cases, the data used to train machine-learning models may be imbalanced, with a disproportionate number of samples belonging to one class (e.g., resistant or susceptible) and not including the intermediate class [84]. This can make it difficult for the model to accurately classify the minority class. Identifying the most relevant features for use in machine-learning models is challenging too. It is important to select features that are predictive of AMR, but also to avoid including redundant or irrelevant features, as this can negatively impact model performance. Machine-learning models can be prone to overfitting, especially when they are highly complex [66]. This means that they may perform well on the training data, but poorly on unseen data. It is important to carefully tune the complexity of the model to achieve good generalization performance.

3.2. Towards Practical Application of AI in the Antimicrobial Sector

Currently, most of the research is restricted to laboratories and not yet implemented practically, although many research works are underway to make them practically applicable. Figure 3 summarizes prospective AI application in the AMR sector. Models are developed for hypothesis deduction on new AMR genes or mutation-variation mechanisms [86]. The outcomes of prediction from some models have been tried to be applied in diagnostics [51]. Integrating genomics to improve surveillance is becoming a hot topic of discussion [87]. Emerging AMR trends can be shown by monitoring known causal resistance genes, and transmission patterns can be revealed that can help in identifying and controlling outbreaks of resistant pathogens. Whole-genome sequence data are expanding, helping AI models to obtain high accuracies in surveillance [88,89,90]. AI models can learn highly impactful features, so that necessary steps can be taken beforehand.

Traditional practices of antimicrobial diagnostics are neither rapid nor intuitive. For instance, more than 24 h is required to perform the current susceptibility test, and it requires the expertise and care of a bioinformatician with minimal error to perform the whole-genome sequence of an antibiotic susceptibility test and huge amount of data is necessary for it to be processed [91]. Different studies have been carried out to minimize the time of diagnosis. For example, diagnosis time can be reduced to 3 h using a flowcytometry antimicrobial susceptibility test and ML [23]. An efficient way of genome data management using artificial intelligence is given in [92]. The establishment of optimal antibiotic use strategies in sepsis treatment using AI-based data-driven techniques is presented in [93]. Specifically, artificial intelligence can positively identify suitable actions, predict with high accuracy the mortality and the length of stay, improving the patient outcome. Commonly used phenotype test methods could have an accuracy of around 90% with Phoenix [94]. ML/DL models can improve these accuracies, if provided with appropriate data and training mechanisms. A revolutionary antimicrobial stewardship would be to design a tool that used personalized medicine based on quickly detecting a pathogen and its resistance profile from clinical samples. For example, in sepsis, mortality is adversely impacted by up to 20% by delayed effective antibiotic therapy [95]. Interests are also increasing in fast diagnostics for bloodstream infection. Such steps, in combination with antibiotic stewardship, would increase the outcome of patients [96]. Different techniques of artificial intelligence are applied in silico to predict new antibiotic molecules and to investigate synergy from drug combination [97]. Since 2014, around 14 new antibiotics have been developed and approved, and the application of artificial intelligence can speed up the pace of antibiotic discovery and production [98].

Artificial intelligence is also playing its part in ensuring clean water supply and good hygiene. Different ML models can predict the chances of the occurrence of antimicrobials in water [28]. Another work has been performed using AI to harness the water crisis [99]. The idea of this work is to ensure access to clean water and sanitation by reducing infectious diseases and the spread of AMR bacteria. Some major practical applications are water resource management, contamination detection, effluent quality improvement, and the monitoring of data.

For illustration, some practical examples are given in Figure 4 and Figure 5. Figure 4 illustrates a framework known as Bentham’s felicific calculus and its application to the decision to start antimicrobial treatment. It is an AI-based clinical decision support system (CDSSs) for antimicrobial optimization considering a moral framework known as Bentham’s felicific calculus. This framework helps to start treatment based on the ML outcome and moral framework [100,101]. Figure 5 shows antimicrobial susceptibility testing (AST), which is one of the most widely used methods for the diagnoses of AMR [102]. The conventional methods are not efficient, need huge datasets, and take more time compared to the AI-based techniques [51,103,104]. Studies have applied supervised machine learning to improve the AST method and reduced the conventional 24 h to a mere 3 h for the test [23,105]. Similarly, Lechowicz et al. developed an artificial neural network-based IR-spectrometer test to reduce AST time to just 30 min [24,25].

4. Conclusions

This article has reviewed state-of-the-art artificial intelligence in tackling antimicrobial-related challenges and opportunities. Artificial intelligence is doing wonders in different domains of humanity. Deep learning and machine learning are subfields of artificial intelligence, tackling challenges by using huge amounts of data. Presently, huge amounts of data related to antimicrobials can be obtained from different sources. Additionally, very powerful processing computers with the help of large storage devices can process these data in almost no time, giving interesting insights. This has led researchers in the antibiotic sector to use AI tools to solve challenges. Currently, a lot of work on the application of AI in antibiotics is under progress, which has opened new pathways. For instance, the amount of time taking in diagnostics is highly reduced from days to hours by applying AI. Furthermore, new AMR and mutations are being discovered with the help of AI. Antimicrobial quantity can be predicted in water resources, and so on. However, there are certain major challenges in the application of AI on AMR. For example, most applications consider only the output to be resistant or susceptible, without considering an intermediate category that overlaps the susceptible and resistant categories. This can produce a wrong diagnosis. Furthermore, generally univariate features are analyzed and related to antibiotic genes. However, it is known that multiple features are responsible for generating or identifying AMR. Therefore, multivariate/interactive models need to be designed. Results obtained by training imbalanced data are not dependable. Models are mostly trained on sequences of a particular geography that might not produce universal output. Data management is another big concern. The research on the application of AI into AMR is still underway, and more is yet to be discovered before consideration for clinical and healthcare application on an extensive level.

Author Contributions

Conceptualization, M.A. and T.A.; methodology, M.A. and T.A.; software, M.A. and T.A.; validation, S.A. and M.A.; formal analysis, S.A.; investigation, M.A., S.A. and T.A.; resources, M.A., S.A. and T.A.; data curation, T.A.; writing—original draft preparation, M.A. and T.A.; writing—review and editing, M.A., S.A. and T.A.; visualization, M.A.; supervision, M.A.; project administration, M.A. and S.A.; funding acquisition, M.A. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

The authors thank Sejong University, Seoul and Hanyang University, Seoul for providing the logistic support.

Conflicts of Interest

The authors declare no conflict of interest.

References

Murray, C.J.L.; Ikuta, K.S.; Sharara, F.; Swetschinski, L.; Robles Aguilar, G.; Gray, A.; Han, C.; Bisignano, C.; Rao, P.; Wool, E.; et al. Global burden of bacterial antimicrobial resistance in 2019: A systematic analysis. Lancet 2022, 399, 629–655. [Google Scholar] [CrossRef] [PubMed]
Subasi, A. Practical Machine Learning for Data Analysis Using Python; Elsevier: Amsterdam, The Netherlands, 2020. [Google Scholar] [CrossRef]
Alpaydin, E. Introduction to Machine Learning; MIT Press: Cambridge, MA, USA, 2020. [Google Scholar]
Yasmin, S.; Karim, A.-M.; Lee, S.-H.; Zahra, R. Temporal Variation of Meropenem Resistance in E. coli Isolated from Sewage Water in Islamabad, Pakistan. Antibiotics 2022, 11, 635. [Google Scholar] [CrossRef] [PubMed]
Schuler, M.S.; Rose, S. Targeted Maximum Likelihood Estimation for Causal Inference in Observational Studies. Am. J. Epidemiol. 2016, 185, 65–73. [Google Scholar] [CrossRef] [PubMed]
Rabaan, A.A.; Alhumaid, S.; Al Mutair, A.; Garout, M.; Abulhamayel, Y.; Halwani, M.A.; Alestad, J.H.; Al Bshabshe, A.; Sulaiman, T.; AlFonaisan, M.K.; et al. Application of Artificial Intelligence in Combating High Antimicrobial Resistance Rates. Antibiotics 2022, 11, 784. [Google Scholar] [CrossRef]
Díaz, I. Machine Learning in the Estimation of Causal Effects: Targeted Minimum Loss-Based Estimation and Double/Debiased Machine Learning. Biostatistics 2019, 21, 353–358. [Google Scholar] [CrossRef] [PubMed]
Sun, J.H.; Sun, F.; Yan, B.; Li, J.Y.; Xin, D.L. Data mining and systematic pharmacology to reveal the mechanisms of traditional Chinese medicine in Mycoplasma pneumoniae pneumonia treatment. Biomed. Pharmacother. 2020, 125, 109900. [Google Scholar] [CrossRef] [PubMed]
Mukhamediev, R.I.; Symagulov, A.; Kuchin, Y.; Yakunin, K.; Yelis, M. From Classical Machine Learning to Deep Neural Networks: A Simplified Scientometric Review. Appl. Sci. 2021, 11, 5541. [Google Scholar] [CrossRef]
Gupta, C.; Johri, I.; Srinivasan, K.; Hu, Y.-C.; Qaisar, S.M.; Huang, K.-Y. A Systematic Review on Machine Learning and Deep Learning Models for Electronic Information Security in Mobile Networks. Sensors 2022, 22, 2017. [Google Scholar] [CrossRef]
Mazlan, A.U.; Sahabudin, N.A.; Remli, M.A.; Ismail, N.S.N.; Mohamad, M.S.; Nies, H.W.; Abd Warif, N.B. A Review on Recent Progress in Machine Learning and Deep Learning Methods for Cancer Classification on Gene Expression Data. Processes 2021, 9, 1466. [Google Scholar] [CrossRef]
Joshi, G.; Jain, A.; Adhikari, S.; Garg, H.; Bhandari, M. FDA Approved Artificial Intelligence and Machine Learning (AI/Ml)-Enabled Medical Devices: An Updated 2022 Landscape. medRxiv 2022. [Google Scholar] [CrossRef]
Wang, G.; Badal, A.; Jia, X.; Maltz, J.S.; Mueller, K.; Myers, K.J.; Niu, C.; Vannier, M.; Yan, P.; Yu, Z.; et al. Development of metaverse for intelligent healthcare. Nat. Mach. Intell. 2022, 4, 922–929. [Google Scholar] [CrossRef]
Subbiah, V. The next generation of evidence-based medicine. Nat. Med. 2023, 29, 49–58. [Google Scholar] [CrossRef] [PubMed]
Boolchandani, M.; D’Souza, A.W.; Dantas, G. Sequencing-based methods and resources to study antimicrobial resistance. Nat. Rev. Genet. 2019, 20, 356–370. [Google Scholar] [CrossRef]
Macesic, N.; Polubriaginof, F.; Tatonetti, N.P. Machine Learning. Curr. Opin. Infect. Dis. 2017, 30, 511–517. [Google Scholar] [CrossRef]
Khaledi, A.; Schniederjans, M.; Pohl, S.; Rainer, R.; Bodenhofer, U.; Xia, B.; Klawonn, F.; Bruchmann, S.; Preusse, M.; Eckweiler, D.; et al. Transcriptome Profiling of Antimicrobial Resistance in Pseudomonas aeruginosa. Antimicrob. Agents Chemother. 2016, 60, 4722–4733. [Google Scholar] [CrossRef]
Nava Lara, R.; Aguilera-Mendoza, L.; Brizuela, C.; Peña, A.; Del Rio, G. Heterologous Machine Learning for the Identification of Antimicrobial Activity in Human-Targeted Drugs. Molecules 2019, 24, 1258. [Google Scholar] [CrossRef] [PubMed]
Weinstein, Z.B.; Bender, A.; Cokol, M. Prediction of synergistic drug combinations. Curr. Opin. Syst. Biol. 2017, 4, 24–28. [Google Scholar] [CrossRef]
Javaid, M.; Haleem, A.; Pratap Singh, R.; Suman, R.; Rab, S. Significance of machine learning in healthcare: Features, pillars and applications. Int. J. Intell. Netw. 2022, 3, 58–73. [Google Scholar] [CrossRef]
Wang, H.; Jia, C.; Li, H.; Yin, R.; Chen, J.; Li, Y.; Yue, M. Paving the way for precise diagnostics of antimicrobial resistant bacteria. Front. Mol. Biosci. 2022, 9, 976705. [Google Scholar] [CrossRef]
Davis, J.J.; Boisvert, S.; Brettin, T.; Kenyon, R.W.; Mao, C.; Olson, R.; Overbeek, R.; Santerre, J.; Shukla, M.; Wattam, A.R.; et al. Antimicrobial Resistance Prediction in PATRIC and RAST. Sci. Rep. 2016, 6, 27930. [Google Scholar] [CrossRef]
Inglis, T.J.J.; Paton, T.F.; Kopczyk, M.K.; Mulroney, K.T.; Carson, C.F. Same-day antimicrobial susceptibility test using acoustic-enhanced flow cytometry visualized with supervised machine learning. J. Med Microbiol. 2020, 69, 657–669. [Google Scholar] [CrossRef]
Lechowicz, L.; Urbaniak, M.; Adamus-Białek, W.; Kaca, W. The use of infrared spectroscopy and artificial neural networks for detection of uropathogenic Escherichia coli strains’ susceptibility to cephalothin. Acta Biochim. Pol. 1970, 60. [Google Scholar] [CrossRef]
Stuart, B. Infrared Spectroscopy. In Kirk-Othmer Encyclopedia of Chemical Technology; Wiley & Sons: New York, NY, USA, 2015; pp. 1–18. [Google Scholar]
Schürch, A.C.; van Schaik, W. Challenges and Opportunities for Whole-Genome Sequencing-Based Surveillance of Antibiotic Resistance. Ann. N. Y. Acad. Sci. 2017, 1388, 108–120. [Google Scholar] [CrossRef] [PubMed]
Cohen, K.A.; Manson, A.L.; Desjardins, C.A.; Abeel, T.; Earl, A.M. Deciphering Drug Resistance in Mycobacterium tuberculosis Using Whole-Genome Sequencing: Progress, Promise, and Challenges. Genome Med. 2019, 11, 45. [Google Scholar] [CrossRef]
Iftikhar, S.; Karim, A.M.; Karim, A.M.; Karim, M.A.; Aslam, M.; Rubab, F.; Malik, S.K.; Kwon, J.E.; Hussain, I.; Azhar, E.I.; et al. Prediction and interpretation of antibiotic-resistance genes occurrence at recreational beaches using machine learning models. J. Environ. Manag. 2023, 328, 116969. [Google Scholar] [CrossRef]
Aslam, M.; Lee, S.-J.; Khang, S.-H.; Hong, S. Two-Stage Attention Over LSTM with Bayesian Optimization for Day-Ahead Solar Power Forecasting. IEEE Access 2021, 9, 107387–107398. [Google Scholar] [CrossRef]
Li, Y.; Xu, Z.; Han, W.; Cao, H.; Umarov, R.; Yan, A.; Fan, M.; Chen, H.; Duarte, C.M.; Li, L.; et al. HMD-ARG: Hierarchical multi-task deep learning for annotating antibiotic resistance genes. Microbiome 2021, 9, 40. [Google Scholar] [CrossRef]
Ren, Y.; Chakraborty, T.; Doijad, S.; Falgenhauer, L.; Falgenhauer, J.; Goesmann, A.; Hauschild, A.-C.; Schwengers, O.; Heider, D. Prediction of antimicrobial resistance based on whole-genome sequencing and machine learning. Bioinformatics 2021, 38, 325–334. [Google Scholar] [CrossRef] [PubMed]
Liu, Z.; Deng, D.; Lu, H.; Sun, J.; Lv, L.; Li, S.; Peng, G.; Ma, X.; Li, J.; Li, Z.; et al. Evaluation of Machine Learning Models for Predicting Antimicrobial Resistance of Actinobacillus pleuropneumoniae from Whole Genome Sequences. Front. Microbiol. 2020, 11, 48. [Google Scholar] [CrossRef] [PubMed]
Bradley, P.; Gordon, N.C.; Walker, T.M.; Dunn, L.; Heys, S.; Huang, B.; Earle, S.; Pankhurst, L.J.; Anson, L.; de Cesare, M.; et al. Rapid Antibiotic-Resistance Predictions from Genome Sequence Data for Staphylococcus aureus and Mycobacterium tuberculosis. Nat. Commun. 2015, 6, 10063. [Google Scholar] [CrossRef] [PubMed]
Ferreira, I.; Beisken, S.; Lueftinger, L.; Weinmaier, T.; Klein, M.; Bacher, J.; Patel, R.; von Haeseler, A.; Posch, A.E. Species Identification and Antibiotic Resistance Prediction by Analysis of Whole-Genome Sequence Data by Use of ARESdb: An Analysis of Isolates from the UNYVERO Lower Respiratory Tract Infection Trial. J. Clin. Microbiol. 2020, 58. [Google Scholar] [CrossRef]
Drouin, A.; Letarte, G.; Raymond, F.; Marchand, M.; Corbeil, J.; Laviolette, F. Interpretable Genotype-to-Phenotype Classifiers with Performance Guarantees. Sci. Rep. 2019, 9, 4071. [Google Scholar] [CrossRef]
Lees, J.A.; Vehkala, M.; Välimäki, N.; Harris, S.R.; Chewapreecha, C.; Croucher, N.J.; Marttinen, P.; Davies, M.R.; Steer, A.C.; Tong, S.Y.; et al. Sequence element enrichment analysis to determine the genetic basis of bacterial phenotypes. Nat. Commun. 2016, 7, 12797. [Google Scholar] [CrossRef] [PubMed]
Aun, E.; Brauer, A.; Kisand, V.; Tenson, T.; Remm, M. A K-Mer-Based method for the identification of phenotype-associated genomic biomarkers and predicting phenotypes of sequenced bacteria. PLoS Comput. Biol. 2018, 14, e1006434. [Google Scholar] [CrossRef]
Kuang, X.; Wang, F.; Hernandez, K.M.; Zhang, Z.; Grossman, R.L. Accurate and Rapid Prediction of Tuberculosis Drug Re-sistance from Genome Sequence Data Using Traditional Machine Learning Algorithms and CNN. Sci. Rep. 2022, 12, 2427. [Google Scholar] [CrossRef]
Arango-Argoty, G.A.; Garner, E.; Pruden, A.; Heath, L.S.; Vikesland, P.; Zhang, L. DeepARG: A deep learning approach for predicting antibiotic resistance genes from metagenomic data. Microbiome 2018, 6, 1–15. [Google Scholar] [CrossRef] [PubMed]
Aslam, M.; Kim, J.-S.; Jung, J. Multi-step ahead wind power forecasting based on dual-attention mechanism. Energy Rep. 2023, 9, 239–251. [Google Scholar] [CrossRef]
Hicks, S.A.; Strümke, I.; Thambawita, V.; Hammou, M.; Riegler, M.A.; Halvorsen, P.; Parasa, S. On evaluation metrics for medical applications of artificial intelligence. Sci. Rep. 2022, 12, 1–9. [Google Scholar] [CrossRef]
Bhattacharyya, R.P.; Bandyopadhyay, N.; Ma, P.; Son, S.S.; Liu, J.; He, L.L.; Wu, L.; Khafizov, R.; Boykin, R.; Cerqueira, G.C.; et al. Simultaneous Detection of Genotype and Phenotype Enables Rapid and Accurate Antibiotic Susceptibility Determination. Nat. Med. 2019, 25, 1858–1864. [Google Scholar] [CrossRef]
Van Camp, P.-J.; Haslam, D.B.; Porollo, A. Prediction of Antimicrobial Resistance in Gram-Negative Bacteria from Whole-Genome Sequencing Data. Front. Microbiol. 2020, 11, 1013. [Google Scholar] [CrossRef]
Kelly, C.J.; Karthikesalingam, A.; Suleyman, M.; Corrado, G.; King, D. Key challenges for delivering clinical impact with artifcial intelligence. BMC Med. 2019, 17, 195. [Google Scholar] [CrossRef]
Hicks, A.L.; Wheeler, N.; Sánchez-Busó, L.; Rakeman, J.L.; Harris, S.R.; Grad, Y.H. Evaluation of Parameters Affecting Performance and Reliability of Machine Learning-Based Antibiotic Susceptibility Testing from Whole Genome Sequencing Data. PLoS Comput. Biol. 2019, 15, e1007349. [Google Scholar] [CrossRef] [PubMed]
Mouton, J.W.; Meletiadis, J.; Voss, A.; Turnidge, J. Variation of Mic Measurements: The Contribution of Strain and Laboratory Variability to Measurement Precision. J. Antimicrob. Chemother. 2018, 73, 2374–2379. [Google Scholar] [CrossRef] [PubMed]
Davies, T.J.; Stoesser, N.; Sheppard, A.E.; Abuoun, M.; Fowler, P.; Swann, J.; Quan, T.P.; Griffiths, D.; Vaughan, A.; Morgan, M.; et al. Reconciling the Potentially Irreconcilable? Genotypic and Phenotypic Amoxicillin-Clavulanate Resistance in Escherichia coli. Antimicrob. Agents Chemother. 2020, 64, e02026-19. [Google Scholar] [CrossRef] [PubMed]
Khaledi, A.; Weimann, A.; Schniederjans, M.; Asgari, E.; Kuo, T.-H.; Oliver, A.; Cabot, G.; Kola, A.; Gastmeier, P.; Hogardt, M.; et al. Fighting Antimicrobial Resistance in Pseudomonas aeruginosa with Machine Learning-Enabled Molecular Diagnostics. EMBO Mol. Med. 2019, 12, e10264. [Google Scholar] [CrossRef]
Anahtar, M.N.; Yang, J.H.; Kanjilal, S. Applications of Machine Learning to the Problem of Antimicrobial Resistance: An Emerging Model for Translational Research. J. Clin. Microbiol. 2021, 59, 7. [Google Scholar] [CrossRef]
Freschi, L.; Jeukens, J.; Kukavica-Ibrulj, I.; Boyle, B.; Dupont, M.-J.; Laroche, J.; Larose, S.; Maaroufi, H.; Fothergill, J.L.; Moore, M.; et al. Clinical Utilization of Genomics Data Produced by the International Pseudomonas aeruginosa Consortium. Front. Microbiol. 2015, 6, 1036. [Google Scholar] [CrossRef]
Nguyen, M.; Long, S.W.; McDermott, P.F.; Olsen, R.J.; Olson, R.; Stevens, R.L.; Tyson, G.H.; Zhao, S.; Davis, J.J. Using Machine Learning to Predict Antimicrobial MICs and Associated Genomic Features for Nontyphoidal Salmonella. J. Clin. Microbiol. 2019, 57, e01260-18. [Google Scholar] [CrossRef]
Prediction of Susceptibility to First-Line Tuberculosis Drugs by DNA Sequencing. N. Engl. J. Med. 2018, 379, 1403–1415. [CrossRef]
Wood, D.E.; Salzberg, S.L. Kraken: Ultrafast metagenomic sequence classification using exact alignments. Genome Biol. 2014, 15, R46. [Google Scholar] [CrossRef]
Moradigaravand, D.; Palm, M.; Farewell, A.; Mustonen, V.; Warringer, J.; Parts, L. Prediction of antibiotic resistance in Escherichia coli from large-scale pan-genome data. PLoS Comput. Biol. 2018, 14, e1006258. [Google Scholar] [CrossRef]
Her, H.-L.; Wu, Y.-W. A pan-genome-based machine learning approach for predicting antimicrobial resistance activities of the Escherichia coli strains. Bioinformatics 2018, 34, i89–i95. [Google Scholar] [CrossRef]
Kim, J.; Greenberg, D.E.; Pifer, R.; Jiang, S.; Xiao, G.; Shelburne, S.A.; Koh, A.; Xie, Y.; Zhan, X. VAMPr: VAriant Mapping and Prediction of antibiotic resistance via explainable features and machine learning. PLoS Comput. Biol. 2020, 16, e1007511. [Google Scholar] [CrossRef] [PubMed]
Lees, J.A.; Mai, T.T.; Galardini, M.; Wheeler, N.E.; Horsfield, S.T.; Parkhill, J.; Corander, J. Improved Prediction of Bacterial Genotype-Phenotype Associations Using Interpretable Pangenome-Spanning Regressions. mBio 2020, 11, 4. [Google Scholar] [CrossRef] [PubMed]
Aslam, M.; Lee, J.-M.; Kim, H.-S.; Lee, S.-J.; Hong, S. Deep Learning Models for Long-Term Solar Radiation Forecasting Considering Microgrid Installation: A Comparative Study. Energies 2019, 13, 147. [Google Scholar] [CrossRef]
Žegklitz, J.; Pošík, P. Model Selection and Overfitting in Genetic Programming. In Proceedings of the Companion Publication of the 2015 Annual Conference on Genetic and Evolutionary Computation, Madrid, Spain, 11–15 July 2015. [Google Scholar]
Green, A.G.; Yoon, C.H.; Chen, M.L.; Ektefaie, Y.; Fina, M.; Freschi, L.; Gröschel, M.I.; Kohane, I.; Beam, A.; Farhat, M. A Convolutional Neural Network Highlights Mutations Relevant to Antimicrobial Resistance in Mycobacterium tuberculosis. Nat. Commun. 2022, 13, 3817. [Google Scholar] [CrossRef]
de la Fuente-Nunez, C. Antibiotic Discovery with Machine Learning. Nat. Biotechnol. 2022, 40, 833–834. [Google Scholar] [CrossRef]
Ma, Y.; Guo, Z.; Xia, B.; Zhang, Y.; Liu, X.; Yu, Y.; Na Tang, N.; Tong, X.; Wang, M.; Ye, X.; et al. Identification of antimicrobial peptides from the human gut microbiome using deep learning. Nat. Biotechnol. 2022, 40, 921–931. [Google Scholar] [CrossRef]
Mongia, M.; Guler, M.; Mohimani, H. An Interpretable Machine Learning Approach to Identify Mechanism of Action of An-tibiotics. Sci. Rep. 2022, 12, 10342. [Google Scholar] [CrossRef]
Huang, J.; Xu, Y.; Xue, Y.; Huang, Y.; Li, X.; Chen, X.; Xu, Y.; Zhang, D.; Zhang, P.; Zhao, J.; et al. Identification of Potent An-timicrobial Peptides via a Machine-Learning Pipeline That Mines the Entire Space of Peptide Sequences. Nat. Biomed. Eng. 2023. [Google Scholar] [CrossRef]
Marchant, J. Powerful Antibiotics Discovered Using AI. Available online: https://www.nature.com/articles/d41586-020-00018-3 (accessed on 7 February 2023).
Ren, Y.; Chakraborty, T.; Doijad, S.; Falgenhauer, L.; Falgenhauer, J.; Goesmann, A.; Schwengers, O.; Heider, D. Deep Transfer Learning Enables Robust Prediction of Antimicrobial Resistance for Novel Antibiotics. Antibiotics 2022, 11, 1611. [Google Scholar] [CrossRef]
Molnar, C.; Casalicchio, G.; Bischl, B. Interpretable Machine Learning—A Brief History, State-of-the-Art and Challenges. In ECML PKDD 2020 Workshops, Proceedings of the Workshops of the European Conference on Machine Learning and Knowledge Discovery in Databases (ECML PKDD 2020): SoGood 2020, PDFL 2020, MLCS 2020, NFMCP 2020, DINA 2020, EDML 2020, XKDD 2020 and INRA 2020, Ghent, Belgium, 14–18 September 2020; Springer: Cham, Switzerland, 2020; pp. 417–431. [Google Scholar]
Murdoch, W.J.; Singh, C.; Kumbier, K.; Abbasi-Asl, R.; Yu, B. Definitions, Methods, and Applications in Interpretable Machine Learning. Proc. Natl. Acad. Sci. USA 2019, 116, 22071–22080. [Google Scholar] [CrossRef]
Loyola-Gonzalez, O. Black-Box vs. White-Box: Understanding Their Advantages and Weaknesses from a Practical Point of View. IEEE Access 2019, 7, 154096–154113. [Google Scholar] [CrossRef]
Azodi, C.B.; Tang, J.; Shiu, S.-H. Opening the Black Box: Interpretable Machine Learning for Geneticists. Trends Genet. 2020, 36, 442–455. [Google Scholar] [CrossRef] [PubMed]
Lv, J.; Deng, S.; Zhang, L. A review of artificial intelligence applications for antimicrobial resistance. Biosaf. Health 2020, 3, 22–31. [Google Scholar] [CrossRef]
Yasir, M.; Karim, A.M.; Malik, S.K.; Bajaffer, A.A.; Azhar, E.I. Application of Decision-Tree-Based Machine Learning Algorithms for Prediction of Antimicrobial Resistance. Antibiotics 2022, 11, 1593. [Google Scholar] [CrossRef]
Yasir, M.; Karim, A.M.; Malik, S.K.; Bajaffer, A.A.; Azhar, E.I. Prediction of Antimicrobial Minimal Inhibitory Concentrations for Neisseria Gonorrhoeae Using Machine Learning Models. Saudi J. Biol. Sci. 2022, 29, 3687–3693. [Google Scholar] [CrossRef] [PubMed]
Aldeyab, M.A.; Bond, S.E.; Conway, B.R.; Lee-Milner, J.; Sarma, J.B.; Lattyak, W.J. Identifying Antibiotic Use Targets for the Management of Antibiotic Resistance Using an Extended-Spectrum β-Lactamase-Producing Escherichia coli Case: A Threshold Logistic Modeling Approach. Antibiotics 2022, 11, 1116. [Google Scholar] [CrossRef] [PubMed]
Rishishwar, L.; Petit, R.A.; Kraft, C.S.; Jordan, I.K. Genome Sequence-Based Discriminator for Vancomycin-Intermediate Staphylococcus aureus. J. Bacteriol. 2013, 196, 940–948. [Google Scholar] [CrossRef]
Drouin, A.; Giguère, S.; Déraspe, M.; Marchand, M.; Tyers, M.; Loo, V.G.; Bourgault, A.-M.; Laviolette, F.; Corbeil, J. Predic-tive Computational Phenotyping and Biomarker Discovery Using Reference-Free Genome Comparisons. BMC Genom. 2016, 17, 754. [Google Scholar] [CrossRef]
Marciano, D.C.; Wang, C.; Hsu, T.-K.; Bourquard, T.; Atri, B.; Nehring, R.B.; Abel, N.S.; Bowling, E.A.; Chen, T.J.; Lurie, P.D.; et al. Evolutionary action of mutations reveals antimicrobial resistance genes in Escherichia coli. Nat. Commun. 2022, 13, 3189. [Google Scholar] [CrossRef] [PubMed]
Lewis, K. Persister Cells, Dormancy and Infectious Disease. Nat. Rev. Microbiol. 2006, 5, 48–56. [Google Scholar] [CrossRef] [PubMed]
Olsen, I. Biofilm-Specific Antibiotic Tolerance and Resistance. Eur. J. Clin. Microbiol. Infect. Dis. 2015, 34, 877–886. [Google Scholar] [CrossRef]
Heydari, A.; Kim, N.D.; Horswell, J.; Gielen, G.; Siggins, A.; Taylor, M.; Bromhead, C.; Palmer, B.R. Co-Selection of Heavy Metal and Antibiotic Resistance in Soil Bacteria from Agricultural Soils in New Zealand. Sustainability 2022, 14, 1790. [Google Scholar] [CrossRef]
Li, L.-G.; Xia, Y.; Zhang, T. Co-Occurrence of Antibiotic and Metal Resistance Genes Revealed in Complete Genome Collection. ISME J. 2016, 11, 651–662. [Google Scholar] [CrossRef]
Xu, E.L.; Qian, X.; Yu, Q.; Zhang, H.; Cui, S. Feature Selection with Interactions in Logistic Regression Models Using Multivariate Synergies for a GWAS Application. BMC Genom. 2018, 19, 170. [Google Scholar] [CrossRef]
Jorgensen, J.H.; Turnidge, J.D. Susceptibility Test Methods: Dilution and Disk Diffusion Methods. In Manual of Clinical Microbiology, 11th ed.; John Wiley & Sons, Inc.: Hoboken, NY, USA, 2015; pp. 1253–1273. [Google Scholar] [CrossRef]
Wang, S.; Zhao, C.; Yin, Y.; Chen, F.; Chen, H.; Wang, H. A Practical Approach for Predicting Antimicrobial Phenotype Re-sistance in Staphylococcus aureus through Machine Learning Analysis of Genome Data. Front. Microbiol. 2022, 13, 605. [Google Scholar]
Cusack, T.; Ashley, E.; Ling, C.; Rattanavong, S.; Roberts, T.; Turner, P.; Wangrangsimakul, T.; Dance, D. Impact of CLSI and EUCAST breakpoint discrepancies on reporting of antimicrobial susceptibility and AMR surveillance. Clin. Microbiol. Infect. 2019, 25, 910–911. [Google Scholar] [CrossRef]
Kavvas, E.S.; Yang, L.; Monk, J.M.; Heckmann, D.; Palsson, B.O. A biochemically-interpretable machine learning classifier for microbial GWAS. Nat. Commun. 2020, 11, 2580. [Google Scholar] [CrossRef]
Aslam, B.; Khurshid, M.; Arshad, M.I.; Muzammil, S.; Rasool, M.; Yasmeen, N.; Shah, T.; Chaudhry, T.H.; Rasool, M.H.; Shahid, A.; et al. Antibiotic Resistance: One Health One World Outlook. Front. Cell. Infect. Microbiol. 2021, 11, 1153. [Google Scholar] [CrossRef]
Deng, X.; den Bakker, H.C.; Hendriksen, R.S. Genomic Epidemiology: Whole-Genome-Sequencing–Powered Surveillance and Outbreak Investigation of Foodborne Bacterial Pathogens. Annu. Rev. Food Sci. Technol. 2016, 7, 353–374. [Google Scholar] [CrossRef] [PubMed]
Ashton, P.M.; Nair, S.; Peters, T.M.; Bale, J.A.; Powell, D.G.; Painset, A.; Tewolde, R.; Schaefer, U.; Jenkins, C.; Dallman, T.J.; et al. Identification of Salmonella for public health surveillance using whole genome sequencing. PeerJ 2016, 4, e1752. [Google Scholar] [CrossRef] [PubMed]
Argimón, S.; Masim, M.A.L.; Gayeta, J.M.; Lagrada, M.L.; Macaranas, P.K.V.; Cohen, V.; Limas, M.T.; Espiritu, H.O.; Palarca, J.C.; Chilam, J.; et al. Integrating whole-genome sequencing within the National Antimicrobial Resistance Surveillance Program in the Philippines. Nat. Commun. 2020, 11, 2719. [Google Scholar] [CrossRef]
Ma, L.; Yi, J.; Wisuthiphaet, N.; Earles, M.; Nitin, N. Accelerating the Detection of Bacteria in Food Using Artificial Intelligence and Optical Imaging. Appl. Environ. Microbiol. 2023, 89, e01828-22. [Google Scholar] [CrossRef]
Wattam, A.R.; Abraham, D.; Dalay, O.; Disz, T.L.; Driscoll, T.; Gabbard, J.L.; Gillespie, J.J.; Gough, R.; Hix, D.; Kenyon, R.; et al. PATRIC, the bacterial bioinformatics database and analysis resource. Nucleic Acids Res. 2013, 42, D581–D591. [Google Scholar] [CrossRef]
Tsoukalas, A.; Albertson, T.; Tagkopoulos, I. From Data to Optimal Decision Making: A Data-Driven, Probabilistic Machine Learning Approach to Decision Support for Patients with Sepsis. JMIR Med. Inform. 2015, 3, e3445. [Google Scholar] [CrossRef]
Doern, C.D.; Park, J.Y.; Gallegos, M.; Alspaugh, D.; Burnham, C.-A.D. Investigation of Linezolid Resistance in Staphylococci and Enterococci. J. Clin. Microbiol. 2016, 54, 1289–1294. [Google Scholar] [CrossRef] [PubMed]
Zasowski, E.J.; Bassetti, M.; Blasi, F.; Goossens, H.; Rello, J.; Sotgiu, G.; Tavoschi, L.; Arber, M.R.; McCool, R.; Patterson, J.V.; et al. A Systematic Review of the Effect of Delayed Appropriate Antibiotic Treatment on the Outcomes of Patients with Severe Bacterial Infections. Chest 2020, 158, 929–938. [Google Scholar] [CrossRef]
Timbrook, T.T.; Morton, J.B.; McConeghy, K.W.; Caffrey, A.R.; Mylonakis, E.; LaPlante, K.L. The Effect of Molecular Rap-id Diagnostic Testing on Clinical Outcomes in Bloodstream Infections: A Systematic Review and Meta-Analysis. Clin. Infect. Dis. 2016, 64, 15–23. [Google Scholar] [CrossRef]
Melo, M.C.R.; Maasch, J.R.M.A.; de la Fuente-Nunez, C. Accelerating antibiotic discovery through artificial intelligence. Commun. Biol. 2021, 4, 1050. [Google Scholar] [CrossRef]
Miethke, M.; Pieroni, M.; Weber, T.; Brönstrup, M.; Hammann, P.; Halby, L.; Arimondo, P.B.; Glaser, P.; Aigle, B.; Bode, H.B.; et al. Towards the sustainable discovery and development of new antibiotics. Nat. Rev. Chem. 2021, 5, 726–749. [Google Scholar] [CrossRef]
Goralski, M.A.; Tan, T.K. Artificial Intelligence and Sustainable Development. Int. J. Manag. Educ. 2020, 18, 100330. [Google Scholar] [CrossRef]
Post, B.; Badea, C.; Faisal, A.; Brett, S.J. Breaking bad news in the era of artificial intelligence and algorithmic medicine: An exploration of disclosure and its ethical justification using the hedonic calculus. AI Ethic 2022. [Google Scholar] [CrossRef] [PubMed]
Bolton, W.J.; Badea, C.; Georgiou, P.; Holmes, A.; Rawson, T.M. Developing moral AI to support decision-making about antimicrobial use. Nat. Mach. Intell. 2022, 4, 912–915. [Google Scholar] [CrossRef]
Isenberg, H.D. Clinical Microbiology: Past, Present, and Future. J. Clin. Microbiol. 2003, 41, 917–918. [Google Scholar] [CrossRef] [PubMed]
Horne, D.J.; Pinto, L.M.; Arentz, M.; Lin, S.-Y.G.; Desmond, E.; Flores, L.L.; Steingart, K.R.; Minion, J. Diagnostic Accuracy and Reproducibility of Who-Endorsed Phenotypic Drug Susceptibility Testing Methods for First-Line and Second-Line Antituberculosis Drugs. J. Clin. Microbiol. 2013, 51, 393–401. [Google Scholar] [CrossRef]
Lunetta, K.L.; Hayward, L.B.; Segal, J.; Van Eerdewegh, P. Screening large-scale association study data: Exploiting interactions using random forests. BMC Genet. 2004, 5, 32. [Google Scholar] [CrossRef]
Mulroney, K.T.; Hall, J.M.; Huang, X.; Turnbull, E.; Bzdyl, N.M.; Chakera, A.; Naseer, U.; Corea, E.M.; Ellington, M.J.; Hopkins, K.L.; et al. Rapid Susceptibility Profiling of Carbapenem-Resistant Klebsiella Pneumoniae. Sci. Rep. 2017, 7, 1903. [Google Scholar] [CrossRef]

Figure 1. Overall process of applying machine-learning/deep-learning models in AMR identification.

Figure 2. Confusion matrix for binary output classification problem.

Figure 3. AI can be applied on antimicrobials to obtain different objectives such as clinical care, drug development, surveillance, identification of new AMR etc.

Figure 4. Combination of moral and AI-based frameworks for CDSS.

Figure 5. Efficiency of AST methods based on AI and conventional techniques.

Table 1. Different ML/DL models and their merits and demerits in AMR problem applications.

Technique	Algorithm	Advantages	Disadvantages
Neural Networks (simple Neural networks, RNN, CNN etc.) [38,60,61,62,63,64,65,66]	These models mimic the human brain and learn by optimizing weights until the final objective is achieved. The better the data, the better is performance Can perform on multi-dimensional data	✓ Can solve complex problems ✓ Feature interaction ✓ Can evaluate features ✓ Capable of multivariate features	✕ Increased model complexity with increase in layers and nodes ✕ The model is not traceable
Decision Tree [28,72,73]	Predict based on target. Leaf nodes equal class label, nodes in the model equals to attributes	✓ Can evaluate features ✓ Model is traceable ✓ Feature interaction	✕ Model might suffer with increasing feature complexity/multivariate
Logistic Regression [74]	Logistic curve that associates to each input features	✓ Feature evaluation	✕ Non-traceable ✕ Interaction of features is not possible

Table 2. Evaluation metrics used in different ML/DL problems.

Regression/Prediction
Evaluation Matrix	Formula
Root Mean Square Error	$\sqrt{\frac{1}{N} . \sum_{n = 1}^{N} {(y_{n}^{'} - y_{n})}^{2}}$
R2 score	$1 - \frac{\sum_{n = 1}^{N} {({y^{'}}_{n} - y_{n})}^{2}}{\sum_{n = 1}^{N} {(y_{n} - \bar{y})}^{2}}$
Classification/Prediction
Accuracy	$\frac{T P + T N}{T P + F P + T N + F N}$
Recall/Sensitivity	$\frac{T P}{T P + F P}$

N is total samples, y_{n}^{'} is predicted values, y_{n} is the actual values and \bar{y} is the same value .

Table 3. Comparison of different ML/DL models for AMR prediction.

Objective	Features	Models	Model for Comparison	Performance	Remarks
Predict AMR (such as CIP, CTX, CTZ and GEN.) [31]	SNPs are being encoded	CNN, RF	LR and SVM	With label encoding RF showed 0.83, with hot encoding, CNN showed 0.855, and with CGR encoding RF showed 0.835	Only SNP data used called based on a single reference genome
Evaluate Machine-Learning models to Predict AMR [32]	k-mers of the strains from WGS	Referenced SVM, and Reference-less SCM		Both models produced around 1.00 precision	Very high precision indicates data are not well balanced
Deep-Transfer Learning to predict Novel AMRs [66]	k-mers and SNPs being encoded	Deep CNN-based transfer leaning		Basic model produced 0.83, transferred models for novel resistance produced less than 0.41	Transferred models producing less precision
Annotating antibiotic resistance genes [30,39]	Genome represented by k-mers	HMD-ARG	Deep-ARG	ARG/non-ARG classification accuracy of 0.948 and antibiotic mobility 0.909	Inputs are assembled sequences, its application scenarios may be limited, and cannot work on short reads unless heavy computational pre-processing are done
Predict vancomycin intermediate susceptible S. aureus phenotype [75]	Resistance genes identified in past	LR	Multylayer perceptron, SVM and RF	Correctly classified 21 out of 25	Model is being built using only 25 genomes
Predict carbapenem resistance in A. baumanii, methicillin resistance in S. aureus, and beta-lactam and co-trimoxazole resistance in S. pneumoniae [76]	Bacterial genome represented by k-mers	AdaBoost		A A. baumanii, S. aureus, S. pneumoniae: 88–99%. M. tuberculosis: 71–88%.	No comparison algorithms used. Approach now implemented as classification tool on Pathosystems Resource Integration Center website

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Ali, T.; Ahmed, S.; Aslam, M. Artificial Intelligence for Antimicrobial Resistance Prediction: Challenges and Opportunities towards Practical Implementation. Antibiotics 2023, 12, 523. https://doi.org/10.3390/antibiotics12030523

AMA Style

Ali T, Ahmed S, Aslam M. Artificial Intelligence for Antimicrobial Resistance Prediction: Challenges and Opportunities towards Practical Implementation. Antibiotics. 2023; 12(3):523. https://doi.org/10.3390/antibiotics12030523

Chicago/Turabian Style

Ali, Tabish, Sarfaraz Ahmed, and Muhammad Aslam. 2023. "Artificial Intelligence for Antimicrobial Resistance Prediction: Challenges and Opportunities towards Practical Implementation" Antibiotics 12, no. 3: 523. https://doi.org/10.3390/antibiotics12030523

APA Style

Ali, T., Ahmed, S., & Aslam, M. (2023). Artificial Intelligence for Antimicrobial Resistance Prediction: Challenges and Opportunities towards Practical Implementation. Antibiotics, 12(3), 523. https://doi.org/10.3390/antibiotics12030523

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Artificial Intelligence for Antimicrobial Resistance Prediction: Challenges and Opportunities towards Practical Implementation

Abstract

1. Introduction

2. Artificial Intelligence (DL/ML) for Antimicrobial Resistance

2.1. Overall Mechanism of ML/DL Models for the Prediction/Detection of AMR

2.2. Data Analysis and Data Management

2.3. Prediction Strategies

2.4. ML/DL Models

2.5. Model Evaluation

2.6. Robustness of Different ML/DL Models in AMR Prediction

3. ML/DL for AMR Prediction: Challenges and towards Practical Implementation

3.1. Challenges

3.2. Towards Practical Application of AI in the Antimicrobial Sector

4. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI