Next Article in Journal
The Normalizer Property for Integral Group Rings of Generalized Twisted Wreath Products of Two Finite Groups
Previous Article in Journal
PromptTrace: A Fine-Grained Prompt Stealing Attack via CLIP-Guided Beam Search for Text-to-Image Models
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Evaluating Machine Learning Algorithms in COVID-19 Research: A Framework Based on Algorithm Co-Occurrence and Symmetric Network Analysis

1
Business School, Sichuan University, Chengdu 610065, China
2
School of Public Administration, Sichuan University, Chengdu 610065, China
*
Author to whom correspondence should be addressed.
Symmetry 2026, 18(1), 163; https://doi.org/10.3390/sym18010163
Submission received: 30 September 2025 / Revised: 25 December 2025 / Accepted: 31 December 2025 / Published: 15 January 2026
(This article belongs to the Section Computer)

Abstract

Machine learning (ML) algorithms are reshaping academic research. However, there is a lack of systematic impact analysis in specific domains. We propose a framework for evaluating the knowledge landscape of domain-specific ML research. It consists of three key components: LDA (Latent Dirichlet Allocation) for topic identification, co-occurrence network construction, and influential algorithm scoring using centrality metrics. In a case study on COVID-19 research, we analyze 30,664 ML-related papers. We identify 13 research topics. We construct a symmetric undirected network to quantify algorithm influence. This analysis employs six centrality metrics: mention frequency, weighted degree, degree centrality, eigenvector centrality, closeness centrality, and betweenness centrality. Results were obtained following linear normalisation. The framework highlights the top ten most influential algorithms for each topic. It reveals the evolving impact of algorithms in COVID-19 research. The methodology is adaptable to other domains. It provides a systematic approach to understanding ML domain-specific impact.

1. Introduction

In contemporary, data-driven scientific domains, the ever-expanding volume of data underscores the imperative of employing diverse methodologies, with a particular emphasis on the pivotal role of ML algorithms in advancing scientific discovery [1]. Due to their clearly defined computational frameworks and advanced learning capabilities, ML algorithms present novel methodologies for the analysis of extensive datasets across diverse research domains [2,3], thereby reshaping the development trajectory of multiple academic disciplines [4]. As distinct knowledge entities in academic literature, ML algorithms also play a vital role in extracting valuable insights or knowledge from vast volumes of raw data [5], often necessitating the integration and comparison of various algorithms. Indeed, numerous research domains, such as biomedical research, employ various algorithms [6]. Consequently, the appropriate selection and rational application of ML algorithms serve as pivotal catalysts for the success of data-driven research [7].
However, domain experts frequently lack specialized algorithmic expertise, complicating the selection of appropriate ML algorithms. This challenge is exacerbated by the rapidly expanding corpus of ML-related literature, which obscures domain-specific trends. Researchers urgently require tools to answer fundamental questions: What problems or topics are ML algorithms applied to in a particular research domain? What are the most influential algorithms within a domain? A comprehensive understanding of these issues necessitates a comprehensive evaluation of domain-specific ML algorithms research. However, the exponential growth of academic papers related to ML algorithms presents obstacles for researchers to obtain an overview of a research domain [8]. For instance, ML algorithms have found widespread application in the research domain of healthcare. As of 1 January 2024, the PubMed Central (PMC) database hosts more than 70,000 articles on ML and healthcare. Extracting a comprehensive overview of research on relevant ML algorithms from such an extensive body of academic papers in this field presents a significant challenge.
ML algorithms are method entities, as well as typical knowledge entities. There are two common approaches for evaluating knowledge entities. The first approach involves conducting a comprehensive survey manually by researchers. However, navigating vast literature collections to perform a comprehensive evaluation is both time-consuming and resource-intensive. Another approach involves utilizing bibliometric indicators derived from bibliometric methods. This approach utilizes various frequency-based indicators to evaluate the influence of knowledge entities [9]. In terms of the bibliometric indicators, quantitative characteristics are traditional indicators for evaluating the influence of algorithm entities [10]. Frequency of mention is the most widely used indicator, reflecting the level of attention received by researchers [11]. In contrast to frequency indicators, researchers have proposed network-related indicators by network centrality to measure large-scale algorithm networks [12]. The ML algorithm network reflects how important algorithm entities are to the overall algorithm community in a research domain. However, the construction of a large co-occurrence network ignores the semantic characteristics of the entities. Some studies have demonstrated that influential algorithms vary across different research topics within a domain [13]. Indeed, there are always several research topics in a research domain. Research topics within a research domain may utilize and prioritize distinct algorithms tailored to their unique objectives and challenges. To this end, text mining offers a viable approach for identifying research topics [14]. Text mining leverages automated processes to extract valuable insights from vast collections of unstructured data [15], offering distinct advantages in identifying complex, undetermined topics [16].
In summary, it is essential to evaluate ML algorithms that are employed within a specific research domain. However, traditional evaluation methods encounter significant challenges due to the vast number of academic papers related to ML algorithm applications. Consequently, this paper proposes a methodological framework for the evaluation of ML algorithm entities, which offers a comprehensive overview of domain-specific ML algorithm research. In detail, we employ text mining, particularly, the LDA, to identify the research topic of academic publications for a domain-specific ML algorithm research, then we construct the ML algorithm network by the co-occurrence relationship among algorithms within in a research topic, and explore the dynamic of ML algorithm network, at last we identify the ML algorithm influence in each topic using frequency of mention and several network centrality measures. The proposed methodological framework can help researchers, especially novices, in gaining a comprehensive understanding of the landscape of ML algorithms within a specific research domain.
To validate the efficacy of the evaluation framework, we conducted a case study. The selection of an appropriate case study domain necessitates rigorous consideration of multiple criteria: (1) the domain must possess a corpus of substantial magnitude to effectively demonstrate the framework’s computational efficiency and scalability in processing extensive scholarly literature; (2) the domain should exhibit a diverse taxonomy of research subdomains to verify the framework’s capacity to identify and differentiate algorithmic influence across heterogeneous thematic contexts; (3) the domain must present contemporary relevance to the scientific community, offering substantive insights that address pressing research challenges and contribute to the advancement of knowledge in the field. Considering that ML algorithms have emerged as essential tools for extracting valuable insights during the COVID-19 pandemic [17], we selected ML algorithms from this period for our case study. First, we compiled a comprehensive collection of widely used ML algorithms from Weka and Scikit-Learn, encompassing 95 full names and 95 abbreviations, to form the ML algorithm dictionary, and we searched ML algorithm-related COVID-19 research in the PMC database by combining this dictionary and COVID-19-related terms. Subsequently, we identified 13 key research topics from a dataset of 30,664 relevant papers using the LDA technique. Next, by integrating a rule-based approach with an ML dictionary, we extracted algorithms from the full text of the papers and constructed co-occurrence networks of algorithm entities for each research topic, with network weights determined by the frequency of co-occurrence. To evaluate the impact of the algorithms, we integrated the network centrality indicator with the frequency of algorithmic mentions to identify signature algorithms within each research topic, and incorporated topological features to further evaluate the impact of these algorithmic entities. This approach not only provides a comprehensive overview of algorithms within the field but also uncovers key research topics and identifies influential algorithms within each topic.
The main contributions in this article are as follows:
(1) Our framework contributed to the knowledge entity evaluation study. The evaluation framework integrated text mining techniques, enabling us to investigate research topics related to knowledge entities within a specific research domain. This framework provides an overview of the domain and facilitates the exploration of the application landscape of these entities. This study introduces an innovative approach to assessing algorithmic influence by combining popularity metrics (e.g., mention frequency) with network centrality measures (such as degree centrality and betweenness centrality) at the thematic level. Unlike conventional methods, our approach accounts for both the algorithm’s visibility (mention frequency) and its role within the network in specific research domains. The key innovation lies in integrating these two types of metrics at the thematic level. This integration allows for a more accurate representation of an algorithm’s influence within specific domains. It overcomes the limitations of relying solely on frequency-based statistics or overall network structure. For example, an algorithm may have significant influence within one theme but remain marginal in others—a subtle distinction missed by traditional methods.
(2) We construct algorithm co-occurrence networks and identify influential algorithms within each research topic, thereby presenting a more comprehensive understanding for researchers, particularly novices, within a specific research domain. Despite the symmetric co-occurrence relationship between algorithms, their influence (as measured by the centrality measure) shows an asymmetric distribution over the network. Consequently, our framework broadens the scope of traditional literature review studies.
(3) Our evaluation framework is validated through a case study on ML algorithm-related COVID-19 papers. Although ML algorithms have played a pivotal role in addressing the challenges presented by the COVID-19 pandemic, the expanding body of literature has resulted in information overload. Researchers frequently encounter confusion regarding the selection of the most appropriate ML algorithms for various COVID-19 applications. Our work provides a comprehensive analysis of how ML algorithms can be applied and mapped within the COVID-19 research domain, thereby clarifying the relationship between algorithms and pandemic-related challenges. This paper is arranged as follows: In Section 2, we introduce the literature review. In Section 3, we present the ML algorithm evaluation framework, and the results are analysed in Section 4. Finally, in Section 5 and Section 6, we present the discussion and conclusion, respectively.

2. Literature Review

This section provides a comprehensive review of the knowledge entities evaluation, examines the application of co-occurrence networks in entity evaluation, and analyzes the current status of research on algorithmic evaluation methods for entities.

2.1. Evaluating Knowledge Entities

The extraction and evaluation of knowledge entities can significantly enhance existing knowledge services, thereby facilitating more efficient and precise access to scientific knowledge for researchers [18]. In previous research, the evaluation of the influence of knowledge entities predominantly depends on bibliometric indicators, typically emphasizing the frequency of knowledge entity mentions, citations, and usage within scholarly literature [19]. Among these indicators, citation frequency proves effective in reflecting the extent of peer recognition of an entity [20], particularly within the domain of library and information science (LIS), and serves as a measure of the popularity of software tools or scientific mapping technologies [21]. In contrast, usage frequency offers crucial insights into the practical application of knowledge entities, particularly for methodological entities such as software tools [22]. Mention frequency, as an indicator of an entity’s presence in academic research, can be integrated with time-series analysis to dynamically monitor variations in its influence [23]. Although these quantitative indicators offer valuable insights, they are also constrained by certain limitations. Citation frequency may not yield a comprehensive and direct portrayal of an entity’s impact, whereas mention frequency and usage frequency fail to differentiate between positive and negative reception [24]. More importantly, current methodologies predominantly rely on quantitative indicators, overlooking the contextual characteristics of knowledge entities within academic literature. For instance, aspects such as the relationship between knowledge entities and other entities, or the specific research topics to which they are associated, are often overlooked in current evaluations. Thus, to gain a more comprehensive understanding of the full impact of knowledge entities, it is essential to broaden the evaluation perspective and integrate analysis at the semantic and relational levels, addressing the limitations inherent in these quantitative assessment methods. Following this line, the present study integrates text mining with network analysis: rather than relying solely on single frequency metrics (e.g., citation or mention counts), we combine LDA-based topic modelling with co-occurrence network analysis to extract research themes at the semantic level and construct relational networks of algorithmic entities, thereby identifying key algorithms within different research topics and their relative influence.

2.2. Evaluating Knowledge Entities Based on Co-Occurrence Network

Dynamic algorithm co-occurrence networks are undirected symmetric structures constructed based on algorithm entity co-occurrence relationships in the literature. By analyzing temporal collaborative patterns between algorithms, these networks reflect research topic evolutionary trajectories. The network identifies co-occurrence frequencies and patterns of algorithm entities in publications, forming algorithm relationship topologies. Its dynamic properties capture temporal changes in algorithm usage patterns, providing a visual representation of domain knowledge structure evolution over time.
Within co-occurrence networks, the structural patterns of interconnections among co-occurring nodes serve as indicators of the relative significance of the involved entities. Network analysis is employed to quantitatively assess these interrelationships and to derive insightful and substantive information [25]. Betweenness centrality, closeness centrality, eigenvector centrality and degree centrality are the most popular evaluation indicators in co-occurrence network analysis. Network indicators offer significant support for the evaluation of knowledge entities, particularly within the biomedical domain. For instance, researchers have constructed heterogeneous co-occurrence networks to investigate potential interactions among drugs, genes, diseases, and therapeutic interventions [26]. Furthermore, the method that combined entity evaluation with network analysis has been employed to identify associations between drugs used in autism treatment [27]. Within the domain of COVID-19 research, scholars have employed indicators such as prevalence indices (PI), collaboration indices (CI), and network topology features to identify critical biological entities, including the ACE-2 gene and the C-reactive protein (CRP) gene, through network structural analysis [28]. A key strength of co-occurrence network analysis lies in its ability to capture the relationships among entities. In contrast, traditional frequency-based evaluation models typically consider each entity in isolation. Conversely, co-occurrence networks unveil the relation of entities, which may reflect their roles in the dissemination of knowledge. Leveraging this potential, researchers have constructed co-occurrence networks to investigate the relationships among algorithmic entities. Through the application of centrality indicators derived from large-scale network analyses of algorithms, researchers have assessed the influence of algorithmic entities within the field of natural language processing (NLP) [29]. Although large-scale co-occurrence networks offer a comprehensive overview of the mutual influence between algorithm entities, this method obscures how the method entities are applied in detail. Within a given domain, a knowledge entity, especially the method entities, usually appears in multiple research topics. Thus, it is necessary to further consider the relation between entities and the influence of entities themselves in different research topics.

2.3. Evaluating the Influence of ML Algorithm Entities

Conventional approaches to assessing the impact of algorithmic entities within the field of LIS predominantly rely on survey techniques and bibliometric indicators. Survey techniques play a critical role in certain domains, notably in medical image recognition. For instance, Support Vector Machines (SVMs) and Convolutional Neural Networks (CNNs) are among the most widely used algorithms in the domain of COVID-19 diagnosis [30]. And a research paper has demonstrated a preference for Random Forest (RF) models outperforming deep learning methodologies in early detection and prognostic analyses [31]. Despite the notable accuracy and clarity of survey methods, manual analysis remains labor-intensive and increasingly inadequate in keeping pace with scientific advancements due to the sheer volume of academic papers. Consequently, scholars are investigating alternative automated assessment methodologies. In recent years, methods for the automated evaluation of algorithmic impact have been developed through the integration of bibliometric indicators and text mining techniques. For instance, by analyzing research papers within the field of NLP, scholars have identified which algorithms exert a greater influence in the domain by extracting and evaluating algorithmic entities [32]. In contrast to conventional survey methods, text mining techniques are capable of efficiently extracting influential algorithms from extensive literature and exhibit greater adaptability to the rapid pace of scientific advancements.
Nevertheless, the current algorithm evaluation methodologies still exhibit certain limitations. Conventional bibliometric indicators primarily reflect the popularity of algorithms based on their frequency of occurrence. However, they fail to uncover the micro-level relationships among algorithms. Furthermore, while co-occurrence network analysis offers a broad overview of algorithmic relationships, it remains insufficient for exploring the influence of algorithms within a specific domain [33]. To address these limitations, this paper proposes an automated approach that integrates text mining and network analysis techniques. We employ the LDA model, which is adept at identifying latent topics within the literature through topic modeling [34], followed by the construction of algorithmic co-occurrence networks. This approach enables a more precise identification of the development dynamics of popular algorithms within a specific domain [35], particularly their influence across different research topics. This novel method of algorithmic entity evaluation not only enhances the analytical efficiency but also offers researchers a more detailed understanding of the development of ML algorithms within a specific domain.

3. The ML Algorithm Evaluation Framework

This section introduces an ML algorithm evaluation framework based on text mining and co-occurrence network analysis, designed to assess the impact of algorithmic entities. The COVID-19 research domain is included as a case study. As shown in Figure 1, the detailed work is divided into four main parts: (1) Data collection and processing, (2) Research topic analysis, (3) Algorithm entity co-occurrence network construction (4) Algorithm entity evaluation.

3.1. Data Collection and Processing

We selected PMC as our database [36], due to its status as one of the most prominent open-access repositories for full-text biomedical literature. Our search strategy consists of two parts: ML algorithms and COVID-19 diseases. Building on the previous algorithm classification framework, this study categorizes ML algorithms into ten groups: ensemble, dimensionality reduction (DR), deep learning (DL), artificial neural networks (ANN), association rule (AR), clustering, regression, classification, probability graph model (PGM), and others [37]. Weka and Scikit-Learn are frequently used open-source ML algorithm tools that implement common ML algorithms. According to the algorithm classification framework, we first constructed a collection of ML algorithms containing the full and short names of the ML algorithms from literature acquired using Weka and Scikit-Learn, in which there were 95 full names and 95 abbreviations, respectively. The dictionary of ML algorithms is presented in Table A1 of Appendix A. And the terms related to COVID-19 that were identified through a literature survey are listed in Table A2 of Appendix A. The below query term is used:
“(COVID-19 related terms [Title/Abstract]) AND (algorithm-related terms [Title/Abstract])”.
The PMC database was searched with the language restricted to English. Review articles were excluded due to their failure to evaluate the effectiveness of the mentioned ML algorithms. The initial screening retrieved 30,952 accessible full-text records. After removing duplicates and records with empty abstracts, metadata for 30,664 studies were retained. Utilizing the BioC API for PMC [38], a final dataset of 30,664 full-text articles was included in the study. The above screening process is illustrated in Figure 2.

3.2. Research Topic Analysis

The LDA model is widely regarded as a prominent method due to its effective balance between interpretability and model complexity [39]. In this study, we employ the LDA model and the Gensim toolkit to identify influential research topics within the domain of COVID-19 studies. Abstracts were selected as the text corpus due to their concise nature when handling large-scale datasets [40]. To enhance the explainability of the topics, we employed a multi-step interpretation process. Initially, word clouds were generated to visualize the 100 most probable terms associated with each topic. Subsequently, a manual summarization involved randomly selecting 10% of articles for discussion by two graduate students, utilizing the word cloud results, article content, and existing research. Finally, experts reviewed, revised, and improved the topic summaries. Through this process, we successfully identified the main research topics within the field of COVID-19.

3.3. Algorithm Entity Co-Occurrence Network Construction

Entity extraction is a prerequisite for constructing an entity co-occurrence network. In this study, we employed a rule-based method to extract algorithm entities, based on the pre-constructed collection of ML algorithms. Due to variations in the full names of ML algorithms depending on authors’ writing styles, we employed fuzzy matching for extracting these entities. The principle of fuzzy matching is that if multiple words of an ML algorithmic name appear in sequence within the text, it indicates that the algorithm is mentioned in the document, disregarding case sensitivity. We employed the full-text to identify algorithm entities, as it provides more mentions of algorithms that are not cited in academic articles [41]. Additionally, through observation, we excluded the “Introduction” section because it does not analyze algorithms in depth. The METHODS, RESULTS, and DISCUSSION sections, which all mention effective algorithm entities, were retained.
We constructed algorithm entity co-occurrence networks for different research topics. Within the collection of literature for a specific research topic: if the full name or the abbreviation of an ML algorithm appears once or more in the text, its frequency is incremented by one; if two ML algorithms (e.g., A and B) are mentioned in the same article, it indicates that A and B appear together; that is, an edge is formed in the co-occurrence network to connect nodes A and B. For each new co-occurrence of nodes A and B, the weight of the edges in the co-occurrence network is increased by one. In an ML algorithm co-occurring network G ( N , E ) , N is the set of nodes in the network, where N = { n 1 , n 2 n m } , and m is the total number of nodes. E is a set of edges in the network. e i j i j , 0 < i < m , 0 < j < m refers to the weight of the edges between node n i and node n j , and ω j denotes the weight of n i . The network is undirected and symmetric: if there is a link from node i to node j, the reverse link from j to i is also present. By construction, this implies a symmetric adjacency matrix, Aij = Aji for all i,j. We verified this property by comparing the adjacency matrix A with its transpose AT and found them to be identical (i.e., ‖A−AT‖F = 0), confirming that the network is indeed symmetric and undirected.
Entity extraction model validation: To ensure the reliability of our entity extraction approach, we evaluated the model’s performance on the test set. The entity extraction model achieved a precision of 92.3%, a recall of 90.8%, and an F1-score of 91.5%, demonstrating high reliability in identifying algorithm entities within the COVID-19 research domain. This validation confirms the effectiveness of our rule-based extraction method combined with fuzzy matching for capturing ML algorithm mentions from academic literature.

3.4. Algorithm Entity Evaluation

Considering the mutual influence among algorithm entities, we introduced topological features to evaluate the influence of algorithm entities. We referred to relevant studies in social network analysis. If ML algorithm A is closely connected to B, and ML algorithm C is closely connected to D, and if B has greater influence than D, then it can be inferred that the influence of ML algorithm A is greater than that of C. In social network research, the centrality of the nodes represents their influence [42]. Four indicators are commonly used to measure centrality: degree, betweenness, closeness, and eigenvector [43]. A higher degree centrality of an ML algorithm measures greater importance and influence within the network.
Betweenness centrality quantifies the importance of a node by evaluating the number of shortest paths that traverse it. Accordingly, the higher the betweenness centrality of an ML algorithm, the greater its influence.
Closeness centrality evaluates the importance of a node based on its distance to other nodes. The higher the closeness centrality of an ML algorithm, the greater its influence.
Eigenvector centrality measures the importance of a node based on the number and degree of its neighboring nodes. Accordingly, the higher the eigenvector centrality of an ML algorithm, the greater its influence.
Furthermore, the ML algorithm co-occurrence network is a weighted network, with the weighted degree used as the evaluation index [44]. Consequently, six indicators-namely mention frequency, degree centrality, betweenness centrality, closeness centrality, eigenvector centrality, and weighted degree-were selected. The average value of these indicators was then computed after linear normalization, serving as the foundation for evaluating the influence of the ML algorithms. A higher value of the normalized average corresponds to a greater influence. Table 1 lists the description and calculation methods for each evaluation indicator.

4. Result

4.1. Research Topics Analysis

The analysis of research topics aims to describe the core research content within a specific domain, thereby providing researchers with a comprehensive overview of the research landscape. Subsequently, we created a trend graph illustrating the evolution of the number of research topics over time. The results indicate that ML algorithms play a significant role in addressing the social impact of COVID-19, advancing medical technologies, and improving individual health outcomes.

4.1.1. LDA Topic Modeling

Regarding coherence and perplexity, the optimal parameters for LDA modeling were set to topic = 13, alpha = 0.2, and beta = 0.01. As shown in Figure 3. To visualize the content of each topic, we used a word cloud map to show the 100 words with the highest probability of appearing in each topic. Figure 4 presents a word cloud map of the thirteen topics, where the larger the font size of a word, the greater the probability of the word appearing in the topic. As a result of each research topic, we discovered 8 distinct research topics for the application of ML algorithms in COVID-19, which included mortality risk and outcome analysis, test and detection of SARS-CoV-2, social impact, clinical diagnosis and symptoms of COVID-19, mental health, diagnosis of medical images, laboratory research on viruses, and vaccination, as well as 5 other COVID-19 research topics. Table 2 presents details of the topics we discovered, including the research topic, the topic number, the count of included studies, and the overview for each research topic.

4.1.2. The Evolution of COVID-19 Research Topics

To illustrate the evolution of research attention across topics over time, Figure 5a shows the changes in the number of articles for each topic. Figure 5b shows the percentage of articles on each topic over time. From the time dimension, the number of articles on each topic exhibited a surge in early 2020, peaked in 2021, and subsequently plateaued. In the early stages of COVID-19, when the novel coronavirus was yet unknown, there were only three topics on ML algorithms and COVID-19 in January 2020, which were Topic 3 (clinical diagnosis and symptoms of COVID-19), Topic 8 (test and detection of SARS-CoV-2), and Topic 9 (mortality risk and outcome analysis). As COVID-19 continued to spread, ML was increasingly applied to other research topics. In this stage, Topic 4 (diagnosis of medical images) was the fastest-growing topic. During the later stages of COVID-19, researchers directed their attention to the impact of the virus on individuals and society as the pressure of detection eased. The number of studies included in Topic 1 (social impact) and Topic 5 (mental health) was second and third, only to Topic 9 (mortality risk and outcome analysis).

4.2. Landmark ML Algorithms Analysis

4.2.1. Constructing ML Algorithm Co-Occurrence Network

Based on the eight key COVID-19 topics and other minor ones that indicated the main COVID-19 research directions or topics for applying ML algorithms, we then constructed a co-occurrence network of ML algorithms for each topic, as shown in Figure 6. The node size represents the number of articles mentioning ML algorithms, and the line thickness represents the weight of the line. Figure 7 indicates that the network density of Topic 4 (diagnosis of medical images) ranked first, where ML algorithms were widely applied, followed by Topic 6 (laboratory research on viruses) and Topic 9 (mortality risk and outcome analysis). The network density of the remaining topics was low.
Figure 6 visualizes the co-occurrence networks of ML algorithms across the thirteen research topics. Nodes represent algorithms and edges indicate that two algorithms are used together within at least one study; thicker and darker edges correspond to higher co-occurrence frequencies. Dense networks reflect a richer combination of algorithms within a topic, whereas sparse networks indicate that only a few algorithms dominate the methodological landscape.
Several patterns emerge. The networks for Topic 4 (diagnosis of medical images), Topic 6 (laboratory research on viruses) and Topic 9 (mortality risk and outcome analysis) are markedly denser than those for other topics, implying greater algorithmic diversity and more frequent joint use. In Topic 4, a tightly connected cluster centred on CNN, DNN, RF, SVM, LR and DT highlights the widespread integration of deep learning and traditional machine learning in imaging pipelines. In Topic 6, PCR, LR, RF, FA and PCA occupy central positions and are strongly interconnected, underscoring the importance of regression and dimensionality-reduction techniques for high-dimensional laboratory and omics data. In Topic 9, LR, LRR, RF and MEN serve as hubs, consistent with the predominance of regression-based models in risk prediction and prognosis tasks based on tabular clinical data.
By contrast, networks in several other topics (e.g., Topics 7, 8 and 11) are relatively sparse and often exhibit star-like structures centred on a small number of algorithms, suggesting a more concentrated methodological toolkit. Across all topics, regression algorithms (such as LR and LRR) and dimensionality-reduction methods (such as PCR, PCA and FA) repeatedly appear as highly connected nodes, confirming their broad applicability and central role in COVID-19–related ML research.

4.2.2. Evaluating the Influence of ML Algorithms

The linear normalized average of the six indices we proposed was used as the basis for evaluating the influence of ML algorithms in each topic. As shown in Figure 8, Dimensionality reduction (DR) was found to be the most popular type of ML algorithms across all research topics. The algorithm that has received the most attention from researchers in each research topic is shown in Figure 6. Logistic regression (LR) was the landmark algorithm in seven groups. Principal component regression (PCR) was the landmark algorithm in test and detection of SARS-CoV-2, clinical diagnosis and symptoms of COVID-19, laboratory research on viruses, and diagnosis and treatment of lung diseases. CNN was the landmark algorithm in diagnosis of medical images, and so is Linear regression (LRR) in social life and public behavior.
Topic 4 had the highest network density and Topic 9 included the most studies. For brevity, we only explain the results of Topic 4 and Topic 9 and show the top ten ML algorithms for each index and their normalized averages.
The results of the ML algorithms evaluation in Topic 4 (diagnosis of medical images) are listed in Table 3 in Topic 4, which shows the top ten ML algorithms ranked by each evaluation index and normalized average. Previous studies evaluated the influence of ML algorithms based on their frequency of use. The top 10 algorithms with the greatest influence according to the method proposed in this study, which is based on the normalized average, were inconsistent with the results of previous studies. Our results showed the ML algorithm with the largest normalized average was random forest (RF), followed closely by CNN. It also showed that RF was more critical in the structure of the ML algorithms’ co-occurrence network despite the high frequency of CNN. RF performed consistently across diverse datasets and topics with a wide range of applicability, and was often adopted by researchers as one of the experimental models. Both RF and CNN played an important role in diagnosis of medical images. In addition, Support Vector Machine (SVM), deep Neural Network (DNN), logistic regression (LR) and Decision Tree (DT) have also received researchers’ attention under this topic, which can contribute to medical image processing.
In diagnosis of medical images, the frequent adoption of Random Forests (RF) and their relative advantages across numerous research scenarios primarily stems from their strong alignment with the practical data and engineering requirements of imaging studies. The workflow from image → radiomics/texture → tabular features typically generates challenges such as high dimensionality, limited sample size, non-linearity, and high-order interactions. RF effectively suppresses variance through bagging and feature subsampling, while inherently accommodating small samples and sparse high-dimensional data. It also demonstrates strong robustness against class imbalance and cross-centre/cross-device heterogeneity, with performance further enhanced through class weighting or threshold adjustments. Moreover, tree-based models provide global feature importance and local explanations via TreeSHAP, while enabling internal validation through out-of-bag (OOB) error estimates. This aligns well with clinical requirements for interpretability, auditability, and regulatory compliance. At the engineering level, RF offers low training and deployment costs, straightforward parameter tuning, minimal GPU dependency, and seamless integration with clinical tabular variables in later stages. In practice, hybrid paradigms such as “deep feature extraction + RF” or “radiomics + RF” preserve the representational power of deep learning while balancing robustness and interpretability, thereby increasing RF’s adoption rate in the literature. However, it should be clarified that deep models such as CNNs and ViTs remain dominant in large-scale, end-to-end, pixel-level tasks with abundant annotations. The “superiority” referenced here primarily reflects RF’s applicability and robustness under constraints of small sample sizes, high dimensionality, and multi-centre heterogeneity, rather than absolute performance dominance across all scenarios.
The ML algorithm with the largest normalized average was logistic regression (LR), which is particularly effective in prediction and classification tasks. In this research area, where the conditions triggered by COVID-19 are complex, LR applied to binomial, polynomial, and ordered classification models. For example, univariate and multivariate ordinal LR models have been successfully employed to identify independent predictors of illness severity [45]. The dominance of LR in this domain can be attributed to its high interpretability, which is critical in clinical decision-making, especially when predicting outcomes like mortality risk. Furthermore, LR’s ability to handle both simple and complex relationships—as seen in its application to evaluate 30-day mortality risk in hemodialysis patients [46]—reinforces its central role in mortality risk analysis, despite the emergence of other algorithms such as XGBoost (XB). In comparison, Principal Component Regression (PCR) and Multi-task Elastic-net (MEN) were also prominent, with PCR excelling in handling high-dimensional data, and MEN being well-suited for multifactorial analyses, such as considering age and comorbidities. Overall, LR and PCR received the most attention from researchers, making them the most influential algorithms in this research topic.
We observe that LR maintains long-term dominance in clinically relevant domains (Table 4). This aligns closely with the field’s stringent requirements for interpretability and probability calibration: LR coefficients directly correspond to odds ratios, facilitating risk communication and guideline development; their probability outputs readily undergo temperature/equidistance calibration to meet specific sensitivity/specificity thresholds. Moreover, clinical tabular data frequently exhibits small sample sizes, class imbalance, and missing values, where LR demonstrates practical feasibility in variable selection and robustness. Regulatory processes also favour transparent, auditable models. Conversely, within medical imaging subdomains, deep networks gain an advantage through large-scale pre-training and spatial invariance, reflecting algorithmic preferences driven by differences in data morphology.

5. Discussion

In this study, we propose a methodology to obtain the landscape of ML algorithms within a domain, encompassing the identification of research topics and the exploration of prominent ML algorithms associated with these research topics derived from an extensive corpus of academic literature.
At the research topic analysis level, the abstract-based LDA topic model demonstrated strong performance. Text mining techniques enabled us to identify thirteen research topics for ML applications in COVID-19. First, we confirmed some findings by referring to previous research. Previous surveys [47] indicated that Topic 4 (diagnosis of medical images) consistently garnered the highest number of studies, a topic in which ML algorithms were employed to analyze image data and other features for disease diagnosis and prediction, aligning with our findings. Additionally, research topics concerning policy and societal impacts, clinical trials, mental health, risk diagnosis, treatment, and prognosis, as noted in previous studies, were also represented in the topics we identified. In addition to the consensus established in previous studies, our approach identified previously unfocused research topics, namely the application of ML algorithms in women’s and children’s health studies and the investigation of changes in public behavioral characteristics. These two topics emphasized groups and characteristics that have attracted considerable attention from researchers during the COVID-19 pandemic. Building on this foundation, the evolutionary analysis indicated that ML models of diagnosis, treatment, and prognosis have consistently represented the foremost research direction within the COVID-19 field. In the later stages of COVID-19, ML models based on vast amounts of medical data supported researchers in discovering new findings. The above analysis demonstrates that our approach effectively and comprehensively identifies research topics relevant to ML applications from large academic literature. The identification of research topics contributes to the development of a structured knowledge framework, outlining the specific applications of ML algorithms for researchers.
At the level of ML algorithm influence evaluation, we constructed algorithm co-occurrence networks within key topics and evaluated algorithms using both mention frequency and network centrality indicators. These six indicators were linearly normalized and then equally weighted to obtain a transparent and comparable composite influence score; alternative data-driven weighting schemes (e.g., PCA or entropy) were not adopted because their sample-specific weights are less interpretable. Based on this composite score, our findings reveal that the influence of the same ML algorithm can vary substantially across different research topics. In terms of application scope, regression and dimensionality reduction algorithms were extensively utilized, with logistic regression (LR) emerging as the most influential algorithm across the eight core research topics, followed by principal component regression (PCR). The regression algorithms aimed to achieve highly sensitive prediction of disease or disease effects [48]. Dimensionality reduction algorithms enabled researchers to condense high-dimensional, complex datasets into lower-dimensional spaces, thereby facilitating subsequent analyses such as genomic, transcriptomic, and proteomic data for disease monitoring or non-linear data for medical image analysis. Analyzing the distribution of ML algorithms across different research topics, the density of co-occurrence networks was found to be highest in Topic 4 (diagnosis of medical images), where RF, CNN, SVM, DNN, LR, and DT all exhibited an influence greater than 0.7. Given that deep learning models offer greater utility than traditional statistical methods in addressing complex medical problems characterized by vast amounts of information. In general, the influence evaluation formula we proposed effectively identifies popular algorithms within each research topic. In contrast to methods relying solely on frequency indicators, this approach places greater emphasis on algorithms that assume a central role within the research topics. The construction of co-occurrence networks within each research topic offers researchers a more holistic understanding of the underlying dynamics.
Beyond the technical evaluation of algorithmic influence, the practical and ethical issues of machine learning (ML) applications in COVID-19 research also deserve careful discussion. First, in clinical environments, model interpretability directly affects applicability. Although deep learning models such as CNNs and DNNs have achieved remarkable accuracy in medical image analysis, their “black-box” nature may hinder physicians from understanding the basis of predictions, thereby reducing the reliability of clinical decision-making. In contrast, algorithms such as Logistic Regression (LR) and Decision Trees (DT), though sometimes less accurate, offer greater transparency and interpretability, making them more acceptable in practice.
Second, fairness is another crucial concern. Imbalanced or biased training data may lead to systematic disadvantages for certain populations (e.g., elderly patients, pregnant women, or minority groups), thus exacerbating health inequities. Researchers should therefore not only evaluate algorithms based on performance metrics but also incorporate fairness-oriented indicators into the evaluation process.
Finally, the risks of inappropriate model selection should not be underestimated. During the pandemic, misclassifying high-risk patients as low-risk could result in delayed treatment and severe consequences, while overestimating risks may cause unnecessary strain on medical resources. To address this, evaluation frameworks should be expanded to consider interpretability, fairness, and risk-awareness alongside technical indicators.

6. Conclusions

ML algorithms are widely used in science discovery. As there are vast academic papers, the evaluation of algorithms has become increasingly critical for researchers conducting data-driven investigations. This paper proposes an automated methodology that integrates text mining and co-occurrence network analysis to identify influential ML algorithms. Text mining techniques are utilized to identify key research topics from vast academic literature. The co-occurrence networks for each topic reflect which algorithms are central. Our method’s successful deployment during the case study of COVID-19 demonstrated that this method offers a direct reference for researchers to make decisions regarding choosing ML algorithms.
This study has its limitations. Although the framework proposed herein demonstrates sound applicability in COVID-19 research, several limitations warrant further clarification. Firstly, the issue of algorithmic nomenclature ambiguity cannot be overlooked. For instance, ‘LR’ may denote either Logistic Regression or Linear Regression in different contexts, with such abbreviations potentially introducing bias in algorithmic identification and categories. Secondly, while fuzzy matching methods enhance recall for algorithmic entities, they may also introduce false positives or false negatives, thereby compromising the accuracy of network construction and impact assessment. Thirdly, the present study’s algorithm identification relies primarily on keyword-based extraction strategies, which may overlook contextual semantics. This approach risks obscuring the precise application or research focus of certain algorithms in specific scenarios.
Future research may enhance algorithm identification precision by incorporating more advanced natural language processing (NLP) techniques, such as named entity recognition and context-sensitive language models. Concurrently, integrating manual verification or semi-automated annotation could mitigate risks arising from algorithm name ambiguity. Only by addressing these issues can the proposed framework achieve robust application across larger-scale and more complex cross-domain datasets. In the spirit of open science, all code and data used in this study are available online at https://github.com/beyoungbelong/ML-algorithms-evaluation/tree/main.

Author Contributions

Conceptualization, S.H.; Methodology, S.H., L.L. and Y.Z.; Validation, Formal analysis, Y.Z.; Writing—original draft, S.H.; Writing—review and editing, Visualization, L.L.; Supervision, L.L. and Y.Z.; Funding acquisition, S.H. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by National Science and Technology Innovation 2030, Noncommunicable Chronic Diseases—National Science and Technology Major Project [grant number 2024ZD0524300, 2024ZD0524302].

Data Availability Statement

The data used in this study were obtained from openly accessible articles in the PubMed Central (PMC) database.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A. The Search Teams for COVID-19 and the Dictionary of ML Algorithms

Table A1. The dictionary of ML algorithms.
Table A1. The dictionary of ML algorithms.
AbbreviationML AlgorithmAbbreviationML Algorithm
ADBAdaptive BoostingLARSLeast Angle Regression
AAApriori AlgorithmLDALinear Discriminant Analysis
BPBack PropagationLRRLinear Regression
BIRCHBalanced Iterative Reducing and Clustering using HierarchiesLRLogistic Regression
BBNBayesian Belief NetworkLSTMLong short term memory
BNBayesian networkLOESSLocally Estimated Scatterplot Smoothing
BRBayesian RegressionMDSMulti-Dimensional Scaling
BABootstrap aggregatingMLPMultilayer Perceptron
BNBBernoulli Naive BayesMRFMarkov Random Field
CCACanonical Correlation
Analysis
MDSMultidimensional Scaling
CRFConditional Random FieldMNBMultinomial Naïve Bayes
CNNConvolutional Neural NetworkMENMulti-task Elastic-Net
CDTConditional Decision
Trees
MLASSOMulti-task Lasso
CONBComplement Naïve BayesMARSMultivariate Adaptive Regression Splines
CARTClassification and Regression TreeNBnaive Bayes
CANBCategorical Naïve BayesOLSOrdinary Least Squares
CHAIDChi-squared Automatic Interaction DetectionOMPOrthogonal Matching Pursuit
DTDecision TreePLSPartial least squares
DBNDeep Belief NetworkPAPassive Aggressive Algorithms
DNNdeep Neural NetworkPEPerceptron
EAEclat AlgorithmPNNPerceptron Neural Network
DBMDeep Boltzmann MachinePRPolynomial regression
DBSCANDensity-Base Spatial Clustering of Application with NoisePCRPrincipal Component Regression
ENRElastic Net RegressionPCAPrincipal Component Analysis
EMExpectation MaximizationQDAQuadratic Discriminant Analysis
FAFactor AnalysisRBFNRadial Basis Function Network
FLFederated LearningRFRandom Forest
FCFuzzy clusteringRNNRecurrent Neural Network
GPRGaussian Process regressionRBNRestricted Boltzmann Machine
GBDTGradient Boosting Decision TreeRIRRidge regression
GMGaussian MixturesRRRobustness Regression
GNBGaussian Naïve BayesSMSammon Mapping
GPCGaussian Processes ClassificationSCSpectral Clustering
GLRGeneralized Linear RegressionSOMSelf-Organizing Map
GANGenerative Adversarial NetworkSAStacked Auto-encoders
GAGenetic AlgorithmSGStacked Generalization
GBRTGradient Boosted Regression TreesSTAStacking
GBMGradient Boosting MachinesSRStepwise Regression
HMMHidden Markov ModelSGDStochastic Gradient Descent
HCHierarchical clusteringSVMSupport Vector Machine
HNHopfield NetworkSVRSupport Vector Regression
ID3Iterative DichotomiserACAgglomerative Clustering
KRRKernel ridge regressionAODEAveraged One-Dependence Estimators
KMK-MeansTLTransfer Learning
KMEK-MediansXBXGBoost
KNNK-Nearest neighborCBCatBoost
LVQLearning Vector QuantizationLGBMLightGBM
LassoLeast Absolute Shrinkage and Selection Operator
Table A2. Teams in retrieval strategy.
Table A2. Teams in retrieval strategy.
WordRelated Retrieval Keywords
COVID-19“COVID-19” OR “SARS-CoV-2” OR “2019-nCoV” OR “Novel Coronavirus Pneumonia” OR “Novel Coronavirus Infected Pneumonia” OR “2019 novel coronavirus” OR “coronavirus 2019” OR “coronavirus disease 2019” OR “2019-novel CoV” OR “2019 ncov” OR “covid 2019” OR “corona virus 2019” OR “ncov-2019” OR “ncov2019” OR “nCoV 2019” OR “Severe acute respiratory syndrome coronavirus 2

Appendix B. Evaluation Results of ML Algorithms in Other Topics

The evaluation results of the ML algorithms in Topic 1 (social impact) are listed in Table A3 indicate that the algorithms with the greatest influence, as determined by the method proposed in this study, align with those based on mention frequency. Our findings demonstrated that the LR algorithm achieved the highest normalized average, owing to its frequent application in predictive dichotomous classification tasks, such as disease spread modeling and risk assessment. The following algorithms were employed to address a broad spectrum of problems related to COVID-19: LRR and FA. These three algorithms have been identified as effective tools for predicting social impacts.
Table A3. Evaluation results of ML algorithms in Topic 1 (social impact).
Table A3. Evaluation results of ML algorithms in Topic 1 (social impact).
No.The Frequency of Being MentionedWeighted DegreeDegree
Centrality
Betweenness CentralityCloseness
Centrality
Eigenvector CentralityNormalized Average
1LR1749LR558LR0.673LR0.409LR0.742LR0.404LR1
2LRR655LRR398LRR0.510LRR0.192LRR0.671LRR0.358LRR0.671
3FA544MEN328FA0.408FA0.128FA0.620FA0.303FA0.525
4MEN143FA264MEN0.388RF0.114MEN0.598MEN0.283MEN0.478
5PCR113PCR108RF0.306MEN0.111PCR0.57PCR0.244PCR0.321
6PA72PCA106PCR0.265DT0.05RF0.557RF0.21RF0.31
7SR70SR98DT0.184PCA0.046DT0.533SR0.193PCA0.253
8PCA63GLR68PA0.184FC0.041SR0.533GLR0.189SR0.238
9EM56PA44PCA0.184PCR0.038GLR0.527PCA0.188DT0.222
10PLS39RF32GLR0.163CB0.033CB0.521PA0.181GLR0.22
The evaluation results of the ML algorithms in Topic 2 (vaccination) are listed in Table A4. The results for the top ten most influential algorithms based on normalized mean are inconsistent with the results based on mention frequency. According to our results, LR held a higher degree of influence in comparison to PCR, which was ranked second. The third most powerful was RF. The results showed that LR was most commonly used to predict vaccine effectiveness and other relevant factors due to its simplicity and effectiveness.
Table A4. Evaluation results of ML algorithms in Topic 2 (vaccination).
Table A4. Evaluation results of ML algorithms in Topic 2 (vaccination).
No.The Frequency of
Being Mentioned
Weighted DegreeDegree
Centrality
Betweenness CentralityCloseness
Centrality
Eigenvector CentralityNormalized Average
1LR 1019LR 520LR 0.568LR 0.429LR 0.685LR 0.413LR 1.000
2PCR767PCR 384PCR 0.459RF 0.295PCR 0.627PCR 0.386PCR 0.761
3LRR156MEN 158RF 0.405PCR0.208RF 0.607RF 0.301RF 0.500
4MEN 73BA 150LRR0.270MEN 0.089BA0.529LRR0.272LRR 0.363
5BA 65LRR 136MEN 0.243Lasso 0.086KM 0.529RR 0.231MEN 0.342
6RR 30RR 74RR 0.216BA 0.075LRR 0.514KM 0.231BA 0.320
7FA 24RF 40Lasso 0.216PE 0.054Lasso 0.514MEN 0.223RR 0.266
8SR 18SR 38BA 0.189LRR 0.041FA 0.507BA0.206KM 0.257
9RF 14FA 32SVM 0.189SVM 0.023MEN 0.500SR 0.200Lasso 0.255
10BR 13BR 30KM 0.189SR 0.018SR 0.493FA 0.194SR 0.229
The evaluation results of the ML algorithms in Topic 3 (Table A5) indicate that the algorithm with the highest normalized mean is almost identical to the results based on mention frequency. Our findings revealed that the ML algorithm with the largest normalized mean was PCR, which exhibited a significantly higher frequency of mentions in Topic 3 and proved to be more influential than the other algorithms. The results demonstrated that PCR has garnered the most attention in clinical diagnosis and symptom analysis, due to its high reliability and practicality in confirming infection status.
Table A5. Evaluation results of ML algorithms in Topic 3 (clinical diagnosis and symptoms of COVID-19).
Table A5. Evaluation results of ML algorithms in Topic 3 (clinical diagnosis and symptoms of COVID-19).
No.The Frequency of
Being Mentioned
Weighted DegreeDegree
Centrality
Betweenness
Centrality
Closeness
Centrality
Eigenvector
Centrality
Normalized
Average
1PCR2834PCR506PCR0.848PCR0.899PCR0.868PCR0.634PCR1.000
2LR236LR232LR0.273LR0.119LR0.559LR0.324LR0.302
3MEN88MEN182LRR0.121MEN0.062LRR0.508MEN0.188MEN0.193
4LRR24RR38MEN0.121SA0.061MEN0.500RR0.183LRR0.145
5EM23LRR26DT0.091RF0.061RF0.493DT0.179RR0.118
6PA21FA24RF0.091LR0.048PE0.493LRR0.166DT0.107
7RR18BA22RR0.091PE0.002SA0.485PE0.161RF0.105
8GBM16EM18PE0.091DT0.001DT0.485PA0.160PE0.105
9BP15PA12PA0.091RR0.001GA0.485BR0.157PA0.104
10FA13DT10BR0.061PA0.001RR0.485FA0.157FA0.098
The evaluation results of the ML algorithms in Topic 5 (mental health) are listed in Table A6. The results for the most influential algorithm based on normalized mean is almost consistent with the result based on mention frequency. Our results showed that the ML algorithm with the largest normalized mean was LR. MEN, LRR, PA followed closely. The rest of the algorithms were much less influential than the top-ranked algorithms. The results showed that LR was a powerful predictive or analytical tool that excelled at interpreting data on mental health.
Table A6. Evaluation results of ML algorithms in Topic 5 (mental health).
Table A6. Evaluation results of ML algorithms in Topic 5 (mental health).
No.The Frequency of
Being Mentioned
Weighted DegreeDegree
Centrality
Betweenness CentralityCloseness
Centrality
Eigenvector CentralityNormalized Average
1LR 1873LR 870LR 0.813LR 0.557LR 0.842LR 0.416LR 1.000
2LRR 697LRR 598MEN 0.521PA 0.146MEN 0.658MEN 0.340MEN 0.503
3PA 409MEN 504PA 0.438MEN 0.131PA 0.623LRR 0.304LRR 0.482
4MEN 224PA 276LRR 0.396BP 0.090LRR 0.608PA 0.292PA 0.432
5FA 91GLR 156FA 0.313PE 0.082FA 0.565FA 0.256FA 0.279
6SR 68FA 134RF 0.271LRR0.050RF 0.552RF 0.219RF 0.231
7PCR 65SR130DT 0.188RF 0.033HC0.533RR 0.185SR 0.192
8GLR 61PCR 78RR0.188FA 0.023DT 0.527PCR 0.181PCR 0.182
9OLS 31RF 44BP 0.167DT 0.019RR 0.522HC 0.180DT 0.181
10BP 23PCA 36HC0.167SVM 0.007BP 0.516DT 0.179RR 0.176
The evaluation results of the ML algorithms in Topic 6 are listed in Table A7. The results for the most influential algorithm based on normalized mean is almost consistent with the result based on mention frequency. Our results showed that the ML algorithm with the largest normalized mean was PCR. The second most popular algorithm was RF, which was much less influential than the most popular algorithm. The third-ranked algorithm was LR, which was slightly less influential than the second. The results nshowed that PCR has the highest impact in laboratory virus research, but other algorithms such as RF, LR, and FA also contributed to the topic to varying degrees. This could reflect the fact that researchers in the field of virology employ multiple machine learning methods to process and analyze complex data in COVID-19 research.
Table A7. Evaluation results of ML algorithms in Topic 6 (laboratory research on viruses).
Table A7. Evaluation results of ML algorithms in Topic 6 (laboratory research on viruses).
No.The Frequency of
Being Mentioned
Weighted DegreeDegree
Centrality
Betweenness CentralityCloseness
Centrality
Eigenvector CentralityNormalized Average
1PCR 888PCR 346PCR 0.612PCR 0.254PCR 0.700PCR 0.331PCR ·1.000
2EM224LR 198RF 0.490RF 0.207RF 0.636FA 0.305RF 0.636
3LR 167RF 144FA 0.469LR 0.066FA 0.628LR 0.288LR0.563
4PCA 111FA 130LR 0.429FA 0.064LR 0.620RF 0.286FA 0.530
5RF 79PCA 116PCA 0.367TL 0.064PCA 0.590PCA 0.246PCA 0.456
6LRR69LRR 90LRR0.347SVM 0.063LRR 0.583LRR 0.239LRR 0.416
7FA 46SVM 84SVM 0.347HC 0.062SVM 0.570PLS 0.231SVM 0.404
8MDS 46AA 72PLS 0.306PC A0.056PLS 0.563SVM 0.226HC0.355
9PA 38BA 72HC 0.286FC 0.054HC 0.557HC 0.194PLS 0.350
10BP 37MEN 66MEN 0.245BA 0.054GM 0.538GM 0.192MEN 0.300
The evaluation results of the ML algorithms in Topic 7 are listed in Table A8. Evaluation results of ML algorithms in Topic 7. This task, which focused on child health during COVID-19, included the fewest number of studies and had the sparsest network of algorithms. The results for the most influential algorithm based on normalized mean is almost consistent with the result based on mention frequency. Our results showed that the ML algorithm with the largest normalized mean was LR. The other algorithms were much less influential than LR. The results showed that LR was often used to analyze children’s health during COVID-19 and the relationship that exists between variables.
Table A8. Evaluation results of ML algorithms in Topic 7 (Other minor COVID-19 research tasks).
Table A8. Evaluation results of ML algorithms in Topic 7 (Other minor COVID-19 research tasks).
No.The Frequency of
Being Mentioned
Weighted DegreeDegree
Centrality
Betweenness CentralityCloseness
Centrality
Eigenvector CentralityNormalized Average
1LR 318LR 104LR 0.741LR 0.746LR 0.730LR 0.594LR1.000
2PCR 101LRR 42LRR 0.296LRR 0.397LRR 0.574PCR 0.325LRR 0.442
3LRR 62MEN 34PCR 0.259PE 0.142PCR 0.540MEN 0.286PCR 0.364
4BP 20PCR 26MEN 0.185PCR 0.108MEN 0.519LRR 0.280MEN 0.276
5MEN 14RR 10RR 0.148PA 0.074GLR 0.482RR 0.217RR0.180
6PA 9RF 10HC 0.111MEN 0.021RR 0.458FA 0.216FA 0.164
7DT 8HC 8FA 0.111RR 0.007FA 0.450RF 0.182RF 0.155
8EM 5FA 8RF 0.111HC 0.001HC 0.443PCA 0.165GLR 0.145
9FA 5PCA 8BP 0.074RF 0.001PCA 0.443GLR 0.157HC 0.143
10MDS 5GLR 6BR 0.074BP 0.000RF 0.443HC 0.154PCA 0.139
The evaluation results of the ML algorithms in Topic 8 are listed in Table A9. The result for the most influential algorithm based on normalized mean is almost consistent with the result based on mention frequency. Our results showed that the ML algorithm with the largest normalized mean was PCR, which had far more mentions and final normalized values than any other algorithm. The other algorithms were much less influential than the most popular algorithm. The results showed that PCR has had significant success in SARS-CoV-2 detection. PCR was often used to process multivariate data, address data correlations, provide robust predictions, and more, which can be valuable in SARS-CoV-2 testing.
Table A9. Evaluation results of ML algorithms in topic 8 (test and detection of SARS-CoV-2).
Table A9. Evaluation results of ML algorithms in topic 8 (test and detection of SARS-CoV-2).
No.The Frequency of
Being Mentioned
Weighted DegreeDegree
Centrality
Betweenness CentralityCloseness
Centrality
Eigenvector CentralityNormalized Average
1PCR 3571PCR516PCR 0.881PCR 0.856PCR 0.894PCR 0.586PCR 1.000
2BA 93BA 194BA0.262AC 0.093BA 0.545BA 0.268BA 0.257
3BP 58LR 74RF 0.190RF 0.057RF 0.532RF 0.211RF 0.166
4LRR53BP 66LR 0.167SVM 0.051LR 0.519LR 0.204LR 0.165
5LR 50MEN 50DT 0.143BA0.039PCA0.519PCA0.195PCA 0.138
6MEN 23LRR44PCA0.143DNN 0.016SVM 0.519DT 0.187DT 0.134
7RF 20DT22HC0.143AA0.010DNN 0.512HC0.183MEN 0.134
8PA 18RF 20DNN 0.119LR 0.008DT 0.512MEN 0.178HC0.131
9PCA 17PCA18MEN 0.119PCA0.007HC0.512LRR0.165LRR 0.130
10DT 15SC 18SVM 0.119HC0.005MEN 0.506SC 0.157BP 0.127
The evaluation results of the ML algorithms in Topic 10 are listed in Table A10. This task focused on the association between COVID-19 and pregnancy, with a focus on women’s health. The results of the top ten influential algorithms based on normalized mean were almost inconsistent with the results based on mention frequency. Our results showed that the ML algorithm with the largest normalized mean was LRR, the second-ranked algorithm was LR, which was very close to the first place. The third, fourth, and fifth ranked algorithms were RF, PCR, and LSTM, respectively.
Table A10. Evaluation results of ML algorithms in topic 10 (Other minor COVID-19 research tasks).
Table A10. Evaluation results of ML algorithms in topic 10 (Other minor COVID-19 research tasks).
No.The Frequency of Being MentionedWeighted DegreeDegree CentralityBetweenness CentralityCloseness CentralityEigenvector CentralityNormalized Average
1PCR345LRR272LR0.540LRR0.249LR0.670LR0.339LRR0.959
2LRR317LR178LRR0.524LR0.190LRR0.663RF0.318LR0.853
3LR242LSTM156RF0.508RF0.184RF0.630LRR0.300RF0.683
4LSTM100PCR138LSTM0.381PCR0.135PCR0.568LSTM0.259PCR0.657
5OLS67RF112PCR0.349LSTM0.101LSTM0.558PE0.232LSTM0.571
6LDA66PE96PE0.270OLS0.082PE0.543PR0.201PE0.390
7RF63MLP78SVR0.238HC0.048PCA0.534MLP0.195PCA0.338
8PCA55RNN78PR0.222BR0.043SVR0.529SVR0.191OLS0.338
9KM44PCA78PCA0.222GA0.032PR0.529SVM0.188SVR0.325
10HC33PR56MLP0.206GPR0.032OLS0.525PCR0.182PR0.317
The evaluation results of the ML algorithms in Topic 11 are listed in Table A11. Evaluation results of ML algorithms in topic 11. The task was primarily concerned with the clinical management of patients with COVID-19 and was particularly oriented toward studies of lung diseases (including pneumonia and lung cancer). The results of the top ten influential algorithms based on normalized mean were almost identical to those based on mention frequency. Our results showed that the ML algorithm with the largest normalized mean was PCR.
Table A11. Evaluation results of ML algorithms in topic 11 (Other minor COVID-19 research tasks).
Table A11. Evaluation results of ML algorithms in topic 11 (Other minor COVID-19 research tasks).
No.The Frequency of
Being Mentioned
Weighted DegreeDegree
Centrality
Betweenness CentralityCloseness
Centrality
Eigenvector CentralityNormalized Average
1PCR 936PCR 302PCR 0.658PCR0.624PCR 0.726PCR 0.531PCR 1.000
2LR 214MEN 190LR 0.289LR 0.180LR 0.541MEN 0.321LR 0.451
3MEN 85LR 144LRR 0.237LRR 0.149MEN 0.533LR 0.297MEN 0.416
4LRR64LRR 46MEN 0.237BP 0.083LRR 0.525LRR 0.250LRR 0.329
5PA 40RR 38BP 0.211MEN 0.071BP 0.502BP 0.248BP 0.274
6BP 24PE 32PA 0.158RR 0.053RR 0.487PE 0.227RR 0.239
7RR 18PA 30AA 0.132CNN 0.050PE 0.467PA 0.220PA 0.229
8DT 14PCA 18RR0.132PA 0.007PCA 0.467PCA0.217PE 0.224
9PE 13BP 16PE 0.132PCA 0.005AA 0.449RR 0.204PCA 0.214
10PCA 13AA 10PCA 0.132AA 0.005PA 0.449AA 0.182AA 0.192
The evaluation results of the ML algorithms in Topic 12 are listed in Table A12. Evaluation results of ML algorithms in topic 12. The core of the task was the characterization of social life and public behavior during the COVID-19 pandemic. The results of the top ten influential algorithms based on the normalized mean were almost inconsistent with the results based on the mention frequency. Our results showed that the ML algorithm with the largest normalized mean was LRR. the second and third ranked algorithms were closer, FA and LR respectively. The fourth ranked algorithm was PLS. the fifth ranked algorithm was RF.
Table A12. Evaluation results of ML algorithms in topic 12 (Other minor COVID-19 research tasks).
Table A12. Evaluation results of ML algorithms in topic 12 (Other minor COVID-19 research tasks).
No.The Frequency of
Being Mentioned
Weighted DegreeDegree
Centrality
Betweenness CentralityCloseness
Centrality
Eigenvector CentralityNormalized Average
1PLS 300LRR 126LRR 0.429LRR 0.194LRR 0.578LRR 0.372LRR 0.913
2LRR 143FA 94RF 0.333PLS 0.131FA 0.543LR 0.303FA 0.672
3LR 135LR 80LR 0.317FA 0.120LR 0.543RF 0.279LR 0.668
4PA 128RF76FA 0.286RF 0.118RF 0.529FA 0.265PLS 0.664
5FA127PCA 68PLS 0.286LR 0.096PCA 0.525PCA0.255RF 0.620
6PCA 68PLS 58PCA 0.254PCA 0.089KM 0.496PLS 0.188PCA 0.551
7OLS64PE 56PE 0.206PE 0.074LSTM 0.492KM 0.185PA 0.451
8RF48PA 52SVM 0.190DT 0.074PLS 0.488PCR 0.183KM 0.418
9PCR46PCR44PA 0.190KM 0.074PA 0.485PA 0.178PE 0.376
10KM43KM 44KM 0.190LSTM 0.065SR 0.477SR 0.175LSTM 0.372
The evaluation results of the ML algorithms in Topic 13 are listed in Table A13. The task centered on the COVID-19 pandemic impact and healthcare issues. Our results showed that the ML algorithm with the largest normalized mean was LR. The second ranked algorithm was LRR.
Table A13. Evaluation results of ML algorithms in topic 13 (Other minor COVID-19 research tasks).
Table A13. Evaluation results of ML algorithms in topic 13 (Other minor COVID-19 research tasks).
No.The Frequency of Being MentionedWeighted DegreeDegree CentralityBetweenness CentralityCloseness
Centrality
Eigenvector CentralityNormalized Average
1LR748LR304LR0.705LR0.552LR0.772LR0.492LR1.000
2LRR228LRR154LRR0.432LRR0.206LRR0.638LRR0.379LRR0.541
3PCR137MEN146RF0.273RF0.137MEN0.571MEN0.285MEN0.367
4MEN67PCR80MEN0.273PCR0.117RF0.564RF0.242PCR0.331
5PA42RR34PCR0.250MEN0.082PCR0.564PCR0.234RF0.291
6OLS37RF28PA0.205PA0.065PA0.543PA0.205PA0.233
7RF22FA24DT0.182CNN0.045RR0.500RR0.173RR0.174
8FA20PA22RR0.136DT0.043PCA0.494FA0.173FA0.155
9PCA20GLR22FA0.114RR0.009XB0.494PCA0.162PCA0.152
10EM19DT20PCA0.114PCA0.005OLS0.489XB0.154DT0.143

References

  1. Miller, H.J.; Goodchild, M.F. Data-driven geography. GeoJournal 2015, 80, 449–461. [Google Scholar] [CrossRef]
  2. Zhu, Y.; Zhou, L.; Xie, C.; Wang, G.J.; Nguyen, T.V. Forecasting SMEs’ credit risk in supply chain finance with an enhanced hybrid ensemble machine learning approach. Int. J. Prod. Econ. 2019, 211, 22–33. [Google Scholar] [CrossRef]
  3. Khan, M.; Mehran, M.T.; Haq, Z.U.; Ullah, Z.; Naqvi, S.R.; Ihsan, M.; Abbass, H. Applications of artificial intelligence in COVID-19 pandemic: A comprehensive review. Expert Syst. Appl. 2021, 185, 115695. [Google Scholar] [CrossRef] [PubMed]
  4. Qu, K.; Guo, F.; Liu, X.; Lin, Y.; Zou, Q. Application of machine learning in microbiology. Front. Microbiol. 2019, 10, 827. [Google Scholar] [CrossRef]
  5. Kitchin, R. Big Data, new epistemologies and paradigm shifts. Big Data Soc. 2014, 1, 2053951714528481. [Google Scholar] [CrossRef]
  6. Zhang, C.; Mayr, P.; Lu, W.; Zhang, Y. Extraction and evaluation of knowledge entities from scientific documents. J. Data Inf. Sci. 2021, 129, 7167. [Google Scholar] [CrossRef]
  7. Han, S.; Zhang, R.F.; Shi, L.; Richie, R.; Liu, H.; Tseng, A.; Quan, W.; Ryan, N.; Brent, D.; Tsui, F.R. Classifying social determinants of health from unstructured electronic health records using deep learning-based natural language processing. J. Biomed. Inform. 2022, 127, 103984. [Google Scholar] [CrossRef]
  8. Appiahene, P.; Missah, Y.M.; Najim, U. Predicting bank operational efficiency using machine learning algorithm: Comparative study of decision tree, random forest, and neural networks. Adv. Fuzzy Syst. 2020, 2020, 8581202. [Google Scholar] [CrossRef]
  9. Tiwari, S.; Chanak, P.; Singh, S.K. A review of the machine learning algorithms for COVID-19 case analysis. IEEE Trans. Artif. Intell. 2022, 4, 44–59. [Google Scholar] [CrossRef]
  10. Blei, D.M.; Ng, A.Y.; Jordan, M.I. Latent dirichlet allocation. J. Mach. Learn. Res. 2003, 3, 993–1022. [Google Scholar]
  11. Wang, Y.; Zhang, C.; Li, K. A review on method entities in the academic literature: Extraction, evaluation, and application. Scientometrics 2022, 127, 2479–2520. [Google Scholar] [CrossRef]
  12. Howison, J.; Bullard, J. Software in the scientific literature: Problems with seeing, finding, and using software mentioned in the biology literature. J. Assoc. Inf. Sci. Technol. 2016, 67, 2137–2155. [Google Scholar] [CrossRef]
  13. Wang, Y.; Zhang, C.; Song, M.; Kim, S.; Ko, Y.; Lee, J. Exploring academic influence of algorithms by co-occurrence network based on full-text of academic papers. Aslib J. Inf. Manag. 2024, 77, 651–680. [Google Scholar] [CrossRef]
  14. Zhang, Z.; Tam, W.; Cox, A. Towards automated analysis of research methods in library and information science. Quant. Sci. Stud. 2021, 2, 698–732. [Google Scholar] [CrossRef]
  15. Bornmann, L.; Mutz, R. Growth rates of modern science: A bibliometric analysis based on the number of publications and cited references. J. Assoc. Inf. Sci. Technol. 2015, 66, 2215–2222. [Google Scholar] [CrossRef]
  16. Yuen, S.Y.; Chow, C.K.; Zhang, X.; Lou, Y. Which algorithm should I choose: An evolutionary algorithm portfolio approach. Appl. Soft Comput. 2016, 40, 654–673. [Google Scholar] [CrossRef]
  17. Yu, S.; Qing, Q.; Zhang, C.; Shehzad, A.; Oatley, G.; Xia, F. Data-driven decision-making in COVID-19 response: A survey. IEEE Trans. Comput. Soc. Syst. 2021, 8, 1016–1029. [Google Scholar] [CrossRef]
  18. Zhang, C.; Mayr, P.; Lu, W.; Zhang, Y. Guest editorial: Extraction and evaluation of knowledge entities in the age of artificial intelligence. Aslib J. Inf. Manag. 2023, 75, 433–437. [Google Scholar] [CrossRef]
  19. Ray, S. A quick review of machine learning algorithms. In Proceedings of the 2019 International Conference on Machine Learning, Big Data, Cloud and Parallel Computing (COMITCon), Faridabad, India, 14–16 February 2019; pp. 35–39. [Google Scholar]
  20. Wang, Y.; Zhang, C. Using the full-text content of academic articles to identify and evaluate algorithm entities in the domain of natural language processing. J. Informetr. 2020, 14, 101091. [Google Scholar] [CrossRef]
  21. Oliveira, M.; Gama, J. An overview of social network analysis. Wiley Interdiscip. Rev. Data Min. Knowl. Discov. 2012, 2, 99–115. [Google Scholar] [CrossRef]
  22. He, J.; Lou, W.; Li, K. How were science mapping tools applied? the application of science mapping tools in LIS and non-LIS domains. Proc. Assoc. Inf. Sci. Technol. 2019, 56, 404–408. [Google Scholar] [CrossRef]
  23. Pan, X.; Yan, E.; Cui, M.; Hua, W. Examining the usage, citation, and diffusion patterns of bibliometric mapping software: A comparative study of three tools. J. Informetr. 2018, 12, 481–493. [Google Scholar] [CrossRef]
  24. Belter, C.W. Measuring the value of research data: A citation analysis of oceanographic data sets. PLoS ONE 2014, 9, e92590. [Google Scholar] [CrossRef] [PubMed]
  25. Chu, H.; Ke, Q. Research methods: What’s in the name? Libr. Inf. Sci. Res. 2017, 39, 284–294. [Google Scholar] [CrossRef]
  26. Lozano, S.; Calzada-Infante, L.; Adenso-Díaz, B.; García, S. Complex network analysis of keywords co-occurrence in the recent efficiency analysis literature. Scientometrics 2019, 120, 609–629. [Google Scholar] [CrossRef]
  27. Behrouzi, S.; Sarmoor, Z.S.; Hajsadeghi, K.; Kavousi, K. Predicting scientific research trends based on link prediction in keyword networks. J. Informetr. 2020, 14, 101079. [Google Scholar] [CrossRef]
  28. Lv, Y.; Ding, Y.; Song, M.; Duan, Z. Topology-driven trend analysis for drug discovery. J. Informetr. 2018, 12, 893–905. [Google Scholar] [CrossRef]
  29. Li, K.; Yan, E.; Feng, Y. How is R cited in research outputs? Structure, impacts, and citation standard. J. Informetr. 2017, 11, 989–1002. [Google Scholar] [CrossRef]
  30. Yu, Q.; Wang, Q.; Zhang, Y.; Chen, C.; Ryu, H.; Park, N.; Baek, J.-E.; Li, K.; Wu, Y.; Li, D.; et al. Analyzing knowledge entities about COVID-19 using entitymetrics. Scientometrics 2021, 126, 4491–4509. [Google Scholar] [CrossRef]
  31. Landherr, A.; Friedl, B.; Heidemann, J. A critical review of centrality measures in social networks. Bus. Inf. Syst. Eng. 2010, 2, 371–385. [Google Scholar] [CrossRef]
  32. Comito, C.; Pizzuti, C. Artificial intelligence for forecasting and diagnosing COVID-19 pandemic: A focused review. Artif. Intell. Med. 2022, 128, 102286. [Google Scholar] [CrossRef] [PubMed]
  33. Opsahl, T.; Agneessens, F.; Skvoretz, J. Node centrality in weighted networks: Generalizing degree and shortest paths. Soc. Netw. 2010, 32, 245–251. [Google Scholar] [CrossRef]
  34. Das, D.; Biswas, S.K.; Bandyopadhyay, S. Perspective of AI system for COVID-19 detection using chest images: A review. Multimed. Tools Appl. 2022, 81, 21471–21501. [Google Scholar] [CrossRef] [PubMed]
  35. Anderson, B.S. Using text mining to glean insights from COVID-19 literature. J. Inf. Sci. 2023, 49, 373–381. [Google Scholar] [CrossRef]
  36. Cheng, X.; Zhao, Y.; Liao, S.S. Key topics in social science research on COVID-19: An automated literature analysis. Health Inf. Libr. J. 2023, 40, 343–358. [Google Scholar] [CrossRef]
  37. Zuo, X.; Chen, Y.; Ohno-Machado, L.; Xu, H. How do we share data in COVID-19 research? A systematic review of COVID-19 datasets in PubMed Central Articles. Brief. Bioinform. 2021, 22, 800–811. [Google Scholar] [CrossRef]
  38. Das, K.; Behera, R.N. A survey on machine learning: Concept, algorithms and applications. Int. J. Innov. Res. Comput. Commun. Eng. 2017, 5, 1301–1309. [Google Scholar]
  39. Comeau, D.C.; Wei, C.H.; Islamaj Doğan, R.; Lu, Z. PMC text mining subset in BioC: About three million full-text articles and growing. Bioinformatics 2019, 35, 3533–3535. [Google Scholar] [CrossRef]
  40. Li, X.; Lei, L. A bibliometric analysis of topic modelling studies (2000–2017). J. Inf. Sci. 2021, 47, 161–175. [Google Scholar] [CrossRef]
  41. Guo, Y.; Zhang, Y.; Lyu, T.; Prosperi, M.; Wang, F.; Xu, H.; Bian, J. The application of artificial intelligence and data integration in COVID-19 studies: A scoping review. J. Am. Med. Inform. Assoc. 2021, 28, 2050–2067. [Google Scholar] [CrossRef]
  42. Li, C.; Feng, S.; Zeng, Q.; Ni, W.; Zhao, H.; Duan, H. Mining dynamics of research topics based on the combined LDA and WordNet. IEEE Access 2018, 7, 6386–6399. [Google Scholar] [CrossRef]
  43. Das, K.; Samanta, S.; Pal, M. Study on centrality measures in social networks: A survey. Soc. Netw. Anal. Min. 2018, 8, 13. [Google Scholar] [CrossRef]
  44. Valente, T.W.; Coronges, K.; Lakon, C.; Costenbader, E. How correlated are network centrality measures? Connections 2008, 28, 16. [Google Scholar]
  45. Xu, K.; Zhou, M.; Yang, D.; Ling, Y.; Liu, K.; Bai, T.; Cheng, Z.; Li, J. Application of ordinal logistic regression analysis to identify the determinants of illness severity of COVID-19 in China. Epidemiol. Infect. 2020, 148, e146. [Google Scholar] [CrossRef]
  46. Kooman, J.P.; Carioni, P.; Kovarova, V.; Arkossy, O.; Winter, A.; Zhang, Y.; Bellocchio, F.; Kotanko, P.; Zhang, H.; Usvyat, L.; et al. Modifiable risk factors are important predictors of COVID-19-related mortality in patients on hemodialysis. Front. Nephrol. 2022, 2, 907959. [Google Scholar] [CrossRef]
  47. Ma, Y.; Liu, J.; Lu, W.; Cheng, Q. From “what” to “how”: Extracting the procedural scientific information toward the metric-optimization in AI. Inf. Process. Manag. 2023, 60, 103315. [Google Scholar] [CrossRef]
  48. Sarmiento Varón, L.; González-Puelma, J.; Medina-Ortiz, D.; Aldridge, J.; Alvarez-Saravia, D.; Uribe-Paredes, R.; Navarrete, M.A. The role of machine learning in health policies during the COVID-19 pandemic and in long COVID management. Front. Public Health 2023, 11, 1140353. [Google Scholar] [CrossRef]
Figure 1. Study workflow.
Figure 1. Study workflow.
Symmetry 18 00163 g001
Figure 2. Literature collection Flowchart.
Figure 2. Literature collection Flowchart.
Symmetry 18 00163 g002
Figure 3. Number of topics.
Figure 3. Number of topics.
Symmetry 18 00163 g003
Figure 4. Word cloud maps for each topic.
Figure 4. Word cloud maps for each topic.
Symmetry 18 00163 g004
Figure 5. The evolution of the number of articles on each topic.
Figure 5. The evolution of the number of articles on each topic.
Symmetry 18 00163 g005
Figure 6. The co-occurrence network of ML algorithms.
Figure 6. The co-occurrence network of ML algorithms.
Symmetry 18 00163 g006
Figure 7. The influence of different types of ML algorithms.
Figure 7. The influence of different types of ML algorithms.
Symmetry 18 00163 g007
Figure 8. Landmark ML algorithms in COVID-19 research.
Figure 8. Landmark ML algorithms in COVID-19 research.
Symmetry 18 00163 g008
Table 1. Evaluation index of ML algorithm and calculation method.
Table 1. Evaluation index of ML algorithm and calculation method.
IndexDefinitionCalculation method
Mention frequencyThis index refers to the number of articles mentioning ML algorithms. The higher the mention count, the greater the influence of nodes.The frequency of a node.
Weighted degreeThis index refers to the sum of line weights of nodes in the network. The greater the weighted degree, the greater the influence of nodes. w i = n j N i e i j ,   w i   is   the   weighted   degree   of   n i ,   N i   is   a   set   of   neighbor   nodes   of   n i .
Degree centralityThis index refers to the degree of nodes divided by the number of nodes in the network. The higher the degree centrality, the more important the node is in the network. d c e n i = d i m 1 ,   d c e n i   is   degree   centrality   of   n i ,   d i   is   degree   of   n i ,   that   is ,   the   number   of   edges   connected   to   n i .
Eigenvector centralityThis index takes into account the interaction between nodes. The greater the influence of a node’s neighbors, the greater the influence of the node. The   eigenvalue   λ   of   A x = λ x   with   the   maximum   absolute   value   and   its   corresponding   eigenvector   x = x 1 , x 2 x m T   are   calculated .   d v e c i = x i .   A   is   the   adjacency   matrix   of   the   co - occurrence   network ,   d c e n i   is   the   eigenvector   centrality   of   n i .
Closeness centralityThis index determines whether a node is close to the center of the network. High closeness centrality means that the closer the node is to other nodes, the more important the node is. d c l o s i = m 1 j = 1 m d i s i j ,   d c l o s i   is   the   closeness   centrality   of   n i d i s i j   is   the   shortest   path   length   between   n i   and   n j , which is the number of edges.
Betweenness centralityThis index is used to determine whether a node occupies an important path in the network from the perspective of network flow. The higher the betweenness centrality is, the shortest paths pass through the node, and the greater the influence of the node. d b e t w i = s t i h s t i p s t , d b e t w i
is   the   betweenness   centrality   of   n i ,   p s t is   the   number   of   shortest   paths   between   nodes   s   and   t .   h s t i   is   the   number   of   shortest   paths   between   n s   and   t   that   pass   through   n i .
Table 2. Research topics in COVID-19.
Table 2. Research topics in COVID-19.
Research TopicTopic NumberCountOverview
Social impactTopic 13289This topic focuses on analyzing factors to people’s attitudes, the spread of COVID-19, infection, and death. Students and workers participated in surveys on the perceptions and behavioral change regarding COVID-19 and control measures to the highest degree.
VaccinationTopic 21936This topic regards vaccination, antibody immunization, viral infection, and mutant strain studies. The advantages of ML models in classification and prediction supported vaccine development and vaccination. In addition, studies on vaccination have mainly focused on the public’s trust in the vaccine and factors influencing vaccine hesitancy.
Clinical diagnosis and symptoms of COVID-19Topic 33256This topic is centered on exploring factors of COVID-19 infection based on case studies and clinical characterization, which involves detecting and diagnosing COVID-19 infection through PCR testing, case analysis, and clinical evaluation of respiratory symptoms.
Diagnosis of medical imagesTopic 42702This topic focuses mainly on the features of medical images, the accuracy of diagnostic methods, and diagnosis using the ML network algorithms.
Mental healthTopic 53098This topic focuses on the psychological impacts of COVID-19 and their spread in the general population, including anxiety, depression, and stress. It also focuses on monitoring psychological changes in the public, immediate intervention, and improving the decision-making abilities of all relevant departments.
Laboratory research on virusesTopic 61904This topic includes research pertaining to infection at the cellular level, drug therapy, immune response, and blood-related research. Predictive studies are carried out based on the genome, transcriptome, and proteome. Predictive markers of disease are identified using ML.
Test and detection of SARS-CoV-2Topic 84019This topic is about the detection and infection of SARS-CoV-2 antigen, including coverage of antigen detection methods in clinical samples, detection of viral variants, and rapid diagnostic methods, with an emphasis on the level of virology.
Mortality risk and outcome analysisTopic 94400This topic mainly contains mortality risk prediction, disease severity analysis, and related factor identification. The purpose of this topic is to identify high-risk patients at an early stage and to provide a reference for clinical decision-making and selection of treatment options to enhance treatment outcomes and optimize healthcare resource management.
Other minor COVID-19 research tasksTopic 7541This topic focuses on children’s health issues during COVID-19.
Topic 101524This topic focuses on women’s health, e.g., studying the association of COVID-19 with pregnancy.
Topic 111352This topic primarily concerns the clinical management of COVID-19 patients and is particularly oriented toward studies of lung diseases.
Topic 121300This topic regards the characterization of social life and public behaviors during the COVID-19 pandemic.
Topic 131297This topic centers on the impact of the COVID-19 pandemic on healthcare issues.
Table 3. Evaluation results of ML algorithms in Topic 4.
Table 3. Evaluation results of ML algorithms in Topic 4.
No.Mention CountWeighted DegreeDegree CentralityBetweenness CentralityCloseness CentralityEigenvector CentralityNormalized Average
1CNN803CNN2084RF0.779RF0.120RF0.819RF0.221RF0.890
2DNN478SVM1960LR0.714DT0.087LR0.778SVM0.214CNN0.888
3SVM384RF1820SVM0.701CNN0.077SVM0.77LR0.213SVM0.784
4RF375DNN1628CNN0.688LR0.069CMM0.762DT0.209DNN0.733
5PCR369LR1406DT0.688SVM0.063DT0.755CNN0.207LR0.723
6TL361DT1226DNN0.636DNN0.060DNN0.733DNN0.198DT0.711
7LR245KNN902PCR0.610LSTM0.044PCR0.72PCR0.197PCR0.596
8LSTM238LSTM868LSTM0.597SVR0.039LSTM0.706LSTM0.193LSTM0.579
9DT211PCR824SVR0.584PCR0.032SVR0.7SVR0.190SVR0.518
10KNN140TL814LR0.519FC0.027KNN0.669MLP0.183TL0.508
Table 4. Evaluation results of ML algorithms in Topic 9.
Table 4. Evaluation results of ML algorithms in Topic 9.
No.Mention CountWeighted DegreeDegree CentralityBetweenness CentralityCloseness CentralityEigenvector CentralityNormalized Average
1LR2981LR1902LR0.807LR0.384LR0.826LR0.373LR1
2PCR1211PCR962PCR0.632PCR0.3PCR0.731PCR0.309PCR0.681
3MEN291MEN686MEN0.474MEN0.054MEN0.640MEN0.296MEN0.436
4LRR218LRR318LRR0.386CART0.043LRR0.6RF0.249LRR0.335
5Lasso96Lasso194RF0.386LRR0.039RF0.594LRR0.237RF0.314
6RF80RF170Lasso0.316BN0.037Lasso0.57Lasso0.228Lasso0.277
7FA76SR164FA0.263BP0.035FA0.564FA0.190FA0.246
8SR73FA162DT0.263TL0.035CART0.559DT0.183CART0.233
9RR64RR154CART0.246RF0.031SVM0.543SVM0.179DT0.219
10PE43PE114SVM0.246KM0.022RR0.538CART0.178SVM0.216
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Huang, S.; Liang, L.; Zhao, Y. Evaluating Machine Learning Algorithms in COVID-19 Research: A Framework Based on Algorithm Co-Occurrence and Symmetric Network Analysis. Symmetry 2026, 18, 163. https://doi.org/10.3390/sym18010163

AMA Style

Huang S, Liang L, Zhao Y. Evaluating Machine Learning Algorithms in COVID-19 Research: A Framework Based on Algorithm Co-Occurrence and Symmetric Network Analysis. Symmetry. 2026; 18(1):163. https://doi.org/10.3390/sym18010163

Chicago/Turabian Style

Huang, Siqi, Luoming Liang, and Ying Zhao. 2026. "Evaluating Machine Learning Algorithms in COVID-19 Research: A Framework Based on Algorithm Co-Occurrence and Symmetric Network Analysis" Symmetry 18, no. 1: 163. https://doi.org/10.3390/sym18010163

APA Style

Huang, S., Liang, L., & Zhao, Y. (2026). Evaluating Machine Learning Algorithms in COVID-19 Research: A Framework Based on Algorithm Co-Occurrence and Symmetric Network Analysis. Symmetry, 18(1), 163. https://doi.org/10.3390/sym18010163

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop