Topic Modeling for Faster Literature Screening Using Transformer-Based Embeddings

Galli, Carlo; Cusano, Claudio; Meleti, Marco; Donos, Nikolaos; Calciolari, Elena

doi:10.3390/metrics1010002

Open AccessArticle

Topic Modeling for Faster Literature Screening Using Transformer-Based Embeddings

by

Carlo Galli

^1,*

,

Claudio Cusano

²

,

Marco Meleti

³,

Nikolaos Donos

⁴

and

Elena Calciolari

^3,4

¹

Histology and Embryology Laboratory, Department of Medicine and Surgery, University of Parma, Via Volturno 39, 43126 Parma, Italy

²

Department of Electrical, Computer and Biomedical Engineering, University of Pavia, Via Ferrata 1, 27100 Pavia, Italy

³

Department of Medicine and Surgery, Dental School, University of Parma, 43126 Parma, Italy

⁴

Centre for Oral Clinical Research, Institute of Dentistry, Faculty of Medicine and Dentistry, Queen Mary University of London, London E1 2AD, UK

^*

Author to whom correspondence should be addressed.

Metrics 2024, 1(1), 2; https://doi.org/10.3390/metrics1010002

Submission received: 18 July 2024 / Revised: 9 September 2024 / Accepted: 19 September 2024 / Published: 8 October 2024

Download

Browse Figures

Versions Notes

Abstract

:

Systematic reviews are a powerful tool to summarize the existing evidence in medical literature. However, identifying relevant articles is difficult, and this typically involves structured searches with keyword-based strategies, followed by the painstaking manual selection of relevant evidence. A.I. may help investigators, for example, through topic modeling, i.e., algorithms that can understand the content of a text. We applied BERTopic, a transformer-based topic-modeling algorithm, to two datasets consisting of 6137 and 5309 articles, respectively, used in recently published systematic reviews on peri-implantitis and bone regeneration. We extracted the title of each article, encoded it into embeddings, and input it into BERTopic, which then rapidly identified 14 and 22 topic clusters, respectively, and it automatically created labels describing the content of these groups based on their semantics. For both datasets, BERTopic uncovered a variable number of articles unrelated to the query, which accounted for up to 30% of the dataset—achieving a sensitivity of up to 0.79 and a specificity of at least 0.99. These articles could have been discarded from the screening, reducing the workload of investigators. Our results suggest that adding a topic-modeling step to the screening process could potentially save working hours for researchers involved in systematic reviews of the literature.

Keywords:

embedding; systematic reviews; topic modeling

1. Introduction

The advancement of technology and the internet have ushered in a new era of information sharing that is significantly transforming how research is conducted at every stage [1]. Scholars and researchers can now effortlessly publish a vast array of data, reviews, and opinions across globally accessible platforms through new publishing models that simplify and accelerate the sharing of data and knowledge. However, publishing not only serves to disseminate knowledge for furthering research but also involves complex dynamics related to career progression and securing grant funding [2,3]. As a result, an overwhelming number of publications covering diverse topics are published every day, making it challenging to efficiently identify relevant papers amidst a sea of sometimes only tangentially related literature [4]. The challenge becomes particularly critical when conducting systematic reviews.

Systematic reviews are rigorous and comprehensive examinations of the existing literature to answer specific research questions [5] that rely on experimental—and sometimes observational—studies, with the purpose of gathering all the existing evidence on a given medical condition and, usually, its therapy. To conduct a successful systematic review, researchers must consider a wide range of sources and databases to ensure comprehensive coverage of the relevant literature [5,6] and to make sure to identify all the pertinent data, minimizing the risk of missing relevant evidence. For this purpose, researchers usually rely on established library repositories to search for scientific articles [7]. These databases, such as Medline, are commonly searched using specific keywords that may appear in the article, e.g., in its title or abstract. Though this approach is fast and robust, it may fail to capture relevant articles if they use different wordings or synonyms [8]. It has long been recognized that multiple searches with complex syntax, which requires advanced query construction skills, are often needed to narrow down the search space [9]. Moreover, as the lexicon may be ambiguous, this approach usually yields large numbers of publications that do not match the exact focus of the query—or are even off topic—and forces researchers to manually sift through the query results to handpick the papers they need [10]. Considerable research has previously focused on developing and testing search filters designed to refine results by targeting specific criteria [11,12,13,14,15].

Automation has great potential to improve literature searches and expedite systematic reviews [16]. Recent advancements in natural language processing (NLP) and machine learning have demonstrated the possibility to automate or assist several tasks within the systematic review process [17,18,19]. Innovations in this area have led to the development of software like Abstractr, ASReviews, EPPI-reviewer, and RobotSearch, which utilizes convolutional neural network architectures to identify RCTs [20,21]. Large language models are promising tools to automate some aspects of systematic reviews by enhancing literature retrieval through semantic understanding and the contextual analysis of search terms [22,23]. An essential component for achieving semantic understanding is embeddings—numerical representations that encode word or even sentence meaning—which are key in NLP for capturing complex relationships between words and sentences using special architectures known as transformers [24]. Embeddings based on transformer architectures have proven very effective in several NLP tasks, and many efforts have been devoted to developing better embeddings for specific tasks, also in the biomedical field, including topic modeling [25].

Topic modeling involves identifying the main theme of unlabeled documents [26]. This approach can be valuable in understanding complex scientific literature corpora by automatically organizing and categorizing large sets of publications based on their topics [27]. One of the latest examples among the topic-modeling algorithms is BERTopic—an advanced algorithm developed by Grootendorst in 2022—which harnesses the power of the embeddings obtained from BERT, a well-known transformer architecture [28]. BERTopic can segment a dataset of text documents—an operation commonly known as clustering—by their semantic content and extract a series of representative keywords, which BERTopic then concatenates to create a topic label for each cluster [29]. Compared to previous topic-modeling algorithms, transformer-based architectures such as BERTopic capture contextual meanings much more effectively, especially on smaller amounts of data, which, in turn, results in more coherent and interpretable topics [30]. These new methods are being increasingly applied to the analysis of scientific literature [31,32,33,34].

Based on the available data in the literature, we assumed that topic modeling could also be used as an intermediate step within the literature screening process to filter out undesired papers that may have been retrieved with a conventional keyword-based search so to clean the dataset before investigators manually screen it. To this purpose, algorithms providing a contextual interpretation of words and sentences appeared as the only viable option to improve the results of keyword-based searches, because they might be able to correctly interpret the meaning of ambiguous expressions depending on their context of use. BERTopic, therefore, appeared as a valuable choice to pursue this goal.

The main aim of the present paper is thus to explore how BERTopic can be applied effectively on datasets of the scientific literature and identify relevant papers for systematic reviews. To do that, we used two datasets that were recently used for two published systematic reviews in the dental field. We analyzed them separately with BERTopic, identified topic clusters within these two corpora, and investigated whether individual non-relevant topic clusters could have been discarded from the screening without negatively affecting the outcome of the review.

The approach we propose might prove helpful in filtering out non relevant articles from further assessment, thereby enhancing the speed and efficiency of the search process. While topic modeling has been applied in various fields, to the best of our knowledge, our approach is unique in leveraging BERTopic as an intermediate step to segment scientific datasets for systematic reviews in the biomedical (dental) field and filter out irrelevant papers retrieved by conventional keyword-based searches.

2. Materials and Methods

2.1. Datasets

Our analysis focused on two separate datasets that had been previously used to identify articles of interest (henceforth, “target articles”) for two published systematic reviews in the dental field [35,36]. The two datasets contained a list of scientific articles on peri-implantitis and bone regeneration, respectively. The authors of the systematic reviews had created these datasets through specific searches conducted across literature databases, and comprised bibliographic information about the articles, including authors, title, abstract, journal with publication date, plus keywords.

The first dataset included 6137 articles on the treatment of peri-implantitis and was generated through a state of the art and well-detailed keyword-based search across several databases, including Medline and Embase [36]. In that systematic review, the investigators eventually identified 24 target articles that answered the following focused questions (FQ):

FQ1: In patients with peri-implantitis, what is the efficacy of different bone reconstructive therapies compared to access flap surgery (AFS) in terms of pocket reduction and change in bleeding and suppuration on probing (BOP and SOP), at a minimum of 12 months of follow-up?

FQ2: In patients with peri-implantitis, what is the long-term (≥12 months) performance of reconstructive therapies in terms of pocket reduction and change in BOP/SOP?

The second database had been used as a basis for another published systematic review by Calciolari et al. on bone augmentation techniques [35]. The dataset used for this work comprised 5309 articles obtained through a systematic literature search to address the following FQs:

FQ1: In patients receiving GBR simultaneous to implant placement, what is the impact of biomaterials (membranes, grafts, bioactive factors) on the stability of peri-implant bone levels as assessed through 2D or 3D radiographs in RCTs/CCTs with ≥ 12 months of follow-up?

FQ2: In patients receiving GBR simultaneous to implant placement, what is the impact of biomaterials (membranes, grafts, bioactive factors) on bone defect dimension (width and/or height) changes as evaluated at re-assessment procedures performed at ≥4 months post GBR in RCTs/CCTs?

We manually screened these two datasets, searching for off-topic articles (henceforth labelled OffTA). We adopted a broad definition of OffTAs as those articles that did not focus on dentistry, e.g., penile prostheses or breast implants. Pre-clinical studies were not considered OffTAs unless they were investigating areas that were clearly related to fields other than dentistry. So, for instance, a report on cellular behavior in an in vitro setting would not necessarily be considered an OffTA, but a pre-clinical investigation on a fracture model in rodent would. All the papers that investigated areas of dentistry (or applicable to dentistry) were considered on-topic articles (OnTAs) for both datasets. It may be argued that when it comes to peri-implantitis, orthopedic implants may be closer to dental implants than, e.g., orthognatic surgery (which is related to dentistry), or that orthopedic research articles may be more relevant to bone regeneration of the alveolar ridges than many dental-related research areas, but as the goal of our investigation was to filter out papers from the dataset to make systematic reviews faster, we worked under the assumption that it would be safer to discard articles that focused on different clinical areas than dentistry, and that would not increase the risk of losing important pieces of evidence for both datasets.

Upon inspection, the first dataset was composed of 3810 OnTAs (i.e., dentistry-related) and 2327 OffTAs (38%), while the second dataset appeared to include only 814 OffTAs or 15% of the total.

2.2. Purpose of the Study

The purpose of the study was to investigate whether running BERTopic, a topic-modeling algorithm, on these two datasets to segment them into topic clusters could make subsequent screening faster, by identifying groups of articles constituted only (or prevalently) of OffTAs—henceforth designated as off-topic groups or OffTGs—that could be safely discarded to narrow down the corpus.

2.3. Data Analysis

The data were analyzed using Google Colab Pro notebook powered by Python 3.10.12 [37] and running on T4 GPUs [38], which provide the acceleration required to handle embeddings efficiently.

The analysis of the publications was conducted on their titles, based on the assumption that titles are a summary of the content of a paper and are thus representative of their topic [39,40]. The datasets did not need to undergo any preprocessing other than removing entries when titles were missing. Unlike previous publications [41], we did not deem it necessary to lowercase the titles, nor to remove stopwords, to rely on BERTopic’s capability to produce contextual embeddings. Unlike bag-of-words approaches, there is a consensus that stopwords may actually improve sentence encoding by providing further context using transformer architectures [42].

Embeddings in NLP are dense vectors that represent the semantics of words in a multidimensional space [43]. Unlike older algorithms [44], bidirectional encoder representations from transformers (BERT) understands the context and creates unique word embeddings based on their usage in different contexts [24,45]. BERTopic operates through several stages, including transformer embedding models, dimensionality reduction, clustering, and cluster tagging using cTF-IDF [28], which are summarized in Figure 1.

The first step is the creation of the embeddings. To carry that out, we chose the Huggingface’s ‘all-mpnet-base-v2’ model.

The embeddings from this model are too large for efficient clustering and must therefore be reduced. As several dimensionality reduction algorithms are available, we used uniform manifold approximation and projection (UMAP), which has been shown to be very effective in preserving the topological structure of data [46]. We empirically decided to reduce all-mpnet-base-v2’s 768-dimension embeddings to 5-dimension embeddings for subsequent processing and to 2-dimensions for visualization. The reduced embeddings were then clustered with hierarchical density-based spatial clustering of applications with noise (HDBSCAN) [47], and cTf-Idf was applied to extract topic keywords in each cluster. Unlike Tf-Idf [48], cTf-Idf adjusts the weight based on the term frequency within a cluster of documents rather than within an individual document [49].

More specifically, we decided to use the following set-up:

−: UMAP metric: cosine distance (default setting);
−: size of the neighborhood: 15;
−: number of components: 5;
−: HDBSCAN clustering metric: Euclidean (default setting);
−: minimum cluster size: 50.

The number of components was determined empirically as a compromise between preserving the richness of information of the original embedding and making it easier for HDBSCAN to cluster them. The settings for the size of the neighborhood and the minimum cluster size were determined empirically through a grid search, as described in Figure 2. To enhance processing speed, we used the cuML GPU-based implementation of UMAP and HDBSCAN [50]. In addition to BERTopic’s default representation model, we adopted KeyBERT, a more recent algorithm, to improve keyword extraction [51].

BERTopic output is not always straightforward. The way BERTopic labels a topic is by taking the main 4 keywords that describe it and joining them together. This default labeling is efficient but its result is admittedly often obscure. Fortunately, BERTopic allows the integration of other algorithms to create better representations of topics, including large language models (LLM). An LLM is an A.I. algorithm that can generate human-like responses [52] and can create more comprehensive and representative descriptions of topics, i.e., better labels. We decided to use the freely available OpenHermes-2.5-Mistral large language model [53]. LLMs are typically—as their name suggests—very large and require vast computer resources, thus reduced versions have been elaborated [54] that are commonly referred to as quantized LLMs [55]. We opted for the OpenHermes-2.5-Mistral-7B-GGUF/openhermes-2.5-mistral-7b.Q4_K_M.gguf quantization, available for download on Huggingface.com (accessed on 3 March 2024).

LLMs need a prompt from the users [56] to generate a response, and we set the following prompt:

““Q:

I have a topic that contains the following documents:

[DOCUMENTS]

The topic is described by the following keywords: ‘[KEYWORDS]’.

Based on the above information, can you give a short label of the topic of at most 5 words?

A:

“““

So, to briefly summarize it, BERTopic worked as we described by clustering the embeddings and a series of keywords that describe these clusters. The LLM then took the keywords of each topic and generated a sentence that described and captured the essence of these keywords. So eventually, every cluster of documents had a little sentence label as a descriptor, which was much more readable and immediate to human users.

We used BERTopic’s inbuilt functions and the matplotlib [57] and seaborn libraries [58] for data visualization. The Datamapplot library [59] was used for effective cluster visualization.

To measure the performance of the algorithm, we used specificity, sensitivity, and F1 scores. Sensitivity measures the proportion of actual positives that are correctly identified by a test and is usually calculated as

S e n s i t i v i t y = \frac{T r u e P o s i t i v e s (T P)}{T r u e P o s i t i v e s (T P) + F a l s e N e g a t i v e s (F N)}

Specificity measures the proportion of actual negatives that are correctly identified and is calculated as follows:

S p e c i f i c i t y = \frac{T r u e N e g a t i v e s (T N)}{T r u e N e g a t i v e s (T N) + F a l s e P o s i t i v e s (F P)}

The F1 score is the harmonic mean of precision and sensitivity, also commonly referred to as recall, in this context. If precision is calculated as

P r e c i s i o n = \frac{T r u e P o s i t i v e s (T P)}{T r u e P o s i t i v e s (T P) + F a l s e P o s i t i v e s (F P)},

then F1 score is calculated as

F 1 s c o r e = 2 \times \frac{P r e c i s i o n \times R e c a l l}{P r e c i s i o n + R e c a l l}

To this purpose, we considered OffTAs as positive and OnTAs as negative. OffTAs clustered within a OffTG were considered true positive, while OnTAs clustered in an OffTG were considered false positives.

3. Results and Discussion

3.1. Peri-Implantitis Dataset Analysis

Running BERTopic on a Colab notebook, after loading the necessary libraries and packages, required a few seconds for such a small dataset. BERTopic successfully identified several topics based on the dataset titles, but the exact number of topics varied depending on algorithm parameters. The main steps that users can control in BERTopic are summarized in Figure 1:

(1): Creating the embeddings,
(2): Reducing the embeddings’ dimensions.
(3): Clustering the embeddings,
(4): Labelling the embeddings.

To create sentence embeddings from the titles, we decided to use the all-mpnet-base-v2 model, which has been pre-trained on a large dataset and has proven to be effective also on academic titles [60]. While slower than smaller models, this did not significantly impact computation time with a dataset of just over 6000 titles.

To improve clustering efficiency, we reduced the dimensions of our initial embeddings using UMAP. This algorithm allows for a thorough customization of its parameters, including the granularity of the topological structure that it aims to preserve during dimensionality reduction through the “number of neighbors” parameter. A major advantage of HDBSCAN is that it does not require pre-determining the number of clusters (e.g., unlike K-means algorithms), although, as a result, it tends to cluster unclear documents in a null group, which is identified by the −1 label. HDBSCAN can be customized through several parameters too, including the minimum acceptable size for a cluster. The last step consists of finding a label for each cluster that corresponds to its topic. For that purpose, we have chosen two representation models: a large language model, to create human-like labels, and KeyBERT, to generate keywords that are characteristic to the topic. Figure 2 shows that by reducing the minimum size of the clusters, BERTopic was able to find more topics in our dataset. This is expected, as niche topics consisting of only a few articles may be overlooked if the threshold is too high. At the same time, changing the sensitivity of UMAP through its number of neighbors parameter altered the slope of the curve. The choice of the best settings is a subjective decision, which depends on the purpose of the investigators; in our case, we wanted to have enough granularity to isolate unrelated topics while keeping the number of topics small enough to be easily manageable and several orders of magnitude smaller than the dataset itself to make it a convenient step in the workflow. Therefore, we empirically determined the optimal number of neighbors to be 15 and the minimum cluster size to be 50. These settings generated 14 topics (Table 1). Table 1 lists the topics identified by the algorithm, from the biggest one to the smallest one. The topic list includes the ‘−1’ unclassified documents cluster, where all the unclassified papers are supposed be allocated.

Interestingly, the algorithm agreed that the non-classified (−1 cluster) articles were still mostly centered on implants (and implant infections) and assigned them the label “Treating Implant Infections”. This null topic cluster was quite small (n = 357). As expected, BERTopic identified several topics related to peri-implantitis (e.g., topic #0 which included the majority of the papers (n = 3733) or topic #6) but also more broadly related to implant dentistry (e.g., topics #5, #7, and #12).

However, this dataset also included at least 8 completely unrelated OffTGs, some of them quite conspicuous in size, such as the following:

#1 Valves and Stents in Coronary Arteries (n = 587), e.g., “Carotid-subclavian bypass grafting with polytetrafluoroethylene grafts for symptomatic subclavian artery stenosis or occlusion: a 20-year experience” [61];

#2 Intraocular Lens Inflammation (n = 232), e.g., “Double-masked, placebo-controlled evaluation of loteprednol etabonate 0.5 for postoperative inflammation” [62];

#5 Parkinson’s Disease and Deep Brain Stimulation (n = 174), e.g., “Three-dimensional space fluid-attenuated inversion recovery at t to improve subthalamic nucleus lead placement for deep brain stimulation in Parkinson’s disease: from preclinical to clinical studies” [63];

#16 Cochlear Implantation (n = 96), e.g., “Online support group users’ perceptions and experiences of bone-anchored hearing aids (bahas): a qualitative study” [64].

A closer examination of the topic list suggests that more topics could be considered unrelated to the query, albeit dental-related. Some of them are small niche topics, such as #12 Zirconia Implants and Abutments (n = 61), some are larger, such as topic #5 Sinus Floor Elevation (n = 144); although it is thematically related to implants, nothing in its keyword descriptors mentions peri-implant disease:

[‘sinus floor’, ‘sinus elevation’, ‘osteotome sinus’, ‘sinus augmentation’, ‘sinus surgery’, ‘maxillary sinus’, ‘sinus implants’, ‘sinus lift’, ‘transcrestal sinus’, ‘eluting sinus’]

Similarly to identifying OffTAs, the decision to discard OffTGs based on their descriptors is subjective, and there is a degree of risk in discarding dentistry- or implant-related topics. The domain knowledge of the investigator is likely the main factor affecting where to place the threshold, i.e., to decide which topics to retain and which ones to discard. The algorithm we employed makes no assumption on the relevance of the identified topics but merely segments the dataset into groups and labels them. It is up to the investigators to decide whether a topic is relevant to the theme of the query. We adopted a cautious approach by retaining all dental-related topics, and we thus decided to discard only those topics that were unrelated to dentistry as a whole to avoid risking losing relevant articles.

To better understand how the titles of the dataset were semantically distributed, we reduced the embedding dimensionality that we used for topic modeling from 5 down to 2 dimensions, so that each title could be represented as a data point in a scatter plot (Figure 3). Closer points correspond to articles whose titles have closer meaning and, therefore. topic, while farther points represent articles that belong to less closely related topics, including frank OffTAs. As can be easily noticed, the conceptual space for this dataset of scientific publications is not homogeneous, but there are several areas of higher density, which constitute the individual topics, and some topics appear isolated. Unsurprisingly, OffTAs tend to be distributed peripherally, as a satellite constellation of articles with looser association to the core of the dataset, which appears in the middle of the plot and includes most of the dental-related articles (Figure 3).

Even when considering only the topics highlighted in Table 1 that are grossly unrelated to the purpose of the systematic review (i.e., unrelated to dentistry in general), they alone account for about 30% of the papers in the dataset (or 1856 papers out of the 6137 titles that had to be manually screened at the time the systematic review was performed).

As this dataset has already been published, we also already knew which articles had been selected for the review. As Table 2 shows, the target articles that corresponded to the query were not contained in any of the OffTGs. OffTGs (and the OffTAs contained in them) removal during screening would thus not have negatively affected the review.

When examining the allocation of the target articles (Figure 4), it is apparent that the majority were allocated to the #0 group (n = 23), followed by group #6 Photodynamic Therapy for Peri-implantitis Treatment (n = 1).

This supports the idea that BERTopic working on all-mpnet-base-v2 embeddings is robust enough to discriminate not only OffTGs from dental-related topics but, within dental topics, what the peri-implantitis topics are.

We then further analyzed the dataset and found that six OnTAs were misclassified into OffTGs, specifically in topic #8 Porous Polyethylene Orbital Reconstruction; the first misclassified paper was about orthognatic surgery and so not directly related to implants:

“Long-term evaluation of the use of coralline hydroxyapatite in orthognathic surgery” [89]

The remaining five papers focused on craniofacial surgery and thus understandably closer to orbital surgery than to dental-related topics:

“Timing of cranial reconstruction after cranioplasty infections: are we ready for a re-thinking? A comparative analysis of delayed versus immediate cranioplasty after debridement in a series of 48 patients” [90]
“HTR^® polymer facial implants: A five-year clinical experience” [91]
“Gore-Tex chin implants: a review of 324 cases” [92]
“The application of alloplastic materials for augmentation in cosmetic facial surgery” [93]
“Japanese National Questionnaire Survey in 2018 on Complications Related to Cranial Implants in Neurosurgery” [94]

In this case, it can be argued that BERTopic did not significantly misclassify these papers and discarding them would not have jeopardized the systematic review. Additionally, we identified 469 OffTAs (i.e., unrelated to dentistry) that had not been clustered in the OffTGs but had been allocated to dental topics. This classification thus yields a specificity = 0.99, a sensitivity = 0.79, and F1 score = 0.88.

3.2. Bone Augmentation Dataset Analysis

We again applied BERTopic on the titles of the articles of the second dataset and tuned the clustering parameters as previously described to obtain an approachable number of topics. As shown in Figure 5, in general, decreasing the minimum size of the acceptable clusters in the HDBSCAN algorithm again increased the number of topics identified by BERTopic.

At the same time, adjusting the UMAP parameters allows BERTopic to better recognize local features in the distribution of the data points. We maintained the same parameters as for the first dataset, which yielded 22 topics. Table 3 lists the topics identified in this dataset.

The topics were visually inspected, and based on their LLM description and their keyword descriptors, we assessed that most topics were, as expected, related to bone regeneration (e.g., topic #3 Alveolar Ridge Augmentation Techniques), ridge preservation (e.g., topic #5 Ridge Preservation Bone Allograft), or otherwise implant-related (e.g., topic #9 Implants in Fresh Extraction Sockets).

However, a careful inspection of the dataset also revealed five OffTGs that appeared completely unrelated to the dental field and that altogether totaled 610 OffTAs (Table 3, bold). These topics included the following:

#2 Hip Arthroplasty (n = 296), e.g., “Timing of tibial tubercle osteotomy in two-stage revision of infected total knee arthroplasty does not affect union and reinfection rate. A systematic review” [95]

#12 Cervical and Lumbar Fusion Studies (n = 93), e.g., “A Long-Term Follow-up, Multicenter, Comparative Study of the Radiologic, and Clinical Results between a CaO-SiO2-P2O5-B2O3Bioactive Glass Ceramics (BGS-7) Intervertebral Spacer and Titanium Cage in 1-Level Posterior Lumbar Interbody Fusion” [96]

#10 Ventricular Assist Devices (n = 117), e.g., “Heart transplantation of patients with ventricular assist devices: impact of normothermic ex-vivo preservation using organ care system compared with cold storage” [97]

The dataset also contained one topic that, albeit related to dentistry, was not centered on implant dentistry or bone regeneration, i.e., topic #15 Periodontal defect treatment, and that could thus be potentially discarded but was retained following a conservative attitude, as explained above.

When we plotted the dimensionally reduced embeddings for these five selected OffTGs, these were again mostly located at the periphery of the scatter plot (Figure 6), at some semantic distance from the bulk of the data points. Overall, the bone augmentation dataset contained a smaller proportion of OffTAs compared to the peri-implantitis dataset, because our analysis identified only 11% of papers that could be safely excluded from further consideration (632 OffTAs out of 5309 articles).

We then assessed the allocation of the 36 target articles that were identified by the manual screening in the systematic review. No target article had been allocated to any of the OffTGs, indicating that their removal before manual screening would not have impacted the review outcome (Table 4).

Interestingly, most target papers belonged to either group #16 Collagen Membranes for Guided Bone Regeneration (16 papers out of 36 target articles), group #6 Bone Grafting for Dental Implants (8 target articles), or group -1 (9 target articles) (Figure 7). This raises an interesting point for consideration as out of 5309 articles, 45% of the target articles were found in topic #16, which contained only 71 papers, and all the 36 articles were contained in six topic groups, which contained 2695 articles (noticeably, topic -1 alone contained 1515 papers).

We then manually screened the corpus again and found that no (dental) OnTA had been clustered in the OffTGs, yielding a classification specificity = 1. Upon inspection, we also found 204 OffTAs (i.e., unrelated to dentistry) that were not clustered in the OffTGs. The recall (or sensitivity) for this task is thus 0.74, and the F1 score for this classification task is 0.85, in line with the results for the first dataset. This indicates that additional papers could have been filtered out from the dataset, but it also confirms that no real dental paper (i.e., a potential target article for a systematic review) was inadvertently lost due to misclassification.

The proposed workflow for article screening is summarized in Figure 8.

It could be argued that alternative, faster, and less resource-intensive topic-modeling protocols could be implemented for the same purpose. We ran both latent Dirichlet allocation (LDA) and latent semantic analysis (LSA) on the titles of both datasets, setting the number of topics to 14 and 22, respectively, to match the topics identified by BERTopic. The performance of these algorithms can be found in Supplementary Table S3. To run both LDA and LSA, the titles had to be pre-processed, including stopword removal. While these approaches use bag-of-words mechanics, making them potentially less capable of capturing semantic nuances, their specificity was overall acceptable, i.e., they tended not to discard on-topic articles, making them potentially safe for use in this context. Their sensitivity was, however, very low when compared to BERTopic, which means that they failed to identify many off-topic articles, making them thus less effective in filtering non-relevant articles, as we propose.

Overall, high performing topic-modeling algorithms, such as BERTopic, create the opportunity to segment whole datasets of articles retrieved from online databases and literature repositories into clusters labelled by their topic, and this is turning out very useful to quickly understand the topic landscape of whole science fields [31,32,34].

Some of the topics BERTopic identified in our datasets are clearly unrelated to the matter at hand and can potentially and, based on our data, safely be removed and excluded from further screening, saving a variable amount of time to investigators. The first dataset, on peri-implantitis, was more heterogeneous and eight OffTGs were identified, which contained about 30% of the total number of papers in the dataset. It can be assumed that their exclusion would have significantly impacted the total manual screening time. The second dataset, on bone regeneration, was cleaner, and the five clearly unrelated OffTGs that BERTopic identified were smaller. It can be therefore assumed that cleaner, more focused datasets will benefit from this semantic filtering less than broader and more heterogeneous datasets. At the same time, it can also be hypothesized that the availability of these protocols of semantic filtering could affect the way keyword-based searches are conducted, relaxing the need for more stringent queries and allowing for broader, more heterogeneous datasets that can be filtered using semantic-based algorithms before manual screening. This stands in contrast to the previous approaches to literature searches, which have mostly relied on complex search strategies to minimize the retrieval of heterogeneous data [134,135,136].

We used the all-mpnet-base-v2 model for our work, which is a general model trained on a very wide corpus of texts. BERTopic, however, is independent of the embedding model, and as new models become available, these can be used to improve the effectiveness of the approach. Specific models, trained on corpora targeting specific science areas, could also be used, if the investigators deem it necessary, to better capture the possible meaning nuances of certain science niches.

One could argue that abstracts could provide more information on topic and context that just titles, and topic modeling should therefore be rather conducted on abstracts or a title + abstract combination. Of course, abstracts arguably provide more details on the content of a manuscript and work as well as or even better than titles to capture the theme of a report. Our choice of using titles for the topic-modeling analysis was rather based on the performance of the proposed algorithm. Abstracts are considerably longer than titles, and processing them takes much longer than analyzing titles, even if using hardware acceleration. One of the purposes of our work was to prove the feasibility of semantic title screening as an acceptable compromise to filter out off-topic articles prior to screening. Our approach, which relies on titles alone, could be easily scaled up to much bigger datasets, paving the way to changes in the paradigms of how literature searches are managed as a whole.

In fact, our brief analysis also suggests that some topics might even be positively selected to conduct a more restricted and focused screening, with some caveats. In our situation, it was easy to retrospectively identify the topics where the target articles were contained but implementing that with a new dataset would not be as straightforward, because many topics would be about closely related areas, and excluding them would be risky. As an example of this, considering a hypothetical adoption of BERTopic in the pre-screening phase of the bone regeneration dataset for a systematic review, topic #16 Collagen Membranes for Guided Bone Regeneration could have been an easy pick for further assessment, but at this stage, there would have been no rationale to safely exclude, e.g., the quite large (n = 692) #1 Sinus Floor Elevation group by just looking at its LLM descriptor or its keywords.

One final aspect to consider is the knowledge and language barrier that this procedure still poses to many life science investigators. At the present moment, applying this and similar algorithms using command line interfaces still requires a degree of coding literacy that may discourage many investigators in biology and medicine, although unjustifiably. We encourage investigators to become familiar with coding interfaces such as Google Colab or Jupyter notebooks, as they often provide an affordable alternative to proprietary software and offer a quick way to deploy, customize, and maximize the potential of new algorithms. Given the rapid advancements in machine learning and artificial intelligence, relying solely on point-and-click user interfaces may be a risky choice. Investing time in learning basic Python syntax may be a more sustainable approach to keep up with the current technological revolution. To assist those interested in trying out the algorithm used in this study, we have made a simplified version of code available as a ipynb notebook in the Supplementary Materials.

4. Conclusions

Our analysis focused on two separate datasets, and used BERTopic, a popular topic-modeling algorithm, to identify sets of articles to discard from datasets of the biomedical literature prior to manual screening to expedite evidence identification in systematic reviews. Taken together, our data show that encoding article titles using the all-mpnet-base-v2 model, followed by semantic clustering with BERTopic, is an inexpensive and quick way to categorize articles into topics and have an effective overview of the dataset’s semantic structure. This procedure has sufficient granularity to identify article groups that can be safely removed from the dataset before it is processed by the investigators during the reliable but slow and labor-intensive process of manual inspection. The number of topics unrelated to the query varies according to the query itself, the keywords used to conduct it, and the database used to create the dataset, but in one of the two datasets we used, we were able to filter out more than 1800 articles, or 30% of the dataset. Furthermore, we also observed that the target articles that had been actually identified for the systematic reviews tended to be found in few clusters, This suggests that semantic search can help investigators not only identify unrelated articles for exclusion but also focus manual inspection on a smaller, relevant subset of the dataset, with faster processing.

The key highlights of our study can be listed as follows:

We applied BERTopic to two datasets of biomedical literature (dentistry).
BERTopic identified a significant number of papers unrelated to the query.
Off-topic papers constituted up to 30% of the initial dataset.
BERTopic effectively filtered out off-topic papers from datasets, saving review time.
The excluded topic clusters contained no relevant manuscripts, ensuring safe exclusion.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/metrics1010002/s1, Table S1: Full topic list peri-implantitis dataset; Table S2: Full topic list bone regeneration dataset; Table S3: Performance comparison of topic-modeling algorithms.

Author Contributions

Conceptualization, C.G. and E.C.; methodology, C.C.; software, C.C. and C.G.; formal analysis, C.C. and C.G.; resources, E.C. and N.D.; data curation, M.M.; writing—original draft preparation, C.G. and M.M.; writing—review and editing, N.D. and E.C.; supervision, E.C.; All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

Data are available upon request.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Reips, U.-D.; Barak, A. How Internet-Mediated Research Changes Science; University Press: Zurich, Switzerland, 2008; ISBN 0521694647. [Google Scholar]
Hyland, K. Academic Publishing: Issues and Challenges in the Construction of Knowledge; Oxford University Press: Oxford, UK, 2016. [Google Scholar]
Lee, I. Publish or Perish: The Myth and Reality of Academic Publishing. Lang. Teach. 2014, 47, 250–261. [Google Scholar] [CrossRef]
Landhuis, E. Scientific Literature: Information Overload. Nature 2016, 535, 457–458. [Google Scholar] [CrossRef] [PubMed]
Dickersin, K.; Scherer, R.; Lefebvre, C. Systematic Reviews: Identifying Relevant Studies for Systematic Reviews. BMJ 1994, 309, 1286–1291. [Google Scholar] [CrossRef] [PubMed]
Bramer, W.M.; Rethlefsen, M.L.; Kleijnen, J.; Franco, O.H. Optimal Database Combinations for Literature Searches in Systematic Reviews: A Prospective Exploratory Study. Syst. Rev. 2017, 6, 245. [Google Scholar] [CrossRef] [PubMed]
Lu, Z. PubMed and Beyond: A Survey of Web Tools for Searching Biomedical Literature. Database 2011, 2011, baq036. [Google Scholar] [CrossRef] [PubMed]
Gusenbauer, M.; Haddaway, N.R. Which Academic Search Systems Are Suitable for Systematic Reviews or Meta-Analyses? Evaluating Retrieval Qualities of Google Scholar, PubMed, and 26 Other Resources. Res. Synth. Methods 2020, 11, 181–217. [Google Scholar] [CrossRef] [PubMed]
Lee, J.-C.; Lee, B.J.; Park, C.; Song, H.; Ock, C.-Y.; Sung, H.; Woo, S.; Youn, Y.; Jung, K.; Jung, J.H.; et al. Efficacy Improvement in Searching MEDLINE Database Using a Novel PubMed Visual Analytic System: EEEvis. PLoS ONE 2023, 18, e0281422. [Google Scholar] [CrossRef]
Grivell, L. Mining the Bibliome: Searching for a Needle in a Haystack? EMBO Rep. 2002, 3, 200–203. [Google Scholar] [CrossRef]
Rogers, M.; Bethel, A.; Boddy, K. Development and Testing of a Medline Search Filter for Identifying Patient and Public Involvement in Health Research. Health Info. Libr. J. 2017, 34, 125–133. [Google Scholar] [CrossRef]
Salvador-Oliván, J.A.; Marco-Cuenca, G.; Arquero-Avilés, R. Development of an Efficient Search Filter to Retrieve Systematic Reviews from PubMed. J. Med. Libr. Assoc. JMLA 2021, 109, 561. [Google Scholar] [CrossRef] [PubMed]
Damarell, R.A.; May, N.; Hammond, S.; Sladek, R.M.; Tieman, J.J. Topic Search Filters: A Systematic Scoping Review. Health Info Libr. J. 2019, 36, 4–40. [Google Scholar] [CrossRef] [PubMed]
Wagner, M.; Rosumeck, S.; Küffmeier, C.; Döring, K.; Euler, U. A Validation Study Revealed Differences in Design and Performance of MEDLINE Search Filters for Qualitative Research. J. Clin. Epidemiol. 2020, 120, 17–24. [Google Scholar] [CrossRef] [PubMed]
Massonnaud, C.; Lelong, R.; Kerdelhué, G.; Lejeune, E.; Grosjean, J.; Griffon, N.; Darmoni, S.J. Performance Evaluation of Three Semantic Expansions to Query PubMed. Health Inf. Libr. J. 2021, 38, 113–124. [Google Scholar] [CrossRef]
Jin, Q.; Leaman, R.; Lu, Z. PubMed and beyond: Biomedical Literature Search in the Age of Artificial Intelligence. EBioMedicine 2024, 100, 104988. [Google Scholar] [CrossRef]
van Dijk, S.H.B.; Brusse-Keizer, M.G.J.; Bucsán, C.C.; van der Palen, J.; Doggen, C.J.M.; Lenferink, A. Artificial Intelligence in Systematic Reviews: Promising When Appropriately Used. BMJ Open 2023, 13, e072254. [Google Scholar] [CrossRef] [PubMed]
Fabiano, N.; Gupta, A.; Bhambra, N.; Luu, B.; Wong, S.; Maaz, M.; Fiedorowicz, J.G.; Smith, A.L.; Solmi, M. How to Optimize the Systematic Review Process Using AI Tools. JCPP Adv. 2024, 4, e12234. [Google Scholar] [CrossRef] [PubMed]
Atkinson, C.F. Cheap, Quick, and Rigorous: Artificial Intelligence and the Systematic Literature Review. Soc. Sci. Comput. Rev. 2023, 42, 376–393. [Google Scholar] [CrossRef]
van de Schoot, R.; de Bruin, J.; Schram, R.; Zahedi, P.; de Boer, J.; Weijdema, F.; Kramer, B.; Huijts, M.; Hoogerwerf, M.; Ferdinands, G. ASReview: Open Source Software for Efficient and Transparent Active Learning for Systematic Reviews. arXiv 2020, arXiv:2006.12166. [Google Scholar]
Khalil, H.; Ameen, D.; Zarnegar, A. Tools to Support the Automation of Systematic Reviews: A Scoping Review. J. Clin. Epidemiol. 2022, 144, 22–42. [Google Scholar] [CrossRef]
Nentidis, A.; Krithara, A.; Paliouras, G.; Gasco, L.; Krallinger, M. BioASQ at CLEF2022: The Tenth Edition of the Large-Scale Biomedical Semantic Indexing and Question Answering Challenge. In European Conference on Information Retrieval; Springer: Cham, Switzerland, 2022; pp. 429–435. [Google Scholar]
Esteva, A.; Kale, A.; Paulus, R.; Hashimoto, K.; Yin, W.; Radev, D.; Socher, R. COVID-19 Information Retrieval with Deep-Learning Based Semantic Search, Question Answering, and Abstractive Summarization. NPJ Digit. Med. 2021, 4, 68. [Google Scholar] [CrossRef]
Vaswani, A.; Shazeer, N.; Parmar, N.; Uszkoreit, J.; Jones, L.; Gomez, A.N.; Kaiser, Ł.; Polosukhin, I. Attention Is All You Need. Adv. Neural Inf. Process. Syst. 2017, 30, 6000–6010. [Google Scholar]
Noh, J.; Kavuluru, R. Improved Biomedical Word Embeddings in the Transformer Era. J. Biomed. Inform. 2021, 120, 103867. [Google Scholar] [CrossRef] [PubMed]
Wu, X.; Nguyen, T.; Luu, A.T. A Survey on Neural Topic Models: Methods, Applications, and Challenges. Artif. Intell. Rev. 2024, 57, 18. [Google Scholar] [CrossRef]
Likhareva, D.; Sankaran, H.; Thiyagarajan, S. Empowering Interdisciplinary Research with BERT-Based Models: An Approach Through SciBERT-CNN with Topic Modeling. arXiv 2024, arXiv:2404.13078. [Google Scholar]
Grootendorst, M. BERTopic: Neural Topic Modeling with a Class-Based TF-IDF Procedure. arXiv 2022, arXiv:2203.05794. [Google Scholar]
Lee, Y.-G.; Kim, S. A Comparative Study on Topic Modeling of LDA, Top2Vec, and BERTopic Models Using LIS Journals in WoS. J. Korean Soc. Libr. Inf. Sci. 2024, 58, 5–30. [Google Scholar]
Arora, S.; May, A.; Zhang, J.; Ré, C. Contextual Embeddings: When Are They Worth It? arXiv 2020, arXiv:2005.09117. [Google Scholar]
Raman, R.; Pattnaik, D.; Hughes, L.; Nedungadi, P. Unveiling the Dynamics of AI Applications: A Review of Reviews Using Scientometrics and BERTopic Modeling. J. Innov. Knowl. 2024, 9, 100517. [Google Scholar] [CrossRef]
Karabacak, M.; Margetis, K. Natural Language Processing Reveals Research Trends and Topics in The Spine Journal over Two Decades: A Topic Modeling Study. Spine J. 2024, 24, 397–405. [Google Scholar] [CrossRef] [PubMed]
Samsir, S.; Saragih, R.S.; Subagio, S.; Aditiya, R.; Watrianthos, R. BERTopic Modeling of Natural Language Processing Abstracts: Thematic Structure and Trajectory. J. Media Inform. Budidarma 2023, 7, 1514. [Google Scholar] [CrossRef]
Karabacak, M.; Jagtiani, P.; Carrasquilla, A.; Jain, A.; Germano, I.M.; Margetis, K. Simplifying Synthesis of the Expanding Glioblastoma Literature: A Topic Modeling Approach. J. Neurooncol. 2024, 169, 601–611. [Google Scholar] [CrossRef] [PubMed]
Calciolari, E.; Corbella, S.; Gkranias, N.; Viganó, M.; Sculean, A.; Donos, N. Efficacy of Biomaterials for Lateral Bone Augmentation Performed with Guided Bone Regeneration. A Network Meta-analysis. Periodontology 2000 2023, 93, 77–106. [Google Scholar] [CrossRef] [PubMed]
Donos, N.; Calciolari, E.; Ghuman, M.; Baccini, M.; Sousa, V.; Nibali, L. The Efficacy of Bone Reconstructive Therapies in the Management of Peri-Implantitis. A Systematic Review and Meta-Analysis. J. Clin. Periodontol. 2023, 50, 285–316. [Google Scholar] [CrossRef] [PubMed]
Bassi, S. A Primer on Python for Life Science Researchers. PLoS Comput. Biol. 2007, 3, e199. [Google Scholar] [CrossRef] [PubMed]
Jia, Z.; Maggioni, M.; Smith, J.; Scarpazza, D.P. Dissecting the NVidia Turing T4 GPU via Microbenchmarking. arXiv 2019, arXiv:1903.07486. [Google Scholar]
Cook, D.A.; Beckman, T.J.; Bordage, G. A Systematic Review of Titles and Abstracts of Experimental Studies in Medical Education: Many Informative Elements Missing. Med. Educ. 2007, 41, 1074–1081. [Google Scholar] [CrossRef] [PubMed]
Hartley, J. Planning That Title: Practices and Preferences for Titles with Colons in Academic Articles. Libr. Inf. Sci. Res. 2007, 29, 553–568. [Google Scholar] [CrossRef]
Guizzardi, S.; Colangelo, M.T.; Mirandola, P.; Galli, C. Modeling New Trends in Bone Regeneration, Using the BERTopic Approach. Regen. Med. 2023, 18, 719–734. [Google Scholar] [CrossRef] [PubMed]
Saif, H.; Fernandez, M.; He, Y.; Alani, H. On Stopwords, Filtering and Data Sparsity for Sentiment Analysis of Twitter. In Proceedings of the 9th International Conference on Language Resources and Evaluation, Reykjavik, Iceland, 26–31 May 2014. [Google Scholar]
Gutiérrez, L.; Keith, B. A Systematic Literature Review on Word Embeddings. In Trends and Applications in Software Engineering: Proceedings of the 7th International Conference on Software Process Improvement (CIMPS 2018) 7; Springer: Cham, Switzerland, 2019; pp. 132–141. [Google Scholar]
Wang, S.; Zhou, W.; Jiang, C. A Survey of Word Embeddings Based on Deep Learning. Computing 2020, 102, 717–740. [Google Scholar] [CrossRef]
Liu, Q.; Kusner, M.J.; Blunsom, P. A Survey on Contextual Embeddings. arXiv 2020, arXiv:2003.07278. [Google Scholar]
McInnes, L.; Healy, J.; Melville, J. Umap: Uniform Manifold Approximation and Projection for Dimension Reduction. arXiv 2018, arXiv:1802.03426. [Google Scholar]
McInnes, L.; Healy, J.; Astels, S. Hdbscan: Hierarchical Density Based Clustering. J. Open Source Softw. 2017, 2, 205. [Google Scholar] [CrossRef]
Qaiser, S.; Ali, R. Text Mining: Use of TF-IDF to Examine the Relevance of Words to Documents. Int. J. Comput. Appl. 2018, 181, 25–29. [Google Scholar] [CrossRef]
Xu, D.D.; Wu, S.B. An Improved TFIDF Algorithm in Text Classification. Appl. Mech. Mater. 2014, 651, 2258–2261. [Google Scholar] [CrossRef]
Raschka, S.; Patterson, J.; Nolet, C. Machine Learning in Python: Main Developments and Technology Trends in Data Science, Machine Learning, and Artificial Intelligence. Information 2020, 11, 193. [Google Scholar] [CrossRef]
Issa, B.; Jasser, M.B.; Chua, H.N.; Hamzah, M. A Comparative Study on Embedding Models for Keyword Extraction Using KeyBERT Method. In Proceedings of the 2023 IEEE 13th International Conference on System Engineering and Technology (ICSET), Shah Alam, Malaysia, 2 October 2023; pp. 40–45. [Google Scholar]
Thirunavukarasu, A.J.; Ting, D.S.J.; Elangovan, K.; Gutierrez, L.; Tan, T.F.; Ting, D.S.W. Large Language Models in Medicine. Nat. Med. 2023, 29, 1930–1940. [Google Scholar] [CrossRef]
Teknium Teknium/OpenHermes-2.5-Mistral-7B. Available online: https://huggingface.co/teknium/OpenHermes-2.5-Mistral-7B (accessed on 10 February 2024).
Kaddour, J.; Harris, J.; Mozes, M.; Bradley, H.; Raileanu, R.; McHardy, R. Challenges and Applications of Large Language Models. arXiv 2023, arXiv:2307.10169. [Google Scholar]
Park, S.; Choi, J.; Lee, S.; Kang, U. A Comprehensive Survey of Compression Algorithms for Language Models. arXiv 2024, arXiv:2401.15347. [Google Scholar]
Meskó, B. Prompt Engineering as an Important Emerging Skill for Medical Professionals: Tutorial. J. Med. Internet Res. 2023, 25, e50638. [Google Scholar] [CrossRef]
Hunter, J.D. Matplotlib: A 2D Graphics Environment. Comput. Sci. Eng. 2007, 9, 90–95. [Google Scholar] [CrossRef]
Waskom, M. Seaborn: Statistical Data Visualization. J. Open Source Softw. 2021, 6, 3021. [Google Scholar] [CrossRef]
McInnes, L. DataMapPlot. Available online: https://github.com/TutteInstitute/datamapplot (accessed on 10 March 2024).
Galli, C.; Donos, N.; Calciolari, E. Performance of 4 Pre-Trained Sentence Transformer Models in the Semantic Query of a Systematic Review Dataset on Peri-Implantitis. Information 2024, 15, 68. [Google Scholar] [CrossRef]
AbuRahma, A.F.; Robinson, P.A.; Jennings, T.G. Carotid-Subclavian Bypass Grafting with Polytetrafluoroethylene Grafts for Symptomatic Subclavian Artery Stenosis or Occlusion: A 20-Year Experience. J. Vasc. Surg. 2000, 32, 411–419. [Google Scholar] [CrossRef]
Stewart, R.; Horwitz, B.; Howes, J.; Novack, G.D.; Hart, K. Double-Masked, Placebo-Controlled Evaluation of Loteprednol Etabonate 0.5 for Postoperative Inflammation. J. Cataract. Refract. Surg. 1998, 24, 1480–1489. [Google Scholar] [CrossRef]
Senova, S.; Hosomi, K.; Gurruchaga, J.-M.; Gouello, G.; Ouerchefani, N.; Beaugendre, Y.; Lepetit, H.; Lefaucheur, J.-P.; Badin, R.A.; Dauguet, J. Three-Dimensional SPACE Fluid-Attenuated Inversion Recovery at 3 T to Improve Subthalamic Nucleus Lead Placement for Deep Brain Stimulation in Parkinson’s Disease: From Preclinical to Clinical Studies. J. Neurosurg. 2016, 125, 472–480. [Google Scholar] [CrossRef] [PubMed]
Almugathwi, M.; Wearden, A.; Green, K.; Hill-Feltham, P.; Powell, R. Online Support Group Users’ Perceptions and Experiences of Bone-Anchored Hearing Aids (BAHAs): A Qualitative Study. Int. J. Audiol. 2020, 59, 850–858. [Google Scholar] [CrossRef] [PubMed]
Andersen, H.; Aass, A.M.; Wohlfahrt, J.C. Porous Titanium Granules in the Treatment of Peri-Implant Osseous Defects—A 7-Year Follow-up Study. Int. J. Implant. Dent. 2017, 3, 50. [Google Scholar] [CrossRef] [PubMed]
Jepsen, K.; Jepsen, S.; Laine, M.L.; Anssari Moin, D.; Pilloni, A.; Zeza, B.; Sanz, M.; Ortiz-Vigon, A.; Roos-Jansåker, A.M.; Renvert, S. Reconstruction of Peri-Implant Osseous Defects: A Multicenter Randomized Trial. J. Dent. Res. 2016, 95, 58–66. [Google Scholar] [CrossRef] [PubMed]
Wohlfahrt, J.C.; Lyngstadaas, S.P.; Rønold, H.J.; Saxegaard, E.; Ellingsen, J.E.; Karlsson, S.; Aass, A.M. Porous Titanium Granules in the Surgical Treatment of Peri-Implant Osseous Defects: A Randomized Clinical Trial. Int. J. Oral. Maxillofac. Implant. 2012, 27, 401. [Google Scholar]
Emanuel, N.; Machtei, E.E.; Reichart, M.; Shapira, L. D-PLEX500: A Local Biodegradable Prolonged Release Doxycycline-Formulated Bone Graft for the Treatment for Peri-Implantitis. A Randomized Controlled Clinical Study. Quintessence Int. 2020, 51, 546–553. [Google Scholar] [PubMed]
Renvert, S.; Giovannoli, J.; Roos-Jansåker, A.; Rinke, S. Surgical Treatment of Peri-implantitis with or without a Deproteinized Bovine Bone Mineral and a Native Bilayer Collagen Membrane: A Randomized Clinical Trial. J. Clin. Periodontol. 2021, 48, 1312–1321. [Google Scholar] [CrossRef] [PubMed]
Isehed, C.; Holmlund, A.; Renvert, S.; Svenson, B.; Johansson, I.; Lundberg, P. Effectiveness of Enamel Matrix Derivative on the Clinical and Microbiological Outcomes Following Surgical Regenerative Treatment of Peri-implantitis. A Randomized Controlled Trial. J. Clin. Periodontol. 2016, 43, 863–873. [Google Scholar] [CrossRef] [PubMed]
Isehed, C.; Svenson, B.; Lundberg, P.; Holmlund, A. Surgical Treatment of Peri-implantitis Using Enamel Matrix Derivative, an RCT: 3-and 5-year Follow-up. J. Clin. Periodontol. 2018, 45, 744–753. [Google Scholar] [CrossRef] [PubMed]
Renvert, S.; Roos-Jansåker, A.; Persson, G.R. Surgical Treatment of Peri-implantitis Lesions with or without the Use of a Bone Substitute—A Randomized Clinical Trial. J. Clin. Periodontol. 2018, 45, 1266–1274. [Google Scholar] [CrossRef]
Nct Peri-Implantitis—Reconstructive Surgical Therapy. Available online: https://clinicaltrials.gov/show/NCT03077061 2017 (accessed on 10 April 2022).
Froum, S.J.; Froum, S.H.; Rosen, P.S. A Regenerative Approach to the Successful Treatment of Peri-Implantitis: A Consecutive Series of 170 Implants in 100 Patients with 2-to 10-Year Follow-Up. Int. J. Periodontics Restor. Dent. 2015, 35, 857. [Google Scholar] [CrossRef] [PubMed]
Gonzalez Regueiro, I.; Martinez Rodriguez, N.; Barona Dorado, C.; Sanz-Sánchez, I.; Montero, E.; Ata-Ali, J.; Duarte, F.; Martínez-González, J.M. Surgical Approach Combining Implantoplasty and Reconstructive Therapy with Locally Delivered Antibiotic in the Treatment of Peri-implantitis: A Prospective Clinical Case Series. Clin. Implant. Dent. Relat. Res. 2021, 23, 864–873. [Google Scholar] [CrossRef] [PubMed]
Isler, S.C.; Soysal, F.; Ceyhanlı, T.; Bakırarar, B.; Unsal, B. Regenerative Surgical Treatment of Peri-implantitis Using Either a Collagen Membrane or Concentrated Growth Factor: A 12-month Randomized Clinical Trial. Clin. Implant. Dent. Relat. Res. 2018, 20, 703–712. [Google Scholar] [CrossRef] [PubMed]
La Monaca, G.; Pranno, N.; Annibali, S.; Cristalli, M.P.; Polimeni, A. Clinical and Radiographic Outcomes of a Surgical Reconstructive Approach in the Treatment of Peri-implantitis Lesions: A 5-year Prospective Case Series. Clin. Oral. Implant. Res. 2018, 29, 1025–1037. [Google Scholar] [CrossRef]
Mercado, F.; Hamlet, S.; Ivanovski, S. Regenerative Surgical Therapy for Peri-implantitis Using Deproteinized Bovine Bone Mineral with 10% Collagen, Enamel Matrix Derivative and Doxycycline—A Prospective 3-year Cohort Study. Clin. Oral. Implant. Res. 2018, 29, 583–591. [Google Scholar] [CrossRef] [PubMed]
Polymeri, A.; Anssari-Moin, D.; van der Horst, J.; Wismeijer, D.; Laine, M.L.; Loos, B.G. Surgical Treatment of Peri-implantitis Defects with Two Different Xenograft Granules: A Randomized Clinical Pilot Study. Clin. Oral. Implant. Res. 2020, 31, 1047–1060. [Google Scholar] [CrossRef]
Roccuzzo, M.; Gaudioso, L.; Lungo, M.; Dalmasso, P. Surgical Therapy of Single Peri-implantitis Intrabony Defects, by Means of Deproteinized Bovine Bone Mineral with 10% Collagen. J. Clin. Periodontol. 2016, 43, 311–318. [Google Scholar] [CrossRef] [PubMed]
Roccuzzo, M.; Mirra, D.; Pittoni, D.; Ramieri, G.; Roccuzzo, A. Reconstructive Treatment of Peri-implantitis Infrabony Defects of Various Configurations: 5-year Survival and Success. Clin. Oral. Implant. Res. 2021, 32, 1209–1217. [Google Scholar] [CrossRef]
Isrctn Reconstructive Surgical Therapy of Peri-Implantitis Bone Defects. Available online: https://trialsearch.who.int/Tri-al2.aspx?TrialID=ISRCTN67095066 2019 (accessed on 10 April 2022).
Aghazadeh, A.; Rutger Persson, G.; Renvert, S. A Single-centre Randomized Controlled Clinical Trial on the Adjunct Treatment of Intra-bony Defects with Autogenous Bone or a Xenograft: Results after 12 Months. J. Clin. Periodontol. 2012, 39, 666–673. [Google Scholar] [CrossRef] [PubMed]
Aghazadeh, A.; Persson, R.G.; Renvert, S. Impact of Bone Defect Morphology on the Outcome of Reconstructive Treatment of Peri-Implantitis. Int. J. Implant. Dent. 2020, 6, 1–10. [Google Scholar] [CrossRef] [PubMed]
Nct Evaluation of Photodynamic Therapy in Treatment of Peri-Implantitis. Available online: https://clinicaltrials.gov/show/NCT05187663 2022 (accessed on 10 April 2022).
Roos-Jansåker, A.; Renvert, H.; Lindahl, C.; Renvert, S. Submerged Healing Following Surgical Treatment of Peri-implantitis: A Case Series. J. Clin. Periodontol. 2007, 34, 723–727. [Google Scholar] [CrossRef] [PubMed]
Roos-Jansåker, A.; Lindahl, C.; Persson, G.R.; Renvert, S. Long-term Stability of Surgical Bone Regenerative Procedures of Peri-implantitis Lesions in a Prospective Case–Control Study over 3 Years. J. Clin. Periodontol. 2011, 38, 590–597. [Google Scholar] [CrossRef] [PubMed]
Roos-Jansåker, A.; Persson, G.R.; Lindahl, C.; Renvert, S. Surgical Treatment of Peri-implantitis Using a Bone Substitute with or without a Resorbable Membrane: A 5-year Follow-up. J. Clin. Periodontol. 2014, 41, 1108–1114. [Google Scholar] [CrossRef]
Cottrell, D.A.; Wolford, L.M. Long-Term Evaluation of the Use of Coralline Hydroxyapatite in Orthognathic Surgery. J. Oral Maxillofac. Surg. 1998, 56, 935–941. [Google Scholar] [CrossRef]
Di Rienzo, A.; Colasanti, R.; Gladi, M.; Dobran, M.; Della Costanza, M.; Capece, M.; Veccia, S.; Iacoangeli, M. Timing of Cranial Reconstruction after Cranioplasty Infections: Are We Ready for a Re-Thinking? A Comparative Analysis of Delayed versus Immediate Cranioplasty after Debridement in a Series of 48 Patients. Neurosurg. Rev. 2021, 44, 1523–1532. [Google Scholar] [CrossRef]
Eppley, B.L.; Sadove, A.M.; Holmstrom, H.; Kahnberg, K.-E. HTR^® Polymer Facial Implants: A Five-Year Clinical Experience. Aesthetic Plast. Surg. 1995, 19, 445–450. [Google Scholar] [CrossRef]
Godin, M.; Costa, L.; Romo, T.; Truswell, W.; Wang, T.; Williams, E. Gore-Tex Chin Implants: A Review of 324 Cases. Arch. Facial Plast. Surg. 2003, 5, 224–227. [Google Scholar] [CrossRef] [PubMed]
Jansma, J.; Schepers, R.H.; Vissink, A. The Application of Alloplastic Materials for Augmentation in Cosmetic Facial Surgery. Ned. Tijdschr. Tandheelkd. 2014, 121, 565–570. [Google Scholar] [CrossRef] [PubMed]
Yasuhara, T.; Murai, S.; Mikuni, N.; Miyamoto, S.; Date, I. Japanese National Questionnaire Survey in 2018 on Complications Related to Cranial Implants in Neurosurgery. Neurol. Med. Chir. 2020, 60, 337–350. [Google Scholar] [CrossRef] [PubMed]
Kitridis, D.; Givissis, P.; Chalidis, B. Timing of Tibial Tubercle Osteotomy in Two-Stage Revision of Infected Total Knee Arthroplasty Does Not Affect Union and Reinfection Rate. A Systematic Review. Knee 2020, 27, 1787–1794. [Google Scholar] [CrossRef] [PubMed]
Lee, J.H.; Kim, S.K.; Kang, S.S.; Han, S.J.; Lee, C.-K.; Chang, B.-S. A Long-Term Follow-up, Multicenter, Comparative Study of the Radiologic, and Clinical Results between a CaO-SiO₂-P₂O₅-B₂O₃ Bioactive Glass Ceramics (BGS-7) Intervertebral Spacer and Titanium Cage in 1-Level Posterior Lumbar Interbody Fusion. Clin. Spine Surg. 2020, 33, E322–E329. [Google Scholar] [CrossRef]
Kaliyev, R.; Lesbekov, T.; Bekbossynov, S.; Nurmykhametova, Z.; Bekbossynova, M.; Novikova, S.; Medressova, A.; Smagulov, N.; Faizov, L.; Samalavicius, R. Heart Transplantation of Patients with Ventricular Assist Devices: Impact of Normothermic Ex-Vivo Preservation Using Organ Care System Compared with Cold Storage. J. Cardiothorac. Surg. 2020, 15, 323. [Google Scholar] [CrossRef]
Naenni, N.; Stucki, L.; Hüsler, J.; Schneider, D.; Hämmerle, C.H.F.; Jung, R.E.; Thoma, D.S. Implants Sites with Concomitant Bone Regeneration Using a Resorbable or Non-resorbable Membrane Result in Stable Marginal Bone Levels and Similar Profilometric Outcomes over 5 Years. Clin. Oral. Implants Res. 2021, 32, 893–904. [Google Scholar] [CrossRef]
Basler, T.; Naenni, N.; Schneider, D.; Hämmerle, C.H.F.; Jung, R.E.; Thoma, D.S. Randomized Controlled Clinical Study Assessing Two Membranes for Guided Bone Regeneration of Peri-implant Bone Defects: 3-year Results. Clin. Oral. Implant. Res. 2018, 29, 499–507. [Google Scholar] [CrossRef] [PubMed]
Mau, J.L.; Grodin, E.; Lin, J.; Chen, M.C.; Ho, C.; Cochran, D. A Comparative, Randomized, Prospective, Two-center Clinical Study to Evaluate the Clinical and Esthetic Outcomes of Two Different Bone Grafting Techniques in Early Implant Placement. J. Periodontol. 2019, 90, 247–255. [Google Scholar] [CrossRef] [PubMed]
Annen, B.M.; Ramel, C.F.; Hammerle, C.H.F.; Jung, R.E. Use of a New Cross-Linked Collagen Membrane for the Treatment of Peri-Implant Dehiscence Defects: A Randomised Controlled Double-Blinded Clinical Trial. Eur. J. Oral. Implantol. 2011, 4, 87. [Google Scholar] [PubMed]
Naenni, N.; Schneider, D.; Jung, R.E.; Hüsler, J.; Hämmerle, C.H.F.; Thoma, D.S. Randomized Clinical Study Assessing Two Membranes for Guided Bone Regeneration of Peri-implant Bone Defects: Clinical and Histological Outcomes at 6 Months. Clin. Oral. Implant. Res. 2017, 28, 1309–1317. [Google Scholar] [CrossRef] [PubMed]
Lee, J.-H.; Lee, J.-S.; Baek, W.-S.; Lim, H.-C.; Cha, J.-K.; Choi, S.-H.; Jung, U.-W. Assessment of Dehydrothermally Cross-linked Collagen Membrane for Guided Bone Regeneration around Peri-Implant Dehiscence Defects: A Randomized Single-Blinded Clinical Trial. J. Periodontal Implant. Sci. 2015, 45, 229. [Google Scholar] [CrossRef] [PubMed]
Lee, J.-H.; Park, S.-H.; Kim, D.-H.; Jung, U.-W. Assessment of Clinical and Radiographic Outcomes of Guided Bone Regeneration with Dehydrothermally Cross-Linked Collagen Membrane around Peri-Implant Dehiscence Defects: Results from a 3-Year Randomized Clinical Trial. Oral Biol. Res. 2019, 43, 8–16. [Google Scholar] [CrossRef]
Becker, J.; Al-Nawas, B.; Klein, M.O.; Schliephake, H.; Terheyden, H.; Schwarz, F. Use of a New Cross-linked Collagen Membrane for the Treatment of Dehiscence-type Defects at Titanium Implants: A Prospective, Randomized-controlled Double-blinded Clinical Multicenter Study. Clin. Oral. Implant. Res. 2009, 20, 742–749. [Google Scholar] [CrossRef] [PubMed]
Schwarz, F.; Schmucker, A.; Becker, J. Long-term Outcomes of Simultaneous Guided Bone Regeneration Using Native and Cross-linked Collagen Membranes after 8 Years. Clin. Oral. Implant. Res. 2017, 28, 779–784. [Google Scholar] [CrossRef] [PubMed]
Schwarz, F.; Hegewald, A.; Sahm, N.; Becker, J. Long-term Follow-up of Simultaneous Guided Bone Regeneration Using Native and Cross-linked Collagen Membranes over 6 Years. Clin. Oral. Implant. Res. 2014, 25, 1010–1015. [Google Scholar] [CrossRef] [PubMed]
Schwarz, F.; Sahm, N.; Becker, J. Impact of the Outcome of Guided Bone Regeneration in Dehiscence-type Defects on the Long-term Stability of Peri-implant Health: Clinical Observations at 4 Years. Clin. Oral. Implant. Res. 2012, 23, 191–196. [Google Scholar] [CrossRef] [PubMed]
Benic, G.I.; Eisner, B.M.; Jung, R.E.; Basler, T.; Schneider, D.; Hämmerle, C.H.F. Hard Tissue Changes after Guided Bone Regeneration of Peri-implant Defects Comparing Block versus Particulate Bone Substitutes: 6-month Results of a Randomized Controlled Clinical Trial. Clin. Oral. Implant. Res. 2019, 30, 1016–1026. [Google Scholar] [CrossRef]
Carpio, L.; Loza, J.; Lynch, S.; Genco, R. Guided Bone Regeneration around Endosseous Implants with Anorganic Bovine Bone Mineral. A Randomized Controlled Trial Comparing Bioabsorbable versus Non-resorbable Barriers. J. Periodontol. 2000, 71, 1743–1749. [Google Scholar] [CrossRef]
Deesricharoenkiat, N.; Jansisyanont, P.; Chuenchompoonut, V.; Mattheos, N.; Thunyakitpisal, P. The Effect of Acemannan in Implant Placement with Simultaneous Guided Bone Regeneration in the Aesthetic Zone: A Randomized Controlled Trial. Int. J. Oral. Maxillofac. Surg. 2022, 51, 535–544. [Google Scholar] [CrossRef]
Jung, R.E.; Glauser, R.; Schärer, P.; Hämmerle, C.H.F.; Sailer, H.F.; Weber, F.E. Effect of RhBMP-2 on Guided Bone Regeneration in Humans: A Randomized, Controlled Clinical and Histomorphometric Study. Clin. Oral. Implant. Res. 2003, 14, 556–568. [Google Scholar] [CrossRef] [PubMed]
Jung, R.E.; Windisch, S.I.; Eggenschwiler, A.M.; Thoma, D.S.; Weber, F.E.; Hämmerle, C.H.F. A Randomized-controlled Clinical Trial Evaluating Clinical and Radiological Outcomes after 3 and 5 Years of Dental Implants Placed in Bone Regenerated by Means of GBR Techniques with or without the Addition of BMP-2. Clin. Oral. Implant. Res. 2009, 20, 660–666. [Google Scholar] [CrossRef] [PubMed]
Jung, R.E.; Kovacs, M.N.; Thoma, D.S.; Hämmerle, C.H.F. Informative Title: Guided Bone Regeneration with and without RhBMP-2: 17-year Results of a Randomized Controlled Clinical Trial. Clin. Oral. Implant. Res. 2022, 33, 302–312. [Google Scholar] [CrossRef] [PubMed]
Jung, R.E.; Hälg, G.A.; Thoma, D.S.; Hämmerle, C.H.F. A Randomized, Controlled Clinical Trial to Evaluate a New Membrane for Guided Bone Regeneration around Dental Implants. Clin. Oral. Implant. Res. 2009, 20, 162–168. [Google Scholar] [CrossRef]
Ramel, C.F.; Wismeijer, D.A.; F Hämmerle, C.H.; Jung, R.E. A Randomized, Controlled Clinical Evaluation of a Synthetic Gel Membrane for Guided Bone Regeneration around Dental Implants: Clinical and Radiologic 1-and 3-Year Results. Int. J. Oral Maxillofac. Implant. 2012, 27, 435. [Google Scholar]
Jung, R.E.; Benic, G.I.; Scherrer, D.; Hämmerle, C.H.F. Cone Beam Computed Tomography Evaluation of Regenerated Buccal Bone 5 Years after Simultaneous Implant Placement and Guided Bone Regeneration Procedures–a Randomized, Controlled Clinical Trial. Clin. Oral. Implant. Res. 2015, 26, 28–34. [Google Scholar] [CrossRef] [PubMed]
Jung, R.E.; Mihatovic, I.; Cordaro, L.; Windisch, P.; Friedmann, A.; Blanco Carrion, J.; Sanz Sanchez, I.; Hallman, M.; Quirynen, M.; Hammerle, C.H.F. Comparison of a Polyethylene Glycol Membrane and a Collagen Membrane for the Treatment of Bone Dehiscence Defects at Bone Level Implants—A Prospective, Randomized, Controlled, Multicenter Clinical Trial. Clin. Oral. Implant. Res. 2020, 31, 1105–1115. [Google Scholar] [CrossRef] [PubMed]
Benic, G.I.; Bienz, S.P.; Song, Y.W.; Cha, J.; Hämmerle, C.H.F.; Jung, U.; Jung, R.E. Randomized Controlled Clinical Trial Comparing Guided Bone Regeneration of Peri-implant Defects with Soft-type Block versus Particulate Bone Substitutes: Six-month Results of Hard-tissue Changes. J. Clin. Periodontol. 2022, 49, 480–495. [Google Scholar] [CrossRef] [PubMed]
Lee, D.-W.; Kim, K.-T.; Joo, Y.-S.; Yoo, M.-K.; Yu, J.-A.; Ryu, J.-J. The Role of Two Different Collagen Membranes for Dehiscence Defect around Implants in Humans. J. Oral Implantol. 2015, 41, 445–448. [Google Scholar] [CrossRef]
Mattout, P.; Nowzari, H.; Mattout, C. Clinical Evaluation of Guided Bone Regeneration at Exposed Parts of Brånemark Dental Implants with and without Bone Allograft. Clin. Oral. Implant. Res. 1995, 6, 189–195. [Google Scholar] [CrossRef] [PubMed]
Merli, M.; Moscatelli, M.; Mariotti, G.; Pagliaro, U.; Raffaelli, E.; Nieri, M. Comparing Membranes and Bone Substitutes in a One-Stage Procedure for Horizontal Bone Augmentation. A Double-Blind Randomised Controlled Trial. Eur. J. Oral. Implantol. 2015, 8, 271. [Google Scholar] [PubMed]
Merli, M.; Moscatelli, M.; Mariotti, G.; Pagliaro, U.; Raffaelli, E.; Nieri, M. Comparing Membranes and Bone Substitutes in a One-Stage Procedure for Horizontal Bone Augmentation. Three-Year Post-Loading Results of a Double-Blind Randomised Controlled Trial. Eur. J. Oral. Implantol. 2018, 11, 441. [Google Scholar]
Park, S.; Lee, K.; Oh, T.; Misch, C.E.; Shotwell, J.; Wang, H. Effect of Absorbable Membranes on Sandwich Bone Augmentation. Clin. Oral. Implant. Res. 2008, 19, 32–41. [Google Scholar] [CrossRef] [PubMed]
Schneider, D.; Weber, F.E.; Grunder, U.; Andreoni, C.; Burkhardt, R.; Jung, R.E. A Randomized Controlled Clinical Multicenter Trial Comparing the Clinical and Histological Performance of a New, Modified Polylactide-co-glycolide Acid Membrane to an Expanded Polytetrafluorethylene Membrane in Guided Bone Regeneration Procedures. Clin. Oral. Implant. Res. 2014, 25, 150–158. [Google Scholar] [CrossRef]
Temmerman, A.; Cortellini, S.; Van Dessel, J.; De Greef, A.; Jacobs, R.; Dhondt, R.; Teughels, W.; Quirynen, M. Bovine-derived Xenograft in Combination with Autogenous Bone Chips versus Xenograft Alone for the Augmentation of Bony Dehiscences around Oral Implants: A Randomized, Controlled, Split-mouth Clinical Trial. J. Clin. Periodontol. 2020, 47, 110–119. [Google Scholar] [CrossRef] [PubMed]
Simion, M.; Misitano, U.; Gionso, L.; Salvato, A. Treatment of Dehiscences and Fenestrations around Dental Implants Using Resorbable and Nonresorbable Membranes Associated with Bone Autografts: A Comparative Clinical Study. Int. J. Oral Maxillofac. Implant. 1997, 12, 1. [Google Scholar]
Urban, I.A.; Wessing, B.; Alández, N.; Meloni, S.; González-Martin, O.; Polizzi, G.; Sanz-Sanchez, I.; Montero, E.; Zechner, W. A Multicenter Randomized Controlled Trial Using a Novel Collagen Membrane for Guided Bone Regeneration at Dehisced Single Implant Sites: Outcome at Prosthetic Delivery and at 1-year Follow-up. Clin. Oral. Implant. Res. 2019, 30, 487–497. [Google Scholar] [CrossRef]
Wessing, B.; Urban, I.; Montero, E.; Zechner, W.; Hof, M.; Alandez Chamorro, J.; Alandez Martin, N.; Polizzi, G.; Meloni, S.; Sanz, M. A Multicenter Randomized Controlled Clinical Trial Using a New Resorbable Non-cross-linked Collagen Membrane for Guided Bone Regeneration at Dehisced Single Implant Sites: Interim Results of a Bone Augmentation Procedure. Clin. Oral. Implant. Res. 2017, 28, e218–e226. [Google Scholar] [CrossRef] [PubMed]
Van Assche, N.; Michels, S.; Naert, I.; Quirynen, M. Randomized Controlled Trial to Compare Two Bone Substitutes in the Treatment of Bony Dehiscences. Clin. Implant. Dent. Relat. Res. 2013, 15, 558–568. [Google Scholar] [CrossRef]
Veis, A.A.; Tsirlis, A.T.; Parisis, N.A. Effect of Autogenous Harvest Site Location on the Outcome of Ridge Augmentation for Implant Dehiscences. Int. J. Periodontics Restor. Dent. 2004, 24, 154. [Google Scholar]
Wen, S.-C.; Fu, J.-H.; Wang, H.-L. Effect of Deproteinized Bovine Bone Mineral at Implant Dehiscence Defects Grafted by the Sandwich Bone Augmentation Technique. Int. J. Periodontics Restor. Dent. 2018, 38, 79–85. [Google Scholar] [CrossRef] [PubMed]
Tsai, Y.; Tsao, J.; Wang, C.; Grodin, E.; Lin, J.; Chen, C.; Ho, C.; Cochran, D.; Mau, J.L.P. Stability of Contour Augmentation of Implant-supported Single Crowns in the Esthetic Zone: One-year Cone-beam Computed Tomography Results of a Comparative, Randomized, Prospective, Two-center Clinical Study Using Two Different Bone Grafting Techniques in Early Implant Placement. J. Periodontol. 2022, 93, 1661–1670. [Google Scholar] [CrossRef] [PubMed]
Wilczynski, N.L.; Haynes, R.B.; Team Hedges. Optimal Search Strategies for Identifying Mental Health Content in MEDLINE: An Analytic Survey. Ann. Gen. Psychiatry 2006, 5, 4. [Google Scholar] [CrossRef] [PubMed]
Zhang, L.; Ajiferuke, I.; Sampson, M. Optimizing Search Strategies to Identify Randomized Controlled Trials in MEDLINE. BMC Med. Res. Methodol. 2006, 6, 23. [Google Scholar] [CrossRef]
Heintz, M.; Hval, G.; Tornes, R.A.; Byelyey, N.; Hafstad, E.; Næss, G.E.; Bakkeli, M. Optimizing the Literature Search: Coverage of Included References in Systematic Reviews in Medline and Embase. J. Med. Libr. Assoc. 2023, 111, 599–605. [Google Scholar] [CrossRef]

Figure 1. Diagram illustrating the workflow used in the present work to model the topics in our datasets. Our initial dataset was in tabular form; titles were converted into embeddings, which were then reduced by UMAP. Reduced embeddings were clustered by HDBSCAN based on their similarity, and keyword descriptors were generated for every cluster by cTF-IDF. A large language model (LL) was then used to create convenient labels for the topic, converting the keywords into a sentence.

Figure 2. Line plot showing the relation between the minimum cluster size setting for HDBSCAN and the number of topics identified by BERTopic in the peri-implantitis dataset, based on the number of neighbors setting in the UMAP dimension reduction algorithm. Red line: n_neighbors = 10; Blue line: n_neighbors = 15; Orange line: n_neighbors = 50; Green line: n_neighbors = 100.

Figure 3. Scatterplot of the semantic distribution of a dataset of titles of scientific articles selected from different biomedical databases using a keyword-based search for peri-implantitis. Titles are not homogeneously distributed but rather form clusters that tend to correspond to topics. Every topic is marked by a different color.

Figure 4. Barchart representing the allocation of the target articles in the peri-implantitis dataset by BERTopic.

Figure 5. Lineplot showing the relation between the minimum cluster size setting for HDBSCAN and the number of topics identified by BERTopic in the bone regeneration dataset, based on the number of neighbors setting in the UMAP dimension reduction algorithm. Red line: n_neighbors = 10; Blue line: n_neighbors = 15; Orange line: n_neighbors = 50; Green line: n_neighbors = 100.

Figure 6. Scatterplot of the semantic distribution of a dataset of titles of scientific articles selected from different biomedical databases using a keyword-based search for bone augmentation. Every topic is marked by a different color.

Figure 7. Barchart representing the allocation of the target articles in the bone regeneration dataset by BERTopic.

Figure 8. Diagram illustrating the workflow proposed in the present work to improve the efficiency of literature searches.

Table 1. The list of topics identified by BERTopic, in order of size, for the peri-implantitis dataset. Non-dental topics are highlighted in bold. The full list can be found as Supplementary Material (Table S1).

Topic	Count	LLM
−1	357	Treating Implant Infections
0	3733	Peri-Implant Bone Study
1	587	Valves and Stents in Coronary Arteries
2	232	Intraocular Lens Inflammation
3	192	Breast Reconstruction and Implants
4	174	Parkinson’s Disease and Deep Brain Stimulation
5	144	Sinus Floor Elevation
6	137	Photodynamic Therapy for Peri-implantitis Treatment
7	132	Implant-retained Mandibular Overdentures
8	127	Orbital Reconstruction Implants
9	96	Cochlear Implantation
10	89	Cervical Fusion and Disc Disease
11	76	Serous Borderline Ovary Tumors.
12	61	Zirconia Implants and Abutments

Table 2. The list of the target articles identified in Donos et al. systematic review on peri-implantitis [36]. No article was clustered in any of the unrelated topic groups.

Authors	Topic	Reference
Andersen, Heidi, Aass, Anne Merete and Wohlfahrt, Johan Caspar	#0 Peri-Implant Bone Study	[65]
Jepsen, K., Jepsen, S., Laine, M. L., Anssari Moin, D., Pilloni, A., Zeza, B., Sanz, M., Ortiz-Vigon, A., Roos-Jansaker, A. M. and Renvert, S.	#0 Peri-Implant Bone Study	[66]
Wohlfahrt, Johan Caspar, Lyngstadaas, Stale Petter, Ronold, Hans Jacob, Saxegaard, Erik, Ellingsen, Jan Eirik, Karlsson, Stig and Aass, Anne Merete	#0 Peri-Implant Bone Study	[67]
Emanuel, Noam, Machtei, Eli E., Reichart, Malka and Shapira, Lior	#0 Peri-Implant Bone Study	[68]
Renvert, Stefan, Giovannoli, Jean-Louis, Roos-Jansaker, Ann-Marie and Rinke, Sven	#0 Peri-Implant Bone Study	[69]
Isehed, C., Holmlund, A., Renvert, S., Svenson, B., Johansson, I. and Lundberg, P.	#0 Peri-Implant Bone Study	[70]
Isehed, C., Svenson, B., Lundberg, P. and Holmlund, A.	#0 Peri-Implant Bone Study	[71]
Renvert, Stefan, Roos-Jansaker, Ann-Marie and Persson, Gosta Rutger	#0 Peri-Implant Bone Study	[72]
Nct	#0 Peri-Implant Bone Study	[73]
Froum, Stuart J., Froum, Scott H. and Rosen, Paul S.	#0 Peri-Implant Bone Study	[74]
Gonzalez Regueiro, Iria, Martinez Rodriguez, Natalia, Barona Dorado, Cristina, Sanz-Sanchez, Ignacio, Montero, Eduardo, Ata-Ali, Javier, Duarte, Fernando and Martinez-Gonzalez, Jose Maria	#0 Peri-Implant Bone Study	[75]
Isler, S.C., Soysal, F., Ceyhanli, T., Bakirarar, B. and Unsal, B.	#0 Peri-Implant Bone Study	[76]
La Monaca, Gerardo, Pranno, Nicola, Annibali, Susanna, Cristalli, Maria Paola and Polimeni, Antonella	#0 Peri-Implant Bone Study	[77]
Mercado, Faustino, Hamlet, Stephen and Ivanovski, Saso	#0 Peri-Implant Bone Study	[78]
Polymeri, Angeliki, Anssari-Moin, David, van der Horst, Joyce, Wismeijer, Daniel, Laine, Marja L. and Loos, Bruno G.	#0 Peri-Implant Bone Study	[79]
Roccuzzo, Mario, Gaudioso, Luigi, Lungo, Marco and Dalmasso, Paola	#0 Peri-Implant Bone Study	[80]
Roccuzzo, Mario, Mirra, Davide, Pittoni, Dario, Ramieri, Guglielmo and Roccuzzo, Andrea	#0 Peri-Implant Bone Study	[81]
Isrctn	#0 Peri-Implant Bone Study	[82]
Aghazadeh, A., Rutger Persson, G. and Renvert, S.	#0 Peri-Implant Bone Study	[83]
Aghazadeh, A., Persson, R.G. and Renvert, S.	#0 Peri-Implant Bone Study	[84]
Nct	#6 Photodynamic Therapy for Peri-implantitis Treatment	[85]
Roos-Jansaker, Ann-Marie, Renvert, Helena, Lindahl, Christel and Renvert, Stefan	#0 Peri-Implant Bone Study	[86]
Roos-Jansaker, Ann-Marie, Lindahl, Christel, Persson, G. Rutger and Renvert, Stefan	#0 Peri-Implant Bone Study	[87]
Roos-Jansaker, Ann-Marie, Persson, Gosta Rutger, Lindahl, Christel and Renvert, Stefan	#0 Peri-Implant Bone Study	[88]

Table 3. The list of topics identified by BERTopic, in order of size, for the bone augmentation dataset. Non-dental topics are highlighted in bold. The full data are found in Supplementary Table S2.

Topic	Count	LLM
−1	1515	Dental Implant Studies
0	727	Dental Implants in Edentulous Patients
1	692	Sinus Floor Elevation
2	296	Hip Arthroplasty
3	265	Alveolar Ridge Augmentation Techniques
4	225	Titanium Implants Surface Acid-Etching Osse
5	167	Ridge Preservation Bone Allograft
6	147	Bone Grafting for Dental Implants
7	146	Soft Tissue Augmentation for Dental Implants
8	140	Peri-Implantitis Treatment
9	138	Implants in Fresh Extraction Sockets
10	117	Ventricular Assist Devices
11	102	Platelet-Rich Fibrin Effects on Dental Implants
12	93	Cervical and Lumbar Fusion Studies
13	89	Bone Regeneration Recombinant Human BMP
14	81	Hydroxyapatite-coated Dental Implant Studies
15	73	Periodontal defect treatment
16	71	Collagen Membranes for Guided Bone Regeneration
17	64	Implant-retained Mandibular Overdentures
18	57	Antibiotic Prophylaxis for Dental Implants
19	54	Bone Anchored Hearing Implant
20	50	Orbital and Retinal Implants

Table 4. The list of the target articles identified in Donos et al. systematic review on bone regeneration [35]. No article was clustered in any of the OffTGs.

Authors	Topic	Reference
Naenni, N., Stucki, L., Hüsler, J., Schneider, D., Hämmerle, C.H., Jung, R.E. and Thoma, D.S.	#16 Collagen Membranes for Guided Bone Regeneration	[98]
Basler, T., Naenni, N., Schneider, D., Hämmerle, C.H., Jung, R.E. and Thoma, D.S.	#16 Collagen Membranes for Guided Bone Regeneration	[99]
Mau, J.L., Grodin, E., Lin, J.J., Chen, M.C.J., Ho, C.H. and Cochran, D.,	#6 Bone Grafting for Dental Implants	[100]
Annen, B.M., Ramel, C.F., Hammerle, C.H.F. and Jung, R.E.	#16 Collagen Membranes for Guided Bone Regeneration	[101]
Naenni, N., Schneider, D., Jung, R.E., Hüsler, J., Hämmerle, C.H. and Thoma, D.S.	#16 Collagen Membranes for Guided Bone Regeneration	[102]
Lee, J.H., Lee, J.S., Baek, W.S., Lim, H.C., Cha, J.K., Choi, S.H. and Jung, U.W.	#16 Collagen Membranes for Guided Bone Regeneration	[103]
Lee, J.H., Park, S.H., Kim, D.H. and Jung, U.W.	#16 Collagen Membranes for Guided Bone Regeneration	[104]
Becker, J., Al-Nawas, B., Klein, M.O., Schliephake, H., Terheyden, H. and Schwarz, F.	-1 Dental Implants Studies	[105]
Schwarz, F., Schmucker, A. and Becker, J.	#16 Collagen Membranes for Guided Bone Regeneration	[106]
Schwarz, F., Hegewald, A., Sahm, N. and Becker, J.	#16 Collagen Membranes for Guided Bone Regeneration	[107]
Schwarz, F., Sahm, N. and Becker, J.	#6 Bone Grafting for Dental Implants	[108]
Benic, G.I., Eisner, B.M., Jung, R.E., Basler, T., Schneider, D. and Hämmerle, C.H.	#6 Bone Grafting for Dental Implants	[109]
Carpio, L., Loza, J., Lynch, S. and Genco, R.	#6 Bone Grafting for Dental Implants	[110]
Deesricharoenkiat, N., Jasilynn, P., Chuenchompoonut, V., Mattheos, N. and Thunyakitpisal, P.	#6 Bone Grafting for Dental Implants	[111]
Jung, R.E., Glauser, R., Schärer, P., Hämmerle, C.H., Sailer, H.F. and Weber, F.E.	-1 Dental Implant Studies	[112]
Jung, R.E., Windisch, S.I., Eggenschwiler, A.M., Thoma, D.S., Weber, F.E. and Hämmerle, C.H.	#6 Bone Grafting for Dental Implants	[113]
Jung, R.E., Kovacs, M.N., Thoma, D.S. and Hämmerle, C.H.	#13 Bone Regeneration Recombinant Human BMP	[114]
Jung, R.E., Hälg, G.A., Thoma, D.S. and Hämmerle, C.H.	#16 Collagen Membranes for Guided Bone Regeneration	[115]
Ramel, C.F., Wismeijer, D.A., F Hämmerle, C.H. and Jung, R.E.	#16 Collagen Membranes for Guided Bone Regeneration	[116]
Jung, R.E., Benic, G.I., Scherrer, D. and Hämmerle, C.H.	#7 Soft Tissue Augmentation for Dental Implants.	[117]
Jung, R.E., Mihatovic, I., Cordaro, L., Windisch, P., Friedmann, A., Blanco Carrion, J., Sanz Sanchez, I., Hallman, M., Quirynen, M. and Hammerle, C.H.	#16 Collagen Membranes for Guided Bone Regeneration	[118]
Benic, G.I., Bienz, S.P., Song, Y.W., Cha, J.K., Hämmerle, C.H., Jung, U.W. and Jung, R.E.	-1 Dental Implant Studies	[119]
Lee, D.W., Kim, K.T., Joo, Y.S., Yoo, M.K., Yu, J.A. and Ryu, J.J.	-1 Dental Implant Studies	[120]
Mattout, P., Nowzari, H. and Mattout, C.	#6 Bone Grafting for Dental Implants	[121]
Merli, M., Moscatelli, M., Mariotti, G., Pagliaro, U., Raffaelli, E. and Nieri, M.	-1 Dental Implant Studies	[122]
Merli, M., Moscatelli, M., Mariotti, G., Pagliaro, U., Raffaelli, E. and Nieri, M.	-1 Dental Implant Studies	[123]
Park, S.H., Lee, K.W., Oh, T.J., Misch, C.E., Shotwell, J. and Wang, H.L.	#16 Collagen Membranes for Guided Bone Regeneration	[124]
Schneider, D., Weber, F.E., Grunder, U., Andreoni, C., Burkhardt, R. and Jung, R.E.	#16 Collagen Membranes for Guided Bone Regeneration	[125]
Temmerman, A., Cortellini, S., Van Dessel, J., De Greef, A., Jacobs, R., Dhondt, R., Teughels, W. and Quirynen, M.	-1 Dental Implant Studies	[126]
Simion, M., Misitano, U., Gionso, L. and Salvato, A.	#16 Collagen Membranes for Guided Bone Regeneration	[127]
Urban, I.A., Wessing, B., Alández, N., Meloni, S., González-Martin, O., Polizzi, G., Sanz-Sanchez, I., Montero, E. and Zechner, W.	#16 Collagen Membranes for Guided Bone Regeneration	[128]
Wessing, B., Urban, I., Montero, E., Zechner, W., Hof, M., Alandez Chamorro, J., Alandez Martin, N., Polizzi, G., Meloni, S. and Sanz, M.	#16 Collagen Membranes for Guided Bone Regeneration	[129]
Van Assche, N., Michels, S., Naert, I. and Quirynen, M.	#6 Bone Grafting for Dental Implants	[130]
Veis, A.A., Tsirlis, A.T. and Parisis, N.A.	-1 Dental Implant Studies	[131]
Wen, S.C., Fu, J.H. and Wang, H.L.	-1 Dental Implant Studies	[132]
Tsai, Y.L., Tsao, J.P., Wang, C.L., Grodin, E., Lin, J.J., Chen, C.J., Ho, C.H., Cochran, D. and Mau, J.L.P.	#0 Dental Implants in Edentulous Patients	[133]

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Galli, C.; Cusano, C.; Meleti, M.; Donos, N.; Calciolari, E. Topic Modeling for Faster Literature Screening Using Transformer-Based Embeddings. Metrics 2024, 1, 2. https://doi.org/10.3390/metrics1010002

AMA Style

Galli C, Cusano C, Meleti M, Donos N, Calciolari E. Topic Modeling for Faster Literature Screening Using Transformer-Based Embeddings. Metrics. 2024; 1(1):2. https://doi.org/10.3390/metrics1010002

Chicago/Turabian Style

Galli, Carlo, Claudio Cusano, Marco Meleti, Nikolaos Donos, and Elena Calciolari. 2024. "Topic Modeling for Faster Literature Screening Using Transformer-Based Embeddings" Metrics 1, no. 1: 2. https://doi.org/10.3390/metrics1010002

APA Style

Galli, C., Cusano, C., Meleti, M., Donos, N., & Calciolari, E. (2024). Topic Modeling for Faster Literature Screening Using Transformer-Based Embeddings. Metrics, 1(1), 2. https://doi.org/10.3390/metrics1010002

Article Menu

Topic Modeling for Faster Literature Screening Using Transformer-Based Embeddings

Abstract

1. Introduction

2. Materials and Methods

2.1. Datasets

2.2. Purpose of the Study

2.3. Data Analysis

3. Results and Discussion

3.1. Peri-Implantitis Dataset Analysis

3.2. Bone Augmentation Dataset Analysis

4. Conclusions

Supplementary Materials

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI