Next Article in Journal
Untangling the Complexity of Two-Component Signal Transduction in Bacteria
Previous Article in Journal
Precipitation Alleviates Adverse Effects of Nitrogen and Phosphorus Enrichment on Soil Microbial Co-Occurrence Network Complexity and Stability in Karst Shrubland
Previous Article in Special Issue
Multilocus Sequence Typing and Antimicrobial Susceptibility of Listeria monocytogenes Isolated from Foods Surveyed in Kosovo
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

GWAS Study Applied to Phenotypically Slow Growth Strains of Listeria monocytogenes Workflow and Outcome

by
Maria Elisabetta De Angelis
1,2,
Giovanna Alessia Robbe
1,*,
Cesare Cammà
1,
Massimo Ancora
1,
Serena Bosica
1,
Mattia Ferrara
1,
Marina Torresi
1,
Frank M. Aarestrup
3,
Alexander Gmeiner
3,
Narong Nuanmuang
3,
Patrick Murigu Kamau Njage
3,
Chiara Bravaccini
4,
Viviana Belardo
4,
Chiara Di Iorio
5,
Silvia Di Zacomo
5,
Paolo Fazii
5,
Francesco Pomilio
1 and
Pimlapas Leekitcharoenphon
3
1
Istituto Zooprofilattico Sperimentale dell’Abruzzo e del Molise “G. Caporale”, 64100 Teramo, Italy
2
Department of Bioscience and Technology for Food, Agriculture and Environment, University of Teramo, Via Balzarini 1, 64100 Teramo, Italy
3
Research Group for Genomic Epidemiology, National Food Institute, Technical University of Denmark, 2800 Kgs. Lyngby, Denmark
4
UOC Food Microbiology Unit, Istituto Zooprofilattico Sperimentale del Lazio e della Toscana “M. Aleandri”, 00178 Rome, Italy
5
UOC “Microbiologia e Virologia Clinica”, Presidio Ospedaliero “Spirito Santo”, Via Fonte Romana 8, 65123 Pescara, Italy
*
Author to whom correspondence should be addressed.
Microorganisms 2025, 13(9), 2011; https://doi.org/10.3390/microorganisms13092011
Submission received: 2 July 2025 / Revised: 6 August 2025 / Accepted: 10 August 2025 / Published: 28 August 2025
(This article belongs to the Special Issue An Update on Listeria monocytogenes, Third Edition)

Abstract

Listeria monocytogenes (Lm) is a serious public health foodborne pathogen cause of listeriosis, usually in elderly, pregnant and immunocompromised people, linked to consumption of contaminated food, especially ready-to-eat (RTE) products. Different protocols can be used to detect Lm, and ISO11290-1:2017 is the reference method in Europe. Through molecular techniques such as whole genome sequencing (WGS) it is possible to discriminate between Lm strains, which are unequally distributed between clinical cases, food or food related environments, probably also due to enrichment step bias towards some Lm serogroup (IIa) compared to IVb. In the present work a set of Lm strains, detected in clinical cases and food, was investigated to define Lm strains growth ability after incubation in Half Fraser broth, and Genome Wide Association Studies (GWAS) applied to correlate the growth phenotype traits to presence of relevant genes. GWAS enabled the identification of a more relevant cassette of genes associated to a holin region of bacteriophage A118 and the determination of the distribution of relevant genes, highlighted from GWAS analysis within a population of Lm IVb and IIa.

1. Introduction

Listeriosis, caused by the etiological agent Listeria monocytogenes (Lm), is an important zoonotic disease. Listeria monocytogenes is a Gram-positive, facultative intracellular pathogen and the causative agent of listeriosis, a severe foodborne infection with high hospitalization and mortality rates, particularly among immunocompromised individuals, the elderly, pregnant women and neonates. The disease causes gastroenteritis, abortion and neurological syndromes and can lead to death in people with an compromised immune system [1]. It is a ubiquitous and heterogenous species, and many studies tried to find a correlation between Lm strains and their distribution in the environment. Indeed, Lm can be present in farms, the food industry and in a wide range of animals, including humans, who can become infected by ingestion of contaminated food. Generally, Lm is reported in different food sources, but the most relevant one is Ready-to-eat (RTE) foods.
In recent years, several listeriosis outbreaks have occurred in Europe, underscoring the pathogen’s public health significance. For instance, a large outbreak in Denmark between March and April 2024 was linked to contaminated RTE fish products, resulting in over 20 confirmed cases and five deaths. Similarly, a multi-country outbreak in 2022 was associated with contaminated smoked fish products, affecting multiple European countries. These events highlight the urgent need for improved surveillance, risk assessment and control measures for L. monocytogenes in the food chain [2].
Control measures for Lm are important in food industry in order to reduce the risk of transmission of listeriosis. In particular, cleaning and disinfection (C&D) standard operating procedures should be in place, as well as regular testing of food products and the environment, in particular aiming to establish effectiveness of sanitation procedures of the food industry working environment. Serogroup IVb of Lm is predominantly associated with clinical cases, while IIa is more frequently detected in food and food industry-related environment. Also, the distribution of clonal complexes (CCs) is uneven: for instance, CC1, CC2, CC4 or CC6 are often isolated in clinical cases rather than CC9 and CC121, usually detected in the food and food industry [3]. Detecting different strains of Lm is more challenging when multiple strains contaminate one sample simultaneously.
Different protocols are available to detect Lm in food and environmental samples, but in the European Union ISO 11290-1:2017-1 [4] is the reference method for Lm and Listeria spp. detection, and ISO 11290-2:2017-2 [5] is the reference method for enumeration. In agreement with that, the two-step enrichment broths used for the detection of Lm strains are the Half Fraser broth for the first enrichment and after 25 ± 2 h the Fraser broth in the second enrichment.
Given this, a potential bias was investigated towards strains of Lm usually adapted to food and the environment, rather than strains more likely to be involved in clinical cases, hence less represented in food and in the food producing environmental samples. The evaluation of growth ability of different serogroups of Lm strains in broths used for the enrichment step revealed that some strains were able to outgrow others when together in a mixed culture [6].
Furthermore, strain heterogeneity of Lm can influence its detection in food, clinical and environmental samples [7]. Additionally, other factors influencing Lm detection can be the stress state of the cells, for examples viable but not culturable cells (VBNC) [8].
After all, detection of Lm can be challenging, especially in heavily contaminated food, but pure culture of the strain is essential in case of listeriosis outbreaks to track the source of the infection and prevent the spread of the disease. This is more complicated considering the possibility of a sample contaminated with a mixture population of Lm belonging to different serogroups and different CCs. The complexity and heterogeneity of this pathogen at the genomic level is indeed an important challenge. The Whole Genome Sequencing (WGS) analytical method allows the collection of a wide number of data correlated to genomic sequences. It can be used for different aims, such as a deep discrimination of strains and clusters within the same group and determining if a strain belongs to the cluster cause of an outbreak or not [9]. Also, analysis of genomic sequences associated to specific phenotypes through genome wide association study (GWAS) can aid to investigate the genes involved in observation of a specific phenotype trait. Widely used, this investigation method was used previously in an Italian outbreak in the Marche region in 2015 to demonstrate the association between clones causing the disease and those causing the re-infection [10].
The aim of the present work was to study a set of Lm strains, mostly detected in clinical cases and one strain from food, to define growth ability after 24 and 26 h of incubation at 30 °C. Also, through further analysis, we investigated a workflow in order to correlate the growth phenotype traits of tested strains to the presence of relevant genes through GWAS and determine the distribution of relevant genes, highlighted from GWAS analysis within a population of Lm IVb and IIa.

2. Materials and Methods

2.1. Data Collection and Preparation

A total of 251 sequences of Lm [Table S1 in Supplementary Materials] were used in the present study.
In detail, strains were selected between those available from clinical cases and associated food for a total of 41 isolates (one food and 40 clinical), collected between 2015 and 2020, were tested at Istituto Zooprofilattico Sperimentale dell’Abruzzo e del Molise (IZSAM) in order to assess the growth potential of Lm after 24 and 26 h of incubation. Their sequences, together with other 127 sequences from strains linked to food producing environments, previously reported in Guidi et al. [11], were downloaded from National Reference Centre for whole genome sequencing of microbial pathogens: database and bioinformatic analysis (GENPAT), for a total of 168 sequences.
A total of 83 sequences were downloaded from the National Center for Biotechnology Information (NCBI) selected from bibliography using the parameters: growth of Lm in half Fraser broth or Brain heart infusion broth (IZSAM) and stress response to cold storage or treatment (between +7 °C and −20 °C).
For details of sequences used in the present study refer to Table S1 in Supplementary Data.

2.2. Growth Potential Evaluation

2.2.1. Stocks Preparation

As above reported, 41 strains were tested for growth potential. In agreement with ISO 11290-2:2017-2 enumeration of Lm strains growth was done at 24 and 26 h. Strains stored at −80 °C in cryovials were streaked on ALOA agar plate (Liofilchem®, Roseto degli Abruzzi, Italy) and incubated at 37 °C for 24 h. After incubation, a single colony was streaked on Blood Agar (Liofilchem®, Italy) and incubated at 37 °C for 24 h to check purity of the microbank, and after incubation an isolated colony was used to contaminate 10 mL of Brain Heart Infusion Broth (BHI) (Liofilchem®, Italy) and incubated at 37 °C for 24 h. After 24 h, 100 μL of the contaminated broth were inoculated in 10 mL of BHI broth and incubated at 30 °C for 24 h. After 24 h of incubation, an optical density (OD600) measurement of the solution was performed and broth with OD600 > 0.125 was serially diluted four times and enumerated, and at the same time 1 mL stocks of the broth was dispensed in 1.5 mL or 2 mL Eppendorf tubes and then frozen at −20 °C to be used at a later time.

2.2.2. Inoculum Preparation and Contamination of Half Fraser Broth

A known concentration stock was used to obtain a final concentration of about 103 cfu/mL, which was used to contaminate the Half Fraser broth (1:10 ratio) (IZSAM), in agreement with ISO 11290-1:2017-1. Hence, 1 mL of the prepared inoculum was used to contaminate 9 mL of Half Fraser broth supplemented with ferric ammonium citrate as suggested by the producer, for a final expected contamination level of about 100 CFU/mL.

2.2.3. Enumeration and Growth Rate

Contaminated broth was enumerated at time of preparation, and, according to ISO 11290-2:2017-2 maximum and minimum incubation time, after 24 h and 26 h at 30 °C. Enumeration was determined using the excel software MICROINCERT MAXI rev. 6 (https://sites.google.com/site/incertezzamicro/informatica-per-il-laboratorio-2/microincert-maxi, accessed on 18 December 2023). The experiment was conducted in duplicate.

2.2.4. Graphics Analysis

R studio software (version 4.4.1) was used to elaborate the data graphics.
In particular, data analysis was performed using library(readxl), library(dplyr), library(tidyverse), library(writexl), library(tidyr), library(data.table); graphics using library(ggplot2); heatmap visualization using library(pheatmap).
Finally, ITOL gene presence/absence representation was done using table2itol.R—Rscript script for generating input files for iTOL.

2.3. DNA Extraction, Sequencing and Bioinformatic Analysis

DNA was extracted using a Maxwell® 16 tissue DNA purification kit (Promega Italia Srl, Milan, Italy) according to the manufacturer’s protocol for strains of Lm collected before 2018, and the DNA purity was checked by NanoDrop2000 (ThermoFisher Scientific, Waltham, MA, USA). DNA extraction was performed using QIAamp® DNA Mini Kit (Qiagen, Hilden, Germany) following the manufacturer’s protocol with minor modifications according to a previous study [12]. DNA quantity and quality were evaluated with a Qubit fluorometer (Thermo Fisher Scientific, Waltham, MA, USA) and Eppendorf BioSpectrometer fluorescence (Eppendorf s.r.l., Milano, Italy) for strains isolated after 2018. Starting from 100–500 ng of input DNA, the Illumina DNA Prep kit (Illumina, San Diego, CA, USA) was used for library preparation according to the manufacturer’s protocols. WGS was performed on the NextSeq® 500 platform (Illumina®, San Diego, CA, USA) with the NextSeq 500/550 mid-output reagent cartridge v2 (300 cycles, standard 150 bp paired-end reads). For the WGS data analysis, an in-house pipeline [13] was used. The trimming step of raw reads was performed using Trimmomatic [14] and a quality control check of the reads using FastQC v.0.11.5 (https://github.com/s-andrews/FastQC). De novo assembly of paired-end reads was carried out using SPAdes v3.11 (https://github.com/ablab/spades) [15] with default parameters for the Illumina platform 2 × 150 chemistry. QUAST v.4.3 (https://github.com/ablab/quast) was used for checking the quality of the genome assemblies. Assembled genomes of the strains selected from reference public databases were downloaded from NCBI and assembled valued with QUAST v.4.3. All the genomes that met the quality parameters recommended by Timme et al. [16] were used for the subsequent analysis steps. Multilocus sequence typing (MLST), based on the Pasteur scheme, was used to characterize Lm strains and detect the sequence type (ST) and clonal complex (CC) querying the Pasteur Institute platform (https://bigsdb.pasteur.fr/listeria/ (accessed on 10 October 2022). [Project number of used sequences is reported in Table S1 in Supplementary Materials].
Core genome MLST (cgMLST) of Lm was calculated according to the Institut Pasteur’s scheme of 1748 target loci using the chewBBACA allele-calling algorithm [17]. Genomes with at least 1660 called loci (95% of the full scheme) were included in the analysis.

2.4. Genome Annotation and GWAS

Gene annotation was carried out using Prokka [18], with default parameters. GFF files produced by Prokka, including sequences and annotations, were used to determine the pangenome with Panaroo [19]. The genes were considered present or absent based on Panaroo prediction using a 98% identity cut-off after annotation correction and CD realignment.
GWAS analysis was performed using Scoary [20]. A trait presence absence file was created for Scoary analysis to determine more relevant genes involved in the phenotype observation. To reduce the genomic heterogeneity between strains of Lm and increase detection of association between genetic and phenotypic traits, despite the low number of tested strains, analysis was focused on Lm CC1 sequences. A score of 0 was applied, and the strains defined as slow if the strains did not reach an average growth of 8 CFUlog10/mL after 24 h incubation, while a score of 1 was applied to strains defined as normal if strain growth was over 8 CFUlog10/mL after 24 h incubation. Not applicable (NA) was used for strains selected from NCBI, for which growth value were unknown.

2.5. Phylogenetic Analysis

Phylogenetic and gene presence/absence representation was done using iTOL [21]. SNP analysis was performed through an IZSAM in-house pipeline using SNIPPY [22], and a variant call format (.vcf) file was generated for each strain, then the multiple .vcf files were merged into a single .vcf, using BCFTOOLS-merge command [23], and representation in newick format tree was elaborated after SNP analysis using CSI phylogeny [24].

2.6. Gene Alignment and Protein Similarity Visualization

Clinker was used to visualize genes evaluated as most relevant to phenotype observation by Scoary [25]. Protein similarity and gene similarity between strains was performed using tBLASTn and BLASTn, respectively. Heatmap was elaborated using R Studio software.

2.7. Protein Translation and Function

Genes scored as more relevant from GWAS analysis were investigated for protein translation and function using the UniProt BLAST online tool [26].

3. Results

The growth ability of 41 strains was measured after 24 and 26 h, and the results were evaluated. The inoculum preparation at contamination was about 1.87 CFUlog10/mL, with a maximum of 2.71 CFUlog10/mL and a minimum of 1.14 CFUlog10/mL. After 24 h of incubation at 30 °C, the median value was about 8.20 CFUlog10/mL with a maximum of 8.74 CFUlog10/mL and a minimum of 7.60 CFUlog10/mL. Among the strains tested, four CC1, one strain CC217 and one strain CC6 did not reach 8.0 CFUlog10/mL after 24 h incubation, but all the strains reached 8.0 CFUlog10/mL after 26 h incubation, Figure 1.
CC1 strains that did not reach 8.0 CFUlog10/mL after 24 h of incubation were considered as the slow growth phenotype. GWAS analysis was performed on all the tested CC1 strains (n = 35) and a set of CC1 strains selected from NCBI (n = 21). No phylogenetic clustering was observed between slow and normal growth strains after performing SNP analysis Figure 2.
Genome annotation and analysis found a total of 3844 genes and in 3 out of 4 slow growth strains, one relevant group of genes, namely “group_2881”, was highlighted by the GWAS analysis, Figure 3. Group_2881 was present in 15 strains out of 56 (37% of the strains). A gene cassette composed of a total of six genes, namely group_2881, lmo2279, lmo2280, lmo2783, group_3146 and lmo2283, was identified through gene visualization of nearby genes of the group highlighted by Scoary, Figure 4.
The cassette of genes was present in other strains with normal growth, therefore tBLASTn was used to obtain the percentage identity among genes starting from translated proteins, and heatmap was used to visualize the results of the identity between selected genes Figure 5.
A pattern in genes lmo2279 and lmo2280 was evident and it was possible to highlight a decreasing pattern between genes belonging to slow growth strains compared to normal growth strains. Considering a similarity identity under 50% as indicative of gene absence. A similarity greater than 90% in gene lmo2280 gene was observed in 23 out of 56 strains (41%), while it was absent in 29 strains out of 56 (51%). In contrast, lmo2279 was present in 21 strains out of 56 (37%) with a similarity identity over 90% and present with a similarity identity under 50% in 35 strains out of 56 (62%).
GWAS at SNP level allowed the identification of a total of 20 relevant mutations in genes uhp_T, lmo1512_LMRG_01347, lmo1653, lmo1798, lmo2221_LMHCC_0276, lmo2467, lmo0294 and lmo2270, and, noticeably, mutations in lmo2280 and lmo2279 genes were present in only two of the phenotypically slow growth Lm CC1, but not in other strains, Figure 6.
Finally, to evaluate the distribution in different strains of Lm of the cassette genes highlighted as more relevant thorough GWAS analysis (group_2881, lmo2279, lmo2280, lmo2783, group_3146 and lmo2283), more strains of Lm IVb and IIa were analyzed, and BLASTn at different percentages of identity (50%, 70% and 90%) was performed and visualized through a heatmap, Figure 7.
Among the highlighted genes, group_2881 was the most shared gene within the 251 analyzed strains, with a similarity over 90%, followed by gene group_2783. Lmo2279 and lmo2280 were absent in around the 70% of the analyzed strains, followed by gene group_3146 and the lmo2283 gene and, last, group_2783. The use of different thresholds for the analysis (50–70% and 90%) had no evident impact in case of group_2881, lmo2279 and lmo_2280, compared to the rest of the investigated genes.
Among the 251 strains analyzed, 68 strains belong to serogroup IIa, and 183 to serogroup IVb. lmo2279 was present in 13 strains IIa, with a similarity between 50 and 90% and only IVb strains with a similarity over 90%, at a 70% threshold, while lmo2279 was present in 2 strains IIa and 65 IVb with over 90% similarity, at 90% threshold, lmo2280 instead was absent in all IIa, while it was present only in 65 strains IVb (Table 1).
BLAST of highlighted genes from GWAS analysis, lmo2280 and lmo2279 matched as holins bacteriophage A118 proteins associated to a canonical holin pathway for lytic cycle [27].

4. Discussion

Lm is an important foodborne pathogen responsible of listeriosis. Its detection in contaminated food can be difficult, especially in complex food contaminated by multiple bacteria species. Furthermore, detection rate in food or the environment associated to the food industry of clonal complex (CC) usually associated to clinical cases, (serogroup IVb), like CC1, has been reported as underrepresented when compared to other CCs normally detected in food like CC121 and CC9 (serogroup IIa) [3]. In our study we investigated growth ability of Lm strains at 24 h and 26 h in a set of clinical cases and one strain isolated in food, and we applied the use of Genome Wide Association (GWAS) to investigate the presence of genes that may explain normal growth traits compared to strains showing a slow growth phenotype.
Among tested strains of the same serogroup of Lm (IVb), it was possible to notice a slower growth ability in four strains, which did not reach the average growth of 8 CFUlog10/mL within 24 h compared to other strains. This is supported by other studies, reporting differences in growth ability of strains belonging to same serogroup [6]. Moreover, a recent study highlighted differences in replication capacity among strains within the same serogroup and demonstrated that external factors, such as the cellular stress and physiological state can influence strain recovery and detection [8]. This being said, a limitation of the study was determined by the low number of isolates tested. Unfortunately, a larger set of clinical isolates were not available for deeper investigation. It would be recommended to collect more strains to improve reliability of the results.
Genome investigation through GWAS analysis can be an important tool to associate loci in the genome to a phenotype [18] and aid to unveil genes behind specific phenotypes. Therefore, further investigation was conducted to associate phenotypically slow strain traits to relevant genes. To reduce the genomic heterogeneity between strains of Lm belonging to different serogroups and to make easier the detection of the association between genetic variants and phenotypic traits, the analysis was focused on strains of Lm CC1, which is one of the CCs that is less represented in clinical cases compared to food detection. Due to the limited number of tested strains, additional CC1 strains were selected from the literature, but a challenge of the present study was the difficulty of retrieving similar experiments. Therefore, we based the approach on the assumption of a normal growth rate for strains selected from the literature for which experiment settings were not comparable to ours, and, in the Scoary input file, growth values were marked as ‘Not Applicable’ due to the lack of experimental data. It could be interesting to confirm this hypothesis by testing more strains and including those used for the analysis as well.
It is important to emphasize that additional tests and a larger set of strains are required, as different outcomes of the analysis may emerge from further experimental results. In fact, GWAS is a powerful tool for this purpose; however, the number of strains to examine in order to have statistically significant results is a parameter to take into consideration when interpreting the results [28].
Pangenome analysis detected 3488 genes; however, the study highlighted only one gene as significantly more relevant. This gene is itself part of a group of genes that might explain the phenotype. The group of genes, reported in slow growth strains, were shared with other normal growth strains. Moreover, those genes were absent in one of the four slow growth strains. Further investigation was performed on nearby genes from the highlighted one by Scoary to evaluate differences within this group of genes between slow and normal growth strains. A cassette of genes was found to be shared between slow and normal growth strains and the pattern of gene similarity in the cassette showed differences between them.
Within the cassette of genes highlighted in the GWAS analysis and associated to an holin region of bacteriophage A118, two genes were noticeably reported as more relevant, namely, the lmo2280 and lmo2279 genes. Furthermore, mutations were also highlighted by the SNP analysis in slow growth strains compared to normal growth strains, suggesting the relevance of the importance of these genes in association to the slow growth phenotype and the need for further investigation. Bacteriophage A118 is a temperate phage isolated in Lm, and holin proteins form unspecific lesions within the host cytoplasmatic membrane [29] and mediate the lysis of the bacteria [30]. Further analysis and more tests, especially with a larger set of sequences, is necessary to confirm and to properly evaluate the involvement of this phage and its role in Lm growth. Unfortunately, the low number of sequences used for the study makes impossible to statistically validate the outcome of the study; therefore, more tests are needed for confirmation.
It is well established that genes associated with Lm phages can play a role in enhancing disinfectant resistance [31] or inhibiting Lm growth, Consequently, their application in the food industry as alternative methods of disinfection is under evaluation [32]. Hence, even if not statistically significant at the moment, more tests are advised to confirm these findings.
Finally, tBLASTn was fundamental in calculating the identity percentage of the genes between slow and normal growth strains and the comparison of the cassette between strains with different growth ability, which highlighted a reduced gene similarity between the examined strains, as in Figure 4. The same pattern was consistent even when it was analyzed in comparison with strains of Lm IVb, other than CC1 from different sources. BLASTn highlighted absence of gene lmo2280 and lmo2279 in most of the strains collected from bibliography (about 70%). Hence, despite the presence of these genes, not only in slow growth strains but in normal growth strains as well, a relevant difference can be noted within the distribution of these two genes between strains of Lm. Strains used for the experiments were collected from clinical cases, while the genomic sequences used for bioinformatic analysis were collected within the food industry environment, in particular poultry [11]. A better representation of the different sources should be considered using a wider range of strains and additional growth assays should be performed should be performed, considering also the role played by phage, their gene distribution among strains of Lm and how they can influence the microorganism phenotype [32].
It is important to highlight the limitations of the present study. First, the limited number of tested strains are not large enough to statistically support our founding. Second, increasing the number of the sequences tested can change the outcome of the GWAS results, given that the collection of genomic sequences from a publicly available repository can increase the number of sequences analyzed. However, the experimental settings, at least in our case, were not comparable; therefore, much information was unknown, and analysis was based on general assumption. From this perspective, more strains should be tested to increase confidence in the results and correctly make inferences.

5. Conclusions

In conclusion, despite the small number of sequences tested two main conclusions can be drawn. First, there is notable variability in growth capacity among Lm strains belonging to the same serogroup. Second, GWAS analysis was able to highlight differences in a gene cassette (group_2881, lmo2279, lmo2280, lmo2783, group_3146 and lmo2283), although no unique gene was found. In addition, tBLASTn results highlighted absence of lmo2280 and lmo2279 between strains from CC1 and other CCs, and SNP analysis confirmed mutations in these genes in two of the four strains phenotypically associated with slow growth.
Noteworthily, relevant genes lmo2280 and lmo2279, from GWAS analysis, were associated with the phage insertion region, which could play a role within Lm phenotype expression. However, further studies are necessary to confirm this hypothesis and clarify the functional impact of these phage-associated genes on bacterial physiology. In conclusion, this work highlights a different approach for genomic analysis in datasets with limited sample sizes. Despite the limited number of sequences tested, more information and genome strains can be used for analysis when taken from the literature. Moreover, our results reinforce the utility of GWAS as a powerful tool for exploring genotype–phenotype associations, even under constrained experimental conditions. Nevertheless, these findings must be interpreted with caution and validated through additional experimental work. In our case, further study should be done to understand gene distribution within a Lm population of which growth ability is known, and confirm if phage proteins could influence, positively or negatively, the microorganism growth.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/microorganisms13092011/s1, Table S1: CSV_file strains.

Author Contributions

Conceptualization, P.F. and P.L.; methodology, P.F. and P.L.; formal analysis, M.E.D.A.; investigation, M.E.D.A. and G.A.R.; resources, C.B., V.B., C.D.I., S.D.Z., and P.F.; data curation, M.E.D.A.; writing—original draft preparation, M.E.D.A. and G.A.R.; writing—review and editing, M.E.D.A., G.A.R., C.C., M.A., S.B., M.F., M.T., F.M.A., A.G., N.N., P.M.K.N., P.L., F.P., C.B., V.B., C.D.I., S.D.Z., and P.F.; visualization, M.E.D.A., P.L., and F.P.; supervision, P.L. and F.P. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The sequencing data that support the findings will be available in National Center for Biotechnology Information (NCBI) BioProject under accession PRJNA1310828 following an embargo from the date of publication to allow for commercialization of research findings.

Acknowledgments

I would like to thank my DTU supervisor for the support and insight she provided to me for my personal growth in this field. I would like to thank my supervisors at IZSAM which allowed me to have this experience and supported me in my research. Finally, I would like to thank all the team at Research Group for Genomic Epidemiology, (DTU Food) and at IZSAM for their support, teachings, and suggestions.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
LmListeria monocytogenes
RTEReady-to-eat food
WGSWhole Genome Sequencing
GWASGenome Wide Association Studies
C&DCleaning and disinfection
CCsClonal complexes
GENPATNational Reference Centre for whole genome sequencing of microbial pathogens: database and bioinformatic analysis
NCBINational Center for Biotechnology Information
MLSTMultilocus sequence typing
STSequence type
cgMLSTCore genome MLST
NANot applicable

References

  1. Dhama, K.; Karthik, K.; Tiwari, R.; Shabbir, M.Z.; Barbuddhe, S.; Malik, S.V.S.; Singh, R.K. Listeriosis in Animals, Its Public Health Significance (Food-Borne Zoonosis) and Advances in Diagnosis and Control: A Comprehensive Review. Vet. Q. 2015, 35, 211–235. [Google Scholar] [CrossRef]
  2. European Food Safety Authority; European Centre for Disease Prevention and Control. Prolonged multi-country outbreak of Listeria monocytogenes ST173 linked to consumption of fish products. EFSA Support. Publ. 2024, 21, 8885E. [Google Scholar] [CrossRef]
  3. Maury, M.M.; Tsai, Y.-H.; Charlier, C.; Touchon, M.; Chenal-Francisque, V.; Leclercq, A.; Criscuolo, A.; Gaultier, C.; Roussel, S.; Brisabois, A.; et al. Uncovering Listeria monocytogenes Hypervirulence by Harnessing Its Biodiversity. Nat. Genet. 2016, 48, 308–313. [Google Scholar] [CrossRef] [PubMed]
  4. ISO 11290-1:2017; Microbiology of the Food Chain—Horizontal Method for the Detection and Enumeration of Listeria monocytogenes and of Listeria spp.—Part 1: Detection Method. International Organization for Standardization: Geneva, Switzerland, 2017.
  5. ISO 11290-2:2017; Microbiology of the Food Chain—Horizontal Method for the Detection and Enumeration of Listeria monocytogenes and of Listeria spp.—Part 2: Enumeration Method. International Organization for Standardization: Geneva, Switzerland, 2017.
  6. Zilelidou, E.A.; Skandamis, P.N. Growth, Detection and Virulence of Listeria monocytogenes in the Presence of Other Microorganisms: Microbial Interactions from Species to Strain Level. Int. J. Food Microbiol. 2018, 277, 10–25. [Google Scholar] [CrossRef] [PubMed]
  7. Disson, O.; Moura, A.; Lecuit, M. Making Sense of the Biodiversity and Virulence of Listeria monocytogenes. Trends Microbiol. 2021, 29, 811–822. [Google Scholar] [CrossRef]
  8. Bannenberg, J.W.; Abee, T.; Zwietering, M.H.; Den Besten, H.M.W. Variability in Lag Duration of Listeria monocytogenes Strains in Half Fraser Enrichment Broth after Stress Affects the Detection Efficacy Using the ISO 11290-1 Method. Int. J. Food Microbiol. 2021, 337, 108914. [Google Scholar] [CrossRef]
  9. Allende, A.; Alvarez-Ordóñez, A.; Bolton, D.; Bover-Cid, S.; Chemaly, M.; Davies, R.; De Cesare, A.; Herman, L.; Hilbert, F.; Koutsoumanis, K.; et al. Whole Genome Sequencing and Metagenomics for Outbreak Investigation, Source Attribution and Risk Assessment of Food-Borne Microorganisms. EFSA J. 2018, 16, e00058. [Google Scholar] [CrossRef]
  10. Chiaverini, A.; Guidi, F.; Torresi, M.; Acciari, V.A.; Centorotola, G.; Cornacchia, A.; Centorame, P.; Marfoglia, C.; Blasi, G.; Di Domenico, M.; et al. Phylogenetic Analysis and Genome-Wide Association Study Applied to an Italian Listeria monocytogenes Outbreak. Front. Microbiol. 2021, 12, 750065. [Google Scholar] [CrossRef]
  11. Guidi, F.; Centorotola, G.; Chiaverini, A.; Iannetti, L.; Schirone, M.; Visciano, P.; Cornacchia, A.; Scattolini, S.; Pomilio, F.; D’Alterio, N.; et al. The Slaughterhouse as Hotspot of CC1 and CC6 Listeria monocytogenes Strains with Hypervirulent Profiles in an Integrated Poultry Chain of Italy. Microorganisms 2023, 11, 1543. [Google Scholar] [CrossRef]
  12. Portmann, A.-C.; Fournier, C.; Gimonet, J.; Ngom-Bru, C.; Barretto, C.; Baert, L. A Validation Approach of an End-to-End Whole Genome Sequencing Workflow for Source Tracking of Listeria monocytogenes and Salmonella enterica. Front. Microbiol. 2018, 9, 446. [Google Scholar] [CrossRef]
  13. Cito, F.; Di Pasquale, A.; Cammà, C.; Cito, P. The Italian Information System for the Collection and Analysis of Complete Genome Sequence of Pathogens Isolated from Animal, Food and Environment. Int. J. Infect. Dis. 2018, 73, 296–297. [Google Scholar] [CrossRef]
  14. Bolger, A.M.; Lohse, M.; Usadel, B. Trimmomatic: A Flexible Trimmer for Illumina Sequence Data. Bioinformatics 2014, 30, 2114–2120. [Google Scholar] [CrossRef]
  15. Bankevich, A.; Nurk, S.; Antipov, D.; Gurevich, A.A.; Dvorkin, M.; Kulikov, A.S.; Lesin, V.M.; Nikolenko, S.I.; Pham, S.; Prjibelski, A.D.; et al. SPAdes: A New Genome Assembly Algorithm and Its Applications to Single-Cell Sequencing. J. Comput. Biol. 2012, 19, 455–477. [Google Scholar] [CrossRef]
  16. Timme, R.E.; Lafon, P.C.; Balkey, M.; Adams, J.K.; Wagner, D.; Carleton, H.; Strain, E.; Hoffmann, M.; Sabol, A.; Rand, H. Gen-FS Coordinated Proficiency Test Data for Genomic Foodborne Pathogen Surveillance, 2017 and 2018 Exercises. Sci. Data 2020, 7, 402. [Google Scholar] [CrossRef] [PubMed]
  17. Silva, M.; Machado, M.P.; Silva, D.N.; Rossi, M.; Moran-Gilad, J.; Santos, S.; Ramirez, M.; Carriço, J.A. chewBBACA: A Complete Suite for Gene-by-Gene Schema Creation and Strain Identification. Microb. Genom. 2018, 4, e000166. [Google Scholar] [CrossRef]
  18. Seemann, T. Prokka: Rapid Prokaryotic Genome Annotation. Bioinformatics 2014, 30, 2068–2069. [Google Scholar] [CrossRef] [PubMed]
  19. Tonkin-Hill, G.; MacAlasdair, N.; Ruis, C.; Weimann, A.; Horesh, G.; Lees, J.A.; Gladstone, R.A.; Lo, S.; Beaudoin, C.; Floto, R.A.; et al. Producing Polished Prokaryotic Pangenomes with the Panaroo Pipeline. Genome Biol. 2020, 21, 180. [Google Scholar] [CrossRef] [PubMed]
  20. Brynildsrud, O.; Bohlin, J.; Scheffer, L.; Eldholm, V. Rapid Scoring of Genes in Microbial Pan-Genome-Wide Association Studies with Scoary. Genome Biol. 2016, 17, 238. [Google Scholar] [CrossRef]
  21. Letunic, I.; Bork, P. Interactive Tree Of Life (iTOL) v5: An Online Tool for Phylogenetic Tree Display and Annotation. Nucleic Acids Res. 2021, 49, W293–W296. [Google Scholar] [CrossRef]
  22. Seemann, T. Snippy: Fast Bacterial Variant Calling from NGS Reads. GitHub. 2015. Available online: https://github.com/tseemann/snippy (accessed on 9 August 2025).
  23. Danecek, P.; Bonfield, J.K.; Liddle, J.; Marshall, J.; Ohan, V.; Pollard, M.O.; Whitwham, A.; Keane, T.; McCarthy, S.A.; Davies, R.M.; et al. Twelve Years of SAMtools and BCFtools. GigaScience 2021, 10, giab008. [Google Scholar] [CrossRef]
  24. Kaas, R.S.; Leekitcharoenphon, P.; Aarestrup, F.M.; Lund, O. Solving the Problem of Comparing Whole Bacterial Genomes across Different Sequencing Platforms. PLoS ONE 2014, 9, e104984. [Google Scholar] [CrossRef] [PubMed]
  25. Gilchrist, C.L.M.; Chooi, Y.-H. Clinker & Clustermap.Js: Automatic Generation of Gene Cluster Comparison Figures. Bioinformatics 2021, 37, 2473–2475. [Google Scholar] [CrossRef]
  26. BLAST® Command Line Applications User Manual [Internet]. National Center for Biotechnology Information (US): Bethesda, MD, USA, 2008. Available online: https://www.ncbi.nlm.nih.gov/books/NBK279690/ (accessed on 9 August 2025).
  27. Denes, T.; Vongkamjan, K.; Ackermann, H.-W.; Moreno Switt, A.I.; Wiedmann, M.; den Bakker, H.C. Comparative Genomic and Morphological Analyses of Listeria Phages Isolated from Farm Environments. Appl. Environ. Microbiol. 2014, 80, 4616–4625. [Google Scholar] [CrossRef] [PubMed]
  28. Read, T.D.; Massey, R.C. Characterizing the Genetic Basis of Bacterial Phenotypes Using Genome-Wide Association Studies: A New Direction for Bacteriology. Genome Med. 2014, 6, 109. [Google Scholar] [CrossRef] [PubMed]
  29. Loessner, M.J.; Inman, R.B.; Lauer, P.; Calendar, R. Complete Nucleotide Sequence, Molecular Analysis and Genome Structure of Bacteriophage A118 of Listeria monocytogenes: Implications for Phage Evolution. Mol. Microbiol. 2000, 35, 324–340. [Google Scholar] [CrossRef]
  30. Pasechnek, A.; Rabinovich, L.; Stadnyuk, O.; Azulay, G.; Mioduser, J.; Argov, T.; Borovok, I.; Sigal, N.; Herskovits, A.A. Active Lysogeny in Listeria monocytogenes Is a Bacteria-Phage Adaptive Response in the Mammalian Environment. Cell Rep. 2020, 32, 107956. [Google Scholar] [CrossRef]
  31. Karlsmose, A.K.; Ivanova, M.; Kragh, M.L.; Kjeldgaard, J.S.; Otani, S.; Svendsen, C.A.; Papić, B.; Zdovc, I.; Tasara, T.; Stephan, R.; et al. A Novel Metagenomic Approach Uncovers Phage Genes as Markers for Increased Disinfectant Tolerance in Mixed Listeria monocytogenes Communities. Infect. Genet. Evol. 2024, 119, 105582. [Google Scholar] [CrossRef]
  32. Kawacka, I.; Olejnik-Schmidt, A.; Schmidt, M.; Sip, A. Effectiveness of Phage-Based Inhibition of Listeria monocytogenes in Food Products and Food Processing Environments. Microorganisms 2020, 8, 1764. [Google Scholar] [CrossRef]
Figure 1. Results of Lm growth rate after inoculation of half Fraser broth (T0, first bar plot) and after 24 and 26 h of incubation, respectively t24 (second bar plot) and t26 (third bar plot). All strains tested (n = 41) are reported on the x-axis, and the y-axis shows the concentration of Lm at different times. Lines in red delimit a growth threshold of 8.0 CFUlog10/mL.
Figure 1. Results of Lm growth rate after inoculation of half Fraser broth (T0, first bar plot) and after 24 and 26 h of incubation, respectively t24 (second bar plot) and t26 (third bar plot). All strains tested (n = 41) are reported on the x-axis, and the y-axis shows the concentration of Lm at different times. Lines in red delimit a growth threshold of 8.0 CFUlog10/mL.
Microorganisms 13 02011 g001
Figure 2. SNP based phylogenetic tree of Lm CC1 used in the present study. Red labelled strains are phenotypically slow growth.
Figure 2. SNP based phylogenetic tree of Lm CC1 used in the present study. Red labelled strains are phenotypically slow growth.
Microorganisms 13 02011 g002
Figure 3. Visualization of strains showing presence of relevant gene highlighted by Scoary group_2881 (in light blue strip). In red were highlighted strains showing phenotypically slow growth compared to other strains consider to be normal in growth.
Figure 3. Visualization of strains showing presence of relevant gene highlighted by Scoary group_2881 (in light blue strip). In red were highlighted strains showing phenotypically slow growth compared to other strains consider to be normal in growth.
Microorganisms 13 02011 g003
Figure 4. Gene alignment visualization for the gene group_2881, highlighted as relevant in Scoary analysis and gene similarity. Top four strains are those reported as slow growth.
Figure 4. Gene alignment visualization for the gene group_2881, highlighted as relevant in Scoary analysis and gene similarity. Top four strains are those reported as slow growth.
Microorganisms 13 02011 g004
Figure 5. Heatmap of tBLASTn gene similarity. Top four strains are those reported as slow growth.
Figure 5. Heatmap of tBLASTn gene similarity. Top four strains are those reported as slow growth.
Microorganisms 13 02011 g005
Figure 6. SNP analysis visualization of relevant mutation between strains considered slow growth (in red) compared to other strains.
Figure 6. SNP analysis visualization of relevant mutation between strains considered slow growth (in red) compared to other strains.
Microorganisms 13 02011 g006
Figure 7. Heatmap of tBLASTn gene similarity of 251 strains. Top four strains are those reported as slow growth.
Figure 7. Heatmap of tBLASTn gene similarity of 251 strains. Top four strains are those reported as slow growth.
Microorganisms 13 02011 g007
Table 1. Percentage of genes present out of 251 strains analyzed based on similarity percentage and thresholds of 50%, 70% and 90% used for analysis in tBLASTn.
Table 1. Percentage of genes present out of 251 strains analyzed based on similarity percentage and thresholds of 50%, 70% and 90% used for analysis in tBLASTn.
GeneThreshold (%)n. AbsentAbsent %n. Similarity 50–90%Similarity 50–90 %n. Similarity > 90%Similarity > 90%
Group_288150.00%0NA0NA251100%
Group_288170.00%0NA0NA251100%
Group_288190.00%0NA0NA251100%
lmo227950.00%17369%135%6526%
lmo227970.00%17369%135%6526%
lmo227990.00%18473%006727%
lmo228050.00%17369%135%6526%
lmo228070.00%17369%135%6526%
lmo228090.00%18674%006526%
group_278350.00%7731%11345%6124%
group_278370.00%7731%11345%6124%
group_278390.00%19076%006124%
group_314650.00%14759%3414%7028%
group_314670.00%14759%3414%7028%
group_314690.00%18072%007128%
lmo228350.00%7731%16867%62%
lmo228370.00%13855%10743%62%
lmo228390.00%24096%00114%
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

De Angelis, M.E.; Robbe, G.A.; Cammà, C.; Ancora, M.; Bosica, S.; Ferrara, M.; Torresi, M.; Aarestrup, F.M.; Gmeiner, A.; Nuanmuang, N.; et al. GWAS Study Applied to Phenotypically Slow Growth Strains of Listeria monocytogenes Workflow and Outcome. Microorganisms 2025, 13, 2011. https://doi.org/10.3390/microorganisms13092011

AMA Style

De Angelis ME, Robbe GA, Cammà C, Ancora M, Bosica S, Ferrara M, Torresi M, Aarestrup FM, Gmeiner A, Nuanmuang N, et al. GWAS Study Applied to Phenotypically Slow Growth Strains of Listeria monocytogenes Workflow and Outcome. Microorganisms. 2025; 13(9):2011. https://doi.org/10.3390/microorganisms13092011

Chicago/Turabian Style

De Angelis, Maria Elisabetta, Giovanna Alessia Robbe, Cesare Cammà, Massimo Ancora, Serena Bosica, Mattia Ferrara, Marina Torresi, Frank M. Aarestrup, Alexander Gmeiner, Narong Nuanmuang, and et al. 2025. "GWAS Study Applied to Phenotypically Slow Growth Strains of Listeria monocytogenes Workflow and Outcome" Microorganisms 13, no. 9: 2011. https://doi.org/10.3390/microorganisms13092011

APA Style

De Angelis, M. E., Robbe, G. A., Cammà, C., Ancora, M., Bosica, S., Ferrara, M., Torresi, M., Aarestrup, F. M., Gmeiner, A., Nuanmuang, N., Njage, P. M. K., Bravaccini, C., Belardo, V., Di Iorio, C., Di Zacomo, S., Fazii, P., Pomilio, F., & Leekitcharoenphon, P. (2025). GWAS Study Applied to Phenotypically Slow Growth Strains of Listeria monocytogenes Workflow and Outcome. Microorganisms, 13(9), 2011. https://doi.org/10.3390/microorganisms13092011

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop