Molecular Genomic Analyses of Enterococcus cecorum from Sepsis Outbreaks in Broilers

Extensive genomic analyses of Enterococcus cecorum isolates from sepsis outbreaks in broilers suggest a polyphyletic origin, likely arising from core genome mutations rather than gene acquisition. This species is a normal intestinal flora of avian species with particular isolates associated with osteomyelitis. More recently, this species has been associated with sepsis outbreaks affecting broilers during the first 3 weeks post-hatch. Understanding the genetic and management basis of this new phenotype is critical for developing strategies to mitigate this emerging problem. Phylogenomic analyses of 227 genomes suggest that sepsis isolates are polyphyletic and closely related to both commensal and osteomyelitis isolate genomes. Pangenome analyses detect no gene acquisitions that distinguish all the sepsis isolates. Core genome single nucleotide polymorphism analyses have identified a number of mutations, affecting the protein-coding sequences, that are enriched in sepsis isolates. The analysis of the protein substitutions supports the mutational origins of sepsis isolates.


Introduction
Enterococci are Gram-positive firmicutes comprising commensals and pathogens in a wide range of vertebrates and plants [1,2], and are prevalent in animal gastrointestinal tracts [1][2][3][4], nature [1,5], and even food processing systems [6].Some species are found in diverse vertebrates, such as Enterococcus faecalis, Enterococcus faecium, and Enterococcus avium, which have been isolated from both mammals and aves [1,2].In humans, these species have been associated with infections of the urinary tract, bloodstream, heart, and central nervous system [7,8].Some Enterococcal species appear to be restricted to particular host species or classes [2].Enterococcus cecorum is primarily associated with avian species and has been isolated from the intestines or cloaca of galliforms and neoornithes [1,2,9].Comparative genomics based on the Enterococcus core genome places E. cecorum most closely related to Enterococcus columbae (host birds) within a clade associated with plant, aquatic, and bird hosts [2].E. cecorum typically colonizes the intestines of chickens at around 3 weeks of age [10][11][12].Pathogenic strains have been associated with osteomyelitis affecting the flexible thoracic vertebrae (kinky back) and the proximal femoral (femoral head necrosis) or tibial (tibial head necrosis) growth plates.Borst et al. [13] identified a cluster of twelve genes that appear to distinguish osteomyelitis isolates from commensals, likely to be involved in virulence via capsular modifications.The evolution of pathogenesis and drug resistance in E. faecalis and E. faecium has been associated with mobile elements and the acquisition of capsular gene clusters [14,15].Phylogenomic analysis of more than 100 genomes of E. cecorum isolates from French broilers suggested that a specific lineage of pathogenic isolates is infecting broilers in Europe and the United States [16].The recent occurrence of sepsis outbreaks in broilers caused by E. cecorum is characterized by onset during the first few weeks post-hatch, prior to the normal colonization of the intestinal tract [17].However, there are older reports of sepsis in broilers caused by E. cecorum, with losses of 5-8%, characterized by pericarditis, bacteremia, and osteomyelitis [18].The mechanisms of the horizontal or vertical spread of E. cecorum in broiler flocks are not known, although particular facilities or flocks may experience repetitive outbreaks [17,18].Recent outbreaks may be more problematic with the removal of early administration of antibiotic growth promoters in flocks [17,19,20].
The questions we wished to address included whether specific mutations or gene acquisition are driving recent E. cecorum septicemia outbreaks in broilers, whether sepsis and osteomyelitis isolates are distinct lineages, and whether high-resolution bacterial genomics can inform responses to sepsis outbreaks.To answer these questions, we assembled genomes from recent isolates from corporate surveys during sepsis outbreaks.These assemblies and all publicly available genomes were then analyzed to determine the whole-genome relatedness of pathogenic and non-pathogenic isolates.The assemblies were compared to determine whether specific genes or gene clusters were essential or highly associated with disease traits (e.g., pathogen vs. commensal, or osteomyelitis vs. sepsis).Similarly, we analyzed whether specific mutations in the core genes are associated with disease traits.These findings are critical for guiding the development of new treatments, vaccines, and management strategies for mitigating sepsis outbreaks.

New Genome Assemblies
Cultures were grown in tryptic soy broth (Difco) +5% chicken serum (Gibco Ther-moFisher, Waltham, MA, USA) in 5% CO 2 , and archived in 40% glycerol at −80 • C. DNA was isolated from 20 mL of overnight stationary-phase broth cultures by treatment with lysozyme, followed by standard SDS-lysis, organic extraction, RNAse/protease digestion, and ethanol precipitation [21,22].The final purification was performed using NanoSep 100 k Omega spin cups according to the manufacturer's instructions (Pall Corporation, Port Washington, NY, USA).Purified total bacterial DNAs were submitted to SeqCenter (Pittsburgh, PA, USA) for 2 × 151 paired-end sequencing on an Illumina NextSeq 2000 (Illumina Corp., San Diego, CA, USA).Paired-end 2 × 251 sequence data for 22 additional isolates were obtained from the laboratory of Luke Borst (North Carolina State University).Sequence data were uploaded to BV-BRC [23] for processing through the Trim Galore pipeline and Unicycler genome assembly.
Whole-genome sequence reads and assemblies were submitted to NCBI under BioProject PRJNA1050746, which included BioSamples SAMN38750234 through SAMN38750267 and accessions JAXOGD000000000 through JAXOHK000000000.

Genome Assemblies from Sepsis Surveys
Enterococcus cecorum isolates from commercial operation samplings were obtained from poultry diagnostic laboratories representing samplings from 2020 to 2021 (Table 1).These included environmental samples as well as necropsy samples, including internal organs.We also included two bacterial chondronecrosis with osteomyelitis (BCO) isolates from our own sampling of commercial broiler farms [22] and our poultry research farm (UAPRF).The total DNA was purified from each isolate and submitted for Illumina 2 × 151 sequencing.The reads were trimmed and assembled as described in the Materials and Methods section.The assemblies (Table 1) ranged from 32 to 105 contigs with an average of 62 contigs.The total assembled base pairs ranged from 2.15 to 2.49 Mbp with an average of 2.27.The N50 values ranged from 49.7 to 209.1 kbp with an average of 108.9.
Table 1.Details of the Enterococcus cecorum strain source, date, location, and bird age.For each strain, the genome assembly in the number of assembled contigs, total assembly in base pairs, and NCBI BioSample are presented.For each DNA, the BLASTp results (average for percent identity × query coverage for protein genes and product genes 312 to 323) were used for the Borst virulence gene cluster.PopPUNK cluster shows the groupings based on phylogenomic comparisons.Not available: n/a.

Considerations Regarding Disease Trait Classifications
A major issue in any of these analyses is knowing the phenotypic traits of each isolate regarding virulence/pathogenicity.The BioSample data from NCBI may not clearly identify the host disease state or the actual anatomical source of the culture.For example, if an isolate was obtained from the bone marrow, should it be classified as a case of sepsis or BCO?If an isolate was obtained from hatchery egg waste from an infected breeder flock, the virulence potential of that isolate is unknown.Therefore, we made every effort to be strict in our interpretation (see legend to Figure 1) as to whether an isolate was from BCO (osteomyelitis or bone marrow) or from sepsis (peritoneum, heart, liver, and blood).For the purposes of our analyses, we reserved the sepsis isolates as those isolated from an internal organ or compartment, whereas the BCO isolates were from bone marrow or infected joints.We also defined a chicken disease group (145 genomes) as BCO isolates plus the strictly defined sepsis isolates, which excludes isolates from the air sac, egg waste, environmental swabs, intestines, cecum, and unspecified clinical samples.All genomes not in the chicken disease group were assigned to the non-phenotypic group.Strain assignments to the phenotypic groups are listed in Table S1.

Phylogenomic Analysis of Sepsis and Osteomyelitis Isolates
These 34 new assemblies along with 193 E. cecorum genomes that had been deposited in NCBI were used to generate phylogenomic trees to discern the relationships and possible origins of the BCO and sepsis isolates.Phylogenomic trees were generated for all 227 genomes using either kmer genome comparisons using PopPUNK (Figure 1) or shared core genome SNPs using ParSNP (Figure S1).Although the branch lengths were different for the two methods, the overall topology was similar, and clustering was very similar.Regardless of using a strict or relaxed definition of BCO or sepsis the phylogenomic analyses all support a polyphyletic origin for both the BCO and sepsis isolates.Red circles for BCO appear in all branches, and the green triangles/circles for chicken sepsis appear in multiple locations.There was one large cluster (A) of US sepsis isolates that was related to a BCO isolate we obtained from tibial pus from a broiler in 2016 (1415).Within this cluster were two vertebral osteomyelitis isolates (LB5924 and LB5925).Cluster B contained LB5800, a sepsis isolate from the heart, four closely related isolates from egg transfer residues (LB5857, LB 5916, and LB5876), an isolate from a case of air-saacculitis (LB5880), two distantly related isolates, a BCO femoral head necrosis isolate (1675), and an intestinal isolate (LB5923).Cluster C contained one sepsis isolate (1750) and two vertebral osteomyelitis isolates (LB5836 and LB5843).Cluster D contained no evidence of BCO or sepsis isolates as it consisted of isolates from cull eggs, or egg transfer residue, along with one isolate (LB5922) with an incomplete documentation of isolation source.The tree contained 100 genomes from isolates collected from 16 poultry farms in France between 2007 and 2017.Some of these genomes were from probable sepsis cases (red circles), with many representing pericarditis isolates [16].These isolates appeared throughout the tree, with clusters including BCO and probable sepsis virulence phenotypes.There were also six clinical isolates from humans (green plus nodes, Figures 1 and S1) that were distributed among the chicken isolates.
LB5800, a sepsis isolate from the heart, four closely related isolates from egg transfer residues (LB5857, LB 5916, and LB5876), an isolate from a case of air-saacculitis (LB5880), two distantly related isolates, a BCO femoral head necrosis isolate (1675), and an intestinal isolate (LB5923).Cluster C contained one sepsis isolate (1750) and two vertebral osteomyelitis isolates (LB5836 and LB5843).Cluster D contained no evidence of BCO or sepsis isolates as it consisted of isolates from cull eggs, or egg transfer residue, along with one isolate (LB5922) with an incomplete documentation of isolation source.The tree contained 100 genomes from isolates collected from 16 poultry farms in France between 2007 and 2017.Some of these genomes were from probable sepsis cases (red circles), with many representing pericarditis isolates [16].These isolates appeared throughout the tree, with clusters including BCO and probable sepsis virulence phenotypes.There were also six clinical isolates from humans (green plus nodes, Figures 1 and S1) that were distributed among the chicken isolates.Of note, the phylogenomic analyses also included cloacal isolates from Harris Hawk (ESM13C1 and ESM13AIC1), black vulture (E1062, ESM877AIC1), pig feces (WCA-380), and internal organs from ducks (D-104) and goose (G-29).The Harris Hawk isolates appeared to be the most distant from the other 225 isolates.The goose G-29 isolate was located within the branches leading to cluster D. The pig and one of the black vulture isolates cluster with a solitary USA sepsis isolate, 1752.The duck internal organ/sepsis isolate D-104, isolated in Poland, was placed in a cluster with French sepsis and BCO isolates.
We interpret all of the phylogenomic data as most consistent with pathogenic isolates arising from non-pathogenic isolates and that sepsis isolates are closely related to BCO isolates.Our conclusions contrast with those from previous analyses of smaller collections of genomes that suggested distinct lineages for commensal and pathogenic isolates of E. cecorum [13,16,37].

Is Gene Acquisition Central to Sepsis or Disease Traits?
If osteomyelitis and/or sepsis are polyphyletic then this could be driven by gene acquisition and/or by mutation.Based on a comparison of the genomes of three BCO isolates with those of three cecal isolates, Borst et al. [13] proposed that a cluster of 12 genes possibly related to capsular modification distinguished the BCO isolates from the cecal isolates.Using the SA1 assembly of Borst as our reference genome, the Prokka annotation places the 12 gene cluster as protein-encoded genes (PEG) 312 through 323.We used BLASTp to search the Prokka-predicted polypeptides from all 227 E. cecorum genomes for these 12 polypeptides.BLASTp results were tabulated by combining values for percent identity and query coverage (pid × qcov/100) and averaged across all genomes according to the phenotypic group: none, chicken disease, BCO, strict sepsis (summarized in Table 2 with complete results in Table S2).The results show that the cluster of capsular genes is conserved in approximately 70-80% of the isolates classified as chicken disease, and for those classified as BCO, the conservation may be higher in the strict sepsis group, but the number of those isolates is significantly lower (34 vs. 113).Additionally, the presence of these capsular genes in the isolates within the none group (no disease or disease association not known) is around 30 to 40% of the isolates.

Table 2.
BLASTp scores for protein-encoded gene products from the Borst gene cluster, based on strain categorization by phenotype rather than chicken disease (none), chicken disease (CD), bacterial chondronecrosis with osteomyelitis (BCO), or strict sepsis (SS).See the results for definitions of phenotype categorization.For each protein-encoded gene, the value is the average for that phenotype of the BLASTp scores for (percent identity × query coverage)/100.For each phenotype, the count is the number of strains in that phenotypic category.The average ± SEM was computed for all 12 polypeptides.Individual gene scores for each isolate are provided in Table S2.Unicycler assemblies of our 34 assemblies were surveyed for circular elements.This identified three potential plasmids in isolate 1754.We used BLASTn searches of all 227 genomes to determine how widespread these contigs are as a signal for horizontal genetic exchange.Contig 25 (6237 bp) from 1745 appears to be more than 75% conserved in 24 assemblies: 1415, 1754, An144, ARS48, ARS60, ARS65, CHD182EN, CHD270EN, CHD278EN, CHD341EN, CHD347EN, CHD83EN, CHD96EN, CIRMBP-1218, CIRMBP-1233, CIRMBP-1259, CIRMBP-1276, CIRMBP-1282, CIRMBP-1284, CIRMBP-1285, CIRMBP-1294, PS10, PS2, and SA2.Contig 27 (4525 bp) is more than 75% conserved in 10 assemblies: 1415, 1754, An144, ARS60, CHD83EN, CHD96EN, CIRMBP-1234, CIRMBP-1238, CIRMBP-1273, and PS2.Contig 31 (2418 bp) is more than 75% conserved in only the 1754 genome; the closest related contig is in An144 and only 71% is conserved.The isolates containing homologous contigs are scattered around the phylogenomic tree (Figure 1) but are sometimes found in highly similar genomes, indicative of both horizontal transfer and shared origins.Only some genomes containing contig 25 homologs contain contig 27 homologs (and vice versa).Analysis of contig 25 using Prokka annotation and BLASTp of NCBI NonRedundant indicates six predicted CDS that annotate as four hypothetical proteins, a replication RepB initiation protein, and a plasmid recombination protein.Similarly, for contig 27, there are four annotated CDS: a hypothetical protein, a plasmid recombination protein, a CopG transcriptional regulator, and a replication RepB protein.Contig 31 annotates as three CDS: a hypothetical protein, a transcriptional regulator, and a replication RebB protein.Therefore, mobile plasmids appear to exist in E. cecorum but do not seem to be widespread or to be drivers of the emergence of sepsis.However, this is only based on the identification of circular elements from the Unicycler assemblies in the 34 genomes we have generated.

Phentotype
Our Prokka-Roary analysis of 227 genomes suggests that the core + pan-genome of this species contains 14,728 genes with 878 genes in the strict-core genome (99% ≤ strains ≤ 100%), 335 genes in the soft-core genome (95% ≤ strains < 99%), 1689 genes in the shell genome (15% ≤ strains < 95%), and 11,826 cloud genes (strains < 15%).Prior analysis of 140 genomes suggested a pangenome of 8523 gene clusters with 1207 genes in the strict-core genome, 4664 genes in the accessory genome, and a unique genome of 2652 [16].Thus, our results appear to expand the pangenome by over 6000 genes with similar gene content for the sum of the strict-core and soft-core to the previous numbers for a strict-core genome.
Scoary analyses of the pan-genome with respect to phenotypic traits (Table S2) were employed to determine whether any particular loci or gene clusters had high specificity for either chicken disease isolates or for strict sepsis.We also analyzed the chicken disease trait, which compared BCO isolates, along with the sepsis isolates, to the isolates for which there is no known disease phenotype.Since many of the isolates for which there is no known disease phenotype may actually include pathogenic isolates, we were skeptical of any meaningful results for this particular comparison.The data for loci positively or negatively associated with Chicken Disease identified 195 genes with corrected p-values < 0.05 (Table S3).Of the 159 loci positively associated (Odds ratio ≥ 5) with chicken disease trait, 63 were annotated as something other than a hypothetical protein.Of the 20 loci negatively associated (Odds ratio ≤ 0.2), only 6 were annotated as something other than a hypothetical protein.For these 159 loci, we identified 12 clusters of two or more loci based on their position in the genomes of either of the three BCO isolates.Seven of the locus clusters were present and clustered in all three isolated genomes examined (Table S3).Functions associated with these clusters include glycosylation loci, including the capsular genes identified by Borst [13], carbohydrate/sugar transferase systems, probable transposable elements, and vitamin/biotin transferase systems.When we analyzed all the genomes for loci associated with the strict sepsis group, the data identified 101 loci with 97 positively associated and 4 negatively associated (Table S4) However, of these, loci 94 were apparently identified because they are primarily found in in the genomes present in cluster A in Figure 1.To reduce the bias of Cluster A, we reduced the representation from that cluster by excluding some of the Cluster A isolates to generate the trait group Sepsis Strict Reduced (Table S1).The Scoary output using Sepsis Strict Reduced indicated that there were no loci with Bonferonni corrected p-values < 0.05 (Table S5).Inspection of these data identifies 24 loci with sensitivity ≥ 50 (a measure of the frequency of presence in isolates with the trait), but only 10 of those loci had a specificity score ≥ 50 (a measure of how specific the gene is in isolates with the trait).Of those 10 loci with higher specificity, five were annotated as hypothetical proteins and another as transposase.The other four loci encode functions in vitamin B12 import, ascorbate uptake, hexulose conversion, and transcriptional regulation of glucosamine utilization.We interpret these Roary-Scoary results to indicate that there is no evidence of specific gene acquisitions that are shared by the majority of sepsis isolates.Therefore, either there are no specific gene acquisitions that can convert a BCO pathogen into a sepsis pathogen, or individual clades have specific independent gene sets that must be acquired for transition to a sepsis pathogen based on their particular pan-genome composition.
To assess clade-specific gene acquisition, we used Scoary to analyze gene presence/ absence for sepsis and BCO isolates in cluster 1 comprising French isolates (trait CIRMBP Cluster 1 in Table S1).We compared four sepsis isolates to 22 BCO isolates within this clade.The Scoary results (Table S6) identified only seven genes, with six annotated as hypothetical proteins and one as a transposase.None of these loci had significant corrected p-values for association but showed high specificity scores.Five of the loci were clustered in at least two of the sepsis isolate genomes and Phaster results showed that this region is likely a prophage (contig 1 from 109,451-157,474).Thus, our analyses failed to reveal compelling evidence for the gene acquisition driving the emergence of sepsis.

Are Core Genome Mutations Central to Sepsis or Disease Traits?
For the transition from BCO pathogen to sepsis pathogen, an alternative to gene acquisition through horizontal transfer is that the transition involves the mutation of resident genes.Evidence from sequence analysis of ancient Marek's Disease Virus strongly supports that this virus has become more pathogenic, not by gene acquisition or gene rearrangement but through point mutations [38].The emergence of sepsis through point mutations would also be consistent with the polyphyletic origins of sepsis isolates (Figure 1).Existing pathogens or commensals could experience a few key mutations that would allow them to survive in the blood and colonize organs.
We first used ParSNP to identify and determine the core genome SNPs that were differentially represented in BCO vs. sepsis isolates.The ParSNP analysis was limited to genomes known to be from cases of chicken disease (scored as 1 in chicken disease; Table S1).For this analysis, we employed the same reduced representation from Cluster A (excluding NA isolates as in Sepsis Strict Reduced; Table S1).We compared 17 sepsis isolates to 113 BCO isolates using the finished/complete genome BCO isolate SA1 as the reference.Since the reference genome was a BCO isolate, only SNPs over-represented in sepsis should represent mutations favoring sepsis.The core genome SNPs (n = 78,150) were filtered for those present in >94% of Sepsis Strict Reduced isolates (n = 967) and then for frequencies ≥0.30 in the sepsis isolates relative to the BCO isolates.This identified 34 SNPs: 32 bi-allelic and 2 tri-allelic (Table S7).These 34 potentially diagnostic SNPs occurred in only six genes.Eight SNPs were in the first or second codon base positions and most likely to be missense, whereas 26 were in the wobble position and were more likely to be silent.Interestingly, four of the genes (Smc, YaaA, hypothetical protein, and AldC) appeared to be clustered, as they represented annotated PEG 229, 230, 231, and 234.
We analyzed the same reduced representation of chicken disease genomes using Maast as an alternative to ParSNP because the two programs use different alignment algorithms to genotype SNPs and are known to produce different results [35].The reference genome was again the BCO isolate SA1.The maximum SNP genotype frequencies for Sepsis genomes from Maast were 0.54 (Table S8), which was considerably lower than the 1.00 SNP frequencies identified by ParSNP (Table S7).The Maast genotype data were filtered for SNPs > 50% in Sepsis (n = 252) and frequency difference (sepsis-BCO) ≥ 0.20, resulting in a total of 77 SNPs (74 biallelic and 3 triallelic), 7 intergenic, and 70 in coding sequences for 51 protein-encoding genes (Table S8).Fifteen of the genes that flanked or spanned the SNPs were annotated as hypothetical proteins, but none of these hypothetical genes was PEG 231 identified in the ParSNP analyses.Of the four clustered genes from the ParSNP data, only the Smc gene was also identified in the Maast analyses.Both the ParSNP and Maast identified SNPs in the PTS system fructose-specific EIIABC component.The potential clusters of SNPs identified in the Maast analyses included those affecting PEG 346 to 373, 570 to 577, 629 to 637, and 744 to 753.There was also the potential for clustering genes affected by seven SNPs affecting four genes from PEG 594 to 603.
We extracted and aligned the encoded polypeptides for the six genes from ParSNP and 12 additional genes from Maast to identify all the variant polypeptide positions (Table S9).The analyses included predicted polypeptides from all chicken disease isolates and included those from Cluster A that had been excluded from the initial ParSNP and Maast analyses.Each variant position was scored for frequency in the Sepsis Strict group of isolates relative to the BCO isolates.This identified 114 polypeptide variants where the Sepsis percentage was greater than 20 points higher than the percentage in BCO (green highlight in Table S9).The highest frequency protein variations were the R555Q and A590V substitutions in the PTS system fructose-specific EIIABC, which were found in 75% of sepsis isolates but only in 33 and 29%, respectively, of BCO isolates (Table S9).All 114 polypeptide variants were also scored for frequency in the Sepsis Strict Reduced (to reduce the influence of the higher number of sepsis isolates from Cluster A).Of the 114 protein variants, 15 were ≥50% in Sepsis Strict Reduced in seven genes (highlighted in yellow in Table S9).These 15 SNPs are summarized in Table 3 and are candidates for key mutations driving the adaptation from BCO to sepsis.The annotation of these seven genes and the literature support their possible roles in the virulence of E. cecorum.The hypothetical protein PEG231 was identified in the ParSNP analysis (Table S7).BLASTp analysis at https://www.uniport.org(accessed December 2023) identifies this as a serine aminopeptidase S33 domain-containing protein.Hip1 in Mycobacterium tuberculosis is a S33 serine aminopeptidase that inhibits macrophage and dendritic cell functions [39].PTS (phosphotransferase transport system) fructose-specific EIIABC has been found to regulate virulence expresssion and stress response in Lysteria monocytogenes [40], and biofilm and endocarditis in Enterococcus faecium [41].CBCL1 orthologs in Pseudomonas aeruginosa are involved in the production of quorum-sensing signals [42].BLASTp at uniprot.org identifies ElaA as a GNAT family N-acetyltransferase, which is a family of proteins involved in bacterial adaptation to diverse habitats [43].EttA regulates protein synthesis in energy-depleted cells and is critical for bacterial survival during the long-term stationary phase [44].RpoN encodes a sigma factor, σ 54 , for directing RNA polymerase to alternative promoters.Orthologs have been shown to regulate virulence determinants, including pili, flagella, and type III secretion systems (reviewed in [45]).YheH is known to function in the signaling pathway for sporulation in Bacillus subtilis [46].Therefore, the genes identified have to do with environmental response and gene regulation.Further, there are at least an additional 39 genes from the Maast analysis that we have yet to assess whether they have coding variations differentially present in sepsis vs. BCO isolate genomes (Table S8).There are also intergenic region SNPs that could affect the regulation of flanking genes.

Conclusions
Our analyses of all publicly available genomes of E. cecorum support a polyphyletic origin of pathogenicity in chickens as well as the recent outbreaks of early on-set sepsis in broiler flocks.This contrasts with previous works suggesting a common lineage in chicken pathogenesis [16].Our analysis of the pangenome of E. cecorum identified no gene or gene clusters essential for the switch from chicken commensal to chicken pathogen or for chicken sepsis isolates.The cluster of 12 capsule-related genes identified by Borst et al. is highly prevalent in osteomyelitis and sepsis isolates, but it is not essential for the pathogenesis nor diagnosis of pathogenic isolates (Table 2).Analysis of core genome SNPs identified missense mutations in a subset of genes associated with osteomyelitis, which are further enriched in sepsis isolates.These genes may be involved in the bacterial response to stress, stationary phase survival, and cell-mediated immunity of the host.However, further research is warranted to confirm and expand these initial findings.
Future work should focus on obtaining additional isolates from sepsis outbreaks to determine how much pathogen genomic diversity is present in each outbreak and to assess whether the same genotypes are present in successive outbreaks in a facility.Previous works have failed to identify vertical transmission of E. cecorum using standard culture methods [10,18].More sensitive DNA-based methods need to be applied to breeder hens, hatcheries, embryos, and broiler houses [47].The virulence testing of the isolates is also problematic.Some have relied on chicken embryo lethality assays for the evaluation of E.cecorum isolates [17,48,49], but the direct relevance to pathogenicity post-hatch has been a concern [50].Experimental infections of young chicks with isolates of E. cecorum have shown some promise [10] but have not been used to evaluate the relative virulence of different isolates.Some insects have been used in the virulence assays of human isolates of other Enterococcus species (reviewed in [51]).Alternative models could be used to discern whether sepsis isolates and BCO isolates have measurable differences in pathogenesis.Further, many of the genomes in national databases are poorly documented for specific clinical sources and host disease states; thus, there is a need for expanding the available genomes from well-documented cases.

Supplementary Materials:
The following supporting information can be downloaded at: https: //www.mdpi.com/article/10.3390/microorganisms12020250/s1, Figure S1: ParSNP phylogenomic tree of 227 Enterococcus cecorum isolates midpoint-rooted with E. cecorum SA1 as the reference genome; Table S1: Phenotypic group assignments for the 227 isolate genomes used in this work; Table S2: BLASTp results for the 227 genomes queried with the 12 encoded polypeptides (peg 312 to 323) from the capsular cluster identified by Borst; Table S3: Scoary output for gene presence-absence for phenotypic trait chicken disease.;Table S4: Scoary output for gene presence-absence for phenotypic trait Sepsis Strict; Table S5: Scoary output for gene presence-absence for phenotypic trait Sepsis Strict Reduced (SSR); Table S6.Scoary output for gene presence-absence for phenotypic trait CIRMBP Cluster 1; Table S7: Most differentially represented core genome SNPs distinguishing BCO and sepsis isolate genomes as determined using ParSNP; Table S8: Most differentially represented core genome

Figure 1 .
Figure 1.PopPUNK phylogenomic tree of 227 Enterococcus cecorum isolates.Strain names are the leaves.Clusters of USA isolates from the sepsis survey are bracketed with letters denoting clusters

Table 3 .
Summary of fifteen polypeptide variants most prevalent in Enterococcus cecorum sepsis isolates.Entries for each variant include the gene name, predicted function, amino acid positions (AA Pos) affected, the reference residues (Ref), substitution (Alt), and percentage of isolates in each of the three phenotypic trait groups.