Genomic Analysis and Characterization of Pseudotabrizicola formosa sp. nov., a Novel Aerobic Anoxygenic Phototrophic Bacterium, Isolated from Sayram Lake Water

Aerobic anoxygenic photosynthetic bacteria (AAPB) are a kind of heterotrophic prokaryote that can use bacteriochlorophyll (BChl) for photosynthesis without oxygen production and they are widely distributed in aquatic environments, including oceans, lakes, and rivers. A novel aerobic anoxygenic photosynthetic bacterium strain XJSPT was isolated during a study of water microbial diversity in Sayram Lake, Xinjiang Province, China. Strain XJSPT was found to grow optimally at 33 °C, pH 7.5 with 1.0% (w/v) NaCl, and to produce bacteriochlorophyll a and carotenoids. Phylogenetic analysis based on 16S rRNA gene sequence and concatenated alignment sequences of 120 ubiquitous single-copy proteins both supported that strain XJSPT belonged to the genus Pseudotabrizicola. Both average nucleotide identity (ANI) and DNA–DNA hybridization (DDH) values were below the species delineation threshold. The primary polar lipids were phosphatidylcholine, phosphatidylglycerol, phosphatidylethanolamine, one unknown lipid, and one unidentified phospholipid. Based on the results of polyphasic analyses performed in this study, strain XJSPT represents a new member of the genus Pseudotabrizicola, for which the name Pseudotabrizicola formosa sp. nov. is proposed. The type strain is XJSPT (= KCTC 52636T = MCCC 1H00184T = SDUM 107003T). Comparative genomic analysis showed that four species of the genus Pseudotabrizicola shared 2570 core genes and possessed a complete anoxygenic photosystem II.


Introduction
Aerobic anaerobic photosynthetic bacteria (AAPB) are widely distributed in aquatic environments (oceans, lakes, and rivers). As a heterotrophic group, they can use reduced organic matter as electron donor under aerobic conditions to carry out non-oxygen-producing photosynthesis, relying on unique bacteriochlorophyll (BChl) and light reaction center [1,2]. The ATP produced by photosynthesis replenishes the energy required for growth, not only reducing the consumption of organic carbon, but also increasing the amount of dissolved organic carbon (DOC) entering the cells, which is vital for biogeochemical cycles [3][4][5].
The genus Tabrizicola, belonging to the family Rhodobacteraceae in the class Alphaproteobacteria, was firstly proposed by Vahideh et al. in 2013 and consists of eight validly published species and three effectively described species at the time of writing [6]. In 2022, Ma et al. reclassified Tabrizicola sediminis, Tabrizicola alkalilacus, and Tabrizicola algicola into a novel genus, Pseudotabrizicola gen. nov., as Pseudotabrizicola sediminis comb. nov., Pseudotabrizicola alkalilacus comb. nov., and Pseudotabrizicola algicola comb. nov. according to the results of polyphasic investigations [7]. Existing members of the genus Pseudotabrizicola are Gram-strain-negative, catalase-and oxidase-positive, and have Q-10 as the main respiratory quinone. Moreover, P. sediminis KCTC 72015 T and P. algicola KCTC 72206 T were reported to belong to AAPB, a kind of heterotrophic bacteria which have a photosynthetic gene and can produce BChl a in aerobic condition but cannot grow photoautotrophically under anaerobic conditions [8][9][10].
During our research of bacterial diversity at Sayram Lake, a cream-colored bacterium designated XJSP T was isolated from a lake water sample using a dilution-plating procedure and conventional isolation techniques. Polyphasic taxonomic investigations, including phenotypic characterizations, chemotaxonomic properties, and phylogenetic analysis, showed that strain XJSP T was a novel aerobic anoxygenic phototrophic bacterium species affiliated to the genus Pseudotabrizicola.

Bacterial Isolation and Culture
Samples from various habitats were gathered for bacterial enrichment and isolation as a part of the study about bacterial resource diversity in our lab [11]. A water sample, collected from Sayram Lake, Xinjiang Province, China (44 • 30 30.41" N, 81 • 12 39.55" E), was diluted stepwise using sterile distilled water and each diluted sample was spread evenly on marine agar 2216 (MA; Becton Dickinson, Franklin Lakes, NJ, USA). The strain XJSP T was isolated from the coated medium, which was incubated at 25 • C for 10 days. Pure cultures were preserved for long-term in sterile 15% (v/v) glycerol supplemented with 1% (w/v) NaCl at -80 • C. The type of strain P. sediminis KCTC 72015 T was purchased as an experiment control strain from Korean Collection for Type Cultures center (KCTC).

S rRNA Gene Sequencing and Phylogenetic Analysis
The 16S rRNA genes of strain XJSP T were amplified using polymerase chain reaction (PCR) technology with two universal primers for bacteria (27F and 1492R) and a purified gene product was cloned using the method described previously to obtain almost complete 16S rRNA gene sequence [12]. The 16S rRNA gene similarities between strain XJSP T and closely related species were calculated using the NCBI BLAST service and EzBioCloud database. The 16S rRNA gene sequence of strain XJSP T and those of relevant strains were aligned by MUSCLE service [13] and phylogenetic trees were reconstructed with 1000 bootstrap replicates based on neighbor-joining (NJ), minimum-evolution (ME), and maximum-likelihood (ML) algorithms in MEGA X software [14,15]. The integrated method T92 + G + I was calculated as the best-fit substitution pattern for reconstructing the ML tree.

Whole-Genome Sequencing and Genome Annotation
Purified genomic DNA was obtained employing the SteadyPure bacterial genomic DNA extraction kit (Accurate Biotechnology Co., Ltd., Hunan Province, China) following the user guide. The draft genome of strain XJSP T was sequenced by Novogen (Tianjin, China) using Illumina Hiseq platform with the sequencing protocol of paired-end 150 bp fragment libraries and genome assembly was carried out with the Velvet software (v. 1.2.10) [16]. The genome sequences of related strains used in this paper were downloaded from the NCBI genomes repository. Gene prediction and annotation were carried out by Prodigal server [17] and the prokaryotic genome annotation pipeline (PGAP) implemented in NCBI [18].

Phenotypic Characteristics
Strain XJSP T was incubated on MA medium at 33 • C for the implementation of phenotypic characteristics investigations. After culturing for four days, the Gram staining reaction was checked with the Gram-stain kit produced by bioMérieux company, and cell morphology was observed by employing light microscopy (E600, Nikon, Tokyo, Japan) and scanning electron microscopy (model Nova NanoSEM450, FEI). The growth of strains at various pH ranges (pH 5.5-9.5, at intervals of 0.5) was tested in marine broth 2216 (MB; Becton Dickinson, Franklin Lakes, NJ, USA) with various pH values, and growth status was quantified using a microplate reader at 600 nm. The pH of mediums was adjusted using commercial additional buffers at a concentration of 20 mM: MES (pH 5.5 and 6.0), PIPES (pH 6.5 and 7.0), HEPES (pH 7.5 and 8.0), Tricine (pH 8.5), and CAPSO (pH 9.0 and 9.5). Temperature conditions for growth were tested at 0, 4, 10, 15, 20, 25, 28, 30, 33, 37, 40, 42, and 45 • C for approximately 7 days on MA medium (growth was recorded every 12 h). Salt tolerance was assayed using modified MA (prepared according to the MA formula, but without NaCl) with different NaCl concentrations (0, 0.5, 1, 2, 3, 4, 5, 6, 7, 8, 9, and 10%, w/v).
Oxidase test was examined by employing the commercial bioMérieux oxidase test kit, and catalase activity was detected through bubbles production after adding 3% (v/v) H 2 O 2 to plate with fresh cultures. Hydrolysis tests of starch (0.2%, w/v), CM-cellulose (0.5%, w/v), alginate (2%, w/v), and Tweens (20, 40, 60, and 80, 1%, v/v) were determined based on the previous methods [28]. The commercial bioMérieux API 50CH and API ZYM reagent strips were used to test acid production and enzyme activities, respectively. Other biochemical analyses were performed applying the BIOLOG GEN III MicroPlates and API 50CH and all reagent strip tests were implemented following the user guide, except for adjusting the NaCl concentration to the optimum. The antimicrobial susceptibility test was investigated using the disc diffusion method under optimum conditions for a week [29].
Photoheterotrophic growth was tested under light exposure (2400 lx) and anaerobic conditions in the following liquid medium (per liter: 3 g sodium pyruvate, 1.2 g NH 4 Cl or 1g KNO 3 ), prepared with modified artificial seawater (per liter of distilled water: 3.3 g MgSO 4 , 2.3 g MgCl 2 , 1.2 g CaCl 2 , 0.7 g KCl, 10 g NaCl) at 33 • C for 14 days. Photoautotrophic growth was determined by anaerobically incubating strain XJSP T under light condition (2400 lx) with the following liquid medium (0.5 mM Na 2 S, 0.5 mM Na 2 S 2 O 3 and 0.1% (w/v) NaHCO 3 ), prepared with modified artificial seawater as described above [9]. Anaerobic conditions were achieved by boiling the liquid medium and adding sterilized liquid paraffin. Additionally, the presence of pigments was detected by in vitro spectrometric methods described by Biebl et al. [30]. Cells cultivated aerobically in MB medium for four days were collected, washed twice, and suspended in a mixture of acetone and methanol (7:2, v/v) to extract the pigments. Absorption spectra were measured using a spectrophotometer.

Chemotaxonomic Properties
Comparative analyses of chemotaxonomic property between Strain XJSP T and experiment control strain P. sediminis KCTC 72015 T were performed using cells harvested in MB medium at the late stage of exponential growth phase. Lipids were obtained in the mixture system of chloroform, methanol, and water (2.5:5:2, v/v/v), and separated and identified by two-dimensional silica gel thin layer chromatography (TLC) plate [31,32]. Extracted fatty acids were separated and analyzed based on the TSBA40 database of the Sherlock Microbial Identification System (MIDI) by an Agilent gas chromatograph (product model 6890N), as used previously [33]. Respiratory quinones obtained from lyophilized thallus were separated by TLC plates and identified applying HPLC technology [34].

S rRNA Gene Sequence and Phylogenetic Analysis
Almost complete 16S rRNA gene sequence of strain XJSP T (1425 bp) was obtained in this study. The 16S rRNA gene sequence similarity values between Strain XJSP T and members of the genus Pseudotabrizicola showed 97.7-99.5% (Table 1). The NJ tree inferred from 16S rRNA gene sequence exhibited strain XJSP T located in the cluster of Pseudotabrizicola species, which supported strain XJSP T belonged to the genus Pseudotabrizicola ( Figure 1). The topology of strain XJSP T and the genus Pseudotabrizicola was also obtained in the phylogenetic trees reconstructed with the ML and ME algorithm ( Figure 1).

Genome Properties and Phylogenetic Analysis
The draft genome (strain XJSP T ) of 3,702,758 bp in length was obtained after assembly with an average 300× coverage depth, producing 14 contigs, and the N50 value is 812,613 bp. All contigs were larger than 1595 bp, with the largest being 1,520,968 bp. The calculated G+C content was estimated to be 63.4 mol%. The 16S rRNA gene sequence of strain XJSP T detected from genome (1467 bp) covered that obtained by amplification (1425 bp). The PGAP results showed that a total of 3552 genes were predicted, including 52 RNA genes (3 rRNA genes, 3 ncRNA genes, and 46 tRNA genes) and 3470 potential proteincoding genes. Detailed comparison results of genome statistics of the Pseudotabrizicola are shown in Table 2. The genome similarity indices ANI and DDH values between strain XJSP T and Pseudotabrizicola species were 81.4-87.3% and 23.9-32.5% respectively, with both being lower than the values for species demarcation [35,36] (Table 1), which indicated that strain XJSP T was a novel member belonging to the genus Pseudotabrizicola. The IQ-TREE built on concatenated alignment sequences of 120 ubiquitous single-copy proteins in bacteria showed the evolutionary relationships of strain XJSP T and the genus Pseudotabrizicola ( Figure 2).

Pan-Genome Analysis of the Genus Pseudotabrizicola
Comparative genomic analysis of the genus Pseudotabrizicola was carried out to identify the consistency and difference of the members. As shown in Figure 3, 2570 core genes were shared by the four Pseudotabrizicola species, strain XJSP T , P. sediminis KCTC 72015 T , P. alkalilacus KCTC 62173 T , and P. algicola KCTC 72206 T , which accounted for more than half (59.8-74.2%) of each genome. KEGG annotation was performed for core, accessory, and unique genes to analyze their distribution in different metabolic pathways. The results showed that the core genes were more involved in the metabolisms of amino acid, energy and nucleotide, translation and replication, and repair. The proportion of accessory genes was higher than that of core genes and unique genes in carbohydrate metabolism, cofactors and vitamins metabolism and xenobiotics biodegradation. However, unique genes contributed more to drug resistance, lipid metabolism, membrane transport, and signal transduction (Supplementary Figure S1).

Metabolic Pathways and Secondary Metabolites Analyses
The results of metabolic pathways analyzed by KEGG's BlastKOALA service showed that most of carbohydrate metabolism pathways were intact, except for the incomplete glycolysis pathway (M00001) in strain XJSP T .
The four species of the genus Pseudotabrizicola, strain XJSP T , P. sediminis KCTC 72015 T , P. alkalilacus KCTC 62173 T , and P. algicola KCTC 72206 T , all possessed a complete anoxygenic photosystem II (M00597), namely the L and M subunits of photosynthetic reaction center. The pufML gene sequences of strain XJSP T are given in Supplementary Table S1. Moreover, strain XJSP T had complete phosphatidylcholine (PC) and phosphatidylethanolamine (PE) biosynthesis pathway (M00091 and M00093, respectively) and isoprenoid biosynthesis pathway (M00096 and M00364), which was consistent with P. sediminis KCTC 72015 T , P. alkalilacus KCTC 62173 T , and P. algicola KCTC 72206 T (Figure 4). The potential secondary metabolites synthesized by strain XJSP T were identified using antiSMASH. The results showed that the genome of strain XJSP T encoded eight identified gene clusters about the biosynthesis of secondary metabolites (Supplementary Table S2). One of the eight gene clusters, for terpene, showed 100% similarity to a known biosynthetic gene cluster-encoding carotenoid [37].