Next Article in Journal
Effect of Lanthanum Sorption on the Behavior of Rarely Crosslinked Acidic and Basic Polymer Hydrogels during Remote Interaction
Next Article in Special Issue
Life Cycle Assessment of Functionalized Bionanocompounds with Ice Nucleation Protein for Freezing Applications
Previous Article in Journal
Preparation and Properties Study of Wood-Based Cushioning Materials
Previous Article in Special Issue
Nanoparticles Based on Chondroitin Sulfate from Tuna Heads and Chitooligosaccharides for Enhanced Water Solubility and Sustained Release of Curcumin
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Exclusive Biosynthesis of Pullulan Using Taguchi’s Approach and Decision Tree Learning Algorithm by a Novel Endophytic Aureobasidium pullulans Strain

by
WesamEldin I. A. Saber
1,*,
Abdulaziz A. Al-Askar
2,* and
Khalid M. Ghoneem
3
1
Microbial Activity Unit, Microbiology Department, Soils, Water and Environment Research Institute, Agricultural Research Center, Giza 12619, Egypt
2
Botany and Microbiology Department, Faculty of Science, King Saud University, Riyadh 11451, Saudi Arabia
3
Seed Pathology Research Department, Plant Pathology Research Institute, Agricultural Research Center, Giza 12619, Egypt
*
Authors to whom correspondence should be addressed.
Polymers 2023, 15(6), 1419; https://doi.org/10.3390/polym15061419
Submission received: 5 February 2023 / Revised: 3 March 2023 / Accepted: 10 March 2023 / Published: 13 March 2023
(This article belongs to the Special Issue Renewable and Sustainable Polymers)

Abstract

:
Pullulan is a biodegradable, renewable, and environmentally friendly hydrogel biopolymer, with potential uses in food, medicine, and cosmetics. New endophytic Aureobasidium pullulans (accession number; OP924554) was used for the biosynthesis of pullulan. Innovatively, the fermentation process was optimized using both Taguchi’s approach and the decision tree learning algorithm for the determination of important variables for pullulan biosynthesis. The relative importance of the seven tested variables that were obtained by Taguchi and the decision tree model was accurate and followed each other’s, confirming the accuracy of the experimental design. The decision tree model was more economical by reducing the quantity of medium sucrose content by 33% without a negative reduction in the biosynthesis of pullulan. The optimum nutritional conditions (g/L) were sucrose (60 or 40), K2HPO4 (6.0), NaCl (1.5), MgSO4 (0.3), and yeast extract (1.0) at pH 5.5, and short incubation time (48 h), yielding 7.23% pullulan. The spectroscopic characterization (FT-IR and 1H-NMR spectroscopy) confirmed the structure of the obtained pullulan. This is the first report on using Taguchi and the decision tree for pullulan production by a new endophyte. Further research is encouraged for additional studies on using artificial intelligence to maximize fermentation conditions.

1. Introduction

The ubiquitous yeast-like fungus, Aureobasidium pullulans (de Bary) G. Arnaud (1918), is found in various conditions, especially in the phyllosphere, as an endophyte or epiphyte of a broad extent of plants, without producing disease symptoms. Biotechnologically, A. pullulans is regarded as a highly important fungus in the production of various biomolecules, e.g., siderophore, and enzymes [1,2,3,4]. The fungus is polymorphic and generally regarded as non-toxigenic and non-pathogenic [5]. The biosynthesis of pullulan (poly-α-1,6-maltotriose) by A. pullulans is the main spot [6,7].
The genus Aureobasidium is an ascomycetous yeast-like fungus that comprises 27 taxa (14 species and two varieties). Among these species, A. pullulans (de Bary) G. Arnaud is the most studied [6]. A. pullulans is a widely distributed black, yeast-like fungus that can naturally exist in various environments, including the phyllosphere [1,3]. It can be found as an epiphyte and/or endophyte of a vast array of plants (e.g., apple, cucumber, green beans, grape, stored barley grain cabbage, and coconut water) without developing diseases [1,2,4].
Pullulan is a biodegradable polymer derived from natural resources and can be considered a renewable resource and environmentally friendly material; it has potential uses in various industries, food packaging, medical devices, and cosmetics [6]. Pullulan is an unbranched extracellular polymer, which is neutral (like dextran, cellulose), water-soluble, and viscous polysaccharide. The alpha-glucan consists of units of glucose with arranged in a linear structure, i.e., three units of glucose molecules are linked by α-1,4 glycosidic bond to form maltotriose units, which are repeatedly polymerized by α-1,6 glycosidic bond on the terminal glucose, thus pullulan known as α-1,4-; α-1,6-glucan [8,9].
Pullulan helps cells withstand dehydration and predator, and also allows for an easier exchange of substances in and out of the cell [10]. Low concentrations of pullulan form high-viscosity solutions and can be made into fibers and films, making it ideal for coating food and pharmaceuticals. It is available commercially as a tasteless polymer, used in food and non-food industries as an additive, processing aid, thickener, stabilizer, emulsifier, and gelling agent. Additionally, pullulan is used in various applications, including edible films for breath fresheners, pharmaceuticals like retard tablets, capsules, and microcapsules, and industries such as cosmetics, textiles, and paper [6,7,8,10].
Several attempts are being done to manipulate pullulan biosynthesis through genetic engineering. For instance, overexpression of the gene encoding for the pullulan synthase enzyme has been reported to increase pullulan production [11]. However, genetic engineering methods can be time-consuming and expensive, and the stability of the modified strains can be a limitation. Optimization of the medium composition, as well as the different process parameters (e.g., temperature, pH, and agitation) for pullulan production by varying the level of each variable, can be challenging and have conflicting effects; this is due to the biosynthesis process needs an accurate balance of the fermentation conditions and very sensitive to the interactions among components [12,13,14].
Because of the drawbacks of genetic engineering, and alteration of the medium components (that may lead to an inconstant supply of pullulan due to uncontrolled factors), Taguchi’s experimental design can deal with such issues. Taguchi’s method is a robust statistical design that optimizes process performance and quality using fractional factorial design and multi-parameter optimization. It is a straightforward and organized approach for finding the optimal level of each factor while maximizing the response with a limited number of experimental runs on multiple variables [15,16]. The Taguchi method originated from expanded factorial and Latin squares (L) designs, known as orthogonal arrays, which are matrices where every pair of columns contains the same number of each possible pair of factors [16,17].
The goal of orthogonal arrays is to optimize the signal-to-noise (S/N) ratio of responses rather than the responses themselves, reducing process variability [18]. The S/N ratio calculates the response’s deviation from the target. It represents the mean value as signal and the standard deviation as noise. The objective is to minimize variability by amplifying the S/N ratio. This is what differentiates the Taguchi method from traditional statistical techniques. Taguchi categorizes the S/N ratio into three types, i.e., nominal is better, larger is better, and smaller is better, depending on the desired response [15,17]. The S/N ratio replaces the response in determining the optimal control factor levels and disregards variations caused by uncontrollable conditions [15]. To our knowledge, none of the previous attempts at pullulan biosynthesis used Taguchi’s approach.
A decision tree (DT) is a supervised machine learning and data mining algorithm represented as a tree-like structure. Each node represents a feature, branches represent decision rules, and leaves represent outcomes. The aim is to predict the target variable by learning decision rules from data features. The tree is constructed by starting at the root node and then recursively partitioning the data into subsets based on the values of the input features. The result is a tree where each leaf node represents a class label, and each internal node represents a test of one of the input features. Decision trees can also be used for other tasks, such as regression and clustering. They are commonly used in decision support systems and operations research. It is essential to have proper data collection and annotation to use a decision tree. The DT can also be used to predict the optimal process parameters for new conditions, such as changes in the composition of the inoculum or changes in the configuration of the substrate [19,20]. It is worth noting that the specific details of the implementation can vary depending on the specific application and dataset. Also, the optimization should be done with caution and validation, as the decision tree can be affected by overfitting and other issues. No previous attempts applied the DT approach in pullulan biosynthesis.
Herein, we report a novel incorporation of both Taguchi’s approach and the machine learning algorithm of DT to optimize the fermentation process parameters for maximization of pullulan biosynthesis using the new endophytic candidate, A. pullulans. Furthermore, A. pullulans were molecularly identified, and the spectroscopic characterization of the resulting pullulan was investigated.

2. Materials and Methods

2.1. Isolation of Endophytic Yeast-like Fungus

The endophytic yeast-like fungus A. pullulans were isolated from common bean seeds. Ten healthy seed samples were collected from various Saudi Arabia provinces (Riyadh, Al-Qaseem, Al-Ahsaa, and Ha’il), lie within 23°27′ N and 27°52′ N (latitudes), and 41°72′ E and 49°58′ E (longitudes). The collected common bean pods were stored at 4 °C after labeling. Isolation of the endophytic seed-borne yeast-like fungus from seed samples was carried out following the technique proposed by Rashad et al. [21]. First, the surface was sterilized with 70% ethanol for 1 min, then 2% sodium hypochlorite for 90 s, then with 100% ethanol for 30 s, and finally, rinsed with distilled water for 10 min before drying under laminar airflow, utilizing sterilized filter paper. Potato sucrose agar (PSA) medium was used to plate 10 seeds and incubated at 25 ± 2 °C in the dark. After 4 days, colonies around each seed were picked and purified by respreading on PSA plates. The single pure colonies were obtained, which were subcultured regularly on PSA slants.

2.2. Identification

2.2.1. Morphological Characteristics

For the morphological identification of A. pullulans colonies, daily observation of colony characteristics was done. Furthermore, a slide culture was made from the grown colony on PSA, which was smeared with methylene blue and observed by wet mounting using bright field microscopy. Identification of the Aureobasidium strain was done following the previously reported features [22,23,24].

2.2.2. Molecular Identification

A. pullulans was identified through molecular methods. DNA was separated and amplified using polymerase chain reaction (PCR) with DNeasy Tissue Kits (from QIAGEN, Hilden, Germany). DNA concentration was determined by comparing it with a lambda DNA standard on a 1% agarose gel. The product was diluted to 20 ng DNA/µL and stored at −20 °C.
The ITS region was amplified and sequenced. The PCR reaction mixture contained 1x buffer (Promega, Madison, WI, USA), 1.5 mM MgCl2, 1 U Taq DNA polymerase (GoTaq, Promega), 0.2 mM dNTPs, 30 picomoles of each primer of ITS5-F (5′-GGAAGTAAAAGTCGTAACAAGG-3′) and ITS4-R (5′-TCCTCCGCTTATTGATATGC-3′), 30 ng of genomic DNA and ultra-pure water. The total volume was made up to 50 µL. The thermal cycling was performed using the Perkin-Elmer/GeneAmp® PCR System 9700 (PE Applied Biosystems, Waltham, MA, USA) with 40 cycles of 30 s each (after an initial denaturation cycle of 5 min at 94 °C), annealing (45 °C for 30 s), and elongation (72 °C for 30 s), with a final extension of 7 min at 72 °C. The PCR products were purified using the Montage PCR Clean-up kit (Millipore, Burlington, MA, USA) to eliminate unincorporated primers and dNTPs. Nucleotide sequence analysis was performed using NCBI BLASTn online tool (http://ncbi.nlm.nih.gov/BLAST/, accessed on 1 January 2023) and aligned utilizing Align Sequences Nucleotide BLAST with the nr/nt database.
The taxa’s evolutionary relationships were compared utilizing the Neighbor-Joining procedure [25]. The percentage of replicate trees (1000 replicates) was compared [26]. The pairwise evolutionary distance was calculated using pairwise deletion. The ambiguous sites were excluded, and the evolutionary analyses were carried out [27].

2.3. Fermentation Medium and Cultural Conditions

The synthesis of pullulan was performed using the medium of Singh et al. [28] with modifications, which contained (g/L) sucrose (50), K2HPO4 (5.0), NaCl (1.0), MgSO4·7H2O (0.2), yeast extract (2.0), and pH 6.5. This medium was used for maintenance, inoculum preparation, and production of exopolysaccharide.
The inoculum of A. pullulans was prepared by scraping culture slant into a 250 mL Erlenmeyer flask containing 50 mL of the sterilized medium, then incubated at 30 °C, 200 rpm. After three days, this inoculum was utilized to inoculate 100 mL of the biosynthesis medium in a 500 mL Erlenmeyer flask at the rate of 10% (v/v).
Unless otherwise stated, the production conditions of pullulan were performed by incubating the Erlenmeyer flasks at 30 °C on a rotary shaker at 200 rpm for up to 7 days. After the fermentation time, the cells were removed by centrifugation at 2500× g for 15 min.

2.4. Determination of Pullulan

Pullulan content was separated and determined following the method of Singh et al. [28]. Briefly, the cells were separated from the culture by centrifugation (1400× g for 15 min). The pullulan in the supernatant was separated by adding two volumes of ice-chilled isopropyl alcohol whilst stirring. The mix was kept overnight at 4 °C and then centrifuged (2500× g for 15 min) to separate the polysaccharide, which was redissolved in deionized water, reprecipitated again with isopropyl alcohol, and washed with acetone, then deionized water, this process repeated thrice to ensure the purity of pullulan. The obtained pullulan was dried (80 °C to constant weight). The pullulan content (%) was expressed as the weight of dry pullulan produced in deciliter fermented broth.

2.5. Taguchi Experimental Design

Experiments of pullulan biosynthesis using A. pullulans were performed in 100 mL batch fermentation conditions. For optimization of the biosynthesis process, seven independent factors were assessed, representing the medium variables reported above (sucrose, K2HPO4, NaCl, MgSO4, yeast extract, pH, and incubation period). The Taguchi orthogonal array was employed to find the ideal conditions for pullulan biosynthesis. The design was structured with the minimum number of runs, i.e., 3^7. Thus 27 runs were generated for the design array by utilizing seven factors with three (high, middle, low) levels (L27). A generic signal-to-noise (S/N) ratio was employed to measure experimental variation, aiming to enhance pullulan biosynthesis quality through decreased mean squared deviation. The type of function applied (larger is better) that maximize pullulan biosynthesis the next Equation (1) was used:
S / N   r a t i o = 10 × l o g ( Σ ( 1 / Y 2 ) / n )
where S/N is the signal-to-noise, n is the number of observations, and Y is the observed data.
Taguchi’s design was experimentally performed and repeated in three different runs. ANOVA was estimated to identify the significant parameters. Both the S/N ratio and ANOVA analysis allow the forecast of the most favorable process parameter combinations. A confirmation experiment was carried out to confirm the obtained optimal conditions based on the parameter design.

2.6. Decision Tree Learning Algorithm

A parametric learning classification and regression tree was devoted to exploring the possible solutions for improving pullulan biosynthesis using the set of tested variables. The seven fermentation parameters were the input continuous predictor variables that were used to alleviate the noise resulting from the experimental procedures. The biosynthesis of pullulan was the target variable (response).
The decision tree learning algorithm with pullulan biosynthesis was created by a group of rules based on the continuous predictor in the data set using least squared errors as a node-splitting method (i.e., choosing the variable for the smallest mean squared error (MSE) and root mean squared error (RMSE) upon regression), and maximum R2 for selecting the optimal tree. To evaluate the prediction precision and avoid over-fitting, validation with a test set was performed at a random fraction of 0.3, where 70% of data was applied for training and the other 30% was used for testing (70/30% training/test sets) a total of 81 runs, representing three replications of the Taguchi experimental design with 27 each. The nodes continued to split until the terminal nodes could not be divided further. Deeper trees tend to have more accurate predictions, so the deepness of the DT was fixed when further changes did not decrease the mean squared error.

2.7. Validation of Taguchi and DT Models

The designed model was validated experimentally using three repetitions of different combinations of the seven factors created by the two models. The observed and predicted values were compared to assess the model’s accuracy in maximizing pullulan biosynthesis.

2.8. Spectroscopic Characterization

The isolated pullulan samples were scanned by Fourier-transform infrared spectroscopy (FT-IR) analysis with the potassium bromide pellet method [29,30]. Infrared spectra were documented on a Bruker FT-IR spectrometer (Bruker Corporation, Vienna, Austria), OPUS version 7.5, from 4000 to 350 cm−1 of scale with sensitivity > 20.0%, and the data were recorded as wavenumbers (ν, cm−1).
The sample was prepared for the proton nuclear magnetic resonance (1H-NMR) analysis by dissolving 15 mg of the obtained pullulan in 0.5 mL DMSO-d6, and drops of D2O solvent were added, in which tetramethylsilane (TMS) was applied as an internal standard [24]. The 1H-NMR spectrum was run on Bruker AVANCE III 400 MHz NMR Spectrometer (Bruker Corporation, Vienna, Austria). The chemical shifts δ were recorded in ppm downfield from TMS, utilizing the tolerating deuterated signals of the used solvents as an internal reference (DMSO-d6/D2O: δH = 2.516, and 4.184 ppm). 1H-NMR (DMSO-d6/D2O) (δ, ppm): 3.135 (s, 1H, CH), 3.302–3.323 (d, 1H, CH), 3.390 (d, 2H, CH2), 3.553–3.701 (d, 1H, CH), 3.767–3.797 (d.d., 2H, CH2), 4.694 (s, 1H, CH), 5.027–5.061 (d, 1H, CH).

2.9. Software and Statistical protocol

The data of the pullulan biosynthesis by A. pullulans are the mean of three trials with standard deviation (±SD) values. Taguchi orthogonal array, the supervised learning algorithm of the decision tree, and the mathematical analysis were performed using Minitab software (version 21, Minitab Inc., State College, PA, USA).

3. Results and Discussion

3.1. Isolation and Morphological Identification of A. pullulans

The current trial revealed the isolation of A. pullulans AKW from common bean seeds as an endophytic fungus and morphologically identified. On PSA, the fungus optimally grows at 30 °C, within a temperature range of 10–35 °C. The isolated fungus produced smooth, faintly pink, yeast-like colonies enclosed with a slimy mass of spores. As time passed, the colonies turned black because of chlamydospore production. The colonies were flat, smooth, moist, yeast-like, slimy, shiny, and leathery in appearance. The surface is cream at the beginning and becomes black with time, with a grayish fringe by aging. Examination of the cell morphology of the isolate (Figure 1) by bright field light microscopy showed two types of conidia are produced. The primary conidia are smooth, hyaline, ellipsoidal, one-celled, and variable in shape and size, whereas the secondary conidia are smaller. Conidiophores can be indistinguishable, occur in between cells, or arise from short lateral branches. Endoconidia is generated within an intercalary cell and moves to a neighboring empty cell. Hyphae are clear (hyaline), smooth, thin-walled, and have transverse septa. The microscopic examination also showed the typical A. pullulans polymorphology with hyphae, blastospores, and chlamydospores. Both colony features and microscopic morphology were in line with the A. pullulans descriptions by Barnett and Barry [22], Domsch et al. [23], and Hermanides-Nijhof [24].

3.2. Molecular Identification of A. pullulans

A. pullulans AKW was molecularly identified based on ITS. The method compares the sequence of the 18S rRNA gene after it has been amplified by PCR. From BLAST results, the endophytic A. pullulans AKW displayed high similarity with the previously similar strains on the GenBank. The phylogenetic tree (Figure 2) shows the fungus as A. pullulans AKW, which was in agreement with the morphological identification. The fungal strain was registered on GenBank with accession number OP924554. A. pullulans AKW belongs to Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Dothideomycetes; Dothideomycetidae; Dothideales; Saccotheciaceae; and Aureobasidium.
Molecular identification, due to its high accuracy, is commonly used to quickly identify filamentous fungi in different taxonomic categories. This technique involves amplifying and comparing the sequence of the 18s rRNA gene through the ITS region using PCR. The uniform fragment size of the ITS in many fungal clusters makes nucleotide sequencing of ITS essential for detecting interspecific and sometimes intraspecific variation [31].
The well-known sequence, used for constructing phylogenies, is accurately labeled and demonstrates a strong connection to similar fungal strains in the GenBank. The ITS sequence is consistent across a broad range of fungal groups, making it valuable in detecting interspecific and sometimes intraspecific differences among organisms [32]. Additionally, the sequences of the non-functional ITS region are frequently highly diverse among fungal species, making it sufficient for identifying fungi at the species level [31,33]. Technically, the rDNA’s repeated nature makes the ITS regions easy to amplify from a small DNA sample [31]. Thus, ITS’s nucleotide sequencing is considered a fast and highly accurate identification approach compared to other markers. This approach can also be used for barcoding a wide range of fungal species [34,35].

3.3. Pullulan Biosynthesis Using Taguchi Approach

The pullulan biosynthesis process by A. pullulans was scaled up utilizing the Taguchi archetype by testing seven continuous variables that influence the biosorption process at three levels each (37). The Taguchi orthogonal array (L27) generated 27 trials. The actual levels of the seven tested factors, together with the orthogonal array of L27, are presented in Table 1.
The Taguchi method was utilized to find the best conditions to enhance the pullulan biosynthesis process. Its orthogonal array design allowed for the efficient development of optimal conditions using the lowest number of trials, thereby saving both time and effort [15,17]. The orthogonal arrays of the seven control factors and the corresponding pullulan biosynthesis laboratory data of the 27 trials are presented in Table 1. As it appeared, the experiment was repeated in three separate runs; then, the data were statistically analyzed to estimate the mean of pullulan biosynthesis and S/N ratio.
The calculated SD values were very small, indicating the accuracy of the repeated runs. Normally, Taguchi analysis often begins with the calculation of the S/N ratio to identify the control factor(s) that reduces variability, the S/N ratio was optimized based on the larger-is-better role, which is used to amplify the pullulan biosynthesis process, thus the level of control factor(s) that maximize pullulan biosynthesis and have no or small impact on the S/N ratio is determined. Accordingly, the highest pullulan biosynthesis and the S/N ratio (17.18) were achieved by trial L7 (7.23%).

3.3.1. Determination of Significant Factors

Firstly, the Taguchi model was utilized to calculate the fitted (predicted) values of pullulan biosynthesis. The fitted values were so adjacent to those of the actual ones; thus, the errors (residuals) were low, evidencing the model’s accuracy. Secondly, the overall design aptness was tested by measuring the determination coefficient (R2) and adjusted R2, being 0.9937 and 0.9863, respectively. Both are used for the model assessment. The present R2 and adjusted R2 values are close to one, being, concluding the accuracy of the Taguchi model.
It is already known that both kinds of R2 range from 0 to 1, and the closer to 1, the accuracy of the modeling capability [36]. If their values are ≥0.9, the model is adequately significant; in any case, R2 should not be <0.75 [37,38]. The R2 value assesses how much the response of pullulan biosynthesis varies based on changes in the amount of the tested factors. Adding more factors leads to a constant increase in R2, but this doesn’t account for the significance of the factors. The adjusted R2 solves this problem by considering the number of factors in the model. Unlike R2, adjusted R2 adjusts sensibly when adding significant factors only to the model, making it a better indicator of the model’s goodness of fit.
The ANOVA was performed on data from the Taguchi matrix to determine which of the seven medium variables has a significant influence on pullulan biosynthesis by A. pullulans (Table 2). Initially, the null hypothesis assumes that all terms have an equal effect and there is no relationship between the tested variables and the pullulan biosynthesis process.
The probability value was considered as a diagnosis tool for rating the significance. If the p-value < 0.05, the variable is considered significant in the pullulan biosynthesis process. On this basis, the seven investigated control factors had a significant effect on pullulan biosynthesis, except yeast extract, which recorded a p-value equal to 0.793 [38].

3.3.2. Assignment of the Optimum Factors’ Levels

Following the determination of significant factors, the next step was to assign the ideal value to each factor. To resolve this point, The S/N ratio was estimated for each factor level (Figure 3). The goal of optimizing process parameters was to improve the S/N ratio for better outcomes. The optimal level for each control factor that reduces noise variability and maximizes pullulan biosynthesis by A. pullulans was determined. The peak S/N ratio for each control factor was at the highest level of four factors, i.e., sucrose (60 g/L), K2HPO4 (6 g/L), NaCl (1.5 g/L), and MgSO4 (0.3 g/L), whereas at the lowest level of yeast extract (1 g/L), pH (5.5), and incubation time (48 h).
The Taguchi method is used to evaluate the mean output of each run in the inner set and determine the variability using the S/N ratio, which varies accordingly. The S/N ratio gauges the output’s variation in relation to the target value under varying noise conditions. Taguchi’s design is robust and can easily pinpoint the control factors (7 in this instance) that minimize variability in a process by reducing the impact of uncontrollable noise factors. Taguchi’s design philosophy involves inducing variability by manipulating noise factors during experimentation, making it easier to identify control factor settings that increase process resilience to noise factor variation. A higher S/N ratio signifies optimal control factor settings in reducing the impact of noise (non-controlled) factors [16,39,40].
The S/N ratio response of pullulan biosynthesis for each level of each control factor was calculated, then the delta (a measure of the range between the uppermost and lowermost average response values) for each factor was estimated. Accordingly, the rank was determined, and the highest delta value designates the relative importance of the factor on the S/N ratio (Table 3).
The results show that MgSO4 (delta 3.11, rank 1) has the major effect on the S/N ratio, followed by NaCl (delta 2.04, rank 2), sucrose (delta 1.69, rank 3), K2HPO4, (delta 1.59, rank 4), incubation time (delta 1.29, rank 5), initial pH (delta 1.17, rank 6), and finally yeast extract (delta 0.14, rank 7). The average mean response of the seven tested factors came in line with the S/N ratio, with an exception in the order or rank of both sucrose and K2HPO4. Such mathematical calculations concluded satisfactory accuracy of both the S/N ratio and mean analysis of the seven tested input variables.
Eventually, Taguchi orthogonal array was utilized to determine the optimal operating conditions of the parameters with the greatest impact on the target output. Further, the optimum level of each variable was also assigned. Before approving these results, the residual analysis is usually explored.

3.3.3. Residual Analysis

Residuals (errors) represent the discrepancy between the predicted output from a model and the actual measured output. Thus, residuals represent the portion of the validation data not explained by the model. Residuals were estimated to determine whether they are negatively affecting the Taguchi model and to verify whether the model meets the assumptions of the data analysis of pullulan biosynthesis. Four residual analyses were depicted (Figure 4).
The normality of residuals was assessed (Figure 4A) to verify the normal distribution of the residuals. As shown, the normal probability plot of the residuals follows a straight line without outlier points, confirming the non-normality. Residual vs. fit values (Figure 4B) show the random distribution of the residuals and further have constant variance i.e., distributed eventually on both sides of 0 with no identifiable arrangement. Another, depicting the histogram of residuals frequency (Figure 4C), revealed no skewness or outliers of the residuals. Finally, when residuals are displayed in time order, i.e., plotting the residuals vs. order of run (Figure 4D), the pattern verifies that the residuals are independent and randomly fall around the center line with no specific pattern or tendencies. All previous residual analyses supported the normality of the errors (residuals) and the appropriateness of the Taguchi design in modeling pullulan biosynthesis by A. pullulans.
Taguchi’s experimental design is a robust approach that forms the cornerstone of the Taguchi method, enabling accurate prediction of multi-variable processes with minimal experimental trials [16,39,40]. This work demonstrates the effectiveness of using the Taguchi method to identify the optimal combination of seven variables to maximize pullulan. Based on Taguchi data, the seven examined variables exhibited various degrees of significance and importance. Thus, the null hypothesis was disproven since the alternative hypothesis indicates the existence of significant variation caused by the parameters studied.

3.4. Decision Tree Learning Algorithm

The data from Taguchi experimental design (Table 1) were used for further investigation using the DT learning algorithm. The data were split 70/30% training/test sets. Upon performing the decision tree learning process, the fitted and residual values were estimated together with the terminal nodes.
The DT is a possible solution to a problem based on given conditions. It starts with a single box and ends up in numerous branches and roots, i.e., numerous solutions. It is a type of supervised learning algorithm that has target variables. To select solutions, DT creates classifications, which are applied to both categorical and continuous variables. Using a DT, the population or samples can be split into two or more homogeneous sets. These homogeneous sets are constructed based on the most significant differentiator on input variables [19,20].
Little or no information is available about the application of DT in the fermentation processes for the determination of important factors. However, herein DT was used to optimize the process parameters of the fermentation medium by analyzing the relationship between the process parameters and the pullulan yield. This was done by training the DT to predict which of the seven process parameters is optimal for fermentation conditions. The DT was trained using the obtained data collected from pullulan production. The decision tree was used to identify the most important process parameters and identify the optimal range of values for these parameters.

3.4.1. Decision Tree Selection

The R2 vis number of the terminal node was plotted (Figure 5) to find out the smallest regression tree that maximizes the R2 value. The R2 values for nine trees and nine terminal nodes were generated from the validation samples. The R2 values of the validation samples typically level off (plateau) and eventually start to decline as the tree grows larger. The optimal regression tree that maximizes the R2 value is 91.51 (for training) and 86.15% (for testing). The tree is optimal as the selection criterion was to create the smallest tree with an R2 value within a standard deviation of 1 of the maximum R2 value.
Because the chart shows that the R2 values are relatively increased (more than a standard deviation of 1) up to the end of the 9th node; therefore, there is no need to look at the performance of some of the smaller trees that are like the optimal tree. Typically, a tree with fewer terminal nodes (such as the current nine-terminal node tree) provides a clearer understanding of how each predictor variable influences the response values. Additionally, a smaller tree simplifies identifying a few key predictors for additional research.
The tree was constructed and investigated for the distinction of terminal nodes on the diagram (Figure 6). The tree diagram shows all 81 cases from the full data set. The target was to find the largest mean with the lowest standard deviations.
The first node used 53 cases and is split at two levels of MgSO4, i.e., ≤0.15 and >0.15 g/L. Node 2 has 17 cases where the mean for the node (4.40 g/L pullulan) is less than the overall mean (5.28 g/L pullulan), with a standard deviation of 1.01, which is >the overall standard deviation. Node 4 has 36 cases with a low standard deviation, 0.68, which is < the overall standard deviation because a split yields a purer node. The mean (5.70 g/L pullulan) for the node is more than the overall mean. This node split MgSO4 into two levels at <0.25 and >0.25 g/L. Node 7 used 17 cases and showed further improvement in the mean and standard deviation. This node was split based on incubation time into ≤84 and >84 h. Node 8 used 10 cases and also showed improvement and split into two terminal nodes (7 and 8) based on sucrose concentration. Terminal node 7 has the highest mean, with only four cases. Accordingly, terminal node 7 showed the highest mean value of pullulan (6.98 ± 0.29 g/L). The node rules were sucrose ≤ 45, MgSO4 > 0.25, and incubation time ≤ 84; these rules were achieved in run no. L7, which also acts as a terminal node in the decision tree. On the other side, terminal node 1 has the smallest mean (2.95 g/L), suggesting that the data in terminal node 1 are probably skewed.
The error statistics of the decision tree learning model were calculated. The accuracy of the resulting tree was indicated with low MSE, being 0.0850 and 0.1397, and RMSE being 0.2916 and 0.37380.3738, for training and testing processes, respectively.

3.4.2. The Relative Variable Importance

The relative variable importance is quantified as the increasing percentage relative to the leading predictor (variable). The leading predictor has a relative importance of 100%, thus, the other variables are standardized relative to the leading predictor; accordingly, the importance of each variable could be easily interpreted. The chart (Figure 7) of the relative variable importance was used to detect which predictor is the most important to the tree. The most important (leading) variable is determined as the one with the highest score of improvement (and thus considered 100% important), and the rest of the variables are ranked accordingly. Variables that are considered important are primary or surrogate splitters in a decision tree. The most vital variable always has 100% relative importance (MgSO4 in the current study). The non-important variable does not appear in the tree.
As MgSO4 was the most important predictor variable with a 100% contribution in pullulan biosynthesis, the other variables were compared to MgSO4 to determine their relative importance. Consequently, NaCl (82.0%), sucrose (79.6%), and K2HPO4 (78.2%) showed higher relative importance than incubation time (21.7%) and pH (11.9%); the yeast extract was the lowest variable.
Although the positive importance of the seven variables, the relative ranking informs about the number of variables to regulate or keep track of in pullulan production. Significant decreases in the relative importance values between variables can assist in determining which variables should be regulated or monitored. In this connection, four variables have the highest relative important values that are relatively close together before a drop to 21.7% in relative importance for the incubation time. It is worth mentioning that the relative variable importance is in harmony with that obtained by Taguchi’s experimental data, confirming the accuracy of the experimental design and the obtained data.

3.5. Validation of Taguchi and DT Models

The prediction level of each of the seven tested factors was determined by both models (Table 4). The laboratory confirmation experiments were conducted in triplicate to verify the optimal process conditions calculated from both design models. The obtained experimental data confirmed and verified the fitness of both models when compared with predicted values.
The maximum pullulan biosynthesis by A. pullulans at the speculative values of the optimal levels of the seven tested variables was 7.17%, with a S/N ratio of 17.64. Under laboratory experimental conditions, these estimated values were evaluated. The experimental value of pullulan biosynthesis was obtained at 7.23 ± 0.13%. Similarly, to check the forecast capacity of the DT model, the actual value was calculated to be 7.11 ± 0.07 compared with the predicted one. This value was separated at the seventh node of the DT.
The actual values showed accuracy and were very close and obey to the predicted value by both models. However, from the economic point of view, the performance of the DT model was better than the Taguchi model since it takes only 40 g/L of sucrose compared to 60 g/L in the case of Taguchi to yield a similar yield of pullulan.
The relative importance and optimum level for each of the investigated variables varied accordingly. However, each variable plays a distinct role in pullulan biosynthesis. The most important one is magnesium sulfate, which is essential for the growth and survival of fungi. It acts as a source of magnesium involved in many metabolic processes, including energy production and DNA synthesis. Its sulfur moiety provides sulfur, which is an important component of amino acids and coenzymes [41,42]. Magnesium sulfate is commonly used as a co-factor in pullulan production. It acts as a key ionic activator in the formation of pullulan polysaccharides by the enzyme pullulanase. Additionally, magnesium sulfate can help regulate the osmotic pressure and pH in the reaction mixture, leading to improved pullulan yield and quality [43].
Sodium chloride came next in terms of relative importance. The rates of both specific glucose consumption and, consequently, the specific pullulan production rate improved due to the presence of NaCl, causing greater pullulan yield. Moreover, NaCl stimulates glucosyltransferase and α-phosphoglucose mutase (involved in pullulan biosynthesis) and also stimulates α-amylase (the degraded pullulan), upregulates the transcriptional levels of fks, pgm1, and amy2 genes, enhances the driving force for ATP supply, and helps to maintain intracellular uridine diphosphate-glucose at a high level in the fungal cells [44,45]. However, NaCl is essential for fungal growth, as it helps regulate the osmotic pressure of the growth medium. Fungi require an appropriate balance of solutes, including sodium and chloride ions, to maintain cellular turgor and facilitate the proper uptake of nutrients. Sodium ions play a role in regulating fungal osmotic pressure, while chloride ions are involved in metabolic processes and in maintaining the stability of cell membranes. However, too much NaCl can be toxic to fungi and inhibit their growth, so it is important to regulate its concentration in the growth medium [46,47].
Sucrose was the third important factor. An earlier study reported that the minimum response of pullulan production happened at the lowest level of sucrose; as the sucrose concentration increased, a considerable increment in pullulan biosynthesis occurred, indicating the significant influence of sucrose concentration on pullulan production. However, a higher concentration of sucrose led to a decrease in pullulan production due to low water activity and the osmotic impact caused by elevated sucrose levels [28].
Dipotassium hydrogen phosphate is used as a food additive and has been approved as generally recognized as safe. Besides acting as a crucial nutrient of potassium and phosphorus source for fungal growth, cell division, and metabolic processes, it also acts as a pH buffer to regulate and stabilize the medium pH, helping to maintain a stable pH environment for optimal fungal growth [48].
Incubation time is a determinant and key factor in pullulan production, as it affects the growth and metabolic activity of the microorganisms and the synthesis of pullulan. Obviously, our endophytic A. pullulans strain showed a relatively short incubation period (48 h) compared with the previously stated time [28,49]. In addition, the optimal incubation time varies depending on the microorganism and the conditions of the reaction, but generally, longer incubation times lead to increased pullulan yields. However, excessively long incubation times can result in reduced pullulan yields due to substrate depletion and the accumulation of inhibitory substances. It is, therefore, important to carefully control the incubation time to obtain optimal pullulan yields [45].
The degree of pH plays a crucial role in pullulan production, affecting both the growth of the microorganisms and the production of pullulan. Optimal pH conditions can vary depending on the specific microorganisms used, but generally, a slightly acidic to neutral pH (6.0–7.0) is favored. This range of pH values provides the microorganisms with the necessary conditions to carry out their metabolic activities, leading to the synthesis and accumulation of pullulan. Additionally, controlling pH can also help regulate other factors, such as osmotic pressure and the availability of essential ions for the production of pullulan [45].
Although yeast extract was ranked last in terms of relative importance, it is a common ingredient in the production of pullulan, serving as a source of essential nutrients (amino acids, vitamins, and trace elements) for the pullulan-producing microorganisms, thus plays an important role in promoting the growth and metabolic activity of the fungus, ultimately leading to the synthesis and accumulation of pullulan [45].

3.6. The Structure Characterization of Pullulan

The structure of the purified pullulan isolated from A. pullulans was interpreted in this work through the spectroscopic analysis involving the usage of two distinctive spectroscopic performances (FT-IR and 1H-NMR spectra).

3.6.1. The FT-IR Spectroscopy

FT-IR spectroscopy is a type of infrared spectroscopy that uses Fourier-transform techniques to analyze the infrared light absorbed or emitted by a sample. It is used to identify chemical compounds and analyze the molecular structure of materials.
The FT-IR spectral analysis was implemented to investigate the structural elucidation of the produced pullulan skeleton. Generally, the spectral data that was recorded (Figure 8 and Table 5) well-established the pullulan structure. As shown, the FT-IR spectral data (KBr, ν cm−1) revealed a sharp absorption band at 3309 cm−1 attributed to the stretching vibration of bonded and non-bonded OH groups, which are ascribed to the alcoholic or aliphatic hydroxyl chains. The absorption band that was recorded at 2924 cm−1 is attributed to medium C-H stretching owing to the presence of the sp3 C-H bond. Additionally, the absorption band situated at 1642 cm−1 is ascribed to the weak stretching vibration of O-C-O groups, which verified the presence of aliphatic ether moiety in the structural skeleton. Nevertheless, this result is well-matched with that reported study, which specified the value of the characteristic stretching vibration of O-C-O groups at 1636 cm−1 [50]. The absorption band that appeared at 1417 cm−1 is attributed to the medium C-H bending of the alkane chain of the sp3 C-H bond. Also, a medium absorption band due to the C-O-H group appeared at 1360 cm−1, signifying the presence of aliphatic ether moiety. Other types of pullulan were correspondingly perceived in the spectra comprising the C-O-C stretching absorption band at 1148 cm−1 and the C-O stretching absorption bands at 1077 and 995 cm−1. The strong absorption band due to the C-O-C group that appeared at 1148 cm−1 is typically well-matched with several previous data of FT-IR spectra [50,51,52]. In particular, a weak band that appeared in the FT-IR chart at 850 cm−1 supported the α-configuration of α-D-glucopyranose fragments in the pullulan skeleton. Additionally, the basic linkage of glucopyranose units to construct the pullulan framework appeared in the recorded data at 706, 755, and 930 cm−1—these values estimated the linkage of the units was accomplished through α-(1,4) and α-(1,6)-D-glycosidic bonds [53].

3.6.2. The 1H-NMR Spectroscopy

The 1H-NMR is a type of nuclear magnetic resonance spectroscopy that specifically measures the magnetic properties of hydrogen nuclei (protons) in a sample. It is used to determine the molecular structure of organic compounds and to study the properties of molecules in solution. The 1H-NMR spectrum provides information about the number of hydrogen atoms, their chemical environment, and their relative orientation in the molecule.
The 1H-NMR spectrum was performed for pullulan to characterize the structural framework. The protons of hydroxyl groups are exchangeable with D2O owing to dissolving the pullulan sample in a mixture of DMSO-d6/D2O to improve the solubility. A one-dimensional 1H-NMR spectrum was applied for the tested sample, which verified signals for the total nine protons for each glucose unit, excluding the exchangeable hydroxyl protons with D2O. The data of the 1H-NMR spectrum (Figure 9 and Figure 10) demonstrated that the signal attributed to the CH proton at the C4 position appeared at δ = 3.135 ppm. The recorded signals appeared in the downfield region (aliphatic region) between δ = 3.135 to 5.027 ppm deducing the protons on carbon atoms adjacent to high electronegative atoms verified the distinctive chemical location of a carbohydrate core. Consequently, the total integrated protons from the 1H-NMR chart indicated the presence of carbohydrate polymer chains with three units of the sugar moiety. The signals at δ = 5.027–5.061 ppm supported the α-1,6 and α-1,4 glycosidic bonds at the C1 position of the sugar moiety, as well as the formation of (1→4) and (1→6) linkages. The data verified the presence of two singlet signals at δ = 3.135 and 4.694 ppm attributed to the protons attached to the C4, and C2 positions, respectively. Correspondingly, the CH and CH2 protons appeared at 3.302–3.323, 3.553–3.701, and 3.767–3.797 ppm as a doublet or doublet of doublet signals are attributed to the protons attached to the C3, C5, and C6 carbons. The methine proton at the C1 position appeared more de-shielded at 5.061–5.027 ppm owing to the impact of the two adjacent oxygen atoms (high electronegative atom) to the carbon atom at the C1 position.
In comparison to the studies [53,54,55] that previously reported on the chemical structure elucidation of pullulan by 1H-NMR spectroscopy, typical data were recorded herein, excluding the signals due to the hydroxyl groups exchanged with D2O. In addition, the comparison of our data with commercial pullulan spectra [29,49] confirmed the structure of the current pullulan.
Several investigations have been conducted on pullulan production by A. pullulans. Overall, the results of previous studies have varied depending on the conditions used. However, these studies have provided valuable insights into the optimization of pullulan production by A. pullulans and the potential use of alternative nutritional and fermentation conditions for pullulan production [56]. One earlier study reported that the optimal conditions for pullulan biosynthesis by A. pullulans were at 28 °C and an agitation rate of 200 rpm. Under these conditions, the yield of pullulan was 40.1 g/L [12]. In addition, the statistical optimization of the central composite design was applied for using molasses as a carbon source for pullulan production by A. pullulans, with a yield of 45 g/L [13]. A more recent study investigated the use of soybean meal hydrolysate as a nitrogen source and found that pullulan production by A. pullulans reached 59.8 g/L [14].
Our study, on the other hand, represents the state-of-the-art for the biosynthesis of pullulan by the endophytic strain that is capable of producing 72.3 g/L, representing about a 1.5-fold increase over most of the previous investigation. This may also be due to the novel application of Taguchi’s design and the DT learning algorithm.

4. Conclusions

In sum, our results provide a concise and insightful description of applying both the Taguchi and DT learning algorithm paradigms in the biosynthesis process of pullulan by the new endophytic candidate, A. pullulans AKW. Another important conclusion that can be drawn is the relatively short fermentation period compared with previous studies on a more simplified medium. However, further research is encouraged for additional studies on using artificial intelligence to optimize the fermentation conditions for the maximization of pullulan biosynthesis. Other studies are also encouraged to focus on its biosynthesis regulation under the current fermentation circumstances.

Author Contributions

W.I.A.S. suggested the topic and drew the conceptualization. W.I.A.S., A.A.A.-A. and K.M.G. shared equally in visualization, methodology, data curation, writing—original draft, validation, reviewing, and editing. All authors have read and agreed to the published version of the manuscript.

Funding

This work was funded by the National Plan for Science, Technology, and Innovation (MAARIFAH), King Abdul-Aziz City for Science and Technology, Kingdom of Saudi Arabia, grant number 2-17-04-001-0043.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

All data were reported in the paper.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Andrews, J.H.; Spear, R.N.; Nordheim, E.V. Population biology of Aureobasidium pullulans on apple leaf surfaces. Can. J. Microbiol. 2002, 48, 500–513. [Google Scholar] [CrossRef] [PubMed]
  2. Olstorpe, M.; Schnurer, J.; Passoth, V. Microbial changes during storage of moist crimped cereal barley grain under Swedish farm conditions. Anim. Feed Sci. Technol. 2010, 156, 37–46. [Google Scholar] [CrossRef]
  3. Grube, M.; Schmid, F.; Berg, G. Black fungi and associated bacterial communities in the phyllosphere of grapevine. Fungal Biol. 2011, 115, 978–986. [Google Scholar] [CrossRef]
  4. Maciel, N.O.P.; Pilo, F.B.; Freitas, L.F.D.; Gomes, F.C.O.; Johann, S.; Nardi, R.M.D.; Lachance, M.A.; Rosa, C.A. The diversity and antifungal susceptibility of the yeasts isolated from coconut water and reconstituted fruit juices in Brazil. Int. J. Food Microbiol. 2013, 160, 201–205. [Google Scholar] [CrossRef] [PubMed]
  5. Göksungur, Y.; Uҫan, A.; Güvenҫ, U. Production of pullulan from beet molasses and synthetic medium by Aureobasidium pullulans. Yurk. J. Biol. 2004, 28, 23–30. [Google Scholar]
  6. Gostinčar, C.; Ohm, R.A.; Kogej, T.; Sonjak, S.; Turk, M.; Zajc, J.; Zalar, P.; Grube, M.; Sun, H.; Han, J.; et al. Genome sequencing of four Aureobasidium pullulans varieties: Biotechnological potential, stress tolerance, and description of new species. BMC Genomics 2014, 15, 549. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  7. Chi, Z.; Wang, F.; Chi, Z.; Yue, L.; Liu, G.; Zhang, T. Bioproducts from Aureobasidium pullulans, a biotechnologically important yeast. Appl. Microbiol. Biotechnol. 2009, 82, 793–804. [Google Scholar] [CrossRef]
  8. Seo, H.P.; Son, C.W.; Chung, C.H.; Jung, D.I.; Kim, S.K.; Gross, R.A.; Kaplan, D.L.; Lee, J.W. Production of high molecular weight pullulan by Aureobasidium pullulans HP-2001 with soybean pomace as a nitrogen source. Bioresour. Technol. 2004, 95, 293–299. [Google Scholar] [CrossRef]
  9. Shingel, K.I. Current knowledge on biosynthesis, biological activity and chemical modification of the exopolysaccharide, pullulan. Carbohydr. Res. 2004, 339, 447–460. [Google Scholar] [CrossRef]
  10. Rehm, B.H.A. Microbial Production of Biopolymers and Polymers Precursors; Caister Academic Press: Poole, UK, 2009; p. 230. [Google Scholar] [CrossRef] [Green Version]
  11. Wei, X.; Liu, G.L.; Jia, S.L.; Chi, Z.; Hu, Z.; Chi, Z.M. Pullulan biosynthesis and its regulation in Aureobasidium spp. Carbohydr. Polym. 2021, 251, 117076. [Google Scholar] [CrossRef]
  12. Ravella, S.R.; Quiñones, T.S.; Retter, A.; Heiermann, M.; Amon, T.; Hobbs, P.J. Extracellular polysaccharide (EPS) production by a novel strain of yeast-like fungus Aureobasidium pullulans. Carbohydr. Polym. 2010, 82, 728–732. [Google Scholar] [CrossRef]
  13. Srikanth, S.; Swathi, M.; Tejaswini, M.; Sharmila, G.; Muthukumaran, C.; Jaganathan, M.K.; Tamilarasan, K. Statistical optimization of molasses based exopolysaccharide and biomass production by Aureobasidium pullulans MTCC 2195. Biocatal. Agric. Biotechnol. 2014, 3, 7–12. [Google Scholar] [CrossRef]
  14. Yang, J.; Zhang, Y.; Zhao, S.; Zhou, Q.; Xin, X.; Chen, L. Statistical optimization of medium for pullulan production by Aureobasidium pullulans NCPS2016 using fructose and soybean meal hydrolysates. Molecules 2018, 23, 1334. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  15. Asghar, A.; Abdul Raman, A.A.; Daud, W.M.A.W. A comparison of central composite design and Taguchi method for optimizing Fenton process. Sci. World J. 2014, 2014, 869120. [Google Scholar] [CrossRef] [Green Version]
  16. El-Moslamy, S.H.; Elkady, M.F.; Rezk, A.H.; Abdel-Fattah, Y.R. Applying Taguchi design and large-scale strategy for mycosynthesis of nano-silver from endophytic Trichoderma harzianum SYA. F4 and its application against phytopathogens. Sci. Rep. 2017, 7, 45297. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  17. Moussa, Z.; Ghoniem, A.A.; Elsayed, A.; Alotaibi, A.S.; Alenzi, A.M.; Hamed, S.E.; Elattar, K.M.; Saber, W.I. Innovative binary sorption of Cobalt (II) and methylene blue by Sargassum latifolium using Taguchi and hybrid artificial neural network paradigms. Sci. Rep. 2022, 12, 18291. [Google Scholar] [CrossRef]
  18. Kosev, V.; Vasileva, V.; Kusvuran, A. Orthogonal regressions of pea (Pisum L.) varieties. Turkish J. Field Crop. 2018, 23, 159–166. [Google Scholar] [CrossRef] [Green Version]
  19. Goodman, K.E.; Lessler, J.; Cosgrove, S.E.; Harris, A.D.; Lautenbach, E.; Han, J.H.; Milstone, A.M.; Massey, C.J.; Tamma, P.D. A clinical decision tree to predict whether a bacteremic patient is infected with an extended-spectrum β-lactamase–producing organism. Clin. Infect. Dis. 2016, 63, 896–903. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  20. Ashino, K.; Sugano, K.; Amagasa, T.; Ying, B.W. Predicting the decision making chemicals used for bacterial growth. Sci. Rep. 2019, 9, 7251. [Google Scholar] [CrossRef] [PubMed]
  21. Rashad, E.M.; Shaheen, D.M.; Al-Askar, A.A.; Ghoneem, K.M.; Arishi, A.A.; Hassan, E.S.A.; Saber, W.I.A. Seed endophytic Achromobacter sp. F23KW as a promising growth promoter and biocontrol of Rhizoctonia root rot of fenugreek. Molecules 2022, 27, 5546. [Google Scholar] [CrossRef]
  22. Barnett, H.L.; Hunter, B.B. Illustrated Genera of Imperfect Fungi, 4th ed.; APS Press: St. Paul, MN, USA, 1998; p. 218. [Google Scholar]
  23. Domsch, K.H.; Gams, W.; Anderson, T. Compendium of Soil Fungi; Academic Press: London, UK, 1993; ISBN 3980308383. [Google Scholar]
  24. Hermanides-Nijhof, E.J. Aureobasidium and allied genera. Stud. Mycol. 1977, 15, 141–166. [Google Scholar]
  25. Saitou, N.; Nei, M. The neighbor-joining method: A new method for reconstructing phylogenetic trees. Mol. Biol. Evol. 1987, 4, 406–425. [Google Scholar] [CrossRef] [PubMed]
  26. Felsenstein, J. Confidence limits on phylogenies: An approach using the bootstrap. Evolution 1985, 39, 783–791. [Google Scholar] [CrossRef]
  27. Tamura, K.; Stecher, G.; Kumar, S. MEGA 11: Molecular Evolutionary Genetics Analysis Version 11. Mol. Biol. Evol. 2021, 38, 3022–3027. [Google Scholar] [CrossRef]
  28. Singh, R.S.; Singh, H.; Saini, G.K. Response surface optimization of the critical medium components for pullulan production by Aureobasidium pullulans FB-1. App. Biochem. Biotechnol. 2009, 152, 42–53. [Google Scholar] [CrossRef]
  29. Zhang, H.Z.; Gao, F.P.; Liu, L.R.; Li, X.M.; Zhou, Z.M.; Yang, X.D.; Zhang, Q.Q. Pullulan acetate nanoparticles prepared by solvent diffusion method for epirubicin chemotherapy. Colloids Surf. B Biointerfaces 2009, 71, 19–26. [Google Scholar] [CrossRef]
  30. Thirumavalavan, K.; Manikkandan, T.R.; Dhanasekar, R. Pullulan production from coconut by-products by Aureobasidium pullulans. Afr. J. Biochem. 2009, 8, 254–258. [Google Scholar]
  31. Al-Askar, A.A.; Saber, W.I.A.; Ghoneem, K.M.; Hafez, E.E.; Ibrahim, A.A. Crude citric acid of Trichoderma asperellum: Tomato growth promotor and suppressor of Fusarium oxysporum f. sp. lycopersici. Plants 2021, 10, 222. [Google Scholar] [CrossRef]
  32. Inglis, P.W.; Tigano, M.S. Identification and taxonomy of some entomopathogenic Paecilomyces spp. (Ascomycota) isolates using rDNA-ITS sequences. Genet. Mol. Biol. 2006, 29, 132–136. [Google Scholar] [CrossRef] [Green Version]
  33. Raja, H.A.; Miller, A.N.; Pearce, C.J.; Oberlies, N.H. Fungal identification using molecular tools: A primer for the natural products research community. J. Nat. Prod. 2017, 80, 756–770. [Google Scholar] [CrossRef]
  34. Schoch, C.L.; Seifert, K.A.; Huhndorf, S.; Robert, V.; Spouge, J.L.; Levesque, C.A.; Chen, W. Fungal Barcoding Consortium. Nuclear ribosomal internal transcribed spacer (ITS) region as a universal DNA barcode marker for Fungi. Proc. Natl. Acad. Sci. USA 2012, 109, 6241–6246. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  35. Al-Askar, A.A.; Rashad, E.M.; Ghoneem, K.M.; Mostafa, A.A.; Al-Otibi, F.O.; Saber, W.I.A. Discovering Penicillium polinicum with high-lytic capacity on Helianthus tuberosus tubers: Oil-based preservation for mold management. Plants 2021, 10, 413. [Google Scholar] [CrossRef]
  36. Saber, W.I.; El-Naggar, N.E.; El-Hersh, M.S.; El-Khateeb, A.Y. An innovative synergism between Aspergillus oryzae and Azotobacter chroococcum for bioconversion of cellulosic biomass into organic acids under restricted nutritional conditions using multi-response surface optimization. Biotechnology 2015, 14, 47–57. [Google Scholar] [CrossRef] [Green Version]
  37. El-Hersh, M.S.; Saber, W.I.; El-Fadaly, H.A. Amino acids associated with optimized alkaline protease production by Bacillus subtilis ATCC 11774 using statistical approach. Biotechnology 2014, 13, 252–262. [Google Scholar] [CrossRef] [Green Version]
  38. El-Naggar, N.E.A.; El-Khateeb, A.Y.; Ghoniem, A.A.; El-Hersh, M.S.; Saber, W.I. Innovative low-cost biosorption process of Cr6+ by Pseudomonas alcaliphila NEWG-2. Sci. Rep. 2020, 10, 14043. [Google Scholar] [CrossRef]
  39. Chiou, A.H.; Wu, W.F.; Chen, D.Y.; Hsu, C.Y. The use of the grey-Taguchi method for the optimization of a silicon nanowires array synthesized using electroless Ag-assisted etching. J. Nanopart. Res. 2013, 15, 1866. [Google Scholar] [CrossRef]
  40. Zainal, N.A.; Shukor, S.R.A.; Razak, K.A. Applying the Taguchi method to optimise the size of silica nanoparticles entrapped with rifampicin for a drug delivery system. J. Eng. Sci. 2015, 11, 9–16. [Google Scholar]
  41. Machado, M.A.; de Oliveira, R.B.; Oliveira, H. Magnesium and sulfate uptake by the ectomycorrhizal fungus Hebeloma cylindrosporum. Mycorrhiza 2010, 20, 327–334. [Google Scholar]
  42. Kavanagh, K. Fungi: Biology and Applications; John Wiley & Sons: Chichester, UK, 2011; p. 267. [Google Scholar]
  43. Singh, R.S.; Saini, G.K.; Kennedy, J.F. Pullulan: Microbial sources, production and applications. Carbohydr. Polym. 2008, 73, 515–531. [Google Scholar] [CrossRef]
  44. Wang, D.H.; Ni, T.F.; Ju, X.M.; Wei, G.Y. Sodium chloride improves pullulan production by Aureobasidium pullulans but reduces the molecular weight of pullulan. Appl. Microbiol. Biotechnol. 2018, 102, 8921–8930. [Google Scholar] [CrossRef] [PubMed]
  45. Pandey, A.K.; Sirohi, R.; Gaurand, V.K.; Pandey, A. Production and applications of pullulan. In Biomass, Biofuels, Biochemicals; Binod, P., Raveendran, S., Pandey, A., Eds.; Elsevier: Amsterdam, The Netherlands, 2021; pp. 165–221. [Google Scholar] [CrossRef]
  46. Griffin, D.H. Fungal Physiology, 2nd ed.; John Wiley & Sons: Hoboken, NJ, USA, 1996; p. 480. [Google Scholar] [CrossRef]
  47. Walker, G.M.; White, N.A. Introduction to fungal physiology. In Fungi Biology and Applications; Wiley: Hoboken, NJ, USA, 2017; pp. 1–35. [Google Scholar]
  48. Arslan, U. Evaluation of antifungal activity of mono and dipotassium phosphates against phytopathogenic fungi. Fresenius Environ. Bull. 2015, 24, 810–816. Available online: http://www.psp-parlar.de (accessed on 11 March 2022).
  49. Singh, R.S.; Saini, G.K.; Kennedy, J.F. Downstream processing and characterization of pullulan from a novel colour variant strain of Aureobasidium pullulans FB-1. Carbohydr. Polym. 2009, 78, 89–94. [Google Scholar] [CrossRef]
  50. Saber-Samandari, S.; Gulcan, H.O.; Saber-Samandari, S.; Gazi, M. Efficient removal of anionic and cationic dyes from an aqueous solution using pullulan-graft-polyacrylamide porous hydrogel. Water Air Soil Pollut. 2014, 225, 2177. [Google Scholar] [CrossRef]
  51. Cheng, K.C.; Demirci, A.; Catchmark, J.M. Effects of plastic composite support and pH profiles on pullulan production in a biofilm reactor. Appl. Microbiol. Biotechnol. 2010, 86, 853–861. [Google Scholar] [CrossRef]
  52. Tao, X.; Xie, Y.; Zhang, Q.; Qiu, X.; Yuan, L.; Wen, Y.; Li, M.; Yang, X.; Tao, T.; Xie, M.; et al. Cholesterol-modified amino-pullulan nanoparticles as a drug carrier: Comparative study of cholesterol-modified carboxyethyl pullulan and pullulan nanoparticles. Nanomaterials 2016, 6, 165. [Google Scholar] [CrossRef]
  53. Trovatti, E.; Fernandes, S.C.M.; Rubatat, L.; Freire, C.S.R.; Silvestre, A.J.D.; Neto, C.P. Sustainable nanocomposite films based on bacterial cellulose and pullulan. Cellulose 2012, 19, 729–737. [Google Scholar] [CrossRef]
  54. McLntyre, D.D.; Calgary, H.J.V. Structural Studies of Pullulan by Nuclear Magnetic Resonance Spectroscopy. Starch/Stärke 1993, 45, 406–410. [Google Scholar] [CrossRef]
  55. Sugumaran, K.R.; Gowthami, E.; Swathi, B.; Elakkiya, S.; Srivastava, S.N.; Ravikumar, R.; Gowdhaman, D.; Ponnusami, V. Production of pullulan by Aureobasidium pullulans from Asian palm kernel: A novel substrate. Carbohydr. Polym. 2013, 92, 697–703. [Google Scholar] [CrossRef] [PubMed]
  56. Wani, S.M.; Mir, S.A.; Khanday, F.A.; Masoodi, F.A. Advances in pullulan production from agro-based wastes by Aureobasidium pullulans and its applications. Innov. Food Sci. Emerg. Technol. 2021, 74, 102846. [Google Scholar] [CrossRef]
Figure 1. Macro- and microphotographs of A. pullulans AKW: the colony on PSA after 7 days at 30 °C (A), microscopic images of melanized hyphae with melanized conidia at 400× (B,C), conidiogenesis at 1000× ((D,E), arrowed), hyaline and melanized blastoconidia (F), and dark brown chlamydoconidia and chains of arthroconidia at 1000× ((G,H), arrowed).
Figure 1. Macro- and microphotographs of A. pullulans AKW: the colony on PSA after 7 days at 30 °C (A), microscopic images of melanized hyphae with melanized conidia at 400× (B,C), conidiogenesis at 1000× ((D,E), arrowed), hyaline and melanized blastoconidia (F), and dark brown chlamydoconidia and chains of arthroconidia at 1000× ((G,H), arrowed).
Polymers 15 01419 g001
Figure 2. Agarose gel electrophoresis of the amplified bands of PCR product (A) and molecular phylogenetic tree based on partial ITS sequence (B) of A. pullulans AKW (OP924554), marked by a red dot.
Figure 2. Agarose gel electrophoresis of the amplified bands of PCR product (A) and molecular phylogenetic tree based on partial ITS sequence (B) of A. pullulans AKW (OP924554), marked by a red dot.
Polymers 15 01419 g002
Figure 3. The plot of the S/N ratio effect based on larger-is-better for pullulan biosynthesis by A. pullulans.
Figure 3. The plot of the S/N ratio effect based on larger-is-better for pullulan biosynthesis by A. pullulans.
Polymers 15 01419 g003
Figure 4. Residual analysis of pullulan production by A. pullulans based on the Taguchi model.
Figure 4. Residual analysis of pullulan production by A. pullulans based on the Taguchi model.
Polymers 15 01419 g004
Figure 5. The R2 vis number of terminal nodes plot generated by a decision tree for pullulan production by A. pullulans.
Figure 5. The R2 vis number of terminal nodes plot generated by a decision tree for pullulan production by A. pullulans.
Polymers 15 01419 g005
Figure 6. The decision tree with a depth of four of the pullulan biosynthesis using A. pullulans. A total of 81 data sets from Taguchi’s experimental design were employed in the decision tree learning algorithm to predict pullulan yield. The data used for node selection are introduced in the squares. The squares involved in the paths of the best variable mixtures for pullulan biosynthesis were predicted with the decision tree.
Figure 6. The decision tree with a depth of four of the pullulan biosynthesis using A. pullulans. A total of 81 data sets from Taguchi’s experimental design were employed in the decision tree learning algorithm to predict pullulan yield. The data used for node selection are introduced in the squares. The squares involved in the paths of the best variable mixtures for pullulan biosynthesis were predicted with the decision tree.
Polymers 15 01419 g006
Figure 7. The relative importance of the seven tested factors on pullulan production based on the decision tree learning algorithm.
Figure 7. The relative importance of the seven tested factors on pullulan production based on the decision tree learning algorithm.
Polymers 15 01419 g007
Figure 8. The FT-IR chart of A. pullulans AKW pullulan with a frequency range from 4000 to 350 cm−1.
Figure 8. The FT-IR chart of A. pullulans AKW pullulan with a frequency range from 4000 to 350 cm−1.
Polymers 15 01419 g008
Figure 9. The 1H-NMR (DMSO-d6/D2O) spectrum of the A. pullulans AKW pullulan.
Figure 9. The 1H-NMR (DMSO-d6/D2O) spectrum of the A. pullulans AKW pullulan.
Polymers 15 01419 g009
Figure 10. The magnification of the aliphatic region in the 1H-NMR (DMSO-d6/D2O) spectrum of the A. pullulans AKW pullulan.
Figure 10. The magnification of the aliphatic region in the 1H-NMR (DMSO-d6/D2O) spectrum of the A. pullulans AKW pullulan.
Polymers 15 01419 g010
Table 1. Orthogonal array and predicted values of Taguchi’s L27 (37) for optimization of pullulan biosynthesis by A. pullulans, as well as the predicted and residual values and terminal nodes of the decision tree learning algorithm at each data point. The experiment was repeated in three separate runs denoted by different colors.
Table 1. Orthogonal array and predicted values of Taguchi’s L27 (37) for optimization of pullulan biosynthesis by A. pullulans, as well as the predicted and residual values and terminal nodes of the decision tree learning algorithm at each data point. The experiment was repeated in three separate runs denoted by different colors.
RunTested VariablePullulan Biosynthesis (%)
Nutritional Component (g/L)pHIncubation
(h)
TaguchiDecision Tree
SucroseK2HPO4NaClMgSO4Yeast
Extract
MeanS/N RatioFittedTypeFittedResidualTerminal
Node
L14040.50.115.5483.81 ± 0.2311.593.70Test2.951.041
L24040.50.126.5723.12 ± 0.109.863.12Test2.950.161
L34040.50.137.5962.32 ± 0.017.302.43Training2.95−0.621
L44051.00.215.5485.37 ± 0.0714.605.53Test5.240.194
L54051.00.226.5724.94 ± 0.0513.884.94Training5.24−0.254
L64051.00.237.5964.41 ± 0.1012.884.26Test4.65−0.195
L74061.50.315.5487.23 ± 0.0917.187.19Training6.980.357
L84061.50.326.5726.60 ± 0.1716.396.61Test6.98−0.577
L94061.50.337.5965.88 ± 0.0915.395.92Training5.760.159
L105041.00.316.5965.43 ± 0.1114.695.50Training5.76−0.269
L115041.00.327.5485.84 ± 0.0815.335.83Training6.01−0.268
L125041.00.335.5726.19 ± 0.0515.846.14Test6.010.138
L135051.50.116.5964.74 ± 0.0613.514.63Test4.76−0.092
L145051.50.127.5484.91 ± 0.0713.814.96Training5.13−0.213
L155051.50.135.5725.22 ± 0.0414.355.27Training5.130.123
L165060.50.216.5964.83 ± 0.0613.674.87Training4.650.235
L175060.50.227.5485.24 ± 0.0514.395.20Training5.240.044
L185060.50.235.5725.51 ± 0.0914.825.51Training5.240.194
L196041.50.217.5725.52 ± 0.0514.845.42Training5.80−0.316
L206041.50.225.5965.54 ± 0.0614.865.67Test5.80−0.326
L216041.50.236.5486.18 ± 0.1215.816.15Training5.800.326
L226050.50.317.5725.53 ± 0.1414.855.56Test6.01−0.638
L236050.50.325.5965.80 ± 0.1215.275.81Test5.76−0.069
L246050.50.336.5486.32 ± 0.1016.016.28Test6.010.298
L256061.00.117.5724.72 ± 0.0813.484.81Training4.76−0.062
L266061.00.125.5965.21 ± 0.1014.345.07Test5.130.073
L276061.00.136.5485.48 ± 0.1514.765.54Training5.130.313
L14040.50.115.548 Training2.950.951
L24040.50.126.572 Training2.950.271
L34040.50.137.596 Training2.95−0.641
L44051.00.215.548 Test5.240.144
L54051.00.226.572 Test5.24−0.354
L64051.00.237.596 Training4.65−0.185
L74061.50.315.548 Test6.980.197
L84061.50.326.572 Training6.98−0.337
L94061.50.337.596 Training5.760.029
L105041.00.316.596 Training5.76−0.289
L115041.00.327.548 Test6.01−0.128
L125041.00.335.572 Training6.010.198
L135051.50.116.596 Training4.76−0.012
L145051.50.127.548 Training5.13−0.303
L155051.50.135.572 Test5.130.113
L165060.50.216.596 Training4.650.185
L175060.50.227.548 Training5.24−0.054
L185060.50.235.572 Test5.240.264
L196041.50.217.572 Training5.80−0.306
L206041.50.225.596 Training5.80−0.276
L216041.50.236.548 Training5.800.306
L226050.50.317.572 Training6.01−0.448
L236050.50.325.596 Training5.760.179
L246050.50.336.548 Training6.010.228
L256061.00.117.572 Training4.760.052
L266061.00.125.596 Training5.130.193
L276061.00.136.548 Test5.130.513
L14040.50.115.548 Test2.950.601
L24040.50.126.572 Training2.950.061
L34040.50.137.596 Test2.95−0.641
L44051.00.215.548 Test5.240.064
L54051.00.226.572 Training5.24−0.304
L64051.00.237.596 Training4.65−0.355
L74061.50.315.548 Training6.980.217
L84061.50.326.572 Training6.98−0.237
L94061.50.337.596 Training5.760.199
L105041.00.316.596 Test5.76−0.469
L115041.00.327.548 Training6.01−0.128
L125041.00.335.572 Test6.010.238
L135051.50.116.596 Training4.760.032
L145051.50.127.548 Training5.13−0.163
L155051.50.135.572 Training5.130.043
L165060.50.216.596 Training4.650.125
L175060.50.227.548 Training5.240.024
L185060.50.235.572 Training5.240.364
L196041.50.217.572 Training5.80−0.226
L206041.50.225.596 Test5.80−0.206
L216041.50.236.548 Training5.800.516
L226050.50.317.572 Test6.01−0.368
L236050.50.325.596 Training5.760.029
L246050.50.336.548 Training6.010.418
L256061.00.117.572 Test4.76−0.102
L266061.00.125.596 Training5.13−0.013
L276061.00.136.548 Test5.130.223
Table 2. ANOVA of means of pullulan biosynthesis by A. pullulans as affected by the seven investigated factors.
Table 2. ANOVA of means of pullulan biosynthesis by A. pullulans as affected by the seven investigated factors.
SourceFreedom DegreeSum of SquareMean SquareF-Valuep-Value
Sucrose22.501.2587.620.000
K2HPO422.531.2788.940.000
NaCl24.862.43170.440.000
MgSO4213.046.52457.600.000
Yeast extract20.010.000.240.793
pH21.700.8559.830.000
Incubation time22.151.0875.470.000
Residual Error120.170.01
Total2626.96
The determination coefficient (R2) 0.9937
Adjusted-R2 0.9863
Table 3. Response analysis of S/N ratio and means of Taguchi array of pullulan biosynthesis by A. pullulans.
Table 3. Response analysis of S/N ratio and means of Taguchi array of pullulan biosynthesis by A. pullulans.
S/N Ratio Response Analysis
LevelSucrose K2HPO4 NaCl MgSO4Yeast Extract Initial pHIncubation Time
113.2313.3513.0912.5614.2714.7614.83
214.4914.3514.4214.4214.2414.2914.26
314.9214.9415.1315.6614.1313.5913.55
Delta1.691.592.043.110.141.171.29
Rank3421765
MeanResponse Analysis
14.8544.8834.7204.3915.2435.5435.597
25.3245.2495.2895.2825.2455.2925.263
35.5895.6345.7576.0935.2784.9314.906
Delta0.7360.7501.0371.7020.0350.6120.691
Rank4321765
Table 4. The predicted fermentation conditions were estimated based on the Taguchi and DT models as well as the predicted and actual values of pullulan biosynthesis by A. pullulans.
Table 4. The predicted fermentation conditions were estimated based on the Taguchi and DT models as well as the predicted and actual values of pullulan biosynthesis by A. pullulans.
ModelNutritional Component (g/L)pHIncubation
(h)
Pullulan, %S/N
Ratio
Terminal
Node
SucroseK2HPO4NaClMgSO4Yeast
Extract
PredictedActual
Taguchi6041.50.315.5487.177.23 ± 0.1317.64-
Decision tree 4041.50.315.5486.987.11 ± 0.07-7
Table 5. The FT-IR spectroscopic data of A. pullulans AKW pullulan.
Table 5. The FT-IR spectroscopic data of A. pullulans AKW pullulan.
Absorption
(cm−1)
AppearanceFunctional GroupCompound ClassRelative Intensity
3309sharpO-H stretchingAlcohol0.250
2924mediumC-H stretchingAlkane 0.066
1642weakO-C-O stretchingAliphatic ether0.073
1417mediumC-H bendingAlkane0.001
1360mediumC-O-H Aliphatic ether0.097
1148strongC-O-C stretchingAliphatic ether0.047
1077strongC-O stretchingAlcohol0.047
995strongC-O stretchingprimary alcohol0.034
930strongC-H bendingtrisubstituted0.034
755strongC-H bendingdisubstituted0.012
706strongC-H bendingdisubstituted0.012
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Saber, W.I.A.; Al-Askar, A.A.; Ghoneem, K.M. Exclusive Biosynthesis of Pullulan Using Taguchi’s Approach and Decision Tree Learning Algorithm by a Novel Endophytic Aureobasidium pullulans Strain. Polymers 2023, 15, 1419. https://doi.org/10.3390/polym15061419

AMA Style

Saber WIA, Al-Askar AA, Ghoneem KM. Exclusive Biosynthesis of Pullulan Using Taguchi’s Approach and Decision Tree Learning Algorithm by a Novel Endophytic Aureobasidium pullulans Strain. Polymers. 2023; 15(6):1419. https://doi.org/10.3390/polym15061419

Chicago/Turabian Style

Saber, WesamEldin I. A., Abdulaziz A. Al-Askar, and Khalid M. Ghoneem. 2023. "Exclusive Biosynthesis of Pullulan Using Taguchi’s Approach and Decision Tree Learning Algorithm by a Novel Endophytic Aureobasidium pullulans Strain" Polymers 15, no. 6: 1419. https://doi.org/10.3390/polym15061419

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop