An Artificial Neural Network Assisted Dynamic Light Scattering Procedure for Assessing Living Cells Size in Suspension

Chicea, Dan

doi:10.3390/s20123425

Open AccessArticle

An Artificial Neural Network Assisted Dynamic Light Scattering Procedure for Assessing Living Cells Size in Suspension

by

Dan Chicea

Research Center for Complex Physical Systems, Faculty of Sciences, “Lucian Blaga” University of Sibiu, Dr. Ion Ratiu str. no. 5–7, 550012 Sibiu, Romania

Sensors 2020, 20(12), 3425; https://doi.org/10.3390/s20123425

Submission received: 17 May 2020 / Revised: 15 June 2020 / Accepted: 16 June 2020 / Published: 17 June 2020

(This article belongs to the Special Issue Sensors for Food Safety and Quality 2019-2020)

Download

Browse Figures

Versions Notes

Abstract

Dynamic light scattering (DLS) is an essential technique used for assessing the size of the particles in suspension, covering the range from nanometers to microns. Although it has been very well established for quite some time, improvement can still be brought in simplifying the experimental setup and in employing an easier to use data processing procedure for the acquired time-series. A DLS time series processing procedure based on an artificial neural network is presented with details regarding the design, training procedure and error analysis, working over an extended particle size range. The procedure proved to be much faster regarding time-series processing and easier to use than fitting a function to the experimental data using a minimization algorithm. Results of monitoring the long-time variation of the size of the Saccharomyces cerevisiae during fermentation are presented, including the 10 h between dissolving from the solid form and the start of multiplication, as an application of the proposed procedure. The results indicate that the procedure can be used to identify the presence of bigger particles and to assess their size, in aqueous suspensions used in the food industry.

Keywords:

artificial neural network; dynamic light scattering; simulated time-series; fermentation; Saccharomyces cerevisiae

1. Introduction

When an incident light beam has a fluid containing suspended particles as target, each particle scatters light and therefore becomes a secondary light source as elastic or, more precisely, quasi-elastic scattering occurs. The particles become scattering centers (SC). If the light source is coherent the scattered waves are coherent, as well, therefore, they will interfere in the far-field. The particles in suspension undergo a complex motion, which confers a dynamic character to the far interference field, giving it the aspect of “boiling speckles”. Several articles investigated the variation of physical parameters, such as the average intensity, speckle size and speckle contrast, with the size and the concentration of the SCs, references [1,2,3] being an example. Using the variation of the above mentioned parameters for particle sizing in dynamic processes where both the SC diameter and number of SCs can change in time is not a good choice [3]. The physical method that uses the dependence between the speckle dynamics and the particles diffusion caused by the Brownian motion is called dynamic light scattering (DLS), traditionally known as photon correlation spectroscopy (PCS) The physical explanation of the method can be found in many works, such as [4,5,6,7,8], to name just a few.

DLS is widely used to analyze the size and size distributions of nanoparticles, colloids and proteins in the suspension of various solvents [9]. The DLS technique has been shown to be appealing for biomedical applications [10], such as studying homogeneity of proteins, ribonucleic acid (RNA), and their complexes as it has certain advantages over other experimental methods. Some other methods that are used in the type of investigations mentioned above include analytical ultra-centrifugation, which requires a bigger amount of sample; small angle X-ray scattering (SAXS), which requires a longer acquisition time, in the order of hours in lack of a synchrotron; and static light scattering, which again requires time averaging to compensate for the dynamic light scattering effect. The DLS technique can be used for investigating suspensions in a wide range of sample buffers, over a relatively big range of temperatures and concentrations, as well. Moreover, DLS requires very small sample amounts, even of the order of 10 μL [10] and has the advantage of providing absolute rather than relative results; therefore, it does not require calibration.

The DLS technique has been established for quite some time [4]. In the early days of DLS, photomultiplier tubes were used as detectors, as they have a fast response with good amplification [7]. Later on, photomultiplier tubes were replaced with avalanche photodiodes [7]. P-I-N diodes represent a step forward [9]. Autocorrelators contained the hardware to compute the autocorrelation function of the DLS time-series and is another typical part of the DLS setup. A laser diode can be used as a coherent light source and a data acquisition system can be used to record the DLS time-series on a computer, which can be used later to process it. Some examples of very simple experimental setups can be found in many papers, [11,12] being just some of them.

The DLS technique can still be improved by making it faster regarding data processing. The artificial neural networks (ANNs hereafter) concept is quite well established and described comprehensively in many works like [13,14], to mention just a few of them. The ANN is computer code that imitates the structure of a brain in its functioning. The code describes artificial neurons and the connections between them, by means of transfer functions and biases. ANNs appear as an appealing alternative to numerical minimization of functions in DLS time-series processing, because they can be much faster, as signal processing in an ANN is done by matrix multiplication and addition, rather than many function and numerical gradient evaluations.

ANNs have been used for data processing in optics and several papers reporting it are presented further on. Reference [15] describes an ANN for detecting amino acids and several solid organic compounds. The work in [16] explains how a trained ANN was successfully used in assessing the size and the refractive index. The paper cited as [17] describes how an ANN was used in measuring the radius of spherical particles. Reference [18] explains how the authors used an ANN trained for pattern recognition to detect the presence of hazardous fibers in the air. Reference [19] reports on using a circular ANN for measuring the particle size and refractive index.

ANNs were successfully used in processing DLS time-series as well. The work reported in [20] used an averaged scattered light intensity frequency spectrum as input for a three artificial neuron layer ANN that produced the average diameter of the suspended particles, in a narrow size range of up to 350 nm. Reference [21] reports a direct continuation of the work in [20] and describes using the autocorrelation of the recorded DLS time-series as input in a three artificial neuron layer ANN, which has the average diameter of the suspended particles as output. The range of particle size was extended up to 1200 nm. Both ANNs reported in [20,21] proved to be several thousand times faster than fitting either the Lorentzian line to the frequency spectrum [20] or the autocorrelation [21] with very small relative errors as compared to the reference method of fitting functions.

Another example of using ANN in processing Rayleigh light scattering data for protein detection is reported in [22] with a 3-5-1 neuron structure. The work published in [23] reports on real-time determination of the total concentration of various oils and mixtures in water using ANNs. A recent work [24] uses the same input as [20], which is the averaged frequency spectrum of the DLS time signal and process it using a 5 layer ANN to produce the particle size distribution, on a small particle size range of 1–500 nm, with big errors for small particles though.

The work reported in this paper is a continuation of the previous work in [20,21] done to extend the size range of sizing particles up to 6000 nm with very good precision.

An application of using the ANN assisted DLS in detecting the presence of bigger particles in an aqueous suspension is presented, as well as a proof of concept. Such bigger particles, as compared with the solvent molecules, water in this case, or the molecules of glucose or fructose, frequently present in fermentation processes, can be bacteria with the size in the range starting from 0.2–0.4 μm [25], with a common size of several microns and reaching sizes as big as 500 and 750 μm [26]. Yeast cells can also be similarly large particles is suspension. While the presence of big particles, like several micron sized bacteria and yeast cells, can be confirmed by optical microscope examination of the suspension sampled from the fermentation environment and stretched as thin film on a glass slab; the presence of particles smaller than one micron cannot be made evident with a conventional microscope, as it is smaller than the wavelength of visible light. Under these circumstances, using a technique that is robust and that does not require calibration would be preferable. DLS appears to be suited for such measurements and ANN assisted DLS appears to be a good choice, as data processing is less computation intensive and easier to use, as highlighted in [20,21].

Yeasts are eukaryotic microorganisms and they are classified as fungi. Reference [27] describes approximately 1500 species of yeasts Yeasts have been widely used for alcohol related brewing and fermentation according to archaeological evidence [28,29,30]. The main fermentation is the Saccharomyces complex [31]. It contains one of the most important species for the food industry, S. cerevisiae, which is the agent used in wine, bread, beer, and sake fermentation. New technologies and new outcomes in bio-engineering and genetics placed S. cerevisiae as a model for eukaryotic biology [32]. S. cerevisiae became a valuable alternative for diverse chemical production [33], for functional foods, pharmaceuticals, and biofuels [34,35]. As Saccharomyces cerevisiae usage in fermentation is so widespread, the proposed ANN assisted DLS time series processing procedure was tested on fermentation produced by this type of yeast to detect the presence of big cells in suspension and to assess the variation of the cell size as fermentation carries on.

The next sections of this paper will present the sample preparation procedure, the procedure for generating the set for training the ANN, the results of the ANN on simulated data, and the results of the yeast cells size variation in time during a fermentation process, indicating that the ANN assisted DLS can be used as an almost real-time procedure for detecting the presence of big particles in suspension, as bacteria or yeast, and for monitoring fermentation by assessing the average size of the suspended yeast cell during the fermentation process.

2. Materials and Methods

2.1. Diluted Yeast Suspension

First an amount of 1 g of Saccharomyces cerevisiae in fresh, solid form was added in 10 mL of water and allowed to dissolve, which is to have the individual cells separated from each other, forming a nontransparent suspension.

While dissolving carried on, a concentrated table sugar was prepared. Sucrose is the common sugar and it is a disaccharide, a molecule composed of two monosaccharides: glucose and fructose. Sucrose has the molecular formula C₁₂H₂₂O₁₁ [36]. The sucrose syrup concentration was 15% weight in water. A volume of 0.2 cm³ of yeast suspension was added in 3.5 cm³ of syrup in a cuvette that was the target of a laser beam. The cell concentration was chosen in this range after several trials, to ensure the transparency of the suspension.

Time series were recorded at equal time intervals during an experiment. The temperature was maintained constant at 22 °C during the experiment. The time series were processed using the ANN assisted dynamic light scattering procedure and provided the average diameter of the particles that scattered light, as will be presented further on.

2.2. The Reference DLS Procedure

The DLS technique has been developed for many decades [4,5,6,7,8], and is quite well established, therefore the theory behind it is not repeated here, but the experimental setup and the main steps in assessing the average particle size using DLS are presented briefly in the following paragraphs.

The experimental setup is presented in Figure 1 and consists of a laser source, (a He-Ne laser, for the work reported here) with a power of 10 mW working in a continuous regime and with a wavelength of 633 nm. A 5 mL circular glass tube was used as the sample container. A detector, a preamplifier with a linear response in the audio frequency range, and a simple, single processor computer were used for recording the time-series. The scattering angle was chosen to be 90°, which is typical for DLS experiments [4,5,7,8,16].

The speckle size was measured using a Philips CCD with the optical system removed, knowing the size of the pixel on the conversion matrix. The average speckle size was assessed following the procedure presented in [3]. The cuvette-detector distance was adjusted in such a manner that the diameter of the detector was 11 cm and equaled the average speckle size.

As the incident light wave is coherent, the scattered waves are coherent as well, therefore it will interfere. What is actually measured using a detector is the intensity, not the electric field of the light wave in the detector location. The detector converts the interference field intensity to an electric signal with the voltage proportional to the intensity and the data acquisition system (DAS hereafter) converts it to an integer number in a range covered by 16 bits, and this is the DAS resolution used in this work. The data acquisition rate was 16 kHz, which was big enough for the particle range covered in this work.

The DAS produces a sequence of values recorded at equal time intervals Δt, as in Equation (1) and such a succession is called a time-series:

I (0), I (Δ t), I (2 Δ t), I (3 Δ t) \dots

(1)

In the general case, if the suspension contains m different types of particles, each having the diffusion coefficient D1, D2, …., Dm, the intensity autocorrelation (ACR hereafter) G(τ) has the form [9,19]:

G (τ) = \sum_{i = 1}^{m} A_{i} e^{- 2 D_{i} q^{2} τ} + B

(2)

where the amplitude factors A_i are proportional to the contribution of each particle group, q is the modulus of the scattering vector [7] detailed in Equation (3), and B is a constant.

q = \frac{4 π n}{λ} \sin (\frac{θ}{2})

(3)

In Equation (3) n is the refractive index of the solvent, λ is the wavelength of the incident coherent light in a vacuum, and θ is the scattering angle; the values of the last variables being mentioned at the beginning of this Section.

If the suspended particles can be considered a single size group, or if they have a relatively narrow distribution and can be considered as a single group, the ACR plot is very close to a simple exponentially decreasing curve. In this case the ACR in Equation (2) can be described by a simplified form, as in Equation (4):

G (τ) = A e^{- 2 D q^{2} τ} + B

(4)

The ACR can be further normalized in two small steps. First, constant B is subtracted. The second step involves an experimental adjustment in such a manner that the detector will cover a single speckle, as described in detail in [18,19], making the spatial coherence factor included in A for this particular case, equal to 1. With this two-step normalization, for a suspension containing one group of particles with a relatively narrow particle size distribution, as stated above, the intensity ACR has a simple form depending on a single parameter, which is D, the diffusion coefficient:

G (τ) = e^{- 2 D q^{2} τ}

(5)

In its turn, the diffusion coefficient D is related to k_B, which is Boltzmann’s constant, to η, the dynamic viscosity coefficient of the solvent, T the absolute temperature of the sample, and the hydrodynamic diameter of the particle, d, as described by the Einstein–Stokes relation [37] (6):

D = \frac{k_{B} T}{3 π η d}

(6)

This procedure was used as the reference for determining the average diameter of the SCs by fitting Equation (5) with D substituted from Equation (6) to each of the generated time-series that were used for training the ANN, as described in the next Section.

2.3. The ANN Assisted DLS Time-Series Processing Procedure

A brief description of the procedure, highlighting the differences and the improvement brought to the previous version, presented in [21], will be presented further on.

Training the ANN was performed with ACR of the simulated time-series as inputs and the corresponding diameters, computed using a nonlinear minimization procedure of Equation (5), as described at the end of Section 2.2, as targets. Each time-series was computed as the sum of harmonic functions, as in Equation (7):

x (t) = \sum_{i = 1}^{Nf} A (f_{i}) \cdot \sin (2 π f_{i} t + φ_{i})

(7)

In Equation (7), A(f_i) is the amplitude of the i-th component, f_i is the frequency of the i-th component, φ_i is the initial phase of the i-th component, t is the time when we compute that particular data in the DLS time-series and N_f is the number of frequencies used in generating the time-series. A(f_i), the amplitude of the i-th harmonic component, is computed with Equation (8), the Lorentzian line, which describes the frequency spectrum of the intensity of the scattered light, while the φ_i initial phases were generated using random numbers with uniform distribution in (0, 2π). The frequencies of all the harmonic components f_i were generated equally spaced in the interval (0, f_s/2), as pointed out in [21], where f_s is the sampling frequency of the DAS.

S (f) = a_{0} \frac{a_{1}}{{(2 π f)}^{2} + a_{1}^{2}}

(8)

The a₀ parameter is selected to be a fixed value, of the order of tens, the same for all series, while a₁ is calculated from Equation (9) [11,12], with q calculated using Equation (3) for each particle diameter d. A(f_i) was selected to be the square root of S(f_i) computed with Equation (8) [4,7,11,12].

d = \frac{2 k_{B} T q^{2}}{3 π η a_{1}}

(9)

where q is computed with Equation (3).

Rather than filtering the experimental time-series, the alternative of training the ANN with noisy time-series was chosen. As the power grid operates at 50 Hz, this component and its harmonics are present in the recorded time-series, therefore noise was added to the generated time-series, consisting of a sum of sine functions, as in Equation (9), having frequencies of type 50 × i, i being a natural number in the range 1–i_max. where i_max is the sampling frequency divided by 50 and then rounded, therefore it is the maximum frequency of the 50 Hz harmonic that is smaller than f_s. The amplitudes of the 50 Hz noise harmonics decrease exponentially with the number of the harmonic, as in Equation (10), where A_ts is the amplitude of the time-series, assessed as the difference between the maximum and the minimum of that particular time-series and i is the harmonic number. The initial phases of the harmonics φ_i were generated using random numbers with uniform distribution in (0, 2π). The power grid noise time-series x_h was computed apart from the time-series and added to it, as in [21].

x_{h} (t) = \sum_{i = 1}^{i_{m a x}} 0.03 \cdot A_{t s} \cdot \exp (- 0.25 \cdot i) \cdot \sin (2 π f_{i} t + φ_{i})

(10)

In addition to the 50 Hz noise, a random noise was added as well, and the time-series, x_noise, was computed apart from the time-series using Equation (12), where N_rnd is the number of frequencies f_i generated using random numbers with uniform distribution in the range [1–f_s]. For the work reported here N_rnd was selected to be 300.

x_{n o i s e} (t) = \sum_{i = 1}^{N_{r n d}} 0.01 \cdot A_{t s} \cdot \exp (- 0.005 \cdot i) \cdot \sin (2 π f_{i} t + φ_{i})

(11)

After computing the noise time series with the power grid noise and with the random noise, these series were added to the generated time-series to produce the (realistically) noisy time-series.

The set of time-series was generated with 32,768 data points each, with a number of 16,385 frequencies in Equation (7) and with noise added as described in the previous paragraphs of this Section. A detailed analysis that led to the selection of values as described above is presented in detail in Section 2.3 of [21], therefore not repeated here. The coefficients in Equations (10) and (11) were selected after a process of trial and error, aiming to describe the noise that is present in experimentally recorded DLS time series as realistically as possible. It is the intention both to extend the range of average diameters that can be assesses using the ANN assisted DLS procedure, and to improve the precision, the reason the ANN procedure was revisited.

The targets (diameters corresponding to each generated time series) were computed using the reference method of fitting Equation (5) with D replaced from (6) on the ACR rather than fitting Equation (8) on the frequency spectrum, as in [21]. Fitting a function to data requires numerical gradient evaluations, with the number of floating point operations increasing with the number of free parameters, therefore fitting a function with one free parameter, as described by Equation (5) requires less floating point operations than fitting a function with two free parameters, as the function in Equation (8). Moreover, computing the power spectrum using the fast Fourier transform algorithm has the disadvantage of requiring a number of 2ⁿ data in the time series, otherwise it adds zeros to match the amount. The addition artificially raises the small frequency part of the spectrum, producing a fake turnover point in the very low frequency of the spectrum, therefore indicating a much bigger diameter that the real one.

Another improvement, as compared with the procedure described in [21], is that the diameters range of the simulated time series was 25–6000 nm instead of 10–1200 nm.

One time-series with noise added to it was generated for each diameter. The step for increasing the diameter was 0.5 nm.

The ANN is a feed-forward type with three layers. The input to the ANN is the autocorrelation of the time-series, more precisely the first 350 lags. Consequently, the input layer has 350 neurons, each corresponding to a value from the ACR data set. The hidden layer has 26 neurons. The output layer has one neuron, as the output is the average diameter. The sigmoid function, implemented in the tansig function in Matlab, was used as a transfer function between the neurons of all three layers [38].

The number of neurons in the hidden layer was found by increasing it till the ANN precision increased sufficiently, thus avoiding ANN over fitting behavior.

Figure 2 brings insight regarding the choice of the number of neurons in the input layer. As it was mentioned above, each value of the ACR, computed on each time series recorded with the same data acquisition frequency, is fed to a neuron in the hidden layer.

Figure 2 reveals that the ACRs of the time series corresponding to very small diameters differ only at their first 4 lags, having values that are the same for the first significant four digits for a number bigger than 4 lags, therefore we cannot use averages for two, three, or more consecutive lags, as they will differ less. On the other hand, the ACRs for bigger diameters differ significantly from each other only at bigger lag number, being overlapped for the first two hundred lags, as can be seen in Figure 2, where the ACRs of time series for diameters significantly differ from each other, that is 0.1 µm, are plotted with the upper (red) lines. Taking a smaller number of lags for input, hence a smaller number of neurons in the input layer will both decrease the precision and limit the range of diameters that can be assessed using the ANN, because the differences between the data that is input to the ANN are significant only at bigger lag numbers.

The last layer has a single neuron, corresponding to the average diameter.

Another thing that resulted from a trial and error procedure is that the best performance belonged to the ANN with training done on ACRs with noise added. The attempts at training the ANN with clean ACRs produced in a simple and tremendously less computation intensive manner, with Equation (5), led to very big errors when the inputs were ACRs of time series with noise added, as time series recorded in an experiment are, therefore this line was abandoned. This feature of the ANN can be explained by the distortion of the ACR produced by the added noise, as revealed by Figure 2, where the green lines present the ACRs computed for the same diameter, with and without added noise.

The training algorithm was Levenberg–Marquardt [39]. Training stopped after 65 iterations, as the R-value reached 1, when rounded to 5 digits. The training lasted for 85 min on a laptop with an Intel i7-7300 processor, using 70% of the data sets for training the ANN, 15% for testing, and 15% for validation.

Once the ANN was trained, the relative errors of the ANN on the whole set of generated time-series were computed using Equation (12):

e r r = \frac{d_{A N N} - d_{r e f}}{d_{r e f}} \cdot 100 (%)

(12)

As the training data set covered an extended diameter range, the relative errors computed with Equation (12) are very small for the diameter range 500–6000 nm, but are slightly bigger than the relative errors reported by [21] in the very small diameter range. Actually, the relative errors remain smaller that 0.1% for diameters bigger than 500 nm, therefore the whole range is not presented in Figure 3, but a zoom focusing on the 20–500 nm diameter range.

To conclude this brief error analysis paragraph, the ANN described in this work predicts the average diameter with a relative error up to 2.5% if the particles have a diameter in the range 25–70 nm, less than 1% if the particles have a diameter in the 70–150 nm range, less than 0.5% if the particles have a diameter in the 150–350 nm range, and less than 0.1% for particles having a diameter in the 350–6000 nm range.

Moreover, we notice that the ANN assisted diameter assessment procedure is stable and predictable for an extended particle range, from 25 to 6000 nm. The results are not precise for suspended particles with a diameter smaller than 25 nm, as they can contain relative errors in the range from −14 to 4%, therefore it should not be used to assess the size of the particles in this diameter range.

Furthermore, we should bear in mind that the average diameter mentioned in this work should be viewed as the hydrodynamic diameter, as it results from Equation (6), regardless of whether it is assessed by a least square minimization fit of Equation (5) or as a result of the ANN trained using simulated ACRs. Particles of nonspherical shape, like nanorods (not the case in yeast cells though) diffuse in a solvent and, when subjected to a DLS experiment, produce a time series and therefore a hydrodynamic diameter, considered an average diameter in the DLS sense. The diameter that is the output of the ANN procedure should be viewed as the diameter of spherical particles that diffuse in the same manner as the particles that produce a time series with the same ACR. Moreover, the diameter that is assessed by the ANN assisted DLS described in this work is slightly different from the diameter that can be assessed by other physical procedures, like optical microscopy, transmission electron microscopy static light scattering, atomic force microscopy with its different mode of operation, and others; a detailed explanation of the differences can be found in [11], on a crystalline substance, as Fe₃O₄ nanoparticles, where X-ray diffraction can be an option. Optical microscopy, using visible light with a wavelength in the range 0.380–0.740 μm, cannot be expected to produce precise results on particles with the size of a few μm, because the light wave diffraction will manifest. On the other hand the DLS techniques has been validated decades ago [4], therefore the DLS was used as reference.

If the suspended particles have a wider size distribution, the ACR is actually a sum of ACRs, well described by Equation (2), but that cannot be described by Equation (5). For the case of yeast cells examined during fermentation the approximation of Equation (5) holds to a certain degree, as illustrated by Figure 4. The upper subplot illustrates one of the ACRs that is very well described by Equation (5), while the lower subplot illustrates one of the ACRs that is described worse by the simple exponential decay of Equation (5).

The distortion from the simple exponential line at bigger lag numbers is caused by the noise in the experimental time series, which is confirmed by ACRs of the simulated time series, illustrated in Figure 2. The middle lines of Figure 2 illustrate the ACR for the ideal mono-dispersed spheres and the ACR of the same time series after noise addition, as described above, in this Section. We notice that the noise increases the ACR at bigger lag numbers, which is the same feature displayed by the ACR of the experimentally recorded time series presented in the upper subplot of Figure 4. Moreover the upper subplot of Fig. 4 reveals a decrease of the ACR that is faster than the simple exponential decrease, which can be caused by the smaller particles in the yeast cell group. The glucose and fructose molecules, having diameters smaller than the lower diameter range that can be detected by the procedure that is described in this work due to the lower acquisition frequency used, might be a cause for the distortion in the small lag number of the ACR as well.

The fact that some ACRs are well described and others are not so well described by a simple exponential decay might be the explanation for the fluctuations of the predicted diameters by both the reference and the ANN procedure, described by Figure 5. The decrease in precision is a consequence of the simplicity of both the experiment and of using the description for mono-dispersed particles in training the ANN, resulting in a simple function with the ACR as input and the average hydrodynamic diameter as output. Moreover, as the yeast cell concentration increases, conditions for restricted diffusion might occur and the predicted diameter might be slightly bigger than the real diameter, and this might be another cause of possible errors and fluctuations.

Another question that can rise from ACR examination is related to the origin of the suspended cells motion. If particles the size of microns have a density different from the density of the solvent they undergo sedimentation, and in a matter of a few hours the particles can be either on the bottom or at the surface of the cuvette. This was not the case after 164 h though; therefore, the scattered light fluctuations were produced by the yeast cells’ Brownian motion, causing diffusion.

2.4. Experimental Procedure and Time-Series Processing

Matlab code was written and used to record the DLS time-series. The length of the time-series and the delay between consecutive recordings are adjustable parameters.

The experiment lasted for 164 h. A time series lasting for 8 s was recorded every 30 min. Yeast fermentation was initiated in the circular cuvette containing the sucrose solution, as described in Section 2.1. After the experiment ended and data acquisition was completed, the time series were stored in an array, the ACRs for them were computed and the first 350 lags were added as a column to the ACRs array. The ACRs array was used later on as input to the Matlab function having the weights and biases, as generated by the training procedure, contained by the code (hardcoded) and the average diameters were computed as the output of the function corresponding to ANN, as described in Section 2.3.

3. Results

The average diameters for each time-series were computed using the ANN designed and trained as described in Section 2.3 and are plotted versus t, the time elapsed from the beginning of the first recording in Figure 5. The blue circles represent the diameters assessed using the reference procedure and the squares for the diameters assessed using the ANN assisted procedure.

Another way of looking at the diameters of the particles in suspension is to render them as a boxplot. The diameters of the three time intervals that present a plateau, with fluctuations though, were grouped and presented as a boxplot for each group in Figure 6.

The fluctuations might be fitting artifacts, both of the reference and of the ANN assisted ACR’s processing procedures, as the particles in suspension are not mono-dispersed. Examining Figure 5 and Figure 6 we notice that during the time interval 0–10 h the average diameter was around 1510 nm, during the 50–90 h plateau around 4450 nm, and during the 120–164 h plateau around 2150 nm.

Additionally, Figure 5 depicts that the diameters computed using the two procedures are very close to each other, which is consistent with the relative error computed on the simulated time series, as presented in Figure 3.

We also notice that at the beginning of the experiment the size of the particles remained constant for 10 h, with fluctuations. The increase of the diameter occurred after about 10 h.

Yeasts may have asexual and sexual reproductive cycles, a feature that is typical for fungi. Reference [40], as many others, states that the most common mode of population growth in yeast is asexual reproduction by budding. During budding, a small bud is formed on the parent cell and during this process the nucleus of the cell splits and migrates into the daughter cell [40]. The bud continues to grow and separates from the parent cell, forming a new cell [41] which is smaller than the mother cell and continues to grow after separation. This stage of yeast cell population evolution is consistent with the increasing of the average diameter of some of the SCs during the time interval from 10 to around 50 h since the beginning of the experiment. During that time interval an increasing number of yeast cells were consuming glucose and were multiplying. Their assessed diameter was expected to increase, as the diffusion coefficient decreased, because the volume and hence the equivalent hydrodynamic diameter of the mother and daughter cell prior of separation was bigger than the hydrodynamic diameter of a single cell. The average diameter remained on a plateau for up to 90 h from the beginning of the experiment, as the yeast cells continued to consume glucose and to multiply by budding. We also notice from Figure 5 and Figure 6 that the average diameter exhibits fluctuations around 4450 nm. This value of the assessed diameter is consistent with the electron microscope assessment of the S. cerevisiae size reported in [42]. Reference [43] states that the yeast cells are ellipsoidal having approximately 4.76 μm on the long axis and 4.19 μm on the short axis for haploids and 6.01 by 5.06 μm for diploids. We should also note that electron microscopy and DLS are different physical procedures and, therefore, the diameter assessed using these different procedures are expected to be different from each other [11,44].

The diploid yeast cell under nutritional starvation gives rise to four haploid meiotic progeny, each of them being encapsulated in a spore. All four products from a single meiosis are wrapped together in a sac, called an ascus [42]. This might be the situation in the small cuvette volume after about 90 h since the beginning of the experiment with the glucose having been consumed by the yeast cells. This is consistent with the decrease of the average diameter as assessed by both the reference procedure and the ANN assisted procedure. The average diameter continued to decrease as the number of haploid cells increased and the number of diploid cells decreased.

Figure 5 also indicates that after about 140 h the diameter remains around 1950 nm. This average diameter is considerably smaller than the size of the haploid cells. Reference [42] indicated that the ascus wall can be digested by degradative enzymes to separate the individual spores from each other. A possible explanation of the decrease of the average diameter to such small values, smaller than the reported size of the haploid cells [42], might be that the ascus wall was disrupted and that the individual spores became free and suspended in the cuvette.

Coming back to the first plateau and the last plateau, corresponding to the time interval 0–10 h, we notice the median values of the assessed diameter are 1510 nm and 2150 nm, respectively, considerably smaller that the diameter of both diploid and haploid yeast cells. If we make a rough estimation of the volume of the haploid cell, considering it to be a sphere with an average diameter of 4500 nm, and we divide it by four and assess the diameter of such a smaller sphere we get a value of roughly 2800 nm. This value is slightly bigger than what was measured using both the ANN assisted DLS and the reference procedure on both plateaus. A possible explanation is that when haploid cell split into spores, each spore does not receive a quarter of the haploid cell volume, but receives less. The first plateau corresponding to the time interval 0–10 h from the beginning of the experiment presented a median value of 1510 nm, smaller than the median diameter of the last plateau. A possible explanation might be that in its solid form, in the sample that was used, the yeast was in the form of spores and during the first 10 h the spores hydrated and arrived at a condition to recombine forming living yeast cells.

4. Discussion and Conclusions

This paper presents an ANN assisted procedure for DLS time series processing, with details on the ANN design and training procedure. The procedure has the average diameter of the suspended particles as output. This average diameter should be viewed as a DLS average diameter, which is the diameter of mono-dispersed particles that produce a DLS time series with the same ACR. The upper range for the particle size that can be precisely measured with the ANN assisted procedure has been increased from several hundred nanometers in the version presented in [20], and 1200 nm in the version presented in [21], to 6000 nm in this version. Moreover, an error analysis of this ANN assisted procedure, with respect to the reference method described in Section 2.2, was carried out over an extended particle size range, which is from 25 to 6000 nm. The relative error, as defined in Equation (12), is up to 2.5% if the particles have a diameter in the 25–70 nm range, less than 1% if the particles have a diameter in the 70–150 nm range, less than 0.5% if the particles have a diameter in the 150–350 nm range, and less than 0.1% for particles having a diameter in the 350–6000 nm range.

The ANN assisted DLS time series processing, working on an extended diameter range, remains considerably faster than the reference DLS time series processing procedure described in Section 2.2, thousands of times faster actually, which, in its turn is faster than the reference procedure described in [20], because it has one free parameter, instead of two. Stating a precise number for how many times the procedure is faster would be imprecise, because the time for completing the reference procedure strongly depends on the starting parameter (educated or less educated guess) and of the random parameters used by the least squares procedure itself. For example, the whole set of 329 time series recorded of the yeast suspension mentioned in this work were processed using the reference DLS procedure in 1.3166 s. The ANN assisted DLS procedure lasted for 2.0837 × 10^–4 s. Both times refer to processing the ACRs already computed and loaded in computer RAM as variables. The outputs of both procedures were retained as variables as well, thus not involving hard disk operations. The ANN assisted DLS procedure was therefore 6319 times faster than the reference DLS, for the above mentioned set of data. At this point it is worth mentioning that being faster is not a big advantage for the type of experiment mentioned in this work, which lasted for 164 h, but might be of interest for processes that complete in a matter of seconds, like nanoparticle aggregation, as reported in works like [1,2,43,44].

Moreover, the procedure can be made thousands of times faster if no output is written on the computer hard drive, but retained as variable. The time for recording the time series remains the same though, and the above mentioned statement is true for the time required for processing the DLS time series once recorded.

The ANN assisted DLS time series processing presented here is not intended to replace the CONTIN [45] or the maximum entropy methods, it is just another approach for time series processing. CONTIN relies on the inverse Laplace transform, being basically an improvement of the inverse Laplace transform method. Calculating the inverse Laplace transform is problematic on experimental data and can lead to ambiguous results, this being an ill-posed mathematical problem. The CONTIN method tries to counter this effect by introducing regularization, driven by a parameter which has a very big influence on the solution found. The maximum entropy method assigns entropy to the solutions and searches for the solution with maximum entropy, which is computation intensive, as well. The ANN based DLS, with its limitations of providing the average DLS diameter only, as described in this work, was designed to avoid these complications. Moreover, this procedure is a step forward in designing the ANN assisted DLS, a pure computational procedure, which will have the size distribution as output, without any assumption regarding the theoretical function to describe it. The method, as presented, is less computation intensive than fitting a function to a set of data because the ANN procedure actually consists of matrix multiplication and addition. It is easy to implement on a low power computation platform like a cell phone or development platform, as it does not require particular libraries with the functions required for least squares methods.

The method, simple to use once the ANN has been trained, was used to monitor the variation of the average S. cerevisiae yeast cell size during population evolution since dissolving from solid form in sucrose solution till nutrition stress occurrence. The stages of the cell population evolution, as hydration, multiplication by budding, haploid formation, and spore releasing were made evident by the variation of the average diameter assessed using the ANN assisted DLS procedure. With these aspects in mind, the experiment presented in this work can be considered a proof of concept for the proposed method, easy to use and low on resources, which qualifies the simple setup and the procedure to be a possible sensor for different processes in food industry where the size of the suspended particles is related to different stages of the process. Moreover, this simple procedure can be used to identify the presence of bigger particles, like bacteria, in a suspension where the expected particles are in much smaller diameter range, which might be useful in establishing a fast procedure for safety or quality assurance.

Funding

The work presented here was financed from Lucian Blaga University of Sibiu research grants LBUS-IRG-2017-03.

Conflicts of Interest

The author declares no conflict of interest.

References

Piederrière, Y.; Le Meur, J.; Cariou, J.; Abgrall, J.F.; Blouch, M.T. Particle aggregation monitoring by speckle size measurement; application to blood platelets aggregation. Opt. Express 2004, 12, 4596–4601. [Google Scholar] [CrossRef] [PubMed]
Piederrière, Y.; Cariou, J.; Guern, Y.; Le Jeune, B.; Le Brun, G.; Lotrian, J. Scattering through fluids: Speckle size measurement and Monte Carlo simulations close to and into the multiple scattering. Opt. Express 2004, 12, 176–188. [Google Scholar] [CrossRef] [PubMed]
Chicea, D. Speckle size, intensity and contrast measurement application in micron-size particle concentration assessment. Eur. Phys. J. Appl. Phys. 2007, 40, 305–310. [Google Scholar] [CrossRef]
Clark, N.A. A study of brownian motion using light scattering. Am. J. Phys. 1970, 38, 575. [Google Scholar] [CrossRef]
Goodman, J.W. Statistical properties of laser speckle patterns. In Laser Speckle and Related Phenomena; Springer: Berlin/Heidelberg, Germany, 1984; pp. 9–75. [Google Scholar]
Kataoka, T.; Inoue, H.; Endo, K.; Oshikane, Y.; Mori, Y.; Nakano, M.; Wada, K.; An, H. Light scattering by small particles and small defects on the silicon wafer surface. Calculations of scattering light intensity and optical image through a lens. J. Jpn. Soc. Precis. Eng. 2000, 66, 1716–1722. [Google Scholar] [CrossRef]
Berne, B.J.; Pecora, R. Dynamic Light Scattering: With Applications to Chemistry, Biology, and Physics; Dover Publications: Mineola, NY, USA, 2000; pp. 164–206. [Google Scholar]
Xu, R. Particle characterization: Light scattering methods. China Particuol. 2003, 1, 271. [Google Scholar] [CrossRef]
Bhattacharjee, S. DLS and zeta potential—What they are and what they are not? J. Control. Release 2016, 235, 337–351. [Google Scholar] [CrossRef]
Stetefeld, J.; McKenna, S.A.; Patel, T.R. Dynamic light scattering: A practical guide and applications in biomedical sciences. Biophys. Rev. 2016, 8, 409–427. [Google Scholar] [CrossRef]
Chicea, D.; Indrea, E.; Cretu, C.M. Assesing Fe3O4 nanoparticle size by DLS, XRD and AFM. J. Optoelectron. Adv. Mater. 2012, 14, 460–466. [Google Scholar]
Chicea, D. A study of nanoparticle aggregation by coherent light scattering. Curr. Nanosci. 2012, 8, 259–265. [Google Scholar] [CrossRef]
Gurney, K. An Introduction to Neural Networks; Taylor & Francis e-Library: London, UK, 2004. [Google Scholar]
Haykin, S. Neural Networks and Learning Machines Third Edition—University Hamilton; Prentice Hall: Ontario, ON, Canada, 2008. [Google Scholar]
Carrieri, A.H. Neural network pattern recognition by means of differential absorption Mueller matrix spectroscopy. Appl. Opt. 1999, 38, 3759–3766. [Google Scholar] [CrossRef] [PubMed]
Berdnik, V.V.; Loiko, V.A. Retrieval of size and refractive index of spherical particles by multiangle light scattering: Neural network method application. Appl. Opt. 2009, 48, 6178–6187. [Google Scholar] [CrossRef] [PubMed]
Berdnik, V.V.; Mukhamedjarov, R.D.; Loiko, V.A. Characterization of optically soft spheroidal particles by multiangle light-scattering data by use of the neural-networks method. Opt. Lett. 2004, 29, 1019–1021. [Google Scholar] [CrossRef] [PubMed]
Kaye, P.; Hirst, E.; Wang-Thomas, Z. Neural-network-based spatial light-scattering instrument for hazardous airborne fiber detection. Appl. Opt. 1997, 36, 6149–6156. [Google Scholar] [CrossRef]
Ulanowski, Z.; Wang, Z.; Kaye, P.H.; Ludlow, I.K. Application of neural networks to the inverse light scattering problem for spheres. Appl. Opt. 1998, 37, 4027–4033. [Google Scholar] [CrossRef]
Chicea, D. Using neural networks for dynamic light scattering time series processing. Meas. Sci. Technol. 2017, 28, 055206. [Google Scholar] [CrossRef]
Chicea, D.; Rei, S. A fast artificial neural network approach for dynamic light scattering time series processing. Meas. Sci. Technol. 2018, 29, 105201. [Google Scholar] [CrossRef]
Dong, L.; Chen, X.; Hu, Z. Application of artificial neural networks for the determination of proteins with CPA-pI by rayleigh light scattering technique. J. Lumin. 2007, 124, 85–92. [Google Scholar] [CrossRef]
He, L.; Kear-Padilla, L.; Lieberman, S.; Andrews, J. Rapid in situ determination of total oil concentration in water using ultraviolet fluorescence and light scattering coupled with artificial neural networks. Anal. Chim. Acta 2003, 478, 245–258. [Google Scholar] [CrossRef]
Shabanov, A.E.; Petrov, M.N.; Chikitkin, A.V. A multilayer neural network for determination of particle size distribution in dynamic light scattering problem. Comput. Res. Model. 2019, 11, 265–273. [Google Scholar] [CrossRef]
Zhao, H.; Dreses-Werringloer, U.; Davies, P.; Marambaud, P. Amyloid-beta peptide degradation in cell cultures by mycoplasma contaminants. BMC Res. Notes 2008, 1, 38. [Google Scholar] [CrossRef] [PubMed]
Schulz-Vogt, H.N.; Jørgensen, B.B. Big bacteria. Annu. Rev. Microbiol. 2001, 55, 105–137. [Google Scholar] [CrossRef] [PubMed]
Kurtzman, C.P.; Fell, J.W. Yeast Systematics and Phylogeny—Implications of Molecular Identification Methods for Studies in Ecology; Springer Science and Business Media LLC: Berlin/Heidelberg, Germany, 2006. [Google Scholar]
McGovern, P.E.; Hartung, U.; Badler, V.R.; Glusker, D.L.; Exner, L.J. The beginnings of winemaking and viniculture in the ancient Near East and Egypt. Expedition 1997, 39, 3–21. [Google Scholar]
Cavalieri, D.; McGovern, P.E.; Hartl, D.L.; Mortimer, R.; Polsinelli, M. Evidence for S. cerevisiae fermentation in ancient wine. J. Mol. Evol. 2003, 57, S226–S232. [Google Scholar] [CrossRef]
McGovern, P.E. Ancient Wine: The Scientific Search for the Origins of Viniculture; Princeton University Press: Princeton, NJ, USA, 2003. [Google Scholar]
Vaughan-Martini, A.; Martini, A. Saccharomyces meyen ex reess. In The Yeasts; Elsevier B.V.: Amsterdam, The Netherlands, 1998; pp. 358–371. [Google Scholar]
Botstein, D.; Fink, G. Yeast: An experimental organism for modern biology. Science 1988, 240, 1439–1443. [Google Scholar] [CrossRef]
Otero, J.M.; Cimini, D.; Patil, K.R.; Poulsen, S.G.; Olsson, L.; Nielsen, J. Industrial systems biology of saccharomyces cerevisiae enables novel succinic acid cell factory. PLoS ONE 2013, 8, e54144. [Google Scholar] [CrossRef]
Chumnanpuen, P.; Brackmann, C.; Nandy, S.K.; Chatzipapadopoulos, S.; Nielsen, J.; Enejder, A. Lipid biosynthesis monitored at the single-cell level in Saccharomyces cerevisiae. Biotechnol. J. 2011, 7, 594–601. [Google Scholar] [CrossRef]
Runguphan, W.; Keasling, J.D. Metabolic engineering of Saccharomyces cerevisiae for production of fatty acid-derived biofuels and chemicals. Metab. Eng. 2014, 21, 103–113. [Google Scholar] [CrossRef]
Rumble, J.R.; Lide, D.R.; Bruno, T.J. CRC Handbook of Chemistry and Physics, 100th ed.; CRC Press: Boca Raton, FL, USA, 2019. [Google Scholar]
Einstein, A. Über die von der molekularkinetischen theorie der wärme geforderte bewegung von in ruhenden flüssigkeiten suspendierten teilchen. Ann. Phys. 1905, 322, 549–560. [Google Scholar] [CrossRef]
Beale, H.D.; Demuth, H.B.; Hagan, M.T.; DeJesus, O. Neural Network Design. Available online: https://hagan.okstate.edu/NNDesign.pdf (accessed on 5 August 2019).
Levenberg, K. A method for the solution of certain non-linear problems in least squares. Q. Appl. Math. 1944, 2, 164–168. [Google Scholar] [CrossRef]
Balasubramanian, M.K.; Bi, E.; Glotzer, M. Comparative analysis of cytokinesis in budding yeast, fission yeast and animal cells. Curr. Biol. 2004, 14, R806–R818. [Google Scholar] [CrossRef] [PubMed]
Yeong, F.M. Severing all ties between mother and daughter: Cell separation in budding yeast. Mol. Microbiol. 2005, 55, 1325–1331. [Google Scholar] [CrossRef] [PubMed]
Neiman, A.M. Ascospore formation in the yeast saccharomyces cerevisiae. Microbiol. Mol. Biol. Rev. 2005, 69, 565–584. [Google Scholar] [CrossRef] [PubMed]
Chicea, D. Nanoparticles and nanoparticle aggregates sizing by DLS and AFM. J. Optoelectron. Adv. Mater. 2010, 4, 1310–1315. [Google Scholar]
Chicea, D. Revealing Fe3O4 nanoparticles aggregation dynamics using dynamic light scattering . Optoelectron. Adv. Mater. Rapid Commun. 2009, 3, 1299–1305. [Google Scholar]
Provencher, S.W. CONTIN: A General Purpose Constrained Regularization Program for Inverting Noisy Linear Algebraic Integral Equations. Comput. Phys. Commun. 1982, 27, 229–242. [Google Scholar] [CrossRef]

Figure 1. Dynamic light scattering (DLS) experimental setup.

Figure 2. The autocorrelation (ACR) computed on time series generated without noise with Equation (7) for diameters of 10 nm, the continuous lower (blue) line; 15 nm, the lower dashed (blue) line; for 5100 nm, the continuous upper (red) line; and 5200 nm, the upper dashed (red) line. The ACRs for a diameter of 2858 nm computed with Equation (5) are presented with the continuous middle (green) line; and with noise added, Equations (10) and (11), with the dashed middle (green) line.

Figure 3. The variation of the relative errors for the diameter range [20–500] nm, computed with ANN.

Figure 4. The ACR of two of the time series. The upper subplot illustrates an ACR that is well described by Equation (5) and the lower subplot the ACR that is the most departed from the ideal exponential decay described by Equation (5).

Figure 5. The average yeast cell diameters during fermentation. The circles depict diameters assessed using the reference procedure and the squares for the diameters assessed using the ANN assisted procedure.

Figure 6. The boxplot of the diameters, in nm, on the three plateaus: 0–10 h, 50–90 h and 120–164 h.

© 2020 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Chicea, D. An Artificial Neural Network Assisted Dynamic Light Scattering Procedure for Assessing Living Cells Size in Suspension. Sensors 2020, 20, 3425. https://doi.org/10.3390/s20123425

AMA Style

Chicea D. An Artificial Neural Network Assisted Dynamic Light Scattering Procedure for Assessing Living Cells Size in Suspension. Sensors. 2020; 20(12):3425. https://doi.org/10.3390/s20123425

Chicago/Turabian Style

Chicea, Dan. 2020. "An Artificial Neural Network Assisted Dynamic Light Scattering Procedure for Assessing Living Cells Size in Suspension" Sensors 20, no. 12: 3425. https://doi.org/10.3390/s20123425

APA Style

Chicea, D. (2020). An Artificial Neural Network Assisted Dynamic Light Scattering Procedure for Assessing Living Cells Size in Suspension. Sensors, 20(12), 3425. https://doi.org/10.3390/s20123425

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

An Artificial Neural Network Assisted Dynamic Light Scattering Procedure for Assessing Living Cells Size in Suspension

Abstract

1. Introduction

2. Materials and Methods

2.1. Diluted Yeast Suspension

2.2. The Reference DLS Procedure

2.3. The ANN Assisted DLS Time-Series Processing Procedure

2.4. Experimental Procedure and Time-Series Processing

3. Results

4. Discussion and Conclusions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI