Portable Dual-Mode Biosensor for Quantitative Determination of Salmonella in Lateral Flow Assays Using Machine Learning and Smartphone-Assisted Operation

Blackshare, Jully; Corman, Brianna; Rajwa, Bartek; Robinson, J. Paul; Bae, Euiwon

doi:10.3390/bios16010057

Open AccessArticle

Portable Dual-Mode Biosensor for Quantitative Determination of Salmonella in Lateral Flow Assays Using Machine Learning and Smartphone-Assisted Operation

by

Jully Blackshare

¹

,

Brianna Corman

²

,

Bartek Rajwa

³

,

J. Paul Robinson

^2,4

and

Euiwon Bae

^1,*

¹

Applied Optics Laboratory, School of Mechanical Engineering, Purdue University, West Lafayette, IN 47907, USA

²

Department of Basic Medical Sciences, Purdue University, West Lafayette, IN 47907, USA

³

Bindley Bioscience Center, Discovery Park, Purdue University, West Lafayette, IN 47907, USA

⁴

Weldon School of Biomedical Engineering, Purdue University, West Lafayette, IN 47907, USA

^*

Author to whom correspondence should be addressed.

Biosensors 2026, 16(1), 57; https://doi.org/10.3390/bios16010057

Submission received: 2 December 2025 / Revised: 5 January 2026 / Accepted: 6 January 2026 / Published: 13 January 2026

(This article belongs to the Special Issue Microbial Biosensor: From Design to Applications—2nd Edition)

Download

Browse Figures

Versions Notes

Abstract

Foodborne pathogens remain a major global concern, demanding rapid, accessible, and determination technologies. Conventional methods, such as culture assays and polymerase chain reaction, offer high accuracy but are time-consuming for on-site testing. This study presents a portable, smartphone-assisted dual-mode biosensor that combines colorimetric and photothermal speckle imaging for improved sensitivity in lateral flow assays (LFAs). The prototype device, built using low-cost components ($500), uses a Raspberry Pi for illumination control, image acquisition, and machine learning-based signal analysis. Colorimetric features were derived from normalized RGB intensities, while photothermal responses were obtained from speckle fluctuation metrics during periodic plasmonic heating. Multivariate linear regression, with and without LASSO regularization, was used to predict Salmonella concentrations. The comparison revealed that regularization did not significantly improve predictive accuracy indicating that the unregularized linear model is sufficient and that the extracted features are robust without complex penalization. The fused model achieved the best performance (R² = 0.91) and consistently predicted concentrations down to a limit of detection (LOD) of 10⁴ CFU/mL, which is one order of magnitude improvement of visual and benchtop measurements from previous work. Blind testing confirmed robustness but also revealed difficulty distinguishing between negative and 10³ CFU/mL samples. This work demonstrates a low-cost, field-deployable biosensing platform capable of quantitative pathogen detection, establishing a foundation for the future deployment of smartphone-assisted, machine learning-enabled diagnostic tools for broader monitoring applications.

Keywords:

lateral flow assay; portable biosensor; photothermal imaging; colorimetric imaging; machine learning; Salmonella

1. Introduction

Foodborne pathogens remain a persistent global threat to public health, affecting nearly 600 million people each year and imposing substantial economic burdens on healthcare systems worldwide [1,2,3,4]. To minimize the risk of outbreaks, contamination must be detected rapidly and accurately before products reach consumers [5,6]. Regulatory agencies such as the U.S. Department of Agriculture (USDA) and the Food and Drug Administration (FDA) have established strict microbial thresholds for food safety, yet existing diagnostic approaches often fail to meet the speed and accessibility requirements for field deployment [7,8].

Conventional methods such as culture-based assays and polymerase chain reaction (PCR) offer high accuracy but are labor-intensive, time-consuming, and require a laboratory setting, making them impractical for rapid screening at points for production or distribution [9,10]. These limitations underscore the need for portable, low-cost biosensing technologies capable of detecting pathogens at low concentrations in real time.

Lateral flow assays (LFAs) have gained widespread adoption in point-of-care diagnostics due to their simplicity, affordability, and rapid delivery results. They are routinely used for detecting a wide range of analytes, from infectious diseases related targets to contaminants in food and water [11,12,13,14]. However, their low sensitivity remains a major obstacle. Commercial LFAs can produce false-negative results when pathogen concentrations fall below their detection limit, particularly in food samples where early detection of contamination is critical [15]. Furthermore, conventional LFAs only provide qualitative results, a binary outcome indicating a test is positive or negative, offering no quantitative information that could assist in assessing contamination severity. Moreover, interference from complex food matrices can suppress signal and reduce assay sensitivity, making conventional LFAs less reliable for pathogen detection in real food samples [16,17,18].

To address this limitation, extensive research has focused on improving LFA sensitivity through either nanoparticle engineering or signal enhancement strategies. While modified tracers, such as platinum or silver nanoparticles, have been shown to enhance optical responses, they require laboratory synthesis and lack the robustness of commercially available gold nanoparticles (AuNPs) [19,20,21,22]. Consequently, alternative enhancement approaches have shifted toward optical and computational sensing methods that improve signal detection without altering assay chemistry [23,24,25,26].

Two promising optical modalities are colorimetric imaging and photothermal analysis (Figure 1). Colorimetric methods utilize imaging systems, such as smartphone cameras, to capture test lines that may be invisible to the naked eye, followed by image processing or machine learning analysis to quantify the signal. Photothermal sensing, on the other hand, exploits the plasmonic heating properties of AuNPs [27,28]. When irradiated near their resonance wavelength around 520 nm, localized temperature increases can be measured through an infrared camera or photothermal speckle imaging, which tracks refractive-index fluctuations caused by thermal changes [26,28].

In our previous work, we compared these two modalities in a benchtop setup [29]. The results did not show a substantial improvement in detection limit compared to visual inspection. However, they revealed important opportunities for further optimization and integration. This study builds upon those findings, and introduces a portable, smartphone-integrated dual-mode biosensor that combines colorimetric and photothermal sensing into a unified system controlled by a Raspberry Pi microcontroller. The device captures and processes both optical signals, employing machine learning algorithms to quantify test results.

In this work, we investigate whether fusing colorimetric and photothermal modes within a portable platform can enhance sensitivity and reliability beyond what either method might achieve independently. The working hypothesis is that integrating complementary information from each sensing mode will lead to a lower limit of detection (LOD).

The objective of this study is to design, implement, and evaluate a portable dual-mode biosensor for Salmonella Typhimurium detection, emphasizing sensitivity improvement, quantitative performance, and on-field usability. The outcome demonstrates the feasibility of achieving a one-order-of-magnitude improvement in detection limit on a smartphone-operable platform suitable for real-time food safety monitoring.

2. Materials and Methods

2.1. Hardware Architecture

A compact dual-mode biosensing system was developed to integrate optical excitation, image acquisition, and real-time signal processing within a single embedded platform. A Raspberry Pi 5 (Raspberry Pi Ltd., Cambridge, UK) served as the main controller, selected for its processing capability and GPIO interfaces that allow simultaneous control of lighting sources and data acquisition. The Raspberry Pi 5 managed all computation, device communication, and image processing tasks while also serving as a local server to interface with a smartphone application.

A Raspberry Pi camera module 3 (Raspberry Pi Ltd., Cambridge, UK) was selected for image acquisition due to its seamless hardware compatibility and flexible control via the PiCamera2 library [30]. The camera captured images suitable for both colorimetric and photothermal analyses, with adjustable parameters such as exposure time, gain, and frame rate.

Two laser modules were incorporated into the system. A 532 nm 50 mW laser (CivilLaser, Wuhan, China) facilitated photothermal excitation, whereas a 780 nm 30 mW laser (CivilLaser, Wuhan, China) was utilized for speckle generation. The 532 nm laser was transistor-transistor-logic (TTL)-modulated through the Raspberry Pi’s GPIO pins to generate periodic excitation. The 780 nm laser was operated in continuous-wave mode, and its beam diameter was manually adjusted using a dot-focus lens. For colorimetric imaging, two circular white LED rings provided uniform illumination across the testing surface to ensure consistent imaging conditions. Power was supplied through a 5 V DC adapter that distributed current to the laser, camera, and LEDs through a custom-fabricated printed circuit board (PCB) (PCBWay, Shenzhen, China), which was designed using KiCad (KiCad, Geneva, Switzerland).

The Raspberry Pi communicated with a smartphone interface using an HTTP-based protocol, enabling the users to control imaging operations and visualize results in real time. The biosensor communicated with a smartphone interface through an HTTP-based protocol, allowing the users to send operational commands and visualize processed results in real time.

2.2. Portable Device Design

A portable enclosure was designed to integrate all hardware components into a compact and field-deployable system. The housing was modeled in SolidWorks 2025 (SOLIDWORKS 3D CAD, Dassault Systèmes, Waltham, MA, USA) and fabricated using an Adventurer 5M 3D printer (Flashforge 3D Technology, Jinhua, China) with PLA filament. The overall dimensions of the assembled device were approximately 150 mm

\times

140 mm

\times

140 mm, comparable to the size of a modern smartphone (Figure 2).

The mechanical layout was optimized to ensure both measurement reproducibility and ease of operation. Dedicated laser mounts enabled manual adjustment of beam position and incidence angle, allowing precise alignment of illumination on the LFA membrane. A fixed camera mount and sample holder maintained a consistent imaging distance and field of view between measurements. The entire enclosure was light-tight, minimizing ambient light interference for both imaging modes.

A removable LFA cartridge slot was added to the front panel, allowing users to easily insert and replace test strips. To isolate the 780 nm wavelength for photothermal measurements, a bandpass filter holder was placed below the camera module. The holder was designed to be easily inserted and removed between imaging modes.

The custom PCB and Raspberry Pi 5 were housed in an isolated compartment to reduce thermal and electrical interference. The compartment cover was magnetically attached, providing convenient access for maintenance or component replacement. The finalized system is illustrated in Figure 3, which shows the overall device layout, including the computer-aided design (CAD) model and the fabricated prototype. The total material cost for the complete biosensor system was approximately $500, demonstrating the feasibility of a low-cost, portable diagnostic platform.

2.3. Smartphone Integration

A custom smartphone application was developed to provide a user-friendly interface for operating the biosensor and visualizing test results. The application was built in a web-based environment using MIT App Inventor, which hosted an HTML interface with JavaScript and CSS for functionality and styling [31]. The primary purpose of the app was to enable wireless communication with the Raspberry Pi 5, simplifying device control for users.

Communication between the smartphone and Raspberry Pi occurred over a local network using an HTTP-based protocol. Upon establishing a connection, the app transmitted operational commands such as “Start Test” to the Raspberry Pi’s built-in Flask web server [32]. The Raspberry Pi executed the requested operations. All image processing and data analysis were performed locally on the Raspberry Pi to ensure consistent performance independent of smartphone hardware.

The smartphone served as a graphical control and visualization tool, displaying real-time feedback and final test results (Figure 4). The workflow consisted of two primary modes:

Calibration Mode: Captures reference data from negative control LFAs to establish baseline values for each vendor.
Testing Mode: Acquires new colorimetric or photothermal data and compares them to stored calibrated reference.

Calibration data were saved locally on the Pi as vendor-specific JSON files, each containing values derived from the control samples. When testing, the system automatically loaded the corresponding calibration file based on the selected vendor. This configuration allowed users to perform calibration once and reuse it across multiple tests without having to manually perform calibration every time for new tests.

2.4. Colorimetric Analysis

In the colorimetric mode, the biosensor captures images of lateral flow assays (LFAs) using the Raspberry Pi camera under uniform lighting provided by two circular white LED strips mounted inside the enclosure. The imaging distance was fixed to ensure reproducible measurements.

For both calibration and testing, the same acquisition procedure was followed, which is outlined in Figure 5. Once the LFA was inserted into the cartridge slot, the Raspberry Pi captures a single JPEG image that was subsequently decomposed into grayscale and RGB color channels. The red, green, and blue channels were analyzed separately to evaluate which provided the highest contrast between the test line and background.

A line intensity profile was extracted across the test line region using a custom script implemented in Python 3.13.2 with OpenCV and NumPy [33,34]. The test line was identified as a local minimum within the intensity profile, while the background intensity (

I_{background})

was determined by averaging pixel values over a selected area adjacent to the test region. The raw line test intensity

(I_{test})

was normalized to the background using:

I_{norm} = 1 - \frac{I_{test}}{I_{background}}

(1)

This normalization ensured consistent comparison across independent experiments. The resulting normalized intensity values from the selected color channels were then stored as input features for the machine learning analysis described in Section 2.6.

2.5. Photothermal Analysis

The photothermal biosensor used a dual-laser design comprising a 532 nm excitation laser and a 780 nm probe laser. The 532 nm laser was TTL-modulated at 1 Hz (50% duty cycle) through the Raspberry Pi’s GIPO pins to induce periodic plasmonic heating of gold nanoparticles on the LFA membrane. Simultaneously, the continuous 780 nm laser generated a speckle pattern, which was recorded by the Raspberry Pi camera through a 780 nm bandpass filter to isolate the probing wavelength.

For each sample, a total of 150 frames were captured at 30 frames per second. This 5 s duration was selected to capture five full signal cycles, providing sufficient spectral resolution for the 1 Hz target frequency while minimizing data processing time on the selected controller. The region of interest, including the test line, was cropped using a circular mask to minimize background interference and suppress unwanted noise [35,36,37]. Pixels outside the mask were excluded from processing, reducing the contribution of low-contrast boundary regions while improving signal consistency.

The temporal intensity variations in the images were extracted from a speckle-intensity time series. After removing the direct current (DC) offset, a Fast Fourier Transform (FFT) was applied to convert the temporal signal into the frequency domain. The spectral magnitude at the modulation frequency (1 Hz) was identified as the characteristic photothermal response. This magnitude was then averaged across all masked pixels to obtain the representative signal for each test strip.

2.6. Machine Learning

Machine learning models were developed to quantitatively relate the optical features extracted from each sensing modality to Salmonella Typhimurium concentration. Separate regression models were first trained for the colorimetric and photothermal datasets, followed by a fused analysis to assess the benefit of multimodal integration. Each dataset was divided into training and testing subsets, and all features were standardized prior to model fitting to eliminate scale dependency [38,39,40].

The modeling process involved feature selection, normalization, and regression model training, followed by cross-validation to evaluate prediction accuracy. The trained models were then applied to a blind test dataset to assess generalizability and robustness.

2.6.1. Colorimetric Feature Selection

In the colorimetric mode, input features were extracted from normalized color-channel intensities in the test line area. The green, blue, and grayscale channels were utilized for model training, whereas the red channel was omitted. The red hue of the test line exhibited diminished contrast, rendering it seemingly faded. The ultimate feature set comprised normalized intensities from the green, blue, and grayscale channels, which demonstrated a positive correlation with bacterial concentration. These features indicated the alteration in optical density at the test line.

2.6.2. Photothermal Feature Selection

Photothermal features were designed to capture both temporal and spatial variations in the speckle pattern during periodic plasmonic heating [41,42]. The extracted photothermal features described both temporal fluctuations in the speckle pattern arising from periodic heating and the spatial distribution of the corresponding spectral magnitudes in the frequency domain. Before modeling, the extracted features were standardized and Box–Cox transformed to ensure uniform Gaussian distribution [38,39].

Spatial Decorrelation Metrics

Let

I_{t} (x, y)

be the speckle image at time

t

. For every adjacent pair of frames

(I_{t}, I_{t + 1})

two measures of frame-to-frame variation were computed and then averaged over the entire temporal sequence:

Structural Similarity Index (SSIM): Quantifies luminance, contrast, and structural similarity between I_t and I_t+1. Higher SSIM values indicate greater similarity and less temporal decorrelation [43].
Mean Squared Error (MSE): Represents the mean of ${[I_{t} (x, y) - I_{t} (x, y)]}^{2}$ over the ROI. Larger MSE reflects larger intensity fluctuations caused by thermally induced phase shifts [44,45].

Spectral Distribution Metrics

For each pixel

k

in the ROI, the temporal intensity

I_{k} (t)

was Fourier transformed, and the magnitude at the laser modulation frequency (

f_{0} = 1

Hz) was extracted, yielding one value per pixel. The spatial distributions of these magnitudes were then summarized by their four moments:

Mean: Average response strength at f₀ across pixels.
Standard deviation: Spatial uniformity of the response, with a larger value indicating a more heterogeneous distribution.
Skewness: Asymmetry of the magnitude distribution, reflecting non-uniform regions with disproportionately high or low responses.
Kurtosis: Tail heaviness or prevalence of outliers within the distribution.

2.6.3. Data Augmentation

To enhance model generalization and robustness against environment variability, data augmentation was applied to both colorimetric and photothermal datasets [46]. These transformations simulated variations that may arise during field operation, such as changes in lighting, camera exposure, or mechanical alignment.

Colorimetric Data

Colorimetric images were augmented using photometric transformations that replicated changes in lighting and camera exposure [47]. Three operations were applied to each image.

Brightness and contrast adjustment: Pixel intensities were randomly scaled and shifted within defined ranges (−25 to 25 for brightness and −30 to 30 for contrast) to reproduce underexposed or overexposed conditions.
Color temperature modification: RGB channel scaling factors were adjusted to simulate warm or cool lighting conditions, with random values drawn from a range of −20 to 20 relative to predefined reference temperature points.

These transformations created realistic variations in color tone and intensity, helping the model become less sensitive to lighting differences and minor exposure shifts.

Photothermal Data

Photothermal speckle images were augmented to account for variability in temporal alignment, sensor noise, and sample positioning [48]. Each image sequence was subjected to the following operations (Figure 6):

Circular time shift: The frame order was rotated by a random integer k, effectively changing the starting point of the sequence while preserving temporal continuity. This process corresponds to a phase shift in the frequency domain but leaves the amplitude spectrum unchanged [45,49].
Photometric jitter: Frame brightness and contrast were perturbed using a random gain $(g)$ , bias $(b)$ , and Gaussian noise $(η)$ . The following operation was conducted to simulate sensor response variations and low-level noise [50].

$I_{t}^{’} (x, y) = g I_{t} (x, y) + b + η (x, y)$

(2)

where $I (x, y)$ and $I ’ (x, y)$ denote the original and perturbed pixel intensities, respectively.
Minor affine transformation: Each frame was randomly rotated and translated to replicate small shifts in sample positioning or camera misalignment. Reflection padding was used to prevent artificial edge effects [44,50,51].

All augmentations were constrained to small magnitudes to preserve the physical characteristics of the speckle pattern while introducing realistic variability.

2.6.4. Model Construction

To quantitatively relate the extracted optical features to Salmonella concentration, predictive models were constructed using multivariate linear regression. Previous work with the benchtop system explored three models, including polynomial and logistic regression, to capture nonlinear assay responses. Although those methods provided acceptable fits, the linear regression model was selected for the portable system due to its interpretability, computational simplicity, and robust performance. The linear model can represent the near-linear relationship between optical signals and log-transformed bacterial concentrations, providing a physically interpretable framework for implementation.

Model Formulation

The prediction target was defined as the log-transformed bacterial concentration, allowing the model to produce non-negative predictions:

y_{l o g} = \log_{10} (c o n c e n t r a t i o n) .

(3)

For a given input feature vector

x = [x_{1}, x_{2}, \dots, x_{p}]

, the general form of the multivariate linear model is expressed as:

{\hat{y}}_{l o g} = β_{0} + \sum_{i = 1}^{p} β_{i} x_{i},

(4)

where

β_{0}

represents the intercept, and

β_{i}

denotes the coefficient associated with the

i^{t h}

feature.

Separate models were trained for the colorimetric and photothermal datasets, followed by a fused model that combined the standardized features of both modalities. All models were trained using Python’s scikit-learn library [52].

Regularization and Feature Selection

To prevent overfitting and improve model interpretability, Least Absolute Shrinkage and Selection Operator (LASSO) regularization was applied to selected models [53]. The LASSO introduces an

L_{1}

penalty term to the loss function [29]:

{\hat{β}}^{L A S S O} = \arg \min_{β_{0}, β} \{\frac{1}{2 n} \sum_{i = 1}^{n} {(y_{i} - β_{0} - x_{i}^{T} β)}^{2} + λ {‖β‖}_{1}\},

(5)

where

λ

controls the penalty strength. The formulation constrains the magnitude of the regression coefficients, driving less-informative features toward zero and achieving implicit feature selection. The optimal value of

λ

was determined through five-fold cross-validation using the LassoCV function.

For comparison, unregularized linear models were also trained using the same dataset splits and processing steps to evaluate the effect of regularization on prediction performance and generalization.

Training and Validation

All features were standardized and Box–Cox transformed prior to model training to reduce skewness. Following the data augmentation procedure described in Section 2.6.3, a total dataset of 15,000 samples was generated to ensure robust model training. These datasets were randomly split into 80% training (12,000 samples) and 20% test sets (3000 samples), with test samples withheld from the training and cross-validation processes. To assess robustness, a bootstrap resampling (1000 iterations) was performed on the training data to estimate confidence intervals for model performance metrics, including the coefficient of determination (

R^{2}

), Akaike information criterion (AIC), and Bayesian information criterion (BIC) [54,55]. The 2.5th and 97.5th percentiles of these distributions were used to compute the 95% confidence intervals for each metric.

2.7. Sample Preparation and Testing Procedure

A commercially available Salmonella lateral flow assay (STLF-020, BioAssay Works, Imjamsville, MD, USA) was used for all colorimetric and photothermal measurements. The assays are designed for the qualitative detection of broad Salmonella species and were selected due to their use of gold nanoparticle tracers.

The bacterial strain for Salmonella enterica serovar Typhimurium ATCC 14028 was selected as the testing analyte. Prior to testing, bacterial suspensions were serially diluted in phosphate-buffered saline (PBS) to achieve concentrations ranging from 10³ to 10⁷ CFU/mL. All solutions were heat-inactivated at 100 °C for 10 min, to ensure safety. The LFA tests were conducted following the manufacturer’s instructions. For photothermal speckle imaging, test strips were air-dried for approximately 60 min after sample loading to ensure the membrane was dry before measurement, thereby preventing thermal signal loss caused by residual moisture.

For each concentration (10³ to 10⁷ CFU/mL) and negative controls, three independent replicate assays were performed to establish the baseline physical dataset. Colorimetric images were captured after reaction was complete, typically 20 to 25 min after loading the sample. The PBS buffer served as the negative control.

Blind Testing Procedure

To evaluate the objectivity and generalizability of the models, a blind testing protocol was conducted using a separate set of LFA samples prepared independently by a laboratory technician. The operator responsible for imaging and data acquisition was not informed of the bacterial concentrations or content.

Blind samples included additional assays containing Escherichia coli, and Kocuria rhizophila to assess model specificity. All samples were randomized before imaging to eliminate potential bias. The blind test data were processed using the trained regression models. Predicted concentrations were then compared against the true values to assess model accuracy user realistic testing conditions.

3. Results

3.1. Model Performance and Evaluation

Linear regression models were trained and evaluated using datasets derived from each sensing modality to assess predictive performance for Salmonella concentration. Separate models were developed with and without LASSO regularization, following the procedures described in Section 2.6.1. The goal was to determine how feature fusion between colorimetric and photothermal signals affected prediction accuracy and robustness.

3.1.1. Linear Regression Models Without Regularization

Figure 7 shows the predicted versus actual bacterial concentrations obtained from unregularized linear regression models. The colorimetric and fused models exhibited strong linear correspondence with the ideal prediction line, whereas the photothermal model showed larger deviations, particularly at the intermediate range (10⁵–10⁶ CFU/mL), indicating higher variability in the speckle-based features.

Error magnitude plots further highlight this trend. Absolute prediction errors in log₁₀ scale were computed as

|\log_{10} (C_{prediction}) - \log_{10} (C_{true})|

for each sample. The colorimetric dataset yielded low, consistent errors across all concentrations, whereas the photothermal model showed a broader error distribution due to its higher signal-to-noise ratio. When features from both sensing modes were combined, the fused model achieved the most stable performance, with absolute errors consistently below one log unit for nearly all tests. This result suggests that multimodal feature fusion reduces random noise and compensates for variability present in individual sensing modes.

3.1.2. Linear Regression with LASSO Regularization

To evaluate the effect of feature regularization and implicit feature selection, the same datasets were retrained using LASSO-regularized linear regression. The overall predictive trends closely matched those of the unregularized models, indicating that most extracted features contributed meaningfully to the regression performance. As shown in Figure 8, the distribution of absolute prediction errors remained consistent with the unregularized models.

Quantitative performance metrics are summarized in Table 1, including the

R^{2}

, AIC, BIC, and the percentage of predictions within an acceptable error range (

\pm

1 log CFU/mL). This range was selected because bacterial concentrations were labeled and modeled in logarithmic form (log₁₀ CFU/mL), and deviations within

\pm

1 log unit reflect the scale used for model training and evaluation.

Among the individual sensing modes, the colorimetric model achieved the highest reproducibility (

R^{2}

= 0.85) with 96% of predictions falling within the acceptable range. The photothermal model, while less accurate (

R^{2}

= 0.78), maintained reasonable predictive consistency despite inherently higher signal variability. The fused model delivered the best overall performance, achieving

R^{2}

of 0.91 and nearly 99% or predictions within the acceptable error range, confirming the advantage of combining optical and thermal features for enhanced detection ability. The fused model yielded lower AIC and BIC values despite having more input features, indicating that the improvement in model fit outweighed the complexity penalty imposed by the information criteria.

Regularization produced minimal changes in overall

R^{2}

values but reduced model complexity by eliminating less informative coefficients. AIC and BIC values remained comparable between regularized and unregularized models, suggesting that LASSO regularization achieved a similar fit quality while simplifying the feature set. Specifically, the LASSO-regularized models retained five out of six features for the photothermal dataset and eight of nine for fused dataset, indicating that most extracted variables carried meaningful predictive value.

3.2. Blind Testing and Uncertainty Analysis

To evaluate the real-world performance of the trained regression models, blind testing was conducted using eight unknown samples containing varying combinations of Salmonella Typhimurium, Escherichia coli, and Kocuria rhizophila (Table 2). The models described in Section 2.6.4 were used to predict Salmonella concentration for each sample, allowing assessment of model accuracy, specificity, and cross-pathogen robustness.

3.2.1. Prediction of Blind Test Samples

Table 3 summarizes the predicted and actual Salmonella concentrations for all blind test samples. The results demonstrate that colorimetric and fused models reliably predicted Salmonella concentrations down to 10⁴ CFU/mL. The photothermal model exhibited greater deviation, particularly at mid-range concentrations. The fused model produces the most consistent and accurate predictions across the full concentration range, confirming that integrating optical and photothermal features improves robustness and overall detection reliability. However, samples containing only non-target bacteria (E. coli or K. rhizophila) were not classified as negatives, as all models returned low but non-zero concentration estimates of approximately 10³ CFU/mL. This outcome suggests that, at the current sensitivity level, the system cannot yet reliably distinguish between Salmonella presence and background signal near the detection limit.

3.2.2. Model Comparison

Figure 9 illustrates the predicted versus actual Salmonella concentrations for all blind test samples, with shaded green bands representing the

\pm

1 log acceptable error range. Predictions closer to the ideal diagonal line indicate higher accuracy. Both the colorimetric and fused models produced results that largely fell within the acceptable range, whereas the photothermal model showed greater dispersion.

Applying LASSO regularization had minimal impact on overall prediction accuracy, as shown in Figure 9b. The near-identical performance of regularized and unregularized models suggests that the extracted features were not redundant and that each contributed meaningfully to the regression outcome. While fusion of optical and thermal features improved consistency and reduced error variance, none of the models successfully classified negative or non-target samples as true negatives. All produced non-zero outputs, reinforcing that the current detection threshold lies between negative and 10⁴ CFU/mL.

3.2.3. Uncertainty Quantification

To assess prediction reliability and model robustness, uncertainty was quantified using a bootstrap-based confidence interval approach. For each sensing modality, the training dataset was resampled 1000 times with replacement, and a new regression model was refitted at each iteration Predicted concentrations for each blind test sample were then recorded across all bootstrap iterations. The 2.5th and 97.5th percentiles of the resulting distributions defined the lower and upper bounds of the 95% confidence interval in log space. The relative uncertainty of each sample was then calculated as

P e r c e n t u n c e r t a i n t y = (10^{\frac{{l o g (C I}_{upper}) - {l o g (C I}_{lower})}{2}} - 1) \times 100 % .

(6)

The calculated uncertainties, which serve as a measure of assay reproducibility (analogous to the coefficient of variation), ranged between 4% and 13% across all models. The colorimetric model exhibited the lowest uncertainty (4–6%) while the photothermal and fused models exhibited slightly higher uncertainty (7–13%) due to greater temporal variability in speckle-derived measurements and higher feature dimensionality.

3.3. Summary of System Performance and Limit of Detection

The overall performance of the developed biosensor was evaluated by comparing the results from visual inspection, the previous benchtop system, and the final portable dual-mode platform. The limit of detection was defined as the lowest bacterial concentration that could be reliably distinguished from the negative control.

For the visual colorimetric assay, Salmonella test line became indistinguishable to the naked eye at concentrations below 10⁵ CFU/mL, corresponding to a practical LOD of approximately 10⁵ CFU/mL. Similarly, both individual sensing modes of the benchtop system yielded an LOD near 10⁵ CFU/mL.

In contrast, the portable dual-mode biosensor demonstrated improved sensitivity. Using machine learning regression models trained on fused colorimetric and photothermal features, the system consistently differentiated 10⁴ CFU/mL samples from PBS and non-Salmonella controls. Although detection at 10³ CFU/mL remained inconsistent, the portable device maintained stable quantitative predictions at 10⁴ CFU/mL, whereas visual and benchtop analysis failed.

This improvement represents an order-of-magnitude increase in detection capability, from a limit of 10⁵ CFU/mL observed in both visual inspection and our previous benchtop study to 10⁴ CFU/mL with the portable dual-mode system. The observed gain in sensitivity can be attributed to improved optical optimization in the compact device design, robust data fusion of colorimetric and photothermal features, and machine learning-based regression that reduced the influence of user subjectivity and measurement variability. Overall, these findings confirm that the integrated dual-mode machine learning-assisted biosensor enables more quantitative, low-cost, and field-deployable detection of foodborne pathogens.

4. Discussion

This study presents the development of a smartphone-integrated, dual-mode biosensor that combines colorimetric and photothermal speckle imaging to enhance the sensitivity of lateral flow assays. Through regression modeling, the portable system achieved an order-of-magnitude improvement in detection sensitivity compared to visual interpretation and benchtop analysis, reducing the limit of detection from 10⁵ CFU/mL to 10⁴ CFU/mL, as reported in our previous work. The device maintained consistent performance during blind testing, demonstrating its potential for quantitative and field-deployable pathogen detection using standard smartphones.

While the colorimetric sensing mode alone provided high reproducibility and stability, integrating photothermal speckle imaging contributed robustness, compensating for signal variability that can arise under different lighting or surface conditions. The fusion mechanism relies on the complementary nature of two signals. The colorimetric mode provides a low-variance baseline that stabilizes the prediction, while the photothermal mode contributes high-sensitivity features that persist even when the visual test line fades. Mathematically, the regression model leverages the colorimetric stability to constrain the higher variance inherent in the speckle fluctuations, preventing noise from driving false positives while retaining the sensitivity benefits of the thermal signal. Although the improvement in prediction accuracy from multimodal fusion was moderate under current testing conditions, the dual-mode framework may become more impactful under variable environmental or sample conditions.

Several factors currently limit the system’s detection sensitivity and practical implementation. The most notable challenge is the difficulty in differentiating between negative controls and 10³ CFU/mL samples. This may be influenced by residual membrane moisture, which can reduce signal strength during photothermal imaging. In addition, timing differences between colorimetric and photothermal measurements can introduce variability. Since colorimetric imaging is performed immediately after the readout period specified by the vendor, while photothermal imaging requires a drying step, the delay may lead to test line bleeding. To ensure real-world applicability, the proposed system design can be upgraded to include a low-power heating element or fan to accelerate membrane drying after sample application. This would synchronize the optimal signal reading times for both modalities, minimizing the delay and preventing signal degradation caused by residual moisture. Future work will focus on several directions:

Robustness: Test additional LFA vendors to assess reproducibility and model generalizability
Sensitivity: Explore higher-power excitation, improved lighting, and alternative image sensors to achieve lower detection limits
Workflow: Use controlled drying to reduce timing disparities and prevent test line bleeding.
Application generalization: Adapt the platform to multiplexed or non-pathogen assays to expand applicability to clinical, food safety, and environmental diagnostics.

Overall, this study established a foundation for quantitative, low-cost, and field-deployable biosensing that bridges optical sensing and computational power. The integration of machine learning with dual optical modalities is a step toward robust, accessible diagnostic tools capable of providing reliable results outside traditional laboratory settings.

Author Contributions

Conceptualization, J.B., E.B. and J.P.R.; Methodology, J.B. and B.C.; Software, J.B. and B.R.; Formal analysis, J.B., B.R. and J.P.R.; Resources, B.C. and J.P.R.; Writing—original draft preparation, J.B.; Writing—review and editing, B.R., J.P.R. and E.B.; Visualization, J.B.; Supervision, E.B.; Project administration, E.B.; Funding acquisition, B.R., J.P.R. and E.B. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by the Center for Food Safety Engineering at Purdue University, funded by the U.S. Department of Agriculture, Agricultural Research Service, under Agreement No. 59-8072-6-001 and by the Technology Innovation Program (RS-2024-00419426, Development of light-electron beam based measurement and analysis instrument technologies for advanced packaging) funded By the Ministry of Trade, Industry & Energy (MOTIE, Korea). Any opinions, findings, conclusions, or recommendations expressed in this publication are those of the author(s) and do not necessarily reflect the view of the U.S. Department of Agriculture.

Data Availability Statement

All data and code supporting the findings of this study are publicly available in the following GitHub repository (Version v1.0.0): https://github.com/jully-blackshare/MDPI_Dual_Mode_Biosensor (accessed on 2 December 2025).

Acknowledgments

The authors acknowledge the use of artificial intelligence tools for assistance in debugging code scripts and for editing to improve grammar and clarity during the preparation of this manuscript.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Elbehiry, A.; Abalkhail, A.; Marzouk, E.; Elmanssury, A.E.; Almuzaini, A.M.; Alfheeaid, H.; Alshahrani, M.T.; Huraysh, N.; Ibrahem, M.; Alzaben, F.; et al. An Overview of the Public Health Challenges in Diagnosing and Controlling Human Foodborne Pathogens. Vaccines 2023, 11, 725. [Google Scholar] [CrossRef]
Scallan, E.; Hoekstra, R.M.; Angulo, F.J.; Tauxe, R.V.; Widdowson, M.-A.; Roy, S.L.; Jones, J.L.; Griffin, P.M. Foodborne Illness Acquired in the United States—Major Pathogens. Emerg. Infect. Dis. 2011, 17, 7–15. [Google Scholar] [CrossRef]
Scharff, R.L. Economic Burden from Health Losses Due to Foodborne Illness in the United States. J. Food Prot. 2012, 75, 123–131. [Google Scholar] [CrossRef] [PubMed]
Hoffmann, S.; Batz, M.B.; Morris, J.G. Annual Cost of Illness and Quality-Adjusted Life Year Losses in the United States Due to 14 Foodborne Pathogens. J. Food Prot. 2012, 75, 1292–1302. [Google Scholar] [CrossRef]
Bintsis, T. Department of International Trade, TEI of West Macedonia, Kastoria, Greece Foodborne Pathogens. AIMS Microbiol. 2017, 3, 529–563. [Google Scholar] [CrossRef]
Sharif, M.K.; Javed, K.; Nasir, A. Foodborne Illness: Threats and Control. In Foodborne Diseases; Elsevier: Amsterdam, The Netherlands, 2018; pp. 501–523. ISBN 978-0-12-811444-5. [Google Scholar]
Saravanan, A.; Kumar, P.S.; Hemavathy, R.V.; Jeevanantham, S.; Kamalesh, R.; Sneha, S.; Yaashikaa, P.R. Methods of Detection of Food-Borne Pathogens: A Review. Environ. Chem. Lett. 2021, 19, 189–207. [Google Scholar] [CrossRef]
U.S. Department of Agriculture, Food Safety and Inspection Service. Guideline for E. coli Testing in Slaughter Establishments; USDA’s Food Safety and Inspection Service (FSIS): Washington, DC, USA, 2023.
Valderrama, W.B.; Dudley, E.G.; Doores, S.; Cutter, C.N. Commercially Available Rapid Methods for Detection of Selected Food-Borne Pathogens. Crit. Rev. Food Sci. Nutr. 2016, 56, 1519–1531. [Google Scholar] [CrossRef] [PubMed]
Campbell, V.R.; Carson, M.S.; Lao, A.; Maran, K.; Yang, E.J.; Kamei, D.T. Point-of-Need Diagnostics for Foodborne Pathogen Screening. SLAS Technol. 2021, 26, 55–79. [Google Scholar] [CrossRef]
Bahadır, E.B.; Sezgintürk, M.K. Lateral Flow Assays: Principles, Designs and Labels. TrAC Trends Anal. Chem. 2016, 82, 286–306. [Google Scholar] [CrossRef]
Nguyen, V.-T.; Song, S.; Park, S.; Joo, C. Recent Advances in High-Sensitivity Detection Methods for Paper-Based Lateral-Flow Assay. Biosens. Bioelectron. 2020, 152, 112015. [Google Scholar] [CrossRef] [PubMed]
Min, H.J.; Mina, H.A.; Deering, A.J.; Bae, E. Development of a Smartphone-Based Lateral-Flow Imaging System Using Machine-Learning Classifiers for Detection of Salmonella spp. J. Microbiol. Methods 2021, 188, 106288. [Google Scholar] [CrossRef]
Jung, Y.; Heo, Y.; Lee, J.J.; Deering, A.; Bae, E. Smartphone-Based Lateral Flow Imaging System for Detection of Food-Borne Bacteria E.Coli O157:H7. J. Microbiol. Methods 2020, 168, 105800. [Google Scholar] [CrossRef]
Hristov, D.R.; Rodriguez-Quijada, C.; Gomez-Marquez, J.; Hamad-Schifferli, K. Designing Paper-Based Immunoassays for Biomedical Applications. Sensors 2019, 19, 554. [Google Scholar] [CrossRef]
Wang, C.; Cheng, Y.; Yin, X.; Wu, Q.; Ma, J.; Zhang, Q.; Zhao, L.; Wang, J.; Zhang, D. “Three-in-One”: Ultrasensitive Lateral Flow Immunoassay Driven by Magnetic Enrichment and Photothermal Signal Amplification. J. Agric. Food Chem. 2024, 72, 18171–18180. [Google Scholar] [CrossRef]
Thenuwara, G.; Akhtar, P.; Javed, B.; Singh, B.; Byrne, H.J.; Tian, F. Recent Advancements in Lateral Flow Assays for Food Mycotoxin Detection: A Review of Nanoparticle-Based Methods and Innovations. Toxins 2025, 17, 348. [Google Scholar] [CrossRef] [PubMed]
Li, Q.; Chen, M.; Ren, J.; Li, S.; Bu, T.; Wang, Y.; Huang, X.; Song, L.; Liu, L.; Luo, R.; et al. Evaluation and Reduction of Matrix Effect by Deep Eutectic Solvents in Lateral Flow Assays: A Case Study for Zearalenone Detection in Different Food Matrices. Sens. Actuators B Chem. 2025, 440, 137901. [Google Scholar] [CrossRef]
Jiang, T.; Song, Y.; Wei, T.; Li, H.; Du, D.; Zhu, M.-J.; Lin, Y. Sensitive Detection of Escherichia Coli O157:H7 Using Pt–Au Bimetal Nanoparticles with Peroxidase-like Amplification. Biosens. Bioelectron. 2016, 77, 687–694. [Google Scholar] [CrossRef]
Park, J.-M.; Jung, H.-W.; Chang, Y.W.; Kim, H.-S.; Kang, M.-J.; Pyun, J.-C. Chemiluminescence Lateral Flow Immunoassay Based on Pt Nanoparticle with Peroxidase Activity. Anal. Chim. Acta 2015, 853, 360–367. [Google Scholar] [CrossRef]
Yan, S.; Liu, C.; Fang, S.; Ma, J.; Qiu, J.; Xu, D.; Li, L.; Yu, J.; Li, D.; Liu, Q. SERS-Based Lateral Flow Assay Combined with Machine Learning for Highly Sensitive Quantitative Analysis of Escherichia Coli O157:H7. Anal. Bioanal. Chem. 2020, 412, 7881–7890. [Google Scholar] [CrossRef]
Kang, H.; Buchman, J.T.; Rodriguez, R.S.; Ring, H.L.; He, J.; Bantz, K.C.; Haynes, C.L. Stabilization of Silver and Gold Nanoparticles: Preservation and Improvement of Plasmonic Functionalities. Chem. Rev. 2019, 119, 664–699. [Google Scholar] [CrossRef] [PubMed]
Wei, J.C.; Gao, Y.M.; Li, G.Q.; Wang, Z.-J.; Vai, M.I.; Mak, P.U.; Du, M.; Hang, P.S. A Lateral Flow Immunochromatographic Assay (LFIA) Strip Reader Based on Scheduler and 8051 IP Core. Automatika 2016, 57, 1079–1088. [Google Scholar] [CrossRef]
Mudanyali, O.; Dimitrov, S.; Sikora, U.; Padmanabhan, S.; Navruz, I.; Ozcan, A. Integrated Rapid-Diagnostic-Test Reader Platform on a Cellphone. Lab Chip 2012, 12, 2678. [Google Scholar] [CrossRef]
Pilavaki, E.; Demosthenous, A. Optimized Lateral Flow Immunoassay Reader for the Detection of Infectious Diseases in Developing Countries. Sensors 2017, 17, 2673. [Google Scholar] [CrossRef] [PubMed]
Saisin, L.; Amarit, R.; Somboonkaew, A.; Gajanandana, O.; Himananto, O.; Sutapun, B. Significant Sensitivity Improvement for Camera-Based Lateral Flow Immunoassay Readers. Sensors 2018, 18, 4026. [Google Scholar] [CrossRef] [PubMed]
Qin, Z.; Chan, W.C.W.; Boulware, D.R.; Akkin, T.; Butler, E.K.; Bischof, J.C. Significantly Improved Analytical Sensitivity of Lateral Flow Immunoassays by Using Thermal Contrast. Angew. Chem. Int. Ed. 2012, 51, 4358–4361. [Google Scholar] [CrossRef]
Song, S.; Choi, S.; Ryu, S.; Kim, S.; Kim, T.; Shin, J.; Jung, H.-I.; Joo, C. Highly Sensitive Paper-Based Immunoassay Using Photothermal Laser Speckle Imaging. Biosens. Bioelectron. 2018, 117, 385–391. [Google Scholar] [CrossRef]
Blackshare, J.; Mina, H.A.; Deering, A.J.; Rajwa, B.; Robinson, J.P.; Bae, E. Enhancing Sensitivity of Commercial Gold Nanoparticle-Based Lateral Flow Assays: A Comparative Study of Colorimetric and Photothermal Approaches. Sensors 2025, 25, 4904. [Google Scholar] [CrossRef]
Raspberry Pi Ltd. Picamera2, version 0.3.27; Raspberry Pi Ltd.: Cambridge, UK, 2025.
MIT App Inventor Project. MIT App Inventor, release nb201; MIT: Cambridge, MA, USA, 2025.
The Pallets Projects. Flask, version 3.1.1; The Pallets Projects: London, UK, 2025.
Bradski, G. The OpenCV Library. Dr. Dobb’s J. Softw. Tools Prof. Program. 2000, 25, 120–123. [Google Scholar]
Harris, C.R.; Millman, K.J.; Van Der Walt, S.J.; Gommers, R.; Virtanen, P.; Cournapeau, D.; Wieser, E.; Taylor, J.; Berg, S.; Smith, N.J.; et al. Array Programming with NumPy. Nature 2020, 585, 357–362. [Google Scholar] [CrossRef]
Koenderink, J.J. The Structure of Images. Biol. Cybern. 1984, 50, 363–370. [Google Scholar] [CrossRef]
Mazzoleni, P.; Matta, F.; Zappa, E.; Sutton, M.A.; Cigada, A. Gaussian Pre-Filtering for Uncertainty Minimization in Digital Image Correlation Using Numerically-Designed Speckle Patterns. Opt. Lasers Eng. 2015, 66, 19–33. [Google Scholar] [CrossRef]
Otsu, N. A Threshold Selection Method from Gray-Level Histograms. IEEE Trans. Syst. Man Cybern. 1979, 9, 62–66. [Google Scholar] [CrossRef]
García, S.; Luengo, J.; Herrera, F. Data Preprocessing in Data Mining; Intelligent Systems Reference Library; Springer International Publishing: Cham, Switzerland, 2015; Volume 72, ISBN 978-3-319-10246-7. [Google Scholar]
Box, G.E.P.; Cox, D.R. An Analysis of Transformations. J. R. Stat. Soc. Ser. B Stat. Methodol. 1964, 26, 211–243. [Google Scholar] [CrossRef]
Rousseeuw, P.J.; Croux, C. Alternatives to the Median Absolute Deviation. J. Am. Stat. Assoc. 1993, 88, 1273–1283. [Google Scholar] [CrossRef]
Hecht, E. Optics, 5th ed.; Global Edition; Pearson: London, UK, 2017; ISBN 978-1-292-09693-3. [Google Scholar]
Goodman, J.W. Speckle Phenomena in Optics: Theory and Applications; Roberts: Englewood, CO, USA, 2007; ISBN 978-0-9747077-9-2. [Google Scholar]
Wang, Z.; Bovik, A.C.; Sheikh, H.R.; Simoncelli, E.P. Image Quality Assessment: From Error Visibility to Structural Similarity. IEEE Trans. Image Process. 2004, 13, 600–612. [Google Scholar] [CrossRef]
Gonzalez, R.C.; Woods, R.E. Digital Image Processing, 4th ed.; global edition; Pearson Education: New York, NY, USA, 2018; ISBN 978-1-292-22304-9. [Google Scholar]
Proakis, J.G.; Manolakis, D.G. Digital Signal Processing: Principles, Algorithms, and Applications, 4th ed.; Pearson/Prentice Hall: Upper Saddle River, NJ, USA, 2007; ISBN 978-0-13-187374-2. [Google Scholar]
Khosla, C.; Saini, B.S. Enhancing Performance of Deep Learning Models with Different Data Augmentation Techniques: A Survey. In Proceedings of the 2020 International Conference on Intelligent Engineering and Management (ICIEM), Moscow, Russia, 17–19 June 2020; IEEE: London, UK, 2020; pp. 79–85. [Google Scholar]
Lee, S.; Kim, S.; Yoon, D.S.; Park, J.S.; Woo, H.; Lee, D.; Cho, S.-Y.; Park, C.; Yoo, Y.K.; Lee, K.-B.; et al. Sample-to-Answer Platform for the Clinical Evaluation of COVID-19 Using a Deep Learning-Assisted Smartphone-Based Assay. Nat. Commun. 2023, 14, 2361. [Google Scholar] [CrossRef]
Iwana, B.K.; Uchida, S. An Empirical Survey of Data Augmentation for Time Series Classification with Neural Networks. PLoS ONE 2021, 16, e0254841. [Google Scholar] [CrossRef]
Chen, S.; Dobriban, E.; Lee, J.H. A Group-Theoretic Framework for Data Augmentation. J. Mach. Learn. Res. 2020, 21, 1–71. [Google Scholar]
Krizhevsky, A.; Sutskever, I.; Hinton, G.E. ImageNet Classification with Deep Convolutional Neural Networks. Commun. ACM 2017, 60, 84–90. [Google Scholar] [CrossRef]
Ronneberger, O.; Fischer, P.; Brox, T. U-Net: Convolutional Networks for Biomedical Image Segmentation. In Proceedings of the International Conference on Medical Image Computing and Computer-Assisted Intervention, Munich, Germany, 5–9 October 2015. [Google Scholar]
Pedregosa, F.; Varoquaux, G.; Gramfort, A.; Michel, V.; Thirion, B.; Grisel, O.; Blondel, M.; Prettenhofer, P.; Weiss, R.; Dubourg, V.; et al. Scikit-Learn: Machine Learning in Python. J. Mach. Learn. Res. 2011, 12, 2825–2830. [Google Scholar]
Tibshirani, R. Regression Shrinkage and Selection via the Lasso. J. R. Stat. Soc. Ser. B Methodol. 1996, 58, 267–288. [Google Scholar] [CrossRef]
Akaike, H. A New Look at the Statistical Model Identification. IEEE Trans. Autom. Control 1974, 19, 716–723. [Google Scholar] [CrossRef]
Schwarz, G. Estimating the Dimension of a Model. Ann. Stat. 1978, 6, 461–464. [Google Scholar] [CrossRef]

Figure 1. Schematic overview of the two sensing modalities integrated into the dual-mode biosensor. The photothermal imaging diagram depicts the binding event at the test line, where yellow spheres represent gold nanoparticles conjugated with antibodies. The colorimetric mode quantifies optical intensity changes at the test line using RGB image analysis, while the photothermal mode measure speckle fluctuations induced by plasmonic heating under 532 nm excitation.

Figure 2. Dimensional comparison of the assembled biosensor with a smartphone, highlighting the compact form factor (150 mm × 140 mm × 140 mm).

Figure 3. Overview of the portable dual-mode biosensor design: (a) CAD model showing external view of the 3D-printed enclosure with labeled access ports and cartridge slot; (b) internal optical layout showing the arrangement of optical and electrical components; (c) fabricated prototype of the portable biosensor, highlighting the assembled optical and structural components.

Figure 4. Workflow of the smartphone-integrated biosensor. The smartphone transmits user commands via HTTP to the Raspberry Pi, which controls LED strips, lasers, and camera. Colorimetric or photothermal data are stored locally for processing and results are returned to the smartphone for visualization.

Figure 5. Workflow of colorimetric image processing. (a) Capturing a JPEG image of the latera flow assay and decompose into RGB, and grayscale channels. (b) The yellow line indicates the path used to extract a line intensity profile across the test line, identifying test line region and background area.

Figure 6. Representative photothermal data augmentation workflow. Extreme parameter values are shown here to help visualize the changes, whereas smaller magnitudes were used in actual model raining to maintain realistic variability. (a) Circular time shift simulating temporal phase offset (arrow indicates transformation from original to augmented signal); (b) photometric jitter introducing sensor gain, bias, and Gaussian noise variations; (c) minor affine transformation applying small rotation and translation to mimic sample misalignment.

Figure 7. Performance of unregularized linear regression models on the test set (n = 3000) for predicting Salmonella concentration. (a–c) Predicted versus actual log concentrations for colorimetric, photothermal, and fused datasets. The dashed red line represents the ideal 1:1 prediction. (d–f) Corresponding absolute prediction errors expressed as

|\log_{10} (C_{prediction}) - \log_{10} (C_{true})|

.

Figure 7. Performance of unregularized linear regression models on the test set (n = 3000) for predicting Salmonella concentration. (a–c) Predicted versus actual log concentrations for colorimetric, photothermal, and fused datasets. The dashed red line represents the ideal 1:1 prediction. (d–f) Corresponding absolute prediction errors expressed as

|\log_{10} (C_{prediction}) - \log_{10} (C_{true})|

.

Figure 8. Performance of LASSO-regularized linear regression models on the test set (n = 3000) for predicting Salmonella concentration. (a–c) Predicted versus actual log concentrations for colorimetric, photothermal, and fused datasets. The dashed line represents the ideal prediction line. (d–f) Corresponding absolute prediction errors expressed as

|\log_{10} (C_{prediction}) - \log_{10} (C_{true})|

.

Figure 8. Performance of LASSO-regularized linear regression models on the test set (n = 3000) for predicting Salmonella concentration. (a–c) Predicted versus actual log concentrations for colorimetric, photothermal, and fused datasets. The dashed line represents the ideal prediction line. (d–f) Corresponding absolute prediction errors expressed as

|\log_{10} (C_{prediction}) - \log_{10} (C_{true})|

.

Figure 9. Predicted versus actual Salmonella concentrations for blind test samples. (a) Unregularized linear regression models using colorimetric (yellow circles), photothermal (orange squares), and fused (purple triangles) datasets. (b) Corresponding LASSO-regularized models. Labels A-H correspond to the specific blind test sample listed in Table 2. The dashed line represents ideal prediction, and the shaded green region indicates the

\pm

1 log error range.

Figure 9. Predicted versus actual Salmonella concentrations for blind test samples. (a) Unregularized linear regression models using colorimetric (yellow circles), photothermal (orange squares), and fused (purple triangles) datasets. (b) Corresponding LASSO-regularized models. Labels A-H correspond to the specific blind test sample listed in Table 2. The dashed line represents ideal prediction, and the shaded green region indicates the

\pm

1 log error range.

Table 1. Performance summary of unregularized and LASSO-regularized linear regression models trained on colorimetric, photothermal, and fused datasets. Metrics include the coefficient of determination (

R^{2}

), Akaike information criterion (AIC), Bayesian information criterion (BIC), and the percentage of predictions within ±1 log CFU/mL error range. Values in parentheses indicate 95% confidence intervals obtained from 1000 bootstrap iterations.

Table 1. Performance summary of unregularized and LASSO-regularized linear regression models trained on colorimetric, photothermal, and fused datasets. Metrics include the coefficient of determination (

R^{2}

), Akaike information criterion (AIC), Bayesian information criterion (BIC), and the percentage of predictions within ±1 log CFU/mL error range. Values in parentheses indicate 95% confidence intervals obtained from 1000 bootstrap iterations.

Trained Dataset	Regularized	R²	AIC	BIC	Predictions Within Acceptable Error Range
Colorimetric	No	0.849 (0.848, 0.851)	4926.9 (4896.0, 4958.7)	4957.0 (4926.0, 4988.7)	96.4%
Colorimetric	Yes	0.849 (0.847, 0.850)	4942.0 (4906.9, 4977.0)	4971.9 (4937.0, 5007.0)	95.9%
Photothermal	No	0.778 (0.777, 0.778)	6093.1 (6084.8, 6102.0)	6123.1 (6114.8, 6132.0)	88.3%
Photothermal	Yes	0.778 (0.777, 0.778)	6093.5 (6805.4, 6102.4)	6123.6 (6115.4, 6132.4)	88.3%
Colorimetric + Photothermal	No	0.910 (0.905, 0.915)	3425.7 (3405.3, 3448.0)	3455.7 (3435.4, 3478.0)	98.7%
Colorimetric + Photothermal	Yes	0.908 (0.907, 0.908)	3460.0 (3434.2, 3485.8)	3490.0 (3464.2, 3515.8)	95.9%

Table 2. Composition of blind test samples used for model validation. Each sample contained varying combinations of Salmonella Typhimurium, Escherichia coli, and Kocuria rhizophila. The actual log bacterial concentrations represent the total bacterial load, while the actual log Salmonella concentration indicates only the portion attributable specifically to Salmonella within each sample.

Test	Bacteria	bacterial Concentration (CFU/mL)	Salmonella Concentration (CFU/mL)
A	Salmonella	4	4
B	Salmonella + E. coli	4	4
C	Salmonella	6	6
D	Kocuria rhizophila	5	0
E	Salmonella + E. coli	3	3
F	E. coli	5	0
G	Salmonella	3	3
H	Salmonella + E. coli	5	5

Table 3. Predicted Salmonella concentrations for blind test samples obtained using colorimetric, photothermal, and fused regression models, with and without LASSO regularization. Each value represents the predicted log concentration generated by the trained models described in Section 2.6.4.

	Predicted Log (Salmonella Concentration-CFU/mL)
Salmonella Concentration (CFU/mL)	Colorimetric	Photothermal	Fused	Colorimetric w/LASSO	Photothermal w/LASSO	Fused w/LASSO
4	3.48	5.47	4.04	3.50	5.46	4.10
4	3.85	4.24	3.99	3.85	4.24	3.90
6	5.24	6.82	5.95	5.28	6.81	5.91
0	3.64	3.37	3.36	3.66	3.38	3.33
3	3.63	3.75	3.49	3.64	3.76	3.50
0	3.95	4.20	3.99	3.91	4.21	3.88
3	3.99	4.08	4.06	4.03	4.08	3.97
5	5.06	4.38	4.82	5.05	4.39	4.73

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Blackshare, J.; Corman, B.; Rajwa, B.; Robinson, J.P.; Bae, E. Portable Dual-Mode Biosensor for Quantitative Determination of Salmonella in Lateral Flow Assays Using Machine Learning and Smartphone-Assisted Operation. Biosensors 2026, 16, 57. https://doi.org/10.3390/bios16010057

AMA Style

Blackshare J, Corman B, Rajwa B, Robinson JP, Bae E. Portable Dual-Mode Biosensor for Quantitative Determination of Salmonella in Lateral Flow Assays Using Machine Learning and Smartphone-Assisted Operation. Biosensors. 2026; 16(1):57. https://doi.org/10.3390/bios16010057

Chicago/Turabian Style

Blackshare, Jully, Brianna Corman, Bartek Rajwa, J. Paul Robinson, and Euiwon Bae. 2026. "Portable Dual-Mode Biosensor for Quantitative Determination of Salmonella in Lateral Flow Assays Using Machine Learning and Smartphone-Assisted Operation" Biosensors 16, no. 1: 57. https://doi.org/10.3390/bios16010057

APA Style

Blackshare, J., Corman, B., Rajwa, B., Robinson, J. P., & Bae, E. (2026). Portable Dual-Mode Biosensor for Quantitative Determination of Salmonella in Lateral Flow Assays Using Machine Learning and Smartphone-Assisted Operation. Biosensors, 16(1), 57. https://doi.org/10.3390/bios16010057

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Portable Dual-Mode Biosensor for Quantitative Determination of Salmonella in Lateral Flow Assays Using Machine Learning and Smartphone-Assisted Operation

Abstract

1. Introduction

2. Materials and Methods

2.1. Hardware Architecture

2.2. Portable Device Design

2.3. Smartphone Integration

2.4. Colorimetric Analysis

2.5. Photothermal Analysis

2.6. Machine Learning

2.6.1. Colorimetric Feature Selection

2.6.2. Photothermal Feature Selection

Spatial Decorrelation Metrics

Spectral Distribution Metrics

2.6.3. Data Augmentation

Colorimetric Data

Photothermal Data

2.6.4. Model Construction

Model Formulation

Regularization and Feature Selection

Training and Validation

2.7. Sample Preparation and Testing Procedure

Blind Testing Procedure

3. Results

3.1. Model Performance and Evaluation

3.1.1. Linear Regression Models Without Regularization

3.1.2. Linear Regression with LASSO Regularization

3.2. Blind Testing and Uncertainty Analysis

3.2.1. Prediction of Blind Test Samples

3.2.2. Model Comparison

3.2.3. Uncertainty Quantification

3.3. Summary of System Performance and Limit of Detection

4. Discussion

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI