Machine Learning-Driven Computer Vision System for Automated Fat and Energy Quantification in Human Milk Microcapillaries

Huamanga-Chumbes, Lujan E.; Sacoto-Cabrera, Erwin J.; Lloret, Jaime; Silva-Alvarado, Vinie Lee; Huicho-Mendigure, Alfz; Moreno-Cardenas, Edison

doi:10.3390/s26061756

Open AccessArticle

Machine Learning-Driven Computer Vision System for Automated Fat and Energy Quantification in Human Milk Microcapillaries

by

Lujan E. Huamanga-Chumbes

¹

,

Erwin J. Sacoto-Cabrera

²

,

Jaime Lloret

³

,

Vinie Lee Silva-Alvarado

³

,

Alfz Huicho-Mendigure

¹

and

Edison Moreno-Cardenas

^1,4,*

¹

TESLA Laboratory, Universidad Nacional de San Antonio Abad del Cusco (UNSAAC), Cusco 08003, Peru

²

GIHP4C, Universidad Politécnica Salesiana, Cuenca 010102, Ecuador

³

Instituto de Investigación para la Gestión Integrada de Zonas Costeras, Universitat Politècnica de València, Carrer del Paranimf 1, Grao de Gandia, 46730 Valencia, Spain

⁴

Technology and Engineering Group, EM Research & Tech, Cusco 08003, Peru

^*

Author to whom correspondence should be addressed.

Sensors 2026, 26(6), 1756; https://doi.org/10.3390/s26061756

Submission received: 13 February 2026 / Revised: 6 March 2026 / Accepted: 7 March 2026 / Published: 10 March 2026

(This article belongs to the Section Sensing and Imaging)

Download

Browse Figures

Versions Notes

Abstract

Neonatal health requires precise lipid quantification in human milk to ensure proper nutritional development. Traditional manual methods, such as the creamatocrit, are limited by human-induced bias and significant measurement uncertainty. This study presents a low-cost Computer Vision System acting as an automated optical sensing modality for estimate the cream fraction (c) using advanced Machine Learning regression, which is subsequently used to derive fat and energy quantification through established analytical equations. The system is optimized for the Gold-LED spectrum, which enhances the dynamic range to 226 a.u. for robust feature extraction. We evaluated 28 distinct ML regression models across three feature spaces (Gray Scale, RGB, and Combined). The results, based on 6400 samples, demonstrate that the Rational Quadratic GPR model achieved the highest predictive stability with a coefficient of determination of

R^{2} = 0.867

. This computational framework achieved a 57.5% reduction in relative error compared to manual benchmarks. SHAP analysis indicates that the model selectively attributes higher importance to Red channel intensities and Blue contrast gradients, which correspond to the optical scattering characteristics of lipid globules. These findings validate the system as a stable sensing modality for non-invasive quantification. The proposed architecture integrates cost-effective hardware with high-precision analytical modeling, offering a reagent-free and operationally feasible alternative for standardized nutritional assessment in neonatal intensive care units and milk banks.

Keywords:

human milk; computer vision system; machine learning; neonatal nutrition; image segmentation; uncertainty analysis; clinical informatics

1. Introduction

One of the most essential pillars for the immediate survival of newborns and their long-term neurocognitive and metabolic development is neonatal nutrition. Within the ecosystem of Neonatal Intensive Care Units (NICU), human milk (HM) is recognized as the nutritional gold standard in milk banks (MB) [1,2]. However, its nature is not static; it constitutes a highly dynamic biological matrix linked to a biological signature that reflects maternal factors, gestational age, and circadian rhythms [3,4]. Within this complex matrix, fatty acids (FA) are critical, contributing with neutral lipids and representing the most energy-dense component necessary for neurological and retinal maturation [1,3,5].

The central problem is that lipid content is the most volatile macronutrient in HM, with fluctuations that can compromise postnatal growth trajectories if not accurately quantified prior to administration [6,7]. Recent studies emphasize that the specific fatty acid (FA) profile undergoes critical changes during milk maturation and in response to maternal metabolic status [5]. Factors such as pre-pregnancy body mass index (pBMI) and dietary intake of long-chain polyunsaturated fatty acids (LCPUFA) significantly alter the quality of the milk product [8], leading to essential nutrient deficits in very-low-birth-weight infants [9]. Given this heterogeneity, the practice of using breast milk without precise quantification is insufficient to meet the requirements of preterm neonates who depend on personalized supplementation to avoid extrauterine growth restriction (EUGR) [10,11].

The management of this variability is exacerbated by technical limitations at the point of care. Effective monitoring of caloric density is mandatory [12], but current tools are often costly and inaccessible to many health centers. In particular, there are no devices developed specifically for measuring human breast milk [13], suggesting a significant lack of solutions focused on women’s health needs. Reference methods like infrared spectroscopy [14] need specialized maintenance and large capital expenditures, which are not always accessible in rural areas. Additionally, because components like lipids and insulin vary significantly between feeding phases, the absence of standardized sampling protocols introduces bias [15]. Thus, it is critical to create quick, portable, and easily accessible solutions that optimize the alignment between clinical viability and laboratory standards [16]. In response to this need, computer vision systems (CVS) function as non-invasive optical sensors that translate spatial and intensity data into compositional metrics. This approach provides a high-resolution alternative to manual inspection by utilizing digital image processing as a primary sensing modality [17,18,19,20]. The physical principle of this sensing modality relies on the optical contrast at the interface of milk phases. The lipid-rich cream layer exhibits higher light-scattering and reflection coefficients compared to the aqueous serum, resulting in a distinct pixel intensity distribution. These distribution profiles encode the volumetric information of the sample, making quantitative regression feasible by mapping digital features to physical lipid concentration. The rise of telemedicine and the use of AI to predict complications, such as EUGR [21], demonstrate the viability of digital solutions to optimize neonatal care [22]. In this context, the combination of a minimalist hardware design with robust processing algorithms allows for superior traceability and quality control [23]. Nutrition personalization, supported by digital tools, facilitates the adaptation of nutritional strategies to the individual needs of the most fragile users [24]. Even with aggressive care bundles that include parenteral lipid emulsions to mitigate weight loss, the transition to optimal enteral nutrition remains a critical challenge [25,26]. The development of a system capable of isolating and quantifying the cream interface in microcapillaries in an automated manner not only eliminates the subjectivity of the analog creamatocrit method but also provides a scalable and economical tool.

On the other hand, recent advances in intelligent detection systems enable the integration of low-cost hardware with data-driven algorithms to perform reliable physical measurements in non-laboratory environments [27]. AI-enhanced integrated control platforms have demonstrated stable real-world implementation of sensor-based systems. Likewise, machine learning (ML) [28,29] applied to physical sensor data has enabled the automation of diagnostic tasks that replace subjective human assessment [20]. The combination of image processing with IoT-based acquisition architectures further confirms computer vision as a scalable detection modality in real-world settings [30]. These advances reinforce the feasibility of objective, automated nutritional assessment through intelligent visual detection.

The main objective of this study is twofold. First, to automate the quantification of cream fraction using a low-cost computer vision system, eliminating the subjectivity inherent in the analogue creamatocrit method. Second, to validate a measurement model capable of operating with high fidelity under controlled lighting conditions. This is achieved by integrating a standardized acquisition prototype with microcapillaries, which allows the isolation and quantification of the cream interface. A central aspect of this approach is the implementation of a Rational Quadratic Gaussian Process Regression (GPR) model. This specific ML architecture was selected after a thorough evaluation of 28 different regression algorithms due to its superior ability to model the complex non-linear relationship between pixel-based volumetric measurements and actual lipid concentration, while effectively managing the heteroscedasticity of biological samples. This system provides a robust, economical, scalable, and high-quality tool.

The rest of the study is divided into six additional sections. Section 2 details the most relevant related studies. Section 3 details the Materials and Methods. The results are presented in Section 4, followed by the Discussion in Section 5 and the Conclusions with future perspectives in Section 6.

2. Related Works

The evaluation of the nutritional quality of HM has historically relied on complex laboratory analyses [14]. These have required high-cost equipment, highly specialized personnel, and prolonged processing times that prevent immediate clinical decision-making [23]. Recent studies have underlined that the price of Mid-Infrared (MIR) instruments continues to increase due to their optical complexity and that alternative methods, such as ultrasound, although more economical, demand excessive sample volumes and are susceptible to calibration errors [6]. Beyond traditional analytical chemistry, the emergence of vision-as-sensor frameworks has redefined imaging pipelines as quantitative optical transducers. In these systems, the camera serves as a non-contact probe that captures the interaction between light and the biological sample. Unlike qualitative imaging, this sensing modality relies on the metrological extraction of physical boundaries and optical density variations, where the digital sensor replaces physical gauges to provide higher spatial resolution and repeatability. Given this scenario, computer vision has emerged as a disruptive solution, enabling non-destructive analysis that simulates human inspection capacity while offering superior mathematical objectivity.

At the most advanced level of this technological transition, Deep Learning (DL) and Convolutional Neural Networks (CNN) have demonstrated the ability to transform visual features into predictive metrics with unprecedented accuracy. Investigations by Dahiya et al. [31] have validated the use of models such as InceptionV3 and Vision Transformers to predict yield and lacteal nutritional quality from images, reaching accuracies of up to 85.6%. Specifically in the human field, Jin et al. [32] have developed models capable of predicting macronutrient composition and detecting the risk of low milk production using metabolic fingerprints and ML, achieving an accuracy of 87.9%. These models capture texture patterns and density variations in the cream layer that are invisible to the human eye, consolidating artificial intelligence as an emerging standard in nutritional diagnosis.

The necessity for modernizing analytical procedures in the food industry has driven the adoption of efficient, non-invasive inspection technologies. As reviewed by Baiano [33], conventional techniques for liquid and semi-liquid products are often time-consuming and unsuitable for real-time monitoring. In this context, imaging-based systems have emerged as critical sensing modalities that provide spatial and multi-constituent information regarding the physicochemical properties of products like milk and oils. This transition towards vision-as-sensor frameworks is further supported by recent advancements in the dairy industry, where non-invasive computer vision methods have been successfully deployed to predict milk quality traits, such as fat and protein content, using affordable RGB camera systems [34]. These sensing architectures demonstrate that digitally extracted features from visible spectra can achieve high correlation coefficients with laboratory ground truths, effectively acting as non-contact optical transducers. Furthermore, recent reviews on food safety emphasize that the integration of AI with smart sensors and computer vision is shifting food quality assessment from slow laboratory testing to real-time, low-cost monitoring of liquid products [35]. While these imaging-based metrology systems have matured in agricultural and industrial sectors [36], their application in the clinical environment of neonatal units for human milk analysis remains an underexplored frontier. This study adapts these proven optical sensing principles to the high-precision requirements of neonatal nutritional assessment, where the imaging pipeline is calibrated to function as a high-precision physical gauge for biological samples.

Currently, there is no methodology that integrates low-cost hardware with a hierarchical segmentation process specifically designed to automate creatocrit in breast milk banks. The novelty of this work lies in the proposal of an objective metric derived from CVS that minimizes technical bias through geometric rectification and standardized optical conditioning, offering a robust and scalable alternative for resource-constrained environments.

3. Materials and Methods

This section describes the integrated framework for automated lipid quantification, encompassing the physical experimental design, the hardware deployment of the Computer Vision System (CVS), and the mathematical modeling of measurement uncertainty. The study is structured to bridge the gap between traditional analog creamatocrit methods and automated digital estimation. As shown in Figure 1, the experimental scheme follows a sequential workflow: from the standardized preparation of biological samples to the high-resolution digital acquisition and subsequent predictive analysis.

3.1. Human Milk Samples and Experimental Design

A large-scale dataset of human milk samples (

n = 6400

) was obtained from the Human Milk Bank of the San Antonio Abad del Cusco National Hospital (Cusco, Peru). The 6400 samples were obtained from a diverse donor pool during a standardized relabeling process at the regional hospital. In order to ensure data independence and maintain clinical anonymity, each individual milk unit was treated as a distinct biological unit. Furthermore, each milk unit corresponds to a single, unique digital image in the dataset. The experimental phase was structured to ensure metrological reliability through a standardized sample preparation and acquisition protocol. Individual samples were transferred to uniform glass microcapillaries (75 mm length, 1.1 mm internal diameter) and sealed with non-toxic plasticine. In order to achieve a clear physical separation between the lipid fraction and the serum, all microcapillaries underwent a centrifugation process at 10,000 RPM for 15 min using a dedicated clinical microcentrifuge. The study followed a randomized complete block design under controlled environmental conditions (22 ± 2 °C; 45–55% relative humidity). For the baseline analog measurement, a trained operator measured the cream height (

h_{c}

) and total height (

h_{t}

) using a precision manual caliper (1 mm resolution). Subsequently, for the digital workflow, the experimental setup was established within a controlled optical enclosure using low-intensity LEDs to minimize background noise. A high-resolution 1080p Logitech camera (Logitech, Lausanne, Switzerland) was positioned at the nadir view distance of 12 cm from the sample holder. In order to generate a robust training dataset, the vertical distances between the detected interfaces (seal–serum, serum–cream, and cream–seal) were computed for the digital images and individually corrected by a trained operator. These measurements act as a high-precision reference. The subsequent ML regression models were then developed to estimate the cream fraction (c) by analyzing global pixel intensity distributions, rather than relying solely on the boundary coordinates; this fraction is used to derive fat and energy quantification through established analytical equations.

3.2. Hardware Deployment and Optical Environment

CVS was integrated into a specialized optical enclosure designed to maintain consistency and eliminate environmental interference. The structure was manufactured in 5.5 mm High-Density Polyethylene. The design isolates the sample from ambient light interference. Its external dimensions are 13 cm (height) × 21 cm (length) × 22.5 cm (width) as shown in Figure 2a. In order to ensure optical axis stability, a mounting aperture is located at the top for a 1080p high-definition camera, Logitech StreamCam Plus (Logitech, Lausanne, Switzerland). This ensures the stability of the optical axis. The internal compartment is engineered for precise, repeatable placement of the microcapillary tube. The system uses constant-intensity LED strips (Philips, Amsterdam, The Netherlands). Golden light was established as the optimal light. A regulator with the IRFZ44N MOSFET (Infineon Technologies, Neubiberg, Germany) was implemented. The circuit operates at 12 V. A potentiometer allows manual brightness adjustment (see Figure 2b). The passive filtering stage eliminates noise.

A conditioning chamber with a black background and LED lighting was designed to ensure consistent optical conditions. Capture is performed with a 1080p Logitech camera connected via USB, selected for its precision in segmenting capillary layers. Edge computing is performed, and finally, the results are labelled and stored (see Figure 3).

3.2.1. Light Sensitivity

A series of experimental tests was conducted to optimize the edge definition of the microcapillary and maximize the visualization of its internal content. The parameters were locked at an exposure time of 10 ms, a gain of ISO 100, and a white balance of 4500 K, with the focus manually fixed at the center of the microcapillary axis. Initial evaluations confirmed that ambient light generates erratic reflections that compromise measurement repeatability. By comparing natural capture with the conditioned environment (Figure 4), the necessity of the black-background chamber for stable edge detection was validated.

Subsequently, the interaction between different light spectra and the microcapillary was evaluated using blue, yellow, and gold LED sources. As depicted in Figure 5, each spectrum was tested at high and low intensity levels to identify potential sensor saturation or low-contrast regions. To quantify these responses, a statistical analysis of pixel intensity distribution was performed using 30,000 random subsamples per configuration.

3.2.2. Spectral Optimization

As summarized in Table 1 and visualized through the violin plots in Figure 6, the Gold Low spectrum demonstrated superior performance for artificial vision tasks. While blue and yellow sources exhibited higher noise floors or information loss at low levels, the Gold spectrum maintained a wide Dynamic Range (DR = 226 a.u.) and high homogeneity. Under these standardized conditions, the Dynamic Range (DR) was calculated as the intensity spread between the noise floor and the saturation limit across the RGB channels, reaching a peak of 226 a.u. for the Gold Low configuration. Specifically, the coincidence of

Q_{1}

and

Q_{2}

at 21.0 a.u. indicates a stable sensor response, while the minimum value of 3.0 a.u. facilitates a steeper intensity gradient. This enhanced contrast between the shadows and the microcapillary walls simplifies calibration of hierarchical segmentation algorithms, enabling robust distinction of physical boundaries regardless of minor ambient fluctuations.

3.3. Mathematical Framework and Uncertainty Analysis

The human milk fat percentage (F) and energy density (

E D

) are not measured directly but are derived from the cream layer height (

h_{c}

) and the total column height (

h_{t}

), which includes both the cream and serum layers. According to the standardized methodology established by [37], F and

E D

are calculated using the following Equations (1) and (2). These equations serve as the universal reference for international clinical protocols and technical health standards, ensuring consistency in the nutritional assessment of donor milk across global healthcare systems [38,39].

F = \frac{2950}{73} \frac{h_{c}}{h_{t}}

(1)

E D = 6680 (\frac{h_{c}}{h_{t}}) + 290

(2)

In order to transform the Computer Vision System (CVS) from a simple image processing tool into a reliable metrological instrument, an uncertainty analysis was performed. This stage establishes the correspondence between the digital coordinate space (pixels) and the physical metric system, ensuring that the extracted heights possess physical meaning. By evaluating how resolution limits propagate through the analytical models, we can define the precision boundaries of the system.

3.3.1. Error Propagation Model

In order to propagate the measurement error, a logarithmic transformation and subsequent differentiation were applied to Equation (1) to linearize the relationship between variables. Differentiating both sides yields the relationship between the relative uncertainties. Since uncertainties are additive in the worst-case scenario, the signs are taken as positive to determine the maximum possible absolute error (

Δ F

), resulting in Equation (3):

Δ F = F (\frac{Δ h_{c}}{h_{c}} + \frac{Δ h_{t}}{h_{t}})

(3)

Likewise, for the energy density defined in Equation (2), the constant offset is subtracted before applying the logarithmic transformation and differentiation. The resulting absolute uncertainty for the

E D

is expressed in Equation (4):

Δ E D = (E D - 290) (\frac{Δ h_{c}}{h_{c}} + \frac{Δ h_{t}}{h_{t}})

(4)

3.3.2. Mathematical Model for Uncertainty Analysis

By defining the cream fraction as

c = h_{c} / h_{t}

and substituting the original definitions into Equations (3) and (4), we obtain the final numerical models in Equations (5) and (6).

Δ F = 40.41 (\frac{Δ h}{h_{t}}) (1 + c)

(5)

Δ E D = 6680 (\frac{Δ h}{h_{t}}) (1 + c)

(6)

The resulting expressions in Equations (5) and (6) reveal that the measurement uncertainty is governed by the cream fraction (c). Since the instrument resolution (

Δ h

) constitutes a fixed constraint, defined either by the physical caliper with a nominal resolution of 1 mm (corresponding to an analog uncertainty of

\pm 0.5

mm by definition of half-scale division) or by the digital pixel resolution (±1 px), and the total column height (

h_{t}

) remains constant for a standardized capillary volume, the propagation of error is effectively scaled by a constant coefficient. Consequently, the absolute uncertainty is minimal for samples with lower lipid content and increases linearly as the cream layer occupies a larger fraction of the tube. This mathematical behavior allows for a predictable assessment of sensor precision across the entire range of human milk compositions [40]. In order to fully characterize the digital measurement uncertainty beyond the discrete pixel resolution, three additional factors were addressed. First, lens distortion was minimized by using a fixed-focus assembly and maintaining a perpendicular optical axis relative to the microcapillary, ensuring a linear geometric projection within the Region of Interest (ROI) [41]. Regarding spatial calibration, the system operates directly in the raw pixel domain to maintain data integrity and avoid rounding errors associated with external metric conversions. Since the core variables are derived from the cream fraction (c), which is a dimensionless ratio, any physical scaling factor is mathematically cancelled, rendering the pixel-based measurement intrinsically self-calibrating as long as the sensor-to-sample distance remains constant. Finally, segmentation variability was mitigated through the standardized Gold Low illumination environment, which provides high-contrast interfaces and stable intensity gradients. This controlled setup ensures that the automated detection of

h_{c}

and

h_{t}

is repeatable and that the

\pm 1

px resolution remains the dominant and predictable limit of the system uncertainty.

3.4. Computer Vision Pipeline

The algorithm, developed in Python 3.10, processes images at their native resolution to avoid information loss from interpolation. The operational flow (see Figure 7) is based on segmenting raw units (pixels) to calculate the dimensions of the phases in the microcapillary.

A semiautomated labeling protocol was implemented to ensure the highest metrological standards during the training phase. While an initial segmentation algorithm was developed to estimate

h_{c}

and

h_{t}

, each of the 6400 images captured underwent rigorous manual supervision and refinement. An expert reviewed and adjusted the detected boundaries in each sample to eliminate any residual errors caused by optical artifacts or meniscus distortion. This carefully selected dataset served as the high-precision ground truth (target). Consequently, the ML models were trained not only to replicate the initial segmentation but also to understand the complex relationship between global pixel distributions and achieve a level of robustness that surpasses simple automated edge detection.

3.4.1. Hierarchical Segmentation and ROI Definition

From the original image

I (x, y) \in R^{M \times N \times 3}

, the conversion to greyscale

I_{g} (x, y)

is performed. To isolate the microcapillary, global binarization is applied as described in Equation (7).

B (x, y) = \{\begin{matrix} 255, & si I_{g} (x, y) > T_{b}, \\ 0, & si I_{g} (x, y) \leq T_{b}, \end{matrix}

(7)

where

T_{b} = 20

because it is sufficient to distinguish the microcapillary from the background. This value is determined empirically and standardized by Gold Low illumination environment. The contours (

C_{k}

) are extracted, and the geometry of the area (A) is validated using the shoelace formula derived from Green’s Theorem:

A = \frac{1}{2} |\sum_{i = 1}^{N} (x_{i} y_{i + 1} - x_{i + 1} y_{i})|

(8)

On the other hand, the general Region of Interest (

{ROI}_{g}

) is defined by a bounding rectangle:

{ROI}_{g} = {(x, y) ∣ x_{min} \leq x \leq x_{max}, y_{min} \leq y \leq y_{max}} .

(9)

The system divides the

{ROI}_{g}

using the midpoint

y_{m i d} = ⌊ h / 2 ⌋

. An intensity threshold

T_{w} = 90

is applied to isolate the seal (

{ROI}_{1}

) and the cream (

{ROI}_{2}

). The lower boundary of the seal (

y_{s e a l}

) is defined as:

y_{s e a l_e n d} = y_{b a s e} + Δ y_{s e a l} + h_{s e a l}

(10)

where

y_{b a s e}

is the origin of the capillary,

Δ y_{s e a l}

is the relative displacement, and

h_{s}

is the height of the seal. The upper limit of the cream (

y_{c_s t a r t}

) is located in the second subregion as described in Equation (11) where

Δ y_{c}

is the relative displacement of the cream:

y_{c_s t a r t} = y_{b a s e} + y_{m i d} + Δ y_{c}

(11)

where

Δ y_{c}

represents the displacement of the cream layer relative to the start of the second subregion. This spatial discrimination allows the analysis of the serum phase to be confined to a third region of interest (

{ROI}_{3}

) defined by Equation (12).

R O I_{3} = {(x^{''}, y^{''}) ∣ x_{m i n} \leq x^{''} \leq x_{m a x}, y_{s e a l_e n d} + 1 \leq y^{''} \leq y_{c_s t a r t} - 1}

(12)

A grey range

[T_{min}^{g} = 25, T_{max}^{g} = 90]

is applied to extract the dominant contour of the serum. The final heights of the serum (

h_{s}

) and cream (

h_{c}

) in pixels are obtained by adjusting a rectangle of minimum area:

h_{p x} = max (w, h)

(13)

where w is the width and h is the height. Finally, based on these values, the percentage of cream, the percentage of F, and the total

E D

are calculated using analytical expressions aligned with the technical guidelines established in [39] and based on the formulations proposed by [37].

3.4.2. Adaptive Cropping

To ensure that color analysis is restricted to the biological content of the sample, an adaptive cropping algorithm is implemented. The adaptive cropping is performed only along the vertical axis (y). Once the general Region of Interest (

{ROI}_{g}

) has been defined, the final vertical limits of the sample (see Equations (14) and (15)) are computed by detecting intensity transitions corresponding to the lower boundary of the seal and the lower boundary of the cream layer, respectively.

y_{s e a l_e n d} = y_{b a s e} + {arg max}_{y \in {ROI}_{1}} \{\nabla I_{g} (y) > T_{w}\} + h_{s e a l}

(14)

y_{c r e a m_e n d} = y_{c_s t a r t} + {arg max}_{y \in {ROI}_{2}} \{\nabla I_{g} (y) > T_{w}\} + h_{c r e a m}

(15)

where

\nabla I_{g}

denotes the vertical intensity gradient. No additional offset is applied; therefore, the cropping limits coincide with the detected physical interfaces. The resulting cropped image

I_{c r o p}

is defined as the subset of pixels containing exclusively the serum and cream phases, effectively removing external artefacts and non-biological regions.

The horizontal extent of the crop is defined by the full width of the microcapillary (

x_{min}

to

x_{max}

), ensuring that all columns of the sample are included in the analysis.

3.4.3. Multichannel Statistical Feature Extraction

After isolating the sample, intensity-domain feature extraction is performed to feed the regression models. For each cropped image

I_{c r o p}

, the RGB color channels are separated, and the luminance (Gray Scale) channel is computed. The feature vector X is constructed by computing frequency histograms for each channel

k \in {R, G, B, G r a y}

, as shown in Equation (16).

H_{k} (i) = \sum_{x, y} δ (I_{c r o p, k} (x, y) - i), i \in [0, 255],

(16)

where

δ

is the Kronecker delta function, and i represents the intensity level. The final feature space is defined by concatenating these histograms. Using this procedure, a structured dataset of feature vectors was generated, where each row corresponds to a single sample image and each column represents a specific feature. This organized dataset was then ready for the application of ML algorithms for regression and predictive analysis.

3.5. Predictive Modeling via ML Regression Algorithms

In order to identify the algorithm with the highest predictive capacity for estimating the cream fraction (c), 28 ML regression models were evaluated. These models were grouped into eight main families: linear models (EL), ensemble trees (ESB), Gaussian process regressors (GPR), kernel-based methods (KN/SVM), linear regression models (LR), neural networks (NN), and decision trees (Tree). Table 2 provides the specific configurations and hyperparameters used for each architecture. The total set of images was divided into 65% for training, 25% for validation, and the remaining 10% for testing. Each model was evaluated under identical computational conditions. The selection criterion for the optimal model was based on maximizing the coefficient of determination (

R^{2}

).

4. Results

This section presents the performance of the developed computer vision system (CVS). The system analyzed images of microcapillaries with centrifuged human milk samples. The algorithm automatically segmented the cream and serum layers. Figure 8 shows the system’s performance under different experimental conditions.

4.1. Segmentation Robustness Analysis

The system proved robust against physical variations during acquisition. Sample 1 (Figure 8a) shows a slight axial tilt. The algorithm correctly segmented the phases despite this deviation. Sample 2 (Figure 8b) demonstrates an ideal vertical alignment. Sample 3 (Figure 8c) shows incomplete filling of the capillary. In all cases, the CVS correctly identified the critical fluid boundaries. Regarding the cream fraction (c), we yield a Mean Absolute Error (MAE) of 0.0050 and a Root Mean Square Error (RMSE) of 0.0062. To further assess the discrepancy between methods, the digital sensor was established as the high-precision reference—constrained only by a fixed spatial resolution of 1 px—revealing that manual measurements exhibited a Mean Relative Error (MRE) of 9.52%. A paired t-test confirmed no statistically significant difference between the digital and manual methods (p = 0.572), with a negligible mean bias of 0.06%. This high degree of agreement between the digital and manual methods, characterized by the absence of systematic bias, confirms the clinical relevance of the CVS as a high-precision alternative to traditional crematocrit scales.

4.2. Uncertainty Analysis

Uncertainty was evaluated under real-world conditions. The study included 68 total measurements for validation. While the ML models were trained and validated using the full dataset (n = 6400), the uncertainty analysis and its graphical representation were conducted on a representative subset of 34 samples, selected across the full concentration range to ensure visual clarity and avoid over-plotting. This setup ensures a balanced comparison between both techniques. Experimental mean values were:

{\bar{c}}_{(m m / m m)} = 0.053

,

{\bar{c}}_{(p x / p x)} = 0.0529

,

{\bar{h}}_{t (m m)} = 48.57

mm and

{\bar{h}}_{t (p x)} = 228.20

px. These values were substituted into Equations (5) and (6). The comparative analysis of measurement uncertainties is presented in Figure 9. For fat content, Figure 9a shows that conventional analog quantification results in an uncertainty of

\pm 0.438 %

, whereas the proposed CVS reduces this value to

\pm 0.186 %

. Similarly, for energy density, Figure 9b illustrates a reduction from

\pm 72.41

kcal/L using analog methods to

\pm 30.81

kcal/L with the CVS.

The statistical impact of this improvement is quantified in Table 3. For the mean fat content (

\bar{F} \approx 2.01 %

), the analog relative error of

21.21 %

was reduced to

9.00 %

using the digital approach. Similarly, the mean energy density (

\bar{E D} \approx 622.49

kcal/L) showed a reduction from

11.52 %

to

4.90 %

. These results confirm a relative error improvement of approximately

57.5 %

, validating the CVS as a robust tool for high-fidelity nutrient estimation.

Since both F and

E D

depend on

h_{c}

and

h_{t}

, their linear correlation is expected. Figure 10 illustrates this relationship, where Figure 10a reveals the high uncertainty inherent in analog measurements, characterized by wide error intervals across the experimental range. In contrast, Figure 10b demonstrates a drastic reduction in uncertainty through the CVS system, which presents significantly narrower and more precise intervals.

4.3. Performance Analysis of Regression Models

Figure 11 illustrates the

R^{2}

values achieved using Gray Scale (Figure 11a), RGB (Figure 11b), and combined RGB + Gray Scale (Figure 11c) descriptors with 256, 768, and 1024 features, respectively. The validation phase is represented in blue, while the testing phase is shown in orange. All models are identified by their specific ID and categorized according to the parameters detailed in Table 2.

For the 256 Gray Scale features, the four best models were: Rational Quadratic GPR (GPR c) with

R^{2}

of 0.850 for validation (see Figure 12a) and 0.862 for test (see Figure 12b), Exponential GPR (GPR a) with 0.848 and 0.863, Medium Gaussian SVM (SVM e) with 0.847 and 0.866, and Cubic SVM (SVM b) with 0.827 and 0.834. These results demonstrate that even with limited intensity data, GPR and SVM architectures maintain high stability.

In the 768 RGB configuration, performance increased significantly. The top four models were: Rational Quadratic GPR (GPR c) reaching an

R^{2}

of 0.885 for validation (see Figure 12c) and 0.867 for test (see Figure 12d), followed by Medium Gaussian SVM (SVM e) with 0.878 and 0.859, Linear Regression (LR b) with 0.876 and 0.854, and Cubic SVM (SVM b) with 0.876 and 0.854. The inclusion of color channels improved the model’s ability to distinguish cream fraction-related chromatic patterns.

Finally, for the 1024 RGB + Gray Scale features, the models showed the best generalization. The leading models were: Rational Quadratic GPR (GPR c) with

R^{2}

of 0.870 for validation (see Figure 12e) and 0.871 for test (see Figure 12f), Medium Gaussian SVM (SVM e) with 0.864 and 0.878, Cubic SVM (SVM b) with 0.862 and 0.856, and Quadratic SVM (SVM f) with 0.859 and 0.846. The combination of 1024 descriptors minimized the variance between phases, proving that integrating luminance with color data yields the most reliable quantification.

These findings highlight the ability of the sensor to provide high-quality data for regression modeling. The 768 RGB feature set proved to be the most accurate, achieving a peak test

R^{2}

of 0.867 with the Rational Quadratic GPR model. This performance confirms that chromatic signatures are the most decisive descriptors for quantifying the cream fraction. Furthermore, selecting this configuration ensures that the system maintains a better

R^{2}

with a resolution of ±1 pixel, effectively converting high-dimensional image data into a stable clinical measurement with a 57.5% reduction in final uncertainty.

4.4. Feature Importance Analysis

A feature importance analysis was performed using SHAP (Shapley Additive exPlanations) values. Figure 13 illustrates the importance distribution of the 10 best features across the Gray Scale, RGB, and RGB + Gray Scale.

In the Gray Scale configuration (see Figure 13a), the model shows a high statistical dependency on low-to-mid intensity bins to establish a negative baseline for the prediction. Predictors such as Gray_41, Gray_57, Gray_51, and Gray_58 show that high feature values result in more negative SHAP values, which the model statistically associates with non-lipid regions such as the background or serum phase. Conversely, features like Gray_173 and Gray_177 display a different behavior, where higher intensity values correlate with a positive SHAP impact, suggesting these bins are significant predictors for the cream layer within the learned feature space.

The RGB analysis (see Figure 13b) reveals a complex spectral interaction, where the Blue (B) and Red (R) channels provide the most discriminative data. The predictor B_46 is the most significant; however, its high values correlate with more negative SHAP impacts, suggesting it serves as a statistical boundary marker. A relevant finding is the role of red channel features like R_140 and R_141, where high intensity values result in positive SHAP shifts. This suggests a statistical correlation between these specific spectral bins and the target variable, rather than a causal physical interaction.

In the 1024-features combined space (Figure 13c), the model’s behavior shifts towards a dominant statistical weight on luminance saturation. The feature Gray_255 emerges as the most significant predictor by a wide margin. This statistical association suggests that the model prioritizes maximum brightness values to differentiate the cream layer. By assigning high importance to Gray_255, the regression model effectively minimizes the impact of mid-tone and shadow regions, focusing the mathematical prediction on peak intensity data rather than a direct physical modeling of lipid scattering mechanisms.

5. Discussion

The transition from manual inspection to the proposed CVS represents a fundamental shift in measurement principles, as synthesized in the cross-method evaluation in Table 4. While electrochemical sensors [13,16] rely on ion-selective redox reactions—which offer high chemical sensitivity but are susceptible to thermal noise and electrode fouling—the CVS employs a non-invasive optical reflectance-based methodology. Unlike electrochemical methods that require periodic reagent calibration, the CVS maintains long-term stability through fixed hardware optimization. The 57.5% relative error improvement should be interpreted as a meteorological milestone in interface detection, directly resulting from the peak performance of the Rational Quadratic GPR model using the 768 RGB feature set (

R^{2} = 0.867

). This model has a training time of 573.4 s, a prediction Speed of 2807.9 obs/s, and a model size of 33.9 MB. The computational feasibility was validated on a portable workstation with an Intel Core i7-1255U processor and 12GB of RAM, where the total processing time per sample remained consistently under 1.5 s.

In manual clinical practice, factors such as the parallax effect and subjective boundary identification lead to an MRE of 9.52%. The CVS eliminates this variance by applying a probabilistic sub-pixel analysis to the chromatic signatures of the milk phases. This improvement is not merely a statistical gain but a reduction in the noise floor of the Creamatocrit method, effectively shifting the measurement from a human-centered estimation to a standardized digital quantification. The superior performance of the Rational Quadratic GPR model is rooted in its kernel structure, which is uniquely suited for the physical nature of milk samples. Unlike linear kernels or the rigid architectures of CNNs [31] that may overfit small clinical datasets, the Rational Quadratic kernel acts as a scale mixture of RBF kernels. This allows the model to handle both small-scale chromatic fluctuations and larger-scale trends in the c fraction simultaneously.

The proposed CVS offers a low-cost, automated alternative for Human Milk Banks in resource-limited settings and low-income countries. By replacing manual inspection with a digital scan, the system eliminates inter-operator variability and parallax errors. This allows non-specialized clinical staff to perform precise, real-time nutritional fortification for infants, providing a scalable, standardized tool where expensive infrared analyzers are not feasible.

6. Conclusions

This research successfully developed an optimized CVS for the automated estimation of the cream fraction, which is subsequently used to calculate the quantification of fat and energy in human milk through established analytical equations. The integration of hardware-level optimization and ML demonstrates a measurable advancement in metrological reliability for neonatal care, rather than a mere incremental improvement. This progress is characterized by two distinct technical contributions: first, the hardware optimization and illumination control effectively lowered the measurement noise floor, enabling a 57.5% reduction in final uncertainty; second, the Rational Quadratic GPR model provided the necessary sub-pixel precision and stability (

R^{2} = 0.867

) to maintain a negligible bias of 0.06% across the clinical range.

Despite these advancements, certain limitations must be acknowledged. While the CVS eliminates analog errors such as parallax, its performance remains dependent on the physical quality of the capillary filling and a controlled acquisition environment. Furthermore, the current validation is focused on a specific clinical range, with observed cream fraction values between 0.0429 and 0.0663, which correspond to energy density levels between 576.8 and 732.9 kcal/L. These values correspond to the typical normative range for mature human milk; consequently, further studies are required to ensure the system’s generalization across extreme concentrations, such as those found in colostrum (potentially higher) or in cases of severe maternal malnutrition (potentially lower).

Future work will focus on two key areas. First, we will prioritize scalability and mobile integration by porting the CVS to smartphone platforms [28], enabling home-based nutritional monitoring for lactating individuals. Second, the target biomarker database will be expanded to include the quantification of total proteins and lactose levels [13]. In order to ensure model robustness under real-world conditions, future validation will include non-normative milk samples, considering diverse maternal profiles, seasonal variations, and extreme fat concentrations. Additionally, multimodal sensor fusion and advanced preprocessing techniques will be explored [42] to isolate biological responses from environmental effects. These advancements will not only enhance individual neonatal care but also provide a scalable tool for large-scale epidemiological surveillance of maternal-infant nutritional status, ensuring global clinical applicability even in the most remote regions.

Author Contributions

Conceptualization, E.J.S.-C. and E.M.-C.; methodology, V.L.S.-A. and E.M.-C.; software, L.E.H.-C. and A.H.-M.; validation, J.L. and V.L.S.-A. formal analysis, L.E.H.-C., V.L.S.-A. and A.H.-M.; investigation, L.E.H.-C., A.H.-M. and E.M.-C.; resources, E.J.S.-C., J.L., V.L.S.-A. and E.M.-C.; data curation, L.E.H.-C. and A.H.-M.; writing—original draft preparation, L.E.H.-C., E.J.S.-C. and A.H.-M.; writing—review and editing, J.L., V.L.S.-A. and E.M.-C.; visualization, V.L.S.-A. and E.M.-C.; supervision, E.J.S.-C., J.L., V.L.S.-A. and E.M.-C.; project administration, E.J.S.-C., J.L., V.L.S.-A. and E.M.-C.; funding acquisition, E.J.S.-C., J.L., V.L.S.-A. and E.M.-C. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported in part by the National University of San Antonio Abad of Cusco, through the projects of the Professional School of Electronic Engineering; the Universidad Politécnica Salesiana under the Fog Computing Simulation project; and the Universitat Politècnica de València through the “Programa de Ayudas de Investigación y Desarrollo (PAID-01-24)”.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data provided can be found within the article. The original contributions made in this study are included in the document; any additional inquiries can be directed to the author or authors responsible.

Acknowledgments

We thank the Institutional Laboratory of Renewable Energy, Optical Communications Engineering and Environmental Technology (TESLA) and the Laboratory for Research, Entrepreneurship and Innovation in Automatic Control Systems, Automation and Robotics (LIECAR), both from the Universidad Nacional de San Antonio Abad del Cusco and Cloud Computing Smart Cities & High Performance Computing Research Group (GIHP4C) from the Universidad Politécnica Salesiana. Additionally, the authors acknowledge the Technology and Engineering Group of EM Research & Tech for providing technical support and feedback during the development of this work.

Conflicts of Interest

Author Edison Moreno-Cardenas was employed by the company EM Research & Tech. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

AlSulaiti, B.; Ferguson-Smith, A.C.; Hanin, G. From mammary glands to nutrients: Genetic insights into milk composition. Biol. Reprod. 2025, 114, 442–461. [Google Scholar] [CrossRef]
Herzlich, J.; Frumer, B.; Mandel, D.; Morag, S.; Halperin, A.; Mangel, L. Effects of Donor Human Milk and Formula Supplementation on Bone Metabolism and Clinical Outcomes in Preterm Infants Receiving Mother’s Own Milk. Nutrients 2025, 17, 3263. [Google Scholar] [CrossRef]
Meng, F.; Uniacke-Lowe, T.; Lanfranchi, E.; Meehan, G.; O’Shea, C.A.; Dennehy, T.; Ryan, A.C.; Stanton, C.; Kelly, A.L. A longitudinal study of fatty acid profiles, macronutrient levels, and plasmin activity in human milk. Front. Nutr. 2023, 10, 1172613. [Google Scholar] [CrossRef] [PubMed]
Tesser, F.; Meneghelli, M.; Martino, D.; Pegoraro, L.; Pelosi, M.S.; Sebellin, S.; Verlato, G. Early Optimal Parenteral Nutrition During NICU Stay and Neurodevelopmental Outcomes in Very Preterm Infants: State of the Art. Nutrients 2025, 17, 232. [Google Scholar] [CrossRef] [PubMed]
Bernardo, G.D.; Leone, G.; Izzo, F.; Giovengo, M.; Basilicata, M.G.; Centanni, F.; Morlino, F.; Salviati, E.; Giordano, M.; Perrone, S.; et al. Fatty Acid Profiling of Breast Milk at Different Gestational Ages. Nutrients 2025, 17, 2672. [Google Scholar] [CrossRef]
Davis, A.; Perrin, M.T. Impact of Holder Pasteurization and Preanalytical Handling Techniques on Fat Concentration in Donor Human Milk: A Scoping Review. Adv. Nutr. 2024, 15, 100229. [Google Scholar] [CrossRef]
Honoré, K.D.; Lai, C.T.; Michaelsen, K.F.; Gridneva, Z.; Zachariassen, G.; Geddes, D.T. Concentration and Intakes of Macronutrients from Human Milk Do Not Differ by Infant Sex in Australian and Danish Cohorts. Nutrients 2025, 17, 3647. [Google Scholar] [CrossRef]
Ahmed, T.B.; Eggesbø, M.; Criswell, R.; Demmelmair, H.; Totzauer, M.; Koletzko, B. The Associations of Maternal Prepregnancy Body Mass Index with Human Milk Fatty Acid and Phospholipid Composition in the Observational Norwegian Human Milk Study. J. Nutr. 2025, 155, 1818–1827. [Google Scholar] [CrossRef]
Sjöbom, U.; Cismaan, O.; Hansen-Pupp, I.; Wackernagel, D.; Sävman, K.; Hellström, A.; Nilsson, A.K. Fatty acid composition of milk from mothers giving birth at extremely low gestation in Sweden. Exp. Results 2022, 3, e9. [Google Scholar] [CrossRef]
Lygerou, I.; Ilia, S.; Briassoulis, P.; Manousaki, A.; Koropouli, M.; Hatzidaki, E.; Briassoulis, G. The Impact of Estimated Energy and Protein Balances on Extrauterine Growth in Preterm Infants. Nutrients 2023, 15, 3556. [Google Scholar] [CrossRef] [PubMed]
Pereira, J.; Bresser, L.R.F.; van Riel, N.; Looijesteijn, E.; Schoemaker, R.; Ulfman, L.H.; Jeurink, P.; Karaglani, E.; Manios, Y.; Brouwer, R.W.W.; et al. Bovine Milk Fat Intervention in Early Life and Its Impact on Microbiota, Metabolites and Clinical Phenotype: A Multi-Omics Stacked Regularization Approach. BioMedInformatics 2022, 2, 281–296. [Google Scholar] [CrossRef]
Radmacher, P.G.; Adamkin, D.H. Fortification of human milk for preterm infants. Semin. Fetal Neonatal Med. 2017, 22, 30–35. [Google Scholar] [CrossRef]
Al-Shami, A.; Ma, H.; Banks, M.; Amirghasemi, F.; Mohamed, M.A.; Soleimani, A.; Nejad, S.K.; Ong, V.; Tasso, A.; Berkmen, A.; et al. Mom and Baby Wellness with a Smart Lactation Pad: A Wearable Sensor-Embedded Lactation Pad for on-Body Quantification of Glucose in Breast Milk. Adv. Funct. Mater. 2025, 35, 2420973. [Google Scholar] [CrossRef] [PubMed]
Barrés-Fernández, A.; Arcos-Machancoses, J.V.; Castillo-Corullón, S.; González, S.I.; Fullana-Tur, M.; Ferrando-Monleón, S. Estimating Caloric Intake per Breastfeeding Session in Infants: A Probabilistic Approach. Nutrients 2025, 17, 3136. [Google Scholar] [CrossRef] [PubMed]
Suwaydi, M.A.; Lai, C.T.; Gridneva, Z.; Perrella, S.L.; Wlodek, M.E.; Geddes, D.T. Sampling Procedures for Estimating the Infant Intake of Human Milk Leptin, Adiponectin, Insulin, Glucose, and Total Lipid. Nutrients 2024, 16, 331. [Google Scholar] [CrossRef] [PubMed]
Banks, M.; Soleimani, A.; Al-Shami, A.; Nejad, S.K.; Ma, H.; Amirghasemi, F.; Mohamed, M.A.; Mousavi, M.P. Electrochemical Devices for Accessible Analysis of Human Milk. ECS Meet. Abstr. 2025, MA2025-02, 2923. [Google Scholar] [CrossRef]
Kaushal, S.; Tammineni, D.K.; Rana, P.; Sharma, M.; Sridhar, K.; Chen, H.H. Computer vision and deep learning-based approaches for detection of food nutrients/nutrition: New insights and advances. Trends Food Sci. Technol. 2024, 146, 104408. [Google Scholar] [CrossRef]
Theba, I.; Vegad, S. A Novel Approach for Detection of Food Adulteration Using Deep Learning with Image Processing. In Lecture Notes in Networks and Systems; Springer: Singapore, 2025; Volume 1113, pp. 201–218. [Google Scholar] [CrossRef]
Balasubramaniam, S.; Joe, C.V.; Prasanth, A.; Kumar, K.S. Computer Vision Systems in Livestock Farming, Poultry Farming, and Fish Farming. In Computer Vision in Smart Agriculture and Crop Management; Wiley: Hoboken, NJ, USA, 2025; pp. 221–258. [Google Scholar] [CrossRef]
Quispe-Astorga, A.; Coaquira-Castillo, R.J.; Utrilla Mego, L.W.; Herrera-Levano, J.C.; Concha-Ramos, Y.; Sacoto-Cabrera, E.J.; Moreno-Cardenas, E. Data-Driven Fault Detection and Diagnosis in Cooling Units Using Sensor-Based Machine Learning Classification. Sensors 2025, 25, 3647. [Google Scholar] [CrossRef]
Bozzetti, V.; Dui, L.G.; Zannin, E.; Riccò, S.; Coglianese, P.; Cavalleri, V.; Iozzi, L.; Ventura, M.L.; Ferrante, S. AI to predict extrauterine growth restriction during transitional nutrition of preterm infants: A retrospective study. J. Perinatol. 2025, 1–9. [Google Scholar] [CrossRef]
Luo, L.; Lou, Y.; Wu, C.; He, J.; Jiang, F. Worldwide research performance on telemedicine for newborns and neonatal intensive care units: A bibliometric and visualization study. Digit. Health 2025, 2025, 1–16. [Google Scholar] [CrossRef]
Agudelo-Pérez, S.; Botero-Rosas, D.; Rodríguez-Alvarado, L.; Espitia-Angel, J.; Raigoso-Díaz, L. Artificial intelligence applied to the study of human milk and breastfeeding: A scoping review. Int. Breastfeed. J. 2024, 19, 79. [Google Scholar] [CrossRef] [PubMed]
Hinojosa-Nogueira, D.; Ortiz-Viso, B.; Navajas-Porras, B.; Pérez-Burillo, S.; González-Vigil, V.; de la Cueva, S.P.; Ángel Rufián-Henares, J. Stance4Health Nutritional APP: A Path to Personalized Smart Nutrition. Nutrients 2023, 15, 276. [Google Scholar] [CrossRef]
Res, G.; Bishara, R.F.; Church, P.T.; Rosenthal, R.; Bishara, R.M.; Dupuis, A.; Asztalos, E.; Banihani, R. Growth and Neurodevelopmental Outcomes of Preterm Infants Born <26 Weeks Gestation before and after Implementation of a Nutrition-Care Bundle. Children 2024, 11, 475. [Google Scholar] [CrossRef]
Kim, K.; Kim, N.J.; Kim, S.Y. Safety and Efficacy of Early High Parenteral Lipid Supplementation in Preterm Infants: A Systematic Review and Meta-Analysis. Nutrients 2021, 13, 1535. [Google Scholar] [CrossRef]
Elhallaoui Oueldkaddour, F.Z.; Wariaghli, F.; Brirhet, H.; Yahyaoui, A.; Jaziri, H. Comparison of Machine Learning Models for Real-Time Flow Forecasting in the Semi-Arid Bouregreg Basin. Limnol. Rev. 2025, 25, 6. [Google Scholar] [CrossRef]
Rghioui, A.; Lloret, J.; Sendra, S.; Oumnad, A. A Smart Architecture for Diabetic Patient Monitoring Using Machine Learning Algorithms. Healthcare 2020, 8, 348. [Google Scholar] [CrossRef]
Samal, K.P.; Thakur, R.R.; Panda, A.K.; Nandi, D.; Pati, A.K.; Pegu, K.; Đurin, B. Machine Learning-Enhanced Monitoring and Assessment of Urban Drinking Water Quality in North Bhubaneswar, Odisha, India. Limnol. Rev. 2025, 25, 44. [Google Scholar] [CrossRef]
Quispe-Vilca, J.L.; Moreno-Cardenas, E.; Sacoto-Cabrera, E.J.; Moreno-Cardenas, Y. Integrating IoT and image processing for crop monitoring: A LoRa-based solution for citrus pest detection. Electronics 2024, 13, 4863. [Google Scholar] [CrossRef]
Dahiya, N.K.; Bajaj, S.B.; Ruhil, A.P.; Jaglan, V. Udderly accurate: A deep learning based modelling for determination of dairyness of Sahiwal cow using computer vision. Indian J. Anim. Sci. 2024, 94, 83–87. [Google Scholar] [CrossRef]
Jin, X.; Lai, C.T.; Perrella, S.L.; Zhou, X.; Hassan, G.M.; McEachran, J.L.; Gridneva, Z.; Taylor, N.L.; Wlodek, M.E.; Geddes, D.T. Milk Composition Is Predictive of Low Milk Supply Using Machine Learning Approaches. Diagnostics 2025, 15, 191. [Google Scholar] [CrossRef]
Baiano, A. Applications of hyperspectral imaging for quality assessment of liquid based and semi-liquid food products: A review. J. Food Eng. 2017, 214, 10–15. [Google Scholar] [CrossRef]
Fuentes, S.; Viejo, C.G.; Tongson, E.; Lipovetzky, N.; Dunshea, F.R. Biometric Physiological Responses from Dairy Cows Measured by Visible Remote Sensing Are Good Predictors of Milk Productivity and Quality through Artificial Intelligence. Sensors 2021, 21, 6844. [Google Scholar] [CrossRef]
Balakrishnan, P.; Leema, A.A.; Jothiaruna, N.; Assudani, P.J.; Sankar, K.; Kulkarni, M.B.; Bhaiyya, M. Artificial intelligence for food safety: From predictive models to real-world safeguards. Trends Food Sci. Technol. 2025, 163, 105153. [Google Scholar] [CrossRef]
Fuentes, S.; Viejo, C.G.; Tongson, E.; Dunshea, F.R.; Dac, H.H.; Lipovetzky, N. Animal biometric assessment using non-invasive computer vision and machine learning are good predictors of dairy cows age and welfare: The future of automated veterinary support systems. J. Agric. Food Res. 2022, 10, 100388. [Google Scholar] [CrossRef]
Lucas, A.; Gibbs, J.A.; Lyster, R.L.; Baum, J.D. Creamatocrit: Simple clinical technique for estimating fat concentration and energy value of human milk. BMJ 1978, 1, 1018–1020. [Google Scholar] [CrossRef]
Tie, W.J.; Kent, J.C.; Tat Lai, C.; Rea, A.; Hepworth, A.R.; Murray, K.; Geddes, D.T. Reproducibility of the creamatocrit technique for the measurement of fat content in human milk. Food Chem. 2021, 356, 129708. [Google Scholar] [CrossRef] [PubMed]
Ministerio de Salud del Perú. Norma Técnica de Salud para la Implementación, Funcionamiento y Promoción de Bancos de Leche Humana en el Perú; Technical Report (NTS N° 152-MINSA/2019/DGIESP); Ministerio de Salud (MINSA): Lima, Peru, 2019. [Google Scholar]
Pratap, A.; Sardana, N. Machine learning-based image processing in materials science and engineering: A review. Mater. Today Proc. 2022, 62, 7341–7347. [Google Scholar] [CrossRef]
Ercetin, A.; Der, O.; Akkoyun, F.; Gowdru Chandrashekarappa, M.P.; Şener, R.; Çalışan, M.; Olgun, N.; Chate, G.; Bharath, K.N. Review of Image Processing Methods for Surface and Tool Condition Assessments in Machining. J. Manuf. Mater. Process. 2024, 8, 244. [Google Scholar] [CrossRef]
Silva-Alvarado, V.L.; Lloret, J. Multivariate Gas Sensor E-Nose System with PARAFAC and Machine Learning Modeling for Quantifying and Classifying the Impact of Fishing Gears. Sensors 2026, 26, 6. [Google Scholar] [CrossRef]

Figure 1. Detailed experimental scheme of the proposed Computer Vision System (CVS), illustrating the integrated workflow from biological sample preparation to automated digital quantification.

Figure 2. Lighting subsystem components and general assembly: (a) Chamber design; (b) Electronic control circuit.

Figure 3. Workflow for image acquisition and processing.

Figure 4. Impact of optical conditioning. Photo taken with (a) Ambient light and (b) in the chamber.

Figure 5. Comparison of microcapillaries under different light sources and levels. (a) Blue LED with high illumination. (b) Blue LED with low illumination. (c) Yellow LED with high illumination. (d) Yellow LED with low illumination. (e) Gold LED with high illumination. (f) Gold LED with low illumination.

Figure 6. Distribution of pixel intensity (a.u.) across different spectra.

Figure 7. Proposed sensor operation algorithm.

Figure 8. Automatic segmentation under diverse conditions. (a) Inclined sample. (b) Vertical alignment. (c) Incomplete filling.

Figure 9. Comparison of uncertainty for: (a) Fat content and (b) Energy Density. For

Δ F

, blue and green curves represent analog and CVS measurement, respectively. For

Δ E D

, light blue and red curves denote analog and CVS quantification.

Figure 9. Comparison of uncertainty for: (a) Fat content and (b) Energy Density. For

Δ F

, blue and green curves represent analog and CVS measurement, respectively. For

Δ E D

, light blue and red curves denote analog and CVS quantification.

Figure 10. Uncertainty intervals for: (a) analog and (b) digital measurement.

Figure 11. Performance of ML Algorithms for Estimate Cream Fraction (c) for: (a) Gray Scale, (b) RGB, and (c) RGB + Gray Scale.

Figure 12. Comparison of Regression Models for Cream Fraction (c) estimation using different feature sets: Gray Scale for (a) Validation and (b) Test. RGB for (c) Validation and (d) Test. RGB + Gray Scale for (e) Validation and (f) Test.

Figure 13. Feature importance analysis using SHAP values (

\times 10^{- 3}

) for: (a) Grayscale, (b) RGB, and (c) combined RGB + Gray Scale features.

Figure 13. Feature importance analysis using SHAP values (

\times 10^{- 3}

) for: (a) Grayscale, (b) RGB, and (c) combined RGB + Gray Scale features.

Table 1. Statistical distribution of pixel value (a.u.).

Color	Level	Q1	Q2	Q3	Min	Max	DR ¹
Blue	High	82.0	85.0	89.0	61	232	171
Blue	Low	21.0	21.0	21.0	21	142	121
Gold ²	High	68.0	74.0	80.0	34	253	219
Gold ²	Low	21.0	21.0	34.0	3	229	226
Yellow	High	70.0	76.0	87.0	39	253	214
Yellow	Low	28.0	34.0	39.0	21	228	207

¹ DR is Dinamic Range. ² Bold values indicate the optimal performance of the Gold spectrum.

Table 2. ML models, presets, and hyperparameter configurations used in the analysis.

Model ID	Preset	Hyperparameters
EL a	Efficient Linear Least Squares	Learner: Least squares
EL b	Efficient Linear SVM	Learner: SVM
ESB a	Bagged Trees	Minimum leaf size: 8
ESB b	Boosted Trees	Minimum leaf size: 8
GPR a	Exponential GPR	Basis function: Constant
GPR b	Matern 5/2 GPR	Basis function: Constant
GPR c	Rational Quadratic GPR	Basis function: Constant
GPR d	Squared Exponential GPR	Basis function: Constant
KN a	Least Squares Regression Kernel	Learner: Least Squares Kernel
KN b	SVM Kernel	Learner: SVM
LR a	Interactions Linear	Terms: Interactions
LR b	Linear	Terms: Linear
LR c	Robust Linear	Terms: Linear
LR d	Stepwise Linear	Initial terms: Linear
NN a	Bilayered NN	N° of fully connected layers: 2
NN b	Medium NN	N° of fully connected layers: 1
NN c	Narrow NN	N° of fully connected layers: 1
NN d	Trilayered NN	N° of fully connected layers: 3
NN e	Wide NN	N° of fully connected layers: 1
SVM a	Coarse Gaussian SVM	Kernel function: Gaussian
SVM b	Cubic SVM	Kernel function: Cubic
SVM c	Fine Gaussian SVM	Kernel function: Gaussian
SVM d	Linear SVM	Kernel function: Linear
SVM e	Medium Gaussian SVM	Kernel function: Gaussian
SVM f	Quadratic SVM	Kernel function: Quadratic
Tree a	Coarse Tree	Minimum leaf size: 36
Tree b	Fine Tree	Minimum leaf size: 4
Tree c	Medium Tree	Minimum leaf size: 12

Table 3. Comparison of Relative Errors between Analog Error and Digital Error Measurement.

Parameter	Analog Error (%)	Digital Error (%)	R.E.I. (%) ¹
F	21.21	9.00	57.6
$E D$	11.52	4.90	57.5

¹ R.E.I. is the Relative Error Improvement, representing the percentage reduction in the relative error achieved by the gold spectrum CVS compared to manual quantification.

Table 4. Comparison of methodologies in recent milk and food analysis studies.

Authors	Methodology	Task	Target	Metric	Key Result
[16]	Electrochemical	Monit.	Nutrients	Access.	Biomarker
[13]	Electrochemicals	Quant.	Glucose	Accuracy	96.8–104.1%
[31]	CNN	Classif.	Cow yield	Accuracy	85.64%
[32]	G.B.	Predict.	Production	Accuracy	87.9%
This Work	CVS + ML	Quant.	Fat/Energy	$R^{2}$	0.867

Note: Quant. = Quantification; Monit. = Monitoring; G.B. = Gradient Boosting; Predict. = Prediction; Classif. = Classification; Access. = Accessibility; R.E.I. = Relative Error Improvement.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Huamanga-Chumbes, L.E.; Sacoto-Cabrera, E.J.; Lloret, J.; Silva-Alvarado, V.L.; Huicho-Mendigure, A.; Moreno-Cardenas, E. Machine Learning-Driven Computer Vision System for Automated Fat and Energy Quantification in Human Milk Microcapillaries. Sensors 2026, 26, 1756. https://doi.org/10.3390/s26061756

AMA Style

Huamanga-Chumbes LE, Sacoto-Cabrera EJ, Lloret J, Silva-Alvarado VL, Huicho-Mendigure A, Moreno-Cardenas E. Machine Learning-Driven Computer Vision System for Automated Fat and Energy Quantification in Human Milk Microcapillaries. Sensors. 2026; 26(6):1756. https://doi.org/10.3390/s26061756

Chicago/Turabian Style

Huamanga-Chumbes, Lujan E., Erwin J. Sacoto-Cabrera, Jaime Lloret, Vinie Lee Silva-Alvarado, Alfz Huicho-Mendigure, and Edison Moreno-Cardenas. 2026. "Machine Learning-Driven Computer Vision System for Automated Fat and Energy Quantification in Human Milk Microcapillaries" Sensors 26, no. 6: 1756. https://doi.org/10.3390/s26061756

APA Style

Huamanga-Chumbes, L. E., Sacoto-Cabrera, E. J., Lloret, J., Silva-Alvarado, V. L., Huicho-Mendigure, A., & Moreno-Cardenas, E. (2026). Machine Learning-Driven Computer Vision System for Automated Fat and Energy Quantification in Human Milk Microcapillaries. Sensors, 26(6), 1756. https://doi.org/10.3390/s26061756

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Machine Learning-Driven Computer Vision System for Automated Fat and Energy Quantification in Human Milk Microcapillaries

Abstract

1. Introduction

2. Related Works

3. Materials and Methods

3.1. Human Milk Samples and Experimental Design

3.2. Hardware Deployment and Optical Environment

3.2.1. Light Sensitivity

3.2.2. Spectral Optimization

3.3. Mathematical Framework and Uncertainty Analysis

3.3.1. Error Propagation Model

3.3.2. Mathematical Model for Uncertainty Analysis

3.4. Computer Vision Pipeline

3.4.1. Hierarchical Segmentation and ROI Definition

3.4.2. Adaptive Cropping

3.4.3. Multichannel Statistical Feature Extraction

3.5. Predictive Modeling via ML Regression Algorithms

4. Results

4.1. Segmentation Robustness Analysis

4.2. Uncertainty Analysis

4.3. Performance Analysis of Regression Models

4.4. Feature Importance Analysis

5. Discussion

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI