Next Article in Journal
Multiple Stakeholder Partition-Based Interactive-Game Voltage Control for Distribution Networks
Previous Article in Journal
Unravelling the Regulation of Asphaltene Deposition by Dispersants Through Macro-Stability in Micro-Mechanism
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Rapid Detection of Protein Content in Fuzzy Cottonseeds Using Portable Spectrometers and Machine Learning

1
College of Mechanical and Electrical Engineering, Tarim University, Alar 843300, China
2
Cotton Engineering Research Institute, Anhui University of Finance and Economics, Bengbu 233030, China
*
Author to whom correspondence should be addressed.
Processes 2025, 13(10), 3221; https://doi.org/10.3390/pr13103221
Submission received: 9 September 2025 / Revised: 3 October 2025 / Accepted: 8 October 2025 / Published: 10 October 2025
(This article belongs to the Section Automation Control Systems)

Abstract

This study developed a rapid, non-destructive method for the quantitative detection of protein in cottonseed by integrating near-infrared (NIR) fiber spectroscopy with chemometric machine learning. The establishment of this method holds significant importance for the rational and efficient utilization of cottonseed resources, advancing research on the genetic improvement of cottonseed nutritional quality, and promoting the development of equipment for raw cottonseed protein detection. Fuzzy cottonseed samples from three varieties were collected, and their NIR fiber-optic spectra were acquired. Reference protein contents were measured using the Kjeldahl method. Spectra were denoised through preprocessing, after which informative wavelengths were selected by combining Uninformative Variable Elimination (UVE) with Competitive Adaptive Reweighted Sampling (CARS) and the Random Frog (RF) algorithm. Partial least squares regression (PLSR), least-squares support vector machine (LSSVM), and support vector regression (SVR) models were then constructed to predict protein content. Model performance was assessed using the coefficient of determination (R2), root-mean-square error (RMSE), residual predictive deviation (RPD), and range error ratio (RER). The results indicate that the standard normal variate (SNV) is the most effective preprocessing step. The best performance was achieved by the LSSVM model coupled with UVE + CARS, yielding R2 = 0.8571, RMSE = 0.0033, RPD = 2.7078, and RER = 10.72, outperforming the PLSR and SVR counterparts. These findings provide technical support for the rapid detection of fuzzy cottonseed protein and lay the groundwork for the development of related detection equipment.

1. Introduction

In recent decades, China’s rapid development in animal husbandry has spurred corresponding growth in the feed industry. The healthy development of the feed sector relies on a stable raw material supply system. According to a report by the China Feed Industry Association, China’s protein feed gap—including soybeans—is projected to exceed 124 million tons by 2035. Feed demand continues to rise due to growing demand for livestock products and structural shifts, making shortages increasingly prominent [1]. Cottonseed is rich in protein and oil, and the protein content of cottonseed meal after dehulling and oil extraction far exceeds that of major feed crops like corn and wheat; crude protein content varies significantly among different varieties of cottonseed meal, ranging from approximately 39.12% to 59.63% [2]. China’s annual cottonseed production reaches approximately 10 million tons. According to a notice from the General Office of the Ministry of Agriculture and Rural Affairs regarding the “Three-Year Action Plan for Reducing Soybean Meal Replacement,” cottonseed protein has considerable potential to replace soybean meal protein [3,4]. According to an industry report (QYResearch, “Current Status and Future Development Trends of the Global and Chinese Cottonseed Protein Market, 2025–2031”), the global dephenolized cottonseed protein market size reached approximately $441 million in 2024 and continues to grow steadily. Representative domestic enterprises include Xinjiang Chenguang Biotechnology Group Co., Ltd., Tumxuk, China. and Xinjiang Jinlan Plant Protein Co., Ltd., Shihezi, China. When procuring cottonseed, feed enterprises typically classify and price purchases based on nutritional indicators (particularly crude protein content). Consequently, they tend to select cottonseed with higher crude protein levels to enhance cost-effectiveness. However, there is currently a lack of rapid prediction technology for crude protein content in raw cottonseed. When purchasing raw cottonseed, pricing is only roughly determined based on variety and origin. Therefore, it is necessary to develop a cottonseed protein content detection technology to protect the economic interests of feed and cotton seed processing enterprises.
Currently, to determine the protein content of raw cottonseed, the seed’s fluff and hull must be removed, and the cottonseed kernel ground into powder. The protein content is then measured using chemical methods [5]. According to GB 5009.5-2016 “Determination of Protein in Foods” [6], the primary chemical methods for measuring protein content in cottonseed include the Kjeldahl method [7], spectrophotometric method [8], and combustion method. Among these, the Kjeldahl method is the most widely applied. The procedure involves the following steps: First, prepare the necessary chemical reagents. Catalytically heat the protein in the cottonseed to decompose it, producing ammonia that reacts with sulfuric acid to form ammonium sulfate. Alkaline distillation releases the ammonia, which is absorbed by boric acid and then titrated with a standard hydrochloric acid solution. Finally, nitrogen content is calculated based on acid consumption, multiplied by a conversion factor to yield protein content. This method is complex, time-consuming, and destructive, failing to meet the demands of feed enterprises [9].
To achieve rapid non-destructive detection of protein content in cottonseed, researchers have conducted relevant studies using spectroscopic methods. Qin et al. [10] employed partial least squares regression to construct a near-infrared calibration model for protein content in cottonseed meal powder using 207 samples of cottonseed material grown across multiple years and locations, achieving an R2 value of 0.933. Huang et al. [11] collected Fourier transform NIR reflectance spectra of ginned cotton seeds. They employed a nonlinear least-squares support vector machine combined with a Monte Carlo non-informative variable elimination method to construct a near-infrared calibration model for whole cotton seed protein content, achieving an R2 of 0.959 for the protein model and enabling rapid analysis of whole cotton seed nutritional quality. Ma et al. [12] collected Fourier-transform NIR reflectance spectra data from fuzzy cottonseeds. They optimized the model using spectral preprocessing methods such as multiple scattering correction and first-order derivatives, and established a rapid NIR determination model for protein content in raw cotton seeds via random forest analysis, achieving an R2 value as high as 0.9459. Yang et al. [13] determined the chemical values of cotton seed protein content using the Kjeldahl method. They collected spectral characteristic information of three different cotton seed morphologies—raw seeds, polished seeds, and kernels—using a near-infrared spectrometer. After applying various mathematical and scattering corrections to the raw spectra, they established near-infrared spectral models for rapid non-destructive detection of protein content in raw seeds, polished seeds, and kernels using an improved partial least squares method. The models achieved R2 values of 0.957, 0.971, and 0.989, respectively. The models achieved R2 values of 0.957, 0.971, and 0.989, respectively.
NIR reflection spectroscopy is an effective method for determining the protein content of cottonseed. Combining NIR reflectance spectroscopy with machine learning has also been successfully applied to the detection of other indicators in cottonseed, such as oil content [14], fatty acid content [15], gossypol content [16], and phytic acid [17]. The aforementioned studies for detecting cottonseed protein content all used Fourier NIR scanning spectroscopy, which collects more detailed data and provides theoretical support for protein content detection. However, this method has the drawbacks of slow detection speed and high equipment costs. Compared to the FT-NIR scanning method for samples, the sample collection approach using fiber optic probes and multi-grain sample pools employed in this study offers superior detection speed. While FT-NIR boasts higher resolution and stable optical paths, fiber optic NIR instruments are slightly less sophisticated in specifications but are more cost-effective and better aligned with industrial needs. However, there is limited research on its application in cottonseed prediction, primarily due to the difficulty in collecting NIR spectral data from multiple cottonseed samples using fiber optic spectroscopy [18]. Additionally, fuzzy cottonseed significantly increases the difficulty of NIR fiber optic spectroscopy detection due to the presence of cotton lint. Only Li et al. [4] employed near-infrared fiber optic spectroscopy to collect near-infrared spectral data of fuzzy cottonseed. Through spectral preprocessing and feature wavelength selection algorithms, they established a regression model for fuzzy cottonseed oil content, thereby achieving the detection of oil content in fuzzy cottonseed. However, no literature has been found regarding studies on detecting the protein content of fuzzy cottonseed using NIR fiber optic spectroscopy.
To this end, this study focuses on fuzzy cottonseed as the research subject, constructing a multi-sample pool of fuzzy cottonseed and collecting its near-infrared (NIR) spectral profiles of the samples using an NIR fiber optic spectrometer. The NIR spectral profiles of the fuzzy cottonseed are then preprocessed using relevant spectral data preprocessing algorithms to reduce noise during the experimental process. Subsequently, a spectral feature wavelength selection algorithm is used to choose a set of key wavelengths that reflect the protein content of fuzzy cottonseed. Finally, a regression model for detecting the protein content of fuzzy cottonseed is established using machine learning algorithms. Therefore, establishing a rapid, non-destructive, and efficient detection technique for fuzzy cottonseed protein will accelerate the development of corresponding rapid prediction devices, thereby accelerating the development of corresponding rapid detection devices.

2. Materials and Methods

2.1. Fuzzy Cottonseed Sample

To obtain fuzzy cottonseed samples with different protein content distributions, 3600 fuzzy cottonseeds from three varieties (1200 seeds per variety) were selected from Xinlai Da Pengfeng Cotton Industry Co., Ltd. in Alar, China. The three varieties were Xinluzao-50, Xinluzao-49, and Xinluzao-27. To maintain sample consistency, the fuzzy cottonseeds of all three varieties were sourced from Alar, China. To ensure sample uniformity and stability, fuzzy cotton seeds were carefully selected using the boiling water scalding method. Each type of fuzzy cotton seed was scalded with boiling water and stirred for 1 min, then mixed with three times its volume of cool water until evenly distributed. Healthy seeds exhibiting deep brown or dark reddish-brown hues were selected, dried at 38–40 °C, allowed to reach moisture equilibrium for 2 days, and stored in sealed containers. Selected fuzzy cotton seeds were of comparable size. For each variety, 1200 seeds were selected using this method and randomly divided into 40 groups of 30 seeds each, yielding a total of 120 groups across the three varieties. The samples were stored at 20 °C with a relative humidity of 60%, and NIR spectral data were collected while determining the protein content. Table 1 shows the protein content distribution ranges of the fuzzy cottonseed samples from the three varieties. The average protein content for each variety was 39.326%, 40.619%, and 40.851%, respectively. By selecting samples from these three varieties, a range of protein content distributions can be obtained, thereby enhancing the reliability of subsequent modeling results.

2.2. NIR Spectroscopy Acquisition

2.2.1. Spectral Acquisition System

As shown in Figure 1, the NIR spectroscopy data acquisition system for cottonseed consists of four components: a computer, an NIR spectrometer, an optical fiber probe, and a sample cell. The instrument model is the SupNIR-1500 portable NIR analyzer (manufactured by Hangzhou Polytech Co., Ltd., Hangzhou, China), with a wavelength range of 1000 nm to 1800 nm and a data sampling interval of 1 nm. It employs a grating scanning configuration and is equipped with a fiber optic probe (featuring a 10 W NIR light source). The sample cell is a transparent cuvette measuring 6 cm in length, 5 cm in width, and 1 cm in thickness.

2.2.2. Spectral Analysis of Fuzzy Cottonseed

Thirty cottonseed grains were uniformly distributed in a 5 × 6 grid within the sample cell. NIR spectral data were collected from five distinct positions within the cell. The spectral average from these five positions was calculated as the spectral data for a single sample. A total of 120 cottonseed samples yielded NIR reflectance spectral data. Figure 1 indicates the positions where cottonseed spectra were collected, marked by red circles.

2.3. Determination of Protein Content in Samples

After collecting the spectral data of cottonseeds, a sample containing 30 cottonseeds was processed to remove lint and hulls, yielding cottonseed kernels, which were then ground into powder. The protein content of the cotton seed kernel powder was then determined using the Kjeldahl method, following the experimental procedures specified in GB 5009.5-2016 “Determination of Protein in Foods” [6].
The main steps of the Kjeldahl nitrogen determination method are as follows: Using weighing paper, take approximately 0.4 g of each group of cottonseed samples and place them into digestion tubes. Add catalyst tablets (composed of copper sulfate and potassium sulfate) and an appropriate amount of concentrated sulfuric acid to the digestion tubes. Then label them and place them in the digestion furnace for digestion. Digestion occurs in two stages: first, digestion at 200 °C for one hour, followed by digestion at 420 °C for one hour. After digestion, remove the tubes. The liquid inside will appear green and transparent. After cooling for one hour, it turns pale blue and transparent. During the digestion period, prepare the relevant standard solutions required for the automatic Kjeldahl nitrogen analyzer. Finally, use the automated Kjeldahl nitrogen determinator (preheat the instrument for at least 15 min, set the standard solution concentration, configure the desired program, and input the sample weight into the program) to prediction the protein content of the cooled liquid, thereby determining the protein content of the raw cottonseed sample. The Kjeldahl nitrogen determinator model used in this study is an 8400 Kjeltec™ Automated Kjeldahl Nitrogen Determinator, FOSS Analytical A/S, Hillerød, Denmark.

2.4. Dataset Preparation

The partitioning of the fuzzy cottonseed NIR spectral dataset directly affects the reliability of the protein content detection model for fuzzy cottonseed established by subsequent machine learning algorithms. If the number or representativeness of the calibration samples is insufficient, the fuzzy cottonseed protein prediction model will suffer from underfitting; conversely, an excessive number of samples will lead to overfitting of the model [19]. Common dataset partitioning methods in the field of spectral analysis include the Kennard-Stone (K-S) algorithm and the Sample Set Partitioning Based on Joint X-Y Distance (SPXY) method [20]. The K-S algorithm assigns fuzzy cottonseed samples with relatively large Euclidean distances in the variable space to the calibration set; however, this method does not account for the influence of the sample concentration variable. The SPXY method, based on the K-S algorithm, redefines the relative Euclidean distance of fuzzy cottonseed samples by incorporating sample concentration (protein content), thus avoiding the problem of uneven sample distribution caused by the K-S method [21]. Given that SPXY offers advantages in enhancing inter-sample variability and representativeness, this study used the SPXY method [22] to partition the NIR data and protein content data of 120 fuzzy cottonseed samples into calibration and prediction datasets at a 3:1 ratio. The calibration set consisted of 80 samples, with 29, 33, and 18 samples from Xinluzao-50, Xinluzao-49, and Xinluzao-27, respectively, and an independent prediction set included 40 samples, with 11, 7, and 22 samples from Xinluzao-50, Xinluzao-49, and Xinluzao-27, respectively.

2.5. Pretreatment of Spectral Data

During the collection of fuzzy cottonseed NIR spectral data, factors such as changes in ambient temperature and light, as well as the instrument and sample particle size, can result in the inclusion of noisy data in the fuzzy cottonseed NIR spectral data [22,23]. The aforementioned noisy data can cause significant interference in the subsequent establishment of the fuzzy cottonseed protein content prediction model, potentially leading to an unreliable model. Therefore, it is essential to use relevant spectral data preprocessing algorithms to eliminate or reduce noise interference [24]. Common spectral data preprocessing methods include Standard Normal Variable transformation (SNV), Savitzky–Golay convolution smoothing (SG), normalization, first derivative (1D), and Multiplicative Signal Correction (MSC) [25,26]. MSC and SNV are mainly used to eliminate surface scattering noise from fuzzy cottonseed samples. SG effectively removes random noise, 1D eliminates or reduces baseline and background interference, and normalization corrects spectral variations caused by slight optical path differences. This study employs MATLAB R2022a to preprocess spectral data, select characteristic wavelengths, and develop predictive models.

2.6. Feature Wavelength Point Selection

The fuzzy cottonseed NIR spectral data collected in this study consists of 800 dimensions, containing both valid information for protein content detection and a significant amount of irrelevant information. This irrelevant information not only leads to data redundancy but also severely interferes with the speed and accuracy of the fuzzy cottonseed protein content prediction model established subsequently. Therefore, it is necessary to use relevant feature wavelength extraction algorithms to select the information in the NIR spectral data that reflects the protein content of fuzzy cottonseed, thereby reducing the dimensionality of the fuzzy cottonseed NIR data. This study used Uninformative Variable Elimination (UVE), Competitive Adaptive Reweighted Sampling (CARS), and Random Frog (RF) algorithms to extract the feature wavelengths that reflect the protein content of fuzzy cottonseed.
The UVE algorithm introduces random noise of the same dimension into the original NIR spectral matrix of fuzzy cottonseed, constructing a composite feature matrix that combines real spectral variables with irrelevant noise. Subsequently, a model was established using partial least squares regression, with its regression coefficients serving as key indicators for evaluating the stability of each wavelength variable. UVE compares these statistical values with the maximum stability of the random noise variables and eliminates the uninformative variables whose stability is below the threshold [27]. The remaining variables form a new feature subset, which is then used to establish a faster and more accurate prediction model. CARS performs adaptive reweighting sampling based on the exponential decay function, calculating the absolute value weights of regression coefficients. Through the theory of survival of the fit prediction, it eliminates wavelength variables with smaller weight values from the model. Finally, as the number of samples increases, the cross-validation root mean square error (RMSECV) value continuously decreases. When RMSECV reaches its minimum, the corresponding wavelength variable set constitutes the extracted spectral feature wavelength variables [28]. The RF algorithm is a feature wavelength selection method based on probabilistic sampling [29]. It first randomly initializes a set of wavelength points to construct the initial prediction model, and then changes the wavelength combination state through operations such as “addition,” “deletion,” and “exchange” of variables, simulating the jumping behavior of frogs. After each jump, the acceptance probability is calculated based on the model’s prediction performance, and the new wavelength combination is accepted or rejected according to the Metropolis–Hastings criterion. Finally, the frequency at which each wavelength point is selected during the sampling process is counted, and the most characteristic wavelength set is chosen.

2.7. Modeling Methods

The feature wavelengths selected by the aforementioned algorithms still cannot directly predict the protein content of fuzzy cottonseed; machine learning algorithms are required to achieve the prediction of protein content. This study repeatedly attempted to use three regression algorithms—Partial Least Squares Regression (PLSR), Least Squares Support Vector Machine (LSSVM), and Support Vector Regression (SVR)—to establish a protein content prediction model for fuzzy cottonseed. PLSR is an algorithm that combines principal component analysis and multivariate linear regression. It projects spectral data and fuzzy cottonseed protein content data into a new space, where it seeks a multivariate linear regression model between the independent and dependent variables to predict protein content [30]. LSSVM transforms the inequality constraints in SVM into equality constraints, thereby simplifying the solution of Lagrange multipliers. This converts the quadratic programming problem into a linear equation system, reducing computational complexity and improving calibration speed and convergence accuracy. Therefore, this paper employs a radial basis function (RBF) kernel with a penalty parameter of 100 and a kernel width of 25 to achieve an LSSVM with good generalization capability. Ten-fold cross-validation is used to tune the hyperparameters [31,32], with the 10-fold cross-validation applied exclusively within the calibration set for hyperparameter optimization. The SVR model employs the concept of support vectors to perform a nonlinear mapping of low-dimensional data into a high-dimensional space, thereby enabling regression analysis within that high-dimensional space. The advantage of the SVR model lies in its lack of restrictions on the distribution of the data, effectively addressing challenges associated with small sample sizes, nonlinear relationships, and high dimensionality [33]. In this study, the SVR uses the RBF kernel.

2.8. Model Evaluation Indicators

The evaluation metrics for regression models primarily include the coefficient of determination (R2), Root Mean Square Error (RMSE), Residual Predictive Deviation (RPD), and Range Error Ratio (RER). R2 reflects the model’s ability to explain total variation, with values ranging from 0 to 1. The closer R2 is to 1, the better the model’s performance. RMSE primarily measures the deviation between the observed and predicted values. A smaller RMSE indicates better predictive performance of the model. RMSEC/RMSEP represent the root mean square error for the calibration/prediction sets, respectively. RPD is the ratio of the standard deviation of the sample’s true values to RMSE, which characterizes the model’s predictive ability. RPD is the ratio of the standard deviation of the true sample values to the RMSE, characterizing the model’s predictive capability. Generally, an RPD value exceeding 1.4 indicates the model can be used for rough sample evaluation, while an RPD value exceeding 2 signifies excellent predictive capability. RER is the ratio of the true range of the sample to the RMSE, reflecting the model’s performance across the entire data range. When 7 ≤ RER < 10, the model is suitable for screening and calibration; when RER ≥ 10, the model is suitable for quantitative analysis
R 2 = 1 i 1 n y ^ i y i 2 i 1 n y ^ i y m c a n
R M S E = 1 n i 1 n y ^ i y i
R P D = s t d p / R M S E p
R E R = m a x y p m i n y p R M S E p
In this equation, the variable n denotes the total number of samples, which may refer to either the calibration or prediction set, depending on whether the calculation is based on the calibration set or the prediction set. y ^ i Is the predicted value of the i -th cottonseed sample from the model, y i is the actual value of the i -th cottonseed sample, y m c a n indicates the mean of all actual cottonseed sample values, and s t d p denotes the standard deviation of these actual values in the prediction set. Additionally, R M S E p represents R M S E for the prediction set. y p corresponds to the protein content of fuzzy cottonseed in the prediction set.

3. Results and Analysis

3.1. Analysis of Spectral Data from Fuzzy Cottonseeds

Figure 2 displays the average spectra for the three cottonseed samples listed in Table 1. The findings suggest that an increase in protein content within the cottonseed corresponds to a stronger reflectance in the 1000~1800 nm wavelength range. In general, the spectral trends of different cottonseed samples remain consistent, but there are significant differences in reflectance levels, indicating that protein content significantly affects the spectral reflectance characteristics. All samples exhibited distinct valleys in the 1100~1140 nm, 1290~1330 nm, and 1380~1420 nm bands. Specifically, the 1100~1140 nm band minimum may be associated with the second harmonic of C–H bonds in amino acids within cottonseed; The 1290~1330 nm trough may be associated with the second harmonic of N–H bonds in cotton seed proteins; The 1380~1420 nm trough may be related to the first harmonic of O–H bonds in cotton fiber or water/proteins within cotton seeds.

3.2. Comparison of Different NIR Spectroscopy Preprocessing Methods

The raw spectral data, as shown in Figure 3a, exhibits significant baseline drift and noise. To address these issues in the raw spectra, this study applied MSC, SG, 1D, Normalization, and SNV preprocessing techniques to the raw spectra. A PLSR model was established to predict crude cottonseed protein content, and the aforementioned preprocessing algorithms were evaluated. Specific parameters are detailed in Table 2. Figure 3 displays the results of these five preprocessing methods. The MSC-processed spectrum differs significantly from the raw spectrum. It exhibits an overall increase in spectral values. Furthermore, while the raw spectrum in Figure 3 is scattered across the y-value range with indistinct peaks, the processed spectrum shows a tendency toward clustering and forms more pronounced troughs. SNV processing yields results similar to MSC, with the key difference being a substantial shift in the distribution range of y-values. SG produces smoother lines with more pronounced peaks while preserving the original spectral shape. One-dimensional processing reduces most absorbance values to near zero, highlighting essential information. Normalization reduces interference from irrelevant data, further enhancing the visibility of characteristic peaks.
In Table 2, the PLSR model after SNV preprocessing performed the best, with an R2 of 0.6948 for the prediction set and an RPD of 1.8445, indicating that the SNV-preprocessed fuzzy cottonseed spectral data have the potential for protein detection. Therefore, this study selects SNV as the preprocessing algorithm for the fuzzy cottonseed NIR spectral data, which is applied to the subsequent data analysis. It is noteworthy that, regardless of whether preprocessing is applied, the R2 of the prediction set is always greater than 0.5, and the RPD exceeds 1.5, further confirming the strong correlation between the fuzzy cottonseed NIR spectral data and its protein content. Therefore, this study demonstrates the feasibility of using NIR spectroscopy to predict and detect the protein content of fuzzy cottonseed.

3.3. Sensitive Band Analysis of Cottonseed

After performing SNV preprocessing, this study used the UVE algorithm for preliminary selection of feature wavelength points and identified spectral ranges related to the protein content of fuzzy cottonseed. The results of the feature selection process are illustrated in Figure 4, where the blue lines depict the stability values of the cottonseed spectral variables, while the green lines represent the stability values corresponding to noise variables. The red circles highlight the spectral features identified by the UVE algorithm. The two black lines set the threshold for the variable selection process. Variables located outside these black lines are regarded as relevant, whereas those within the lines are deemed irrelevant. Ultimately, UVE identified a total of 260 feature wavelengths, predominantly distributed in the ranges of 1070~1130 nm, 1160~1200 nm, 1250~1280 nm, 1380~1460 nm, 1480~1630 nm, and 1660~1740 nm.
After UVE screening, 260 feature wavelength points were selected. Given the large number of feature wavelengths extracted by UVE, some wavelengths may contain little or no information, potentially affecting the performance of the prediction model. To establish a more stable prediction model, further optimization of the model combination is necessary. Building upon the UVE-screened feature wavelength dataset, this study employs CARS and RF to further identify new feature wavelength points. The CARS was set to 100 iterations. Figure 5 illustrates the changes in the number of feature wavelengths selected by CARS, RMSE, and regression coefficients with the number of runs. As shown in the figure, when the number of runs increased from 1 to 24, the decrease in the number of feature wavelengths slowed down, and RMSECV gradually decreased. This indicates that during the 1–24 runs, a significant number of irrelevant spectral variables were eliminated, and the model accuracy continuously improved. As the number of runs continued to increase, RMSECV either slowly or rapidly increased, while the regression coefficient steadily grew. At 24 runs, RMSECV reached its minimum value, with 37 wavelengths retained. The results indicate that the characteristic wavelength points reflecting crude protein content in cottonseed are primarily distributed between 1070–1130 nm, 1160–1200 nm, and 1380–1460 nm. These bands may be associated with the second harmonics of C–H and N–H bonds in proteinaceous substances within cottonseed [15].
Following the selection of feature wavelengths associated with fuzzy cottonseed protein content using the UVE algorithm. The feature wavelengths selected by UVE were used as the input for RF to proceed with the next step of feature wavelength selection. The RF method utilizes a data centralization technique, integrated with 10-fold cross-validation and 1000 simulation iterations. The wavelengths chosen by the RF algorithm are illustrated in Figure 6, and the red dots in the figure represent the feature wavelengths selected by RF. Ultimately, 15 spectral wavelength points were identified to construct the prediction model for cottonseed protein content. This result aligns with the characteristic wavelength distribution selected by UVE-CARS and may be related to the second harmonics of C-H and N-H bonds in cotton seed proteins.

3.4. Model Development and Comparison

This study employed a combination of UVE, CARS, and RF algorithms, or each method individually, to screen characteristic wavelength points. These were integrated with PLSR, LSSVM, and SVR methods to construct predictive models for crude cottonseed protein content. Model evaluation was conducted using the validation set, with results presented in Table 3. The optimal model was the regression prediction model established by LSSVM based on the UVE-CARS combination algorithm, yielding an R2 of 0.8571, with an RSEM of 0.0033, RPD of 2.7078, and RER of 10.72. This indicates the model’s high accuracy and reliability in predicting raw cottonseed protein content. Among PLSR and LSSVM models, the regression prediction model based on UVE-CARS demonstrated the best performance. Among the SVR models, the UVE-RF-based model yielded the best results. This outcome demonstrates that the raw cottonseed protein content prediction model constructed by first preliminarily screening feature wavelength points using UVE, then further refining them with CARS and RF, outperforms models built using UVE, CARS, or RF alone. This finding indicates that individual UVE, CARS, or RF algorithms cannot effectively screen optimal feature wavelengths. However, the combined algorithm maximizes the screening and retention of feature wavelengths related to protein content from the original spectrum. Comparing the three modeling methods—PLSR, LSSVM, and SVR—revealed LSSVM as the optimal model. This stems from LSSVM converting the inequality constraints in SVM to equality constraints, thereby simplifying the solution of the Lagrange multipliers. This transformation converts the quadratic programming problem into solving a system of linear equations, reducing computational complexity while enhancing training speed and convergence accuracy [32].

4. Discussion

In this study, NIR fiber optic spectral data for fuzzy cottonseed were first collected, and the SNV algorithm was applied to remove surface scattering and noise from the fuzzy spectra. Following this, the UVE, CARS, and RF algorithms, either in combination or individually, were used to select feature wavelengths from the fuzzy NIR spectra of cottonseed, accurately identifying wavelength points associated with fuzzy cottonseed protein content. Subsequently, PLSR, LSSVM, and SVR machine learning methods were employed to construct the corresponding models for the protein content in cottonseed, effectively tackling the challenge of rapid protein content detection. Numerous studies have examined the detection of protein content in cottonseed. For instance, Qin et al. [10] constructed a NIR calibration model for protein content in cottonseed meal using 207 samples of cottonseed materials grown across multiple years and locations. The model employed partial least squares regression and achieved an R2 value of 0.933. Qin et al. enhanced the detection accuracy by eliminating spectral interference from cottonseed hulls and fibers, thereby improving the characterization of key chemical bond signals in cottonseed meal powder. In comparison to this study, the absence of interference from cottonseed shells and fibers facilitates the detection of chemical bonds within cottonseed kernel powder, thereby improving detection accuracy. Huang et al. [11] gathered Fourier NIR spectral data from light cottonseed and developed a protein content prediction model using MC-UVE and LSSVM, achieving an R2 of 0.959. Compared to studies employing FT-NIR, the LSSVM model based on the combination of UVE and CARS in this research achieved R2 = 0.8571, RPD = 2.7078, and RER = 10.72, meeting the criteria for quantitative analysis while offering distinct advantages in terms of speed and low cost. Therefore, the methodology adopted by this research will play a significant role in the feed processing industry, effectively addressing the key issues in the detection of protein content in cottonseed.
Due to the small size of cottonseeds, determining the protein content of individual cottonseeds using an NIR spectrometer or chemometrics presents significant challenges. Therefore, protein content can typically only be measured for a group of cottonseeds. Collecting the NIR spectra of individual fuzzy cottonseeds in each group is not only time-consuming but also requires considerable manpower. In this study, 30 fuzzy cottonseeds were grouped as a single sample, and five NIR spectra were collected from different positions of each sample. The mean value of these spectra was used as the representative spectral data for the sample. Subsequently, these spectral data underwent preprocessing, followed by feature wavelength selection. Based on the evaluation of the constructed models, this data collection approach proves to be highly effective in detecting fuzzy cottonseed protein content while ensuring operational efficiency.
Due to the light-dispersing effect of cotton seed fibers, direct PLSR modeling of raw cotton seed spectra after SNV preprocessing yielded poor protein content prediction results, indicating substantial redundant information persists even in preprocessed spectra. However, the predictive models established after UVE-CARS screening yielded significantly superior results compared to those built using the full spectrum. This indicates that feature spectra filtered by characteristic wavelengths possess greater interpretability than the full spectrum. Moreover, the number of wavelengths screened by UVE-CARS was reduced by over ninety percent relative to the full spectrum. This maximally minimized redundant information within the spectrum, thereby streamlining the predictive models.
This study employed only three cotton seed varieties, with a limited number of experimental samples. The portable spectrometer used supported only the 1000~1800 nm wavelength range, resulting in certain limitations. In future studies, if conditions permit, we plan to collect a larger number of cottonseed samples and conduct systematic comparisons across different seed kernel counts. This will enable quantitative assessment of accuracy at varying kernel densities. Additionally, we intend to acquire an advanced portable near-infrared spectrometer with a broader spectral coverage. These measures will help further optimize the existing model, enhancing its performance and generalization capabilities. The algorithms employed in this paper are relatively few. For preprocessing, techniques such as wavelet transforms could be explored. Regarding feature wavelength extraction algorithms, methods like the Successive Projection Algorithm (SPA) and Principal Component Analysis (PCA) could be tested. For modeling, subsequent studies could experiment with algorithms like Random Forests. Chemometrics research is advancing rapidly, with algorithms continually refined and optimized. Future research may explore new algorithmic modeling approaches.

5. Conclusions

This study combines NIR fiber spectroscopy with machine learning to establish a predictive model capable of accurately and effectively forecasting the protein content of fuzzy cottonseed. Prediction models for raw cottonseed protein content were developed using PLSR, LSSVM, and SVR. Results indicate that SNV serves as the optimal preprocessing algorithm. The model based on UVE-CARS demonstrated the highest predictive performance, with LSSVM yielding superior results compared to PLSR and SVR. The LSSVM model based on UVE-CARS demonstrated the highest performance, with model R2, RMSE, RPD, and RER values of 0.8571, 0.0033, 2.7078, and 10.72, respectively. The feature wavelengths reflecting cotton seed protein content in the 1000 nm~1800 nm near-infrared band were primarily distributed between 1070~1130 nm, 1160~1200 nm, 1250~1280 nm, 1380~1460 nm, 1480~1630 nm, and 1660~1740 nm. Furthermore, compared to FT-NIR, near-infrared fiber spectroscopy offers advantages such as faster detection speed and lower cost, making it highly suitable for the rapid, non-destructive testing requirements of feed enterprises. Therefore, this study provides an effective method for determining crude protein content in cottonseed. In future applications, excluding ineffective spectral bands can reduce the cost of developing future instruments, laying a foundation for related equipment research and development.

Author Contributions

X.D.: Writing—original draft, Methodology, Investigation, Formal analysis, Data curation, and Conceptualization. Q.L.: Methodology, Writing—review and editing, and conceptualization. Z.L.: Data curation. S.Z.: Data curation. H.Z.: Writing—review & editing, Supervision, and Project administration. G.J.: Writing—review & editing. All authors have read and agreed to the published version of the manuscript.

Funding

The authors gratefully acknowledge the National Natural Science Foundation of China (61701334); Tarim University Chancellor’s Fund Program (Grant No. TDZKSS202434 and TDZKSS202304); the Humanity and Social Science Research Project of Anhui Educational Committee (Grant No. 2024AH052138); and the University-Industry Collaborative Education Program of the Ministry of Education (Grant No. 231100751213206) for supporting this research.

Data Availability Statement

The original contributions presented in the study are included in the article, further inquiries can be directed to the corresponding author.

Acknowledgments

This research was supported by the College of Mechanical and Electrical Engineering, Tarim University, and the Cotton Engineering Research Institute, Anhui University of Finance and Economics.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Du, Z.; Zheng, P.; Yu, J.; Yan, H.; Wu, A.; Yu, B.; Chen, D. Rapeseed Protein as a Sustainable Source of Protein in Humans: Modification of Rapeseed Meal by Physical and Chemical Methods. Crit. Rev. Food Sci. Nutr. 2025, 1–11. [Google Scholar] [CrossRef]
  2. Tao, A.; Wang, J.; Luo, B.; Liu, B.; Wang, Z.; Chen, X.; Zou, T.; Chen, J.; You, J. Research Progress on Cottonseed Meal as a Protein Source in Pig Nutrition: An Updated Review. Anim. Nutr. 2024, 18, 220–233. [Google Scholar] [CrossRef]
  3. Xu, B.; Gao, M.; Wang, S.; Wang, Y.; Zhou, Z.; Hu, W. Elevated Temperature Further Inhibited Cottonseed Protein Synthesis under Severe Drought, but Promoted Cottonseed Protein Synthesis under Mild Drought. Agric. Water Manag. 2022, 272, 107871. [Google Scholar] [CrossRef]
  4. Li, Q.; Zhang, H.; Zhou, W.; Liu, R.; Dong, X.; Jin, G. Rapid and Non-Destructive Determination of Oil Content in Fuzzy Cottonseeds via near-Infrared Reflectance Spectroscopy. Microchem. J. 2024, 207, 112005. [Google Scholar] [CrossRef]
  5. Bellaloui, N.; Stetina, S.R.; Turley, R.B. Cottonseed Protein, Oil, and Mineral Status in near-Isogenic Gossypium Hirsutum Cotton Lines Expressing Fuzzy/Linted and Fuzzless/Linted Seed Phenotypes under Field Conditions. Front. Plant Sci. 2015, 6, 137. [Google Scholar] [CrossRef] [PubMed]
  6. GB 5009.5-2016; National Food Safety Standard—Determination of Protein in Foods. China Standards Press: Beijing, China, 2016.
  7. Zhuang, T.; Xin, M.; Wang, Q.; Wang, Y.; Saeed, M.; Xing, H.; Zhang, H.; Zhang, Y.; Deng, Y.; Zhang, G.; et al. Determination of Protein and Fatty Acid Composition of Shell-Intact Upland Cottonseed Using near-Infrared Reflectance Spectroscopy. Ind. Crops Prod. 2023, 191, 115909. [Google Scholar] [CrossRef]
  8. Ehnert, S.; Seehase, J.; Müller-Renno, C.; Hannig, M.; Ziegler, C. Simultaneous Quantification of Total Carbohydrate and Protein Amounts from Aqueous Solutions by the Sulfuric Acid Ultraviolet Absorption Method (SA-UV Method). Anal. Chim. Acta 2021, 1174, 338712. [Google Scholar] [CrossRef] [PubMed]
  9. Bridges, C.; Hardin, R.; McClurkin-Moore, J. Changes in Cottonseed Meal Quality during Post-Harvest Processing of Cottonseed. J. Stored Prod. Res. 2024, 108, 102371. [Google Scholar] [CrossRef]
  10. Qin, L.; Shen, X.; Chen, J.; Zhu, S. Determination of protein and gossypol content in cotton kernel powder with near infrared reflectance spectroscopy. Spectrosc. Spectr. Anal. 2010, 30, 635–639. [Google Scholar] [CrossRef]
  11. Huang, Z.; Sha, S.; Rong, Z.; Chen, J.; He, Q.; Khan, D.M.; Zhu, S. Feasibility Study of near Infrared Spectroscopy with Variable Selection for Non-Destructive Determination of Quality Parameters in Shell-Intact Cottonseed. Ind. Crops Prod. 2013, 43, 654–660. [Google Scholar] [CrossRef]
  12. Ma, Y.; Li, L.; Shi, Y.; Yin, H.; Zhao, T.; Chen, J.; Zhu, S. Determination of protein and oil content in fuzz cottonseed based on near infrared spectroscopy. Cotton Sci. 2023, 35, 211–219. [Google Scholar] [CrossRef]
  13. Yang, S.; Huang, Y.; Zhou, D.; Huang, L.; Wu, Y.; Fu, S.; Xu, Q.; Peng, J.; Kuang, M. Establishment of Rapid and Non-destructive Determination of Protein Content in Cottonseed by Near Infrared Spectroscopy. J. Chin. Cereals Oils Assoc. 2024, 39, 166–172. [Google Scholar] [CrossRef]
  14. Quampah, A.; Huang, Z.; Wu, J.; Liu, H.; Li, J.; Zhu, S.; Shi, C. Estimation of Oil Content and Fatty Acid Composition in Cottonseed Kernel Powder Using Near Infrared Reflectance Spectroscopy. J. Am. Oil Chem. Soc. 2012, 89, 567–575. [Google Scholar] [CrossRef]
  15. Wang, Q.; Xing, H.; Liu, X.; Mao, L.; Wei, Z.; Zhang, H.; Wang, L.; Wang, H.; Saeed, M.; Zhang, G.; et al. Estimation of Protein and Fatty Acid Composition in Shell-Intact Cottonseed by Near Infrared Reflectance Spectroscopy. J. Am. Oil Chem. Soc. 2020, 97, 331–340. [Google Scholar] [CrossRef]
  16. Li, C. Feasibility Study on the Use of Near-Infrared Spectroscopy for Rapid and Nondestructive Determination of Gossypol Content in Intact Cottonseeds. J. Cotton Res. 2021, 4, 13. [Google Scholar] [CrossRef]
  17. Yin, H.; Mo, W.; Li, L.; Ma, Y.; Chen, J.; Zhu, S.; Zhao, T. Near-Infrared Spectroscopy Analysis of the Phytic Acid Content in Fuzzy Cottonseed Based on Machine Learning Algorithms. Foods 2024, 13, 1584. [Google Scholar] [CrossRef] [PubMed]
  18. Wang, S.; Zhang, M.; Zhao, Z.; Qiu, X.; Zhang, R. Optical Properties and Monte Carlo Simulation of Cotton Fibers with Different Micronaire Values. Ind. Crops Prod. 2025, 223, 120254. [Google Scholar] [CrossRef]
  19. Nieto-Ortega, S.; Olabarrieta, I.; Saitua, E.; Arana, G.; Foti, G.; Melado-Herreros, Á. Improvement of Oil Valorization Extracted from Fish By-Products Using a Handheld near Infrared Spectrometer Coupled with Chemometrics. Foods 2022, 11, 1092. [Google Scholar] [CrossRef]
  20. Zaukuu, J.-L.Z.; Attipoe, N.Q.; Korneh, P.B.; Mensah, E.T.; Bimpong, D.; Amponsah, L.A. Detection of Bissap Calyces and Bissap Juices Adulteration with Sorghum Leaves Using NIR Spectroscopy and VIS/NIR Spectroscopy. J. Food Compos. Anal. 2025, 141, 107358. [Google Scholar] [CrossRef]
  21. Tian, H.; Zhang, L.; Li, M.; Wang, Y.; Sheng, D.; Liu, J.; Wang, C. Weighted SPXY Method for Calibration Set Selection for Composition Analysis Based on Near-Infrared Spectroscopy. Infrared Phys. Technol. 2018, 95, 88–92. [Google Scholar] [CrossRef]
  22. Wang, R.; Fang, Y.; Luo, W.; Chen, M.; Li, Z.; Yu, Y.; Ren, Z.; Huang, Y.; Dong, H. Quantitative Analysis of Camellia Oil Binary Adulteration Using near Infrared Spectroscopy Combined with Chemometrics. Microchem. J. 2025, 217, 115018. [Google Scholar] [CrossRef]
  23. He, C.; Jiang, X.; Li, B.; Liu, Y. A Method Based on Local Linear Reconstruction Combined with Spectral Information Entropy for Constructing and Updating Apple Origin Discrimination Models. Food Meas. 2024, 18, 6365–6377. [Google Scholar] [CrossRef]
  24. Gao, S.; Xu, J. Hyperspectral Image Information Fusion-Based Detection of Soluble Solids Content in Red Globe Grapes. Comput. Electron. Agric. 2022, 196, 106822. [Google Scholar] [CrossRef]
  25. Liu, R.; Li, Q.; Zhang, H. Rapid Identification of the Geographical Origin of the Chinese Mitten Crab (Eriocheir sinensis) Using Near-Infrared Spectroscopy. Foods 2024, 13, 3226. [Google Scholar] [CrossRef]
  26. Li, Q.; Zhou, W.; Zhang, H. Integrating Spectral and Image Information for Prediction of Cottonseed Vitality. Front. Plant Sci. 2023, 14, 1298483. [Google Scholar] [CrossRef]
  27. Liu, C.; Yu, H.; Liu, Y.; Zhang, L.; Li, D.; Zhang, J.; Li, X.; Sui, Y. Prediction of Anthocyanin Content in Purple-Leaf Lettuce Based on Spectral Features and Optimized Extreme Learning Machine Algorithm. Agronomy 2024, 14, 2915. [Google Scholar] [CrossRef]
  28. Yao, K.; Sun, J.; Cheng, J.; Xu, M.; Chen, C.; Zhou, X. Nondestructive Detection of S-ovalbumin Content in Eggs Using Portable NIR Spectrometer and MPA-CARS. J. Food Process Eng. 2023, 46, e14186. [Google Scholar] [CrossRef]
  29. Sun, J.; Yang, W.; Feng, M.; Liu, Q.; Kubar, M.S. An Efficient Variable Selection Method Based on Random Frog for the Multivariate Calibration of NIR Spectra. RSC Adv. 2020, 10, 16245–16253. [Google Scholar] [CrossRef]
  30. Zeng, S.; Zhang, Z.; Cheng, X.; Cai, X.; Cao, M.; Guo, W. Prediction of Soluble Solids Content Using Near-Infrared Spectra and Optical Properties of Intact Apple and Pulp Applying PLSR and CNN. Spectrochim. Acta Part A Mol. Biomol. Spectrosc. 2024, 304, 123402. [Google Scholar] [CrossRef]
  31. Su, P.; Liang, W.; Zhang, G.; Wen, X.; Chang, H.; Meng, Z.; Xue, M.; Qiu, L. Quantitative Detection of Components in Polymer-Bonded Explosives through Near-Infrared Spectroscopy with Partial Least Square Regression. ACS Omega 2021, 6, 23163–23169. [Google Scholar] [CrossRef] [PubMed]
  32. Mo, L.; Chen, H.; Chen, W.; Feng, Q.; Xu, L. Study on Evolution Methods for the Optimization of Machine Learning Models Based on FT-NIR Spectroscopy. Infrared Phys. Technol. 2020, 108, 103366. [Google Scholar] [CrossRef]
  33. Deng, J.; Zhao, Y.; Wang, Z.; Jiang, H. Quantitative Analysis of Zearalenone in Wheat Leveraging Support Vector Machine and Olfactory Visualization Technology. Microchem. J. 2024, 206, 111470. [Google Scholar] [CrossRef]
Figure 1. NIR Spectroscopy Data Acquisition System for Cottonseed.
Figure 1. NIR Spectroscopy Data Acquisition System for Cottonseed.
Processes 13 03221 g001
Figure 2. Spectrum of Cottonseed. (a) Xinluzao-50, (b) Xinluzao-27, (c) Xinluzao-49.
Figure 2. Spectrum of Cottonseed. (a) Xinluzao-50, (b) Xinluzao-27, (c) Xinluzao-49.
Processes 13 03221 g002
Figure 3. Average spectrum of hairy cottonseed. (a) The original NIR spectra, (b) the NIR spectra processed using MSC, (c) the NIR spectra processed using SG, (d) the NIR spectra processed using 1D, (e) the NIR spectra processed using Normalization, (f) the NIR spectra processed using SNV.
Figure 3. Average spectrum of hairy cottonseed. (a) The original NIR spectra, (b) the NIR spectra processed using MSC, (c) the NIR spectra processed using SG, (d) the NIR spectra processed using 1D, (e) the NIR spectra processed using Normalization, (f) the NIR spectra processed using SNV.
Processes 13 03221 g003
Figure 4. Feature wavelength point selection using UVE.
Figure 4. Feature wavelength point selection using UVE.
Processes 13 03221 g004
Figure 5. CARS selected feature wavelength points. (a) number of sampled variables, (b) RMSECV, (c) Regression coefficients path.
Figure 5. CARS selected feature wavelength points. (a) number of sampled variables, (b) RMSECV, (c) Regression coefficients path.
Processes 13 03221 g005
Figure 6. RF selected feature wavelength points.
Figure 6. RF selected feature wavelength points.
Processes 13 03221 g006
Table 1. The Protein content of three types of cottonseeds.
Table 1. The Protein content of three types of cottonseeds.
VarietyNumber of CottonseedsProtein Content Range (%)Average Protein Content (%)
Xinluzao-504038.059~40.23239.326
Xinluzao-494039.195~41.84240.619
Xinluzao-274039.321~42.46640.851
Table 3. Comparison of Evaluation Metrics for Different Models in Predicting Cottonseed Protein Content.
Table 3. Comparison of Evaluation Metrics for Different Models in Predicting Cottonseed Protein Content.
ModelScreening MethodsVNR2CRMSECR2PRMSEPRPDRER
PLSRUVE-CARS550.77750.00410.69400.00461.84217.7493
UVE-RF200.69370.00480.66590.00481.76317.4169
CARS770.74490.00440.66310.00481.75567.3854
RF150.70140.00470.69130.00461.83427.7161
UVE4120.78430.00400.64410.00501.70817.1854
LSSVMUVE-CARS400.91230.00250.85710.00332.707810.72
UVE-RF200.89890.00280.80190.00372.28989.6324
CARS890.91070.00250.74780.00442.03818.07
RF150.90360.00270.74600.00422.02198.5056
UVE1950.79060.00400.67570.00481.78947.5276
SVRUVE-CARS410.84850.00330.73200.00461.97727.82
UVE-RF100.76820.00420.74820.00422.03088.54
CARS660.78070.00410.74000.00431.99858.4071
RF200.79230.00390.61290.00521.63786.8897
UVE3130.88410.00290.68220.00471.80777.6047
Table 2. Parameters of PLSR Models Established for Raw Cotton Seed Protein Content Using Different Pretreatments.
Table 2. Parameters of PLSR Models Established for Raw Cotton Seed Protein Content Using Different Pretreatments.
Pretreatment
Methods
Number of WavelengthsCalibration SetPrediction Set
R2CRMSECR2PRMSEPRPD
None8000.60430.00590.55140.00361.5205
MSC0.67210.00490.63280.00511.6805
SG0.62700.00570.59900.00341.6081
1D0.71810.00470.62740.00451.6683
Normalization0.61620.00560.60180.00461.6139
SNV0.70190.00470.69480.00461.8445
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Dong, X.; Li, Q.; Luo, Z.; Zhang, S.; Zhang, H.; Jin, G. Rapid Detection of Protein Content in Fuzzy Cottonseeds Using Portable Spectrometers and Machine Learning. Processes 2025, 13, 3221. https://doi.org/10.3390/pr13103221

AMA Style

Dong X, Li Q, Luo Z, Zhang S, Zhang H, Jin G. Rapid Detection of Protein Content in Fuzzy Cottonseeds Using Portable Spectrometers and Machine Learning. Processes. 2025; 13(10):3221. https://doi.org/10.3390/pr13103221

Chicago/Turabian Style

Dong, Xiaofeng, Qingxu Li, Zhenwei Luo, Sun Zhang, Hongzhou Zhang, and Guoqiang Jin. 2025. "Rapid Detection of Protein Content in Fuzzy Cottonseeds Using Portable Spectrometers and Machine Learning" Processes 13, no. 10: 3221. https://doi.org/10.3390/pr13103221

APA Style

Dong, X., Li, Q., Luo, Z., Zhang, S., Zhang, H., & Jin, G. (2025). Rapid Detection of Protein Content in Fuzzy Cottonseeds Using Portable Spectrometers and Machine Learning. Processes, 13(10), 3221. https://doi.org/10.3390/pr13103221

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop