Next Article in Journal
Crop Water Productivity: Within-Field Spatial Variation in Irrigated Alfalfa (Medicago sativa L.)
Previous Article in Journal
Scale-Up and Development of a Community Industrial Prototype for Red Palm Oil Production Using Advanced Microwave Technology
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Growth Curve Models and Clustering Techniques for Studying New Sugarcane Hybrids

by
Carlos David Carretillo Moctezuma
1,
María Guzmán Martínez
1,*,
Flaviano Godínez-Jaimes
1,*,
José C. García-Preciado
2,
Ramón Reyes Carreto
1,
José Terrones Salgado
3 and
Edgar Pérez Arriaga
4
1
Facultad de Matemáticas, Universidad Autónoma de Guerrero, Av. Lázaro Cárdenas S/N, Chilpancingo de los Bravo 39087, Guerrero, Mexico
2
Instituto Nacional de Investigaciones Forestales, Agrícolas y Pecuarias, Campo Experimental Tecomán, Tecomán 28930, Colima, Mexico
3
Decanato de Ciencias de la Vida y la Salud, Escuela de Ingeniería en Agronomía, Centro de Investigación en Horticultura y Plantas Nativas, Universidad Popular Autónoma del Estado de Puebla, 21 sur No. 1103, Puebla 72410, Puebla, Mexico
4
Tecnológico Nacional de México, Instituto Tecnológico de Ciudad Victoria, División de Estudios de Posgrado e Investigación, Ciudad Victoria 87010, Tamaulipas, Mexico
*
Authors to whom correspondence should be addressed.
AgriEngineering 2025, 7(4), 114; https://doi.org/10.3390/agriengineering7040114
Submission received: 24 January 2025 / Revised: 29 March 2025 / Accepted: 7 April 2025 / Published: 9 April 2025

Abstract

:
Sugarcane (Saccharum spp.) is a crop of significant industrial and nutritional value, essential for producing various products. Due to its importance, genetic improvement programs involve a rigorous selection process. In this study, growth curve models were used to analyze the maturity curves of 33 hybrids (currently in the adaptability testing phase) and 6 control varieties (MEX 69-290, ITV 92-1424, CP 72-2086, COLMEX 94-8, COLMEX 95-27, RB 85-5113) during both plant and ratoon periods at the Melchor Ocampo Sugar Mill fields in Jalisco, México. With the use of clustering techniques, the materials were classified into four maturity groups: early, early–intermediate, intermediate–late, and late. Hybrids with a larger intercept and smaller slope were classified as having early and early–intermediate maturity. Conversely, hybrids with a smaller intercept and larger slope were classified as having intermediate–late and late maturity. According to the Connectivity and Dunn indexes, the DBSCAN algorithm provides the best clustering structure for materials in the plant cycle, while for the ratoon cycle, the k-means algorithm offers the best clustering structure. This highlights the versatility of each algorithm in the context of hybrid and varietal maturity analysis. These results are crucial for optimizing the productivity and sustainability of the crop, with significant implications for the sugar industry.

1. Introduction

The development of new sugarcane varieties provides a solution to the genetic deterioration experienced by commercial varieties. This deterioration manifests as a decrease in field performance, which impacts its industrial use. The lack of new genotypes to replace the deteriorated varieties is the main factor limiting yield [1,2].
To develop new sugarcane varieties, materials adapted to environmental conditions and the needs of the agro-industry are sought, with resistance to pests, diseases, and drought, good adaptation to different soil types, low post-harvest deterioration, lower production costs, and high sucrose concentration [3,4]. The development of an outstanding commercial variety for a specific region requires approximately ten to twelve years of constant evaluation in the field and in an industrial environment. The main stages that new materials must go through include the selection of strains, trials in furrows, multiplication I, II, and III, plot establishment, adaptability, agro-industrial evaluation, and semi-commercial trials [5].
During the adaptability phase, new hybrids are evaluated in different soils and climates using agronomic variables (germination, height, density, and health) and industrial variables (sucrose percentage (PS), Brix degrees, purity, fiber, and reducing sugars). Destructive sampling is performed to analyze the maturity curve, which constitutes a key criterion for selecting the most promising materials [6].
In the agro-industrial phase, variables such as polarization (PP), Brix degrees, purity, fiber, juiciness, and stem hardness are studied. PP is a key maturity indicator, as it equates to apparent sucrose [7,8]. While biological maturity is reached when the sucrose content is optimal, this is identified through the maturity curve, which is crucial for the selection of genotypes in the sugar industry [9,10].
Since data related to the maturity curve exhibit temporal autocorrelation, it is necessary to apply appropriate statistical methodologies for analysis. In this regard, growth curve models, a class of linear mixed models (LMMs), capture variability between and within genotypes over time [11,12].
Smith’s work [13] highlights the use of LMMs in multi-environment trials to evaluate varietal performance in different geographical locations, being particularly useful for handling incomplete data, as they improve precision when modeling error variation. In plant breeding, LMMs are commonly used in multi-environment trials to select lines adapted to the climate, resistant to pests, and with higher yield and quality [12,14,15,16].
In sugarcane, commercial and new hybrids have been compared using repeated measurements of Brix degrees, PP, and PS. Quadratic regression has been applied to model sucrose accumulation, determine the optimal harvest time, and maximize yield [17]. Additionally, genotypes have been evaluated using analysis of variance and linear regression, identifying the best agro-industrial response and classifying cultivars as having early, intermediate, and late maturity [7].
Another study in Tucumán, Argentina, analyzed sucrose accumulation curves and clustering techniques to classify sugarcane genotypes using segmented polynomials. To group the materials, the estimated accumulation curve and two clustering techniques, UPGMA and k-means, were used, facilitating the selection of the most promising genotypes [10]. According to [10], clustering algorithms can be used in agriculture to classify crops. In this regard, Seta [18] applied k-means to classify sugarcane-producing regions, while Mbukwa [19] used k-means and k-medoids to identify high-yield districts for maize and bean in Tanzania.
This study used growth curve models to analyze sucrose accumulation in 33 sugarcane hybrids under adaptability trials, along with six commercial control varieties. The 39 materials were grouped using k-means, k-medoids, and DBSCAN to identify their maturity type and select the most promising for the next selection phase.

2. Materials and Methods

A total of 39 materials from the experimental fields of the National Institute of Forestry, Agricultural and Livestock Research (INIFAP) were studied. They form two groups: the control materials and the new materials. The control group corresponds to six commercial varieties, and the new materials correspond to 33 varieties of Mexican and foreign origin (Table 1). The new materials are in the adaptability phase, and all the materials were evaluated in plant and ratoon crop cycles under irrigation conditions in both seasons.
The experiment was established on 10 March 2011, at Las Pilas experimental field (19°46′48.85″ N, 104°13′57.65″ W) and the Melchor Ocampo Sugar Mill (19°47′18.0″ N, 104°14′24.3″ W), which is located in the Zacapala area of Autlán de Navarro, Jalisco, México [6]. The experimental field had loam soil, pH 5.60 , and organic matter of 1.74 with an accumulated annual precipitation of 998 mm, an altitude of 860 m above sea level, and an average annual temperature of 22.3 °C.
Regarding agronomic management, the local technological package was used, which consists of fertilization per hectare of 200-100-200 units of N-P-K, respectively, divided into two applications, plus the application of sugarcane compost (1.5 t ha 1) in one application. Irrigation was carried out every two months during the rainfed period from October to June. For pest control, quarterly releases of Trichogramma sp. were used against the sugarcane borer (Diatraea saccharalis and Eoreuma loftini). For rodent control, poisoned baits were used in two applications per year. There was no disease control, and mechanical weeding was used to combat weeds during the first three months after planting or sprouting.
The experimental unit consisted of four rows, with each row measuring 15.0 m in length and 1.4 m in width. The useful plot area consisted of the two central rows, with 2 m removed from the ends.
The SP of the materials was measured through destructive sampling on four dates (Table 2), with 30 to 34 days between the measurement dates. Seven months elapsed from planting to the first measurement date in the plant cycle, and six months from harvesting to the first measurement date in the ratoon cycle.
Accordingly, by measuring the sucrose percentage in the materials at four different times (Table 2), we have what is known as repeated measurements, which are not independent. Therefore, the variation recorded in repeated measurements will reflect the variation in the response of the material over time, hence the relevance of using growth curve models to model repeated measurements. In this case, the PS is the variable of study.
Depending on environmental, geographical, and soil conditions, commercial varieties show a specific type of maturity: early, early–intermediate, intermediate–late, or late. According to [20], Mex 69-290 and CP 72-2086 exhibit late–intermediate and early maturity, respectively. ITV 92-1424 and COLMEX 95-27 exhibit early maturity; on the other hand, COLMEX 94-8 exhibits early–intermediate maturity [21], these varieties develop in clayey (AR), loam, silty crumb, and clayey crumb (FAR), and clay–loam (ARL) soils [22] under the conditions of temperature and altitude described in Table 3. Finally, RB 85-5113 has early–intermediate maturity and grows in medium-textured soils (loam = F, FAR, silt loam = FLI) [23].

2.1. Growth Curve Model

Growth curve models are useful for studying variability between and within subjects over time, as their covariance structure accounts for the correlation between observations from the same subject and the independence between different subjects. This type of data, known as repeated measures or longitudinal data, is characterized by observations ordered over time. The main advantage of these models is that, with up to four measurements per experimental unit, it is possible to analyze the temporal response and the variation in trends between experimental units.
The matrix form of the growth curve model is given by
y = X β + Zu + ϵ ,
where y is the n × 1 response vector; X and Z are design matrices of dimensions n × p and n × q , respectively; β is the fixed-effect parameter vector of dimensions p × 1 ; u is the random-effects vector of dimensions q × 1 ; and ϵ is the error vector of dimensions n × 1 . In Model (1), X β is the fixed component, and Zu + ϵ is the random component. It is assumed that u and ϵ are independent, that is, C o v ( u , ϵ ) = 0 , and that they each have a multivariate normal distribution:
u N n ( 0 , D ) , ϵ N n ( 0 , R ) ,
where D and R are the covariance matrices of the random-effects vector and the error vector, respectively; both positive definite matrices. D denotes the variance and covariance matrix of the random effects, and R can be decomposed as R = σ 2 Λ C Λ , where Λ is a covariance matrix and C is a correlation matrix. From these assumptions, we have that the response variable has the following distribution:
y N n ( X β , V ) ,
where V = C o v ( y ) = ZDZ T + R . Both components allow for modeling the correlation and heteroscedasticity in subject measurements. In using the Henderson equations [24] to estimate the Model (1) parameters, the best unbiased linear estimator for β and the best unbiased linear predictor for u are obtained:
β ^ = X T V 1 X 1 X T V 1 y , u ^ = DZ T V 1 ( y X β ^ ) .
In the growth curve models, y i j is the j-th observation of the i-th material, j = 1 , , n i denote the number of measurements, and i = 1 , , n denote the number of materials; the growth curve model is given by
y i j ( t i j ) = β 0 + β 1 t i j + u 0 i + u 1 i t i j + ϵ i j .
where t i j indicates the time at which the data y i j were observed, and ϵ i j is the error associated with that observation. β 0 and β 1 are unknown parameters representing the intercept and slope of the model, respectively; u 0 i is the random effect associated with the intercept; and u 1 i is the random effect associated with the slope of the model (Appendix A.1). The fixed component of Model (2) is β 0 + β 1 t i j , and the random component is u 0 i + u 1 i t i j + ϵ i j . According to this model and considering four measurement dates (Table 2), the design matrices for each material i are of dimensions 4 × 2 and are given by
X i = Z i = 1 t i 1 1 t i 2 1 t i 3 1 t i 4 .
The fixed-effects parameter vector, the random-effects vectors, and the error vectors are given by
β = β 0 β 1 , u i = u 0 i u 1 i , ϵ i = ϵ i 1 ϵ i 2 ϵ i 3 ϵ i 4 .
Then, the model of growth curves for the i-th material is given by
y i = X i β + Z i u i + ϵ i , i = 1 , , 39 .
Random effects u i , i = 1 , , 39 are assumed to be independent and identically distributed, that is, u i N n i ( 0 , D ) , with
D = V a r ( u i ) = σ u 0 i 2 σ u 1 i u 0 i σ u 0 i u 1 i σ u 1 i 2 .
There are several covariance structures for D that provide a more accurate representation of the variability and correlation between observations within a group or subject [25] that enables more flexible and appropriate model fitting to longitudinal data.
The error vectors ϵ i , i = 1 , , 39 are also assumed to be independent and identically distributed, that is, ϵ i N n i ( 0 , R i ) , with
R i = C o v ( ϵ i ) = V a r ( ϵ i 1 ) C o v ( ϵ i 1 , ϵ i 2 ) C o v ( ϵ i 1 , ϵ i 3 ) C o v ( ϵ i 1 , ϵ i 4 ) C o v ( ϵ i 1 , ϵ i 2 ) V a r ( ϵ i 2 ) C o v ( ϵ i 2 , ϵ i 3 ) C o v ( ϵ i 2 , ϵ i 4 ) C o v ( ϵ i 1 , ϵ i 3 ) C o v ( ϵ i 3 , ϵ i 2 ) V a r ( ϵ i 3 ) C o v ( ϵ i 3 , ϵ i 4 ) C o v ( ϵ i 1 , ϵ i 4 ) C o v ( ϵ i 4 , ϵ i 2 ) C o v ( ϵ i 4 , ϵ i 3 ) V a r ( ϵ i 4 ) .
This matrix can be decomposed in the form
R i = σ 2 Λ i C i Λ i ,
where Λ i = d i a g ( λ 1 , , λ n i ) is a diagonal matrix with nonnegative diagonal elements for the i-th material, with λ j , j = 1 , , n i , representing functions of variance [25]; C i is a correlation matrix for the i-th material. This way, the presence of heteroscedasticity in the elements of the vector ϵ i is explained by Λ i , and C i explains the correlation between the observations of the i-th material. Some structures for C i include autoregressive processes of order 1 and autoregressive structures [26].
For the plant and ratoon crop cycles, Model (3) was used. For the plant cycle, the specification of R i assumes that C i = I n i × n i , and the elements of Λ i are given by λ j = | t i j | δ (varPower variance function), where δ is a constant; this variance function indicates that the magnitude of the variance error changes nonlinearly over time (Appendix A.2.1). Then, for the plant cycle
R i = σ 2 t i 1 2 δ 0 0 0 0 t i 2 2 δ 0 0 0 0 t i 3 2 δ 0 0 0 0 t i 4 2 δ ,
and
D = σ u 0 i 2 σ u 1 i u 0 i σ u 0 i u 1 i σ u 1 i 2 .
For the plant cycle, the intraclass correlation coefficient ρ I C C was calculated; it indicates the degree of correlation that exists between the observations of the same group. It is calculated using the following equation:
ρ I C C = σ ( u 0 , u 1 ) σ u 0 σ u 1 ,
where σ ( u 0 , u 1 ) is the covariance between the random effects of the intercept and slope, and σ u 0 and σ u 1 are the standard deviations of the random intercept and the random slope, respectively.
For the ratoon cycle, u 1 i = 0 i , i = 1 , 2 , , 39 . For the specification of R i , C i = I n i × n i was assumed, and the dependence of the elements of Λ i in time are given by λ j = δ S i j (varIdent variance function); this variance function is suitable when the data exhibit less variation within a group (Appendix A.2.2). We set δ S i j = 1 i and j; then, for the ratoon cycle,
R i = σ 2 1 0 0 0 0 1 0 0 0 0 1 0 0 0 0 1 .
If the growth curve model correctly explains the variability of the materials, then the assumptions of normality, homogeneity of variances, and independence in the residuals should be satisfactorily met. The Kolmogorov–Smirnov (normality), Bartlett (homogeneity of variances), and Box–Pierce (independence) statistical tests were used to verify the fulfillment of these assumptions.
The normality hypothesis was tested with the ks.test function, the independence hypothesis was tested with the Box.test function, and the homogeneity of variances was tested with the bartlett.test function; all three functions are available in the stats package [27].
The modeling of the growth curves was performed with the lme function of the nlme package [28].

2.2. Grouping Techniques

After performing the statistical modeling, an estimated growth curve was obtained for each of the 39 materials, capturing both the variability among materials and within each material over time. This estimated curve consists of an intercept value and a slope value. These two values represent the newly available information for each of the 39 materials and will be used for the clustering analysis. The maturity type of the 33 new materials is determined by observing the biological maturity of the commercial varieties present in each of the formed groups.
The clustering techniques of k-means, k-medoids, and DBSCAN were employed to classify genotypes with similar sucrose accumulation patterns. k-means was chosen for its efficiency in data partitioning [29], k-medoids for its robustness against noise and outliers [30], and DBSCAN for its ability to identify non-spherical clusters and detect outliers [31].
k-means and k-medoids are partition-based algorithms [32], whereas DBSCAN is a density-based method that is robust to outliers [33].

2.2.1. k-means

The k-means algorithm is a non-hierarchical clustering method used to analyze numerical data, with the goal of partitioning a data set into K groups, each represented by a centroid. The value of K must be defined in advance. The algorithm iteratively adjusts the position of the centroids in the object space, seeking the best grouping of similar individuals. To calculate the distance between objects and determine their similarity, several distance functions can be used, with Euclidean distance being one of the most commonly used, as it measures the shortest distance between two points [34,35,36].
For this research, the value of K was set to four, as four maturity types were considered for the 33 materials in the study. Additionally, an inflection point was identified using the elbow method. This analysis revealed that the optimal number of clusters was four, aligning with the expectation based on the four previously defined maturity types: early, early–intermediate, intermediate–late, and late. The metric used in k-means was Euclidean distance.

2.2.2. k-medoids

The k-medoids algorithm uses a partitioning method to group a set of n objects into K clusters. The process starts by setting the number of clusters and randomly selecting the initial centroids. Then, the distance of each object to the centroid of each cluster is calculated. Subsequently, an iteration is performed to adjust the centroids according to the assignment of the data [37]. For this method, the value of K was also equal to four to consider four degrees of maturity in the 33 new materials. The metric used was the Euclidean distance.

2.2.3. DBSCAN

DBSCAN is a density-based clustering algorithm that identifies clusters as dense regions separated by lower-density areas. It performs well with nonlinear data and distinguishes between high- and low-density clusters. The process starts by selecting an unvisited point; if its ε neighborhood contains enough points, a cluster is formed; otherwise, it is classified as noise. The key parameters are ε (maximum distance between neighbors) and η (minimum number of points to form a dense region) [38]. In this study, ε was determined using the kNN function from the dbscan package [39], and several values of η were tested.

2.3. Validation Indices

The evaluation of the clusters identified was carried out using the Silhouette index, the Dunn index, and the Connectivity index.
The Silhouette index measures the confidence with which an observation is assigned to a cluster, providing information about the cohesion and separation of the clusters. This index varies in the range from 1 to 1, where a value close to 1 indicates that the point has been correctly assigned to the cluster and is well separated from other clusters [40]. Values from 0.71 to 1 indicate a strong cluster structure has been found; from 0.51 to 0.70 , a reasonable structure; from 0.26 to 0.50 , the structure is weak; and less than or equal to 0.25 , no substantial structure [41]. Negative values have a similar interpretation and indicate that the object may be incorrectly assigned and is closer to points from another cluster [42]. The Silhouette index is given by
I S = 1 n k = 1 K i C k b ( i ) a ( i ) max { a ( i ) , b ( i ) }
where n is the total number of objects in the dataset, K is the number of clusters, C k is the set of objects in cluster k, a ( i ) is the average distance between object i and all other objects in the same cluster, b ( i ) is the average distance between object i and all objects in the nearest cluster, and max { a ( i ) , b ( i ) } is the maximum value of a ( i ) and b ( i ) .
The Dunn index is a measure of the quality of a partition in terms of the ratio between the minimum distance between cluster centroids and the maximum distance between points and their respective centroids [43]. This index is given by
I D = min k = 1 , , K d C k , C 0 max x i Ω d x i , c k
where K is the number of clusters, min k = 1 , , K d ( C k , C 0 ) is the minimum of the distances between the centroids of each cluster C k and the global center C 0 , and max x i Ω d ( x i , c k ) is the maximum distance between each point x i in the dataset Ω and its respective centroid C k in its corresponding cluster. The minimum distance is a measure of the internal coherence of the clusters, and the maximum distance is a measure of the separation between clusters.
The Connectivity index is calculated by constructing a neighbor matrix. Define n n i ( j ) as the j-th nearest neighbor of observation i, and let x i , n n i ( j ) be zero if i and n n i ( j ) are in the same cluster and 1 / j otherwise. The Connectivity index has a value between 0 and and should be minimized [44]. This index is given by
I C = i = 1 n j = 1 L x i , n n i ( j )
where n is the total number of objects in the dataset, and L is a parameter that determines the number of neighbors that contribute to the connectivity measure.
The quality of the clustering was determined with the Silhouette (Silhouette function of the package [45]), Dunn (Dunn function of the package [44]) and Connectivity (Connectivity function of the package [44]) indices.

3. Results

3.1. Plant Cycle

To effectively model the heteroscedasticity observed in the data, the varPower function was implemented, setting the δ parameter to 0.2 , which allowed for a better fit of the model to the variability of the data. After that, the fitted model for the plant cycle satisfies the assumptions of normality (Kolmogorov–Smirnov, p-value = 0.85 ), homogeneity of variance (Bartlett, p-value = 0.34 ), and independence (Box–Pierce, p-value = 0.23 ).
Table 4 presents the parameter estimates of the fitted model (Model (3)), along with their respective 95% confidence interval (CI), showing both fixed and random effects. The fixed effects include the estimated parameters β ^ 0 and β ^ 1 , both significant, which represent the intercept and slope, respectively. The estimated intercept, β ^ 0 = 6.37 , reflects that all materials have an average SP value of 6.37 when time is zero. The slope indicates the change in SP for each unit change in time; thus, a value of 1.50 signifies that, on average, SP increases by 1.50 units for each additional unit of time, which according to Table 2, on average, is 31 days.
The intraclass correlation coefficient, ρ I C C = 0.89 (Table 4), indicates the degree of correlation that exists between the observations of the same group.
Since both random effects were statistically significant, specific estimated intercepts and slopes were obtained for each of the 39 materials from Las Pilas (Table 5). The variety COLMEX 94-8, which is known to have an early maturity, has an estimated intercept of 8.97 and an estimated slope of 1.22 ; similar values are observed for the variety RB 85-5113, which has an estimated intercept of 8.23 and an estimated slope of 1.23 . The variety MEX 69-290, which is known to have an intermediate–late maturity, has an estimated intercept of 5.67 and an estimated slope of 1.82 . From the estimations of all materials, it can be deduced that materials with an early maturity tend to have a higher intercept. Since both the intercept and the slope are random, their values vary depending on the specific sugarcane material being evaluated. Thus, materials that have a late maturity tend to have a smaller intercept and a larger slope.
Figure 1 presents the estimates of the intercept and slope for each material, where a high intercept indicates an early accumulation of sucrose (typical of early materials), and a steep slope reflects a higher accumulation rate over time (characteristic of late materials). The red color in the intercept indicates materials with a higher intercept value, while the blue color represents those with a lower intercept. Similarly, color coding is applied to the slope: blue highlights materials with a steeper slope, while red indicates materials with a gentler slope. This color scheme provides a clear way to identify which materials are positioned above or below others based on these parameters. Together, this explains the behavior of the materials according to their respective fitted models. This is the graphical representation of Table 5.

Clustering Materials in the Plant Cycle

Clustering was performed using the intercept and slope estimates of the estimated maturity curve (Table 5). Since sugarcane materials can express any of the four types of maturity in the sugar industry, early, early–intermediate, intermediate–late, and late, the number of groups was set at 4.
The clustering of the materials using the k-means algorithm with K = 4 is shown in Figure 2a and Table 6. The early-maturity group has four early-maturing commercial varieties—ITV 92-1424, RB 85-5113, CP 72-2086, and COLMEX 95-27—and ten new materials. The early–intermediate-maturity group contains four new materials and variety COLMEX 94-8. The group designated with intermediate–late maturity comprises variety MEX 69-290, which is known to exhibit intermediate–late maturity and 11 new materials. The l-te maturity group has eight new materials.
The validation indices for clustering the materials with k-means are I S = 0.63 , I D = 0.12 , and I C = 13.41 . The value of I S = 0.63 indicates that the grouping of materials shows moderately good cohesion and appropriate separation between the clusters. The value of I D = 0.12 indicates relatively low separation between clusters and a potential for improvement in cohesion within the clusters. Meanwhile, the value of I C = 13.41 indicates good separation between clusters and appropriate compactness within the clusters. Therefore, two out of three indexes indicate that the k-means clustering is good.
The clustering of the materials using the k-medoids algorithm with K = 4 is shown in Figure 2b and Table 6. The early-maturity group has 12 materials, including commercial varieties ITV 92-1424, RB 85-5113, CP 72-2086, and COLMEX 94-8, all of which are reported to have an early maturity. The early–intermediate-maturity group has seven new materials. The intermediate–late-maturity group has 11 as it incorporates variety MEX 69-290, which has intermediate–late maturity. The late-maturity group contains eight new materials.
The validation indices for clustering the materials with k-medoids clustering are I S = 0.55 , I D = 0.12 , and I C = 13.85 . The value of I S = 0.55 indicates that the clustering exhibits moderate cohesion and appropriate separation among the groups, but there is still room for improvement in terms of group separation. The value of I D = 0.12 indicates relatively low separation between clusters and a potential for improvement in cohesion within the clusters. The value of I C = 13.85 indicates good separation between the groups and appropriate compactness within the clusters.
For the DBSCAN clustering algorithm with ϵ = 0.5 and a η = 2 , we have K = 4 groups with 15, 5, 17, and 2 materials, as shown in Figure 2c and Table 6. The early group consists of 11 new materials and 4 commercial varieties: ITV 92-1434, RB 85-5113, CP 72-2086, and COLMEX 95-27. The early–intermediate group includes four materials and commercial variety COLMEX 94-8. The intermediate–late group comprises 16 new materials, along with commercial variety MEX 69-290. Lastly, the late group consists of two new materials, ITSAMEX 07-7501 and ITSAMEX 07-121115.
The validation indices for clustering the materials with DBSCAN are I S = 0.64 , I D = 0.24 , and I C = 12.36 . The values of I S = 0.64 and I D = 0.24 indicate that the clustering exhibits moderately good cohesion and adequate separation between the groups. The value of I C = 12.36 suggests reasonable connectivity between the groups, implying that there is internal connectedness within each group and good relationships between observations in each group. Thus, it can be concluded that the clustering performed with DBSCAN has achieved satisfactory separation and cohesion among the groups, along with suitable connectivity.
The k-means and DBSCAN algorithms are more similar to each other than to k-medoids, as they coincide in the maturity type assignment for 32 materials, while k-means and k-medoids coincide for 26, and k-medoids and DBSCAN for 20. According to the index values (Table 7), DBSCAN is the most suitable method for characterizing the 33 materials by maturity, and its use should be considered in the adaptation phase of the plant cycle. Thus, given the grouping by DBSCAN, it is up to the researcher to choose the materials that will pass to the next phase, given the type of maturity they present.
A graphical representation of Table 6 is shown in Figure 3.
The heatmap displays the classification of the 39 sugarcane materials according to three clustering algorithms, based on the four defined maturity types (early (1), early–intermediate (2), intermediate–late (3), late (4)). The color distribution allows for visualizing the correspondence between the methods used, highlighting a general consistency in the assignment of materials to the different maturity groups. This coherence between the algorithms suggests the stability and robustness of the classification patterns obtained, as shown in Figure 2.

3.2. Ratoon Cycle

The model satisfies the assumptions of normality (Kolmogorov–Smirnov, p-value = 0.85), homogeneity of variances (Bartlett, p-value = 0.34), and independence (Box–Pierce, p-value = 0.23).
For this cycle, incorporating only the random intercept was sufficient to improve the model fit (Model 3), which allowed for capturing the variation in the intercept, indicating notable differences in the initial SP among the different genotypes. Table 8 shows that both the intercept and the slope of the fitted model for the ratoon cycle are significant, with narrow confidence intervals, indicating high precision in the estimates. The significant variability among the subject intercepts ( σ ^ u 0 = 1.18 ) and the moderate residual variability ( σ ^ e = 1.05 ) suggest that the model adequately captures the variation present in the data.
Since only the intercept has a statistically significant random effect, specific estimated intercepts were obtained for each of the 39 materials from Las Pilas (Table 9). Similarly to in the plant cycle, a higher intercept is observed in varieties known to have an early maturity: ITV 92-1424 had an intercept of 8.30 , CP 72-2086 had an intercept of 9.35 , and COLMEX 95-27 had an intercept of 9.37 . Higher intercepts were also observed for the early–intermediate-maturity variety: COLMEX 94-8 had an intercept of 9.20 . A lower intercept was observed in the MEX 69-290 variety, which had an intercept of 7.16 .
The estimates of the models for the plant and ratoon cycles show some differences in terms of the estimated intercept. For the early-maturing varieties, the following results are obtained: the ITV 92-1434 variety had an SP of 8.39 in the plant cycle and an SP of 8.30 in the ratoon cycle, a difference of 0.09 units; the CP 72-2086 variety had an SP of 7.92 in the plant cycle and an SP of 9.35 in the ratoon cycle, a difference of 1.43 units; and the COLMEX 95-27 variety had an SP of 7.56 in the plant cycle and an SP of 9.37 in the ratoon cycle, a difference of 1.87 units. This indicates that the three early-maturing varieties showed higher sucrose percentage yields in the ratoon cycle. The early–intermediate-maturity varieties showed the following results; the COLMEX 94-8 variety had an SP of 8.97 in the plant cycle and an SP of 9.20 in the ratoon cycle, a difference of 0.23 units, and the RB 85-5113 variety had an SP of 8.23 in the plant cycle and an SP of 8.30 in the ratoon cycle, a difference of 0.07 units. Finally, the late–intermediate-maturity variety, MEX 69-290, presented an SP of 5.67 in the plant cycle and an SP of 7.16 in the ratoon cycle, a difference of 1.49 units. This variety also showed higher yields in sucrose percentage in the ratoon cycle.
An illustration of the behavior of the materials from Table 9, based on their estimated parameters, is shown in Figure 4. It can be observed that the slope remains constant for all 39 materials, while the intercept varies from one material to another. Furthermore, it is clearly noticeable that some materials exhibit a significant difference in their initial value (intercept). The red color indicates materials with a higher intercept, reflecting a greater initial accumulation. In contrast, the blue color highlights materials with a lower intercept, indicating a lower initial accumulation. This color coding facilitates the interpretation of material behavior based on the estimated parameter.

Clustering Materials in the Ratoon Cycle

The clustering of materials using the k-means algorithm is shown in Figure 5a and Table 10. The early group consists of five new materials and five commercial varieties: ITV 92-1424, RB 85-5113, CP 72-2086, COLMEX 95-27 and COLMEX 94-8. The early–intermediate group is composed of 14 materials, including the MEX 69-290 variety. Despite being identified as intermediate–late in the plant cycle, the MEX 69-290 variety was assigned to the early–intermediate cluster in this cycle due to its higher SP in the ratoon cycle. Regarding the absence of the COLMEX 94-8 variety in the early–intermediate-maturity group, its maturity is determined based on the average SP value that the hybrids in this group have ( 7.44 % ). We have a group of 14 materials that are of intermediate–late maturity. Their maturity is inferred from the average value ( 6.42 % ) of sucrose that the hybrids exhibit. Finally, the late group contains only ITSAMEX 07-4954 material.
According to the Silhouette index ( I S = 0.56 ), the clustering performed with the k-means algorithm indicates moderate cohesion and satisfactory separation between the clusters. However, it is observed that the separation between groups could be improved, as the Dunn index was low ( I D = 0.16 ). Finally, according to the Connectivity index ( I C = 12.08 ), the groups are well defined and internally connected.
The clustering of materials using the k-medoids algorithm is shown in Figure 5b and Table 10. The early group is composed of eight materials, including commercial varieties ITV 92-1434 and RB 85-5113. The early–intermediate group contains three new hybrids and commercial varieties CP 72-2086, COLMEX 94-8, and COLMEX 95-27. Similarly, the intermediate–late group consists of commercial variety Mex 69-290 and ten new hybrids, while the late group contains 14 hybrids.
These groups had clustering validation indices with the following values: I S = 0.59 , I D = 0.06 , and I C = 13.68 . The results indicate that the clustering carried out through the k-medoids algorithm forms groups in which there are similar characteristics among the materials within the same group in terms of β 0 and β 1 . The groups show good cohesion and appropriate separation.
For this cycle, the DBSCAN algorithm with ϵ = 0.24 and η = 2 identified four groups with 24, 8, 4, and 2 materials. In this cycle, one material was left out of the four groups. In Figure 5c and Table 10, some groups are more compact, with a low point density (early–intermediate and late groups), while others are more dispersed, with a high point density (intermediate–late and early groups). Thus, the early-maturity group consists of commercial varieties ITV 92-1434 and RB 85-5113 and six new materials. The early–intermediate-maturity group contains commercial varieties CP 72-2086, COLMEX 94-8, and COLMEX 95-27 and one new hybrid. The intermediate–late-maturity group includes 23 new materials and the Mex 69-290 variety. Lastly, the late group contains two new materials. Finally, the DBSCAN algorithm did not assign hybrid ITSAMEX 07-4954 to any group. That is, according to the algorithm, this material presents a maturity type different from the four established in the analysis. For this clustering, the validation indices were I S = 0.64 , I D = 0.14 , and I C = 16.11 . These values indicate good cohesion and appropriate separation between the groups. The separation between the groups is reflected in the high value of the Dunn index. These results suggest that the formed groups represent different maturity types of sugarcane and exhibit distinctive characteristics in terms of their density.
Table 10 presents a comparison of the clustering algorithms for the ratoon cycle. It can be observed that the k-medoids and DBSCAN algorithms have more coincidences in the assigned maturity type (23 materials); k-means and k-medoids share five assignments, and k-means and DBSCAN have 17 shared assignments. The intermediate–late and early–intermediate groups have more materials according to the k-means algorithm, while according to the k-medoids algorithm, the late and intermediate–late groups have more assigned materials. Finally, the intermediate–late and early groups have more assigned materials according to the DBSCAN algorithm.
Similarly to the previous graph, Figure 6 is a graphical representation of Table 10, complemented by Figure 5. In both cases, the material that was not classified into any group can be observed. Furthermore, the color distribution, which intuitively indicates the maturity type of each material, is clearly depicted.
The k-means algorithm achieved the best values for the I D (highest) and I C (lowest) indices, while DBSCAN obtained the best value for the I S index (Table 11). Therefore, k-means is the most suitable method for clustering materials in the ratoon cycle, allowing for the selection of materials that will advance to the next evaluation phase.

4. Discussion

Improving sugarcane varieties with high yield and higher sucrose content is essential for the sugar industry. Previous studies, such as [46,47], have emphasized the importance of these goals. The present research analyzes how sucrose accumulation varies among different hybrids and varieties, and how this variability is observable in both the plant cycle and the ratoon cycle.
Variations in sucrose accumulation depend on genetic, soil, climatic, and management factors [48,49]. Ref. [50] observed higher sucrose yields in the plant cycle, with decreases in the ratoon and second ratoon cycles, highlighting the importance of agronomic management and soil conditions.
The selection of adapted varieties and agronomic management are key to optimizing sucrose production [51]. The development of new varieties is complex and requires multiple evaluations before commercialization. Growth curve models can model maturity in the adaptability phase with at least four measurements per material, explaining their variability over time.
The Silhouette index exceeded 0.5 for all three algorithms evaluated, surpassing the 0.34 reported in [10] with k-means. However, the Connectivity and Dunn indices indicated that DBSCAN was the best method for the plant cycle, while k-means was the most appropriate for the ratoon cycle, aligning with its recognition in clustering [52].
DBSCAN correctly identified the maturity of the commercial varieties in the plant cycle, validating the estimation of their maturity curve and the methodology for characterizing the 33 hybrids. In the ratoon cycle, k-means also correctly identified maturity, ensuring proper characterization.
Some materials changed classification between cycles due to variations in sucrose caused by environmental factors, agronomic management, and algorithm differences. This underscores the importance of management and harvest timing to maximize sucrose content [53], demonstrating the usefulness of these methods in characterizing materials based on maturity.
Different maturity types impact crop yield and performance by influencing sucrose accumulation, the optimal harvest time, and industrial processing efficiency [54,55]. Early-maturing varieties allow for an earlier harvest and resource optimization, while intermediate- and late-maturing varieties can achieve higher yields over time, benefiting logistical planning and preventing processing plant overload. Thus, maturity classification not only enhances the understanding of the behavior of the evaluated materials but also has practical implications for agricultural and industrial management.
These results are crucial for identifying genetically adapted materials and developing varieties with high sucrose content [56].
This study opens up various opportunities to extend the application of statistical models to future research. In addition to sugarcane, mixed models that incorporate the interaction between genotype and environment could be applied to a wide range of agricultural crops, such as sugar beet, sweet sorghum, forage maize, rice, soybeans, and oilseed crops like sunflower and canola. These models can be useful not only for studying variations in sugar production but also other variables such as grain yield, water use efficiency, disease resistance, and crop quality, all of which are influenced by environmental conditions.
Furthermore, applying these models in different geographical regions with varying climatic conditions and soil types can provide a more precise understanding of how environmental factors impact crop performance and quality. For instance, in tropical or subtropical areas, the models could offer insights into how crops respond to consistent temperature and humidity conditions, while in more arid or temperate zones, they could analyze how seasonal fluctuations, water availability, or droughts affect performance. The ability of mixed models to capture the spatial and temporal variability of data would be particularly useful in tailoring agricultural practices to local environments, enabling farmers to make more informed decisions to optimize resource use.
In conjunction with the inclusion of additional variables, such as fertilizer use, pest control, crop rotation, and soil management practices, these models could provide a more comprehensive view of the factors affecting agricultural productivity. This multifaceted approach could improve sustainability, efficiency, and profitability in the production of a wide variety of crops, contributing to agriculture that is better adapted to local conditions and more resilient to climate change.

5. Conclusions

The sucrose accumulation curves of 33 new sugarcane hybrids were analyzed using growth curve models, enabling the modeling of their variability over time and sucrose yield across two cycles. Based on the estimated intercept and slope, the materials were characterized and grouped according to their maturity type (early, early–intermediate, intermediate–late, and late), facilitating the selection of the most promising hybrids for the next evaluation stage.
This characterization in the adaptability testing phase is valuable for plant breeders and geneticists in the search for new commercial sugarcane varieties. The methodology employed can serve as a reference for studying and selecting genotypes in advanced selection stages, optimizing agricultural management and harvest planning.
Growth curve models, combined with k-means, k-medoids, and DBSCAN clustering algorithms, allowed for the classification of genotypes based on their agronomic and industrial performance, contributing to the selection of more productive varieties. In the agro-industrial phase, these materials can be further evaluated to confirm their stability under different environmental conditions.

Author Contributions

Conceptualization, M.G.M. and C.D.C.M.; methodology, M.G.M., C.D.C.M. and F.G.-J.; software, C.D.C.M., M.G.M. and F.G.-J.; validation, M.G.M., C.D.C.M., J.C.G.-P., E.P.A. and R.R.C.; formal analysis, M.G.M., C.D.C.M. and J.C.G.-P.; investigation, J.C.G.-P.; resources, J.C.G.-P.; data curation, C.D.C.M., J.T.S. and J.C.G.-P.; writing—original draft preparation, C.D.C.M., M.G.M. and F.G.-J.; writing—review and editing, M.G.M., C.D.C.M., F.G.-J., J.C.G.-P., J.T.S. and R.R.C.; visualization, C.D.C.M., M.G.M., E.P.A. and F.G.-J.; supervision, M.G.M., J.C.G.-P., J.T.S. and F.G.-J.; project administration, J.C.G.-P.; funding acquisition, J.C.G.-P. All authors have read and agreed to the published version of the manuscript.

Funding

The project entitled “Agro-industrial selection and transfer of new sugarcane varieties for the state of Jalisco”, led by García Preciado José Concepción, was carried out from 30 March 2010, to 31 March 2013. It was funded by Fundación Produce de Jalisco, while land availability and technical monitoring of the research were supported by the Sugarcane Production and Quality Committees of the Melchor Ocampo Sugar Mill and INIFAP. The project was conducted at the Tecomán Experimental Station, within the agricultural subsector and at the state level, and was classified as applied research. Currently, the project is listed as inactive.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Publicly available datasets were analyzed in this study. These data can be found at https://github.com/arimagm/New-Sugarcane-Hybrids (accessed on 2 September 2024).

Acknowledgments

The authors express their thanks to the technical field department of Ingenio Melchor Ocampo S.A. de C.V., to the engineer Filemón Zavalza Garcia, and to the sugarcane production and quality committee of the mentioned mill, as well as special thanks to Gabriel Ricardo Blackaller Ayala and Jesús Zúniga Mendoza.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A

Appendix A.1. Growth Curve Model

The model y i j ( t i j ) = β 0 + β 1 t i j + u 0 i + u 1 i t i j + ϵ i j analyzes how the observations of a set of materials vary over time, considering not only the effect of time but also differences among materials (genotypes). We use a random-effects structure to account for individual differences between genotypes and how these affect both the initial value ( u 0 i , i.e., the starting point of each genotype), and the rate at which observations change over time ( u 1 i , the variation in the slope of each genotype).
In this model, β 0 represents the average response value at t i j = 0 for the entire population, while u 0 i captures the specific deviation in each genotype from this value. Similarly, β 1 represents the average rate of change over time, whereas u 1 i describes how this rate varies among genotypes. Thus, the combination β 0 + u 0 i defines the actual intercept for each genotype, and β 1 + u 1 i determines its specific slope. This approach captures both the general trend of observations over time and the individual variations in each genotype, providing a more precise evaluation of how time and genotype influence crop measurements.

Appendix A.2. Variance Structure

In this study, two variance structures were used to model how errors vary across time for each hybrid. These structures help us understand how errors in the measurements may be correlated between each other and over time.

Appendix A.2.1. Power Variance Model (varPower)

The structure varPower is used to model a nonlinear relationship between errors and time. The variance structure is given by
R i = σ 2 t i 1 2 δ 0 0 0 0 t i 2 2 δ 0 0 0 0 t i 3 2 δ 0 0 0 0 t i 4 2 δ
  • σ 2 : This is the common variance of the errors.
  • t i j : This represents the time at which the measurement is taken for hybrid i.
  • δ : This is the parameter controlling the relationship between the errors and time. If δ is 0.5, the relationship is a square root in time, while if it is 1, the relationship is linear. With δ = 0.2 , the errors increase with time, but more gradually compared to higher values of δ .
This model allows for errors to either increase or decrease over time, depending on the value of δ . For example, if δ = 0.2 , the errors still increase with time, but at a slower rate compared to when δ = 0.5 or δ = 1 .
This model allows for errors to either increase or decrease over time, depending on the value of δ .

Appendix A.2.2. Identity Variance Model (varIdent)

The structure varIdent is simpler and assumes that the errors have the same variance over time. The variance structure for this model is
R i = σ 2 1 0 0 0 0 1 0 0 0 0 1 0 0 0 0 1
  • σ 2 : As in the previous model, this is the common variance of the errors.
  • 1: This indicates that the variance of the errors is constant across all time points.

References

  1. Alejandre Rosas, J.A.; Galindo Tovar, M.E.; Lee Espinosa, H.E.; Alvarado Gómez, O.G. Variabilidad genética en 22 variedades híbridas de caña de azúcar (Saccharum spp. Híbrido). Phyton (Buenos Aires) 2010, 79, 87–94. Available online: https://revistaphyton.fund-romuloraggio.org.ar/vol79/Alejandre-Rosas.pdf (accessed on 23 May 2024).
  2. Arntzen, C.J.; Ritter, E.M. Encyclopedia of Agricultural Science Volume 2: EL, 1st ed.; Academic Press: San Diego, CA, USA, 1994; Available online: https://www.cabidigitallibrary.org/doi/full/10.5555/19951403738 (accessed on 4 January 2023).
  3. CONADESUCA. Nota Informativa Sobre Innovaciones en Materia de Productividad del sector. Nuevas Variedades de Caña de Azúcar; Sagarpa: Ciudad de México, México, 2016; Available online: https://www.gob.mx/cms/uploads/attachment/file/136406/NotaNuevasVariedadesd_Cana_deAzucar.compressed.pdf (accessed on 22 February 2023).
  4. Que, Y.; Wu, Q.; Zhang, H.; Luo, J.; Zhang, Y. Developing new sugarcane varieties suitable for mechanized production in China: Principles, strategies and prospects. Front. Plant Sci. 2024, 14, 1337144. [Google Scholar] [CrossRef] [PubMed]
  5. Sentíes-Herrera, H.E.; Valdez-Balero, A.; Loyo-Joachin, R.; Gómez-Merino, F.C. Fases experimentales en el mejoramiento genético de la caña de azúcar (Saccharum spp.) En México. AgroProductividad 2017, 10, 93–99. Available online: https://go.gale.com/ps/i.do?p=IFME&u=anon~3406d702&id=GALE|A530914354&v=2.1&it=r&sid=googleScholar&asid=76e56977 (accessed on 15 January 2024).
  6. García-Preciado, J.C. Evaluación de variables de calidad en híbridos de Saccharum Spp. En Difer. Ambient. Agroecol. De Jalisco, México. AgroProductividad 2017, 10. Available online: https://www.revista-agroproductividad.org/index.php/agroproductividad/article/view/55 (accessed on 16 April 2023).
  7. Rodríguez Gross, R.; Puchades Izaguirre, Y.; Abiche Maceo, W.; Rill Martínez, S.; Suarez, H.J.; Salmón Cuspineda, Y.; Gálvez, G. Estudio del rendimiento y modelación del período de madurez en nuevos cultivares de caña de azúcar. Cultiv. Trop. 2015, 36, 134–143. Available online: http://scielo.sld.cu/scielo.php?pid=S0258-59362015000400019&script=sci_arttext (accessed on 25 August 2023).
  8. Hernández, O.L.; García, S.S.; Nataren, E.H.; Espinoza, L.C.L.; Oliva, A.C.; Sanchez, S.C.; Romero, E.R.; Zossi, S. La espectroscopía de infrarrojo cercano (NIRS) en el seguimiento de la madurez del cultivo de la caña de azúcar (Saccharum spp.). Agro Product. 2019, 12, 107–113. [Google Scholar] [CrossRef]
  9. Larrahondo, J.E.; Cassalett, C.; Torres, J.; Issacs, C. El cultivo de la caña de azúcar en la zona azucarera de Colombia. In Centro de Investigación de la Caña de Azúcar de Colombia (Cenicaña); 1995; pp. 337–354. Available online: https://www.cenicana.org/pdf_privado/documentos_no_seriados/libro_el_cultivo_cana/libro_p3-394.pdf (accessed on 19 June 2023).
  10. Ostengo, S.; Rueda Calderón, M.A.; Bruno, C.; Cuenya, M.I.; Balzarini, M. A protocol for identifying characteristic sucrose accumulation curves of sugarcane genotypes (Saccharum spp.). Sugar Tech 2021, 23, 519–523. [Google Scholar] [CrossRef]
  11. Palmito dos Santos, D.; Nascimento, M.; Ferreira, D.F.; Verardi, C.K. Optimal Designs in Plant Breeding Experiments: A Simulation Study Comparing Grid-Plot and Partially Replicated (p-Rep) Design. Sugar Tech 2024, 26, 387–395. [Google Scholar] [CrossRef]
  12. Crossa, J.; Pérez-Rodríguez, P.; Jarquin, D.; Cuevas, J.; Montesinos-López, O. Genome and Environment Based Prediction Models and Methods of Complex Traits Incorporating Genotype × Environment Interaction. In Genomic Prediction of Complex Traits; Ahmadi, N., Bartholomé, J., Eds.; Methods and Protocols; Humana: New York, NY, USA, 2022; pp. 245–283. [Google Scholar] [CrossRef]
  13. Smith, A.B.; Cullis, B.R.; Thompson, R. The analysis of crop cultivar breeding and evaluation trials: An overview of current mixed model approaches. J. Agric. Sci. 2005, 143, 449–462. [Google Scholar] [CrossRef]
  14. Jarquin, D.; Crossa, J.; Lacaze, X.; Cheyron, P.; Pérez-Rodríguez, P. A Reaction Norm Model for Genomic Selection Using High-Dimensional Genomic and Environmental Data. Theor. Appl. Genet. 2014, 127, 595–607. [Google Scholar] [CrossRef]
  15. Cuevas, J.; Crossa, J.; Montesinos-López, O.; Burgueño, J.; Pérez-Rodríguez, P.; de los Campos, G. Bayesian Genomic Prediction with Genotype × Environment Interaction Kernel Models. G3 Genes Genomes Genet. 2017, 7, 41–53. [Google Scholar] [CrossRef] [PubMed]
  16. Burgueño, J.; Crossa, J.; Cotes, J.M.; Vicente, F.S.; Dasgupta, S. Genomic Prediction of Breeding Values When Modeling Genotype × Environment Interaction Using Pedigree and Dense Molecular Markers. Crop Sci. 2012, 52, 707–719. [Google Scholar] [CrossRef]
  17. Gilbert, R.A.; Shine, J.M., Jr.; Miller, J.D.; Rice, R.W. Sucrose accumulation and harvest schedule recommendations for CP sugarcane cultivars. Crop Manag. 2004, 3, 1–7. [Google Scholar] [CrossRef]
  18. Seta, P.T.; Hartomo, K.D. Mapping land suitability for sugar cane production using k-means algorithm with leaflets library to support food sovereignty in central java. Khazanah Inform. J. Ilmu Komput. Dan Inform. 2020, 6, 15–25. [Google Scholar] [CrossRef]
  19. Mbukwa, J.N.; Anjaneyulu, G.V.S.R. Application of K-Means and Partitioning Around Mediods (PAM) clustering techniques on Maize and Beans yield in Tanzania. KY Publ. 2016, pp. 146–158. Available online: http://bomsr.com/4.4.16/146-158%20JUSTINE%20NKUNDWE%20MBUKWA.pdf (accessed on 9 June 2024).
  20. Cervantes-Preciado, J.F.; Milanés-Ramos, N.; Castillo, M.A. Evaluation of 11 hybrids of sugar cane (Saccharum spp.) in the Central region of Veracruz, México. Agroproductividad 2019, 12, 69–73. Available online: https://www.cabidigitallibrary.org/doi/full/10.5555/20203314828 (accessed on 25 August 2023). [CrossRef]
  21. Moreno Torres, V.M. Nuevas Variedades de Caña de Azúcar; Deschamps, L., Ed.; Deschamps & Escamilla: Colima, México, 2010; Fundación Produce Colima A.C., Instituto Interamericano de Cooperación para la Agricultura (IICA); pp. 1–59. [Google Scholar] [CrossRef]
  22. Villa Godoy, S.; De Teresa y Polignac, F. Manual Azucarero Mexicano, Edición 2024; Cía. Editora del Manual Azucarero, S.A. de C.V.: Ciudad de México, México, 2024; pp. 1–500. Available online: https://www.manualazucarero.com/_files/ugd/fc2095_b4e9661cd898438aadc336a83992d281.pdf (accessed on 12 November 2023).
  23. Fernandes, A.M.; de Queiroz, A.C.; Pereira, J.C.; Lana, R.d.P.; Barbosa, M.H.P.; da Fonseca, D.M.; Detmann, E.; Cabral, L.d.S.; Pereira, E.S.; Vittori, A. Composição químico-bromatológica de variedades de cana-de-açúcar (Saccharum spp L.) com diferentes ciclos de produção (precoce e intermediário) em três idades de corte. Rev. Bras. De Zootec. 2003, 32, 977–985. [Google Scholar] [CrossRef]
  24. McCulloch, C.E.; Searle, S.R. Generalized, Linear, and Mixed Models, 1st ed.; Wiley-Interscience: Hoboken, NJ, USA, 2001. [Google Scholar] [CrossRef]
  25. Gałecki, A.; Burzykowski, T. Linear Mixed-Effects Models Using R, 1st ed.; Springer: New York, NY, USA, 2013; pp. 245–273. [Google Scholar] [CrossRef]
  26. Aguirre-Calderón, O.A. ¿Cómo corregir la heterocedasticidad y autocorrelación de residuales en modelos de ahusamiento y crecimiento en altura? Rev. Mex. De Cienc. For. 2018, 9, 28–59. [Google Scholar] [CrossRef]
  27. R Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2023; Available online: https://www.R-project.org/ (accessed on 18 July 2024).
  28. Pinheiro, J.; Bates, D.; R Core Team. nlme: Linear and Nonlinear Mixed Effects Models, R package version 3; R Foundation for Statistical Computing: Viena, Austria, 2023; pp. 1–163. Available online: https://CRAN.R-project.org/package=nlme (accessed on 3 February 2024).
  29. Zhang, L.; Lu, F.; Liu, A.; Guo, P.; Liu, C. Application of K-means clustering algorithm for classification of NBA guards. Int. J. Sci. Eng. Appl. 2016, 5, 1–6. Available online: https://ijsea.com/archive/volume5/volume5issue1.pdf (accessed on 13 October 2024). [CrossRef]
  30. Kaur, N.K.; Kaur, U.; Singh, D. K-Medoid clustering algorithm—A review. Int. J. Comput. Appl. Technol. 2014, 1, 42–45. [Google Scholar]
  31. Ester, M.; Kriegel, H.-P.; Sander, J.; Xu, X. A density-based algorithm for discovering clusters in large spatial databases with noise. In Proceedings of the Second International Conference on Knowledge Discovery and Data Mining, Portland, OR, USA, 2–4 August 1996; Volume 96, pp. 226–231. Available online: https://cdn.aaai.org/KDD/1996/KDD96-037.pdf?source=post_page (accessed on 30 November 2024).
  32. Zhang, C.; Huang, W.; Niu, T.; Liu, Z.; Li, G.; Cao, D. Review of Clustering Technology and Its Application in Coordinating Vehicle Subsystems. Automot. Innov. 2023, 6, 89–115. [Google Scholar] [CrossRef]
  33. Khan, M.M.R.; Siddique, M.A.B.; Arif, R.B.; Oishe, M.R. ADBSCAN: Adaptive density-based spatial clustering of applications with noise for identifying clusters with varying densities. In Proceedings of the 2018 4th International Conference on Electrical Engineering and Information & Communication Technology (iCEEiCT), Dhaka, Bangladesh, 13–15 September 2018; pp. 107–111. [Google Scholar] [CrossRef]
  34. Franco-Árcega, A.; Sobrevilla-Sólis, V.I.; Gutiérrez-Sánchez, M.J.; García-Islas, L.H.; Suárez-Navarrete, A.; Rueda-Soriano, E. Sistema de enseñanza para la técnica de agrupamiento k-means. Pädi Boletín Científico De Cienc. Básicas E Ing. Del ICBI 2021, 9, 53–58. [Google Scholar] [CrossRef]
  35. Zhang, Q.; Chen, Q.; Xu, L.; Xu, X.; Liang, Z. Wheat Lodging Direction Detection for Combine Harvesters Based on Improved K-Means and Bag of Visual Words. Agronomy 2023, 13, 2227. [Google Scholar] [CrossRef]
  36. Gupta, M.K.; Chandra, P. Effects of similarity/distance metrics on k-means algorithm with respect to its applications in IoT and multimedia: A review. Multimed. Tools Appl. 2022, 81, 37007–37032. [Google Scholar] [CrossRef]
  37. Juliantho, D.A.; Hendrik, B. Komparasi Algoritma K-Means Dan K-Medoids Dalam Clustering Penyebaran Kasus Covid 19. J. Inf. Syst. Educ. Dev. 2023, 1, 30–32. Available online: https://journal.mwsfoundation.or.id/index.php/jised/article/view/12 (accessed on 7 December 2024).
  38. Ansari, Z. Discovery of web user session clusters using DBSCAN and leader clustering techniques. Int. J. Res. Appl. Sci. Eng. Technol. (iJRASET) 2014, 2, 209–217. [Google Scholar]
  39. Hahsler, M.; Piekenbrock, M. dbscan: Density-Based Spatial Clustering of Applications with Noise (DBSCAN) and Related Algorithms, R package version 1.2-0; R Foundation for Statistical Computing: Vienna, Austria, 2024. Available online: https://CRAN.R-project.org/package=dbscan (accessed on 30 October 2023).
  40. Ben Ncir, C.-E.; Hamza, A.; Bouaguel, W. Parallel and scalable Dunn Index for the validation of big data clusters. Parallel Comput. 2021, 102, 102751. [Google Scholar] [CrossRef]
  41. Kaufman, L.; Rousseeuw, P.J. Finding Groups in Data: An Introduction to Cluster Analysis; illustrated, revised, reprint ed.; John Wiley & Sons: Hoboken, NJ, USA, 2009; Available online: https://books.google.es/books?hl=es&lr=&id=YeFQHiikNo0C&oi=fnd&pg=PR11&dq=Finding+groups+in+data:+an+introduction+to+cluster+analysis&ots=5CofD1KFAE&sig=CaZq_JCe5uV8RB_ymUHSuad8bKQ#v=onepage&q=Finding%20groups%20in%20data:%20an%20introduction%20to%20cluster%20analysis&f=false (accessed on 10 November 2024).
  42. Videla, M.E.; Bruno, C. Validación de agrupamientos para representar estructura genética poblacional. Agriscientia 2022, 39, 1–10. [Google Scholar] [CrossRef]
  43. Luna-Romera, J.M.; del Mar Martinez-Ballesteros, M.; Garcia-Gutierrez, J.; Riquelme-Santos, J.C. An approach to silhouette and dunn clustering indices applied to big data in spark. In Advances in Artificial Intelligence: 17th Conference of the Spanish Association for Artificial Intelligence, CAEPIA 2016, Salamanca, Spain, 14–16 September 2016; Proceedings 17; Springer: Cham, Switzerland, 2016; pp. 160–169. [Google Scholar] [CrossRef]
  44. Brock, G.; Pihur, V.; Datta, S.; Datta, S. clValid: An R Package for Cluster Validation. J. Stat. Softw. 2008, 25, 1–22. Available online: https://www.jstatsoft.org/v25/i04/. (accessed on 12 November 2024). [CrossRef]
  45. Maechler, M.; Rousseeuw, P.; Struyf, A.; Hubert, M.; Hornik, K. cluster: Cluster Analysis Basics and Extensions; R package version 2.1.4; 2022. Available online: https://CRAN.R-project.org/package=cluster (accessed on 8 November 2024).
  46. Zhao, Y.; Liu, J.; Huang, H.; Zan, F.; Zhao, P.; Zhao, J.; Deng, J.; Wu, C. Genetic improvement of sugarcane (Saccharum spp.) contributed to high sucrose content in China based on an analysis of newly developed varieties. Agriculture 2022, 12, 1789. [Google Scholar] [CrossRef]
  47. Cheavegatti-Gianotto, A.; De Abreu, H.M.C.; Arruda, P.; Bespalhok Filho, J.C.; Burnquist, W.L.; Creste, S.; di Ciero, L.; Ferro, J.A.; de Oliveira Figueira, A.V.; de Sousa Filgueiras, T.; et al. Sugarcane (Saccharum X officinarum): A reference study for the regulation of genetically modified cultivars in Brazil. Trop. Plant Biol. 2011, 4, 62–89. [Google Scholar] [CrossRef]
  48. Marcano, M.; Manrique, U.; Garcia, M.; Salcedo, F. Prueba de ocho variedades de caña de azúcar (Saccharum sp.) bajo condiciones de secano en un suelo de sabana del estado Monagas, Venezuela. Rev. Científica UDO Agrícola 2005, 5, 54–61. Available online: https://dialnet.unirioja.es/servlet/articulo?codigo=2221593 (accessed on 14 September 2023).
  49. Vasantha, S.; Kumar, R.A.; Tayade, A.S.; Krishnapriya, V.; Ram, B.; Solomon, S. Physiology of sucrose productivity and implications of ripeners in sugarcane. Sugar Tech 2022, 24, 715–731. [Google Scholar] [CrossRef]
  50. García, S.S.; Escobar, R.N.; Alanis, L.B. Determinación de la dosis óptima económica de fertilización en caña de azúcar. Terra Latinoam. 2003, 21, 267–272. Available online: http://www.redalyc.org/articulo.oa?id=57315595012 (accessed on 6 April 2023).
  51. Gómez-Merino, F.C. Manual para la identificación varietal de caña de azúcar, 1st ed.; Colegio de Postgraduados: Texcoco, México, 2015; pp. 5–7. Available online: https://www.researchgate.net/profile/Fernando-Gomez-Merino/publication/271647569_Manual_para_la_Identificacion_Varietal_de_Cana_de_Azucar/links/54f483720cf2eed5d734bf55/Manual-para-la-Identificacion-Varietal-de-Cana-de-Azucar.pdf (accessed on 6 April 2023).
  52. Mei, H.; Mao, L.; Zhang, Y.; Chen, M. BDT-ADBSCAN: Adaptive Density-Based Spatial Clustering of Applications with Noise Based on Bayesian Decision Theory for Identifying Clusters with Multi-Densities. In Proceedings of the 2022 IEEE 10th Joint International Information Technology and Artificial Intelligence Conference (ITAIC), Chongqing, China, 17–19 June 2022; pp. 1510–1516. [Google Scholar] [CrossRef]
  53. Jackson, P.A. Breeding for improved sugar content in sugarcane. Field Crops Res. 2005, 92, 277–290. [Google Scholar] [CrossRef]
  54. Delgado Mora, I.; Jorge Suarez, H.; Vera, A.; Cornide Hernández, M.T.; Díaz Mujica, F.R.; Gómez Pérez, J.R.; Suárez Sanchez, O.; Puchades Isaguirre, Y. Influencia de la edad y cultivar de caña de azúcar en el momento de la cosecha. Cent. Agrícola 2016, 43, 59–65. Available online: http://scielo.sld.cu/scielo.php?pid=S0253-57852016000200008&script=sci_arttext (accessed on 3 July 2024).
  55. Espinoza, J.G. Maduración de la caña de azúcar y floración de la caña de azúcar y su manejo. Cengicaña. 2012, pp. 262–281. Available online: https://dialnet.unirioja.es/servlet/articulo?codigo=5149559 (accessed on 3 July 2024).
  56. Mendoza Batista, Y.; Cruz Sarmiento, R.; Vaillant Cáceres, Y.; Luis Martínez, O.; Céspedes Argota, M. Comportamiento de los cultivares de caña de azúcar C97-445 y C95-416 en localidades de la provincia Holguín. Cent. Agrícola 2019, 46, 49–53. Available online: http://scielo.sld.cu/scielo.php?pid=S0253-57852019000100049&script=sci_arttext&tlng=en (accessed on 6 July 2023).
Figure 1. Estimated curves for materials based on intercept and slope for the plant cycle.
Figure 1. Estimated curves for materials based on intercept and slope for the plant cycle.
Agriengineering 07 00114 g001
Figure 2. Clustering of Las Pilas materials in the plant cycle.
Figure 2. Clustering of Las Pilas materials in the plant cycle.
Agriengineering 07 00114 g002
Figure 3. Heatmap of material maturity based on clustering algorithms for the plant cycle.
Figure 3. Heatmap of material maturity based on clustering algorithms for the plant cycle.
Agriengineering 07 00114 g003
Figure 4. Estimated curves for materials based on intercept and slope for the ratoon cycle.
Figure 4. Estimated curves for materials based on intercept and slope for the ratoon cycle.
Agriengineering 07 00114 g004
Figure 5. Clustering of materials in the ratoon cycle, Las Pilas.
Figure 5. Clustering of materials in the ratoon cycle, Las Pilas.
Agriengineering 07 00114 g005
Figure 6. Heatmap of material maturity based on clustering algorithms for the ratoon cycle.
Figure 6. Heatmap of material maturity based on clustering algorithms for the ratoon cycle.
Agriengineering 07 00114 g006
Table 1. Materials evaluated during the plant and ratoon crop cycles, Las Pilas experimental field.
Table 1. Materials evaluated during the plant and ratoon crop cycles, Las Pilas experimental field.
IDMaterialOriginTypeIDMaterialOriginType
1COSTA JALMexicanN21ITSAMEX 07-7259MexicanN
2RB 85-5035ForeignN22ITSAMEX 06-6395MexicanN
3ITV 92-1424ForeignC23ITSAMEX 06-4863MexicanN
4RB 85-5113ForeignC24ITSAMEX 07-4954MexicanN
5LAICA 92-13ForeignN25ITSAMEX 07-4387MexicanN
6CP 72-2086ForeignC26ITSAMEX 07-121115MexicanN
7COLMEX 94-8MexicanC27ITSAMEX 07-246MexicanN
8ATEMEX 99-48MexicanN28ITSAMEX 07-12116MexicanN
9ATEMEX 99-1MexicanN29ITSAMEX 07-12113MexicanN
10ATEMEX 99-61MexicanN30ITSAMEX 07-2963MexicanN
11MEX 70-486MexicanN31ITSAMEX 07-99711MexicanN
12TCP 89-3505ForeignN32ITSAMEX 07-9886MexicanN
13MEX 80-1521MexicanN33ITSAMEX 07-86810MexicanN
14MEX 69-290MexicanC34ITSAMEX 07-12119MexicanN
15ITSAMEX 07-44814MexicanN35ITSAMEX 07-1903MexicanN
16ITSAMEX 06-3049MexicanN36ITSAMEX 07-44813MexicanN
17ITSAMEX 07-8681MexicanN37ITSAMEX 07-12118MexicanN
18ITSAMEX 07-20810MexicanN38CP 85-1382ForeignN
19ITSAMEX 07-7501MexicanN39COLMEX 95-27MexicanC
20ITSAMEX 07-1107MexicanN
N: new material; C: commercial variety.
Table 2. Sucrose percentage evaluation dates (month-day-year) of the 39 materials studied, Las Pilas.
Table 2. Sucrose percentage evaluation dates (month-day-year) of the 39 materials studied, Las Pilas.
Dates
Cycle Period D1 D2 D3 D4
Planting
Plant03-04-201110-08-201111-08-201112-08-201101-11-2012
Elapsed days D 1 D 1 = 0 D 2 D 1 = 31 D 3 D 2 = 30 D 4 D 3 = 34
Harvest
Ratoon05-21-201211-16-201212-19-201201-22-201302-25-2013
Elapsed days D 1 D 1 = 0 D 2 D 1 = 33 D 3 D 2 = 34 D 4 D 3 = 34
Table 3. Biological maturity of the six commercial materials.
Table 3. Biological maturity of the six commercial materials.
Conditions
Commercial Variety Type of Maturity Temperature Altitude masl Soil
CP 72-2086early19.5–35.40–1300FAR, AR, ARL
ITV 92-1424early19.5–35.430–1200FAR, AR, ARL
COLMEX 95-27early19.5–35.40–1300FAR, AR, ARL
COLMEX 94-8early–intermediate19.5–35.450–1200FAR, AR, ARL
RB 85-5113early–intermediate15.0–29.5 F, FAR, FLI
MEX 69-290intermediate–late19.5–35.40–1300FAR, AR, ARL
masl: meters above sea level.
Table 4. Estimated parameters of fixed and random effects of Model (3) for the plant cycle.
Table 4. Estimated parameters of fixed and random effects of Model (3) for the plant cycle.
Fixed EffectsRandom Effects
Parameter 95 % CI Parameter 95 % CI Parameter 95 % CI
β ^ 0 = 6.37 ( 5.50 , 7.24 ) σ ^ u 0 = 2.21 ( 1.56 , 3.14 ) ρ ^ I C C = 0.89 ( 0.98 , 0.50 )
β ^ 1 = 1.50 ( 1.25 , 1.75 ) σ ^ u 1 = 0.47 ( 0.23 , 0.94 ) σ ^ e = 1.23 ( 1.05 , 1.44 )
CI: confidence interval.
Table 5. Estimated parameters of Model (3) for the 39 materials for the plant cycle.
Table 5. Estimated parameters of Model (3) for the 39 materials for the plant cycle.
IDMaterialType β ^ 0 β ^ 1 IDMaterialType β ^ 0 β ^ 1
26ITSAMEX 07-121115N3.012.1432ITSAMEX 07-9886N6.651.41
19ITSAMEX 07-7501N3.032.1735ITSAMEX 07-1903N6.781.39
24ITSAMEX 07-4954N3.621.811COSTA JALN7.041.43
27ITSAMEX 07-246N3.702.002RB 85-5035N7.051.34
12TCP 89-3505N3.791.855LAICA 92-13N7.111.24
21ITSAMEX 07-7259N4.072.0039COLMEX 95-27C7.561.31
18ITSAMEX 07-20810N4.211.6636ITSAMEX 07-44813N7.561.37
33ITSAMEX 07-86810N4.211.8813MEX 80-1521N7.791.19
25ITSAMEX 07-4387N4.681.7615ITSAMEX 07-44814N7.801.06
29ITSAMEX 07-12113N5.011.686CP 72-2086C7.921.28
34ITSAMEX 07-12119N5.041.7911MEX 70-486N7.971.34
20ITSAMEX 07-1107N5.131.569ATEMEX 99-1N8.081.01
28ITSAMEX 07-12116N5.451.644RB 85-5113C8.231.23
23ITSAMEX 06-4863N5.511.733ITV 92-1424C8.391.21
37ITSAMEX 07-12118N5.521.847COLMEX 94-8C8.971.22
16ITSAMEX 06-3049N5.601.7022ITSAMEX 06-6395N9.190.98
30ITSAMEX 07-2963N5.671.5910ATEMEX 99-61N9.480.94
14MEX 69-290C5.671.828ATEMEX 99-48N9.790.98
17ITSAMEX 07-8681N5.921.6138CP 85-1382N9.860.92
31ITSAMEX 07-99711N6.431.42
N: new material; C: commercial variety.
Table 6. Comparison of the clustering results for the plant cycle, Las Pilas.
Table 6. Comparison of the clustering results for the plant cycle, Las Pilas.
Groups
ID Material Type k-means k-medoids DBSCAN
3ITV 92-1424CEarlyEarlyEarly
4RB 85-5113CEarlyEarlyEarly
6CP 72-2086CEarlyEarlyEarly
9ATEMEX 99-1NEarlyEarlyEarly
11MEX 70-486NEarlyEarlyEarly
13MEX 80-1521NEarlyEarlyEarly
15ITSAMEX 07-44814NEarlyEarlyEarly
1COSTA JALNEarlyEarly–IntermediateEarly
2RB 85-5035NEarlyEarly–IntermediateEarly
5LAICA 92-13NEarlyEarly–IntermediateEarly
32ITSAMEX 07-9886NEarlyEarly–IntermediateEarly
35ITSAMEX 07-1903NEarlyEarly–IntermediateEarly
36ITSAMEX 07-44813NEarlyEarly–IntermediateEarly
39COLMEX 95-27CEarlyEarly–IntermediateEarly
7COLMEX 94-8CEarly–IntermediateEarlyEarly–Intermediate
8ATEMEX 99-48NEarly–IntermediateEarlyEarly–Intermediate
10ATEMEX 99-61NEarly–IntermediateEarlyEarly–Intermediate
22ITSAMEX 06-6395NEarly–IntermediateEarlyEarly–Intermediate
38CP 85-1382NEarly–IntermediateEarlyEarly–Intermediate
31ITSAMEX 07-99711NIntermediate–LateEarly–IntermediateEarly
14MEX 69-290CIntermediate–LateIntermediate–LateIntermediate–Late
16ITSAMEX 06-3049NIntermediate–LateIntermediate–LateIntermediate–Late
17ITSAMEX 07-8681NIntermediate–LateIntermediate–LateIntermediate–Late
20ITSAMEX 07-1107NIntermediate–LateIntermediate–LateIntermediate–Late
23ITSAMEX 06-4863NIntermediate–LateIntermediate–LateIntermediate–Late
25ITSAMEX 07-4387NIntermediate–LateIntermediate–LateIntermediate–Late
28ITSAMEX 07-12116NIntermediate–LateIntermediate–LateIntermediate–Late
29ITSAMEX 07-12113NIntermediate–LateIntermediate–LateIntermediate–Late
30ITSAMEX 07-2963NIntermediate–LateIntermediate–LateIntermediate–Late
34ITSAMEX 07-12119NIntermediate–LateIntermediate–LateIntermediate–Late
37ITSAMEX 07-12118NIntermediate–LateIntermediate–LateIntermediate–Late
12TCP 89-3505NLateLateIntermediate–Late
18ITSAMEX 07-20810NLateLateIntermediate–Late
21ITSAMEX 07-7259NLateLateIntermediate–Late
24ITSAMEX 07-4954NLateLateIntermediate–Late
27ITSAMEX 07-246NLateLateIntermediate–Late
33ITSAMEX 07-86810NLateLateIntermediate–Late
19ITSAMEX 07-7501NLateLateLate
26ITSAMEX 07-121115NLateLateLate
N: new material; C: commercial variety.
Table 7. Validation indices for the plant cycle.
Table 7. Validation indices for the plant cycle.
MethodConnectivityDunnSilhouette
k-means13.410.120.65
k-medoids13.850.120.55
DBSCAN12.360.240.64
Table 8. Estimated parameters of fixed and random effects of Model (3) for the ratoon cycle.
Table 8. Estimated parameters of fixed and random effects of Model (3) for the ratoon cycle.
Fixed EffectsRandom Effects
Parameter 95 % CI Parameter 95 % CI
β ^ 0 = 7.34 ( 6.79 , 7.90 ) σ ^ u 0 = 1.18 ( 0.90 , 1.54 )
β ^ 1 = 1.95 ( 1.25 , 1.75 ) σ ^ e = 1.05 ( 0.93 , 1.20 )
CI: confidence interval.
Table 9. Estimated parameters for the 39 materials in the ratoon cycle, Las Pilas.
Table 9. Estimated parameters for the 39 materials in the ratoon cycle, Las Pilas.
IDMaterialType β ^ 0 β ^ 1 IDMaterialType β ^ 0 β ^ 1
24ITSAMEX 07-4954N4.481.9523ITSAMEX 06-4863N7.331.95
25ITSAMEX 07-4387N5.801.9513MEX 80-1521N7.371.95
5LAICA 92-13N6.031.959ATEMEX 99-1N7.401.95
12TCP 89-3505N6.191.9515ITSAMEX 07-44814N7.441.95
2RB 85-5035N6.231.9532ITSAMEX 07-9886N7.571.95
28ITSAMEX 07-12116N6.361.9531ITSAMEX 07-99711N7.821.95
27ITSAMEX 07-246N6.451.9534ITSAMEX 07-12119N7.831.95
36ITSAMEX 07-44813N6.461.9538CP 85-1382N7.871.95
29ITSAMEX 07-12113N6.511.9533ITSAMEX 07-86810N8.031.95
17ITSAMEX 07-8681N6.511.9522ITSAMEX 06-6395N8.221.95
18ITSAMEX 07-20810N6.531.9510ATEMEX 99-61N8.231.95
20ITSAMEX 07-1107N6.621.953ITV 92-1424C8.301.95
16ITSAMEX 06-3049N6.711.954RB 85-5113C8.301.95
26ITSAMEX 07-121115N6.731.9535ITSAMEX 07-1903N8.691.95
21ITSAMEX 07-7259N6.781.9511MEX 70-486N8.831.95
1COSTA JALN7.001.957COLMEX 94-8C9.201.95
19ITSAMEX 07-7501N7.001.958ATEMEX 99-48N9.201.95
30ITSAMEX 07-2963N7.111.956CP 72-2086C9.351.95
14MEX 69-290C7.161.9539COLMEX 95-27C9.371.95
37ITSAMEX 07-12118N7.261.95
N: new material; C: commercial variety.
Table 10. Comparison of the algorithms for the ratoon cycle.
Table 10. Comparison of the algorithms for the ratoon cycle.
Groups
ID Material Type k-means k-medoids DBSCAN
3ITV 92-1424CEarlyEarlyEarly
4RB 85-5113CEarlyEarlyEarly
10ATEMEX 99-61NEarlyEarlyEarly
22ITSAMEX 06-6395NEarlyEarlyEarly
6CP 72-2086CEarlyEarly–IntermediateEarly–Intermediate
7COLMEX 94-8CEarlyEarly–IntermediateEarly–Intermediate
8ATEMEX 99-48NEarlyEarly–IntermediateEarly–Intermediate
39COLMEX 95-27CEarlyEarly–IntermediateEarly–Intermediate
35ITSAMEX 07-1903NEarlyEarly–IntermediateLate
11MEX 70-486NEarlyEarly–IntermediateLate
31ITSAMEX 07-99711NEarly–IntermediateEarlyEarly
33ITSAMEX 07-86810NEarly–IntermediateEarlyEarly
34ITSAMEX 07-12119NEarly–IntermediateEarlyEarly
38CP 85-1382NEarly–IntermediateEarlyEarly
1COSTA JALNEarly–IntermediateIntermediate–LateIntermediate–Late
9ATEMEX 99-1NEarly–IntermediateIntermediate–LateIntermediate–Late
13MEX 80-1521NEarly–IntermediateIntermediate–LateIntermediate–Late
14MEX 69-290CEarly–IntermediateIntermediate–LateIntermediate–Late
15ITSAMEX 07-44814NEarly–IntermediateIntermediate–LateIntermediate–Late
19ITSAMEX 07-7501NEarly–IntermediateIntermediate–LateIntermediate–Late
23ITSAMEX 06-4863NEarly–IntermediateIntermediate–LateIntermediate–Late
30ITSAMEX 07-2963NEarly–IntermediateIntermediate–LateIntermediate–Late
32ITSAMEX 07-9886NEarly–IntermediateIntermediate–LateIntermediate–Late
37ITSAMEX 07-12118NEarly–IntermediateIntermediate–LateIntermediate–Late
2RB 85-5035NIntermediate–LateIntermediate–LateIntermediate–Late
5LAICA 92-13NIntermediate–LateLateIntermediate–Late
12TCP 89-3505NIntermediate–LateLateIntermediate–Late
16ITSAMEX 06-3049NIntermediate–LateLateIntermediate–Late
17ITSAMEX 07-8681NIntermediate–LateLateIntermediate–Late
18ITSAMEX 07-20810NIntermediate–LateLateIntermediate–Late
20ITSAMEX 07-1107NIntermediate–LateLateIntermediate–Late
21ITSAMEX 07-7259NIntermediate–LateLateIntermediate–Late
25ITSAMEX 07-4387NIntermediate–LateLateIntermediate–Late
26ITSAMEX 07-121115NIntermediate–LateLateIntermediate–Late
27ITSAMEX 07-246NIntermediate–LateLateIntermediate–Late
28ITSAMEX 07-12116NIntermediate–LateLateIntermediate–Late
29ITSAMEX 07-12113NIntermediate–LateLateIntermediate–Late
36ITSAMEX 07-44813NIntermediate–LateLateIntermediate–Late
24ITSAMEX 07-4954NLateLateNot classified
N: new material; C: commercial variety.
Table 11. Validation indices for the ratoon cycle.
Table 11. Validation indices for the ratoon cycle.
MethodConnectivityDunnSilhouette
k-means12.080.160.56
k-medoids13.680.060.59
DBSCAN16.110.140.64
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Carretillo Moctezuma, C.D.; Guzmán Martínez, M.; Godínez-Jaimes, F.; García-Preciado, J.C.; Reyes Carreto, R.; Terrones Salgado, J.; Pérez Arriaga, E. Growth Curve Models and Clustering Techniques for Studying New Sugarcane Hybrids. AgriEngineering 2025, 7, 114. https://doi.org/10.3390/agriengineering7040114

AMA Style

Carretillo Moctezuma CD, Guzmán Martínez M, Godínez-Jaimes F, García-Preciado JC, Reyes Carreto R, Terrones Salgado J, Pérez Arriaga E. Growth Curve Models and Clustering Techniques for Studying New Sugarcane Hybrids. AgriEngineering. 2025; 7(4):114. https://doi.org/10.3390/agriengineering7040114

Chicago/Turabian Style

Carretillo Moctezuma, Carlos David, María Guzmán Martínez, Flaviano Godínez-Jaimes, José C. García-Preciado, Ramón Reyes Carreto, José Terrones Salgado, and Edgar Pérez Arriaga. 2025. "Growth Curve Models and Clustering Techniques for Studying New Sugarcane Hybrids" AgriEngineering 7, no. 4: 114. https://doi.org/10.3390/agriengineering7040114

APA Style

Carretillo Moctezuma, C. D., Guzmán Martínez, M., Godínez-Jaimes, F., García-Preciado, J. C., Reyes Carreto, R., Terrones Salgado, J., & Pérez Arriaga, E. (2025). Growth Curve Models and Clustering Techniques for Studying New Sugarcane Hybrids. AgriEngineering, 7(4), 114. https://doi.org/10.3390/agriengineering7040114

Article Metrics

Back to TopTop