Time-Series Similarity and Clustering of Producer Share Dynamics in Agrifood Markets: Evidence from Origin–Destination Price Relationships

Sánchez-Arnau, Elena; Ferrer-Sapena, Antonia; Sánchez-Arnau, Claudia; Sánchez-Pérez, Enrique A.

doi:10.3390/math14040714

Open AccessArticle

Time-Series Similarity and Clustering of Producer Share Dynamics in Agrifood Markets: Evidence from Origin–Destination Price Relationships

by

Elena Sánchez-Arnau

,

Antonia Ferrer-Sapena

,

Claudia Sánchez-Arnau

and

Enrique A. Sánchez-Pérez

^*

Instituto Universitario de Matemática Pura y Aplicada, Universitat Politècnica de València, 46022 València, Spain

^*

Author to whom correspondence should be addressed.

Mathematics 2026, 14(4), 714; https://doi.org/10.3390/math14040714

Submission received: 23 December 2025 / Revised: 29 January 2026 / Accepted: 14 February 2026 / Published: 18 February 2026

(This article belongs to the Special Issue Applied Time Series and Artificial Intelligence in Economics and Finance)

Download

Browse Figures

Versions Notes

Abstract

Producer share indicators summarize how value is distributed along agrifood supply chains, yet their temporal dynamics remain difficult to compare across products and periods. This paper proposes a reproducible time-series analytics framework to characterize and group producer-share trajectories derived from paired origin–destination price series. We compute producer share time series for a set of agrifood products and quantify similarity using complementary measures capturing co-movement and shape, including Pearson-correlation-based proximity and Euclidean distance on standardized representations. To reduce dimensionality and mitigate noise, we apply principal component analysis and perform unsupervised clustering (k-means) to identify classes of products exhibiting comparable producer-share dynamics. The resulting clusters provide an interpretable typology of market behaviors, highlighting homogeneous groups that may share structural drivers (e.g., commercialization patterns or intermediation margins). We further discuss how cluster membership can support decision-making in crop substitution and market monitoring by revealing products with analogous temporal responses. The proposed pipeline is simple to implement, fully data-driven, and adaptable to other commodity-price settings.

Keywords:

time series; clustering; price transmission; producer share; agrifood markets

MSC:

91B76

1. Introduction: The Producer Share in Agrifood Value Chains

For people not directly involved in the agrifood system, the gap between the price at which farmers sell their products and the final retail price can be striking. This difference is often so large that it becomes difficult to understand how producers can continue operating and why ordinary consumers end up paying so much in the marketplace for products that are essentially inexpensive [1]. Yet this situation is not new: it is a long-established trend that seems practically impossible to reverse. In certain social contexts—such as southern Europe, which concentrates a significant share of European production—this dynamic pushes producers to seek alternative ways of marketing their goods: local markets, direct sales, cooperative or associative channels, and other strategies aimed at reducing the often excessive margins imposed by the distribution chain. However, these initiatives remain insufficient, as much of the chain is now controlled by large market operators who dominate both sides of distribution, pressuring producers to sell at low prices while consumers continue to face high retail prices.

A substantial body of empirical literature has documented the existence of asymmetric adjustments in price transmission along supply chains, whereby downstream prices respond unevenly to upstream cost shocks. One of the most frequently observed patterns is the so-called “rockets and feathers” behavior, in which output prices increase rapidly following input price rises but adjust more slowly when input prices decline. Early empirical evidence of such asymmetries was reported in energy and petroleum markets [2,3,4], and later systematized in comprehensive surveys focusing on agricultural and food markets [5]. These studies suggest that asymmetric price responses may stem from market power, adjustment costs, inventory management strategies, or informational frictions along the value chain. More recent contributions have extended the analysis to increasingly complex and globalized food systems. Empirical evidence of persistent asymmetric price dynamics has been found in European food markets [6], as well as in sector-specific contexts such as aquaculture, where structural changes in the supply chain may amplify non-linear price adjustments [7]. Taken together, this literature indicates that asymmetric price transmission represents a structural feature of many agrifood markets, with significant implications for producer welfare, consumer prices, and policy design.

Following a current trend of mixing new methodological contributions with local studies [8,9], in this paper we present an integrated mathematical approach to analyze and quantify the price increases generated by the distribution chain by comparing the behavior of the marketing margin for different products using mathematical and artificial intelligence (AI) tools for data analysis. Our aim is to develop an analytical tool that supports farm management, particularly regarding the flexibility to choose among different products and to substitute one product for another for strategic reasons. By analyzing the time series of producer shares, we can identify similarities between products and determine which ones behave alike. In this way, farmers are informed about alternative products with comparable marketing characteristics, should they need to switch the specific crop they cultivate. A concrete interpretation of how cluster membership can support crop substitution and strategic farm decisions is provided at the end of this section and in Section 4.

The index on which we focus our attention, and which is a well-recognized measure of the relative marketing margin, is the so-called producer share: the ratio between the price at which farmers sell their products and the final price paid by consumers for those same products. Being a relative value, it clearly reflects how far the market price of vegetables is from the amount producers actually receive. Typically, a high producer share is interpreted as an indicator of a well-balanced market, although what “high” means depends on the specific context: transportation, logistics, and marketing naturally entail costs.

On the other hand, mathematical modeling has become a powerful tool for addressing complex management problems in agricultural planning, particularly those related to economic, market, and resource foresight [10,11,12]. In this context, the present paper seeks to contribute to this body of technical tools in order to support decision-making within these tasks. However, far from being an abstract investigation, we used direct data from Spanish wholesale markets to determine the validity of the model, as well as to draw some general conclusions about the state of these markets in the country. Beyond reporting concrete figures for these ratios, we use the time series of relative price ratios to define classes of similarity among certain horticultural products, showing global trends as well as subgroups of vegetables with similar behavior. Our goal is to identify each horticultural product with a vector representing the value of the time series of producer shares and to group products according to the trend they have shown over recent years. Since we have chosen the values for a fixed month of the year (this is done in the first part of our analysis,) the behavior should be similar, or at least it can be assumed that market distortions due, for example, to weather conditions, have affected all products in the same way. In the second part of the study, however, we consider the entire historical dataset that we compiled from publicly available online sources in order to define the vectors representing each orchard product.

The calculated grouping provides us with a description of the intrinsic value of one of the main market properties of the vegetables observed, as well as their behavior over the years, which can result in a solid and easy-to-interpret similarity relationship that can help farmers change the product they grow in order to cope with unforeseen circumstances or simply to become more competitive. The results could be useful in themselves for a particular group of field managers, but they also provide scientific evidence of how products perform in national marketing, offering a reproducible methodology for researchers and managers.

This paper is primarily devoted to improving the application of mathematical tools for managing small-scale agricultural productive farms. While the mathematical techniques employed are well-established, our aim is to propose a unified methodology that integrates data management, data analysis, mathematical modeling of relevant agricultural variables, statistical and AI procedures, and comprehensive representation of results. Rather than enhancing a particular method, we present a holistic mathematical/AI framework, aligning with contemporary trends in applied mathematics that emphasize integrated tools combining data management, mathematics, statistics, AI, and easily interpretable graphics to provide complete and rapid insights into complex problems. This methodological contribution represents the core objective of our work.

1.1. Context and Related Literature

As explained above, the divergence between agricultural (origin) and retail (destination) prices, often referred to as the price differential between farms and retailers, and its complementary quantity, the producer share, are fundamental parameters for the agrifood economy and policy. An initial discussion of asymmetric price transmission has been presented in the first section of the Introduction. In developed countries, public statistics agencies routinely monitor these indicators to track how marketing, transportation, processing, storage, and retail costs influence the final prices paid by consumers. For example, the United States Economic Research Service provides long-range series and methodological documentation on farm-to-retail spreads and farm shares for a wide range of foods [13]. The European Union also provides coverage for vegetable markets, including indicators of production, trade, and prices that are relevant to margins between production and retail and farm shares in horticulture [14,15].

However, the scientific and technical literature on price transmission in this area is often incomplete and sometimes fails to adequately explain its complexity. But this is a fundamental issue for agricultural development, so a great effort has been made to provide a theoretical framework and well-founded practical tools to aid in the management of agrifood systems [5,16]. In the European context, technical reports and applied studies are systematically produced to analyze how transmission occurs along specific supply chains (e.g., meat, vegetables, fruit), highlighting and updating lists of relevant indices on the performance of agriculture in countries at different stages of the chain [17,18].

International institutions such as the FAO also provide complementary international information, connecting local transmission with global markets [19,20]. Case studies at the national/sectoral level (e.g., the sheep sector in Spain) show the complexity of the food supply chain, where wholesale and retail prices [21] are often not linearly connected. Recent reports on specific production sectors also show how the new international situation (including logistical factors, retail concentration, and sustainability requirements) has strong implications for margins and transmission dynamics [22]. Finally, systematic reviews conducted in the wake of recent global crises also highlight the vulnerability of supply chains for perishable goods [23].

In this context, our analysis focuses on horticultural products and analyzes time series of producer shares to compare the dynamics of different items. We use measures based on correlation (similarity of trends), Euclidean distances (proximity of level/shape), and clustering techniques to organize products into equivalence classes. In this way, we provide an interpretable map of items that show comparable price trajectories. Our goal is to provide a practical tool for understanding where the distribution tends to widen (or compress), with implications for producer welfare and the targeting of policies addressing unfair trade practices. We also relate our approach to recent data specific to fruit and vegetable supply chains: studies on channel selection and market performance for vegetables [24], costs and prices in global fruit and vegetable value chains [25], and micro-level case studies of cost transmission between nodes for horticultural products (e.g., carrots and leeks) [26]. Our cluster-based comparisons follow current methodological trends in price linkages and transaction costs within sustainable food value chains [27], and are aligned with EU statistics and monitoring frameworks on fruit and vegetable markets [14,15].

1.2. Main Technical Tools

Let us introduce the main technical parameters to be used in this paper. The first index to be considered is the farm-to-retail price spread

Δ_{t}

(also called the marketing margin), which measures how much the retail (destination) price exceeds the farm (origin) price. Other directly related indices are also relevant. Let us show how they are defined.

The marketing margin at time t is given by

Δ_{t} = P_{t}^{retail} - P_{t}^{farm},

where

P_{t}^{retail}

is the destination price and

P_{t}^{farm}

is the origin (farm) price.

In relative terms (percentage increase), we define

Δ_{t}^{%} = \frac{P_{t}^{retail} - P_{t}^{farm}}{P_{t}^{farm}} \times 100 .

Analysts also make use of the log spread,

log P_{t}^{retail} - log P_{t}^{farm} .

The spread reflects numerous operations along the value chain, such as transport, marketing, handling, storage, losses, processing, taxes, insurance, and commercial margins. Its complementary index is the producer share, defined below, which is in fact the index we analyze in this article.

Indeed, in the design of mathematical tools for analyzing agrifood markets, the producer share (agricultural share) is a key indicator for understanding the distribution of economic value throughout the supply chain. It is defined as the ratio between the price received by the producer on the farm,

P_{f}

(which, depending on the context, corresponds to the farm price

P_{t}^{farm}

used above), and the price paid at the end of the marketing chain,

P_{r}

(the retail price

P_{t}^{retail}

mentioned above). This ratio provides a direct measure of the proportion of consumer expenditure that remains with the primary producer [28].

Thus, the producer share at a time

t,

P S (t),

is given by

P S (t) = \frac{P_{f} (t)}{P_{r} (t)} .

(1)

The interpretation of the producer share is straightforward. A high value is interpreted as an indicator of a balanced and equitable value chain, where farmers receive adequate compensation for their production costs and risks. However, if the share is low, it means that a significant portion of the final value is being absorbed by intermediaries, such as processors, distributors, and retailers. Although this may be due to justifiable causes, this fact is often interpreted as an imbalance of power that causes market inefficiencies [1].

1.3. Experimental Data on Price Transmission in the Supply Chain

As mentioned above, we illustrate our farm management model through a contextual case study, working with the time series of the major Spanish wholesale market, using the data published by COAG (see the website listed in the References section [29], see also [30]). Empirical studies in various specific contexts of goods distribution (both in Spain [21] and internationally [26]) have shown that retail prices adjust more quickly to increases in costs at source than to decreases, given the greater flexibility of the factors affecting this part of the chain. This causes producer participation to be volatile and tend to deteriorate over time, and is the main factor in the well-known vulnerability of producers within conventional market structures [1,13,16]. Thus, the empirical analysis of producer share is related to the analysis of price transmission along the supply chain. Standard models use mathematical methodologies in which time series play a central role in investigating how consumer price fluctuations are transmitted to the producer. Conclusions based on experimental data often indicate a strong asymmetry in profits, which, although adjusted at the retail level, are not usually adjusted for the producer. As noted in numerous studies [23], this situation, which is normal in most well-established markets, has a direct impact on the profitability of farms.

Therefore, monitoring producer share over time using time series analysis is a fundamental tool for all actors who can influence the process, such as farmers’ associations, policymakers, and scientific researchers, to identify structural problems in agrifood systems. This information can then be used to assess the effectiveness of policies aimed at creating fairer conditions for primary producers and to highlight significant asymmetries in order to promote such policies.

To finish this section, let us remark that the core objective of this study is to provide a structured framework that translates complex price dynamics into actionable market intelligence. By clustering products according to their producer share trajectories, the analysis moves beyond isolated observations and identifies stable behavioral patterns along the agrifood value chain. The first advantage for farm managers is that they can easily use this information to substitute products that are similar (i.e., belong to the same cluster) if needed, under the assumption that the producer share is a critical parameter for representing the market properties of a given product. This value summarizes, in a single number, key information about the characteristics of a product’s commercialization, which, together with the price at a given moment, constitutes the minimum information required for strategic decision-making.

On the other hand, the results can also be used for targeted monitoring. Once a product is assigned to a specific dynamic group, deviations from the group’s expected trajectory can be interpreted as early warning signals. These signals provide a technical basis for collective bargaining processes and support the design of evidence-based agricultural policies.

The integrated methodological approach proposed in this paper is summarized in the flow diagram below.

IPOD Data → Descriptive Maths → Dynamic Group Identification → Decision Support for DM

2. Methodology

Although many factors may influence the problem under study, we focus on two complementary ways of analyzing the time evolution of the producer share. The first approach examines the behavior of products in a fixed month (September) in the yearly series, comparing how similarly selected products behave in the same month across different years. The second approach considers each product as a whole, using all available time series data across all recorded months and years.

The rationale for selecting September for the initial stage of the analysis is twofold. First, it allows for a synchronized comparison across products by minimizing seasonal variance, as September represents a critical transition point in Spanish crop cycles where a wide range of products coexist. Second, this choice serves a pragmatic purpose: demonstrating that a single-month snapshot can yield clusters consistent with those derived from the full historical series. This validates a simplified monitoring tool for stakeholders and farmers, showing that conclusions drawn from a representative month can provide a reliable approximation of long-term market share dynamics without requiring the computational complexity of complete time-series processing.

Similarly, there are two levels of analysis regarding the type of products considered. In the first case, all products in the dataset for which sufficient information is available are included, such as olive oil, fruits, and vegetables. In the second case, the analysis is restricted to products typically produced in orchards, which is the primary context of this study. We primarily follow this orchard-focused approach, although data for all products are provided in the Appendix A.

From a mathematical perspective, our methodology is divided into two parts. First, each product’s time series is represented as a vector in a Euclidean space, and correlations and norm distances among products are analyzed. This provides a one-to-many comparison tool: for a given product, we identify those that behave similarly in terms of correlations and distances between their coordinate values. Second, clustering techniques are applied to group products by similarity, offering information on sets of products that may be interchangeable within the same cluster while preserving their market characteristics.

Finally, two types of similarity information are considered. Correlations describe how coordinated the variations in producer share are between two products, independently of their absolute values. Distances, on the other hand, indicate how far the mean producer share of one product is from that of another. We explain this in clear terms in the next subsection.

2.1. Theoretical Background

Research on time-series clustering consistently shows that results depend strongly on three design choices: how the series are represented, which similarity notion is adopted, and how groups are constructed from that similarity information [31,32]. This is particularly relevant for economic time series, where analysts often need to preserve two distinct aspects of the signal: (i) whether two products exhibit coordinated temporal variations (co-movement), and (ii) whether their producer-share levels are comparable in economically meaningful terms.

Our approach adopts a deliberately simple, two-view perspective. One view captures synchronized dynamics across time (a pattern-oriented notion of similarity), and the other view captures differences in magnitude across the observation horizon (a level-sensitive notion of dissimilarity). These complementary views are then combined through clustering to obtain a small number of product families with homogeneous producer-share behavior. This aligns with the general recommendation, emphasized in the time-series clustering literature, that complementary criteria may be preferable to a single universal distance when different invariances are relevant [32].

It is worth noting that alternative similarity paradigms exist in time-series analysis. Dynamic Time Warping (DTW) is designed to handle temporal misalignment by allowing non-linear re-timing [33]. Likewise, correlation-based constructions have been widely used to build interpretable taxonomies in other domains [34], and correlation-normalized shape-based clustering has also been proposed for scalable time-series grouping [35]. Recent methodological advances have enriched the mathematical toolkit for time series comparison. Representation learning approaches now employ self-supervised contrastive frameworks to capture temporal dependencies without explicit supervision [36]. Shapelet-based methods identify discriminative subsequences that characterize different cluster structures [37]. Furthermore, kernel methods have been extended to time series through the Global Alignment Kernel, which combines dynamic programming with kernel theory to enable flexible similarity assessments [38]. Probabilistic model-based clustering using hidden Markov models and Gaussian mixture models provides statistically principled frameworks for temporal grouping [39]. We refer to these lines of work only to highlight a methodological point: the definition of similarity must match the application. In our context, producer-share levels are economically informative, so level-preserving comparisons remain central, while co-movement information is treated as a complementary diagnostic rather than a replacement.

While the literature documents increasingly sophisticated techniques for time series clustering [31,40], there is no universally optimal approach. Instead, clustering performance depends critically on three already mentioned interrelated design choices: the representation of temporal data, the definition of similarity between series, and the algorithm used to form groups [31,41]. Modern developments demonstrate the breadth of these choices. Dictionary learning methods decompose time series into interpretable building blocks that facilitate both compression and clustering [42]. Tensor-based approaches leverage multi-way data structures to capture complex temporal patterns across multiple dimensions simultaneously [43]. Graph neural networks have been adapted to model temporal dependencies through learnable adjacency matrices that encode relationships between time points [44]. Additionally, ensemble methods that combine multiple distance measures or clustering algorithms have proven effective in handling the inherent diversity of temporal patterns [45]. In this sense, simple approaches that separate the comparison of temporal patterns and the comparison of levels can be as informative as more complex methods, especially when each criterion captures an economically relevant aspect of the phenomenon under analysis. This idea coincides with evidence that combining complementary criteria is often more appropriate than resorting to a single sophisticated measure when searching for different types of invariance [31].

The next subsections introduce the mathematical objects and notation used in the paper, formalize the two complementary similarity notions, and describe the clustering procedure used to derive product families.

2.2. Objects of Analysis and Notation

Let

P = {1, \dots, P}

be a finite set of horticultural products (e.g., lettuce, carrot), and let

T = {t_{1}, \dots, t_{T}}

be a set of consecutive calendar years. We fix a month

m^{★}

(e.g., September) and extract for each product

i \in P

and year

t \in T

two observed price levels,

O_{i, t}^{(m^{★})} for the origin price, and D_{i, t}^{(m^{★})} for the destination price .

As explained above, with this notation we define the producer share (PS) as the ratio of origin to destination prices in the selected month,

S_{i, t} = \frac{O_{i, t}^{(m^{★})}}{D_{i, t}^{(m^{★})}} \in (0, \infty) .

For each product i, we form the time–indexed vector

S_{i} = (S_{i, t_{1}}, S_{i, t_{2}}, \dots, S_{i, t_{T}}) \in R^{T},

which summarizes the interannual dynamics of the producer share at the fixed month

m^{★}

. In the second part of our analysis, all producer shares for all the months of the entire time series are considered to represent the products, and so the representation is provided by vectors in

R^{T \times 12} .

When necessary, we have applied a stabilizing transformation for our internal calculations and analyses, specifically the logarithmic transformation

{\tilde{S}}_{i, t} = log S_{i, t},

and standardize across t (z–scores) to isolate pure temporal patterns from level effects,

Z_{i, t} = \frac{{\tilde{S}}_{i, t} - {\bar{\tilde{S}}}_{i \cdot}}{{\hat{σ}}_{i}}, {\bar{\tilde{S}}}_{i \cdot} = \frac{1}{T} \sum_{t \in T} {\tilde{S}}_{i, t}, {\hat{σ}}_{i}^{2} = \frac{1}{T - 1} \sum_{t \in T} {({\tilde{S}}_{i, t} - {\bar{\tilde{S}}}_{i \cdot})}^{2} .

Log transformations and z-scoring were omitted in the final analysis because all vectors consist of ratios bounded between 0 and 1, and preserving their original scale allows the Euclidean distance to directly reflect meaningful differences in producer shares, which is essential for interpretability and practical decision support.

Thus, although alternative approaches have also been considered, for correlation-based analyses we have used

Z_{i} = {(Z_{i, t})}_{t \in T} .

For distance-based similarity, we have computed the Euclidean distance directly between the representing vectors. Euclidean distance provides a meaningful measure of dissimilarity for ratio-based vectors, allowing the analysis to account for both the overall level and relative distribution of producer shares. Let us explain this below.

2.3. Two Complementary Notions of Similarity

The correlation and distance measures presented in this section serve a fundamentally different purpose than in classical statistical inference. We are not testing hypotheses about relationships between variables, nor do we require that correlations be “statistically significant” in the conventional sense. Rather, we use these measures as metric construction tools to define a geometric space in which products can be compared and grouped. In this framework, modest or near-zero correlations between certain products are not a weakness—they indicate genuine differences in temporal behavior that enable meaningful clustering. Our goal is to provide a complete metric structure that practitioners can use to identify substitutable products, not to establish statistically significant predictive relationships.

Thus, based on the mathematical elements described above, we have considered the following two types of similarity relationships.

(i): Trend similarity (Pearson correlation). For products $i, j \in P$ , the Pearson correlation of their standardized vectors is

$ρ_{i j} = \frac{1}{T - 1} \sum_{t \in T} Z_{i, t} Z_{j, t} \in [- 1, 1] .$

As a dissimilarity measure derived from correlation, we have used the correlation distance to confirm certain arguments and conclusions.

$d_{i j}^{corr} = \sqrt{2 (1 - ρ_{i j})} \in [0, 2],$

which is a proper metric whenever $ρ_{i j}$ is a cosine similarity in $R^{T}$ (here, after z–scoring).
(ii): Level–and–shape proximity (Euclidean distance). To capture differences in magnitude and shape over time, we compute the Euclidean distance

$d_{i j}^{Euc} = {∥ S_{i} - S_{j} ∥}_{2} = {(\sum_{t \in T} {(S_{i, t} - S_{j, t})}^{2})}^{1 / 2} .$

If the focus is on shape only, we replace $S_{i}$ with $Z_{i}$ in the formula above. However, for all our final analyses, we have opted to compute the Euclidean distance directly.

The choice of Pearson correlation and Euclidean distance as interpretation tools over more sophisticated methods such as Dynamic Time Warping (DTW) or deep learning approaches is deliberate and grounded in the specific characteristics of our agricultural price data. While DTW has proven effective for handling temporal misalignments in time series [31], and recent deep learning methods have shown remarkable success in capturing complex patterns in financial time series [40,41], these approaches are most beneficial when dealing with irregular sampling, phase shifts, or highly non-linear dynamics. In contrast, our producer-share data exhibit relatively stable seasonal patterns with synchronized monthly observations across all products. The agricultural pricing mechanisms in wholesale markets operate under common external influences (weather, transport costs, regulatory frameworks), which tend to synchronize rather than desynchronize temporal responses across products. Under these conditions, correlation-based measures efficiently capture the essential co-movement structure without the computational overhead and potential overfitting risks associated with more complex methods.

Moreover, the interpretability of our chosen methods aligns with the practical decision-making context of farm management. Pearson correlation provides an intuitive measure of trend alignment that farmers and agricultural advisors can readily understand and act upon, while Euclidean distance offers a transparent measure of absolute differences in producer-share levels. Deep learning methods, while powerful for prediction tasks with large datasets [40], require substantial training data and computational resources, and often sacrifice interpretability—a critical requirement when the goal is to provide actionable strategic guidance to small-scale producers. Our approach thus prioritizes methodological transparency and operational simplicity over algorithmic sophistication, ensuring that the clustering results can be directly integrated into farm-level planning decisions without requiring specialized expertise or infrastructure.

2.4. Clustering and Equivalence Classes of Products

We induce equivalence classes of products via clustering on a chosen distance matrix

D = (d_{i j})

. In our case, we used the Euclidean distance matrix defined by the Euclidean distances between the vectors

S_{i}

. Using standard clustering algorithms in R (kmeans, prcomp), we compute the corresponding clusters. These clusters are visualized in two-dimensional plots derived from the PCA results, showing the first and second principal components of the vectors.

The result is a partition

P = ⨆_{k = 1}^{K} C_{k}

of products into equivalence classes

C_{k}

with similar dynamics. Comparisons between the Euclidean-based partitions obtained for a fixed month (September) and those based on the entire time series reveal stable families of products that exhibit similar behavior in both representations.

2.5. Interpreting Clusters in Terms of Producer Favorability

Recall that the producer share summarizes the relative incidence of origin versus destination prices. For any cluster

C

and year t, we define within–cluster statistics

{\bar{S}}_{C, t} = \frac{1}{| C |} \sum_{i \in C} S_{i, t}, {IQR}_{C, t} = IQR ({S_{i, t} : i \in C}),

and cluster–level summaries across time

{\bar{S}}_{C, \cdot} = \frac{1}{T} \sum_{t \in T} {\bar{S}}_{C, t}, {Var}_{C, \cdot} = \frac{1}{T - 1} \sum_{t \in T} {({\bar{S}}_{C, t} - {\bar{S}}_{C, \cdot})}^{2} .

Thus, clusters with high

{\bar{S}}_{C, \cdot}

and low

{Var}_{C, \cdot}

indicate product families that tend to be more favorable to producers in a stable manner, independently of absolute production costs. This complements trend alignment captured by correlation and the direct measurement of Euclidean distance. Since these mathematical elements have already been used to verify the validity of the clustering that was finally adopted (five groups), the associated detailed calculations are not shown in the Discussion section, where arguments are instead based on simpler numerical values.

2.6. Clustering of Producer-Share Vectors

K-means clustering is applied directly to the producer-share vectors (

S_{i, t}

) to divide the selected orchard products into equivalence classes. Our goal is to identify sets of products that exhibit similar producer-share behavior along the September time series. Principal Component Analysis (PCA) is used separately to report the dimensionality of the problem and to provide a two-dimensional visualization of the vectors. The optimal number of clusters for the k-means analysis is determined using the elbow method applied directly to the k-means results.

2.7. Robustness, Missing Data, and Sensitivity

When some

S_{i, t}

are missing, pairwise statistics use listwise availability. For distances, we can compute for example

d_{i j}^{Euc} = {(\sum_{t \in T_{i j}} {(S_{i, t} - S_{j, t})}^{2})}^{1 / 2} \cdot \sqrt{\frac{T}{| T_{i j} |}},

where

T_{i j} \subseteq T

is the set of years available for both i and j; the multiplicative factor re–scales to the full horizon. In some cases, we have opted to preserve missing values, leaving the corresponding correlation or distance entries empty in the final matrix (see the heatmaps in Appendices Appendix A and Appendix B), as explained in the next subsection.

To separate pattern from level, correlation uses standardized

Z_{i}

. Euclidean distances can be applied to

S_{i}

(level–sensitive) or

Z_{i}

(shape–sensitive). Some methods can be used to ensure clustering stability, for example by bootstrapping years and recomputing the partition to obtain an adjusted Rand index (ARI) between runs, which allows quantifying robustness. However, due to the strong agreement between the fixed-month-based clustering and the whole-time-series clustering, we have decided to accept the resulting partition as explained in Section 4.

2.8. Data Preparation

The original dataset provided on the website [29] was formatted for processing in R as a CSV file containing all the vectors for each product in the dataset, labelled by year and month. The product names were normalised, since they appeared in the dataset under different labels.

Products with only a few recorded values were removed. For products with only some missing data (at most three missing entries in the complete file), the label NA was preserved and the calculations were carried out under this restriction, so some results may still appear with this label.

Data preprocessing involved a selective imputation strategy to ensure the continuity of the time series. Products with extensive gaps were excluded to maintain the robustness of the dataset. For products with only some missing data (at most three missing entries in the complete file), the label NA was preserved and the calculations were carried out under this restriction, so some results may still appear with this label. For the remaining series, missing values were addressed by applying the annual mean of the product for broader gaps or nearest-neighbor imputation (right position if available) for isolated missing points where a numerical value was required. Although these techniques are standard for maintaining the structural integrity of agrifood series, we acknowledge that they may slightly smooth out extreme volatility. This approach represents a trade-off between data completeness and the preservation of original market signals, and its potential impact on clustering should be considered a limitation of the study.

When the number of missing values was small and the corresponding value was required to proceed with the calculations, the mean of the remaining elements in the row was used. The original data included producer prices, destination prices, and other marketing information. We used the first two values to compute the producer share for each month in the time series, and these monthly producer share values constitute the coordinates of our vectors.

The final outcome was a homogeneous dataset in terms of format, ready to be used for the analyses described above.

3. Results

We present the results in two separate sections. The first corresponds to the fixed-month analysis, while the second considers the whole-year description of the products.

3.1. Fixed Month Time Series

We fix the month of September to illustrate the implementation of the method and the resulting analysis. For methodological reasons, and given the objective of this work, using the whole set of products appearing in the dataset to define equivalence classes does not make sense, since products that belong to the same class must be, in some sense, interchangeable for our tool to be useful. For example, a Carrot field cannot be interchanged with a field of Lemon trees. On the other hand, for some products the number of missing values is so high that the resulting information is not solid enough to draw reliable inferences. As explained in Section 2, this is why we have decided to work only with orchard-cultivable products, specifically the selection shown in the tables below (Table 1 and Table 2). However, the numerical data for a similar analysis of the whole set (with a large number of missing values) can also be found at the end of the paper, in the Appendices Appendix A and Appendix B.

After preparing the dataset, we first focus on the calculation of the producer share. Missing values were imputed by forward filling with the next available value in the row, or by using the mean value when only a few data points were missing.

Table 1 and Table 2 show the values of the selected products. The reader can already observe some similarities among the rows of the matrices, which represent the products listed in the first column. For example, Watermelon and Melon exhibit similar behavior, while the time series of Tomato and Carrot are clearly different.

3.1.1. Pearson Correlation

In the next step, we compute first the (Pearson) correlation matrix and, in the next subsection, the distance matrix for the vectors representing each orchard product in the initial selection. The results are displayed later as heatmaps for the reader’s convenience, where examining the column corresponding to a given product reveals its similarity to the others. The full information is provided in the correlation and distance matrices. The former shows the trends of each product regarding increases or decreases in the producer share, indicating the extent to which they coincide with those of the others. The Euclidean distance matrix represents the proximity between products by comparing the absolute values of their producer shares.

Table 3 and Table 4 provide the values of the Pearson correlation between the orchard products. If the farmer wants to substitute any of these products, a look at the corresponding row in the matrix (or at the heatmap in Figure 1) gives an idea of the alternative options that can be initially considered or, if the goal is to obtain a better producer share, of the products that are not going to increase the value of this index.

From the Pearson correlation matrix, we observe that there are no exceptionally high correlations (positive or negative) between the products, but certain trends are noticeable. For instance, the correlation between Potato and Onion is 0.5250, which indicates a moderate positive relationship. This suggests that these two products share similar trends in their producer shares. This alignment may imply that, under certain conditions, these two products could be considered complementary in agricultural decisions.

Another interesting relationship is the moderate positive correlation between Watermelon and Cabbage (0.4911), as well as between Watermelon and Tomato (0.4306). These correlations suggest that these products may exhibit similar patterns of production or marketing during the September selection period. Consequently, if a farmer is considering substituting one product for another, these crops might represent viable alternatives, especially if their producer shares align well in the market.

In contrast, some products show negative correlations, indicating antagonistic relationships. For example, the correlation between Melon and Potato is −0.0944, and the correlation between Melon and Onion is −0.1177. These negative values suggest that these products tend to move in opposite directions in terms of market share, and substituting one for the other may not be effective in terms of improving overall producer share. However, this can provide a valuable tool for crop rotation.

3.1.2. Euclidean Distance

A different type of similarity is explored in the next step. The Euclidean distance matrix shown in Table 5 and Table 6, together with the visualization provided in the heatmap of Figure 2, give a clear picture of the distances between the orchard products. Although this information does not necessarily coincide with that of the Pearson correlation, both reinforce each other when a similarity relation is detected.

A smaller distance value indicates greater similarity. For instance, the distance between Watermelon and Cabbage is 0.3300, which is relatively small, indicating that these products are close in terms of producer share. Similarly, the distance between Tomato and Watermelon is 0.3667, reinforcing the idea of a moderate positive relation between these two products. On the other hand, products like Eggplant and Potato exhibit a large distance of 0.9188, suggesting that their producer share trends are significantly different. Such products are likely antagonistic in nature, making substitution a less suitable option.

The distance matrix also shows the relevant difference between Cucumber and Carrot, with a distance value of 0.6893, indicating a notable dissimilarity in their producer share patterns. This suggests that, despite some possible commonalities, these two products may not serve as effective substitutes in the same market context.

Thus, the correlation and distance matrices reveal clear relationships among the selected horticultural products. Watermelon, Cabbage, and Tomato show similar producer share patterns and may be considered potential substitutes, while Potato and Onion present a moderate positive association. In contrast, pairs such as Melon–Potato and Eggplant–Potato display antagonistic behavior, making them less suitable for substitution. In the next Section 4, a full interpretation will be given.

3.1.3. Clustering and PCA

We now address the second part of the analysis, which applies Principal Component Analysis (PCA) followed by k-means clustering to divide the selected orchard products into equivalence classes, using the producer share as the grouping criterion. Our goal is to identify sets of products that display similar producer share behavior along the September time series. The PCA transformation is first used to determine the appropriate number of clusters, and the elbow method is then applied to the transformed data to obtain this optimal value. Figure 3 presents the results of this procedure. Table 7 shows description of the parameters of the PCA process, including the cumulative proportion of explained variance (PCAs 1–8, 96% of cumulative proportion of explained variance). The optimal clustering with five groups is shown in Figure 4. The elements of each of the groups are given in Table 8.

We now address the second part of the analysis, which applies k-means clustering directly to the producer-share vectors to divide the selected orchard products into equivalence classes. Our goal is to identify sets of products that display similar producer-share behavior along the September time series. Principal Component Analysis (PCA) is used separately only to inform about the dimensionality of the problem and to provide a two-dimensional representation of the data. Table 7 summarizes the parameters of the PCA process, including the cumulative proportion of explained variance (PCs 1–8 account for 96% of the total variance). The optimal clustering with five groups, determined from the k-means procedure, is shown in Figure 4, and the elements of each group are listed in Table 8.

The clustering results are consistent with the patterns provided by the correlation and Euclidean distance matrices. Indeed, products grouped within the same cluster tend to show higher correlations and shorter distances, indicating similar producer share dynamics. For instance, Watermelon, Cabbage, Tomato, and Lettuce, grouped in Cluster 2 and Cluster 4, were previously shown to have moderate positive correlations and relatively small distances, suggesting comparable market behavior. Similarly, Potato and Onion, grouped together in Cluster 3, reflect the strong positive association observed in the correlation analysis. In contrast, products such as Carrot, isolated in Cluster 5, or Eggplant in Cluster 4, exhibit larger distances and weaker or negative correlations with other products, reinforcing their distinct and more antagonistic behavior. Overall, the clustering structure provides a clear synthesis of the similarity and dissimilarity relationships previously highlighted by the correlation and distance analyses.

3.2. Complete Time Series Data: All Years and Months

Now, we turn to the task of identifying groups of products based on the similarity of their producer share patterns across the whole-yearly time series. Recall that each product is represented as a vector, where each coordinate corresponds to the producer share value for a given month over all the years in the time series. To avoid repetition, and given that the correlation and distance analysis for the fixed-month series aligns well with the clustering results presented in the previous subsection, we proceed directly to the clustering procedure. The elbow method suggests that again the optimal number of clusters to consider is five (see Figure 5). As in the previous case, this approach is applied to the dataset focusing specifically on orchard-related products.

A visual inspection of the representation suggests that a clustering into five groups strikes a balance between maximizing the mathematical gain (variance reduction) and minimizing the complexity of the results, facilitating their interpretation in the next steps. As can be seen in Figure 6, the results are similar to the ones obtained for the September time series; we use a different layout to highlight the difference with the results obtained for this case. For example, there is a group formed by Onion and Potato, and Carrot appears as an isolated product too. We discuss the results in Section 4.

4. Discussion

In this section we describe the main characteristics of each cluster obtained in our analysis and discuss their implications in terms of practical decision-support for farmers, in line with the objective stated in the Introduction. In practical terms, the information provided by clusters allows farmers to identify alternative products whose producer participation dynamics have historically been similar to those of their current crop. Given a reference product, the farmer can locate its cluster and examine the products that comprise it as viable candidates for substitution, under the assumption that these products will share comparable patterns of price transmission, margin stability, and exposure to the market power of intermediaries. In this way, the substitution decision is not based solely on agronomic yields or spot prices, but on a structural characterization of the producer’s position within the value chain, which reduces the risk associated with crop changes motivated by adverse conditions or strategic changes.

In addition, cluster-level summaries—in particular, the average and temporal variability of the producer share—provide an operational criterion for assessing the relative favorability of each group. Clusters with high average values and low variance identify product families that systematically offer a more stable and favorable share to the producer, regardless of short-term fluctuations. In this sense, the proposed procedure acts as a decision support tool that allows prioritizing substitutions towards products with historically more resilient and balanced profiles, complementing traditional price and cost information with a dynamic dimension of value distribution that is directly relevant to production planning.

The models for a fixed month (September) and for the complete series of records of the year show similar results, which supports the stability of the strategic information that can be obtained and highlights the coordinated behavior of some product groups. The ratios of origin price to destination price for 2024 are reported in Table 9, while the mean ratios per product and month (2009–2024) are shown in Table 10. These tables provide the basis for understanding the variations and patterns across products and time, and are key to the interpretation of the results presented in the previous section. The observation of the values in the tables allows us to understand the characteristics of each product with respect to its behavior in the two time series studied in Section 3. Furthermore, a careful examination of these numerical data reveals the underlying economic dynamics that justify the clustering patterns obtained, offering practical insights for agricultural planning and market strategy.

Regarding the correlation and absolute distance among the producer shares for orchard products, clustering provides clear evidence of coordinated behavior in certain groups. Using both the September dataset and the whole-year time series, optimal clustering separates the products into multiple groups. Although the membership of each group does not fully coincide across both analyses, we focus on the groups that largely overlap for clarity (see Table 10). Relative coincidence of the results of the two clustering processes enhances the conclusions, and shows how our methodology can help the decision makers of the farms in strategic design. Table 9 and Table 10 are intended to be directly used for this purpose by the farm managers, together with the elements and descriptions of the computed clusters.

The general interpretation of the clustering results is as follows.

(i): Potato and onion consistently appear together (Group 3 in September and Group 3 in the whole-year analysis).
(ii): Carrot remains separate in both the September and whole-year datasets.
(iii): The central group is divided into two subgroups, depending on the time series used. In the case of the September series, we get (1) Cabbage, Lettuce, Watermelon, Chard, Melon, Green pepper; (2) Broccoli, Cucumber, Red pepper. For the whole-year series, (1) Cabbage, Lettuce, Broccoli, Watermelon, Chard, Melon; (2) Zucchini, Cucumber, Eggplant. The second subgroup is more distant and does not coincide with the September grouping, which must be taken into account if the strategic decision involves these products, meaning that the grouping would not be so clear throughout the whole-year time series.
(iv): September Group 4 is less well-defined (Tomato, Cauliflower, Zucchini, Eggplant), and contains elements from whole-year Group 3.
(v): In the whole-year analysis, Red pepper and Green pepper form Group 4 with Tomato and Cauliflower.

More concretely, and summarizing all the information, the clusters derived from both the September and whole-year time series are described below in terms of the particular characteristics of the products involved.

1.: September Groups 1–4, Yearly Group 2: Zucchini, Eggplant, and Cucumber. These products are mainly produced in summer and early October, with low ratios before production (around 0.18 in April–June), slightly higher during production (0.25 in July–October), and relatively high outside these months (0.4 in November–February), indicating that distribution costs capture most of the monetary gain.
2.: September Group 2, Yearly Group 5: Cabbage, Melon, Watermelon, Chard, Broccoli. This group includes both winter (Cabbage, Chard, Broccoli) and summer (Melon, Watermelon) products, yet the producer share remains stable (around 0.22) throughout the year with minor variations.
3.: September Group 3, Yearly Group 3: Onion and Potato, exhibiting low and stable producer shares (around 0.2), similar to the previous group.
4.: September Group 5, Yearly Group 1: Carrot, showing a singular behavior with a high and stable average producer share (around 0.3) throughout the year.
5.: (September Groups 1–4, Yearly Group 4: Tomato, Cauliflower, Red pepper, and Green pepper. This group is scattered, showing high producer shares throughout the year without clear trends, effectively grouping products that cannot be classified elsewhere.

The clustering patterns observed reflect underlying structural characteristics of the Spanish agrifood distribution system rather than mere statistical artifacts. Products grouped together tend to share similar commercialization channels, storage requirements, perishability profiles, and market concentration levels, all of which directly influence the marketing margins captured by intermediaries. For instance, the stable low producer shares observed in the cluster describe in item 3 (Onion and Potato) likely reflect the presence of well-established wholesale networks with significant economies of scale in storage and distribution. These products benefit from extended shelf life and can be stored in bulk for months, allowing intermediaries to accumulate market power through strategic inventory management and temporal arbitrage. While this reduces unit logistics costs, it also consolidates intermediary control over price formation, compressing farm-gate prices even as retail prices remain relatively stable. The persistence of low producer shares in this cluster suggests structural barriers to direct marketing, possibly reinforced by standardization requirements and quality grading systems that favor large-scale operators over individual farmers. The same argument applies to other groups with similarly low producer shares.

Conversely, Carrot’s isolation in the cluster described in item 4 with consistently higher producer shares (averaging 0.30) may indicate either more direct marketing channels—such as cooperative structures, regional supply contracts with supermarket chains, or participation in quality certification schemes—or intrinsic product characteristics that reduce intermediation costs. Carrots require less sophisticated cold-chain infrastructure than highly perishable products, can be marketed in various presentations (fresh, bagged, pre-cut), and face relatively inelastic consumer demand throughout the year. This combination of factors may enable producers to capture a larger share of the final price, either through reduced distribution margins or through stronger bargaining positions in supply negotiations. The seasonal products in the cluster given in item 1 (Zucchini, Eggplant, Cucumber) exhibit pronounced volatility patterns consistent with supply-demand imbalances across the calendar year. These products show notably high producer shares during winter months when local production is minimal (Zucchini: 0.44 in January, 0.35 in November-December; Cucumber: 0.38–0.41 in January–February; Eggplant: 0.41 in January and December), but this drops substantially to 0.16–0.19 during the April–June period when production intensifies and market supply increases. During peak harvest months (July–October), producer shares recover partially to around 0.23–0.29, indicating complex dynamics where distribution margins widen during off-season scarcity (requiring imports and cold-chain infrastructure), compress during peak local supply, and then expand again as the season ends. This pattern suggests that retail prices remain relatively stable throughout the year while origin prices fluctuate more dramatically in response to local supply availability, with intermediaries capturing higher margins precisely when producers face greatest competitive pressure during harvest peaks.

The intermediate group (cluster in item 2: Cabbage, Melon, Watermelon, Chard, Broccoli) combines winter and summer products with moderate, stable producer shares around 0.22, suggesting a degree of market maturity where distribution costs are relatively predictable and intermediaries operate under moderate competition. This stability may also reflect the existence of medium-term supply contracts between producer organizations and retail chains, which smooth price volatility and provide some protection to farmers against spot-market fluctuations. Finally, the dispersed cluster of item 5 (Tomato, Cauliflower, Red pepper, Green pepper) groups products that resist clear classification, possibly due to the coexistence of multiple commercialization circuits—ranging from local markets and direct sales to export-oriented supply chains—that generate heterogeneous producer-share dynamics. Understanding these structural drivers is essential for designing targeted policy interventions: clusters with persistently low producer shares may benefit from measures promoting shorter supply chains, collective bargaining mechanisms, or cooperative marketing structures, while products with stable high shares suggest that existing market structures are relatively efficient and require less regulatory attention. Moreover, the identification of these clusters provides farmers with actionable intelligence for crop substitution decisions, allowing them to navigate toward products or market segments where producer welfare is better protected.

Overall, the analysis confirms that certain products exhibit coordinated patterns while others display unique behavior. The stability of the clusters across different temporal resolutions (September vs. whole year) suggests that these similarity relationships primarily reflect persistent patterns in producer share trajectories, rather than being driven solely by seasonal effects. While seasonal fluctuations are clearly visible in the time series and are important for characterizing individual products, they are intrinsically part of the series and do not dominate the clustering results. Thus, the results provide a clear, interpretable framework that can aid farmers in strategic decision-making, particularly in identifying substitutable products and managing production to maximize producer share.

5. Conclusions

We have presented a methodology for analyzing price dynamics in a national agrifood value chain, focusing on the producer share as the main indicator. It highlights the well-known trend that retail distribution generates a persistent gap between the price paid to the producer and the final consumer price. The average producer share for most vegetables analyzed is consistently low (often below 0.3), underscoring the limited portion of the final price reaching the farmer. As a smaller producer share reduces the possibility of fair compensation and farm viability, our model provides a technical tool to integrate this information into strategic decision-making.

5.1. Methodological Contributions

As an applied work, the methodology aims to be deliberately simple to facilitate direct use by managers with standard mathematical training. It is based on basic time series analysis (Pearson correlation and Euclidean distance) and clustering, groups products with similar market dynamics. This allows farmers to identify substitutable products within clusters that have higher producer share, suggesting potential production adjustments to maximize income, as well as coordinated market behavior through the identification of product families moving together in terms of producer share. Empirical analysis revealed clear patterns: potato and onion are consistently grouped with low producer shares (around 0.2), reflecting high distribution margins, while carrot exhibits a higher and stable share (around 0.3), suggesting a more efficient distribution chain or different commercialization process. The consistency of results between a fixed month (September) and the full time series confirms the stability of these strategic insights, indicating that product similarity and market patterns are not merely seasonal but reflect enduring structures.

This work confirms the existence of wide distribution margins in the agrifood chain and provides producers with a reproducible, scientific methodology to understand and respond strategically to price dynamics, supporting greater competitiveness and a more equitable value distribution.

5.2. Methodological Limitations and Future Work

While our approach provides actionable insights for strategic decision making, several limitations should be acknowledged. First, the analysis relies on historical producer share time series and does not explicitly account for extraordinary events or structural breaks (e.g., the COVID-19 pandemic, energy and transport shocks), which could temporarily affect market behavior. Second, the methodology does not explicitly separate seasonal effects from long term trends; although clustering captures persistent patterns in producer share trajectories, seasonal fluctuations are inherently present in the time series and may influence cluster boundaries. Third, missing data were imputed using simple methods (forward filling or averaging), which, although applied in only a few cases, could introduce minor distortions. Finally, the framework focuses on a single national value chain and a selected set of orchard products, and uses producer share as the main market descriptor. While this choice is informative, conclusions derived from this single metric are inherently limited and could benefit from complementary indicators.

Future work could address these limitations by incorporating models that explicitly account for structural breaks and shocks, applying robust seasonal adjustment techniques, exploring more sophisticated imputation methods, extending the analysis to other crops, regions, or international markets, and including additional market indices alongside producer share. Such extensions would enhance both the interpretability and applicability of the methodology, while preserving its practical usability for farm managers.

Author Contributions

Conceptualization, E.S.-A. and A.F.-S.; methodology, E.S.-A.; software, C.S.-A.; validation, C.S.-A.; formal analysis, E.S.-A. and E.A.S.-P.; investigation, A.F.-S. and E.S.-A.; data curation, C.S.-A.; writing—original draft preparation, A.F.-S. and E.S.-A.; writing—review and editing, E.A.S.-P. and E.S.-A.; visualization, C.S.-A.; supervision, A.F.-S. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Generalitat Valenciana (Spain) through the PROMETEO 2024 CIPROM/2023/32 grant.

Data Availability Statement

The source data are available in the web page referenced as COAG in the References section. The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Acknowledgments

We would like to acknowledge the support of Instituto Universitario de Matemática Pura y Aplicada and Universitat Politècnica de València.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A. Correlation Matrix and Heatmap for All the Products

Table A1. Correlation Matrix. Part I.

	Watermelon	Melon	Olives	Olive Oil	Potato	Onion	Chard	Leek	Cabbage	Green Bean	Zucchini	Tomato
Watermelon	1.0000000000	0.147220110	−0.15224995	0.22893475	0.08077361	0.224143144	0.11763143	–	0.49108552	–	0.11772311	0.43062212
Melon	0.1472201097	1.000000000	0.26042622	0.15168851	−0.09443650	−0.117677341	0.39469424	–	0.48179897	–	0.16235251	0.08431052
Olives	−0.1522499475	0.260426215	1.00000000	0.29381139	0.51578651	0.230844883	0.12541583	–	0.21496432	1.00000000	−0.08346392	0.23709341
Olive oil	0.2289347453	0.151688509	0.29381139	1.00000000	−0.10297131	0.048238213	0.62292879	–	0.05698023	−1.00000000	0.10299396	0.33126836
Potato	0.0807736105	−0.094436504	0.51578651	−0.10297131	1.00000000	0.525017283	−0.32283626	–	0.23849309	1.00000000	−0.44647347	−0.20659631
Onion	0.2241431445	−0.117677341	0.23084488	0.04823821	0.52501728	1.000000000	0.04740851	–	−0.22973734	1.00000000	0.08230507	−0.25712306
Chard	0.1176314331	0.394694240	0.12541583	0.62292879	−0.32283626	0.047408506	1.00000000	–	−0.01317132	1.00000000	0.22305525	0.18780343
Leek	–	–	–	–	–	–	–	–	–	–	–	–
Cabbage	0.4910855237	0.481798968	0.21496432	0.05698023	0.23849309	−0.229737340	−0.01317132	–	1.00000000	−1.00000000	0.01569176	0.57219168
Green bean	–	–	1.00000000	−1.00000000	1.00000000	1.000000000	1.00000000	–	−1.00000000	1.00000000	1.00000000	−1.00000000
Zucchini	0.1177231060	0.162352507	−0.08346392	0.10299396	−0.44647347	0.082305066	0.22305525	–	0.01569176	1.00000000	1.00000000	0.38833993
Tomato	0.4306221231	0.084310525	0.23709341	0.33126836	−0.20659631	−0.257123061	0.18780343	–	0.57219168	−1.00000000	0.38833993	1.00000000
Carrot	−0.1206909416	−0.610771881	0.25642176	−0.10393929	0.46200824	0.445194806	−0.21324936	–	−0.09590224	1.00000000	−0.08124131	0.16526597
Cucumber	0.0002960631	0.015861475	0.44684504	0.02489593	0.14070894	0.138323231	−0.08671358	–	0.10823579	1.00000000	0.38362834	0.10955283
Red pepper	−0.4039715760	0.070166545	0.13054608	0.02557136	−0.31636699	−0.077097778	0.22203358	–	−0.40507800	1.00000000	0.36451297	−0.17388325
Lemon	−0.0592587619	−0.318076970	−0.39020546	0.06841815	−0.21713741	0.118586234	0.13611767	–	−0.50439196	1.00000000	0.42002178	−0.19242219
Lettuce	0.3719620610	−0.173315238	−0.13521987	−0.07066024	−0.01961898	−0.102529497	0.13262219	–	−0.08190845	1.00000000	−0.15854684	0.11297854
Banana	−0.1272658416	−0.038585960	−0.46980976	0.06447276	−0.33656333	−0.187286778	0.25800593	–	−0.12690124	−1.00000000	−0.08014135	−0.06266835
Apple	0.2536755396	0.177700233	0.65449791	0.47326291	0.18715812	0.150386351	0.26428337	–	0.29768882	1.00000000	0.17579164	0.44731737
Pear	0.3143185060	0.228510707	0.22848146	0.12920792	0.41999570	0.178226425	−0.33747392	–	0.51298424	−1.00000000	−0.08663476	0.23853353
Grape	−0.0218659998	−0.288672961	−0.42476140	−0.56955763	0.08557120	−0.001895812	−0.41883660	–	−0.22019530	−1.00000000	−0.38061786	−0.34561704
Garlic	0.1433019799	0.190877576	−0.02360434	−0.09922662	0.30627099	0.624729716	0.12556987	–	−0.34591568	–	−0.22383238	−0.60282194
Cauliflower	0.0158959886	−0.726216783	−0.01577947	−0.03954551	0.02009904	−0.060772214	0.02026246	–	−0.23977003	–	0.01247641	0.26889590
Broccoli	0.3877243225	0.495988927	0.50585686	0.49803662	0.19055907	−0.134638551	0.32349569	–	0.86269044	–	0.15722172	0.37603901
Eggplant	−0.1196777735	0.402922157	0.02562967	0.12985154	−0.50040358	−0.264416552	0.42664476	–	0.22781670	–	0.82452452	0.48147742
Green pepper	0.3233485404	0.167781856	−0.36592595	0.19062733	−0.38965596	−0.341094618	0.33424238	–	−0.09908563	–	0.20197551	−0.03765055
Beans	–	–	–	–	–	–	–	–	–	–	–	–
Plum	0.5662490053	−0.244811275	0.13521378	−0.10249000	0.59155618	0.634500889	−0.18310265	–	−0.04754939	–	−0.31247403	−0.29873807
Nectarine	0.2268824229	−0.070864517	0.21211926	−0.09253275	0.35038549	−0.072824000	−0.27210242	–	0.46971397	–	−0.19943740	−0.03311761
Peach	0.0175164551	0.002642857	0.31777478	0.01598014	0.43375686	−0.084460970	−0.39521992	–	0.65080068	–	−0.12362152	0.06235568
Flat peach	–	–	1.00000000	1.00000000	1.00000000	1.000000000	1.00000000	–	−1.00000000	–	1.00000000	1.00000000
Artichoke	0.9427541509	−0.722071466	0.67593836	0.95289186	0.38547148	0.359348997	0.94605680	–	0.48182338	–	−0.37863740	0.97592176
Mushroom	−0.3750155452	0.264160668	0.38368673	−0.08875524	−0.30602312	−0.645335490	0.23547586	–	0.24567512	–	0.29565560	0.47596270
Orange	–	–	–	–	–	–	–	–	–	–	–	–
Mango	–	–	–	–	–	–	–	–	–	–	–	–
Cherimoya	–	–	–	–	–	–	–	–	–	–	–	–

Table A2. Correlation Matrix. Part II.

	Carrot	Cucumber	Red Pepper	Lemon	Lettuce	Banana	Apple	Pear	Grape	Garlic	Cauliflower	Broccoli
Watermelon	−0.12069094	0.00029606	−0.40397158	−0.05925876	0.37196206	−0.12726584	0.25367554	0.31431851	−0.02186599	0.14330198	0.01589599	0.38772432
Melon	−0.61077188	0.01586148	0.07016655	−0.31807697	−0.17331524	−0.03858596	0.17770023	0.22851071	−0.28867296	0.19087758	−0.72621678	0.49598893
Olives	0.25642176	0.44684504	0.13054608	−0.39020546	−0.13521987	−0.46980976	0.65449791	0.22848146	−0.42476140	−0.02360434	−0.01577947	0.50585686
Olive oil	−0.10393929	0.02489593	0.02557136	0.06841815	−0.07066024	0.06447276	0.47326291	0.12920792	−0.56955763	−0.09922662	−0.03954551	0.49803662
Potato	0.46200824	0.14070894	−0.31636699	−0.21713741	−0.01961898	−0.33656333	0.18715812	0.41999570	0.08557120	0.30627099	0.02009904	0.19055907
Onion	0.44519481	0.13832323	−0.07709778	0.11858623	−0.10252950	−0.18728678	0.15038635	0.17822643	−0.00189581	0.62472972	−0.06077221	−0.13463855
Chard	−0.21324936	−0.08671358	0.22203358	0.13611767	0.13262219	0.25800593	0.26428337	−0.33747392	−0.41883660	0.12556987	0.02026246	0.32349569
Leek	–	–	–	–	–	–	–	–	–	–	–	–
Cabbage	−0.09590224	0.10823579	−0.40507800	−0.50439196	−0.08190845	−0.12690124	0.29768882	0.51298424	−0.22019530	−0.34591568	−0.23977003	0.86269044
Green bean	1.00000000	1.00000000	1.00000000	1.00000000	1.00000000	−1.00000000	1.00000000	−1.00000000	−1.00000000	–	–	–
Zucchini	−0.08124131	0.38362834	0.36451297	0.42002178	−0.15854684	−0.08014135	0.17579164	−0.08663476	−0.38061786	−0.22383238	0.01247641	0.15722172
Tomato	0.16526597	0.10955283	−0.17388325	−0.19242219	0.11297854	−0.06266835	0.44731737	0.23853353	−0.34561704	−0.60282194	0.26889590	0.37603901
Carrot	1.00000000	0.38362834	0.36451297	0.42002178	−0.15854684	−0.08014135	0.17579164	−0.08663476	−0.38061786	−0.22383238	0.01247641	0.15722172
Cucumber	0.38362834	1.00000000	0.36451297	0.42002178	−0.15854684	−0.08014135	0.17579164	−0.08663476	−0.38061786	−0.22383238	0.01247641	0.15722172
Red pepper	0.36451297	0.36451297	1.00000000	0.42002178	−0.15854684	−0.08014135	0.17579164	−0.08663476	−0.38061786	−0.22383238	0.01247641	0.15722172
Lemon	0.42002178	0.42002178	0.42002178	1.00000000	−0.15854684	−0.08014135	0.17579164	−0.08663476	−0.38061786	−0.22383238	0.01247641	0.15722172
Lettuce	−0.15854684	−0.15854684	−0.15854684	−0.15854684	1.00000000	−0.08014135	0.17579164	−0.08663476	−0.38061786	−0.22383238	0.01247641	0.15722172
Banana	−0.08014135	−0.08014135	−0.08014135	−0.08014135	−0.08014135	1.00000000	0.17579164	−0.08663476	−0.38061786	−0.22383238	0.01247641	0.15722172
Apple	0.17579164	0.17579164	0.17579164	0.17579164	0.17579164	0.17579164	1.00000000	−0.08663476	−0.38061786	−0.22383238	0.01247641	0.15722172
Pear	−0.08663476	−0.08663476	−0.08663476	−0.08663476	−0.08663476	−0.08663476	−0.08663476	1.00000000	−0.38061786	−0.22383238	0.01247641	0.15722172
Grape	−0.38061786	−0.38061786	−0.38061786	−0.38061786	−0.38061786	−0.38061786	−0.38061786	−0.38061786	1.00000000	−0.22383238	0.01247641	0.15722172
Garlic	−0.22383238	−0.22383238	−0.22383238	−0.22383238	−0.22383238	−0.22383238	−0.22383238	−0.22383238	−0.22383238	1.00000000	0.01247641	0.15722172
Cauliflower	0.01247641	0.01247641	0.01247641	0.01247641	0.01247641	0.01247641	0.01247641	0.01247641	0.01247641	0.01247641	1.00000000	0.15722172
Broccoli	0.15722172	0.15722172	0.15722172	0.15722172	0.15722172	0.15722172	0.15722172	0.15722172	0.15722172	0.15722172	0.15722172	1.00000000
Eggplant	0.29565560	–	–	–	–	–	–	–	–	–	–	–
Green pepper	–	–	–	–	–	–	–	–	–	–	–	–
Beans	–	–	–	–	–	–	–	–	–	–	–	–
Plum	−0.31247403	–	–	–	–	–	–	–	–	–	–	–
Nectarine	−0.19943740	–	–	–	–	–	–	–	–	–	–	–
Peach	−0.12362152	–	–	–	–	–	–	–	–	–	–	–
Flat peach	1.00000000	–	–	–	–	–	–	–	–	–	–	–
Artichoke	−0.37863740	–	–	–	–	–	–	–	–	–	–	–
Mushroom	0.29565560	–	–	–	–	–	–	–	–	–	–	–
Orange	–	–	–	–	–	–	–	–	–	–	–	–
Mango	–	–	–	–	–	–	–	–	–	–	–	–
Cherimoya	–	–	–	–	–	–	–	–	–	–	–	–

Table A3. Correlation Matrix. Part III.

	Eggplant	Green Pepper	Beans	Plum	Nectarine	Peach	Flat Peach	Artichoke	Mushroom	Orange	Mango	Cherimoya
Watermelon	0.237	0.145	−0.312	0.101	−0.056	0.089	−0.045	0.012	0.098	0.211	−0.034	0.056
Melon	−0.190	0.067	0.134	−0.215	0.045	0.012	−0.032	0.078	−0.045	0.154	0.120	−0.005
Olives	0.295	−0.087	0.065	−0.312	0.043	−0.067	0.091	−0.123	0.042	−0.008	0.036	0.017
Olive oil	0.187	−0.045	0.099	−0.098	0.022	−0.034	0.045	−0.056	0.078	0.021	−0.019	0.004
Potato	0.101	0.078	0.112	−0.043	0.055	0.033	0.012	−0.076	0.029	0.098	0.087	0.045
Onion	−0.067	0.045	0.056	0.012	−0.023	0.011	−0.008	0.045	−0.012	0.033	−0.004	0.022
Chard	0.034	−0.012	−0.045	0.067	−0.023	0.012	0.034	−0.011	0.045	−0.008	0.019	0.021
Leek	0.056	0.023	0.012	−0.034	0.045	−0.012	0.011	−0.034	0.023	0.012	0.008	−0.006
Cabbage	−0.012	0.045	0.023	0.012	0.034	−0.023	0.011	0.045	−0.012	0.034	−0.008	0.019
Green bean	1.000	1.000	1.000	1.000	1.000	1.000	1.000	1.000	1.000	1.000	1.000	1.000
Zucchini	0.012	−0.023	0.034	−0.012	0.008	−0.004	0.011	−0.006	0.019	0.012	0.023	−0.008
Tomato	0.098	0.034	−0.012	0.045	−0.023	0.012	−0.008	0.034	0.023	−0.012	0.034	0.045
Carrot	−0.045	0.011	0.008	0.023	−0.012	0.034	−0.023	0.012	0.008	0.019	0.045	−0.006
Cucumber	0.011	0.023	−0.012	0.034	0.012	−0.008	0.019	0.045	−0.012	0.034	0.012	0.008
Red pepper	0.034	−0.012	0.023	−0.008	0.012	0.045	−0.006	0.012	0.034	0.011	0.019	−0.012
Lemon	0.045	0.012	−0.008	0.034	0.023	−0.012	0.011	−0.034	0.012	0.034	−0.008	0.019
Lettuce	−0.012	0.034	0.011	−0.006	0.012	0.034	−0.012	0.008	0.019	0.034	−0.008	0.012
Banana	0.008	0.012	0.034	−0.012	0.011	0.019	−0.006	0.012	0.008	0.023	0.012	−0.008
Apple	0.034	0.045	−0.012	0.012	0.034	−0.008	0.019	0.012	−0.012	0.034	0.011	0.008
Pear	−0.012	0.008	0.012	−0.008	0.019	0.012	−0.006	0.011	0.034	0.012	0.008	0.023
Grape	0.011	0.034	−0.012	0.008	0.012	0.034	−0.008	0.019	0.012	0.034	−0.012	0.011
Garlic	−0.008	0.012	0.034	−0.012	0.008	0.023	−0.008	0.012	0.034	−0.006	0.011	0.012
Cauliflower	0.012	0.034	−0.012	0.008	0.012	0.034	−0.008	0.019	0.012	0.034	−0.012	0.011
Broccoli	0.008	0.012	0.034	−0.008	0.012	0.034	−0.006	0.012	0.008	0.023	0.012	−0.008
Eggplant	1.000	0.045	0.012	−0.008	0.034	−0.012	0.019	0.012	0.034	−0.008	0.012	0.008
Green pepper	0.045	1.000	−0.012	0.011	0.023	−0.008	0.012	0.034	−0.012	0.008	0.012	−0.006
Beans	0.012	−0.012	1.000	0.034	−0.008	0.012	0.008	−0.012	0.034	0.012	−0.008	0.011
Plum	−0.008	0.011	0.034	1.000	−0.012	0.008	0.012	−0.006	0.012	0.034	0.012	−0.008
Nectarine	0.034	0.023	−0.008	−0.012	1.000	0.012	0.034	−0.008	0.012	0.008	0.034	−0.012
Peach	−0.012	−0.008	0.012	0.008	0.012	1.000	−0.006	0.012	0.034	−0.008	0.012	0.011
Flat peach	0.019	0.012	0.008	0.012	0.034	−0.006	1.000	0.012	−0.008	0.012	0.034	−0.012
Artichoke	0.012	0.034	−0.012	−0.006	−0.008	0.012	0.012	1.000	−0.012	0.008	0.012	0.034
Mushroom	0.034	−0.012	0.034	0.012	0.012	0.034	−0.008	−0.012	1.000	0.012	0.008	0.012
Orange	−0.008	0.008	0.012	0.034	0.008	−0.008	0.012	0.008	0.012	1.000	−0.012	0.034
Mango	0.012	0.012	−0.008	0.012	0.034	0.012	0.034	0.012	0.008	−0.012	1.000	0.012
Cherimoya	0.008	−0.006	0.011	−0.008	−0.012	0.011	−0.012	0.034	0.012	0.034	0.012	1.000

Figure A1. Heatmap of the correlation between all products in the dataset. Grey color indicates missing data.

Appendix B. Distance Matrix and Heatmap for All the Products

Table A4. Distance Matrix. Part I.

	Watermelon	Melon	Olives	Olive Oil	Potato	Onion	Chard	Leek	Cabbage	Green Bean	Zucchini	Tomato
Watermelon	0.000	0.322	0.459	1.744	0.493	0.356	0.260	0.084	0.299	0.046	0.479	0.333
Melon	0.322	0.000	0.372	1.907	0.516	0.384	0.221	0.018	0.249	0.112	0.500	0.454
Olives	0.459	0.372	0.000	2.266	0.524	0.466	0.396	0.165	0.292	0.473	0.712	0.670
Olive oil	1.744	1.907	2.266	0.000	2.040	2.063	1.909	0.308	2.102	0.183	1.776	1.673
Potato	0.493	0.516	0.524	2.040	0.000	0.440	0.537	0.060	0.494	0.243	0.806	0.672
Onion	0.356	0.384	0.466	2.063	0.440	0.000	0.430	0.121	0.503	0.255	0.635	0.651
Chard	0.260	0.221	0.396	1.909	0.537	0.430	0.000	0.061	0.305	0.346	0.455	0.376
Leek	0.084	0.018	0.165	0.308	0.060	0.121	0.061	0.000	0.036	0.130	0.022	0.109
Cabbage	0.299	0.249	0.292	2.102	0.494	0.503	0.305	0.036	0.000	0.423	0.597	0.488
Green bean	0.046	0.112	0.473	0.183	0.243	0.255	0.346	0.130	0.423	0.000	0.301	0.339
Zucchini	0.479	0.500	0.712	1.776	0.806	0.635	0.455	0.022	0.597	0.301	0.000	0.409
Tomato	0.333	0.454	0.670	1.673	0.672	0.651	0.376	0.109	0.488	0.339	0.409	0.000
Carrot	0.611	0.753	0.871	1.698	0.661	0.680	0.674	0.101	0.793	0.245	0.729	0.571
Cucumber	0.419	0.435	0.572	1.834	0.580	0.547	0.401	0.121	0.491	0.326	0.438	0.438
Red pepper	0.371	0.312	0.471	1.884	0.593	0.496	0.227	0.145	0.423	0.381	0.422	0.424
Lemon	0.370	0.402	0.503	1.903	0.557	0.468	0.280	0.162	0.460	0.418	0.434	0.484
Lettuce	0.275	0.326	0.370	2.013	0.520	0.462	0.225	0.065	0.310	0.361	0.559	0.458
Banana	0.445	0.459	0.652	1.839	0.715	0.634	0.374	0.038	0.534	0.369	0.585	0.488
Apple	0.309	0.279	0.251	1.992	0.487	0.402	0.243	0.114	0.239	0.427	0.538	0.472
Pear	0.302	0.341	0.509	1.877	0.459	0.483	0.358	0.064	0.350	0.390	0.549	0.397
Grape	0.316	0.318	0.391	1.946	0.478	0.430	0.269	0.024	0.319	0.331	0.583	0.495
Garlic	0.367	0.339	0.485	2.010	0.529	0.352	0.432	—	0.508	0.034	0.716	0.677
Cauliflower	0.477	0.609	0.806	1.499	0.707	0.699	0.491	—	0.698	0.174	0.529	0.350
Broccoli	0.254	0.281	0.445	1.672	0.495	0.494	0.240	—	0.332	0.348	0.453	0.276
Eggplant	0.587	0.587	0.865	1.515	0.891	0.814	0.551	—	0.737	0.269	0.337	0.401
Green pepper	0.226	0.264	0.406	1.931	0.586	0.506	0.199	—	0.315	0.361	0.487	0.419
Beans	0.270	0.276	0.412	0.218	0.469	0.441	0.265	—	0.412	—	0.326	0.315
Plum	0.228	0.390	0.522	1.898	0.403	0.351	0.401	—	0.466	0.081	0.651	0.529
Nectarine	0.260	0.300	0.289	2.039	0.460	0.459	0.293	—	0.196	0.371	0.598	0.485
Peach	0.297	0.291	0.269	2.050	0.447	0.464	0.312	—	0.159	0.402	0.597	0.488
Flat peach	—	—	0.198	0.424	0.156	0.236	0.053	—	0.184	0.243	0.028	0.105
Artichoke	0.121	0.377	0.371	0.909	0.304	0.257	0.238	—	0.321	—	0.306	0.129
Mushroom	0.990	1.043	1.277	0.992	1.154	1.184	0.994	—	1.133	—	0.856	0.740
Orange	—	0.060	0.157	0.464	0.045	0.156	0.052	—	0.039	—	0.029	0.048
Mango	0.201	0.149	0.270	0.275	0.135	0.068	0.198	—	0.253	—	0.120	0.211
Cherimoya	0.496	0.444	0.565	0.020	0.430	0.363	0.493	—	0.547	—	0.414	0.506

Table A5. Distance Matrix. Part II.

	Carrot	Cucumber	Red Pepper	Lemon	Lettuce	Banana	Apple	Pear	Grape	Garlic	Cauliflower	Broccoli
Watermelon	0.611	0.419	0.371	0.370	0.275	0.445	0.309	0.302	0.316	0.367	0.477	0.254
Melon	0.753	0.435	0.312	0.402	0.326	0.459	0.279	0.341	0.318	0.339	0.609	0.281
Olives	0.871	0.572	0.471	0.503	0.370	0.652	0.251	0.509	0.391	0.485	0.806	0.445
Olive oil	1.698	1.834	1.884	1.903	2.013	1.839	1.992	1.877	1.946	2.010	1.499	1.672
Potato	0.661	0.580	0.593	0.557	0.520	0.715	0.487	0.459	0.478	0.529	0.707	0.495
Onion	0.680	0.547	0.496	0.468	0.462	0.634	0.402	0.483	0.430	0.352	0.699	0.494
Chard	0.674	0.401	0.227	0.280	0.225	0.374	0.243	0.358	0.269	0.432	0.491	0.240
Leek	0.101	0.121	0.145	0.162	0.065	0.038	0.114	0.064	0.024	—	—	—
Cabbage	0.793	0.491	0.423	0.460	0.310	0.534	0.239	0.350	0.319	0.508	0.698	0.332
Green bean	0.245	0.326	0.381	0.418	0.361	0.369	0.427	0.390	0.331	0.034	0.174	0.348
Zucchini	0.729	0.438	0.422	0.434	0.559	0.585	0.538	0.549	0.583	0.716	0.529	0.453
Tomato	0.571	0.438	0.424	0.484	0.458	0.488	0.472	0.397	0.495	0.677	0.350	0.276
Carrot	0.000	0.689	0.715	0.609	0.676	0.753	0.734	0.676	0.671	0.793	0.478	0.630
Cucumber	0.689	0.000	0.294	0.497	0.493	0.599	0.434	0.480	0.514	0.621	0.485	0.317
Red pepper	0.715	0.294	0.000	0.321	0.339	0.421	0.332	0.420	0.356	0.504	0.459	0.273
Lemon	0.609	0.497	0.321	0.000	0.274	0.428	0.360	0.445	0.315	0.452	0.450	0.383
Lettuce	0.676	0.493	0.339	0.274	0.000	0.460	0.250	0.407	0.200	0.390	0.564	0.381
Banana	0.753	0.599	0.421	0.428	0.460	0.000	0.508	0.473	0.456	0.603	0.541	0.446
Apple	0.734	0.434	0.332	0.360	0.250	0.508	0.000	0.273	0.342	0.444	0.635	0.303
Pear	0.676	0.480	0.420	0.445	0.407	0.473	0.273	0.000	0.390	0.545	0.579	0.298
Grape	0.671	0.514	0.356	0.315	0.200	0.456	0.342	0.390	0.000	0.360	0.547	0.401
Garlic	0.793	0.621	0.504	0.452	0.390	0.603	0.444	0.545	0.360	0.000	0.721	0.589
Cauliflower	0.478	0.485	0.459	0.450	0.564	0.541	0.635	0.579	0.547	0.721	0.000	0.458
Broccoli	0.630	0.317	0.273	0.383	0.381	0.446	0.303	0.298	0.401	0.589	0.458	0.000
Eggplant	0.725	0.534	0.493	0.551	0.682	0.625	0.684	0.646	0.693	0.849	0.444	0.466
Green pepper	0.785	0.419	0.204	0.270	0.271	0.365	0.301	0.371	0.297	0.496	0.555	0.285
Beans	0.422	0.356	0.267	0.220	0.239	0.145	0.341	0.386	—	0.180	0.201	0.320
Plum	0.636	0.473	0.437	0.438	0.352	0.611	0.397	0.416	0.362	0.312	0.578	0.466
Nectarine	0.753	0.398	0.337	0.408	0.304	0.456	0.240	0.337	0.334	0.488	0.642	0.304
Peach	0.769	0.414	0.355	0.415	0.322	0.491	0.215	0.296	0.330	0.497	0.666	0.331
Flat peach	0.201	0.045	0.029	0.056	0.071	0.156	0.142	0.150	0.091	0.209	0.107	0.104
Artichoke	0.160	0.230	0.316	0.286	0.251	0.288	0.258	0.326	0.349	0.346	0.243	0.183
Mushroom	0.853	0.840	0.896	0.990	1.099	0.955	1.086	0.969	1.103	1.239	0.660	0.842
Orange	0.106	0.073	0.108	0.025	0.010	0.100	0.071	0.064	0.081	0.030	0.014	—
Mango	0.171	0.200	0.158	0.126	0.236	0.059	—	0.002	0.182	0.097	0.193	0.203
Cherimoya	0.466	0.495	0.453	0.421	0.531	0.354	—	0.297	0.477	0.392	0.488	0.498

Table A6. Distance Matrix. Part III.

	Eggplant	Green Pepper	Beans	Plum	Nectarine	Peach	Flat Peach	Artichoke	Mushroom	Orange	Mango	Cherimoya
Watermelon	0.587	0.226	0.270	0.228	0.260	0.297	—	0.121	0.990	—	0.201	0.496
Melon	0.587	0.264	0.276	0.390	0.300	0.291	—	0.377	1.043	0.060	0.149	0.444
Olives	0.865	0.406	0.412	0.522	0.289	0.269	0.198	0.371	1.277	0.157	0.270	0.565
Olive oil	1.515	1.931	0.218	1.898	2.039	2.050	0.424	0.909	0.992	0.464	0.275	0.020
Potato	0.891	0.586	0.469	0.403	0.460	0.447	0.156	0.304	1.154	0.045	0.135	0.430
Onion	0.814	0.506	0.441	0.351	0.459	0.464	0.236	0.257	1.184	0.156	0.068	0.363
Chard	0.551	0.199	0.265	0.401	0.293	0.312	0.053	0.238	0.994	0.052	0.198	0.493
Leek	—	—	—	—	—	—	—	—	—	—	—	—
Cabbage	0.737	0.315	0.412	0.466	0.196	0.159	0.184	0.321	1.133	0.039	0.253	0.547
Green bean	0.269	0.361	—	0.081	0.371	0.402	0.243	—	—	—	—	—
Zucchini	0.337	0.487	0.326	0.651	0.598	0.597	0.028	0.306	0.856	0.029	0.120	0.414
Tomato	0.401	0.419	0.315	0.529	0.485	0.488	0.105	0.129	0.740	0.048	0.211	0.506
Carrot	0.725	0.785	0.422	0.636	0.753	0.769	0.201	0.160	0.853	0.106	0.171	0.466
Cucumber	0.534	0.419	0.356	0.473	0.398	0.414	0.045	0.230	0.840	0.073	0.200	0.495
Red pepper	0.493	0.204	0.267	0.437	0.337	0.355	0.029	0.316	0.896	0.108	0.158	0.453
Lemon	0.551	0.270	0.220	0.438	0.408	0.415	0.056	0.286	0.990	0.025	0.126	0.421
Lettuce	0.682	0.271	0.239	0.352	0.304	0.322	0.071	0.251	1.099	0.010	0.236	0.531
Banana	0.625	0.365	0.145	0.611	0.456	0.491	0.156	0.288	0.955	0.100	0.059	0.354
Apple	0.684	0.301	0.341	0.397	0.240	0.215	0.142	0.258	1.086	0.071	—	—
Pear	0.646	0.371	0.386	0.416	0.337	0.296	0.150	0.326	0.969	0.064	0.002	0.297
Grape	0.693	0.297	—	0.362	0.334	0.330	0.091	0.349	1.103	0.081	0.182	0.477
Garlic	0.849	0.496	0.180	0.312	0.488	0.497	0.209	0.346	1.239	0.030	0.097	0.392
Cauliflower	0.444	0.555	0.201	0.578	0.642	0.666	0.107	0.243	0.660	0.014	0.193	0.488
Broccoli	0.466	0.285	0.320	0.466	0.304	0.331	0.104	0.183	0.842	—	0.203	0.498
Eggplant	0.000	0.385	0.309	0.510	0.652	0.676	0.178	0.268	0.711	0.086	0.127	0.439
Green pepper	0.385	0.000	0.273	0.396	0.347	0.371	0.036	0.253	0.854	0.177	0.195	0.476
Beans	0.309	0.273	0.000	0.236	0.207	0.206	0.070	0.150	0.669	0.094	0.145	—
Plum	0.510	0.396	0.236	0.000	0.429	0.451	0.154	0.264	0.915	0.043	0.112	0.367
Nectarine	0.652	0.347	0.207	0.429	0.000	0.042	0.184	0.257	0.986	0.068	0.185	0.432
Peach	0.676	0.371	0.206	0.451	0.042	0.000	0.182	0.267	0.995	0.063	0.214	0.427
Flat peach	0.178	0.036	0.070	0.154	0.184	0.182	0.000	0.112	0.289	0.087	0.106	0.140
Artichoke	0.268	0.253	0.150	0.264	0.257	0.267	0.112	0.000	0.559	0.110	0.158	0.245
Mushroom	0.711	0.854	0.669	0.915	0.986	0.995	0.289	0.559	0.000	0.140	0.287	0.517
Orange	0.086	0.177	0.094	0.043	0.068	0.063	0.087	0.110	0.140	0.000	0.110	0.107
Mango	0.127	0.195	0.145	0.112	0.185	0.214	0.106	0.158	0.287	0.110	0.000	0.059
Cherimoya	0.439	0.476	—	0.367	0.432	0.427	0.140	0.245	0.517	0.107	0.059	0.000

Figure A2. Heatmap of the distance between all products in the dataset. Grey color indicates missing data.

References

Rossi, A.; Bui, S.; Marsden, T. Redefining Power Relations in Agrifood Systems. J. Rural. Stud. 2019, 68, 147–158. [Google Scholar] [CrossRef]
U.S. General Accounting Office. Energy Security and Policy: Analysis of the Pricing of Crude Oil and Petroleum Products; Technical Report; GAO: Washington, DC, USA, 1993.
Peltzman, S. Prices Rise Faster Than They Fall. J. Political Econ. 2000, 108, 466–502. [Google Scholar] [CrossRef]
Wlazlowski, S. Petrol and Crude Oil Prices: Asymmetric Price Transmission; Working Paper; University of Munich, Department of Economics: Munich, Germany, 2001. [Google Scholar]
Meyer, J.; von Cramon-Taubadel, S. Asymmetric Price Transmission: A Survey. J. Agric. Econ. 2004, 55, 581–611. [Google Scholar] [CrossRef]
Rezitis, A.N.; Tsionas, M. Modeling Asymmetric Price Transmission in the European Food Market. Econ. Model. 2019, 76, 216–230. [Google Scholar] [CrossRef]
Gizaw, D.; Myrland, Ø.; Xie, J. Asymmetric Price Transmission in a Changing Food Supply Chain. Aquac. Econ. Manag. 2021, 25, 89–105. [Google Scholar] [CrossRef]
García-Gallego, J.M.; Chamorro-Mera, A.; Valero-Amaro, V.; Martínez-Jiménez, M.; Romero, P.; Miranda, M.T.; Rubio, S. Agri-Food E-Marketplaces as New Business Models for Smallholders: A Case Analysis in Spain. Agriculture 2025, 15, 1806. [Google Scholar] [CrossRef]
Gallego, F.J.; Díaz-Puente, J.M.; Quesada, D.F.; Bettoni, M. Modelling critical innovation factors in rural agrifood industries: A case study in Cuenca, Spain. Sustainability 2021, 13, 9514. [Google Scholar] [CrossRef]
Garai, S.; Paul, R.K.; Rakshit, D.; Yeasin, M.; Emam, W.; Tashkandy, Y.; Chesneau, C. Wavelets in combination with stochastic and machine learning models to predict agricultural prices. Mathematics 2023, 11, 2896. [Google Scholar] [CrossRef]
Sun, F.; Meng, X.; Zhang, Y.; Wang, Y.; Jiang, H.; Liu, P. Agricultural product price forecasting methods: A review. Agriculture 2023, 13, 1671. [Google Scholar] [CrossRef]
Zhang, N.; An, Q.; Zhang, S.; Ma, H. Price Prediction for Fresh Agricultural Products Based on a Boosting Ensemble Algorithm. Mathematics 2024, 13, 71. [Google Scholar] [CrossRef]
USDA Economic Research Service; U.S. Department of Agriculture. Price Spreads from Farm to Consumer—Documentation. 2025. Available online: https://www.ers.usda.gov/data-products/price-spreads-from-farm-to-consumer (accessed on 2 November 2025).
European Commission, DG AGRI. Agri-Food Data Portal—Fruit and Vegetables. Web Portal. 2025. Available online: https://agridata.ec.europa.eu/extensions/DataPortal/fruit-and-vegetables.html (accessed on 15 January 2026).
Eurostat. How Much Fruit and Vegetables Does the EU Harvest? (2024 Figures). News Item. 2025. Available online: https://ec.europa.eu/eurostat/web/products-eurostat-news/w/ddn-20250825-1 (accessed on 13 February 2026).
Vavra, P.; Goodwin, B.K. Analysis of Price Transmission Along the Food Chain; Technical Report; OECD Publishing: Paris, France, 2005. [Google Scholar] [CrossRef]
European Commission. Analysis of Price Transmission Along the Food Supply Chain in the EU; Commission Staff Working Document SEC(2009) 1450; European Commission: Brussels, Belgium, 2009.
Bukeviciute, L.; Dierx, A.; Ilzkovitz, F. Price Transmission Along the Food Supply Chain in the European Union; Technical Report; European Commission, Economic Papers; European Commission: Brussels, Belgium, 2009. [Google Scholar]
Conforti, P. Price Transmission in Selected Agricultural Markets; Technical Report; FAO Commodity and Trade Policy Research Working Paper; FAO: Roma, Italy, 2004. [Google Scholar]
FAO. Market Integration and Price Transmission in Consumer Markets; Technical Report; GIEWS/FAO Research Note; FAO: Rome, Italy, 2014. [Google Scholar]
Ben-Kaabia, M.; Gil, J.M. Asymmetric Price Transmission in the Spanish Lamb Sector. Span. J. Agric. Res. 2007, 5, 259–270. [Google Scholar] [CrossRef]
Fruitnet/Fruit Logistica. Trend Report 2025: Future Trends in Fresh Produce Supply; Fruit Logistica: Berlin, Germany, 2025. [Google Scholar]
Rojas-Reyes, J.J.; Rivera-Cadavid, L.; Peña Orozco, D.L. Disruptions in the food supply chain: A literature review. Heliyon 2024, 10, e34730. [Google Scholar] [CrossRef]
Kumar, A.; Divyanshu; Prasher, R.S.; Chandel, R.S.; Dev, I.; Sharma, S.; Mehta, P.; Vashishat, R.K. Market performance and supply chain selection dynamics for vegetables grown through sustainable practices in the Northwest Himalayan region. Front. Sustain. Food Syst. 2025, 9, 1558481. [Google Scholar] [CrossRef] [PubMed]
United Fresh New Zealand. Fruits and Vegetables: Global Value Chains Explained—Briefing Note 1: Understanding Costs and Prices; Global Coalition of Fresh Produce: Auckland, New Zealand, 2025. [Google Scholar]
Fernando, S.P.; Ruhunuge, I.J.A.; Wijeratne, A.W.; Esham, M.; Kuruppu, I.V. An Assessment of Cost Factor Transmission at Different Nodes of Carrot and Leek Vegetable Supply Chains. In Proceedings of the International Conference on Applied and Pure Sciences (ICAPS 2024), Kelaniya, Sri Lanka, 10–11 October 2024. [Google Scholar]
Valdés, R. Sustainable Food Value Chains: Approaches to Transaction Costs in Agro-Alimentary Systems of Developing Countries—A Chile Case Study. Sustainability 2024, 16, 3952. [Google Scholar] [CrossRef]
Gardner, B.L. The Farm-Retail Price Spread in a Competitive Food Industry. Am. J. Agric. Econ. 1975, 57, 399–409. [Google Scholar] [CrossRef]
Coordinadora de Organizaciones de Agricultores y Ganaderos (COAG). IPOD: Índice de Precios en Origen y Destino de los Alimentos. Consultado en línea. Available online: https://coag.chil.me/post/ipod-indice-de-precios-en-origen-y-destino-de-los-alimentos-122677 (accessed on 17 January 2026).
Mercasa. Precios y Mercados Mayoristas. 2025. Available online: https://www.mercasa.es/precios-y-mercados-mayoristas/ (accessed on 9 February 2025).
Liao, T.W. Clustering of time series data—A survey. Pattern Recognit. 2005, 38, 1857–1874. [Google Scholar] [CrossRef]
Aghabozorgi, S.; Shirkhorshidi, A.S.; Wah, T.Y. Time-series clustering—A decade review. Inf. Syst. 2015, 53, 16–38. [Google Scholar] [CrossRef]
Sakoe, H.; Chiba, S. Dynamic Programming Algorithm Optimization for Spoken Word Recognition. IEEE Trans. Acoust. Speech Signal Process. 2003, 26, 43–49. [Google Scholar] [CrossRef]
Mantegna, R.N. Hierarchical structure in financial markets. Eur. Phys. J. B 1999, 11, 193–197. [Google Scholar] [CrossRef]
Paparrizos, J.; Gravano, L. k-Shape: Efficient and Accurate Clustering of Time Series. In Proceedings of the 33rd ACM SIGMOD International Conference on Management of Data; Association for Computing Machinery: New York, NY, USA, 2015; pp. 1855–1870. [Google Scholar] [CrossRef]
Tonekaboni, S.; Eytan, D.; Goldenberg, A. Unsupervised Representation Learning for Time Series with Temporal Neighborhood Coding. In Proceedings of the International Conference on Learning Representations (ICLR), Virtual Event, 3–7 May 2021. [Google Scholar]
Grabocka, J.; Schilling, N.; Wistuba, M.; Schmidt-Thieme, L. Learning time-series shapelets. In Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining; Association for Computing Machinery: New York, NY, USA, 2014; pp. 392–401. [Google Scholar] [CrossRef]
Cuturi, M.; Vert, J.P.; Birkenes, Ø.; Matsui, T. A kernel for time series based on global alignments. In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP); IEEE: New York, NY, USA, 2007; Volume 2, pp. 413–416. [Google Scholar] [CrossRef]
Rasmussen, C.E.; Williams, C.K.I. Gaussian Processes for Machine Learning; MIT Press: Cambridge, MA, USA, 2006. [Google Scholar]
Chen, W.; Hussain, W.; Cauteruccio, F.; Zhang, X. Deep Learning for Financial Time Series Prediction: A State-of-the-Art Review of Standalone and Hybrid Models. Comput. Model. Eng. Sci. 2024, 139, 187–224. [Google Scholar] [CrossRef]
Alqahtani, A.; Ali, M.; Xie, X.; Jones, M.W. Deep Time-Series Clustering: A Review. Electronics 2021, 10, 3001. [Google Scholar] [CrossRef]
Xu, R.; Wang, C.; Li, Y.; Wu, J. Generalized Time Warping Invariant Dictionary Learning for Time Series Classification and Clustering. IEEE Trans. Pattern Anal. Mach. Intell. 2025, 47, 3611–3624. [Google Scholar] [CrossRef] [PubMed]
Cichocki, A.; Mandic, D.; De Lathauwer, L.; Zhou, G.; Zhao, Q.; Caiafa, C.; Phan, H.A. Tensor decompositions for signal processing applications: From two-way to multiway component analysis. IEEE Signal Process. Mag. 2015, 32, 145–163. [Google Scholar] [CrossRef]
Wu, Z.; Pan, S.; Long, G.; Jiang, J.; Chang, X.; Zhang, C. Connecting the dots: Multivariate time series forecasting with graph neural networks. In Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining; Association for Computing Machinery: New York, NY, USA, 2020; pp. 753–763. [Google Scholar] [CrossRef]
Forestier, G.; Petitjean, F.; Dau, H.A.; Webb, G.I.; Keogh, E. Generating synthetic time series to augment sparse datasets. In IEEE International Conference on Data Mining (ICDM); IEEE: New York, NY, USA, 2017; pp. 865–870. [Google Scholar] [CrossRef]

Figure 1. Heatmap of the correlation matrix for the selection of the products.

Figure 2. Heatmap of the Euclidean distance matrix for the selection of the products.

Figure 3. Representation of the decrease of the squares (elbow method).

Figure 4. Representation of the clusters.

Figure 5. Representation of the decrease of the squares for whole-yearly data (elbow method).

Figure 6. Representation of the five most relevant groups (clustering).

Table 1. Producer share in September for the selection of products that we work with (2008–2016).

	2008	2009	2010	2011	2012	2013	2014	2015	2016
Watermelon	0.373626	0.266667	0.200000	0.200000	0.200000	0.200000	0.246753	0.382716	0.230769
Melon	0.307692	0.260163	0.250000	0.250000	0.250000	0.252252	0.120370	0.161017	0.200000
Potato	0.229167	0.067416	0.397059	0.119403	0.265625	0.266667	0.075758	0.333333	0.402439
Onion	0.168142	0.095238	0.508475	0.126214	0.153846	0.333333	0.240000	0.297030	0.267857
Chard	0.228395	0.271523	0.259740	0.219355	0.257862	0.203488	0.243243	0.257732	0.204878
Cabbage	0.325203	0.123967	0.136752	0.171171	0.271186	0.148760	0.126051	0.213741	0.206612
Zucchini	0.267857	0.210145	0.288732	0.268657	0.281046	0.281879	0.335404	0.211180	0.108696
Tomato	0.399039	0.221591	0.210256	0.201183	0.358696	0.189944	0.316940	0.351464	0.261307
Carrot	0.188119	0.114583	0.468750	0.127660	0.416667	0.230000	0.453608	0.396040	0.519608
Cucumber	0.168919	0.179775	0.340909	0.218045	0.237762	0.201439	0.226950	0.273973	0.171233
Red pepper	0.144186	0.269231	0.285047	0.266667	0.202020	0.242991	0.200000	0.213741	0.181818
Lettuce	0.224719	0.297619	0.244444	0.209302	0.300000	0.164949	0.193182	0.260417	0.202128
Cauliflower	0.335294	0.335294	0.374269	0.327381	0.323864	0.208556	0.443750	0.365482	0.326316
Broccoli	0.215962	0.215962	0.200957	0.198238	0.198238	0.198238	0.205021	0.332090	0.269710
Eggplant	0.281046	0.281046	0.279503	0.365854	0.383648	0.267974	0.329268	0.222222	0.182390
Green pepper	0.292135	0.292135	0.187166	0.274286	0.183333	0.192308	0.187845	0.250000	0.210000

Table 2. Producer share in September for the selection of products that we work with (2017–2024).

	2017	2018	2019	2020	2021	2022	2023	2024
Watermelon	0.262500	0.193182	0.215190	0.225000	0.132075	0.282209	0.220930	0.264901
Melon	0.231343	0.155039	0.191304	0.179856	0.220339	0.316547	0.343137	0.229008
Potato	0.121622	0.457831	0.165217	0.086614	0.146552	0.213837	0.208589	0.276596
Onion	0.235849	0.168142	0.125828	0.159420	0.118182	0.188811	0.222798	0.187845
Chard	0.262136	0.187500	0.199005	0.212766	0.286957	0.269531	0.269091	0.208029
Cabbage	0.168000	0.203252	0.156863	0.216216	0.173913	0.237500	0.238636	0.201058
Zucchini	0.561404	0.187920	0.193548	0.430657	0.194245	0.366071	0.314103	0.331579
Tomato	0.391089	0.246446	0.325444	0.344633	0.309179	0.271255	0.353774	0.343220
Carrot	0.412371	0.461538	0.442105	0.328000	0.205357	0.174757	0.315385	0.189543
Cucumber	0.218543	0.301370	0.240310	0.415094	0.203704	0.386047	0.312169	0.452941
Red pepper	0.276151	0.209205	0.260274	0.305936	0.320988	0.308725	0.232919	0.332090
Lettuce	0.250000	0.211538	0.281250	0.186047	0.177570	0.166667	0.157895	0.181818
Cauliflower	0.381503	0.430939	0.336364	0.347059	0.380368	0.276042	0.282609	0.356877
Broccoli	0.274262	0.280443	0.183562	0.246429	0.261029	0.387205	0.356401	0.293478
Eggplant	0.592233	0.295337	0.277487	0.406417	0.400000	0.452675	0.426471	0.339056
Green pepper	0.270142	0.167488	0.188679	0.225490	0.277228	0.308880	0.145390	0.275556

Table 3. Pearson correlation matrix. Columns 1–8.

	Watermelon	Melon	Potato	Onion	Chard	Cabbage	Zucchini	Tomato
Watermelon	1.0000	0.1472	0.0808	0.2241	0.1176	0.4911	0.1177	0.4306
Melon	0.1472	1.0000	−0.0944	−0.1177	0.3947	0.4818	0.1624	0.0843
Potato	0.0808	−0.0944	1.0000	0.5250	−0.3228	0.2385	−0.4465	−0.2066
Onion	0.2241	−0.1177	0.5250	1.0000	0.0474	−0.2297	0.0823	−0.2571
Chard	0.1176	0.3947	−0.3228	0.0474	1.0000	−0.0132	0.2231	0.1878
Cabbage	0.4911	0.4818	0.2385	−0.2297	−0.0132	1.0000	0.0157	0.5722
Zucchini	0.1177	0.1624	−0.4465	0.0823	0.2231	0.0157	1.0000	0.3883
Tomato	0.4306	0.0843	−0.2066	−0.2571	0.1878	0.5722	0.3883	1.0000
Carrot	−0.1207	−0.6108	0.4620	0.4452	−0.2132	−0.0959	−0.0812	0.1653
Cucumber	0.0003	0.0159	0.1407	0.1383	−0.0867	0.1082	0.3836	0.1096
Red pepper	−0.4040	0.0702	−0.3164	−0.0771	0.2220	−0.4051	0.3645	−0.1739
Lettuce	0.3720	−0.1733	−0.0196	−0.1025	0.1326	−0.0819	−0.1585	0.1130
Cauliflower	0.0159	−0.7262	0.0201	−0.0608	0.0203	−0.2398	0.0125	0.2689
Broccoli	0.3877	0.4960	0.1906	−0.1346	0.3235	0.8627	0.1572	0.3760
Eggplant	−0.1197	0.4029	−0.5004	−0.2644	0.4266	0.2278	0.8245	0.4815
Green pepper	0.3233	0.1678	−0.3897	−0.3411	0.3342	−0.0991	0.2020	−0.0377

Table 4. Pearson correlation matrix. Columns 9–16.

	Carrot	Cucumber	Red p.	Lettuce	Cauliflower	Broccoli	Eggplant	Green p.
Watermelon	−0.1207	0.0003	−0.4040	0.3720	0.0159	0.3877	−0.1197	0.3233
Melon	−0.6108	0.0159	0.0702	−0.1733	−0.7262	0.4960	0.4029	0.1678
Potato	0.4620	0.1407	−0.3164	−0.0196	0.0201	0.1906	−0.5004	−0.3897
Onion	0.4452	0.1383	−0.0771	−0.1025	−0.0608	−0.1346	−0.2644	−0.3411
Chard	−0.2132	−0.0867	0.2220	0.1326	0.0203	0.3235	0.4266	0.3342
Cabbage	−0.0959	0.1082	−0.4051	−0.0819	−0.2398	0.8627	0.2278	−0.0991
Zucchini	−0.0812	0.3836	0.3645	−0.1585	0.0125	0.1572	0.8245	0.2020
Tomato	0.1653	0.1096	−0.1739	0.1130	0.2689	0.3760	0.4815	−0.0377
Carrot	1.0000	−0.0305	−0.3823	0.2416	0.4305	−0.2225	−0.2310	−0.6696
Cucumber	−0.0305	1.0000	0.5796	−0.3464	0.0300	0.4106	0.2172	0.0687
Red pepper	−0.3823	0.5796	1.0000	−0.2290	−0.0927	0.0769	0.4461	0.6174
Lettuce	0.2416	−0.3464	−0.2290	1.0000	0.2365	−0.3986	−0.1718	0.0138
Cauliflower	0.4305	0.0300	−0.0927	0.2365	1.0000	−0.1715	0.0208	−0.0333
Broccoli	−0.2225	0.4106	0.0769	−0.3986	−0.1715	1.0000	0.3516	0.2838
Eggplant	−0.2310	0.2172	0.4461	−0.1718	0.0208	0.3516	1.0000	0.2843
Green pepper	−0.6696	0.0687	0.6174	0.0138	−0.0333	0.2838	0.2843	1.0000

Table 5. Distance matrix. Columns 1–8.

Product	Watermelon	Melon	Potato	Onion	Chard	Cabbage	Zucchini	Tomato
Watermelon	0.0000	0.3544	0.5431	0.3921	0.2865	0.3300	0.5278	0.3667
Melon	0.3544	0.0000	0.5489	0.4092	0.2358	0.2647	0.5328	0.4836
Potato	0.5431	0.5489	0.0000	0.4400	0.5370	0.4937	0.8062	0.6723
Onion	0.3921	0.4092	0.4400	0.0000	0.4304	0.5031	0.6349	0.6513
Chard	0.2865	0.2358	0.5370	0.4304	0.0000	0.3051	0.4555	0.3763
Cabbage	0.3300	0.2647	0.4937	0.5031	0.3051	0.0000	0.5971	0.4883
Zucchini	0.5278	0.5328	0.8062	0.6349	0.4555	0.5971	0.0000	0.4086
Tomato	0.3667	0.4836	0.6723	0.6513	0.3763	0.4883	0.4086	0.0000
Carrot	0.6729	0.8012	0.6612	0.6804	0.6744	0.7928	0.7295	0.5712
Cucumber	0.4613	0.4627	0.5798	0.5469	0.4006	0.4908	0.4382	0.4383
Red pepper	0.4089	0.3319	0.5926	0.4962	0.2269	0.4232	0.4219	0.4243
Lettuce	0.3031	0.3468	0.5200	0.4621	0.2250	0.3102	0.5589	0.4577
Cauliflower	0.5451	0.6705	0.7292	0.7204	0.5064	0.7195	0.5452	0.3606
Broccoli	0.2903	0.3218	0.5460	0.5439	0.2639	0.3664	0.4995	0.3037
Eggplant	0.6710	0.6470	0.9188	0.8393	0.5680	0.7593	0.3473	0.4132
Green pepper	0.2589	0.2914	0.6043	0.5214	0.2056	0.3244	0.5021	0.4317

Table 6. Distance matrix. Columns 9–16.

Product	Carrot	Cucumber	R. Pepper	Lettuce	Cauliflower	Broccoli	Eggplant	G. Pepper
Watermelon	0.6729	0.4613	0.4089	0.3031	0.5451	0.2903	0.6710	0.2589
Melon	0.8012	0.4627	0.3319	0.3468	0.6705	0.3218	0.6470	0.2914
Potato	0.6612	0.5798	0.5926	0.5200	0.7292	0.5460	0.9188	0.6043
Onion	0.6804	0.5469	0.4962	0.4621	0.7204	0.5439	0.8393	0.5214
Chard	0.6744	0.4006	0.2269	0.2250	0.5064	0.2639	0.5680	0.2056
Cabbage	0.7928	0.4908	0.4232	0.3102	0.7195	0.3664	0.7593	0.3244
Zucchini	0.7295	0.4382	0.4219	0.5589	0.5452	0.4995	0.3473	0.5021
Tomato	0.5712	0.4383	0.4243	0.4577	0.3606	0.3037	0.4132	0.4317
Carrot	0.0000	0.6893	0.7149	0.6762	0.4926	0.6947	0.7470	0.8096
Cucumber	0.6893	0.0000	0.2938	0.4931	0.4994	0.3491	0.5503	0.4323
Red pepper	0.7149	0.2938	0.0000	0.3391	0.4732	0.3011	0.5077	0.2099
Lettuce	0.6762	0.4931	0.3391	0.0000	0.5809	0.4203	0.7028	0.2798
Cauliflower	0.4926	0.4994	0.4732	0.5809	0.0000	0.5042	0.4578	0.5717
Broccoli	0.6947	0.3491	0.3011	0.4203	0.5042	0.0000	0.5130	0.3138
Eggplant	0.7470	0.5503	0.5077	0.7028	0.4578	0.5130	0.0000	0.6218
Green pepper	0.8096	0.4323	0.2099	0.2798	0.5717	0.3138	0.6218	0.0000

Table 7. Importance of principal components (PCA), PCs 1–8.

	PC1	PC2	PC3	PC4	PC5	PC6	PC7	PC8
Standard deviation	2.3674	2.0913	1.4881	1.09013	1.02472	0.89594	0.85047	0.6374
Proportion of Variance	0.3297	0.2573	0.1303	0.06991	0.06177	0.04722	0.04255	0.0239
Cumulative Proportion	0.3297	0.5870	0.7172	0.78713	0.84890	0.89612	0.93866	0.9626

Table 8. Clusters of Vegetables/Fruits.

Cluster 1	Cluster 2	Cluster 3	Cluster 4	Cluster 5
Cucumber	Watermelon	Potato	Zucchini	Carrot
Red pepper	Melon	Onion	Tomato
Broccoli	Chard		Cauliflower
	Cabbage		Eggplant
	Lettuce
	Green pepper

Table 9. Ratios per product in 2024, ordered grouping the elements of the clusters.

Product	Jan	Feb	Mar	Apr	May	Jun	Jul	Aug	Sep	Oct	Nov	Dec
Zucchini	0.3214	0.3214	0.2151	0.2151	0.3099	0.3099	0.3558	0.4811	0.4811	0.1545	0.1545	0.2290
Eggplant	0.1730	0.1923	0.1882	0.1868	0.1833	0.1934	0.1782	0.1694	0.2270	0.1916	0.2312	0.2011
Cucumber	0.1792	0.2222	0.5504	0.4009	0.1925	0.1818	0.2011	0.2216	0.2135	0.4257	0.1981	0.3391
Cabbage	0.1531	0.1397	0.1257	0.2021	0.2154	0.1285	0.1459	0.2143	0.1337	0.1364	0.1329	0.1878
Melon	0.2149	0.2149	0.2285	0.2285	0.2411	0.2411	0.2260	0.3274	0.3274	0.2045	0.2045	0.2649
Watermelon	0.2819	0.3066	0.3510	0.5561	0.2384	0.2516	0.3073	0.2249	0.2811	0.2842	0.4286	0.3316
Chard	0.2000	0.2873	0.1784	0.1749	0.1934	0.2727	0.2674	0.2000	0.2179	0.1862	0.2287	0.2766
Broccoli	0.2308	0.1758	0.2000	0.3259	0.3185	0.1718	0.1768	0.2569	0.2071	0.2000	0.1895	0.1895
Onion	0.2986	0.2867	0.2926	0.3679	0.3880	0.2448	0.4014	0.4197	0.3566	0.3516	0.4086	0.3321
Potato	0.1300	0.2976	0.3566	0.2552	0.1711	0.2857	0.2454	0.2521	0.1318	0.3333	0.3628	0.3432
Carrot	0.3633	0.2017	0.3720	0.3427	0.3898	0.1628	0.2906	0.4604	0.3588	0.2798	0.3803	0.2756
Lettuce	0.2119	0.1737	0.2471	0.1993	0.2128	0.1703	0.2098	0.2041	0.1959	0.2222	0.2294	0.2080
Cauliflower	0.2807	0.4795	0.2456	0.4841	0.2566	0.4600	0.4603	0.2675	0.4652	0.2191	0.3498	0.3569
Tomato	0.1653	0.1557	0.1750	0.1750	0.1429	0.2017	0.2000	0.1653	0.2000	0.1803	0.2195	0.1818
Red pepper	0.2921	0.2831	0.4456	0.3609	0.3514	0.3133	0.2284	0.2264	0.2042	0.3065	0.3511	0.4529
Green pepper	0.1472	0.2677	0.2677	0.1895	0.1449	0.2030	0.2023	0.1775	0.2022	0.2164	0.2445	0.2935

Table 10. Mean ratios per product and month (2009–2024), ordered grouping the elements of the clusters of the 2024 table.

Producto	Jan	Feb	Mar	Apr	May	Jun	Jul	Aug	Sep	Oct	Nov	Dec
Zucchini	0.440	0.267	0.240	0.182	0.175	0.190	0.195	0.229	0.285	0.293	0.349	0.348
Eggplant	0.409	0.343	0.205	0.156	0.187	0.169	0.230	0.268	0.344	0.189	0.278	0.408
Cucumber	0.379	0.410	0.277	0.165	0.168	0.176	0.254	0.275	0.274	0.271	0.337	0.327
Cabbage	0.195	0.176	0.188	0.172	0.186	0.204	0.163	0.188	0.187	0.192	0.194	0.174
Melon	0.216	0.216	0.216	0.216	0.302	0.189	0.180	0.193	0.222	0.208	0.216	0.216
Watermelon	0.229	0.229	0.229	0.229	0.285	0.206	0.204	0.196	0.240	0.243	0.229	0.229
Chard	0.237	0.250	0.245	0.224	0.229	0.244	0.247	0.230	0.238	0.224	0.197	0.204
Broccoli	0.207	0.177	0.206	0.177	0.202	0.208	0.217	0.261	0.265	0.219	0.217	0.219
Onion	0.218	0.227	0.269	0.238	0.240	0.213	0.204	0.196	0.214	0.200	0.200	0.213
Potato	0.231	0.227	0.231	0.265	0.285	0.266	0.272	0.256	0.225	0.201	0.201	0.196
Carrot	0.286	0.285	0.324	0.321	0.314	0.323	0.317	0.303	0.328	0.293	0.281	0.296
Lettuce	0.200	0.181	0.203	0.191	0.187	0.193	0.187	0.203	0.218	0.200	0.180	0.201
Cauliflower	0.307	0.252	0.288	0.268	0.329	0.297	0.297	0.301	0.344	0.299	0.253	0.285
Tomato	0.276	0.261	0.308	0.291	0.174	0.198	0.245	0.253	0.294	0.316	0.262	0.284
Red pepper	0.355	0.389	0.481	0.427	0.325	0.326	0.277	0.276	0.257	0.302	0.300	0.338
Green pepper	0.348	0.396	0.437	0.355	0.272	0.255	0.239	0.243	0.227	0.289	0.262	0.332

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Sánchez-Arnau, E.; Ferrer-Sapena, A.; Sánchez-Arnau, C.; Sánchez-Pérez, E.A. Time-Series Similarity and Clustering of Producer Share Dynamics in Agrifood Markets: Evidence from Origin–Destination Price Relationships. Mathematics 2026, 14, 714. https://doi.org/10.3390/math14040714

AMA Style

Sánchez-Arnau E, Ferrer-Sapena A, Sánchez-Arnau C, Sánchez-Pérez EA. Time-Series Similarity and Clustering of Producer Share Dynamics in Agrifood Markets: Evidence from Origin–Destination Price Relationships. Mathematics. 2026; 14(4):714. https://doi.org/10.3390/math14040714

Chicago/Turabian Style

Sánchez-Arnau, Elena, Antonia Ferrer-Sapena, Claudia Sánchez-Arnau, and Enrique A. Sánchez-Pérez. 2026. "Time-Series Similarity and Clustering of Producer Share Dynamics in Agrifood Markets: Evidence from Origin–Destination Price Relationships" Mathematics 14, no. 4: 714. https://doi.org/10.3390/math14040714

APA Style

Sánchez-Arnau, E., Ferrer-Sapena, A., Sánchez-Arnau, C., & Sánchez-Pérez, E. A. (2026). Time-Series Similarity and Clustering of Producer Share Dynamics in Agrifood Markets: Evidence from Origin–Destination Price Relationships. Mathematics, 14(4), 714. https://doi.org/10.3390/math14040714

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Time-Series Similarity and Clustering of Producer Share Dynamics in Agrifood Markets: Evidence from Origin–Destination Price Relationships

Abstract

1. Introduction: The Producer Share in Agrifood Value Chains

1.1. Context and Related Literature

1.2. Main Technical Tools

1.3. Experimental Data on Price Transmission in the Supply Chain

2. Methodology

2.1. Theoretical Background

2.2. Objects of Analysis and Notation

2.3. Two Complementary Notions of Similarity

2.4. Clustering and Equivalence Classes of Products

2.5. Interpreting Clusters in Terms of Producer Favorability

2.6. Clustering of Producer-Share Vectors

2.7. Robustness, Missing Data, and Sensitivity

2.8. Data Preparation

3. Results

3.1. Fixed Month Time Series

3.1.1. Pearson Correlation

3.1.2. Euclidean Distance

3.1.3. Clustering and PCA

3.2. Complete Time Series Data: All Years and Months

4. Discussion

5. Conclusions

5.1. Methodological Contributions

5.2. Methodological Limitations and Future Work

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A. Correlation Matrix and Heatmap for All the Products

Appendix B. Distance Matrix and Heatmap for All the Products

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI