2.1. Overview
It is important to note that at the global level, the most reliable component of trade data is the monetary value of traded goods. This is because in most countries, trade data originates from customs declarations, where monetary values are essential for determining tariffs and duties. While quantity data is often included, it can be less reliable and has only been systematically available since 2006—insufficient for the longitudinal analysis conducted in this study.
Traditional material flow analysis typically relies on physical quantity data combined with estimates of material concentrations to derive flows. Monetary-based flows are more common when multi-regional input–output data is used (MRIO). This allows the disaggregation of flows on a sectoral level—but for many applications, including the P flows before biomass production, it is not detailed enough, see also [
24,
25]. In contrast, our method is designed to work directly with monetary trade values. Since we aim to estimate relative shares of global trade rather than absolute quantities, using value data is sufficient for our purposes, as detailed in
Section 2.3. Additionally, trade values often implicitly reflect product quality or concentration (e.g., higher-grade materials usually command higher prices), which can serve as a proxy for material content.
Trade data is reported at varying levels of detail. For cross-country comparability, we use the Harmonized System (HS) at the 6-digit level, which includes about5400 product categories. While this level ensures broad international comparability, individual product categories still contain a degree of heterogeneity.
Trade data is not without its challenges: gaps, inconsistencies, and valuation differences—including those arising from exchange rate fluctuations—are well-documented [
26,
27]. To address these issues, we use a cleaned and reconciled version of the UN Comtrade database, published as Atlas [
28], where import and export values are harmonized and reported on an FOB (free on board) basis.
In our analysis, we approximate the flow of phosphorus between countries using annual aggregate trade values. To derive physical flows, we estimate the average phosphorus content per USD for each relevant product category. These estimates allow us to convert value-based flows into material flows. The total of these category-specific flows is constrained to match known global phosphate flows for each year, as defined by the boundaries of our system.
Our system boundaries are on the one side set by P production, approximated by phosphate rock mining data by country. On the other side, they are set by the amount of P used in fertilizer application by country. To align data across production, trade, and use, we normalize country-level values relative to total global values in each year. This produces a flow matrix where each entry in row i represents the fraction of global phosphorus mined in country i that is ultimately used in country j. Row sums of F represent the share of global phosphorus mined in a given country. Column sums represent the share of global phosphorus used in a given country. The entire matrix is normalized such that all elements sum to 1. To convert these shares into absolute quantities (e.g., in metric tons of ), the values can be multiplied by the total global phosphate availability in each year (e.g., as reported by a geological survey). This yields a detailed country-by-country breakdown of the origins and destinations of phosphorus applied as fertilizer.
2.2. Data
Data on the mining of phosphate rock (in the following abbreviated as PR) for the period from 2001 to 2022 was obtained from the World Mining database [
29]. This dataset contains information on global mining activity; in particular, this includes 39 countries with the current ability to mine PR, of which 37 reported mining in 2022. We are aware that other data sources exist, especially for data on mining, yet we choose World Mining because this source is very comprehensive, especially for smaller countries. For an overview of data sources on phosphate and the related problems, the reader is referred to [
30]. The total amount mined in 2022 was 71.1 million metr. t of
, see the left panel of
Figure 1.
For the data on trade, we utilize the Comtrade database [
31] in the version provided as Atlas [
28]. We use data from those product categories that are known to contain significant amounts of P (verified in discussions with industry experts). We utilize data on the trade of products classified (according to the HS classification) in the categories 251010, 251020, 280910, 280920, 283510, 283521, 283522, 283523, 283524, 283525, 283526, 283529, 283531, 283539, 310310, 310320, 310390, 310510, 310520, 310530, 310540, 310551, 310559, 310560, and 310590. These 25 categories can be hierarchically aggregated into 5 more general categories, namely, natural calcium phosphate (phosphate rock), phosphoric acid, phosphates (mostly industrial use), phosphatic fertilizers, and mixed fertilizers.
The database contains information on the amounts traded on a bilateral basis in USD for 234 countries. For example, in 2022, the total trade volume for all the goods in the categories mentioned above amounted to USD 62.1 billion. Of this, USD 4.68 billion was traded in natural calcium phosphates, USD 7.25 billion was traded in phosphoric acid, USD 5.66 billion was traded in phosphates, USD 3.72 billion was traded in phosphatic fertilizers, and USD 40.82 billion was traded in mixed fertilizers. However, in this study, we will consider a weighted sum of these categories (details in
Section 2.4). For an overview of global trade, the reader is referred to
Figure A1 in the Appendix.
The use of phosphate can be approximated by data from the FAO [
32]. This dataset contains information on the import, export, production, and agricultural use of fertilizer in 167 countries (these countries are responsible for roughly 95% of P-related trade). This data is typically available with a delay of 2 years, which determines that the current endpoint for our analysis is 2022. For example, the agricultural use of P in fertilizer for 2022 is reported at 41.9 million metr. t of
, see also the right panel of
Figure 1. Hence, when one compares the figures of PR mining and P fertilizer use (and averages these over a few years), one could come to the conclusion that about 70% of mined phosphate rock ends up as fertilizer. Considering losses in the production processes of fertilizer and a tendency toward under-reporting in fertilizer use, we know that the actual share is in fact higher. Therefore, studies have looked at the technical processes that are involved in fertilizer production. For example, we know that about 90% of processed mined phosphate is used in a chemical wet process and mostly converted to phosphoric acid, out of which about 82% is used to produce fertilizer. Considering further that 15% of P fertilizers are not made from phosphoric acid, we can approximate a lower bound of 80% of total P used in fertilizer [
33]. Other studies estimate a higher fraction of P fertilizer use; at the upper end of the spectrum, the FAO estimated that 90% of mined PR is used by the fertilizer industry [
34]. These differences can partly be explained by slightly varying approximations of the losses in the fertilizer production processes, as well as by differences in the accounting for animal feed supplements (∼7%).
In any case, we note that in our analysis we make the simplifying assumption that the entire PR production (and no other source) is used as fertilizer. This means that the P used for other purposes, e.g., via phosphorus compounds, see [
35], is handled as if it flows in the same way. We also implicitly assume that mining and use take place in the same year, and we neglect the effect of changes in stocks as well as losses.
2.3. Trade-Based Flows
The phosphate trade network reflects large global dissimilarities between the mining and the agricultural uses of P. The shares in P-related trade over time are visualized in
Figure 2. China, Morocco, and Russia are large exporters of P-related goods. India and Brazil are the largest importers. The position of the USA is unique because of the very high share of locally used P. Still, the USA is a net exporter, since it exports large amounts of fertilizer, which dominates its value-based P trade statistics.
The trade network is, however, not sufficient to draw conclusions from about the flow of P in a material sense (see the bottom panels of
Figure 2). One reason is that goods in the different product categories contain P in very different amounts per USD value. The second reason is that countries can appear as net exporters of P while not mining any phosphate rock at all. This can happen because they act as a hub for the trade and production of P fertilizer. Third, we must account for flows within countries; this concerns fertilizer that is used in the same country where phosphate rock is mined.
In the following, we will show that these problems can, however, be overcome by transforming the matrix of imports and exports into a matrix of shares of net flows. We achieve this by applying some accounting identities and by re-evaluating indirect P flows between countries where necessary.
First of all, let us start by defining two vectors that carry the information on PR mining and on the agricultural use of P-based fertilizer by country,
and
. The data is taken from the World Mining and FAOSTAT databases, respectively. Both row vectors contain
N entries, where
N is the number of countries, in our case 234, plus 1 dummy for trades to undeclared places. For further calculations, it will be useful to normalize these vectors such that they represent the shares in global mining and use, respectively. For the countries for which no data on fertilizer use is available we assume that their share is the same as their share of P net imports. Hence, we define
and
For each pair of countries, the dataset contains the traded value in USD for the product (goods) categories. These can be represented by separate matrices with dimensions . In these matrices, each entry in row i represents exports of country i towards the country in column j. Since the imports and exports in the database have been harmonized, the matrices can equivalently be interpreted in a column-wise fashion; thus, entries in column j represent imports to country j from country i.
To utilize the information on P trade to inform us about actual P flows, it is useful to first consider only the net in P trade. For this, we can first calculate a matrix of net exports by subtracting imports from exports for each pair of countries and by dropping negative values; hence,
To account for the fact that each of these categories represents goods with a specific average P content, a weighting for these categories is necessary. Hence, in the following we will work with matrices of weighted P trade
,
where
is the weight given to product category
c and
. For the moment, however, we assume equal weights for all product categories. We will discuss optimized weighting schemes in
Section 2.4.
We further remove reciprocal flows between countries from
through different product categories by applying Equation (
3) equivalently. We then normalize the matrix of net exports by the total and obtain
To derive a complete picture of trade-based flows, we further must realize that we are missing information on P that is not traded but that stays within the country where it is mined. We can, however, approximate this data by assuming that exported P must originate from mined P that has not been used domestically or from net imports from other countries. The net imports can be calculated from , where corrects for the overall share of traded vs. locally used P. A value of was approximate from the aggregate data.
Hence, we can calculate a vector with the implied share of exported P,
as
From this we can calculate a vector
with implied shares of P that remain locally as
This now gives us the opportunity to include local P use as the diagonal entries into the normalized trade matrix, which (after re-scaling) results in the flow matrix
, the first version of a P flow representation. In this matrix, each element represents the material share of P moved from country
i to country
j.
where
I is an
identity matrix.
2.4. Optimal Category Weights and Fit
To validate whether any flow matrix F is a good approximation of the true flow of materials in the system, we have to evaluate if it resembles the distribution of PR mining and P used in agriculture correctly. Note that a mathematical solution for an optimal flow is , which would imply flows proportional to local use from all countries that mine PR. We are, however, interested in a flow matrix that resembles the actual material flows, which are distinct from and for which trade flows are known to be a latent version.
Hence, we will assume that the reported data on PR mining and the data on agricultural use of P represent the true shares of P origination and use, which of course is a simplification. For the analysis of the fit, it makes sense to use the relationships between the matrix
F and the vectors representing the shares of P mining and use,
M and
U. We can derive a vector of predicted P use
by multiplying with the flow matrix
where
J is a vector of ones with dimensions
. Similarly, we can predict the implied mining by using the transposed value of
F and calculating
These relationships can also be expressed as the row and column sums of
F. We note that Equation (
7) implies that the entries on the main diagonal of
F contain information on
U and
M by construction. We can use the predictions and compare
with
U and
with
M to evaluate the accuracy of
F. We can in fact use this evaluation to solve one remaining problem in the calculation of
F in the first place.
In particular, we have to investigate if the weighting scheme in the calculation of the trade matrix (see Equation (
4)) can be improved. In an ideal setting, we would try to calculate the exact P content of each trade relationship that each country has with each other country for each product category. This, however, is not feasible, due to resource constraints and data availability. We can, however, obtain the approximate P content indirectly by estimating the optimal weighting scheme that results in the flow matrix that gives us the most likely actual P flows between countries.
For this purpose, we define a function
D that evaluates the difference between the estimated and the observed quantities in terms of a weighted sum of absolute errors.
Since we are interested in minimizing the misallocation of P flows globally, we found that absolute errors are in this case a better choice than squared errors, since the latter would (due to the size distribution of countries) lead to an undesirable focus mainly on fitting a few large countries’ P data. We obtain optimal weights by employing a non-linear optimization of
D by choosing positive weights
c in Equation (
4) such that
D is minimized. Repeating the calculations described in
Section 2.3 generates an improved flow matrix
.
We found that it is useful to give the mining part of Equation (
11) a slightly higher weight than the use part. This is caused by the fact that the fertilizer use data is rather noisy, because for many countries figures are estimated or approximated. In particular, we chose
and
. We found that the exact weighting of
has only negligible effects on our results (see also
Section 2.8). Values of
will, however, lead to noisy estimation results. Product categories that contain only very few entries were omitted to guarantee that an optimum for (
11) could be found. We found that it is sufficient to include the 11 product categories with the most volume (which account for 90 % of the overall trade in the 25 P-related categories). While it is numerically possible to include up to around 16 categories in the estimation, this does not improve the overall fit of the estimated P flows.
The difference between the flow matrices
and
is that the latter is now based more heavily on product categories that are responsible for a high material P flow, irrespective of the monetary value. In the calculation of the flow, we now mainly rely on the information from the trade in phosphoric acid, DAP and MAP fertilizers, and calcium phosphate. To a lesser degree, superphosphates, sodium triphosphate, and other fertilizer forms are also considered. The optimal weighting scheme shows some variation from year to year. For an example, see
Figure A2 in the Appendix.
Table 1 gives an overview of the fit. We note that in all our analyses, we separately estimate the weights and calculate the flow matrices for the years 2001–2022. While using net exports already produces a good fit of the flow matrix with the data, we observe that just by optimizing the weights, we can improve the average
from 94% to 98% (first vs. second column). In the following, we will investigate how much further adjustments to the trade data can improve these results.
2.5. Correcting for PR Origination
In our analysis, we rely on the fact that overall, the recorded net exports must resemble the actual flow of P to a large extent. Some distortion by trade activity, i.e., the fact that P may travel through several countries and possibly several product categories before being used, is inevitable. While we cannot correct for the entirety of these effects, it is possible to make noticeable improvements by using information on the mining activity in different countries.
When we evaluate the row sums of , we observe that some countries show up as net exporters of P without having sufficient mining activity. To derive a flow matrix that is consistent with the qualitative aspects of the mining data, this aspect must be corrected. This means that we have to shift these exports to the most likely actual origin, i.e., the countries that mine PR.
To achieve this, we use the following algorithm. For each country
i that is not mining PR, we consider its excess P exports
given by entries in row
i in
,
We then calculate the import shares of country
i from countries
j that do mine. We define the set of these countries as
.
Equivalently, the export shares to all countries
j are given by
Further, we calculate the amount of imports into country
i that are used locally and are not exported, which is
In the flow matrix, we then add the excess exports of country
i to the exports of countries that mine PR, given by
. Here we make use of the fact that
is a row vector and
is a column vector. We then remove these exports from the export of the non-mining countries. Further, we must scale down the imports from the mining countries to the excess exporter by multiplying these imports by
, which means that we correct the imports to the amount that the country itself uses. We denote the resulting matrix as
. In total, this procedure removes about 5% of the flows accounted for in
. Since this matrix was normalized (see Equation (
8)), we must reinstate this normalization after our correction algorithm. To achieve this and to derive our improved flow matrix
, we have to consider that we should only re-scale the trade (off-diagonal) part of
, since the entries on the main diagonal were calculated from sources for which the absolute value in terms of P was known. The normalization that takes this into account is given by
The statistics in
Table 1 show that these approximate corrections do in fact improve the fit of the model significantly. Interestingly, the fit improves not only with the mining data but also with the use data, which indicates that the correction also has a positive indirect effect on the accuracy of the column sums of the flow matrix.
2.6. Correcting for Trade
In the previous section, we showed how we correct the data for countries that do not mine phosphate rock. For these countries, it was self-evident that they could not be the source of relevant phosphate flows. For countries that do mine PR, a similar procedure can be used to approximately correct for trade flows where an intermediary is involved in a trade chain. The most relevant case here is probably the USA, which is both an importer and an exporter of phosphate, as well as a producer. Until now, our calculations implicitly assumed that, for example, imports to the USA have completely been used in the USA and that all exports from the USA originate from locally mined PR. A more realistic assumption is, of course, that imported P is both used domestically and re-exported (albeit likely in processed form). The share that is re-exported must be attributed to the original source country.
To analyze such trade chains, it is useful to look at a flow matrix with the domestic markets removed and start by defining a matrix
. We will now look at all the countries with PR mining (countries with less than 1% of global PR production were excluded from this correction) and again label this set of countries as
m. We can calculate how much P can
potentially flow through a country
i as
The in- and outflows for a country
i in terms of shares of the total flow are given by the vectors
and
We define the share of the potential through-flow that we think should be re-accounted for as
. We can now correct the flows by adding the through-flow of each country under consideration to the outflow of the original exporter by the operation
We then have to reduce the outflow of country
i by
and reduce the inflow by
. Re-normalization of the resulting matrix (see Equation (
16)) gives
.
The parameter can be approximated by evaluating the fit of the model for values . We found the optimum to be , which means that countries that mine PR re-export on average 40% of the P that they import in addition to their own production. We note that in reality, longer chains of trade than considered here exist; however, we found that these are very likely not relevant for our results. This has two reasons. The first is that our analysis is already based on shares of net exports. The second reason is that for longer chains to be relevant, we would have to observe two countries with large trade volume in the middle of such a chain, which we do not. In fact, the only case where we do suspect longer chains of trade are smaller, remote countries (for example in Africa), which have only small P inflow. Also, for such countries, the trade data is often inaccurate, which makes 2nd-order corrections impractical.
The statistics of the improvement in fit from this step are shown in
Table 1. The difference between
and
is, in fact, relatively small compared with the previous correction steps.
2.7. Scaling Outflow to Mining Data
The last correction deals with the overall differences between the PR production data and the implied mining of
. Assuming that the PR mining data has a good overall level of accuracy (and is less distorted by price variations than the trade data), we can scale the entries in the rows of the mining countries in
in such a way that their sum will closely match
M. This operation is, of course, only useful as long as we do not negatively impact the column sums in terms of its prediction of fertilizer use. We note that this optional step is, in principle, an extension to the steps taken in
Section 2.3, with the difference that here we condition the entire row sums (and not only the diagonal elements) of the flow matrix on
M.
Again, we define a partial flow matrix without the local use as
. We then calculate how much P outflow should be available in each country. This share is given by
i.e., the mining in country
i minus the local use. We then calculate a correction factor
based on the ratio of observed versus expected outflow of country
i, given by
where
is a damping factor.
in this case. The damping factor is an optional parameter that can protect against adverse effects on the fit with the use data if necessary, see also the next section. A new flow matrix
can now be calculated by dividing the entries of rows
i in
by the corresponding correction factor
. We avoid correcting countries with very low PR production and trade flows by demanding that
and
. The matrix is then re-normalized equivalent to Equation (
16). The statistics in
Table 1 show that this correction in fact does improve the overall fit significantly. In particular, we increase the consistency of the flow matrix with the PR mining data without losing consistency with the fertilizer use data. We note that when comparing the fit with other models, one should consider that in this step, we endogenized
.
2.8. Simplifying the Estimation
A final consideration is whether the setup for finding the optimal weights of the product categories can be simplified. This is important in order to gain insights about the applicability of this method to other resources or goods. While for the case of P both the origin and the (preliminary) end use are relatively easy to determine, we can assume that for many other cases, use data will not be available in comparable accuracy, since the variety in terms of applications is typically much higher than it is for P.
This means that one will typically be in a situation where a product category weighting will have to be estimated based only on data on the origination. For the case of P, this would mean that we would modify Equation (
11), i.e., set the weight for the fertilizer use part
to 0.
We re-estimated the model described in
Section 2.7 with this setting and found that the resulting flow matrix
has a fit that is comparable with variants discussed in previous sections. We found that in this case, a slight dampening of the correction factor in Equation (
22) is useful by setting
. The fit with the mining data is very similar to that of
, while the fit with the use data is only slightly worse. This might be seen as a good indication that an application of the presented model to other cases with more limited data quality should be feasible.