Colombian Export Capabilities: Building the Firms-Products Network

In this paper, we analyse the bipartite Colombian firms-products network, throughout a period of five years, from 2010 to 2014. Our analysis depicts a strongly modular system, with several groups of firms specializing in the export of specific categories of products. These clusters have been detected by running the bipartite variant of the traditional modularity maximization, revealing a bi-modular structure. Interestingly, this finding is refined by applying a recently proposed algorithm for projecting bipartite networks on the layer of interest and, then, running the Louvain algorithm on the resulting monopartite representations. Important structural differences emerge upon comparing the Colombian firms-products network with the World Trade Web, in particular, the bipartite representation of the latter is not characterized by a similar block-structure, as the modularity maximization fails in revealing (bipartite) nodes clusters. This points out that economic systems behave differently at different scales: while countries tend to diversify their production—potentially exporting a large number of different products—firms specialize in exporting (substantially very limited) baskets of basically homogeneous products.


Introduction
Exporting activities of countries have remarkable signals of complexity. By tradition, the understanding of the international trade has been of interest to politicians and economists. More recently, with the surge of the complex networks theory, the understanding of international trade has been enriched, providing information about the structure of industries and how it relates with countries growth, income, and development [1,2]. This paper provides new and inspiring evidence on the study of the productive capacity of a nation based on its exports. More precisely, we analyze the bipartite network of Colombian exports using data at the firm level for the period 2010-2014 by employing tools developed within the field of information theory and complex networks analysis [3].
The bipartite network derives from considering the type of products exported by firms, ending up with two layers: firms and products. We are interested in the understanding of the projections on those layers, carefully dealing with the statistical significance of the similarity measure employed. Our approach is based on the maximization of the constrained Shannon entropy with the available information about the system at hand: since we are interested in detecting common patterns of economic activities characterizing Colombian firms, we employ properly defined constraints to define benchmarks for testing the statistical significance of the observed patterns. As a result, the building blocks in our analysis rely on connecting nodes by their similarity in both layers: firms are connected because they export a significantly large number of common products and products are connected because they are jointly exported by a significantly large set of firms.
Diversification of products is related to the growth of firms by the expansion into new activities or markets. It has been argued that firms accumulate specific capabilities that can be used to produce different products or to enter different industries [4]. From the perspective of cost minimizing firms, economies of scope are revealed when the cost of joint production of different products is less than the cost of producing all of them separately [5]. In this sense, considering that firms' capabilities are revealed in production intensity and product portfolio, the focus has been on the understanding on why firms diversify, how diversification emerges, and its relation to firms' performance [6][7][8].
At the level of international trade, the network analysis made possible to build taxonomies of products and countries that allowed a better understanding of countries' exporting capabilities [9][10][11]. Challenging the Ricardo's fundamental comparative advantage theory, the surprising outcome was that the bipartite matrix was triangular (sorting by countries capabilities) rather than block-diagonal [12]. Nevertheless, developed countries have the capabilities to produce and export a wide variety of sophisticated and unsophisticated products, while the developing countries reveal much more restricted capacities that are related to the exports of less complex products.
There are important differences at comparing the results of the present paper with those in international trade. A firm is certainly much more constrained in its production, both in scale (volumes) and scope (number of different products). Therefore, the expected outcome of the present study must be by construction different. Obviously, we attempt to go deeply in the well-studied complexities of the economic systems giving important insights of the fitness of countries. In addition, more precisely, we aim to provide inputs to understand the process of firms product diversification as a consequence of combinations of (unobserved) capabilities. In this sense, we could assume that the higher the capabilities of a firm, the higher the number of the exported products. Thus, we attempt to discover if also at the country level firms and products get together in meaningful communities, and also we aim to recognize the differences and similarities between the micro and macro levels: firms versus countries.
The paper is organized as follows. In Section 2 we describe the dataset and the data cleaning process. In Section 3 we provide a detailed explanation of the methods employed for the present analysis. In Section 4 we illustrate the results of our study of the Colombian firms-products network and compare them with the corresponding ones, observed in the World Trade Web. Finally, we conclude with a general discussion of our results.

Colombian Export Data
We study the Colombian exports as a bipartite, undirected, binary network: firms and products constitute the nodes of the two different layers and intra-layer links are not permitted. For the description of the Colombian firms-products (CFP in what follows) network, we use all export transactions of manufacturing products reported at the Colombian Customs Office (Dirección de Impuestos y Aduanas Nacionales, DIAN) and collected by the Colombian Bureau of Statistics (Departamento Administrativo Nacional de Estadística, DANE), for the period 2010 and 2014. We removed all transactions related to re-exports of products elaborated in other countries. Each shipment has a unique seller ID, which we use as the firm identifier, the date, a 6-digit harmonized system (hs) characterizing the product, the destination and the US dollar value of the transaction. For the 2010-2011 period, products are described by hs2007 coding while, in the following period, by hs2012 coding (6-digits in both cases). For the comparison, we used the conversion table provide by the UN [13].

World Trade Web Data
Data concerning the World Trade Web (WTW) are represented as a bipartite, undirected, binary network as well: countries and products constitute the nodes of the two different layers and intra-layer links are not permitted. For our analyses we use the BACI World Trade Database [14]. Products are described by hs2007 coding at 4 digits.

Data Cleaning Procedure
We filter out small firms in the CFP and small countries in the WTW, since they would bring very little information. In the CFP, we removed firms with a total yearly export volume lower than current 10 4 USD (results do not change upon varying such a threshold). In addition, a Revealed Comparative Advantage (RCA) [15] threshold is applied. This procedure, which is standard in the analysis of international trade, consists in comparing, for every firm/country, the share of each export product value with the global (i.e., over the entire dataset) analogous. In formulas, if w ip is the value of the export of the firm/country i for the product p, the RCA reads: where we have defined the strength of firm/country i as s i = ∑ p w ip , the strength of product p as t p = ∑ i w i p and W = ∑ i ,p w i p is the total, exported volume. If the firm/country share is greater than the global share, i.e., if RCA ≥ 1 (or, equivalently, w ip ≥ s i t p W ) its "exporting performance" on the given commodity is interpreted as being above-average and the entry is validated. The output of this cleaning procedure is a rectangular binary matrix M (i.e., the biadjacency matrix of our bipartite, undirected, binary network). In the case of the CFP biadjacency matrix M CFP , the number of firms and products will be indicated, respectively, as F and P. For simplicity, we omitted temporal subscripts, but in the case of the CFP both F and P vary over time. The matrix generic entry m ip is 1 if firm/country i exports an amount of product p above the RCA threshold; otherwise, m ip = 0. Each row represents the basket of products of a given firm/country and, similarly, each column represents the set of exporters of a given product.
Interestingly, the obtained bipartite networks have different connectances depending on the system analyzed: the WTW density of links ranges from 0.09 to 0.13, while in the Colombian dataset its order of magnitude is steadily 10 −3 . The percentage of links validated by the RCA thresholding procedure differs for the Colombian national exports and for the WTW, passing from 0.9 for the CFP to 0.2 for the WTW (see also Figure 1). This implies that there is no such a big difference between the topological structure of the Colombian export network and of its binarized version obtained by employing the RCA threshold. In turn, this means that there is relatively low competition from national producers in the products exported by firms.
As a general observation, while the number of products of the CFP network remains practically constant throughout the considered temporal period (P 3100), the number of firms keeps increasing, moving from F 4800 in 2010 to F 5100 in 2014. Moreover, while the number of persistent firms (i.e., the ones that are present throughout the entire time period) is 2240, the number of persistent products is 2400, i.e., a high percentage of the total. For the sake of the comparison, the WTW is characterized by several countries 140 and several products 1131 throughout all years.

Methods
To analyse the Colombian firms-products network we apply a recently proposed algorithm to obtain monopartite representations of bipartite networks. Since the method is perfectly general, in what follows we will index the nodes corresponding to the rows of the biadjacency matrix with the letter r = 1, . . . , R and those corresponding to the columns of it with the letter c = 1, . . . , C. In our specific case, firms lie along the rows and products lie along the columns.
As in [3], we implement the following four-steps recipe, prescribing to (a) choose a specific pair of nodes belonging to the layer of interest, say r and r , and measure their similarity, (b) quantify the statistical significance of the measured similarity with respect to a properly defined null model, (c) link nodes r and r if, and only if, such a similarity is found to be significant, (d) repeat the steps above for every pair of nodes.

Measuring Nodes Similarity
The most straightforward approach to quantify nodes similarity is counting the number of common neighbors V rr shared by nodes r and r and defined as: In (2) we adopted the definition V c rr ≡ m rc m r c for the basic quantity of our approach, satisfying the relationship V c rr = 1 if, and only if, both r and r share the (common) neighbor c. In other words, we quantify the similarity between any two Colombian firms (products) by counting the number of their co-exported products (co-exporting firms). As discussed in [8,16], alternative measures for quantifying similarity-also known with the name of relatedness-indeed exist. Our choice has been dictated by the intuitive meaning of the number of co-occurrences, beside its analytical tractability under the null hypothesis described below.

Quantifying the Significance of Nodes Similarity
The second step of our algorithm prescribes to quantify the statistical significance of the similarity of r and r . To this aim, a benchmark is needed: a natural choice leads to adopt the Exponential Random Graph (ERG) class of null-models [17][18][19][20][21]. Briefly speaking, the ERG formalism rests upon the constrained maximization of Shannon entropy, a procedure aimed at maximizing the uncertainty about the system at hand except for what is known about it (represented by the constraints). Such a recipe ensures that a maximally unbiased inference is carried out, thus minimizing the risk of drawing unsupported conclusions about our data. , whence the name of the formalism. The numerical value of P(M) is, then, determined by the vector C(M) of topological constraints [17]. To determine the unknown parameters θ, the likelihood-maximization recipe can be adopted: given an observed biadjacency matrix M * , it translates into solving the system of equations C ( θ) = ∑ M P(M) C(M) = C(M * ), which prescribes to equate the ensemble averages C ( θ) to their observed counterparts, C(M * ) [18,19]. The null model we have considered in the present paper is known as the Bipartite Configuration Model (BiCM) [21] and is defined by constraining the degrees of nodes belonging to both layers. Enforcing the nodes degrees allows us to write P(M) in a factorized form, i.e., as the product of pair-specific probability coefficients: depending on the firms' degree d r , r = 1, . . . , R and on the products degree u c , c = 1, . . . , C (whose names, diversification and ubiquity, we mutuate from the country-specific analogue [10]) x r and y c being the Lagrange multipliers associated with the constrained degrees. Under the BiCM, the probability p rc that a link exists between firm r and product c) reads: its numerical value being determined by the likelihood-maximization conditions: prescribing that the average values of firms diversification and products ubiquities match the observed counterparts { d * } and { u * }.
Since ERG models with linear constraints treat links as independent random variables, the presence of each V c rr can be regarded as the outcome of a Bernoulli trial: f Ber (V c rr = 0) = 1 − p rc p r c ; it follows that, once r and r are chosen, the events describing the presence of the single V c rr patterns are independent random experiments: this, in turn, implies that each V rr is a sum of independent Bernoulli trials, each one described by a different probability coefficient. The distribution describing the behavior of each V rr turns out to be the so-called Poisson-Binomial [3]. Measuring the statistical significance of the nodes similarity r and r translates into calculating a p-value on the aforementioned Poisson-Binomial distribution, i.e., the probability of observing several patterns greater than, or equal to, the observed one (which will be indicated as V * rr ): Upon repeating such a procedure for each pair of nodes, we obtain a R × R matrix of p-values [3,22]. Since this operation is computationally costly, for sparse networks it is possible to rest upon the approximation of the Poisson-Binomial distribution prescribing to substitute the variable V rr with a Poisson variable having the same expected value µ. The error of such an approximation is quantified by Le Cam's theorem [22][23][24]: A comparison of the BiCM with alternative null models has been carried out in a similar-in-spirit analysis [25].

Validating the Projection
To understand which p-values are to be retained, a statistical procedure accounting for the fact that we are testing multiple hypotheses at a time is needed. In the present paper we apply the so-called False Discovery Rate (FDR) procedure [26]. Whenever M different hypotheses, H 1 , . . . , H M , characterized by M different p-values, must be tested at a time, FDR prescribes to, first, sort the M p-values in increasing order, p-value 1 ≤ · · · ≤ p-value M and, then, to identify the largest integerî satisfying the condition with t representing the usual single-test significance level (e.g., t = 0.05 or t = 0.01). The third step of the FDR procedure prescribes to reject all the hypotheses whose p-values are less than, or equal to, p-valueˆi (i.e., all p-values satisfying p-value 1 ≤ · · · ≤ p-valueˆi). Notably, FDR allows one to control for the expected number of false "discoveries" (i.e., incorrectly rejected null hypotheses), irrespectively of the independence of the hypotheses tested [27]. In our case, the FDR prescription translates into adopting either the thresholdît/( R 2 ) or the thresholdît/( C 2 ) depending on the layer whose projection we are interested in. For the sake of clarity, each pair of nodes whose corresponding p-value passes the FDR validation is joined by a binary, undirected link in the corresponding projection. In what follows, we have used a single-test significance level of t = 0.05.

Testing the Projection Algorithm
To test the performance of our method, the Louvain algorithm [28] has been run on the validated projections of the real networks considered for the present analysis. Since Louvain algorithm is known to be order-dependent [29], we considered several outcomes of the former equal to the size of the projected network-each one obtained by randomly reshuffling the order of nodes taken as input-and chose the one providing the maximum value of the modularity [3,29] 3.1. 4

. Statistical Analysis
Our ensemble method allows the statistical significance of a large number of topological quantities to be tested. To quantify to what extent the considered null model is able to capture the real structure of the network, one can compare the observed and expected value of any quantity of interest X via the so-called z-score, defined as are the sampling moments, computed according to the sampling frequenciesP(M). The latter are the sampling-induced approximations of the ensemble frequencies P(M), computed by explicitly generating a sufficiently large number of network configurations (in our case, 1000). Whenever the ensemble distribution of the quantity X closely follows a Gaussian, z-scores can be attributed the usual meaning of standardized variables, enclosing the 99.7% of the probability distribution within the range z X ∈ [−3, 3]: any discrepancy between observations and expectations leading to values |z X | > 3 can thus be interpreted as statistically significant.
However, when the ensemble distribution of the quantity X deviates from a Gaussian, z-scores cannot be interpreted in the aforementioned way and an alternative procedure is needed: here, we have computed the box plots. Box plots are intended to sum up a whole probability distribution by showing no more than five percentiles: the 25th percentile, the 50th percentile and the 75th percentile (usually drawn as three lines delimitating a central box), plus the 0.15th and the 99.85th percentiles (usually drawn as whiskers lying at the opposite sides of the box). Box plots can, thus, be used to assess the statistical significance of the observed value of X against the null value output by the BiCM.

Results
The results of our analysis refer to the year 2010 for both the CFP network and the WTW. However, they are robust across years.

Node Degree and Strength Distributions
Let us start by describing some empirical findings about the system under analysis, concerning the distribution of firms and products strengths. As Figure 2 shows, both distributions seem to be well fitted by a log-normal (as confirmed by a Kolmogorov-Smirnov (KS) test that does not reject such an hypothesis) in agreement with the the evidence for the monopartite WTW [30] and also at the firm level [31,32]. Let us now move to the description of the degree distributions of firms and products. Heavy-tailed distributions for the values of the aforementioned quantities can be observed, across all considered years: the KS test does not reject the hypothesis that both distributions are compatible with power-laws (see the top panels of Figure 3). As a comparison, let us consider the distributions describing the countries and products degrees of the WTW: as the bottom panel of Figure 3 shows, the distribution of the products degrees is compatible with a log-normal. On the other hand, although the distribution of the firms degrees appears to be a bit more noisy, a KS test does not reject the hypothesis that it follows a power-law.

Nodes Degrees and Strengths Correlation
The relationship between nodes degrees and strengths is found to be close related in several economic and financial networks, be they monopartite [33] or bipartite [34]. Briefly speaking, strengths are found to be positively correlated with the degrees, reflecting the fact that countries with a larger number of neighbors are also the ones exporting a larger volume of products. This seems to hold true also for the CFP network, as Figure 4 shows. There is huge heterogeneity, especially for firms with medium to large strength. While it is expected that firms with relatively low total exports have a low scope, firms with higher total trade can be very specialized in a few products (even a single product) or highly diversified. As a consequence, this pattern is observed in the product projection, some products are intensively produced by a few firms and some are ubiquitous and produced intensively by many firms.

Specialization vs. Diversification at a National and International Level
From the times of Adam Smith and David Ricardo, it is recognized that international trade increases specialization, leading to bilateral benefits, despite countries' differences in technologies and wages. This vision can explain fairly well the inter-industry trade, but it leaves out one of the most important trade modes: the intra-industry trade. The modern approaches aim to solve this by introducing the love for variety [35,36]. Indeed diversification of the product basket has been shown to be a good symptom of economic development: developed countries export more intensively and also export a wider basket of goods than their developing counterparts [37].
The very first analyses of the bipartite representation of the WTW showed an unexpected triangular shape of the biadjacency matrix [9,38]. Indeed, the result was striking, since it showed for the first time the presence of a tendency of countries to diversification: even the most developed countries do not abandon the production of the most basic products, while they enlarge their export basket towards most sophisticated goods. Actually, the picture is less simple than that: although countries tend to diversify their exports, a signal of specialization is still present. Otherwise stated, the observed level of diversification (quantified by the country node degrees) cannot explain a residual tendency of firms to focus on certain classes of products [39].
To capture the productive capabilities of countries, several measures were proposed: the very first proposals [9,38] show several flaws and defects [2]. The Fitness and Complexity algorithm [11] (FiCo, in the following) outperforms other competitors in terms of accuracy of predictions [40]. The FiCo procedure accounts for the non-linearity of the system via a recursive algorithm: the performance of a country (quantified by a fitness) depends both on the "quality" of the exported products (described by shades of complexity) and on the fitness of their exporters. Indeed, the success of the FiCo algorithm relies upon the structure of the bipartite WTW representation: the algorithm rewards countries according to the variety and complexity of their export baskets.
The FiCo algorithm is also able to highlight the "triangularity" of the biadjacency matrix describing a given system [41]: as Figure 5 shows, if rows are re-ordered by fitness and columns by complexity, the non-zero entries of the biadjacency matrix appear as "packed" together. When considering the WTW this becomes particularly evident, as the top panel of Figure 5 shows; analogously, when the FiCo algorithm is applied to the Colombian national export dataset, a triangular structure is revealed as well, as shown in the central panel of Figure 5 [42,43].  central and bottom panel). The ratio between the x-and y-axes was modify to permit an easier comparison between the shape o the matrices. Columns represent products and rows countries (WTW) or firms (CFP). In the central panel, rows and columns of the biadjacency matrix are ordered according to the FiCo ranking, while in the bottom panel the (bipartite) communities found via the Barber algorithm are highlighted [44]. The FiCo algorithm, thus, hides the block-structure characterizing the national exports of Colombia.
Interestingly, the triangular structure revealed by the re-ordering induced by fitness and complexity co-exists with a quite different structure. Upon running the Barber community detection algorithm [29,44], based on the bipartite extension of Newman's modularity, a block-wise structure, in fact, emerges (see the bottom panel of Figure 5): in a sense, thus, the FiCo algorithm covers the specialization signal present in the original dataset. While at the firm level we indeed expect to observe specialization, since it is unlikely that a company may export all possible products (or even a large percentage of them), this is not true for the WTW, to which the application the same algorithm does not lead to detect any block-wise structure.

Nestedness
The analysis of nestedness (here we adopted the so-called NODF, i.e., Nestedness metric based on Overlap and Decreasing Fill [45]) allows the picture provided by the FiCo algorithm to be further refined. As Figure 6 shows, the z-score of nestedness is steadily negative across our temporal snapshots, i.e., z nestedness −11: in other words, the observed CFP network configurations are significantly less nested than expected, a result that confirms our previous finding concerning the (bipartite) block-structure of the system under analysis. This result further points out that constraining the nodes degrees leads to enforcing some kind of nestedness as well, whose value is (significantly) larger than the observed one. Figure 6. Evolution of the empirical nestedness (NODF) values and of the BiCM-induced ensemble distributions of the same quantity, compactly represented by the box-plots (showing the 0.15th, the 25th, the 50th, the 75th and the 99.85th percentiles). The CFP network is characterized by a nestedness whose empirical value is significantly less than expected.

Projecting the Colombian Firms-Products Network
Let us now move to considering the projections of the CFP network. This kind of analysis complements the results found by running the bipartite community detection shown in Figure 5, by making the hidden relationships between nodes belonging to the same layer explicit.
For what concerns the projection of the CFP network on the layer of products, a persistent structure is observable throughout the years, with the main communities remaining approximately the same (see Figure 7, showing the projections corresponding to the years 2010, 2014 and the related partitions). More in detail, while the total number of connected components is always 100 (some of them are so small that can be neglected for all practical purposes), several larger connected components (1-3), characterized by an internal community structure, is observable: additionally, while smaller communities are more homogeneous, the larger ones are more heterogeneous. In any case, the following clusters of products are observed across all temporal snapshots: clothes, industrial supplies, bodycare products and related chemicals, fabrics and textiles, cotton fabrics, food, electronic devices, metal products, construction companies supplies, domestic appliances, leather and footwear, stationery, wood and glass objects.
It is also interesting to notice how different communities, characterizing the CFP network projected on the products layer and partitioning the same connected component, are linked. Some examples follow: in 2011 and in 2014, the "food" community and the "bodycare" community are connected through the product "organic soaps and essential oils"; in 2013, instead, the "food" community is linked to the "medicines and other chemicals" community through the product "vitamins"; in 2014 the "fabrics" community is connected to the "clothes" community by a single link, joining the "girls' undershirts" product and the "knitted fabrics" product. Comparing the communities found in this way on the projected network, our partition is pretty consistent with the one found via the Barber algorithm on the original bipartite network (Figure 7). The Barber communities are usually made up of one or more of the projected communities, plus some of the other nodes not belonging to the giant component (they are indeed connected to their community in the original network). The correlation between the two partitions is measured by the Variation of Information [46], that in our case is between 0.52 and 0.58 for all five years considered and for both firms and products networks. Since the Variation of Information is not easy to interpret, it is also possible to measure the fraction of nodes that get included from one of our communities in a Barber one, making a "best correspondence" between them. We measure the inclusion of our partition A into the Barber partition B as In our case, this quantity is between 0.85 and 0.92 for all of the years, which means that about 90% of the nodes are in a community that is consistent in the two partitions.
The projected CFP network on the firms layer is characterized by a persistent structure throughout our temporal interval as well: this is however denser and composed by larger, isolated components than the projection on the products layer (see Figure 8).

Comparison with the WTW
Let us conclude this section with some remarks: in [3,39] the validated projection of the WTW was presented. While the BiCM-induced validation on the layer of countries outputs a clear structure in which countries sharing similar productive capabilities gather in communities, it is necessary to relax the constraints defining the entropy maximization to have a similar projection on the products layer [3,39]. From an information-theoretic point of view, the imposed constraints seem to be enough to explain the actual co-occurrences between products: this may be due to the large asymmetry between the cardinality of the countries-and the products-layer, letting the heterogeneity of countries degrees encode all relevant information [3]. Employing a less complex null model, in fact, leads to a projection with a rich structure whose communities of products can be related to the industrialization level of the related exporters: in other words, communities are not defined by homogeneous products but by those that can be efficiently exported by countries with strong industrial capabilities (e.g., metal products, tramway locomotives, tires, and turbines belong to the same cluster).
The case of the national exports of Colombia is essentially different in two main respects: first, we used the BiCM as a benchmark for the projection on both layers; second, the product categories are more clearly defined. This behavior is partly due to the block-wise structure of the system, as already noticed in the previous sections. Finally, let us stress that an intrinsic difference between the bipartite blocks detected by the Barber algorithm and the validated communities characterizing the projections exists. Barber's bipartite modularity compares the local link density with its expected value, thus considering as contributions to the network community structure even small, although positive, fluctuations; the detection of communities on the projections is, instead, enhanced by the preliminary validation implemented via the algorithm introduced in [3].

Discussion
In this article we performed a pioneering analysis of the international trade patterns using country firm-level information and employing a complex network methodology. More precisely, we studied the bipartite Colombian firms-products network, using an approach based on the maximization of the constrained Shannon entropy with the available information about the system at hand. This allowed us to detect interesting patterns of economic activities characterizing Colombian firms and products. With the aim of better understanding our results, we kept as a benchmark what we know about the World Trade Web, whose bipartite network (countries-products) has been widely studied in recent years [3,39,47].
Both systems have remarkable different organizational structures. The matrix associated with the CFP network is comparatively much sparser than the matrix associated with the WTW network. In fact, while in the WTW network the industrialized countries have the capabilities to export a large number of diverse products, this would be impossible even for the largest companies in Colombia. We showed that the matrix of the CFP network can be re-organized in a block-wise fashion, suggesting the presence of firms specialized in the production of product sets. This is in sharp contrast with the WTW network, whose matrix representation has a genuinely triangular structure.
In this way, we obtained projections in which it was possible to distinguish that both firms and products were organized in communities. We also explicitly notice that the product communities observed in the CFP resemble the WTW communities (although the latter are composed by less homogeneous products than the former ones), being the main similarity between these networks. Although it may be obvious for the economic thinking, it turns out to be interesting verifying that non observable capabilities to produce different products can be recognized at different levels of aggregation, i.e., at both the firm and country levels.
The evidence that diversifying is much more difficult for firms than for countries, in turn leading to little competition at the Colombian, national scale, also affects the performance of the FiCo algorithm in providing information about the "quality" of firms and products. Indeed, the FiCo algorithm takes advantage of the triangular shape of the biadjacency matrix. Indeed, it is able to go beyond the degree sequence and highlight those countries that are able to export items that only countries with similar factor endowments and technological capabilities are able to export. In the case of the CFP, there is another crucial point, which is the strong bipartite community structure. Even going beyond the degree sequence, still the community structure is not taken into account. Because of this, the predictions of the FiCo algorithm are going to be weaker on the CFP, with respect to those on the WTW. This is particularly evident when comparing the correlation between the firms' degree and fitness with that of countries degree and fitness (and analogously for what concerns the products-see Figure 9). Generally speaking, the different behavior of economic systems at different scales is reminiscent of the behavior of ecosystems, with more massive species being characterized by a larger metabolic activity. From this point of view, countries behave like massive species, capable of diversifying their production: firms, on the other hand, are characterized by a much more limited activity, focusing on sectors of products. Indeed, this reflects into the different topological structure of the considered systems, as proven by nestedness, whose observed value is compatible with the BiCM prediction in the case of the WTW [21]) but is not for the Colombian national export dataset.
The present paper is intended to be the first step of a more extended analysis of national export trade networks. Indeed, due to the recent interest in local nestedness [48], it is intriguing the possibility of the analysis in greater details of the structure of the blocks found in Figure 5. We leave these studies for further research, due to the implications on the understanding of the whole system.