Over- and Underyielding in Time and Space in Experiments with Mixed Stands of Scots Pine and Norway Spruce

Pine-spruce forests are one of the commonest mixed forest types in Europe and both tree species are very important for wood supply. This study summarized nine European studies with Scots pine and Norway spruce where a mixed-species stand and both monocultures were located in an experimental set-up. Overyielding (where growth of a mixed stand was greater than the average of both monocultures) was relatively common and often ranged between 0% and 30%, but could also be negative at individual study sites. Each individual site demonstrated consistent patterns of the mixing effect over different measurement periods. Transgressive overyielding (where the mixed-species stand was more productive than either of the monocultures) was found at three study sites, while a monoculture was more productive on the other sites. Large variation between study sites indicated that the existing experiments do not fully represent the extensive region where this mixed pine-spruce forest can occur. Pooled increment data displayed a negative influence of latitude and stand age on the mixing effect of those tree species in forests younger than 70 years.


Introduction
Norway spruce (Picea abies (L.) Karst.) and Scots pine (Pinus sylvestris L.) are among the commonest tree species in northern European forests and together cover more than 150 million hectares [1,2]. Both tree species can co-exist on a wide range of sites, but late-successional Norway spruce prevails Figure 1. The overlapping area of the natural distribution of Norway spruce and Scots pine in Northern and Central Europe (dark grey color), and the distribution of study sites: White squares with black dots indicate sites containing all three treatments, white squares indicate sites with age up to 10 years (FIN) or lacking the treatment "Norway spruce monoculture" (POL, SE4 1090, SE4 1092). Source: EUFORGEN 2013 GIS database [5]. The overlapping area of the natural distribution of Norway spruce and Scots pine in Northern and Central Europe (dark grey color), and the distribution of study sites: White squares with black dots indicate sites containing all three treatments, white squares indicate sites with age up to 10 years (FIN) or lacking the treatment "Norway spruce monoculture" (POL, SE4 1090, SE4 1092). Source: EUFORGEN 2013 GIS database [5]. Recently, there has been increasing interest in tree species mixtures as they seem to provide a more complete suite of forest ecosystem services [6,7]. Felton et al. [8] anticipated biodiversity benefits from the conversion from spruce monocultures to pine and spruce mixtures. In order to quantify the potential impacts on wood production and the trade-offs in ecosystem services, it is important to know how the growth and yield of mixed pine-spruce stands can differ from those of monocultures on similar sites. In addition, as wood production per unit area is relevant to carbon storage and climate change policies, and is a major driver of silvicultural decisions, better knowledge about the comparative growth performance of mixed and monospecific stands on sites that are suitable for both tree species is crucial. Niche complementarity could occur between Scots pine and Norway spruce on the basis of certain traits such as the light-demanding pine versus the shade-tolerant spruce, or because of different rooting depth (Table 1). Furthermore, resistance can increase in these mixtures, for example, a lower incidence of Heterobasidion annosum in mixed stands [9]. This trend is probably not true of all pests; for example, browsing damage was higher or similar in mixed stands [10,11]. Table 1. Possible tree species interactions in mixture according to Forrester [12], and evidence for such interactions from investigations in Scots pine-Norway spruce stands.

Possible Species Interactions Interactions Found in Mixed Scots Pine-Norway Spruce Forests
With influence on nutrient availability Mostly overlapping fine-roots [13], increased soil nitrogen when spruce was grown in mixture [14] With influence on water availability Overlapping fine-roots [13], higher water stress in boreal mixed forest, and reduced water stress in hemiboreal mixed forest [15] Influencing light absorption and use Temporal effects and interactions between resources Lower incidence of Heterobasidion annosum [9], higher or similar browsing damage [10] Stand structure Less storm damage [16] Comparisons of productivity between mixed-and single-species stands can refer to the most productive monoculture or the average productivity of the corresponding monocultures. Reports on mixed pine-spruce experiments in Scandinavia [17][18][19] found 7%-26% higher productivity in mixed species forests compared with the average growth of both monocultures (common overyielding). However, growth models widely applied in these regions (e.g., [20,21]) assume no positive interactions between species growing in mixture.
No evidence of transgressive overyielding, i.e., higher productivity in a mixed stand compared to the most productive monoculture, was found in Scandinavia [17][18][19] (although this was found by Pretzsch and Schütze in another study [22]). Lindén and Agestam [18] found temporal changes in the relative yield in mixed stands, which increased considerably during the last measurement periods on two sites while it decreased on another site. By contrast, reports on the first [14] and second rotation [23] of two experiments on the same site in northern England indicated a positive interaction between both tree species, resulting in a transgressive overyielding in the mixture. Similar conclusions were drawn by [24] who analyzed Polish yield data monitored for more than 100 years.
Although mixing effects at single sites (or ages) can represent significant increases in productivity, the mean mixing effect for the same mixture across a wide range of sites (or over a whole rotation) can be smaller and nonsignificant [25]. In order to understand how the mixing of these tree species affects productivity at larger scale, we analyzed the accessible increment data from known Scots pine-Norway spruce mixed forest experiments in Europe. As contrasting effects of mixing on stand growth have been reported for varying climate, soil conditions, and mixture proportions [12,26], we included an evaluation of site factors potentially influencing growth that were available from the experiment description.
We particularly searched for evidence of transgressive overyielding since this aspect is most important for any forest manager who aims to maximize wood production (Question 1 below). However, we also compared stand growth of mixed pine-spruce stands with the average stand growth of both monocultures (over-or underyielding, Question 2). The latter approach corresponds with most of the recently published growth studies in mixed forests regarding biodiversity-productivity relationships, e.g., [27,28]. The main objective of this study was to analyze the effect of species mixing of Scots pine and Norway spruce on forest productivity at a stand level in order to answer the following three research questions:

1.
Are Scots pine-Norway spruce mixed stands more productive than the most productive monoculture of either tree species (transgressive overyielding)? 2.
Are Scots pine-Norway spruce mixtures more productive than the average of both corresponding monocultures (overyielding)? 3.
Does the mixing effect change significantly over stand age or with site conditions?

Material: Study Sites
We collected information about stand growth from nine studies in Northern and Central Europe ( Figure 1 and Table 2), most containing the following three treatments: a Scots pine-Norway spruce mixed stand, a pure Scots pine stand, and a pure Norway spruce stand. All treatments within a study were of comparable age and growing in similar site conditions. However, some studies contained several study sites, but not every site contained all treatments (see below). One study was lacking the treatment "pure Norway spruce" [24]. To our knowledge, there are no other long-term experiments containing pure and mixed Scots pine and Norway spruce stands in Europe.  1  4  1  3  1  1  3  1  3  N of plots with pure pine  10  4  10  3  3  3  6  1  6  N of plots with pure spruce  10  4  10  1  3  3  -1  6  N of plots with mixture  10  4  10  4  3  3  Six of the nine experiments have already been reported: SE1 [18], SE3 [17], SE4 [19], UK1 [14,29], repeated on the same site UK2 [23], and POL [24]. The study SE3 was complemented by additional measurements by the original authors in 2013. The experimental layout for SE2 was described by [30] and for FIN by [31]. The study site GER was named study site SRO in [32].
Four of the nine studies contained replicated treatments on more than one study site. Hence, in total, 18 study sites with rectangular study plots were included in this survey. Growth and mortality of Scots pine and Norway spruce had been monitored for single measurement periods of 3 to 33 years. On five of the 18 sites, pure Norway spruce stands were not included in the experiment (two sites of SE4 and the three sites of POL). In the study GER, the monospecific Norway spruce plot was only established in 2015. On this plot, former dendrometric measures for each tree were derived by diameter and height reconstruction (according to Pretzsch et al. [26]).
Most of the studied stands were planted ( Table 2). One mixed-species stand in POL and all SE3 stands were seeded. In all experiments, the even-aged mixtures contained tree species intermingled either on a tree-by-tree basis or in very small groups. Some spruce trees in the mixed-species plots in Germany and Poland could have originated from natural regeneration and may differ slightly in age. Except SE4, management was excluded or minimized after stand establishment in all experiments. In the British, Swedish, and Finnish experiments, the various treatments in each experiment were established using a common management regime. By contrast, the Polish and German study plots were established in existing mixed-and single-species stands; these plots were on the same sites and of similar age, but stand history before establishment of the study plots is unknown. Thinnings were carried out in SE4, keeping an equal total stem number in all treatments after thinning ( Table 2). In POL and GER, only study plots which had been unthinned or very lightly thinned in the past were selected (i.e., A-and B-grade thinning intensities, see Pretzsch [33], pp. 157-158). That means that they grew close to the self-thinning line, and the growth was largely unaffected by thinning. Table 2 summarizes the experimental design that varied between the studies. The experiments UK1, UK2, SE1, SE2, SE3, SE4, and FIN were designed as randomized block experiments to study the effect of one factor (the forest type-in our case, pine, spruce, and mixed pine-spruce) on stand growth. However, no monospecific spruce forest was included on two of the three sites of SE4, because pure spruce did not occur on these sites. The number of study sites and replicated treatments can be seen in Table 2. In SE1 and SE3, subplots within each 35 × 40 m study plot for the replicated treatments were established [17,18]. UK1, UK2, and FIN contained three replicates of each treatment at each study site. POL included four replicated treatments of monospecific pine and mixed forest on the site Maskulinskie, but there was no replication at other sites in Poland or Germany. Also, the selection of plots for each treatment was not randomized in the studies POL and GER.
We compensated for any shortcomings in different study designs in further statistical analyses by calculating mean growth and yield parameters individually, as well as standard deviations for each of 18 study sites. The studies SE2, SE4, POL, and FIN comprised blocks on three or four separate study sites. While most study plots per study site were located adjacent to each other (with a buffer zone), the largest distances between study plots were 50-800 m on Polish sites and 400 m on the German site.
The size of sample plots was largest in POL and smallest in FIN and SE1-3 (Table 2). In SE1 and SE3, each study plot was measured by 12 sample plots of 28 m 2 [17,18]. In most cases, the mixture proportion was equal when the experiment was established, except in POL where spruce represented one-third or less of the total basal area (BA) and in SE2 where it varied from 50% to 90%.
The studies covered a large latitudinal range with growing seasons varying from 130 days per year in Northern Sweden to more than 200 in the U.K. (Table 3). The understorey vegetation was dominated by Vaccinium myrtillus on mainly mesic and moist forest sites. The study site in Table 3 refers to the same site with adjacent study plots. The experimental site in the U.K. differed in vegetation and the soil moisture regime can be characterized as very moist or wet. Podzolic soils were the main soil types present across the sites, but gleys and brown earths were also found. Mean annual temperature ranged from −0.5 • C to +8 • C. Table 3. Selected site and stand characteristics of the 18 study sites within the nine study areas. The site index SI was calculated specifically describing dominant trees in monospecific stands in each country, using the methods by Hägglund and Lundmark [34] for Swedish sites, by Edwards and Christie [35] for the site in the UK, by Hynynen et al. [21] for the Finnish site, Assmann and Franz [36] and Wiedemann [37] for the German site, and Bruchwald [38] for Polish sites. As further notes, the sites Maskulinskie and Strzałowo (POL) were affected by a large Lymantria monacha outbreak that heavily reduced the growth of pine: growth rates more than two times higher were observed in the mixture during the subsequent measurement period due to the response of spruce to the improved light conditions. Wind damage forced the early closure of UK1. Slight single-tree wind damages were also reported from other experiments.

Methods
To answer our three research questions, we performed two analyses: an analysis (A) of stand growth on each study site showing absolute and relative production values within each experiment (Question 1 and 2), and an analysis (B) of mixing effects combining the ten study sites that contain all three treatments with single measurement periods into one data set (Question 3).

Quantifying of Mixing Effects
For the comparison of stand productivity P of pine and spruce in mixed stands with the most productive monoculture, Equation (1) was used to provide a measure of transgressive overyielding [39].
In the equations, the suffix X indicates a tree species growing in a mixed stand and 0 indicates a tree species growing in a monospecific stand. For the Norway spruce-Scots pine mixture with respect to average productivity of both monocultures of each species, Equation (2) was used to calculate the mixing effect ME which provides a measure of common overyielding [39].
The ratios ME Pine and ME Spruce in Equations (3) and (4) were calculated to quantify a tree-species-specific mixing effect for Norway spruce or Scots pine respective to one type of monoculture (Equations (3) and (4)).

Statistical Analysis and Models
In addition to the productivity ratios observed in single experiments (Analysis A), we studied the influence of the following site and stand variables on the overyielding parameters (Equation 1 and 2) for each measurement period of all experiments in a pooled Analysis B: latitude, length of growing season, mean annual temperature, annual precipitation, the Martonne index of aridity [40], site index for spruce and pine (i.e., dominant height at the reference age of 100 years), vegetation type, soil type, and soil moisture (see Table 3), as well as stand age (6 to 132 years), BA (pine: 0.3 to 46, spruce: 0.1 to 39, and mixture: 0.4 to 47 m 2 ha −1 ), stem number (pine: 181-2640, spruce: 863-3880, mix: 181-2980), Stand density index SDI (pine: 141-1842, spruce: 108-1192, mix: 150-1268; a density measure when comparing stands of different age and site [41]), mean diameter ratio of pine and spruce trees in the mixture (0.86 to 3.04, based on stem diameter at 1.3 m height), and initial mixture proportion of the measurement period (1 to 59% of spruce, based on BA, standing volume, or stem number in descending order of preference). See the supplementary data for more details. To calculate the stand productivity ratio, stand BA growth (or stand volume growth, in this order of preference) was used. While most results of Analysis B refer to ME without the consideration of tree species proportions, an additional statistical model was tested to compare ME and ME mix% , where ME in Equation (2) was weighted by the tree species proportions m Pine and m Spruce . The mixture proportion at the beginning of each measurement period was used (Equation (5)).
Given the hierarchical structure of the data, linear mixed models were chosen to account for autocorrelations of measurements within the studies. The following model was used to explore the influence of covariables on a given productivity ratio: where y ij is a productivity ratio according to Equations (1), (2), or (5) on the study site i in the observation period j, µ is a general mean, α i is a random effect for the location, x is a tested dependent variable, β 1 -β n are fixed effect coefficients, and ε is a random error with normal distribution. When exploring the relationships between mixing effect and the other factors (see the 16 site and stand variables above), observations during stand initiation (defined as the period within 10 years of stand establishment) were excluded to avoid extremely high yield ratios due to small tree sizes (<3 cm mean stem diameter). Thus, the first two measurement periods in UK2 and the young experiment FIN were excluded from pooled analyses. Single measurement periods were analyzed to assess the variation of ME values and the conformity of observed growth trends on individual studies and sites.
When linear mixed models were used to identify statistically significant factors, only growth observations were included in the analysis. Akaike's Information Criterion modified for small samples (AICc) and the likelihood test were used to compare different models. All statistical calculations were done with the R package 'nlme' [42].
For the discussion, the analysis was repeated when growth estimates for monospecific spruce on the study sites 1090 and 1092 in SE4 and POL were also included. The estimates were based respectively on the growth simulator for Swedish forests [20] and yield tables for Norway spruce [43] that are widely used in forest practice under Swedish and Polish conditions. The aim of this exercise was to detect signals that could be useful for future mixed forest growth modeling to improve yield predictions in the northern half of Europe.
In addition to linear mixed models, Pearson correlations were used to explore positive or negative relationships between single variables and mixture effects.

Over-and Underyielding in Single Experiments (Analysis A)
Different levels of production per treatment on each site are illustrated in Figure 2. Transgressive overyielding was observed in UK1, UK2, and SE4, while common overyielding occurred in five experiments (also described by the ME values in Table 4).
While the stand ages in the studies in northern Europe often covered only the first half of the rotation period, the Polish and German studies contained more mature stands. In the experiments SE1, SE2, and SE3, monospecific pine was the most productive at half rotation age while in SE4, UK1, UK2, and POL, mixed pine-spruce stands were most productive. In contrast, in GER, monospecific spruce was the most productive treatment. In SE1, SE2, and SE3, volume productions of pure pine were 19% (±4% confidence interval), 113-285%, and 25% (±10% CI) higher than in the mixture. In SE4, UK1, UK2, and POL, the productions by the mixture were 12%, 15% (±26% CI), 22% (±11% CI), and 4-41% higher than the most productive monoculture (Table 4). In GER, the monospecific spruce produced  Table 4 shows the production values per experiment and study site with the corresponding mixing effects calculated according to the Equations (1)-(4). Compared to the most productive monoculture, the mixture effect ME max ranged from 0.3 to 1.2. In three out of ten study sites, transgressive overyielding was observed where the mixture was 8-22% more productive. The lowest ME max was found in experiment SE2 where the mixed stands produced only 40% compared with monospecific pine after 40 years. In other cases, the mixture was 15-20% less productive than the most productive monoculture. Compared with the average productivity of the corresponding monocultures, the mixture effect ME ranged from 0.7 to 1.4 across all experiments.
Concerning the productivity of pine-spruce mixtures relative to pine, the experiments revealed a large range of effects on stand growth: from 0.3 to 1.4 on single sites (ME pine in Table 4). Interestingly, these values ranged between 0.3 and 0.8 on sites north of latitude 60, and between 0.9 and 1.4 in the south. Often, the productivity of mixtures relative to spruce monocultures was more than double on the northern sites (see ME spruce in Table 4).  Cumulative production from the first to last measurements of the experimental sites (volume growth for SE, POL, GER in grey bar color, or basal area growth in UK with black color or contour). The three bars per study site illustrate the production achieved in the three different treatments: pine (left bar in light grey), tree species mixture (center), and spruce (third bar to the right, if available on the site). The lines above the bars indicate the standard deviation if the treatment was replicated on the site (see number of observations in Table 1). The numbers above the bars show the stand age covered by the observations. Table 4 shows the production values per experiment and study site with the corresponding mixing effects calculated according to the Equations (1)-(4). Compared to the most productive monoculture, the mixture effect ME max ranged from 0.3 to 1.2. In three out of ten study sites, transgressive overyielding was observed where the mixture was 8-22% more productive. The lowest ME max was found in experiment SE2 where the mixed stands produced only 40% compared with Cumulative production from the first to last measurements of the experimental sites (volume growth for SE, POL, GER in grey bar color, or basal area growth in UK with black color or contour). The three bars per study site illustrate the production achieved in the three different treatments: pine (left bar in light grey), tree species mixture (center), and spruce (third bar to the right, if available on the site). The lines above the bars indicate the standard deviation if the treatment was replicated on the site (see number of observations in Table 1). The numbers above the bars show the stand age covered by the observations. Table 4. Stand productivity (with standard deviation on site if replicated) and mixture effects ME max, ME, ME pine and ME spruce (with standard deviation if both treatments were replicated) over the entire observation period of the study sites. Grey values indicate estimated growth of pure spruce. The ME estimates of single measurement periods vary within single experiments and on individual sites from the reported mean values in Table 4. For example, the ME max values for the first and last measurement period of the experiments SE3, GER, and UK2 equaled 0.97 and 1.05 (5-20  In Figure 3 we contrast the different periodic annual increments of mixed and pure stands based on basal area or volume growth per measurement period. The increment of BA (as well as stem volume increment) varied considerably between 0.1 and 5 m 2 ha −1 a −1 (or 1 and 30 m 3 ha −1 a −1 ). The values above the 1:1 bisector line represent better growth performance of the tree species mixture, while values below the line display a better performance of the monoculture. In Figure 3 we contrast the different periodic annual increments of mixed and pure stands based on basal area or volume growth per measurement period. The increment of BA (as well as stem volume increment) varied considerably between 0.1 and 5 m 2 ha −1 a −1 (or 1 and 30 m 3 ha −1 a −1 ). The values above the 1:1 bisector line represent better growth performance of the tree species mixture, while values below the line display a better performance of the monoculture.
Merging the different stands and measurement periods in Figure 3 did not reveal a general pattern of superior stand growth either in the mixed or the monospecific stands. Although the majority of points in Figure 3a,d indicate that the growth between single measurements was somewhat larger for pine-spruce than for pure pine, this was not true in the most northern experiments SE1, SE2, and SE3. The eight observations of BA growth (12 observations of volume growth) from those three experiments lie under the bisector line in Figure 3a,d, while the observations in POL dominate the graph. As the yield tables used for pure spruce in POL did not provide BA estimates, they are not included in Figure 3b,c. Both figures reveal that most measurement periods indicate better growth in the mixed stands compared with monospecific spruce or the average of both monocultures. Also, both graphs suggest that this is the case on sites with generally higher site productivity. This pattern Merging the different stands and measurement periods in Figure 3 did not reveal a general pattern of superior stand growth either in the mixed or the monospecific stands. Although the majority of points in Figure 3a,d indicate that the growth between single measurements was somewhat larger for pine-spruce than for pure pine, this was not true in the most northern experiments SE1, SE2, and SE3. The eight observations of BA growth (12 observations of volume growth) from those three experiments lie under the bisector line in Figure 3a,d, while the observations in POL dominate the graph.
As the yield tables used for pure spruce in POL did not provide BA estimates, they are not included in Figure 3b,c. Both figures reveal that most measurement periods indicate better growth in the mixed stands compared with monospecific spruce or the average of both monocultures. Also, both graphs suggest that this is the case on sites with generally higher site productivity. This pattern is not evident for periodic increment of volume during the single measurements in Figure 3e,f. On the contrary, when volume was measured (black dots), monospecific spruce or the average growth of both monocultures performed slightly better than the mixture. In GER, the stem volume productivity of monospecific spruce was significantly and constantly higher than that of the mixed or the monospecific pine. This was not so evident for the periodic annual increment of BA. The pattern would disappear if volume growth estimates (POL and SE4) were included (Figure 3e,f).

Influence of Stand Age, Tree Species Proportion, Climate, and Other Factors on the Mixture Effects
Excluding estimated growth values for monospecific spruce (POL, SE4), a statistically significant negative influence of latitude and stand age upon the productivity ratio ME was found using linear mixed-effects models ( Table 5). In a similar model with stand age, latitude, and mean tree size ratio between mixed pine and spruce trees as independent variables (data not shown), only a positive effect of stand age was found to be significant (p = 0.026). Tree size ratio was not statistically significant (p = 0.053). The model with all three independent variables was not better than a model with only stand age and tree size ratio in terms of the likelihood ratio test, parsimony, and AICc. Variation between the study sites accounted for ca. 25% of the residuals in the models. Table 5. Three linear mixed-effects models that related different site variables (e.g., annual precipitation) and stand characteristics (e.g., stand age, ratio of mean diameter of pine to spruce trees in the mixture) of each measurement period and site to the productivity ratio ME between mixed stands and the average of the corresponding monocultures. While Model 1 excludes study sites without monospecific spruce, others include estimated growth of spruce based on yield tables. Model 3 included the initial mixture proportion of spruce at the beginning of the measurement period. When mixture proportions were included in the calculation of ME (Equation (5)), the model fit did not improve (Table 5). However, the data were dominated by equal tree species proportions (Figure 4d).

Model
Pearson correlations between ME ratios and single explanatory variables were small and nonsignificant (p > 0.05). Site index for spruce and latitude resulted in R 2 values of only 0.2 (Figure 4b).
Correlations between stand age and ME ratio resulted in R 2 = 0.4. This weak indication of a positive relationship between ME and stand age was strongly influenced by the observations from single studies, e.g., POL. When the data were restricted to actual increment measurements and, thereby, to stand ages of <70 years, a negative correlation was found ( Figure 5). Again, the correlation did not improve when mixture proportions were included in the calculation of ME. ranked lower according to Akaike [43]). If the experiments in UK with high ME and precipitation were excluded from the data set, none of the climate factors would have been statistically significant.    ranked lower according to Akaike [43]). If the experiments in UK with high ME and precipitation were excluded from the data set, none of the climate factors would have been statistically significant.   For further discussion, our Analysis B using linear mixed-effects models restricted to observed growth values was complemented by growth estimates from yield tables and models (thereby including POL and two additional sites of SE4). This indicated a statistically significant positive influence of stand age, precipitation, and mean size ratio between pine and spruce trees (Table 5). There was no improvement from using the Martonne index instead of precipitation (this model ranked lower according to Akaike [43]). If the experiments in UK with high ME and precipitation were excluded from the data set, none of the climate factors would have been statistically significant.

The Mixing Effect of Observed Increment without Spruce Growth Estimates
Compared with the average production of both monocultures, the majority of studies showed yields of 97-136% in mixed stands. While overyielding was more common, transgressive overyielding was found in three out of ten cases. Two of these three cases (UK1 and UK2) occurred in the repeated experiment at Gisburn [23]. In these three cases, there was up to 20% higher wood production in mixture than in the most productive monoculture.
Our correlations over the study sites indicated decreasing overyielding with higher latitude. Especially on the sites in northern latitudes, monocultures of pine were more productive, while at southern latitudes, spruce monocultures were more productive. Thus, a large range of possible mixing effects could be envisaged for the study region, consistent with [25]. Introducing a spruce admixture to pine in northern latitudes decreased yield, because spruce grew poorly while pine thrived. In study SE2, the most productive monoculture pine yielded two to four times more than the mixture on sites where the site index for spruce was 5-6 m lower than that for pine. It is not surprising that adding spruce on such sites will lower production. By contrast, in Germany, mixing spruce with pine is often expected to lead to an overall higher growth performance [44].
However, generalizations from models with the weak relationships found in this study are difficult and will need to be better informed by both existing and new experiments. The paucity of data from long-term experiments in mixed forests highlights a need for modeling efforts, but experiments will also be needed to validate and develop such models [45]. Therefore, two important conclusions are (1) to maintain the existing experiments for future validation and development of management-related growth models, and (2) to complement the limited number of experiments with new study approaches (e.g., [25,26]).

The Mixing Effect of Studied Stands When Complemented by Simulated Growth Estimates for Lacking Treatments
To compensate for the lack of spruce monospecific stands in SE4 and POL, we used data derived from a growth simulator [20] and yield tables [43]. It was not possible to examine how well the growth simulator and yield tables applied to these specific sites, and the estimates obtained should be treated with caution.
The mixing effect thus predicted for the two study sites 1090 and 1092 in SE4 fitted well to the purely observed increment and would not change the previous conclusions. When the experiment POL with estimated growth of monospecific spruce was included, stand age, precipitation, and mean size ratio of pine and spruce trees in the mixture were highlighted. All three of them were positively correlated with the mixing effect. However, removing the single study POL from the pooled data resulted in a negative correlation between stand age and mixing effect.
We encountered the following problems when merging observed and simulated increment: There was little variation in some covariates describing site factors (latitude, site index), and the pattern of variation of simulated increment was different from that of observed values (Figure 4). In addition, there was confounding between the use of growth estimates and stand age, since POL was the only study with trees older than 70 years but pure spruce growth was estimated. Figure 5 illustrates that the mixing effect of the first two measurement periods of all three sites in POL agreed with expectations based on other stands, but the trend changed in subsequent measurements. Lacking other stands older than 70 years, the available data are too few to reach a firm conclusion about the general trend across larger forest regions. Basically, we could alter the results of our analysis by including or excluding the study POL. This modeling exercise only gave a hint on a possible changing mixing effect at the time when both tree species reach approximately the same tree heights. The pattern would remain if a growth underestimation of 20% by the yield table is assumed.
It remains unclear if the mixing effect becomes stronger in mature pine-spruce stands as recently indicated by global forest data [46,47]. To answer this important question, maintenance and future measurements of the existing long-term experiments should be given a very high priority.
In both analyses (restricted to observed increment in the results or including growth estimates for discussion), the influence of the site index on the mixture effect was not clear (Figure 4b), although this is certainly a primary driver over the whole rotation. Only the site index ratio between spruce and pine in Figure 4f indicates a positive, but nonsignificant trend. We conclude that there are several influences driving the "mixing effect", sometimes inter-correlated (e.g., latitude and site index), but also that there are more factors than the variables used in the limited data set. This resulted in poor modelling performances. Local climate conditions (during single growing seasons and monthly variation) and the differences between mixed and pure plots due to micro-site variation, previous management, and stand structure could have caused additional variability not explained by the models.
The Finnish experiment with early height measurements [48] was too young and not included in any comparison as average tree heights can be driven by species ontogeny and may correspond to a levelling process between species [49]. On medium-fertility sites in North Karelia in Finland, another growth model fitted using data from temporary plots [50] suggested that volume productivity at mid-rotation may be 10-15% greater in mixed compared to monospecific Scots pine and Norway spruce stands of the same stand age and BA. However, this is not in line with the experimental data from the northern sites, but rather with the overall difference across all sites of this study.
The positive influence of mean tree sizes on productivity in Table 5 is in line with the finding by Bielak et al. [24], but we could not detect a relation between site fertility and tree size ratio, unlike Pretzsch and Dieler in mixed spruce-beech stands [51]. Contrary to our results with a limited range of tree species proportions, the mixture proportion is important to estimate the productivity of mixed stands [33]. Eventually, together with stand density, such additional modifiers could be useful in future growth models, calibrated by single observation plots only [21].