You are currently viewing a new version of our website. To view the old version click .
Econometrics
  • Article
  • Open Access

6 January 2026

A Theory-Based Formal-Econometric Interpretation of an Econometric Model

Department of Economics, University of Oslo, 0317 Oslo, Norway

Abstract

The references of most of the observations that econometricians have are ill defined. To use such data in an empirical analysis, the econometrician in charge must find a way to give them economic meaning. In this paper, I have data and an econometric model, and I set out to show how economic theory can be used to interpret the variables and parameters of my econometric model. According to Ragnar Frisch, that is a difficult task. Economic theories reside in a Model World and the econometrician’s data reside in the Real World; the rational laws in the model world are fundamentally different from the empirical laws in the real world; and between the two worlds there is a gap that can never be bridged To accomplish my task, I build a bridge between Frisch’s two worlds with applied formal-econometric arguments, invent a pertinent model-world economic theory, walk the bridge with the invented theory, and use it to give economic meaning to the variables and parameters of my econometric model. At the end I demonstrate that the invented theory and the bridge I use in my analysis are empirically relevant in the empirical context of my econometric model.
JEL Classification:
A12; B23; B41; C01; C18; D01

1. Introduction

In empirical studies that aim at a unification of the theoretical-quantitative and the empirical-quantitative approaches to economic problems, economic theory can be used in different ways. One way is to give economic meanings to the variables and parameters of an econometric model. On pp. 32–38 in his 1930 Yale lectures Ragnar Frisch describes how such an interpretation is to be carried out (Bjerkholt & Qin, 2011).
In his lectures, Frisch assumed that an economic theory resides in a Model World, and that the data reside in the Real World. In addition, he claimed that the rational laws in the model world are fundamentally different from the empirical laws in the real world. Between the two worlds there is a gap that can never be bridged. In this paper I add axioms for a theory universe, axioms for a data universe, bridge principles that link variables in the theory universe with variables in the data universe, and statistical arguments to Frisch’s analysis. With these additions, I use his ideas to search for an economic theory that can provide economic meanings to the variables and parameters of a given econometric model.
The given econometric model has three variables, y 1 , y 2 , and y 3 , and a name, Commodity-exchange. I have 400 observations of each variable, and I believe that the observations describe the equilibrium configurations of a perfectly competitive market for one commodity. To see if that makes sense, I will search for an economic theory in Frisch’s Model World that I can use to underpin such an interpretation of Commodity-exchange.
A theory-based interpretation of an econometric model is a description of a theory-based model of the data generating process. The data generating process has one true model, which I denote by the acronym TPD, with T short for true, P for probability, and D for distribution. The data generating process has many theory-based models, each one of which I denote by the acronym MPD, where M is short for marginal, P for probability, and D for distribution. An MPD is a probability distribution of the data variables that is induced by the probability distribution of the theory variables and a model of the bridge principles. Such an MPD may be very different from the TPD.
An MPD has many interesting properties. For example, in a given empirical context, a model of the MPD is data admissible if its parameters lie in a 95% confidence band around an estimate of MPD parameters. Moreover, a data admissible MPD is congruent if it encompasses the true data generating process, the TPD. The congruence of an MPD implies that the MPD can account for the relevant statistical results obtained by the TPD (Hendry & Richard, 1989; Mizon, 1995).
These properties play an important role in my inventing a theory-based interpretation of Commodity-exchange. Roughly speaking, the bridge principles are valid in the given empirical context if and only if all the data admissible MPDs are congruent. When the bridge principles are valid, my interpretation of Commodity-exchange is empirically relevant only if the set of parameter vectors of data admissible MPDs has a non-empty intersection with the 95% confidence band around the estimated values of the corresponding TPD parameters.
I have added axioms of a theory universe, axioms of a data universe, bridge principles, and statistical arguments to Frisch’s analysis. When the bridge principles are empirically relevant, they provide a bridge between the theory universe and the data universe. In addition, the two universes, my bridge principles, and my statistical arguments, when fitted together, constitute a bridge between the model world and the real world. In the paper, I shall walk this bridge with the searched-for theory and give economic meanings to the variables and parameters of Commodity-exchange.
The bridge I build in this paper has several analogues. Two of them are relevant here: Trygve Haavelmo’s imaginary bridge (Haavelmo, 1944), and Aris Spanos’ completion of Haavelmo’s Bridge Pier (Spanos, 2015, p. 177). Haavelmo’s imaginary bridge linked theoretical variables in Frisch’s model world with the corresponding variables in his observational world. In his mind Haavelmo traversed the bridge by identifying the values of theoretical variables with the true values of the corresponding data variables. In (Stigum, 2025) I have with applied formal-econometric arguments built a bridge that has all the properties of Haavelmo’s imaginary bridge. Haavelmo can walk across that bridge with a model-world theory and give economic meaning to the variables and parameters of a given observational-world econometric model.
In two very interesting papers on the intrinsic properties of simultaneous equation systems (Spanos, 1989, 2015) Spanos builds a bridge between an observational world of data generating processes and a world of economic theories whose variables roam in Spanos’ observational world. Spanos seems to accept the Cowles Commission’s idea that a properly formulated economic theory is a simultaneous equation system with two parts, a structural form and a reduced form, but he interprets the reduced form primarily as a statistical model. His bridge links the variables in the reduced form with the variables in the structural form. I do not believe that a Cowles Commission researcher with an economic theory about a few observational-world variables can cross a Spanos-kind of bridge with his theory and use it to give economic meaning to the variables and parameters of an econometric model whose variables are the same as his own theoretical variables.
There are good reasons to believe that Frisch’s idea of an unbridgeable gap between the model world and the observational world and the Cowles Commission’s gallant skip of Haavelmo’s imaginary bridge have caused mainstream econometricians to base their empirical analyses on economic theories about observational-world variables. Both the economic-theoretical and the statistical analysis become easier when the basic theory resides in the real world instead of in a model world. I hope that now when there exists a prototype of a bridge between a model world and an observational world, econometricians will mend their ways.
The empirical relevance of the searched-for theory and the theory-based interpretation is proved at the end of Section 2. The empirical validity of the bridge principles that I use in my theory-based interpretation of Commodity-exchange is established in Section 3. I generated the data with the help of STATA’s Release 16 Number Generators.

2. A Theory-Based Interpretation of an Econometric Model

According to Haavelmo (1944, p. 4), it is “never possible—strictly speaking—to avoid ambiguities in classifications and measurements of real phenomena. Not only is our technique of physical measurements imprecise, but in most cases we are not even able to give an unambiguous description of the method of measurement to be used, nor are we able to give precise rules for the choice of things to be measured in connection with a certain theory.” Thus, in the words of Sismondo (1993, p. 516) the references of the data that economists have belong in a socially constructed World of Ideas—a world that has little in common with the real world.
The import of Haavelmo’s comments in this context is as follows: In an empirical analysis, the econometrician in charge knows what the data variables supposedly measure, but has only vague ideas about what they actually measure. The econometrician gives the data variables names, and uses the means of applied econometrics to ascertain whether the ideas have empirical relevance. The present simple example shows how an econometrician goes about that.

2.1. Commodity-Exchange: An Empirical Model for Three Variables

I have 400 observations of three variables, y 1 , y 2 , and y 3 . To me they are observations of a perfectly competitive market’s equilibrium configurations of the demand and supply of a single commodity, y. I take y 1 t to be an observation of the actual sales of y in period t, t = 1 , , 400 . The corresponding price of y is y 2 t . The third variable, y 3 , I take to be an auxiliary variable that has a significant influence on the supply of y. Whether this interpretation is empirically relevant remains to be seen.
I will use ordinary least-squares analysis to determine what is the most reasonable description of the data generating process. I begin by regressing y 1 on y 2 and y 3 , and by regressing y 2 on y 1 and y 3 . These regressions suggest that y 3 has a statistically significant positive effect on y 1 , and a statistically significant negative effect on y 2 . Hence, it is possible that the components of y 1 measure the supply of y, and that the components of y 2 measure the price of y during the respective periods.
With these comments in mind, I decided to adopt an ad-hoc econometric model for the three data variables that I call Commodity-exchange. The model describes for t = 1 , , 400 , how the variables relate to one another in the data generating process:
y 1 t = a 1 + b 1 y 3 t + δ 1 t ,
y 2 t = a 2 + b 2 y 3 t + δ 2 t .
In addition, I decide to keep the original interpretation of the variables; i.e., in the respective periods, the components of y 1 measure the sales of y, the components of y 2 record the price of y, and y 3 details the quantity of an auxiliary variable.
In the Introduction I described the idea of a true model of the data generating process, the TPD. For the present analysis I assume that my sample of observations is purely random, and that the data variables have finite means and finite positive variances in the TPD. If that is correct, one may obtain good estimates of TPD parameters. Below, Table 1 records estimates of the means and the correlation matrix of y 1 , y 2 , and y 3 , and Table 2 records the least-squares estimates of Equations (1) and (2). I obtained the estimates in the two tables with Stata Release 16.
Table 1. Means and Correlation Matrix of y.
Table 2. OLS Parameter Estimates of Equations (1) and (2).
In Table 2, RMSE is short for root mean squared error of the residual, R-sq is short for R 2 , F designates the F-statistic, and P is short for Prob. > F . The true values of the estimates in Table 1 and Table 2 are TPD parameters. They lie with 95% certainty in their confidence intervals.
The estimated versions of Equations (1) and (2) are displayed in Equations (3) and (4) for t = 1 , , 400 .
y 1 t = 9.2182 + 0.5049 y 3 t + f e 1 t ,
y 2 t = 6.3591 0.3812 y 3 t + f e 2 t ,
The error terms in (3) and (4) have mean zero, finite positive variances, and are not normally distributed.
The non-normality of the distributions of the error terms makes my claim that the confidence intervals in Table 1 and Table 2 with 95% certainty contain the true values of the respective TPD parameters suspect Davidson and MacKinnon (1993, pp. 71–77). However, here as in the remainder of the paper, the numerical accuracy of my claims about the confidence intervals does not matter in the context of this paper. It is how I use the intervals in my arguments that is important.

2.2. A Search for a Theory-Based Interpretation of Commodity-Exchange

To provide a theory-based interpretation of Commodity-exchange, I invent an economic theory whose variables roam in Frisch’s model world, build a bridge between the model world and the world in which my data reside, walk the bridge with my theory, and use the theory to give economic meaning to the variables and parameters of my econometric model.

2.2.1. The Searched-For Economic Theory

Let x denote a commodity, and let p denote the price of x. I assume that the equations in (5) depict the equilibrium configurations of the market for x in period t.
x 1 t = A + B p t , x 2 t = C p t + D x 3 t , x 1 t = x 2 t , t = 1 , , 400 ,
where x t R + 3 , p t R + + , A > 0 , B < 0 , C > 0 , and D > 0 .
Here, the first equation describes the demand for x, x 1 , as a linear function of its price. The second equation depicts the supply of x, x 2 , as a linear function of its price and an auxiliary variable, x 3 . The third equation insists that the values of p and x 3 ensure that in each period the demand for x equals the supply of x.
The equations in (5) constitute a simple simultaneous-equation system. They can be solved for x 1 t and p t . Theorem 1 bears witness to that.
Theorem 1.
Consider the commodity market that the equations in (5) describe. In an equilibrium configuration of the market
x 1 t = α 1 + β 1 x 3 t , p t = α 2 + β 2 x 3 t , t = 1 , , 400 ,
where α 1 = A C / ( C B ) , β 1 = ( B D ) / ( C B ) , α 2 = A / ( C B ) , and β 2 = ( D / ( C B ) ) .
Econometricians refer to the equations in (5) as the structural model of the theory. They refer to the equations above as the reduced form of the structural model.
Next, I will build a bridge between the model world and the world in which my data reside. I begin with the axioms for a theory universe.

2.2.2. The Axioms of a Theory Universe

The first axiom claims that the theory universe has eight variables, x R + 3 , p R + + , z R , and η R 3 , that run around in the theory universe. The x and p have the same denotation as the x and p in the preceding theory, and the components of η are error terms. The meaning of z is explicated in the third axiom.
The second axiom assumes that the ( x 1 t , x 2 t , p t , x 3 t ) in the theory universe satisfy the equations in (5).
The third axiom insists there exist two triples of constants, ( e 1 , e 2 , e 3 ) and ( c 1 , c 2 , 1 ) , and a sequence of purely random values of z, z t , with mean zero and finite positive variance such that
( x 1 t , p t , x 3 t ) = ( e 1 , e 2 , e 3 ) + ( c 1 , c 2 , 1 ) z t , t = 1 , , 400 ,
In formal econometrics, the probability distribution of ( x 1 t , p t , x 3 t ) which the equations in (6) and the z t in (7) induce, is the RPD distribution of ( x 1 t , p t , x 3 t ) , where R is short for researcher, P for probability, and D for distribution.
The fourth axiom assumes that the RPD distributed vectors ( x 1 t , p t , x 3 t , z t , η 1 t , η 2 t , η 3 t ) , t = 1 , , 400 , constitute a purely random sample, and their components have finite means and finite positive variances. The error terms are distributed independently of each other and of z, and they have mean zero.
So much for the theory universe. Next, the three axioms of the data universe. The axioms are in accord with my description of Commodity-exchange.

2.2.3. The Axioms of a Data Universe

The first axiom insists that the data universe contains four hundred observations of each one of three variables, which I denote by y 1 t , y 2 t , and y 3 t , t = 1 , , 400 .
The second axiom assumes that the vectors ( y 1 t , y 2 t , y 3 t ) are TPD distributed and constitute a purely random sample. In the TPD the variables have finite means and finite positive variances.
The third axiom insists that there exist constants a 1 , b 1 , a 2 , b 2 , and four hundred TPD distributed purely random pairs ( ξ 1 t , ξ 2 t ) , such that in the data generating process,
y 1 t = a 1 + b 1 y 3 t + ξ 1 t ,
y 2 t = a 2 + b 2 y 3 t + ξ 2 t .
The ξ 1 t and ξ 2 t are distributed independently of the y 3 t and of each other.
It remains to record the one and only axiom for the bridge between the two universes. Here it is:

2.2.4. The Axiom of a Bridge Between the Two Universes

The variables in the theory and data universes are linked as follows:
x 1 t = y 1 t η 1 t , p t = y 2 t η 2 t , x 3 t = y 3 t η 3 t , t = 1 , , 400
The RPD distribution of the two x’s and p and the three η ’s in (10) induce a probability distribution of the components of y, the MPD, that I introduced in the Introduction. The MPD may differ from the TPD. However, the components of y have finite means and finite positive variances in both distributions. Moreover, later in Theorem 2 I will show that there are models of a data admissible MPD that satisfy the third data axiom. Consequently, a data admissible MPD has a model that encompasses the TPD.
The equations in (6), (7) and (10), the assumptions about the probability distribution of the sequence of vectors, ( z t , η 1 t , η 2 t , η 3 t ) , and the assumption ( e 1 , e 2 , e 3 ) = ( m y 1 , m y 2 , m y 3 ) , provide a family of theory-based interpretations of the y in (1) and (2). A member of this family is empirically valid only if it satisfies several interesting conditions on the models of the market in (5) and on the models of the MPD distribution of y t in (10). Theorem 2 exhibits all of them, and the end of this section shows that there is at least one empirically valid member.
Theorem 2.
Suppose that the sequence of vectors, ( x 1 t , p t , x 3 t , z t ) , satisfies the equations in (6) and (7). Assume, also, that ( x 1 t , p t , x 3 t ) , ( y 1 t , y 2 t , y 3 t ) , and ( η 1 t , η 2 t , η 3 t ) satisfy the equations in (10), and that ( e 1 , e 2 , e 3 ) = ( m y 1 , m y 2 , m y 3 ) . Then, there is a model of the equations in (6) such that, for t = 1 , , 400 ,
( y 1 t m y 1 , y 2 t m y 2 , y 3 t m y 3 ) = ( c 1 , c 2 , 1 ) z t + ( η 1 t , η 2 t , η 3 t ) x 1 t = ( m y 1 c 1 m y 3 ) + c 1 x 3 t
p t = ( m y 2 c 2 m y 3 ) + c 2 x 3 t , a n d x 3 t = z t + m y 3 .
y 1 t = ( m y 1 c 1 m y 3 ) + c 1 y 3 t + ( η 1 t c 1 η 3 t ) , a n d y 2 t = ( m y 2 c 2 m y 3 ) + c 2 y 3 t + ( η 2 t c 2 η 3 t ) .
Finally, the ( α 1 , β 2 , α 2 , β 2 ) in (6), and the ( a 1 , b 1 , a 2 , b 2 ) in (1) and (2) with the MPD distribution of y, are such that,
( α 1 , β 2 , α 2 , β 2 ) = ( a 1 , b 1 , a 2 , b 2 ) = ( c 3 , c 1 , c 4 , c 2 ) w h e r e c 3 = ( m y 1 c 1 m y 3 ) a n d c 4 = ( m y 2 c 2 m y 3 )
In an empirically valid model of Theorem 2’s assumptions, it follows from the equations in (13) that the vector of coefficients in (6) and (11) must satisfy the conditions,
α 1 = ( m y 1 c 1 m y 3 ) ,   α 2 = ( m y 2 c 2 m y 3 ) ,   β 1 = c 1 and β 2 = c 2
In addition, the inequalities in (5) insist that
( m y 1 / m y 3 ) c 1 , c 1 > 0 ,   ( m y 2 / m y 3 ) c 2 , and c 2 < 0 .
Finally, since ( m y 1 , m y 2 , m y 3 ) is an estimate of ( e 1 , e 2 , e 3 ) , it must also be the case that the true values of e 1 , e 2 , and e 3 with high probability lie in the confidence intervals in Table 1, and satisfy the conditions,
e 1 > 10.55084 ,   e 2 > 5.11722 , and e 3 < 3.1003 .
To show that the given theory-based interpretation of Commodity-exchange has an empirically relevant model, I begin by estimating the coefficients in Equation (13).

2.2.5. The Statistical Arguments

I assume that the assumptions I made about ( x 1 t , p t , x 3 t ) , ( y 1 t , y 2 t , y 3 t ) , and ( η 1 t , η 2 t , η 3 t ) in Theorem 2 are valid. In addition, I assume that the assumptions I have made about the probability distribution of the ( z t , η 1 t , η 2 t , η 3 t ) are valid. Finally, I assume that the covariance matrix of ( η 1 t , η 2 t , η 3 t ) is diagonal, and that z t , and ( η 1 t , η 2 t , η 3 t ) are Gaussian. With these assumptions, the model in (13) becomes a restricted factor analysis model (see Lawley and Maxwell (1971, pp. 86–104) and Anderson and Rubin (1956)), and I can use factor analysis to estimate c 1 and c 2 . Moreover, if l insist that the third component of the factor is 1 and not 1 , c 1 and c 2 and the four variances of the model are identifiable.
With the given assumption about the probability distribution of ( z t , η 1 t , η 2 t , η 3 t ) , the sample covariance matrix, S ( n ) , multiplied by ( n 1 ) , has the Wishart distribution with ( n 1 ) degrees of freedom and the log-likelihood function,
Log ( S ( n ) , Σ ) = g ( y ( 1 ) , , y ( n ) ) ( ( n 1 ) / 2 ) log | Σ | + t r . ( S ( n ) Σ 1 )
where g ( . ) is a function of the observations, | ( . ) | and t r . ( . ) denote, respectively, the determinant and trace of ( . ) , Σ = E S ( n ) in the probability distribution of ( z t , η 1 t , η 2 t , η 3 t ) , m y ( n ) = n 1 1 t n y ( t ) , and
S ( n ) = ( n 1 ) 1 1 t n y ( t ) m y ( n ) ] [ y ( t ) m y ( n )
One obtains factor analytic estimates of c 1 , c 2 , and the variances of the components of ( z t , η 1 t , η 2 t , η 3 t ) by minimizing the value of Log ( S ( n ) , Σ ) | S ( n ) | 3 with respect to the six parameters subject to the conditions in Equation (19), where the entries of the three-dimensional diagonal matrix must be non-negative.
Σ = λ σ z 2 λ + γ , λ = ( c 1 , c 2 , l ) , and γ = σ η 1 2 0 0 0 σ η 2 2 0 0 0 σ η 3 2
The Wishart distribution in (18) has many models. In a case where the σ η 3 2 in (19) is zero, the equations in (19) will have a solution only if Σ 12 = Σ 13 Σ 32 / Σ 33 .
Table 3 records the Stata Release 16 maximum-likelihood estimates of c 1 , c 2 and the variances of z and η . The estimated chi-squared statstic is, Chi2(3,0) = 0.00, for an ‘LR test of model vs. saturated.’ Moreover, the estimated log likelihood equals −1940.9322.
Table 3. Factor loadings and variances of z and the error terms.
The table and the equations in (13) imply that, for all t = 1 , , 400 ,
y 1 t = m y 1 + 0.5190 z t * + η 1 t * ; y 2 t = m y 2 0.3919 z t * + η 2 t * ; y 3 t = m y 3 + z t * + η 3 t * .
Moreover, Table 3 suggests that I cannot reject the possibility that the variance of η 3 * is zero; i.e., that my observations of x 3 may be accurate.

2.2.6. A Theory-Based Interpretation of Commodity-Exchange

With the searched-for economic theory I have crossed the bridge I built between Frisch’s two worlds, and I am about to enter the real world to give economic meaning to the variables and parameters of Commodity-exchange.
Here I find that Table 3 suggests that I cannot reject the possibility that the variance of η 3 * is zero; i.e., my observations of x 3 may be accurate. Since y 3 is an auxiliary variable, I decide to assume that my observations of x 3 are accurate.
If I accept that I have accurate observations of x 3 and note that, with c 3 * = ( m y 1 0.5190 m y 3 ) = 9.1762 , and c 4 * = ( m y 2 + 0.3919 m y 3 ) = 6.3910 , I find that, for t = 1 , , 400 ,
y 1 t = 9.1762 + 0.5190 y 3 t + η 1 t * , y 2 t = 6.3910 0.3919 y 3 t + η 2 t * ,
But, if that is so, I can—with the x and p in the searched-for theory—identify x 1 t with y 1 t η 1 t * , p t with y 2 t η 2 t * , x 3 t with y 3 t , and deduce that
x 1 t = 9.1762 + 0.5190 x 3 t , p t = 6.3910 0.3919 x 3 t , t = 1 , , 400 .
When I identify x 1 t with y 1 t η 1 t * , p t with y 2 t η 2 t * , and x 3 t with y 3 t , I take y 1 t η 1 t * , y 2 t η 2 t * , and y 3 t to be possible observed true values of the respective components of y in the MPD distributed data generating process. As such, the identification becomes the first step in the process of giving economic meaning to the components of y and the parameters of Commodity-exchange.
The next step consists in demonstrating that the equations in (22) constitute a reduced form of a structural form that satisfies the inequality constraints that I listed in (5). For that purpose I need Theorem 3.
Theorem 3.
The quadruple ( α 1 , β 1 , α 2 , β 2 ) in the equations in (6) is the associated quadruple of a model of the commodity market in (5) if and only if the coefficients in (5) satisfy the equations
C = α 1 α 2 , B = β 1 β 2 , C B = α 1 β 2 β 1 α 2 α 2 β 2 A = α 1 β 2 β 1 α 2 β 2 , D = α 1 β 2 β 1 α 2 α 2
From Theorem 3 and easy calculations it follows that the ( c 3 * , c 1 * , c 4 * , c 2 * ) in (22) comprises the reduced-form parameters of a model of the equations in (5) in which
A = 17.6398 , B = 1.3243 , C = 1.4358 , D = 1.0817 ,
i.e., in which
x 1 t = 17.6398 1.3243 p t , x 2 t = 1.4358 p t + 1.0817 x 3 t , and x 1 t = x 2 t , t = 1 , , 400 .
The economic meanings which the first two steps in the interpretation process give to the variables and parameters of Commodity-exchange is economically meaningful. It remains to show that it, also, is statistically adequate. That takes two more steps—one for the empirical relevance of the searched-for economic theory, and one for the empirical relevance of the theory-based interpretation of Commodity-exchange.
In the third step I note that with z t * = y 3 t m y 3 and ( c 1 * , c 2 * ) = ( 0.5190 , 0.3919 ) , and with the estimates of the three means in Table 1 and the four variances in Table 3, the components of the vector,
( c 1 * , c 2 * , m y 1 , m y 2 , m y 3 , σ z * 2 , σ η 1 * 2 , σ η 2 * 2 , σ η 3 * 2 ) ,
are the parameters of a data admissible MPD distribution of y. Consequently, being an economic theory with a data admissible model of its MPD, the searched-for theory is empirically relevant in the empirical context of Commodity-exchange.
In the fourth step I will demonstrate that my theory-based explication of Commodity-exchange is, also, empirically relevant. The preceding arguments demonstrate that an MPD with the parameter vector in (25) is statistically meaningful. Hence, to show that my theory-based explication is empirically relevant, it remains to show that the reduced-form parameters in (23), ( 9.1762 , 0.5190 , 6.3910 , 0.3919 ) , belong in the confidence intervals for ( a 1 , b 1 , a 2 , b 2 ) in Table 2. To wit: 9.17616 ( 8.8248 , 9.6115 ) ; 0.5190 ( 0.3829 , 0.6269 ) ; 6.3910 ( 6.1080 , 6.6102 ) ; 0.3919 ( 0.4592 , 0.3033 ) .

3. The Empirical Validity of the Bridge Principles in (10)

The interpretation of Commodity-exchange that I give in Section 2 looks good, but it is not well founded. The bridge principles seem arbitrary since the probability distribution of η * comes from nowhere. Besides, the probability distribution of y in an MPD may be very different from its probability distribution in the TPD. In this section I will take care of both problems by showing that all the statistically meaningful data admissible MPDs are congruent. The fact that all the statistically meaningful data admissible MPDs are congruent implies that the MPD in my interpretation of Commodity-exchange can account for all the relevant statistical results of the TPD. In addition, it implies (Stigum, 2016, p. 7) that the bridge principles that I use in my theory-based interpretation of the parameters of Commodity-exchange are empirically valid.
To establish the empirical validity of my bridge principles, I must introduce new notation, define the concepts of encompassing and congruence properly, and record a useful mathematical detail. First, the mathematical detail: Formally, the data variables constitute a vector-valued random process, Y = { y ( t , ω ) ; t N } , on a probability space, ( Ω p , p , P p ( . ) ) , where N = 0 , 1 , 2 , , Ω p is a subset of a vector space, p is a σ field of subsets of Ω p , and P p ( . ) : p [ 0 , 1 ] is a probability measure. The family of finite-dimensional probability distributions of the members of Y relative to P p ( . ) is the true probability distribution of the data variables, the TPD. Now, according to Kolmogorov’s consistency theorem (Kolmogorov, 1933), one can, also, with an MPD associate a probability measure on ( Ω p , P ) , P M ( . ) : p [ 0 , 1 ] , such that Y has the probability distribution MPD relative to P M ( . ) , and such that Y becomes a vector-valued random process on the probability space, ( Ω p , p , P M ( . ) ) .
Next the notation: Let s n denote a finite set of observations of y 1 , y 2 , y 3 , and let m M 0 ( . ) and m P 0 ( . ) , respectively, be the factor-analytic estimator of the MPD and the least-squares estimator of the TPD. Then, in the given empirical context m P 0 ( s n ) records the least-squares estimates of the parameters of Commodity-exchange in Equations (1) and (2); i.e., the values of
( a 1 , b 1 , a 2 , b 2 , m y 1 , m y 2 , m y 3 , σ y 1 2 , σ y 2 2 , σ y 3 2 ) ,
which according to Table 1 and Table 2 equal
( 9.2182 , 0.5049 , 6.359 , 0.3812 , 10.722 , 5.223 , 2.979 , 1.5079 , 0.0058 , 0.0024 ) .
Moreover, m M 0 ( s n ) records the MPD parameters of Commodity-exchange in Equations (13) and (15); i.e., the values of
( c 3 , c 1 , c 4 , c 2 , m y 1 , m y 2 , m y 3 , σ z 2 , σ η 1 2 , σ η 2 2 , ( σ η 3 2 = 0 ) ) ,
which according to Table 1, Table 2 and Table 3–with σ y 3 * 2 = σ z * 2 —equal
( 9.1762 , 0.519 , 6.391 , 0.3919 , 10.722 , 5.223 , 2.979 , 1.4668 , 2.302 , 0.937 , 0 ) .
I assume that m M 0 ( . ) and m P 0 ( . ) are both consistent estimators. In addition, I let ( M 1 , m M 0 ( . ) ) and ( M 2 , m P 0 ( . ) ) be two econometric models in which M1 denotes a model of MPD and M2 denotes a model of TPD. I assume that the parameters of M1 and M2 are, respectively, as specified in (28) and (26), and that the M2 is the TPD. Finally, let θ M and θ P denote different values of the parameters in (28) and (26). I assume that there are two subsets of R 10 , AM and AP, such that θ M AM and θ P AP, and I let θ M 0 and θ p 0 be, respectively, the P lim of m M 0 ( s n ) and m P 0 ( s n ) in P P 0 ( . ) -measure.
Now, the definitions of encompassing and congruence: Let θ M * be the parameter of a statistically meaningful data admissible MPD, and let P M * ( . ) be the probability measure on ( Ω p , p ) that corresponds to this MPD. Then, in the present empirical context ( M 1 , m M 0 ( . ) ) encompasses ( M 2 , m P 0 ( . ) ) if and only if there exists a mapping, Γ ( . ) : A M A P , such that
Γ ( θ M * ) = P lim m P 0 ( s n ) in P M * ( . ) - measure , and Γ ( θ M 0 ) = θ P 0
If M1 is coherent with the a-priori theory of Commodity-exchange and if ( M 1 , m M 0 ( . ) ) encompasses ( M 2 , m P 0 ( . ) ) and M 2 is the TPD, then ( M 1 , m M 0 ( . ) ) is a congruent model of TPD.
The C-component of Definition 1 on page 6 in Stigum (2016) gives a precise definition of the pertinent idea of encompassing. In the present proof, Γ ( . ) is taken to be the identity matrix, I ( . ) . The corresponding definition of congruence is part of Definition 2 on page 6 in Stigum (2016). For a comprehensive discussion of encompassing and congruence see Mizon (1995), Bontemps and Mizon (2003, 2008), and pages 322–333 in Stigum (2015).
With these definitions in mind I can show that my bridge principles are valid in the present empirical context. In a given sample in which I have accurate observations of the x 3 t , the functions which estimate the respective components of the vector in (26), are the same as the functions which estimate the respective components of the vector in (28). The estimates differ because the functions have the TPD distribution in (26) and the estimated MPD distribution in (28). To show that the functions, which estimate the respective components of the vector in (26), are the same as the functions, which estimate the respective components of the vector in (28), it suffices to show that the functions that estimate c 1 , c 2 and σ z 2 in (28) are the same as the functions that estimate b 1 , b 2 and σ y 3 2 in (26).
In a model of the Wishart distribution in (18) in which the σ η 3 2 in (19) is zero, the equations in (19) will have a solution only if Σ 12 = Σ 13 Σ 32 / Σ 33 . In addition, it is a fact that, if the equations in (31) have a solution, my factor-analytic estimates of λ , σ z 2 , and γ will satisfy the equations in (31).
S ( n ) = λ σ z 2 λ + γ ;   λ = ( c 1 , c 2 , l ) ,   and γ = σ η 1 2 0 0 0 σ η 2 2 0 0 0 σ η 3 2 0 ( σ η 1 2 , σ η 2 2 , σ η 3 2 ) and 0 = σ η 3 2 .
Consequently, when I have accurate observations on the x 3 t , it follows from the equations in (31) that the factor-analytic estimates of c 1 , c 2 , and σ z 2 satisfy the equations
c 1 = 1 t 400 ( y 1 t m y 1 ) ( y 3 m y 3 ) / 1 t 400 ( y 3 t m y 3 ) 2 ; c 2 = 1 t 400 ( y 2 t m y 2 ) ( y 3 m y 3 ) / 1 t 400 ( y 3 t m y 3 ) 2 ;   and σ z 2 = ( n 1 ) 1 1 t 400 ( y 3 t m y 3 ) 2 .
These equations are—as was to be shown—the least-squares equations for the estimates of b 1 , b 2 , and σ y 3 2 .
Next, let M2 be the TPD, let Ml be a statistically meaningful data-admissible MPD, let θ M be the corresponding values assigned to the parameters in (31), and let P M ( . ) be the probability measure on ( Ω P , P ) determined by this ML. Then it is the case that in both P M ( . ) -measure and P p ( . ) -measure, m M 0 ( s n ) = m P 0 ( s n ) almost everywhere. Hence, θ M = P lim m P 0 ( s n ) in P M ( . ) -measure, and θ M 0 = θ P 0 in P P ( . ) -measure. Consequently, ( M 1 , m M 0 ( . ) ) encompasses ( M 2 , m P 0 ( . ) ) . Moreover, M 1 is coherent with the a-priori theory of Commodity-exchange in the sense that there is an MPD model of Equations (1) and (2). Finally, since M2 is the TPD, ( M l , m M 0 ( . ) ) is a congruent model of the TPD. It follows that M l is congruent. Since Ml was an arbitrarily chosen statistically meaningful data-admissible MPD, it follows that all statistically meaningful data-admissible MPDs are congruent, and that my bridge principles in (10) are valid in the present empirical context.

Funding

This research received no external funding.

Data Availability Statement

No new data were created or analyzed in this study. Data sharing is not applicable to this article.

Acknowledgments

I am the sole author of the paper. In writing it, I have benefitted from interesting and constructive criticisms by Econometrics’ three referees, and of helpful and constructive criticisms of earlier versions of the paper by Aris Spanos, Jennifer Castle, and Jo Thori Lind. With K. Waagan’s help, I used Stata for my statistical calculations. Konrad Kristofer Reksjø Almås prepared the LaTex manuscript for submission.

Conflicts of Interest

The author declares no conflicts of interest.

References

  1. Anderson, T. W., & Rubin, H. (1956). Statistical inference in factor analysis. In J. Neyman (Ed.), Third Berkeley symposium on mathematical statistics and probability (Vol. 5, pp. 111–150). University of California Press. [Google Scholar]
  2. Bjerkholt, O., & Qin, D. (2011). A dynamic approach to economic theory: Lectures by Ragnar Frisch at Yale University. Routledge. [Google Scholar]
  3. Bontemps, C., & Mizon, G. E. (2003). Congruence and encompassing. In B. P. Stigum (Ed.), Econometrics and the philosophy of economics: Theory-data confrontations in economics (pp. 354–378). Princeton University Press. [Google Scholar]
  4. Bontemps, C., & Mizon, G. E. (2008). Encompassing: Concepts and implementation. Oxford Bulletin of Economics and Statistics, 70(s1), 721–750. [Google Scholar] [CrossRef]
  5. Davidson, R., & MacKinnon, J. G. (1993). Estimation and inference in econometrics. Oxford University Press. [Google Scholar]
  6. Haavelmo, T. (1944). The probability approach in econometrics. Econometrica, 12, iii–vi. [Google Scholar] [CrossRef]
  7. Hendry, D. F., & Richard, J. F. (1989). Recent developments in the theory of encompassing. In B. Cornet, & H. Tulkens (Eds.), Contributions to operations research and economics. The XXth anniversary of CORE (pp. 393–440). MIT Press. [Google Scholar]
  8. Kolmogorov, A. N. (1933). Grundbegriffe der wahrscheinlichkeitsrechnung. Springer. [Google Scholar]
  9. Lawley, D. N., & Maxwell, A. E. (1971). Factor analysis as a statistical method. American Elsevier. [Google Scholar]
  10. Mizon, G. E. (1995). Progressive modelling of macroeconomic time series: The LSE methodology. In K. D. Hoover (Ed.), Macroeconomics: Developments, tensions and prospects (pp. 107–169). Kluver Academic Press. [Google Scholar]
  11. Sismondo, S. (1993). Some social constructions. Social Science Studies, 23(3), 515–553. [Google Scholar] [CrossRef]
  12. Spanos, A. (1989). On rereading Haavelmo: A retrospective view of econometric modeling. Econometric Theory, 5(3), 405–429. [Google Scholar] [CrossRef]
  13. Spanos, A. (2015). Revisiting Haavelmo’s structural econometrics: Bridging the gap between theory and data. Journal of Economic Methodology, 22(2), 171–196. [Google Scholar] [CrossRef]
  14. Stigum, B. P. (2015). Econometrics in a formal science of economics: Theory and the measurement of economic relations. MIT Press. [Google Scholar]
  15. Stigum, B. P. (2016). The status of bridge principles in applied econometrics. Econometrics, 4(4), 50. [Google Scholar] [CrossRef]
  16. Stigum, B. P. (2025). A lost world of applied econometrics with the founding ideas of Ragnar Frisch and Trygve Haavelmo. University of Oslo. [Google Scholar]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Article Metrics

Citations

Article Access Statistics

Multiple requests from the same IP address are counted as one view.