Documentos de Académico
Documentos de Profesional
Documentos de Cultura
.
JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide range of
content in a trusted digital archive. We use information technology and tools to increase productivity and facilitate new forms
of scholarship. For more information about JSTOR, please contact support@jstor.org.
The Econometric Society is collaborating with JSTOR to digitize, preserve and extend access to Econometrica.
http://www.jstor.org
GERHARD TINTNER
1. THE PROBLEM
ID
GGERHARD TINTNER
JUSTIFICATION
G(zi(Z Z2, ..
IZQ)
=0,
i =1, 2,
,q
<
Q.
MULTIPLE
REGRESSION
FOR SYSTEMS
OF EQUATIONS
Stability conditions will also be taken as fulfilled. It should be mentioned that Leontief" has recently attempted to verify such static systems in a somewhat simplified form.
We can also consider system (1) as representing a nonstatic Walrasian system of general equilibrium. Anticipations have then to be
introduced. Such ideas have been treated extensively by Hicks12 and
recently by Lange.'3 The present author has shown that the assumption
of specific routines of anticipations leads to "dynamic" terms in the
equations (1).14 This is similar to procedures adopted earlier by Evans'5
and Roos.'6 That is to say, some of the zi in (1) have now to be interpreted as referring to different points in time, as derivatives with respect to time, integrals, etc.
It seems to us that such an approach is similar to the one implied in
certain mathematical business-cycle theories, especially those of Tinbergen,'7 Kalecki,'8 Davis," and the author.20 Empirical verifications
are available in the extensive work of Tinbergen2l and also on a much
smaller scale in a publication by the author of this article.22
It has already been pointed out by Pareto2' and emphasized by later
writers24that the labor of determining empirically the system (1) would
10 A. Wald, "Ueber einige Gleichungssysteme der mathematischen Oekonomie,"
Zeitschriftfir Nationalokonomie, Vol. 7, 1936, pp. 637-670.
11 W. Leontief, The Structure of American Economy, 1919-1929, Cambridge,
Mass., 1941, 181 pp.
12 J R. Hicks, Value and Capital, Oxford, 1939, 331 pp.
13 0. Lange, Price Flexibility and Employment, Bloomington, Ind., 1944, 114 pp.
1 G. Tintner, "A Contribution to the Nonstatic Theory of Production," in
Studies in MathematicalEconomicsand Econometrics,In Memoryof Henry Schultz,
pp. 92-109, esp. pp. 106 ff.; "A Contribution to the Non-Static Theory of
Choice," QuarterlyJournal of Economics, Vol. 56, February, 1942, pp. 274-306,
esp. pp. 302 ff.
1"G. C. Evans, Mathematical Introduction to Economics, New York, 1930,
pp. 36 ff.
18 C. F. Roos, Dynamic Economics, Bloomington, Ind., 1934, pp. 14 ff.
17 J. Tinbergen, "Annual Survey: Suggestions on Quantitative Business Cycle
Theory," ECONOMETRICA, Vol. 3, July, 1935, pp. 241-308.
18 M. Kalecki, Essays in the Theory of Economic Fluctuations, London, 1935;
Studies in Economic Dynamics, New York, 1944.
19 H. T. Davis, The Theory of Econometrics, Bloomington, Indiana, 1941,
pp. 408 ff.
20 G. Tintner, "A 'Simple' Theory of Business Fluctuations," ECONOMETRICA,
Vol. 10, July-October, 1942, pp. 317-320.
21 J. Tinbergen, Business Cycles in the United States, 1919-32, Geneva, 1939.
22 G. Tintner, "The 'Simple' Theory of Business Fluctuations: A Tentative
Verification," Review of Economic Statistics, Vol. 26, August, 1944, pp. 148-157.
23 V. Pareto, Cours d'gconomiepolitique, Vol. 2, Lausanne, 1897, pp. 364 ff.
24 F. A. von Hayek, "The Present State of the Debate" in CollectivistEconomic
Planning, London, 1935, pp. 201-243, esp. pp. 207 ff.
GERHARD
TINTNER
be prohibitive because of the great number of variables actually entering into such a system of general equilibrium. This is true even if observations for all the z; were available which is certainly not the case.
Hence we are forced to investigate the possibilities of simplification.
In economic theory, such simplified models have very often been constructed. Fisher's famous equation of exchange25can be considered as
a very simplified model of the complete Walrasian system. This assumes of course that this equation or the corresponding "Cambridge"
equation is not a mere tautology. The equations in Keynes's earlier
Treatise26are of a similar nature. Many other models have been constructed, some of which are represented in Lundberg's book.27 Keynes's
General Theory28can be thought of as a simplified model of the Walrasian system. It is mathematically formulated in the articles by
Hicks,2"Lange,30and Modigliani,3' to mention only a few. We want to
proceed along similar lines.
We replace a certain subset of the variables and parameters
Z1, Z2, . . *, ZQ by one new variable Mh (h=l1, 2, * * *, p). p is now
a much smaller number than Q. At the same time we reduce the number of equations in the system.
The substitution of Mh for a subset of our original variables and
parameters amounts to the following: We replace for instance the various wheat prices by one representative wheat price or by the average
of all wheat prices. We replace the quantities of all the producers' goods
in the system (like iron, coal, copper, etc.) by an index of the quantity
of producers' goods. We replace the various short-term interest rates
by one representative short-term rate or by the average of all shortterm rates, etc. Time will probably also enter explicitly into our equations.
After replacing subsets of the original variables and parameters in
(1) by the new set of variables M1, M2, * * * , Mp we may still have
variables that are not represented. Write w for a new variable which
stands for those variables which lack representation.
25
26 J.
27
28
London, 1936.
29 J. R. Hicks, "Mr. Keynes and the 'Classics'; A Suggested Interpretation,"
ECONOMETRICA,
Vol. 5, April, 1937, pp. 147-159.
30 0. Lange, "The Rate of Interest and the Optimum Propensity to Consume,"
Economica, Vol. 5, New Series, February, 1938, pp. 12-32.
31 F. Modigliani, "Liquidity Preference and the Theory of Money," ECONOMETRICA, Vol. 12, January, 1944, pp. 45-88. See also D. M. Fort, "A Theory of
General Short-Run Equilibrium," ECONOMETRICA, Vol. 13, October, 1945, pp.
293-310.
MULTIPLE
REGRESSION
FOR SYSTEMS
OF EQUATIONS
G. Haberler, Der Sinn der Indexzahlen, Tilbingen, 1927, pp. 125 ff.
G. Tintner, "A Note on Economic Aspects of the Theory of Errors in Time
Series," Quarterly Journal of Economics, Vol. 53, November, 1938, pp. 141-149.
31 R. Frisch, Statistical Confluence Analysis by Means of Complete Regression
10
GERHARD
TINTNER
The economic meaning of the assumption that the Mi are the mathematical expectatiQns of the observed variables Xi is the following: We
assume that in the long run the averages (arithmetic means) of the disturbances or deviations mentioned above tend to be zero. The consequences are not very serious if this assumption should not be strictly
fulfilled. The resulting bias will only influence the constant terms if
we consider linear systems, as we propose to do later. The more important regression coefficients (which represent, for instance, elasticities),
are not affected. Since we are primarily interested in the estimation of
these coefficients we may assume that the biases will not impair the
significance of our results considerably, even if they should exist.
Write Xit for the actual observation of the theoretical variables Mi
at the point in time t. Our observations extend over the period
t=l, 2,**,
N.
We have;
(3)
Xit -Mit
+ Yit)
i =
1, 2, ..,
p; t-=1, 2, ..,
N.
We have by definition Mt = EXi. Mit is the mathematical expectation or systematic part and yit the random term or the disturbance
(Frisch). These disturbances result from lack of representation, frictions, and errors of measurement as indicated above. The last term
may be absent with some of the variables, especially the one representing time. The mathematical expectations are not random variables. We
will also assume that the term wt (t= 1, 2, * * * , N) which stands for the
variables not represented in the system is random, with Ewt =0. It is
also possible that sometimes some kinds of frictions may give rise to
this type of stochastic terms.
Now we finally assume the system (2) linear in first approximation.
p
(4)
k,o + Z
k,jM2t = w,t,
v = 1, 2,
.,
R; t=1,
2,
, N.
j=1
MULTIPLE
REGRESSION
FOR SYSTEMS
11
OF EQUATIONS
(5)
ka + k,ktMit = O,
N.
12
GERHARD
TINTNER
MULTIPLE
REGRESSION
FOR SYSTEMS
OF EQUATIONS
13
tioned above and also with the nonmathematical theories of the cycle.
There is one feature in nonstatic systems that may occasionally bring
about abrupt and sudden changes which may appear as discontinuities.
This is the "kink" in the (real or imagined) demand curve of the firm
under oligopoly. It implies a discontinuity in the marginal-revenue
curve. This was recently pointed out by Lange47 on the basis of the
earlier work of P. Sweezy.48 It does not seem, however, that these discontinuities are actually very large and that they are widespread in the
economic system. Continuity and momentum seem to us outstanding
characteristics of our economy.
The recent book by von Neumann and Morgenstern49 also stresses
discontinuities and indeterminacies arising in our economy from oligopolistic and similar situations. These are not unlike phenomena observed in games of strategy. But the actual determination of prices,
etc., under such conditions of indeterminacy may depend upon noneconomic factors like bargaining power, politics, etc. as in the much
discussed problem of bilateral monopoly.50 Hence it does not seem to
us that under normal conditions important discontinuities must necessarily arise, as long as the total social situation is reasonably stable.
It is important to note that the stochastic features of our system
are introduced only through the disturbances yit. Economically this is
equivalent to the following assertions: If we knew all our data perfectly
and if the system (1) was frictionless and represented perfectly "rational" behavior then the zi would appear as smooth (and predictable)
functions of time. This is not unlike the famous assertion of Laplace"
in classical mechanics.
Stochastic processes would appear only if assumption A above is
adopted or if we attempted to determine the complete system (4). This
would in some measure correspond to Frisch's business-cycle theory,52
the problem considered by Slutsky,53 the analysis of Hurwicz,54etc.
Lange, Price Flexibility and Employment, pp. 40 ff.
M. Sweezy, "Demand under Conditions of Oligopoly," Journal of Political Economy, Vol. 47, August, 1939, pp. 568-573.
49 J. von Neumann and 0. Morgenstern, Theory of Games and Economic Behavior, Princeton, 1944, pp. 43 if.
50 See, e.g., G. Tintner, "Note on the Problem of Bilateral Monopoly,"
Journal of Political Economy, Vol. 47, April, 1939, pp. 263-270, and the literature
quoted there, esp. p. 270.
51 P. S. de Laplace, Essai philosophique sur les probabilit6s, 4th ed., Paris, 1819,
p. 4.
52 R. Frisch, "Propagation Problems and Impulse Problems ...
," cited in
note 46.
53 E. Slutzky, "The Summation of Random Causes as the Source of Cyclical
Processes," ECONOMETRICA, Vol. 5, April, 1937, pp. 105-146.
54 L. Hurwicz, "Stochastic Models of Economic Fluctuations," ECONOMETRICA,
Vol. 12, April, 1944, pp. 114-124.
40 O.
48 P.
14
GERHARD
TINTNER
3. THE STATISTICAL
METHOD
A. Estimation
We give now a short description of our method. It deals essentially
with a case discussed by Frisch55and is in a sense an extension of the
work of Koopmans,56 who considered the existence of only one relationship existing between the Mit. This corresponds to the case R =1. The
methods presented here are also similar to the ones given by the author
in an earlier article.57 They are definitely connected with modern work
in multivariate analysis.58
Under the assumptions stated above we endeavor to estimate the
coefficients k, in equation (5). In order to do this we have first to
"estimate" R itself, i.e., the number of independent linear relationships
actually existing among our p variables Mit in the population.
The possibility of estimating the "true dimensionality" of our problem in statistical terms (a problem already envisaged by Frisch59 and
contemplated by Fisher60 and others in "discriminant analysis") distinguishes our approach from the one of Haavelmo and his followers.
These simply assume R, the number of independent linear relationships
between the variables in the population as given. Hence it seems possible that they may endeavor to accomplish too much, i.e., to determine a greater number of equations than actually exist in the data.
This would by necessity lead to nonsensical results. The reason for this
distinction between their method and our procedures lies in the different
role of economic theory and the reliance put into a priori assumed economic relationships, which seem to be greater with Haavelmo and his
school than with us.
a. Estimation of the Variances of the Disturbances. In order to estimate R we have first to estimate the variances of the disturbances yit.
Since we have assumed above that the Mit are smooth functions of
time we can use the variate difference method for this purpose. This
seems to us preferable because we do not have to assume that the systematic parts of our variables are specific functions of time, e.g., polyR. Frisch, Statistical ConfluenceAnalysis.
T. Koopmans, Linear RegressionAnalysis in Economic Time Series, Haarlem,
1937.
57G. Tintner, "An Application of the Variate Difference Method to Multiple
Regression," ECONOMETRICA, Vol. 12, April, 1944, pp. 97-113.
58 H. Hotelling, "Relations between Two Sets of Variates," Biometrika, Vol. 28,
December, 1936, pp. 321-377.
69 Statistical Confluence Analysis . . .,
cited in note 34. See also, R. Stone.
T'heAnalysis of Market Demand, London, National Institute of Economic and
Social Research, 1945.
60 R. A. Fisher, "The Statistical Utilization of Multiple Measurements," Annals of Eugenics, Vol. 8, 1938, pp. 376 ff.
55
56
MULTIPLE
REGRESSION
FOR SYSTEMS
OF EQUATIONS
15
63
Journal of the American Statistical Association, Vol. 40, March, 1945, pp. 38-56.
16
GERHARD
TINTNER
(6)
al2
. .
alp
XV1
a12
alp
a22 -XV2
.
.
a2
a2p
.
= O,
* * app - XVp
Ar = (N-
1)(Xl + X2 +
Xr)
i.e., the sum of the r smallest roots of (6) multiplified by (N - 1). This
quantity (7) is distributed like x2 with (N-I-p
+r) r degrees of freedom. It may be used to estimate R, i.e., the true number of independent
relationships of the type (5) which actually exist in the population.
The determinantal equation (6) is fundamental for our procedures.
64 P. L. Hsu, "On the Problem of Rank and the Limiting Distribution of
Fisher's Test Function," Annals of Eugenics, Vol. 11, 1941, pp. 39 fif.
66 T. Koopmans, Linear Regression Analysis
in Economic Time Series, pp. 87 ff.
6B H. Hotelling, "Simplified Calculation of Principal Components," Psychometrika, Vol. 1, March, 1936, pp. 27-35.
MULTIPLE
17
OF EQUATIONS
FOR SYSTEMS
REGRESSION
(8)
Q= E
E
t=1 i=1
(xt -M,t)2/V,
where mit is the deviation of Mit from its arithmetic mean. We have to
minimize Q under the R conditions (5).
These conditions involve actually Rp+R coefficients k,, (v=1,
2, * * , R; j=O, 1, 2, * * , N) which have to be estimated. The constants k,o are evidently given by the condition that the best fit has to
pass through the means of all the variables :68
p
(9)
kao+
v = ly 2, ..
k,ikXi = 0,
I R.
. . .
m2t,
.
.,
Xmp-R
tin
terms
18
GERHARD TINTNER
k,ikwiVi =
(10)
v, w = 1, 2, * *,
vw,,
i=l1
t=1
E ktjx1)
E3 i==l
( j=1
kwix1t) =
E
O,
v p w; v, w = 1, 2, *
, f.
These conditions (10) and (11) normalize and orthogonalize the coefficients klcv(v-1, 2, . * *, R; j= 1, 2, . . *, p).
The procedure is then essentially the same as weighted regression
with just one equation.69 We determine first the most advantageous
values of the mit (i= 1, 2, * * *, p; t= 1, 2, * , N). It can be demonstrated that under the given conditions this leads to a sum of squares
in the form:
R
9= E
(12)
t=l
P
(
j=l
v=l
Xtx)2.
kvj
This expression is very similar to the one appearing in the earlier analysis with just one relation.
The complete solution in mathematical form is given elsewhere.70 It
can be shown that the minimization of the expression (12) under conditions (10) and (11) leads again to the determinantal equation (6).
The system of linear homogeneous equations that yields the solutions is as follows:
(all
+ aipkvp = 0,
+ a2,kvp = 0,
XvV,)kv,+ al2kv2+
*
al2kv1+ (a22 XvV2)kv2+
(13)
.
aJpkvc+ a2pkv2+
* + (app - XvVp)kvp= 0,
v l= 21,
2,
R.
This system of equations can evidently only have a nontrivial solution if its determinant is zero. [A trivial solution is excluded by condition (10).] This is achieved by inserting for Xvone of the roots of the
determinantal equations (6). Then the computation of the k,j is possible, if one additional condition is imposed, e.g., condition (10) for v =w,
69T. Koopmans, Linear Regression Analysis. . . , cited in note 56.
70 G. Tintner, "A Note on Rank, Multicollinearity and Multiple Regression."
MULTIPLE
REGRESSION
FOR SYSTEMS
OF EQUATIONS
19
20
GERHARD
TlNTNER
j =1, 2, .
21
+- aklpvkp,v = 0,
+.
- a2p',kvpv,' =
0,
+ (apvpv - Xv')kvpv,' = 0,
v-1,
2,
, R.
This is again a system of linear homogeneous equations in the variables kv'. A nontrivial solution is possible only if the determinant is
equal to zero. In order to accomplish this we have to modify the determinantal equation (6) in such a fashion, that the rows and columns
corresponding to the sv variables that do not now appear in the system
(14) are left out. Denote the smallest latent root of the modified equation (6) by Xv'. Then we insert this solution Xv'into (14) and assume
22
GERHARD TINTNER
C. Tests of Significance
We will now give tests of significance that should enable us to get
an idea about the goodness of the fit achieved and to estimate the loss
of statistical accuracy suffered by imposing new conditions for the purpose of identification.
- * - +SR=
Let 81+82+
S' be the total number of conditions imposed
in the process of identification. Define a quantity analogous to (7):
(15)
AR'
(N
1) (X1' + X2 + - - * + XR )
which is now the sum of the R smallest roots of the modified determinantal equations (6) multiplified by N-1. Each one of the roots Xv'
is computed by leaving out in (6) the appropriate s, rows and columns.
de-
grees of freedom.
The following test of significance is analogous to a procedure given
This is apby Wilks.73 We form F = (N-1-P
+R)R (AR'-AR)/S'AR.
proximately distributed like Snedecor's F with (N-1 -p+R)R and S'
degrees of freedom. We fix a level of significance and test the following
hypothesis: That the difference between AR and AR' is zero, i.e., that
the introduction of the S' new conditions necessary for identification
has not materially deteriorated our fit.
There is no equivalent test for the individual equations. The reason
for this is that in the original system (5) the equations are not identified
and we hence do not know which one corresponds to a given meaningful
equation of our new set-up. Hence we must always determine a complete system before we can make a test of significance. There is, however,
a method by which we can compare identified systems that involve
different sets of conditions.
73S.
MULTIPLE
REGRESSION
FOR SYSTEMS
OF EQUATIONS
23
Assume the number of conditions again S' in the first system and S in
the second. Then we form a quantity ARcorresponding to (15). The quanis again
tity F=[(N-1-p+R)R+S]AR'/[(N-1-p+R)R+S']AR
like
distributed
F
Snedecor's with (N-1-p+R)IR+S'
approximately
and (N-1-p+R)R+S
degrees of freedom. The hypothesis to be
tested is again: That there is no essential difference in the sum of
squares of residuals divided by the appropriate number of degrees of
freedom resulting from the two sets of conditions (or in the two systems).
It is remarkable and entirely in the spirit of our approach to the
problem that it is not possible to test individually the difference between the k,j and ki'. It shows again that it is really not individual
equations that are determined in our case but that the entire system is
estimated as a whole.
Tests of significance for the individual kv' can be given that are
approximations and based upon an idea indicated by Koopmans.74 In
order to do this we have to assume a different normalization rule from
the one given above.
This new normalization rule is also more appropriate and useful in
economics. It consists in making one of the variables Mit, say Mlt,
the dependent variable. E.g., in the theory of demand we frequently
consider the quantity demanded as the dependent variable and the
price, income, etc. as independent variables.
It should be stressed, however, that now this distinction is made only
for the purpose of normalization and that it has definitely nothing to
do with the distinction between dependent and independent variables
made in classical regression analysis. There the assumption is that the
dependent variable alone is subject to disturbances while the fit is carred through for purposes of prediction assuming fixed values for the
so-called independent variables.75 Here our purpose is the estimation
of the structural coefficients themselves and not prediction. Disturbances enter into all our variables, not only into Xit which now assumes the role of "dependent" variable for purpose of normalization
only.
Let our new normalized solutions be denoted by kc,i"(v = 1, 2, * * , R;
j=2, 3, * , pv'). The system of equations becomes now:
(a22-Xv'V2)kv2"t+a23kv3"+
(16)
(16)
a23k,2"+- (a33-X'V3)kC3"-+
a2pvkv2/"
+a3pkv3+
74 T. Koopmans,
75
**
+a2pv
+a3pk ,p.'t
= a12,
=a13,
24
GERHARD
TINTNER
This system of linear homogeneous equations will again have a nontrivial solution if we substitute for Xv'a solution for the modified determinantal equation (6). The modification consists in leaving out the
s, rows and columns corresponding to the variables that by assumption
do not enter into the specific vth equation.
Using an approximation given by Koopmans76 we can now compute
the variances and covariances of the kq"' under the assumption that
the variances of the systematic parts are much larger than the ones of
the disturbances.
The covariance of ku"i"and kj'" is then given by the formula:
(17)
Ekcvi"kvi"= Xv/Cij( E
kh"Vh
V1) (N
pv').
MULTIPLE
REGRESSION
FOR SYSTEMS
25
OF EQUATIONS
TO AGRICULTURAL
DATA
A. The Data
We will now apply our method to an endeavor to fit a demand and a
supply curve for agricultural products in the United States. Our data
cover the period 1920-1943 (N = 24). The data are annual figures.
They are the following:80 X1, Prices received by farmers, 1910-14 = 100;
X2, National income (billion $); X3, agricultural production, 1935-39
= 100; X4, time, origin between 1931 and 1932; X5, prices paid by
farmers, 1910-14=100. The means of the variables are presented in
Table 1.
TABLE 1
ARITHMETIC MEANS (X,)
Mean
Variable
Symbol
127.417
72.3975
100.625
0.000
136.460
Xi
X2
X3
X4
X5
X1
X2
X3
X4
X2
1212.593
536.182
557.636
MATRIX (aii)
X3
87.594
207.147
106.418
X4
-72.696
75.022
54.543
50.000
X5
X5
532.000
220.043
30.261
-38.543
240.652
26
GERHARD
TINTNER
Difference
Xi
X2
X3
X5
1
2
3
4
5
337.885
130.649
47.901
28.580
24.349
72.346
19.068
9.319
7.349
7.056
J4.156
10.698
7.961
7.160
6.827
62.863
25.221
12.472
9.812
8.987
Tests indicate that the variances probably become stable for the fifth
difference series. Hence we will take the items in the last line of Table 3
as the estimates of the error variances Vi of our variables.
The variable time (X4) is evidently not subject to error and hence we
have V4 =0.
C. Estimation of the Number of Independent Linear Relationships
In order to estimate the number of independent relationships among
our variables we form the determinantal equation corresponding to
formula (6):
1212.593-24.349X
-72.696
532.000
536.182
87.594
557.636 -7.056X
207.147
75.022
220.043
106.418 -6.827X
54.543
30.261
(18)
50.000
=0.
-38.543
240.652 -8.987X
(Variables
Number
X1,
X2,
Roots
XI.
1
2
3
0.420
1.258
6.373
** Significant
X3,
X4,
X5)
Sum of Squares
9.660
38.594
185.173**
MULTIPLE
REGRESSION
FOR SYSTEMS
27
OF EQUATIONS
(19)
k12m2 t+
k13m3 t+
kl4M4t +
kl5m5t = 0,
0,k21Mt+k22M2t+k23M3t+k24M4t+k25M5t
O,
t
1, 2,
* , 24,
where the mit are the deviations of the mathematical expectations Mit
of the empirical variables Xit from their respective means. Formula (19)
corresponds to (5), if we take deviations from the means. The kXsare
restricted by the normalizing and orthogonalizing conditions (10) and
(11).
To compute first k1i we use the smallest latent root Xi =0.420 from
Table 4. Inserting this in formula (13) we get the following system of
homogeneous linear equations :81
1202.366k11+536.182kl2+ 87.594kl3- 72.696k14+ 532.000k16= 0,
536.182k11+ 554.672k12+ 207.147k13+ 75.022k14+ 220.043 k15= 0,
(20)
-
87.594k11+207.147kl2+103.551kl3+54.543kl4+
30.261k15=0,
72.696k11 +
38.543k15 = 0,
75.022k12+
54.543k13+50.000k14-
532.000k1l+220.043k12+ 30.261k13-38.543kl4+236.877k15
0.
81 For computational detail, see the author's earlier paper: "An Application of
the Variate Difference Method to Multiple Regression."
28
GERHARD
TINTNER
The two equations (21) and (22) solve our problem of estimation: They
are our estimates of the two relationships existing among our 5 variables.
E. Identification
The two equations (21) and (22) represent in an orthogonalized and
normalized form the two linear relations existing among our variables.
But evidently they cannot be interpreted in an economically meaningful way. We do not know which one of the two equations is the demand
curve, which one the supply curve. Also we can construct an infinite
number of equivalent relationships by multiplying each by an arbitrary
constant and combining them.
In order to solve the problem of identification we will proceed as
follows: To get the demand curve (v=1) we will assume that kC5=0
and in order to find the supply curve (v = 2) we will assume that k22= 0.
Then the two relationships to be fitted are:
k131m3t + kl4'm4t = 0,
(23)
kul'mlt + kl2'n2t +-
(24)
536.812
557.636 -7.056X'
(25)
v~~~~~~~(25)
/106.418-6.827X'
87.594
207.147
-72.696
75.022
54.543
50.000
=0.
29
Ar'
are pre-
TABLE 5
ROOTS OF EQUATION(25)
(Variables X1, X2, X3, X4)
Number
Root
Sum of Squares
Xr
Ar
r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
1
2
** Significant
0.278
5.121
6.394
124.177**
Number
Root
Sum of Squares
1
2
** Significant
10.695
79.971**
0.465
3.012
at the 1-per-cent level of significance.
Number
Root
Sum of Squares
1
2
3
**
Ar
0.799
1.115
49.001
18.377
44.022
1071.045**
30
GERHARD
TINTNER
TABLE
(Variables X1,
Number
X2,
X3)
Root
Sum of Squares
Ar
Nr
1.218
27.244
2
**
28.014
654.626**
Number
Root
Ar
XI
1
2
**
Sum of Squares
0.473
14.302
10.879
339.756**
We replace now the two equations (23) and (24) by the following two
which are more convenient for economic purposes:
(26)
(27)
M3 = k2l''ml + k2411m4
+ k2511m5
[we have suppressed the subscript t in (26) and (27) for the sake of
convenience]. Inserting the smallest root of (25) from Table 5
(X"'=0.278) in (16) we have to solve the system of equations:
1205.824k1l'" + 536.182kl21' - 72.696kl41' =
(28)
87.594,
75.022kl21' + 50.000k141'=
54.543.
The solutions k,i" appear in the following equation (demand fun ction):
(29)
X3=
mates of X3 for fixed x1, X2, x4. It is valid for prediction,but we believe
MULTIPLE
REGRESSION
FOR SYSTEMS
OF EQUATIONS
31
M3
1.721m, + 0.809m4
3.611m5.
X3=
M3 =
0.174m, + 0.547m2.
X3 =
- 0.141xi + 0.525x2.
M3=
5.055m,
11.248m5.
X3
= 0.561xl
1.115x5.
32
GERHARD
TINTNER
any combination of them) would have served just as well: Cost of agricultural production, agricultural wage rates, rents, technological coefficients, etc.
Hence we have to be very careful in using the equations (29) and
(31) or (33) and (35) in economic policy. Apart from statistical shortcomings which will be discussed in the following section we must bear
in mind that they are not unique. A different choice of the identifying
variables would possibly have given entirely different results. This is a
consequence of the fact that we have replaced the complete Walrasian
system by a much more restricted model of general equilibrium.
F. Tests of Significance
We want now to show some tests of significance. Let us first investigate whether we have lost anything in the process of identification.
That is to say, we want to inquire whether t e substitution of the economically meaningful relationships (29) and (31) has been accompanied
by some statistical disadvantages as compared to the orthogonalized
and normalized, but economically meaningless, relationships (21) and
(22).
In order to make the comparison we observe that from Table 4 it
appears that the sum of squares A2=38.594. This represents 40 degrees
of freedom. On the other hand we have from Tables 5 and 6: A1= 6.394
and A1= 10.695 respectively. Each has 20 degrees of freedom. Their sum
is 17.089 and this represents 40 degrees of freedom. So we see that the
introduction of identification has actually improved our fit. We get for
a test of significance: F = 2.258 and this is significant since at the 5-percent level F= 1.74, at the 1-per-cent level F= 2.20. These tests are of
course only approximations.
Another question we can now answer is for instance the following:
Was it necessary to include the variable X4 (time) in our analysis? If
we modify the determinantal equation (18) by leaving out column 4
and row 4 we have: X1=0.799 and X2=1.115 (Table 7). The sum of
squares A2=44.022 and this is based upon 20 degrees of freedom.
Compare this with A2 from Table 4 which is 38.594 with 19 degrees of freedom. We have the test function: F = (44.022 -38.594) X
(19)/(39.594) (1) = 2.605. But we have F=4.38 at the 5-per-cent level
for 1 and 19 degrees of freedom. Hence the difference is not significant
and the inclusion of X4 has not significantly improved our fit. It should
be remembered that this test is also only an approximation.
Finally we will indicate an approximate test of significance for the
individual regression coefficients in the weighted-regression equations.
Let us first consider (29). The inverse matrix to the matrix of the
coefficients on the left-hand side of equations (28) is:
MULTIPLE
REGRESSION
FOR SYSTEMS
OF EQUATIONS
33
TABLE 10
MATRIX
INVERSE
(cii)
XI
XI
X2
00.006228
X2
-0.009069
0.015469
X4
0.022663
-0.036401
0.107582
X,3
From our data we compute the standard error of the weighted-regression coefficients in (29). We have for instance for the coefficient of
mln:V/ [0.0972(24.349) +0.4242(7.056) +6.82730.006228 [0.278/(24 - 4)1)
=0.027. The corresponding t' is -0.097/0.027=
-3.592. It is significant for the level of significance of 1 per cent and 20 degrees of freedom. The required t is 2.845. In a similar fashion we get t' = 10.095 for
the coefficient of M2 in (29) and t'=2.820 for the coefficient of m4. The
first is significant at the 1-per-cent level, the second only at the 5-percent level of significance. The tentative nature of these results has been
indicated above.
We can also determine fiducial limits (or confidence limits) for the
regression coefficients in (29). The coefficient of ml can fall into the
interval -0.153 and -0.041 if the fiducial coefficient is 95 per cent
(t=2.086 for 20 degrees of freedom). The coefficient for m2 can vary
under the same conditions between 0.336 and 0.512. The coefficient
of M4, finally, may fall between 0.081 and 0.545. All these results have
to be considered as only approximately true.
We note the rather large intervals that are permitted for the variation of the regression coefficients representing our demand curve. This
emphasizes the very tentative nature of our conclusions.
All the regression coefficients in the weighted regression equation (31)
turn out to be not significant at the 5-per-cent level of significance.
Hence it appears that we are not able to determine to any reliable degree the conditions of agricultural supply from our data. The reason
for this is probably that some factors that are most important for the
supply of agricultural commodities, like for instance the weather, have
not been included in our analysis. This emphasizes the observations of
Henry Schultz82 and E. J. Working83that in agricultural data there is
probably a stable demand curve and a shifting supply curve. We can
determine the first, but not the second.
All regression coefficients in (33) are significant at the 1-per-cent level
of significance (demand curve without time trend). The coefficients in
(35) are significant at the 5-per-cent level, but not at the 1-per-cent
82
83
34
GERHARD
TINTNER
level. We are not able to say very much about the supply curve with
any degree of accuracy.
G. Tentative Applications to Policy
We have emphasized before that the main reason why we have to
deal with problems of identification is that we need economically meaningful relationships for purposes of economic policy. We will tentatively
utilize some of our results in this fashion. It should be emphasized,
however, that these applications are only in the nature of illustrations
and should not be taken too seriously.
Let us consider our demand curve, which can be established with
some accuracy. We utilize first equation (29) (weighted regression with
time trend) to estimate the elasticity of demand (M3) with respect to
price (ml). This elasticity is evaluated at the means given in Table 1.
Hence it is only valid for conditions that are the same as the average ones in the period covered by our data (1920-43). The elasticity
with respect to price is: -0.097(127.417/100.625) = -0.123. That is to
say: Other things being equal, if the price increases by 1 per cent the
demand for agricultural commodities as a whole will decrease by approximately 0.123 per cent or about i of one per cent. This figure
should be compared with the elasticity derived under similar assumptions from the ordinary regression equation (30). This elasticity is
- 0.109. We believe for reasons indicated above that our estimate is
preferable.
We can also estimate the elasticity of demand with respect to price
from the weighted-regression equation (33) which does not include a
time trend. We get then for the price elasticity of demand: -0.222.
This should be compared to the elasticity of demand derived from the
ordinary regression equation (34), which is -0.179.
Further we can derive the income elasticities of demand. From equation (29), the income elasticity computed at the means in Table 1 is
0.307. That is to say: Other things equal and conditions approximately
the same as in the period 1920-43, an increase in national incomneby
1 per cent will make for an increase in demand for agricultural products
by about 0.307 per cent or not quite 3 of one per cent.
This income elasticity should be compared to the one derived from
the ordinary regression equation (30) which is 0.293.
Income elasticity can also be derived from the weighted-regression
equation without time trend (33). There it appears as 0.396. This
should be compared to the corresponding ordinary regression equation
(34). The income elasticity is there 0.378.
We can derive tentatively fiducial limits for the price elasticities
from our weighted-regression equations. They appear for equation (29)
MULTIPLE
REGRESSION
FOR SYSTEMS
OF EQUATIONS
35
Records,"
G. Tintner and
36
GERHARD
TINTNER