Chapter 10
Multicollinearity: What Happens If the Regressors Are Correlated?
341
consumption example discussed in the previous section (Example 10.2). This is a four-by-
four box diagram because we have four variables in the model, a dependent variable
(C) and three explanatory variables: real disposable personal income (Yd), real wealth (W),
and real interest rate (I).
First consider the main diagonal, going from the upper left-hand corner to the lower
right-hand corner. There are no scatterpoints in these boxes that lie on the main diagonal. If
there were, they would have a correlation coefficient of 1, for the plots would be of a given
variable against itself. The off-diagonal boxes show intercorrelations among the variables.
Take, for instance, the wealth box (W). It shows that wealth and income are highly corre-
lated (the correlation coefficient between the two is 0.97), but not perfectly so. If they were
perfectly correlated (i.e., if they had a correlation coefficient of 1), we would not have been
able to estimate the regression (10.6.6) because we would have an exact linear relationship
between wealth and income. The scatterplot also shows that the interest rate is not highly
correlated with the other three variables.
Since the scatterplot function is now included in several statistical packages, this diag-
nostic should be considered along with the ones discussed earlier. But keep in mind that
simple correlations between pairs of variables may not be a definitive indicator of collinear-
ity, as pointed out earlier.
To conclude our discussion of detecting multicollinearity, we stress that the various
methods we have discussed are essentially in the nature of “fishing expeditions,” for we
cannot tell which of these methods will work in any particular application. Alas, not much
can be done about it, for multicollinearity is specific to a given sample over which the
researcher may not have much control, especially if the data are nonexperimental in
nature—the usual fate of researchers in the social sciences.
Again as a parody of multicollinearity, Goldberger cites numerous ways of detecting
micronumerosity, such as developing critical values of the sample size,
n
*
,
such that micron-
umerosity is a problem only if the actual sample size,
n,
is smaller than
n
*
.
The point of
Goldberger’s parody is to emphasize that small sample size and lack of variability in the
explanatory variables may cause problems that are at least as serious as those due to
multicollinearity.
Do'stlaringiz bilan baham: