PRACTITIONER'S CORNER*: Logarithmic Dependent Variables and Prediction Bias

DOIhttp://doi.org/10.1111/j.1468-0084.1983.mp45004006.x
Date01 November 1983
AuthorPeter Kennedy
Published date01 November 1983
PRACTITIONER'S CORNER*
Logarithmic Dependent Variables and Prediction Bias
Peter Kennedy
It is common in econometrics to find estimation undertaken via a regres-
sion in which the regressand is the logarithm of the original dependent
variable. In many such studies interest focuses exclusively on the slope
coefficients and under the usual assumptions regarding how the data
were generated, the slope coefficient estimates are best linear unbiased.
But often attention is also paid to prediction of Y, the original depend-
ent variable. In such cases the usual method of estimating Y, taking the
exponential of the estimate j of y = ln Y produces biased predictions;
although this result has been known for some time, its application to
the prediction context has not been stressed, and an examination of
recent econometric studies suggests that practitioners do not realize
that this bias can in some circumstances be sufficiently large as to
warrant an adjustment. Such circumstances are most likely to arise in
the context f simulation studies, since the variance of the predicted y
can be large when prediction is undertaken outside the bounds of the
data set.
Consider for expository purposes the estimating problem:
Y = aX13 exp(e) (1)
where X is an independent variable and a and í3 are fixed parameters.
(Generalization to the case of Y = aX1X2 .. Xf,K exp(e) is straight-
forward. It should also be clear that a similar bias exists for the semi-
logarithmic estimating form). Exp() is assumed to be a lognormally-
distributed multiplicative error term with mean one, so that the mean
of Y conditional on X is aX13, the usual interpretation attached to such
algebraic specifications. (See, e.g., Goldberger (1968a), p. 3.) If is often
the case that researchers assume that Fe O, implying that the condi-
tional expectation of Y is EY = aX13 exp (a2). However, the results to
follow do not depend on which of these two specifications is adopted.
The model (1) gives rise to the log-linear estimating form:
y = y + ßx + e
* The purpose of Practitioner's Corner is to publish brief methodological notes of interest
to applied economists. The Editors welcome submissions of this sort.
389

To continue reading

Request your trial

VLEX uses login cookies to provide you with a better browsing experience. If you click on 'Accept' or continue browsing this site we consider that you accept our cookie policy. ACCEPT