Keywords: Measurement error models, news and noise
Abstract:
This paper proposes a simple generalization of the classical measurement error model and studies its implications for ordinary least squares (OLS) regression. The usual model starts with the true variable of interest and adds noise, which we call the classical measurement error (CME) term; see Fuller (1987) or virtually any econometrics textbook. The generalization discussed here incorporates a different kind of measurement error that subtracts signal from the true variable; this new error term is called the Lack of Signal Error, or LoSE for short. This additional term adds some much-needed flexibility to the classical measurement error model: it allows the mismeasured variable to have either more or less variance than the true variable of interest, in contrast to the classical model which imposes that the mismeasured variable have greater variance. This restriction does not hold in some important applications in macroeconomics and elsewhere.
The implications of LoSE for OLS regression are opposite the usual intuition about measurement error, which is applicable to CME only. The CME intuition says that measurement error in the dependent variable of a regression poses no real problems for standard estimation and inference. Parameter estimates are unbiased and consistent, while hypotheses are more difficult to reject because CME increases the variance of regression residuals and parameter estimates; inference under these circumstances has a cautious slant. CME in the explanatory variables causes the real problems for OLS regression, namely attenuation bias and inconsistency. However with LoSE these results are reversed. For the baseline case considered here, LoSE in the explanatory variables produces no bias or inconsistency, and similar to CME in , LoSE in boosts the variance of regresson residuals and hence standard errors. And, LoSE in the dependent variable introduces an attenuation-type bias and inconsistency into the regression under some circumstances (in particular, when the explanatory variables contain some signal missing from the dependent variable). In addition, LoSE in shrinks the variance of the regression residuals, thus shrinking parameter standard errors compared to what they would be without this type of mismeasurement. Standard errors are potentially misleadingly small, complicating inferences about the relation between true and . Under these conditions, a more cautious approach to inference than has been taken in the past may be warranted.
Mismeasurement in many types of data used for empirical work in economics and other disciplines may be better described by the generalized measurement error model with LoSE than by the pure classical measurement error model. This paper focuses on US macroeconomic quantity data such as gross domestic product (GDP) and gross domestic income (GDI), which attempt to measure the same underlying concept using different source data; see Fixler and Nalewaik (2007) and Nalewaik (2007a). These data pass through numerous revisions, and the more poorly-measured initial estimates have less variance than the revised estimates, providing a concrete example of measurement error that cannot be CME; see also Mankiw and Shapiro (1986). Section 2 of the paper makes this point while motivating the generalization of the CME model outlined here.
Section 3 of the paper discusses the nature of the source data used to compute US macroeconomic quantity data, and points out some reasons why LoSE may be present in the estimates even after they have passed through all their revisions. The fact that GDP growth should equal GDI growth, but does not in the fully revised quarterly or annual frequency data, proves that some mismeasurement remains in either GDP or GDI growth. The evidence in Fixler and Nalewaik (2007) and Nalewaik (2007a,b) supports the notion that this mismeasurement is largely LoSE, with more LoSE in GDP growth than in GDI growth. Those arguments are recounted briefly in this section, and some simple calculations comparing GDP and GDI growth show that the LoSE in GDP growth is likely substantial: after 1984, at least 30% of the variance of the true growth rate of the economy appears to be missing.
Some of the implications of substantial LoSE in GDP growth are fairly obvious. Realizations of GDP growth are simply less informative about the true growth rate of the economy than many macroeconomists currently believe, given the common but incorrect presumption that fully-revised GDP growth is measured with little error. In a macro forecasting context, true forecast errors are larger, on average, than forecast errors computed using data mismeasured with LoSE. The implications for regression estimation are more complicated, and working out those implications is the main contribution of the paper. This is done in section 4, which illustrates the effect of LoSE in the dependent and explanatory variables under different sets of assumptions. For concreteness, examples of popular regressions from macroeconomics that may conform to each set of assumptions are provided.
In a wide variety of econometric specifications employed in the macroeconomics and finance, variables like GDP growth, investment growth and consumption growth are regressed on asset prices - interest rates, stock price changes, exchange rate changes, etc. These regressions are of particular interest because asset prices potentially capture some signal missing from the mismeasured quantities, implying attenuation-type biases in the coefficients. Section 4 tests for these biases, regressing different output growth measures contaminated with more or less LoSE on a fixed set of stock or bond prices. When the dependent variable is contaminated with more LoSE, the regression coefficients are smaller, and the differences across regressions are often statistically significant. For example, the coefficients increase when we switch the dependent variable from the early GDP growth estimates based on limited source data to later GDP growth estimates based on more-comprehensive data. Tellingly, the coefficients increase again when we switch the dependent variable from GDP growth to GDI growth. The hypothesis that measurement error in the dependent variable does not bias OLS regression coefficients, a core piece of conventional wisdom in the profession, is rejected by the data, just as the paper predicts if the measurement error is LoSE. Section 5 concludes the paper.
Let be the true value of the variable of interest, while is an vector of possibly stochastic variables used to construct . The mismeasured estimate of the variable of interest is . In many cases a government statistical agency or some other organization computes based on information from surveys, administrative records, and other data sources (source data for short); then will be variables drawn from the source data, possibly including non-linear functions of the original source data.
Under the classical measurement error model,
Under the generalized model of mismeasurement considered here, the mismeasured estimate is as in Fixler and Nalewaik (2007):
(1) |
Define the deviation of the variable of interest from its conditional expectation as:
(2) |
(3) |
Since the CME is independent of , it is naturally independent of the LoSE as well. However the LoSE is clearly correlated with , with
in fact. Taking variances of (3):
(4) |
The restrictions imposed by the CME model on variances and covariances have obvious drawbacks, as it is easy to think of hypothetical and actual counterexamples. As a hypothetical counterexample, assume that has positive variance, while the estimate is just a constant for all . The estimate is clearly mismeasured, but the classical measurement error model cannot handle this case. The growth rates of US GDP and GDI provide an actual counterexample. These estimates pass through numerous revisions that plausibly reduce measurement error, since they incorporate more comprehensive and higher-quality source data. For example, suitable source data is simply unavailable for many components of the "advance" current quarterly GDP estimate released about a month after the quarter ends. Source data for some of those components is incorporated into the revised "final" current quarterly estimate released about two months later, and higher-quality data are incorporated at subsequent annual and benchmark revisions, likely bringing the estimate closer to its true value.1 Then an early estimate of GDP growth or GDI growth can be modelled as a later revised estimate (the analog to ) plus a measurement error term, which disappears with revision. Table 1 shows that the initial estimates have less variance than the revised estimates, violating the variance restrictions of the CME model.2 The generalized model here implies that the bulk of the measurement error is LoSE, as noted by Mankiw and Shapiro (1986).
While the generalized model here is less restrictive than the CME model, some restrictions do remain. Writing:
(5) |
Before concluding this section, it is worth emphasizing that need not be an exhaustive information set - i.e. it need not contain all available relevant pieces of information about . Resource and other constraints certainly preclude this from being the case, and the sections below considering the implications of LoSE allow for this possibility.
Each estimated growth rate of a macro quantity such as gross domestic product (GDP) is an attempt at measuring the change in the value of all relevant economic transactions, in the entire economy, from one fixed time period to the next. For an entity as large as the U.S. economy, this is a daunting, almost mind-boggling task, as the number of transactors and transactions is typically enormous, with little or no information recorded about many of them at high frequencies. Attempts to measure changes in these macro quantities are much more ambitious than attempts to measure similar changes for a single person, household, or even company. Simply due to their broad, universal nature, estimates of macro quantities are likely to miss more information - i.e. be contaminated with more LoSE - than are estimates of micro quantities (although some micro data sources may be contaminated with LoSE as well).6
Of course, the nature of the available source data determines the information content of macro quantity growth rates of interest, and frequency is important in this regard in the case of data from the U.S. National Income and Product Accounts (NIPA).7 The most comprehensive data on GDP and other major NIPA aggregates are only available at the quinquennial frequency (every five years), at the time of the major economic censuses. Even then, resource constraints make true census counts impossible. Many transactions in the underground economy remain unobserved and must be estimated, and some "above-ground" transactions are simply missed by any census.8 At the annual frequency, the GDP source data are typically samples drawn from the census universe. These samples can be quite large, capturing a sizeable fraction of the relevant value of transactions, but they are typically skewed towards measuring the transactions of larger businesses. As such, they may miss variation arising from the transactions of small companies and from businesses starting, shutting down, and operating in the underground economy. The lack of representation of these segments of the economy may add or substract variance to the official estimates, depending on the relative variance of the non-measured segments and their covariance with the measured segments, but this mismeasurement has the potential to add some LoSE to the data. At the annual frequency, and also at the quarterly frequency to some extent, government tax and administrative records are used as an additional source of information about the value of transactions, especially on the income (GDI) side of the accounts. These data can be informative, but underreporting makes them less than fully comprehensive.
At the quarterly and monthly frequency,9 reliance on samples is more pronounced, and the samples are less comprehensive. Smaller samples introduce larger sampling errors, which have traditionally been thought of as introducing CME into the estimates. The samples are typically random, after all, so part of the difference between the population and sample moments is likely random variation uncorrelated with the variation in the population moments. However smaller samples may introduce some LoSE as well, since smaller samples are simply less informative than larger samples: when different segments of the economy behave quite differently, small samples which are not fully representative may miss variation arising from some segments.10In addition, usable data on the value of transactions at a frequency higher than annual is simply unavailable for a substantial share of some NIPA aggregates such as GDP; many of the services categories of personal consumption expenditures (PCE) lack usable source data, for example.11 Quarterly and monthly growth rates are typically interpolated from annual totals, or estimated as "trend extrapolations." The lack of hard information for these categories must introduce some LoSE into the quarterly and monthly estimates.
Some users of quarterly or annual US NIPA data take the view that the variance of CME is negligible after the data have passed through its sequence of revisions, particularly for the more highly aggregated NIPA quantities like GDP growth where any remaining CME variance in its subcomponents may be diminished by averaging. Absent knowledge of the possible existence of LoSE, this view would imply that the variance of overall mismeasurement is close to zero, since in the pure CME model.
However there is ample evidence that the variance of overall mismeasurement in the most aggregated NIPA aggregate, GDP, is not close to zero, especially since the mid-1980s. GDI is an alternative estimate of the same quantity, so examining the relation between GDP and GDI provides some direct evidence on mismeasurement. Table 2 shows that, prior to 1984, the variance of each estimate is close to the covariance between the two, for both annual and annualized quarterly growth rates. The two estimates diverge very little, providing little direct evidence of mimeasurement. However after 1984, when the variance of both estimates drops dramatically (see McConnell and Perez-Quiros (2000)), the correlation between the estimates also falls, as the covariance falls relative to the variances on average. This is especially true for the quarterly growth rates, where the correlation falls from 0.93 to 0.68.12Interestingly, the variance of GDI growth also increases relative to the variance of GDP growth. Under the generalized CME model of section 2, this relatively large GDI variance may stem from some combination of two possible sources: (1) a relatively large amount of CME in GDI growth, boosting its variance, and (2) a relatively large amount of LoSE in GDP growth, damping its variance. The evidence favors the latter as the more important source of mismeasurement.
First, consider the results in Nalewaik (2007a), who estimates a two-state bivariate Markov switching model where mean GDP and GDI growth switch with the state; the low-growth states identified by the model encompass recessions as defined by the NBER. The conditional variance of GDI in that model, conditional on the estimated state of the world, is actually slightly lower than the conditional variance of GDP, despite the fact that the unconditional variance of GDI growth is higher. The higher unconditional variance stems from GDI growing faster than GDP in high-growth periods and slower than GDP in slow-growth periods in and around recessions. In other words, GDI growth appears to contain more signal about the state of the world than GDP growth: the larger spread between its high- and low-growth means implies greater informativeness about the state. Greater signal in GDI growth implies some LoSE in GDP growth, relatively more than in GDI growth.
Second, table 1 shows that the variance of GDI growth becomes relatively large only after the data pass through annual and benchmark revisions; in the earlier current quarterly estimates, the variance of GDP growth actually exceeds the variance of GDI growth. Since the revisions plausibly bring estimated GDI growth closer to the truth, they must either reduce LoSE, which would increase its variance, or reduce CME, which would decrease its variance. The increase in variance from the revisions, then, is likely a decrease in LoSE, suggesting the relatively large variance of revised GDI stems from relatively less LoSE. The annual and benchmark revisions appear to add more signal to GDI growth than GDP growth, increasing the variance of GDI growth relative to GDP growth.13 Fixler and Nalewaik (2007) discuss the revisions evidence in more detail, testing the hypothesis that the idiosyncratic variation of GDI growth is purely CME and rejecting at conventional significance levels. This again implies some LoSE in GDP growth.
To get a sense of the magnitude of the potential variance missing from GDP growth due to LoSE, assume that the CME variance in each estimate is negligible, so the differences between GDP and GDI growth stem entirely from differential LoSE:
and: | |||
and the covariance is: | |||
and: | |||
so: | |||
Going forward, if these post-1984 variances and covariances are the norm, the implications of a potentially non-trivial amount of LoSE in macro data such as GDP growth should be taken seriously. For estimation and inference, the post-1984 portion of many samples will become increasingly large and important. The next section explores the implications of LoSE for estimation and inference using the most ubiquitous tool in econometrics: OLS regression.
Consider ordinary least squares estimation of the relation between a mismeasured variable and a
set of mismeasured explanatory variables , using a
sample of length . When stacking together the observations, time subscripts are dropped for convenience:
(6) | |||
(7) |
and: | (8) | ||
(9) |
The inconsistency of can be corrected by instrumenting with a set of instruments , with , if the instruments meet the following set of assumptions:
(10) |
cross terms |
Given the traditional focus on mismeasurement in on regression estimation, we begin with this subsection making the following assumption, in addition to assumption 1:
The OLS regression estimator in this case is:
Of course, the CME in produces the usual attenuation bias. By way of review, and for comparison with later results:
and: | (11) | ||
(12) |
To focus more tightly on the implications of LoSE, the remainder of this subsection considers the case of no CME in :
Since nominal asset prices are measured with virtually no error, they provide a candidate variable meeting the assumptions of this section.16 Given the evidence from the prior section indicating that macroeconomic quantities are measured with LoSE, a specification such as the human capital CAPM, essentially a regression of stock prices on labor income growth, may be well represented by the regression model developed here.
In addition to assumption 1, this subsection makes the following assumptions:
Since
, we have:
in this case. The LoSE impacts only , so
, and
. The OLS regression estimates
as:
The standard errors around the point estimates are more interesting. For the variance of the point estimates,
, and:
LoSE in the dependent variable has an opposite effect, decreasing the variance of the regression residuals and parameter estimates. Measurement error of this type actually increases the power of hypothesis tests. Is more power a good thing? I would argue not. If a hypothesis is rejected at a prescribed significance level, an immediate concern is that the rejection was due to LoSE-induced power. If the data were free of mismeasurement, the variance of the parameter estimates would be larger and the rejection may not have occured.
Note that the parameter variances here are correct for the parameters governing the relation between mismeasured and , and are not biased down in the same sense as they would be if, for example, we ignored positive autocorrelation in the residuals of the regression. However in most applications the parameters of interest are those governing the relation between true and , and the econometrician uses the mismeasured data to make inferences about that true relation out of necessity, because the mismeasured data are all that is available. Under the assumptions of this subsection, the parameters estimated using the mismeasured data are unbiased and consistent for the parameters governing the true relation, but the parameter variances are smaller than they would be without the LoSE in . Standard errors computed using data mismeasured in this way then give a misleading sense of precision about the relation between true and , and the power of hypothesis tests is misleadingly large.
An example clarifies some of these issues. Imagine a situation where the econometrician has access to true , , and a list of other variables that are orthogonal to but related to . Hypotheses about the relation between and are of interest. Testing hypotheses using parameter estimates and standard errors from a regression of on is the natural and correct way to procede. But consider the following two step procedure: (1) regress on and some subset of , and compute predicted values which we call , and then (2) regress on , testing hypotheses about the relation between and using standard errors from this second regression. The parameter estimates in the second regression are the same as in a regression of on , but the standard errors are smaller and tests have greater power because the first stage generates LoSE in . In fact, by making arbitrarily small, the standard errors can be made arbitrarily small and any hypothesis may be rejected; setting to the null set drives the standard errors to zero. Artificially generating LoSE in in this fashion is clearly ridiculous, and makes hypothesis tests meaningless.
In reality, if the data meet the conditions of this section, the econometrician starts out in stage (2) of this two step procedure, sadly without access to . Something analogous to step (1) has already taken place with the creation of the data , by a government statistical agency or some other entity. The main issue is not materially different if it is a government statistical agency rather than the econometrician who has generated the LoSE in .
If the variance of LoSE were known, the most prudent course of action would be to increase the estimated variance of by . This essentially brings us back the pure CME case, where econometricians are comfortable using standard procedures for inference.19Unfortunately, is typically unknowable; the results in sections 3.2 and 4.3.1 for measured GDP growth provide only an approximate lower bound on . The bottom line is that inferences are fundamentally less definitive when the dependent variable is contaminated with LoSE.
In a forecasting context, it should be noted that the LoSE also shrinks the variance of out-of-sample forecast errors. The actual variance of the out-of-sample forecast error for the true variable of interest, , with estimated using mismeasured , is . However the LoSE reduces this variance by , and if the increase in variance from CME does not offset this increase, the forecast errors computed using mismeasured give a misleading sense of precision: the deviations of from the forecasts are larger than those mismeasured forecast errors suggest. For example, table 1 shows that for GDP growth forecasts, forecast errors computed using the "advance" GDP growth estimates will be smaller, on average, than the true forecast errors.
Regressions of mismeasured macroeconomic quantities like GDP growth on time trends, dummies, and other deterministic variables certainly meet the conditions outlined in this subsection. These types of regressions are run most frequently in a forecasting context, where the primary goal is to forecast , not to estimate the deep structural parameters of an economic model relating to .20 Many other types of variables that are mismeasured in the latter context can be treated as measured without error in the former forecasting context, where the explanatory variables are just forecasting tools. A good example is source data used to estimate (certainly in the relevant information set). The source data may be a mismeasured estimate of some other variable , but is not relevant when using solely as a forecasting tool; can be treated as an estimate of itself measured without error. Another potential example is lags of the dependent variable;21 lagged is by definition a mismeasured estimate of lagged , but this is irrelevant for purposes of pure forecasting. The lags are certainly in the information set of the agency estimating , although it is not necessarily clear to what extent the estimating agency relies on the lags in computing current period estimates. If the agency uses the lags optimally in constructing the estimates, or if the lags contain no information above and beyond the source data used to compute the current period estimates, then the lags meet this subsection's conditions for variables in the context of a pure forecasting regression. Any variable that adds no information about the dependent variable above and beyond that contained in its source data would qualify.
In addition to assumption 1, this subsection makes the following assumptions:
(13) | |||
(14) | |||
Instruments that meet the conditions of assumption 2 in this case are those for which
converges in probability to zero, for example if
, so that is independent of the information about missing from . Instruments typically thought of as valid based on other
considerations may not meet this condition; see the discussion of Euler equation estimation at the end of the section. The asymptotic distribution of the IV regression estimates
is:
A wide range of regression specifications may employ explanatory variables that are not mismeasured and contain information about the dependent variable above and beyond that contained in the source data used to estimate the dependent variable. Some such specifications are employed for forecasting using , some are employed to estimate the structural parameters of an economic model relating to , and some are employed for both purposes; the bias results apply in all these situations as long as is the object of interest.
Regressions of macroeconomic quantities on asset prices are particularly interesting examples likely to meet the conditions of this subsection. It is entirely possible that some of the variation missed by LoSE-contaminated macroeconomic quantities appears in asset prices. Of course, some variation in asset prices likely arises from misinformation, rational or irrational bubbles, and other factors unrelated to fundamentals, but this does not imply that other more informative types of variation are not present as well. Dynan and Elmendorf (2001) and Fixler and Grimm (2006) show that asset prices predict revisions to economic aggregates like GDP growth, showing that asset prices contain information missed by the earlier vintages of the government estimates. Asset prices may contain information missed by the fully revised, later-vintage estimates as well, and that information may appear in asset prices in at least two ways.
First, there almost surely exists publicly-available information about the state of the economy that is not fully incorporated into GDP growth or its subcomponents, information which is observed by the vast majority of asset market participants and is thus likely incorporated into asset prices. The various pieces of data used to compute GDI may be part of this information, including GDI's source data on corporate profits and employee compensation. Not all of this information appears to be incorporated into measured GDP growth, as section 3.2 illustrates, and financial markets certainly react to this information; see Faust et al (2003), for example.
Second, asset prices aggregate the private information of vast numbers of market participants, private information that is likely correlated with current or future economic activity. For example, the stock price of a company may reflect numerous pieces of private information about that company's cash flow prospects. Aggregating across all firms, the idiosyncratic variation in firms' stock prices averages out, so an aggregate stock market index contains the signal about aggregate economic activity dispersed in all the pieces of private information.22 Bond prices and exchange rates are fundamentally tied to economic activity, with market participants placing bets with real money about current and future economic prospects; see Evans and Lyons (2005, 2007) for a description of how private information about the economy becomes embedded in exchange rates through the market's filtering of order flow information.
This points to parameter estimates from regressions of macro quantities on asset prices that are biased towards zero. To examine this, consider a regression of several of our quarterly output growth measures on current and lagged growth rates of the Wilshire 5000 stock price index; Fama (1990) studied a similar specification.23 The relation between economic growth and lagged stock prices can be interpreted in at least two non-mutually-exclusive ways. First, stock prices may be responding to news about current and future economic growth and its effect on expected cash flow to firms.24 Second, stock price variation may have a causal effect on current and future economic growth, through wealth effects on consumer spending, for example. For our purposes here, differentiating between these two stories is not necessary; it suffices that a relation between true GDP growth and stock prices does exist, governed by a true parameter vector .
The first column of table 3 shows regression results from the post-1984 sample, using as the dependent variable the time series of "advance" GDP growth estimates. Recall that since a substantial amount of data is missing for the "advance" GDP estimate, it is likely contaminated with a particularly large amount of LoSE. The second column shows results using the latest available estimates of GDP growth, which have passed through revisions incorporating more-comprehensive data, reducing LoSE and increasing the variance of the estimates (see table 1).
In comparing these two columns of table 3, first note that each stardard error in the second column exceeds its counterpart in the first (these are Newey-West (1987) corrected for heteroskedasticity and second-order autocorrelation). The LoSE in "advance" GDP growth biases down the standard errors. Second, note that most of the coefficients in the second column exceed those in the first.25 The last column reports the sum of the coefficients and its standard error, giving a sense of the average downward bias in the coefficients in the first column relative to the second. The difference between the sums in the two columns, 0.072, is statistically significant, with a standard error of 0.036 correcting for cross-correlation and second-order cross-autocorrelation between the two sets of residuals. We reject the hypothesis that the LoSE in "advance" GDP growth does not bias down the regression coefficients. The size of the downward bias is about what we would expect given the summary statistics in table 1: if all the variance in each estimate is signal, the ratio of the variance of the two GDP growth estimates provides a reasonable guess as to the expected size of the downward bias.26 This signal-to-signal ratio is about three-fourths, with the ratio of the sum of the coefficients about two-thirds.
The third column of table 3 reports results using latest GDI growth as the dependent variable. Most striking is the large increase in the sum of the regression coefficients compared to column two, 0.116 with a standard error of 0.040. A straightforward interpretation of this result is that GDI growth contains more information about true output growth than does GDP growth, leading to a greater LoSE-induced attenuation bias in regressions using GDP growth. However, direct measures of corporate profits are included in estimates of GDI growth, opening up other possible interpretations.
One alternate interpretation is that estimates of corporate profits are noisy, and stock prices react to some of that noise. Then is positively correlated with in regressions using GDI growth as the dependent variable, biasing the coefficients up. If this were the case, the problem should be more severe for the early estimates of GDI growth, before profits have been benchmarked to administrative and tax records. Yet the fourth column of table 3 shows that the sum of the coefficients using the early estimates of GDI growth is less than half the sum of the coefficients using the revised estimates released several years later; this noise interpretation does not fit the facts.
A second alternate interpretation is that GDI growth contains more information about corporate profits than does the profits variation implicit in measured consumption, investment, and the other components of GDP growth, but that superior information about profits does not translate into superior information about output growth. This could occur if the profits information is negatively correlated with the remainder of GDI growth. Since profits are likely the relevant variable for determining stock prices, relatively large regression coefficients may occur using GDI growth even if it does not contain superior information about output growth. This is unlikely for a number of reasons, but can be examined most directly by regressing the growth rate of GDI minus corporate profits (deflated by the GDP deflator) on the stock price changes. If this alternate interpretation is correct, stripping out profits should reduce the sum of the coefficients, but the last column of the table shows that the sum of the coefficients actually increases.27
As tables 1 and 2 show, the ratio of the variances of latest GDP growth over latest GDI growth is 0.87, but the ratio of the sum of the coefficients in table 3 is considerably less, again about two-thirds. One possible explanation is that not all of the variance of GDP growth is signal; some noise in GDP growth would bring down the signal-to-signal ratio of GDP to GDI growth closer the observed downward bias in the regressions. To investigate this, table 3A switches to a univariate regression framework where the explanatory variable is the average stock price change over the current and six previous quarters; table 3B then reverses the regression, using the average stock price change as the dependent variable. The coefficient using GDP growth as the explanatory variable is about three-fourths the size of the coefficient using GDI growth, an attenuation bias that is indicative of noise in GDP growth. While the statistical significance of the difference between the slopes is marginal (the 0.146 difference has a standard error of 0.088), this evidence is suggestive.
Assuming that one-fourth of the variance of GDP growth is noise, then GDP growth captures at most 64 percent of the variance of true GDP growth, recomputing the upper bound on from the end of section 3.2. In this case the variance of signal in GDP growth is about equal to its covariance with GDI growth, and the ratio of this signal variance to the variance of GDI growth gives the upper bound. If all information about output growth is reflected in GDI growth, so , this bound holds and the coefficients in the third column of table 3 are the true parameter vector . However, if some information about true output growth is missing from GDI growth, these coefficients are themselves biased down, and unfortunately we do not know the size of this downward bias.
These results using stock prices are largely confirmed by regressions of the different output growth measures on bond prices, as shown in table 4. The explanatory variables are TERM, the difference in yield between 10-year and 2-year treasury notes, and DEF, the difference in yield between corporate bonds and 10-year treasury notes.28 Numerous papers have used similar variables to forecast output growth; see for example Chen (1991) and Estrella and Hardouvelis (1991). The table examines regressions at forecasting horizons ranging from one- to eight-quarters ahead; DEF has substantial explanatory power at shorter horizons, while TERM shows some explanatory power at longer horizons. All of the coefficients except one and all of the standard errors increase when we switch from "advance" GDP as the dependent variable to latest GDP. Switching from latest GDP to latest GDI, the coefficients again all increase, except for TERM at the one- and two-quarter ahead horizons when its statistical significance and marginal explanatory power are weakest. The last column reports p-values from an F-test of equal coefficients from the GDP and GDI regressions; equality is rejected at the three- and four- quarter ahead horizons. Similar results obtained from univariate regressions using either TERM or DEF, although the standard errors around the TERM coefficients were larger making definitive statements from those regressions difficult. Using DEF as the dependent variable in reverse regressions, coefficients were smaller using GDP growth as the explanatory variable than using GDI growth, supporting evidence of some noise in GDP growth. The coefficients using GDP growth were between 12 and 42 percent smaller, depending on horizon.
The evidence here and in section 3 shows that GDP growth at the quarterly frequency is contaminated with significant LoSE, implying its major subcomponents are contaminated with significant LoSE as well. As discussed in section 3.1, a large fraction of consumption lacks any real source data at the quarterly frequency, so this component is likely to be particularly contaminated. Consider the implications for Euler equation estimates of the relation between quarterly macro consumption growth and interest rates ; see Campbell and Mankiw (1989). True consumption growth may have substantial covariance with interest rates, but mismeasured consumption growth misses some of this variation, biasing the OLS regression coefficient towards zero. Lagged variables such as lagged interest rates are almost universally assumed to be valid instruments in estimating the Euler equation, and they may be valid for dealing with expectational errors and some other forms of endogeneity. However if interest rates contain information about actual contemporaneous consumption growth missed by measured consumption growth, lagged interest rates likely contain just as much if not more of this missing information, since interest rates are basically forward-looking. Instrumenting current interest rates with lagged interest rates does not meet Assumption 2, and the instrumental variables parameter estimates remain biased towards zero.
Again for simplicity, and to focus on the effects of LoSE, this section considers the case of no CME in , so assumption 4 holds, as well as assumption 1. Three special cases are illuminating. The first is where the information sets used to construct and coincide in the universe of variables correlated with , so (see footnote 14). Then , so their difference in (5) and (6) disappears, leaving unbiased and consistent regression parameter estimates. The variance and asymptotic distribution of , and the probability limit of , are as in subsection 4.2. The main concern under these circumstances is the shrinking effect of LoSE on standard errors.
The second illuminating case is where , so contains all the information about in , plus additional information. The difference is independent of ; substituting this difference for in subsection 4.3 then leaves the results of that section unchanged. The estimate is biased and inconsistent, with the bias towards zero; some variation in measured that appears in is missed by measured , biasing down the covariance between and relative to the variance of . Valid instruments must be in the information set used to compute the more-poorly measured .
The last illuminating case is where contains all the information about in plus additional information, so . Then is independent of and , and if this difference replaces in subsection 4.1, the results in that subsection carry over to this case, except LoSE in shrinks the error and parameter variances. The estimates are unbiased and consistent.
These cases should help provide some intuition about the potential effects of LoSE in particular regression applications where the econometrician has some knowledge of the relative degree of mismeasurement in the explanatory and dependent variables. For each application, whether , , or provides the best description of reality determines which results are most relevant, those from subsection 4.1 (augmented with LoSE in ), 4.2, or 4.3. For example, the extent of any bias in the parameter estimates depends on the degree to which the mismeasured explanatory variables contain signal missing from the dependent variable.
Regressions of mismeasured macro quantities on other mismeasured macro quantities meet the conditions of this subsection, when the goal is to estimate the relation between and , the true values of the quantities. The permanent income hypothesis (PIH), for example, is about the relation between the true values of consumption and income, not mismeasured estimates of consumption and income, so regressions of macro consumption on macro income that attempt to estimate the PIH parameters meet the conditions of this subsection. The evidence in sections 3.2 and 4.3.1 suggests that the results on bias may be operational in such a PIH regression.29
This paper has shown that the canonical classical measurement error (CME) model is too restrictive to handle some important cases of mismeasurement. That model assumes that the mismeasured variable of interest is equal to the true variable plus a noise term uncorrelated with the true variable, implying that the variance of the mismeasured variable must exceed the variance of the true variable. It is easy to think of hypothetical examples where this implication is not met, and some important examples of mismeasurement in macroeconomic time series do not meet this implication either. For example, time series such as US GDP growth and GDI growth pass through numerous revisions, and the first-available vintage of a variable can be thought of as a mismeasured estimate of subsequent revised vintages. The variance of the first-available vintage is often less than the variance of better-measured subsequent vintages, a form of mismeasurement that does not fit into the CME box.
The paper proposes a simple generalization of the CME model that is mathematically tractable, embeds the CME model as a special case, and adds useful flexibility, allowing the mismeasured variable to have either more or less variance than the true variable of interest. The key to the generalization is that instead of just allowing mismeasurement that adds noise to the true variable of interest, it also permits mismeasurement that subtracts signal; I label this reduction of signal from mismeasurement the Lack of Signal Error, or LoSE for short.
In some ways, this generalization of the CME model provides the second half of the story about errors in variables and their effect on ordinary least squares (OLS) regression, as the results here exhibit a symmetry that is intuitively pleasing. CME in the dependent variable of a regression does not bias parameter estimates, and its only effect is to increase standard errors compared to what they would be without the mismeasurement. Given this lack of bias, and the caution in rejecting hypotheses introduced by the relatively large standard errors, the consensus in the profession is that CME in the dependent variable poses no serious problems for OLS regression and inference. LoSE in the explanatory variables , has the same effect on OLS regression: the LoSE does not introduce bias into the parameter estimates, and it increases their standard errors compared to what they would be without the mismeasurement.
CME in the explanatory variables , of course, does cause problems for OLS regression, namely bias in the parameter estimates, towards zero in the univariate case. LoSE in the dependent variable introduces a similar bias under some circumstances: when some of the signal missing from the dependent variable is captured by the explanatory variables , such an attenuation-type bias appears. LoSE in is also problematic since it shrinks the variance of the regression residuals, decreasing the size of standard errors compared to what they would be without this type of mismeasurement. This raises the possibility that some rejections of hypotheses may stem from the existence of LoSE in , so without this measurement error, the hypotheses would not be rejected. Under such circumstances, additional caution in making inferences about the relation between true and seems appropriate, as conventional standard errors may be misleadingly small.
The paper provides a taxonomy of different types of LoSE and their impact on OLS parameter estimates and standard errors, and provides examples of common regressions in macroeconomics that may fit into the different categories considered. The paper reviews recent evidence in Fixler and Nalewaik (2007) and Nalewaik (2007a,b) that US GDP growth is mismeasured with LoSE even after the data has passed through all its revisions. A comparison with GDI growth yields a lower bound on the variance of LoSE in GDP growth: since the mid-1980s, quarterly or annual GDP growth has captured at most 70% of the variance of the true growth rate of the economy. US GDP growth and its subcomponents like consumption growth have served as the dependent variables in many regression studies in macroeconomics and finance; the potential for biases stemming from mismeasurement of the dependent variable, and misleading standard errors - these issues have not been contemplated in a serious way prior to this paper.
Asset prices are a set of variables that may capture some of the signal missing from GDP growth and its subcomponents, implying attenuation-type biases in regressions of the mismeasured quantities on those prices. The empirical results here confirm that. The government's first estimates of GDP growth are likely contaminated with more LoSE than their revised GDP growth estimates, and those GDP growth estimates are likely contaminated with more LoSE than GDI growth. In regressions of these different measures of output growth on either stock prices or bond prices, the measures of output growth contaminated with more LoSE have smaller coefficients, and the changes in the coefficients across regressions are often statistically significant. The set of explanatory variables is fixed from regression to regression; the only thing changing is the degree of measurement error in the dependent variable. We reject the CME intuition that measurement error in the dependent variable does not bias regression coefficients. It can cause biases if it is LoSE, and does cause biases in this important set of regressions of macro quantities on asset prices.
On a positive note, the results derived here provide some clear prescriptions for handling different types of mismeasurement, in terms of choice of instruments, and also choice of which variable is dependent , and which is explanatory . As an example, consider estimation of an Euler equation relation between macro data on consumption growth and the interest rate. If consumption growth is mismeasured with LoSE, and the interest rate is free from mismeasurement as it may be under some circumstances, then the results here recommend using the interest rate as the dependent variable. The common practice of using consumption growth as the dependent variable, and instrumenting the interest rate with lagged interest rates, does not produce consistent estimates when consumption growth contains LoSE. Such insights are opposite the current conventional wisdom in the profession. The generalized measurement error model with LoSE is likely applicable in a wide variety of econometric specifications beyond the few considered here, and our results should provide helpful insights for making appropriate modifications to econometric practice.
Vintage | ||
---|---|---|
Current Quarterly, "Advance" | 3.1 | . |
Current Quarterly, "Final" | 4.1 | 4.0 |
Latest Vintage Available | 4.2 | 4.8 |
Upper bound on: | ||||
---|---|---|---|---|
Quarterly, 1947-1984Q2 | 24.1 | 24.1 | 22.4 | 0.93 |
Annual, 1947-1984 | 8.2 | 8.5 | 8.2 | 0.97 |
Quarterly, 1984Q3-2004 | 4.2 | 4.8 | 3.1 | 0.70 |
Annual, 1985-2004 | 1.6 | 2.6 | 1.9 | 0.69 |
Measure & Vintage: | ("Advance") | (Latest) | (Latest) | ("Final") | (Latest) |
---|---|---|---|---|---|
: | 0.012 | 0.014 | 0.032 | 0.031 | 0.003 |
(standard error) | (0.019) | (0.027) | (0.026) | (0.023) | (0.025) |
: | 0.048 | 0.054 | 0.084 | 0.038 | 0.083 |
(standard error) | (0.018) | (0.024) | (0.023) | (0.023) | (0.030) |
: | 0.021 | 0.065 | 0.056 | 0.036 | 0.059 |
(standard error) | (0.024) | (0.027) | (0.027) | (0.027) | (0.028) |
: | 0.058 | 0.057 | 0.067 | 0.044 | 0.093 |
(standard error) | (0.017) | (0.022) | (0.023) | (0.020) | (0.031) |
: | 0.011 | 0.015 | 0.051 | 0.024 | 0.064 |
(standard error) | (0.023) | (0.028) | (0.029) | (0.024) | (0.028) |
: | -0.003 | -0.007 | 0.038 | -0.019 | 0.070 |
(standard error) | (0.025) | (0.026) | (0.022) | (0.022) | (0.028) |
: | 0.002 | 0.023 | 0.008 | 0.002 | 0.032 |
(standard error) | (0.016) | (0.017) | (0.027) | (0.018) | (0.030) |
: | 0.149 | 0.221 | 0.337 | 0.156 | 0.403 |
(standard error) | (0.055) | (0.060) | (0.069) | (0.069) | (0.079) |
Measure & Vintage: | ("Advance") | (Latest) | (Latest) |
---|---|---|---|
: | 0.142 | 0.214 | 0.325 |
: (standard error) | (0.060) | (0.068) | (0.073) |
Measure & Vintage: | ("Advance") | (Latest) | (Latest) |
---|---|---|---|
: | 0.411 | 0.454 | 0.600 |
: (standard error) | (0.194) | (0.182) | (0.169) |
Measure: | , "Advance": | , "Advance": | , Latest: | , Latest: | , Latest: | , Latest: | p-val., equal s |
---|---|---|---|---|---|---|---|
k=1 | 0.20 | -0.50 | 0.31 | -0.61 | 0.23 | -0.79 | 0.10 |
k=1 (standard error) | (0.26) | (0.13) | (0.26) | (0.13) | (0.29) | (0.10) | |
k=2 | 0.42 | -0.44 | 0.48 | -0.53 | 0.43 | -0.69 | 0.13 |
k=2 (standard error) | (0.26) | (0.12) | (0.31) | (0.12) | (0.33) | (0.13) | |
k=3 | 0.58 | -0.38 | 0.60 | -0.40 | 0.68 | -0.65 | 0.00 |
k=3 (standard error) | (0.30) | (0.12) | (0.36) | (0.15) | (0.37) | (0.15) | |
k=4 | 0.62 | -0.23 | 0.57 | -0.28 | 0.70 | -0.50 | 0.01 |
k=4 (standard error) | (0.32) | (0.15) | (0.39) | (0.17) | (0.40) | (0.17) | |
k=5 | 0.59 | -0.19 | 0.67 | -0.29 | 0.75 | -0.41 | 0.40 |
k=5 (standard error) | (0.35) | (0.14) | (0.38) | (0.14) | (0.44) | (0.19) | |
k=6 | 0.72 | -0.27 | 0.76 | -0.32 | 0.92 | -0.39 | 0.54 |
k=6 (standard error) | (0.35) | (0.10) | (0.38) | (0.13) | (0.41) | (0.16) | |
k=7 | 0.73 | -0.19 | 0.81 | -0.20 | 0.96 | -0.39 | 0.14 |
k=7 (standard error) | (0.35) | (0.10) | (0.36) | (0.13) | (0.38) | (0.15) | |
k=8 | 0.66 | -0.10 | 0.72 | -0.15 | 0.94 | -0.27 | 0.27 |
k=8 (standard error) | (0.34) | (0.13) | (0.36) | (0.14) | (0.37) | (0.15) |