Keywords: Inflation, core prices.
Abstract:
This paper examines a number of alternative PCE price inflation measures including overall PCE inflation, PCE inflation excluding food and energy, trimmed mean PCE inflation, component-smoothed inflation, variance-weighted inflation, inflation with weights based on disaggregated regressions, and survey measures of inflation expectations. When averaging across a handful of specifications based on the primary uses of a core inflation measure three conclusions arise: 1. Inflation rates for nearly all the measures best track ex-post trend inflation or predict future overall inflation when they are averaged over a considerable number of months. Overall PCE price inflation should be averaged over 18 months or longer. A shorter averaging period is appropriate for core measures, often on the order of 12 months. 2. Even after appropriately averaging each index, core inflation indexes generally perform better than overall inflation. 3. Exclusion indexes, such as PCE excluding food and energy, perform slightly worse than many other possible core inflation measures; trimmed mean PCE, or a variance-weighted index, may be better choice for a summary inflation measure
Just as there are multiple uses of core inflation measures, there are multiple ways to construct one. The most popular US core inflation measure, consumer price inflation excluding food and energy, is an exclusion index-it excludes some items and takes the weighted average inflation rate of the remaining items. Alternative approaches to constructing core inflation abound. These include, but are not limited to, trimmed means, persistence weighting, variance weighting, component smoothing, exponential smoothing, and dynamic factors.
From an empirical perspective these various core inflation measures are useful only if they provide something that cannot be found in the overall inflation measure, for example, if the core measure is a better predictor of future overall inflation, or if the core measure better matches in real time an ex-post measure of current trend overall inflation. If the core measure cannot perform these tasks better than overall inflation over a significant period of time then there is little reason to use it.
This note examines a number of measures of core inflation to provide guidance to the media, policymakers, and researchers about which measures may be the most useful to follow. In many ways this note is an expansion of Rich and Steindel (2007) and other examinations of US core inflation measures.3 Rich and Steindel selected a small group of the most common exclusion and trimmed mean inflation indexes and compared the ability of these measures to track trend and predict future overall inflation using quarterly data. They concluded that no one index is clearly preferred over the other indexes. This note builds on Rich and Steindel's results, and results from other core inflation comparisons, by examining a larger set of core inflation measures over a wider span of sampling intervals and time periods. Combining results across many different specifications leads to three main conclusions:
Since at least the 1970s, the overall consumer price index published by the Bureau of Labor Statistic in the US has been recognized to be susceptible to swings caused by large movements in a few items. Up and down swings in the prices of a handful of items leads to substantial short-term volatility in the inflation index. Over the past few decades a large number of different core inflation measures have been advanced to reduce this transitory volatility while still maintaining the signal of longer-run inflation movements.
For the most part, these core measures of inflation are rule-based recombinations of the price changes observed in the overall price index and serve as a tool for communicating the stance of current or future inflation in real-time. This paper will follow that convention and define a core inflation measure as a rule-based recombination of the data used to construct the PCE price index that can be followed each month as the data is released.4 Thus a core inflation measure is not the outcome of a model which includes information not in the primary consumer price data, such as asset prices, wages, exchange rate changes, estimates of the output gap, or the judgmental forecasts of experts. Inclusion of information from those items may help provide a better forecast of inflation, but, by their very nature and complexity, they would be less useful in communication. As a result, a core inflation measure is not substitute for a fully-specified forecast of inflation, though it is sometimes used as one informally.
As noted above, the most common core inflation measures are exclusion indexes. They are constructed by identifying a handful of items which are considered to be the cause of the excess volatility and then building a new price index which excludes those same items throughout history. The most popular exclusion index is consumer price inflation excluding food and energy, though official agencies in the United States construct many additional exclusion indexes, and other countries sometimes use different exclusion indexes.5
The most common core inflation measures after exclusion indexes are central tendency statistical measures such as the trimmed mean and weighted median. These indexes were developed by Bryan and Pike (1991) and Bryan and Cecchetti (1994) for the US CPI and later by Dolmas (2005) for the US PCE price index. These indexes are based on the idea that given the structure of the price changes, large changes in a few items can cause excess movement in the average price level. As a result, the items with the largest positive and negative prices change in the month are removed. These indexes differ from exclusion indexes in that the items removed can change each month, whereas the same items are removed every month in exclusion indexes. Following the recommendations of Cecchetti and Bryan, the Federal Reserve Bank of Cleveland each month removes the 16 percent of consumption bundle with the largest price increases and the 16 percent of the consumption bundle with the largest price decreases (or smallest price increases) when creating the trimmed mean price index for the US CPI. Different trim points are used by the Federal Reserve Bank of Dallas when creating the trimmed mean PCE index. In contrast, the weighted median inflation rate removes all except the single item at the center of the distribution of price changes.
Several additional methods of creating core inflation have been suggested. Some of these are based on re-weighting items in the index in a less discrete fashion than the exclusion and trim measures, which simply include or exclude an item. For example, a variance-weighted inflation index lowers the weight placed on highly volatile items and raises the weights placed on low volatility ones. A regression-based price index uses component weights determined by a regression for predicting overall inflation from the disaggregated components. The cost-of-nominal distortions price index (CONDI) places more weight on items which tend to change price less frequently.
Some other methods do not reweight the subcomponents of the inflation index but merely smooth the inflation rate over time following a formula. Examples following this technique that are examined in this roundup include Cogley's exponentially-smoothed inflation (Cogley 2002), and the prediction of Stock and Watson's UC-SV model (Stock and Watson 2008), both of which are weighted averages of past overall inflation. A component smoothed index smoothes the price changes of the individual items over time before the individual items are aggregated to find a top line inflation index. Highly volatile items are smoothed more than low volatility items.
Table 1 lists the various inflation measures examined in this note with a very brief description of each. A more detailed description of some of the measures is contained in the appendix. Silver (2007) and Wynne (2008) provide additional background on different types of core inflation measures.
This list of core measures examined in this note is large, but it is not comprehensive. Measures not listed on table 1 or examined in this note include indexes based on persistence weights, which have not fared well in other comparisons Clark 2001; Cutler 2001; Bilke and Stracca 2007; Smith 2007); and measures which are likely to be too complex for use in communication with the general public such as dynamic factor indexes (Cecchetti 1997; Kapetanios 2004; Reis and Watson 2007; Giannone and Matheson 2007), common trends (Bagliano and Morana 2003), forecasts from multi-equation models of overall inflation (Folkertsma and Hubrich 2001; Domenech and Gomez 2006; Velde 2006) and other techniques (Arrazola and de Hevia 2002).
Also included in the round-up of measures in this note are values from inflation expectations surveys. Specifically, inflation expectations from the Reuters/University of Michigan Survey of Consumers are examined. Other inflation expectations surveys lack the history or frequency needed for inclusion here. Unlike the other measures examined in this note, inflation expectations are not re-combinations of actual inflation data and hence are not truly core inflation measures, but rather they are averages of what individuals expect inflation to be. Nonetheless, the growing prominence of inflation expectations as an anchor for the inflation process suggests that including expectations in this round-up is merited. Additional support for including expectations comes from Ang, Bekaert, and Wei's (2007) finding that survey measures of inflation expectations out-perform some conventional measures of core inflation.
This note focuses on inflation in the United States personal consumption expenditures price index (PCE prices) published by the Department of Commerce's Bureau of Economic Analysis.6 Much of the data used in constructing the PCE price index comes directly from to the more well-known Consumer Price Index (CPI) published by the Department of Labor's Bureau of Labor Statistics, and the two price indexes generally follow each other, so the main results of this paper would likely apply to CPI inflation measures as well.
In constructing the different core inflation measures used in this paper, out-of-sample methods are used whenever possible.7
Different studies have proposed a large number of desirable criteria for a core inflation measures, but this study confines the evaluation criteria to three varieties of tests of tracking trend inflation or predicting future inflation. These tests derive from the most common uses for a core inflation measure, and therefore are the most important criteria on which to judge a core inflation measure.8
1. The first test flows from the use of core inflation to measure the current inflation rate purged of transitory factors. In this context underlying inflation (and hence the core inflation measure) should closely track an ex-post estimate of trend inflation. Such ex-post estimates of trend inflation are generally constructed using a band-pass filter, a Hodrick-Prescott filter, or a 36-month centered moving average of headline inflation.9
2. The second test derives from the use of core inflation measures to informally suggest where overall inflation will move to in the coming period. In this context headline inflation is considered to be volatile and mean-reverting to underlying inflation. As a result underlying inflation (and hence the core inflation measure) should be a good predictor of future overall inflation.
3. The third test derives from the Phillips curve often used in more formal inflation forecasting. Like the second test this one also uses core inflation to predict overall future inflation, but it adds variables (like the unemployment rate gap and import prices) to explicitly account for factors which may cause underlying inflation to change over time.
Within each of these tests the fit of the core measure is examined over different time periods and various sampling intervals. These sampling intervals range from the rate of change in the core measure over one month up to its change over 24 months.
Results for each of these tests are shown for the most common core inflation measures before the tests are averaged and a wider array of core measures are examined.
The ability of the core measures to track trend inflation can be quantified by the root mean square error of the core inflation measure from trend inflation. It is unclear, however, how to treat a long-standing bias in the core measure when constructing goodness-of-fit measures, such as when a core measure persistently has an inflation rate 1 percentage point lower than overall inflation. Using the root mean square error to measure fit would always penalize for bias even though relatively constant and predictable bias between the core inflation indicator and overall inflation would only require the user to add an adjustment factor. On the other hand, using the standard deviation of the difference between the core measure and trend inflation would correct for the average bias over the time period, even if that bias were not predictable at the time.
This note takes an intermediate approach by assuming that users of the core inflation measure can adjust for bias by looking at the average difference between overall inflation and core inflation over the preceding 10 years. Thus in this note the ability of the core measure to track a measure of trend inflation is given by the root mean square error adjusted for expected bias:
Figure 1 displays the standard error of some prominent core inflation indexes using a centered 36-month moving average of headline PCE inflation as trend inflation, Figure 2 shows the same data, but uses a Baxter-King band pass filter to measure trend inflation.12 The vertical axes in the figures use a logarithmic scale to emphasize the percentage, rather than the absolute, differences in fit across the core inflation indexes. The horizontal axis in each panel lists the number of months used in constructing the core measure's inflation rate (i.e., the sampling interval, s in formula 1). The two panels of each figure cover the time periods starting in 1980 and in 1991, with the core inflation measures being constructed through the summer of 2009, which is the most recent data available while still allowing enough additional data points to construct relatively reliable benchmarks for the comparison.13
Figures 1 and 2 show that of these common measures, the trimmed mean (the dotted line) performs generally better than the other measures at tracking trend inflation over most intervals, but the preference for that measure diminishes as the interval lengthens out. In the more recent time period (the bottom panel) there is little difference between the trimmed mean and prices excluding food and energy once the interval is around 9 months or longer. Overall inflation also performs quite well with the Baxter-King filter in the longer time period when the interval is around twelve months, but it performs less well at short intervals and when the 36-month moving average is used as to measure trend inflation. The performance of survey measures of inflation expectations, unlike the indexes constructed actual price data, do not improve much with intervals over a few months. Finally, the best interval for almost all of the measures tends to be longer in the more recent period, starting in 1991, than in the full sample, starting in 1980. This is likely a result of inflation in the past 20 years having been low with very little long-run trend.
The ability of the core inflation measure to predict future inflation is tested in the same way as the ability of the measures to track trend inflation. The only difference is the benchmark is now future inflation rather than trend inflation.14 Two different measures of future inflation are tested: overall inflation during the next twelve months, and overall inflation in the second half of the next 24 months (i.e. inflation 12 to 24 months ahead). Forecast horizons shorter than 12-months are not examined as those horizons would promote core measures that are good at predicting the transitory movements that core inflation measures are intended to filter out.
Results for the core measures ability to predict future inflation at these two horizons are shown in the two panels of figure 3 and of figure 4. They are similar to the results for the core measures ability to track trend inflation, though overall inflation performs somewhat worse relative to the other core measures, particularly in the longer period. Michigan median inflation expectations for the next twelve months perform fairly well at predicting future inflation in the longer period, but not so well in the more recent period. On the other hand Michigan median inflation expectations for the next five to ten years predict inflation very well in the more recent period, but data is not available far enough back to evaluate it over the longer period. As before overall PCE tends to require longer sampling intervals than the alternative inflation measures.
A more general test of the forecasting ability of alternative core inflation measures runs a regression which includes some variables to explicitly account for movements in underlying inflation:
This test broadens the framework used in the second test by allowing to take on values different from 1, and allowing for the additional right-hand side variables, X. These changes create a simplified version of the expectations-augmented Phillips curve often used in inflation forecasting, with expected inflation equal to . For this note the X variables added to the regression are the unemployment rate gap and share-weighted non-fuel import and oil price changes, as well as their lagged prices changes15:
A series of regressions using rolling ten-year windows, with starting and ending dates incremented one month at a time, are run to create out-of-sample forecasts for inflation over the next twelve-months. The first time period is shortened to begin at the start of 1985 to allow for enough data to obtain coefficient estimates.16
Results for this out-of-sample Phillips curve test are shown in figure 5. The errors in this figure are noticeably smaller than the comparable errors in the previous test, shown on figure 3, which do not include the additional right hand side Phillips curve variables. Since out-of-sample forecasts are used here, the reduction in errors is not simply a result of adding variables to improve regression fit. However, they are based on future values of the unemployment rate, oil prices, and import prices. These variables, while they cause changes in overall inflation, would not be available in real-time to a forecaster.
Figure 5 suggests that longer sampling intervals for the core measures are strongly preferred in Phillips curves. A long sampling interval is consistent with the usual practice of including multiple lags of inflation among the exogenous variables in a Phillips curve. Among the different measures shown in this figure the overall PCE inflation, PCE inflation excluding food and energy, and the Dallas Fed trimmed mean all perform roughly similarly. Michigan median 12-month inflation expectations, however, perform poorly over the longer sampling intervals.
A next step is to combine the different tests and time periods to find a core measure at an interval which performs generally well over the many uses of a core inflation measure. While these combined results are dependent on the weights given to the various benchmarks and time periods they should still give a rough idea of which core measures work well and which do not.
The combination is created by taking geometric average of the error of the inflation measure under 15 different specifications, where the errors were defined earlier. The specifications comprise the 5 benchmarks from figures 1 through 5 (a centered 36-month moving average of overall inflation, a Baxter-King band-pass filtered version of overall inflation, overall inflation over the next 12 months, overall inflation in the 12 months following the next 12 months, and a Phillips curve for overall inflation in the next 12 months), and three time periods (January 1980 to September 200917, March 1991 to September 2009, and January 2000 to September 2009).
The choice of time periods results in performance since 2000 being weighted more heavily than earlier performance. Such a weighting is appropriate if there have been structural changes in inflation data. One reason to think that there have been such changes is the continual updating of the methodology and structure for the CPI data which underlie most of the PCE prices series. For example, the third time period was chosen to correspond with the period following the major CPI revision in 1999.
Figures 6 through 9 display the average performance over the 15 specifications for the core inflation measures listed in table 1. As before, a logarithmic vertical axis is used to emphasis differences across the measures. For comparison across the types of core measures figure 10 takes some of the best performing measures from each type and displays them on a non-logarithmic axis.
Figure 6 displays the results for overall PCE inflation and various exclusion methods.18 Overall PCE performs by far the worst of the various measures at short sampling intervals. The performance of overall PCE inflation improves significantly as it is averaged over periods up to about 18 months. At that point its standard error is still above that of all of the exclusion indexes; however, the difference is relatively small. The exclusion indexes, except for including only services, all perform rather similarly and the performance of all of them improves until the interval reaches around 9 months.
The trimming-based measures, figure 7, do much better than exclusion indexes at short sampling intervals, but still improve with averaging the inflation rate up to about 7 or 8 months. All the trimmed measures perform similarly, with a very slight preference for the Dallas Fed trimmed mean.
Also on figure 7 are the Michigan 12-month-ahead inflation expectations measures. Other expectations measures do not go far enough back to be included in this round-up. These measures perform well at the one-month interval, but do not improve much with longer sampling intervals. By the nine-month interval the expectations measures perform notably worse than the trimmed measures. Median inflation expectations perform better than mean expectations.
Figure 8 shows the results from methods that recombine the current inflation rates of the various components using other weighting schemes. In variance weighting, components with more variable relative prices or inflation rates receive a correspondingly lower weight. In many ways this idea is a more refined version of the trim- and exclusion- based measures depending on how the variance is calculated. Here the variance is calculated from relatively recent price changes, so the variance of an item is allowed to change over time. This makes the variance-weighted indexes more similar to the trim-based measures than to exclusion indexes, and many of the variance weighted measures perform similarly to the trimmed-based measures. Results for two of these variance-weighted measures tested are shown in figure 8. Using the variance of price changes to set the item weight tends to work slightly better than using the variance of the relative price level or the variance of the change in the price change, but the results are not dramatically different. (For space reasons, only one of these additional permutations is shown.) Also on figure 8 are the results from Eusepi, et al's cost of nominal distortions price index (CONDI), which essentially gives greater weight to items who frequency of price change is lower (among other factors). This index performs similarly many of the exclusion-based price indexes. The final type of measures shown in figure 8 use component weights from a regression of the disaggregated components on overall PCE inflation. These regression-based models do not to fit terribly well, and lengthening the interval had little effect on their performance.
Among the techniques that are based on smoothing inflation over time, shown on figure 9, the results from a couple low-order ARIMAs using rolling 15-year windows perform well. The other smoothing techniques also perform decently, but not as good as the ARIMAs. Component smoothed inflation and exponentially smoothed inflation performed slightly worse than the UC-SV measure. It is notable, though not surprising, that the smoothing methods, other than the ARIMA(2,0,1), do not require a long interval. Presumably this is because they are already averaged over time.
Figure 10 displays some of the best performing measures in each category, along with overall PCE and an average of the three most common PCE inflation measures (overall PCE, prices excluding food and energy, the Dallas Fed trimmed mean). As noted above the best interval for overall PCE inflation seems to be at least 18 months, and the average error keeps falling slightly through 20 months.
On average across all the intervals, excluding food and energy performs somewhat worse than the potential other indexes, and overall PCE performs considerably worse than any of the measures including excluding food and energy. This suggests that core inflation measures are useful and there may be better measures of core inflation than excluding food and energy.
One the other hand, when each measure is examined at its best interval the difference across measures is much less pronounced. Overall PCE inflation still performs the worst, then excluding food and energy performs next worst, and the other measures perform better than these two measures, but the difference is small. The error from using excluding food and energy at its best interval (0.79 at an interval of 13 months) is only slightly higher than the error when using the best measure, the Dallas Fed trimmed mean at its best interval (0.77 at an interval of 15 months), and only slightly lower than using overall PCE inflation at its best interval (0.82 at an interval of 20 months). Assessing whether the differences across measures are statistically significant in a formal test would be difficult since the 15 different specifications are not independent.19
Also as can be seen on figure 10 combining inflation measures only leads to a small improvement. The averaging of overall inflation, excluding food and energy, and the trimmed mean performs only a couple basis point better at most intervals than simply using the trimmed mean. The small improvement when averaging suggests that the measures tend to make similar errors when tracking underlying inflation or predicting future inflation.
As noted in the introduction, a core measure is only useful if it can perform the goals of tracking current trend inflation or predicting future overall inflation better than overall inflation itself over a significant period of time. The results above show that this may be the case, but they could merely be the result of the aggregation placing a large amount of weight on the post-2000 time period. To demonstrate that this is not the case, figures 11 through 14 take most of the measures from figure 10 and display how they fit a geometric average of the 5 benchmarks over rolling 10-year windows using a interval of 3 months (figure 11), 6 months (figure 12), 12 months (figure 13), or each index's best interval from the previous section (figure 14). One substitution is made from figure 10: the Dallas Fed trimmed mean is replaced by the average trimmed mean because the Dallas Fed does not publish their measure prior to 1977. Over the period where both trimmed mean measures are available they perform quite similarly.
Over this longer sample, the trimmed mean, the variance weighted, and the average of overall PCE, PCE excluding food and energy, and the trimmed mean all perform similarly regardless of the interval used. These three measures generally perform notably better than excluding food and energy or overall PCE prices at sampling intervals of 6 months or less. At a twelve month interval there some periods where overall PCE inflation performs the best, while at each index's best interval the differences across measures is relatively small and swamped by the difference in performance of all the measures across time.
There is no universally "best" core inflation measures, just as prior studies have found. Nonetheless, a number of conclusions can still be drawn from the results here:
First, short sampling intervals should be avoided. Almost all the inflation measures, except survey-based measures and already smoothed indexes, must be averaged over a significant time interval to best track underlying inflation or predict future inflation. The best interval depends on the measure and the time period, but at present for overall PCE inflation sampling intervals shorter than 18 months should be avoided. For other measures of PCE prices inflation the best sampling intervals in the tests here were often around 12 months.
Second, at sampling intervals of 12 months or less since the mid-1980s overall PCE inflation performs worse at predicting future inflation or matching ex-post measures of trend inflation than many other core measures including excluding food and energy, variance weighted inflation, or the Dallas Fed trimmed mean. Even when each measure is evaluated at its best interval, overall PCE inflation still performs slightly worse than the majority of potential core measures, though the difference may not be statistically or economically significant.
Third, trimmed mean or variance weighted price indexes perform slightly better than exclusion indexes such as PCE prices excluding food and energy, particularly at short intervals. Though, again, the difference between measures when each is evaluated at its best interval may not be statistically or economically significant. However, it does suggests that if we desire a single real-time measure of core inflation there may be better choices than simply excluding food and energy.
This appendix gives an overview of some of the core inflation measures examined in this note.
Exclusion Indexes
Inflation indexes resulting from removing certain items throughout history are referred to as exclusion indexes. To account for the removed items, the weights of the items remaining in the index are scaled up and their relative weights are unaffected. Exclusion indexes are the most popular measures of core inflation for their simplicity and transparency: They are easily constructible in real time, communicable to the public, and replicable with published inflation data. Clark (2001) prefers CPI prices excluding energy in his overview of core inflation measures.
One variant of deciding what items should be excluded is to include only "sticky" prices (Aoki 2001). Following a similar procedure to that used by Bryan and Meyer (2010) for the US CPI, a sticky price index for this note is created by examining a list of the frequency of price changes from Bils and Klenow (2004) and excluding the 30 percent of the consumption basket with the most frequent price changes.
Statistical Central Tendency Measures
When the cross-sectional distribution of price change has fat tails, the mean may be very sensitive to outliers. As a result, Bryan and Pike (1991) and Bryan and Cecchetti (1994) advocated the use of weighted medians and trimmed means to measure underlying inflation. These indexes exclude different items each month depending on their place in the cross-sectional distribution of price changes. The monthly price change for the index is determined by ordering the items by their price change over the month, from most negative to most positive, and then removing the components in the tails of the distribution up to a certain share of the consumption basket. The price change is then computed from the remaining items. The basic formula for these types of measures is:
where is the inflation rate of the i item in the ordered cross sectional inflation distribution, is trim on the lower side, is the trim on the upper side, is the item at the p-point in the cross sectional inflation distribution at time t, is the weight of item i in the consumption basket at time t, and is the inflation rate of item i in month t. Twelve-month changes are constructed as the cumulation of the one-month changes.
Dolmas (2005) created a version of the trimmed mean for the PCE price index, which is published monthly by the Dallas Federal Reserve Bank. Dolmas' version is not symmetric; it uses =.194, =.254. Thus, the measure excludes 44.8 percent of the consumption basket. In the weighted median ==.5, so only the inflation rate of the item at the very middle of the cross-sectional distribution is used. Smith (2004) finds this index performs quite well.
The trimmed mean requires the designer to determine cut-off points. For the Dallas Fed trimmed mean the cut-off points were designed so that the measure would best fit benchmarks over the years 1977 to 2004. To avoid possible contamination from the cut-off points being determined ex-post, an alternative version that averages across all possible symmetric trims from no trim (==0) to the largest possible trim (==.5) was constructed for this paper. This average trimmed mean takes a more agnostic view of the appropriate cut-off points. In contrast to the normal trimmed mean or weighted median, this formulation does not completely ignore the information in the tails, but instead down-weights those observations. The formula for this type of measure is:
The average trimmed mean measure performs quite similarly to the normal trimmed mean produced by the Dallas Fed except for being somewhat more biased.20The trimmed measures used here all trim based on the one-month price changes. However, there is no reason that longer sampling intervals, such as price changes over three or twelve month periods, could not be used to determine which items to trim.
Trimmed mean of volatile components
The trimmed mean of volatile components (TMVC) proposed in Pedersen (2009) combines aspects of the trimmed mean approach and the exclusion index approach. The TMVC removes a fixed share of the consumption basket each month based on the volatility of the inflation rate of the item. Like the other central tendency measures, items are ranked according to certain criteria and a fixed percentage of the consumption basket is removed, the items removed each month may vary, and information on inflation from the current month is used in constructing the ranking. Like the exclusion index approach, the history of the items does matter. Pedersen ranks CPI components by the variance of their monthly inflation rates over the past 6 to 24 months and then removes the most volatile components. He finds the best indexes for the placecountry-regionUnited States exclude between 3 and 47 percent of the consumption basket. The TMVC specification used in this paper removes the most volatile 25 percent of the consumption basket based on monthly inflation over the prior 24 months (including the most recent observation). The same 205-component disaggregation of overall PCE as used in the average trimmed mean is used to construct the TMVC.
Variance-weighted (Neo-Edgeworth Indexes)
Instead of completely removing volatile components, a neo-Edgeworth index down-weights the volatile components (Diewert 1995). Often each item is weighted by the inverse of its variance. In fact, if each component had equal shares in the consumption basket, there were no long-run relative price changes across the components, and shocks to a component's price change were not correlated across components or across time, then weighting each item by the inverse of that item's variance would be optimal. However, these restrictions are not observed in practice.
Variance-weighted price indexes may give a lot of weight to items that have a low variance but make up a small share of the consumption bundle. To reduce this problem, rather than strictly using the inverse of the monthly variance of the item's price change to weight the item, the inverse of 1 plus the monthly variance of the item's price change is multiplied by the item's nominal share. These weights are normalized to sum to one across all items. The variance of the monthly percentage change in price for each component is taken over backward-looking 60-month windows.
The same 205-component disaggregation of PCE prices used for constructing a number of the other core measures was used to construct the variance-weighted indexes.
A large number of additional formulations of variance-weighted inflation were also tested. The formulations differed by what variance was constructed and by the weight given to past observations in constructing that variance. First, beyond just taking the variance of the one-month price change of the component, three additional variances were constructed: 1. The variance of the one-month price change of the component relative to the one-month price change of overall PCE, 2. The variance of the component price level relative to the price level of overall PCE, and 3. The variance of the change in the monthly inflation rate (i.e. the second moment of the price level). As before these were multiplied by the items nominal share.
Further formulations were constructed by changing the treatment of older observations for each of these three variance measures. Older observations were down-weighting when constructing the variance in geometrically-declining fashion. Monthly discount rates ranging from 0.05 percent (very slowly declining weights on past observations) to 50 percent (very little weight on past observations) were evaluated.
In nearly all cases the different permutations had only a small effect on the results, leaving the majority of variance weighted measures to perform similarly.
Smoothed versions of headline inflation
The indexes above all rely on looking at the disaggregated components. On the other hand much work has been done simply focusing on ways of pulling a trend out of the headline inflation and eschewing any information in the components.
Cogley (2002) suggests exponentially smoothing headline inflation. This method sets the core inflation rate to a long moving average of past inflation: . This can also be written as core inflation equals weighted average of current inflation and lagged core inflation, , or as an IMA(1,1) process, where . Cogley suggests setting equal to .875 for quarterly CPI inflation (i.e. a 12.5 percent quarterly discount rate).
Stock and Watson (2008) suggest what amounts to a generalization of Cogley's procedure that allows the moving average term in the IMA(1,1) to vary over time following an approximate logarithmic random walk.21 They find that their unobserved components-stochastic volatility (UC-SV) model fits quarterly inflation quite well, and it out-performs traditional Phillips curves since 1985.
They suggest that the projection from an IMA(1,1) using rolling 10-year windows performs only slightly worse than their UC-SV model. Similarly, for the monthly data used in this paper rolling 15-year windows of many ARIMAs with at least one MA term, at least one autoregressive or degree of integration, but no more than 1 degree of integration, perform quite well.
Component Smoothed Inflation
Rather than throwing away the underlying component data and simply smoothing headline inflation, Gillitzer and Simon (2006) suggest smoothing individual components by a degree appropriate for that item and then aggregating these smoothed components. This concept has a couple of advantages: First, the concept is easily grasped by the lay person: certain items are volatile, therefore these items need to be smoothed more than other items. Second, each item receives its normal weight in the consumption basket. This makes the index relatively immune to both the populist complaint that "the central bank throws out the items which are increasing when looking at inflation", and problems associated with diverging trend inflation rates across components, which can cause the index to be biased one way or the other.
Component smoothing has not been widely explored, and there are many potential alternative methods for smoothing the components that might improve the technique but have not yet been examined. These possible methods include using a simple average of the component over the prior x-months, applying Stock-Watson's UC-SV model to each item, or allowing changes in the degree of smoothing to be correlated across items. The technique used for smoothing in this paper differs from Gillitzer and Simon. Here the degree of smoothing is determined for each item by a single exponential smoother calibrated to predict the item's twelve-month-ahead inflation rate over the previous ten years. The smoothing component varies across the items from .01 to .16 with a median value of .07, which implies the median item is smoothed with a mean lag of 14 months.
The level of component disaggregation is likely to be important for the fit of a component smoothed index. This uses the same 205-component disaggregation of PCE to construct component smoothed inflation that was used to construct the weighted median, the TMVC, and variance-weighted inflation.
Regression weights
Smith (2007) suggests that the forecast from a regression of headline inflation on the lagged disaggregated components of inflation provides a good measure of core inflation. Specifically, the basic regression form is:
, where is headline inflation rate over the past twelve months, and is the inflation rate of the i component over the twelve months prior to the past twelve months. Using the coefficients from this regression a forecast for headline inflation over the next twelve months, is constructed and considered the estimate of core inflation at time t.Following Smith's basic methodology 48 regression-based variants were constructed. Most of these results are not shown for space reasons. The variants differed according to:
1. The degree of disaggregation: Both a 17-component disaggregation of PCE prices based on NIPA table 2.3.4 and 51-component disaggregation suggested by Smith were used. The 17-component disaggregation performed somewhat better than the 51-item disaggregation.
2. Whether the inflation rates of the components were multiplied by their share in the consumption basket in the regression. This allows the contribution of an item's inflation to headline inflation to evolve as expenditure patterns change over time. This multiplication did improve the results slightly.
3. Whether the observations were equally weighted (with a start date of 1972) or instead older observations were down-weighted in a geometrically declining fashion (with a start date of 1959). One fixed weighted and eleven geometrically declining permutations, with discount rates of 0.05 percent to 5 percent per month, were constructed. There was no clear preference for either discounting or fixed weighting.
Additional variants, such as restricting the regression constant to zero, the sum of the coefficients to 1, and the coefficients to be non-negative, were not examined.