The Federal Reserve Board eagle logo links to home page

Skip to: [Printable Version (PDF)] [Bibliography] [Footnotes]
Finance and Economics Discussion Series: 2009-41 Screen Reader version

Education's Role in China's Structural Transformation*


Soohyung Lee
University of Maryland and MPRC
Benjamin A. Malin
Federal Reserve Board


September 2, 2009


Keywords: Returns to education, structural transformation, China, labor reallocation, economic growth

Abstract:

We explore education's role in improving the allocation of labor between China's agricultural and nonagricultural sectors and measure the portion of China's recent growth attributable to this channel. Building from micro-level estimates, we find that education's impact on labor reallocation between sectors accounts for about 9 percent of Chinese growth, whereas its impact on within-sector human capital growth explains only 2 percent. Our findings suggest that, when frictions cause large productivity gaps across sectors and returns to education are greater in the higher-productivity sectors, education policy may be a useful tool for increasing efficiency.



1 Introduction

A burgeoning literature in development economics suggests that barriers to the efficient allocation of inputs are important for explaining why some countries have much lower income per capita than others. Because such barriers - whether to technological adoption (Parente & Prescott, 1994), labor force mobility (Hayashi & Prescott, 2008), or the flow of physical and human capital to their most productive uses (Vollrath, 2009) - depress income levels, it follows that reductions in these barriers can lead to periods of rapid economic growth. Such growth may also be accompanied by a structural transformation of the economy as the employment and output shares of low-productivity sectors decrease, and factors of production move into high-productivity sectors.

The recent growth experience of China appears to be consistent with the effects of reduced barriers. Since 1978, when the Open Door policy reforms began, output per worker has increased at a very rapid pace.1 Moreover, this growth has been accompanied by a declining employment share for the agricultural sector from 71 percent in 1978 to 47 percent in 2004 (China Statistical Yearbooks), a phenomenon called labor reallocation. Given that large productivity differences exist between the agricultural and nonagricultural sectors, estimates of the fraction of growth in aggregate output per worker accounted for by labor reallocation are quite sizable, ranging between 25 and 33 percent (Brandt, Hsieh and Zhu (2008) and Dekle and Vandenbroucke (2006), respectively).

Although many factors undoubtedly contributed to labor reallocation, in this paper we use detailed micro-level data to quantify the role of one particular driving force: increased educational attainment. Our focus on education stems from the ability of educated workers to overcome the labor market restrictions present in China (e.g., migration regulations based on the hukou system). Our hypothesis is that increased education effectively reduced the inefficiencies caused by these restrictions, thus spurring growth. We undertake a growth accounting exercise to measure education's impact on growth through enabling labor reallocation between sectors. To provide a metric for assessing the importance of this channel, we also measure education's contribution through raising human capital within sectors.2

We utilize a model with finite-lived individuals in which income is a function of an individual's observable (e.g, gender) and unobservable (e.g, sector-specific ability) characteristics. Following Mincer (1974), we assume education increases the human capital of workers, possibly at a different rate in each sector. In addition, education affects the cost of working in the nonagricultural sector relative to the agricultural sector. Individuals choose their education level and sector of employment to maximize the expected net present discounted value of their lifetime income. We estimate the model using simulated method of moments and multiple waves of the China Health and Nutrition Survey, a household-level survey containing data on the occupation, education, and income of all household members.

The estimated model provides three key pieces of information which we employ in our growth accounting exercise. These include sector-specific returns to education, the extent to which education increases an individual's probability of working in the nonagricultural sector, and the productivity gain associated with having a nonagricultural job. Note that the latter two effects of education depend upon an individual worker's characteristics and ability. For example, the marginal benefit of education in terms of lowering the cost of working in the nonagricultural sector can be negligible for a worker born in an urban area, where nonagricultural jobs are abundant, but can be substantial for a worker who must migrate to improve his chances of finding a nonagricultural job.

To quantify education's within-sector contribution to growth, we first combine the estimated sector-specific returns to education with measures of educational attainment to calculate the growth of human capital in each sector. The underlying assumption, following Young (2003), is that relative incomes of individuals within a sector reflect their relative marginal products. This assumption is consistent with the presence of the aforementioned labor market barriers if those barriers only impede workers' mobility across sectors but workers can move freely between competitive firms within each sector. We then weight each sector's human capital growth by the sector's share of aggregate output to compute the contribution of education to aggregate growth through increased human capital.

Turning to education's contribution to growth due to labor reallocation, we multiply our individual-specific measure of education's impact on the likelihood of working in the nonagricultural sector by the expected productivity differential between a nonagricultural and agricultural job. Note that the differential is predicted using our estimated income processes and individual characteristics. We then average this product over all educated workers to get a measure of the average productivity gain from reallocation due to education. Finally, we multiply this average by the change in the fraction of educated workers to get the contribution to aggregate growth.

Our main findings are that (a) education accounts for a large fraction of the productivity growth associated with labor reallocation and (b) education's contribution to growth through labor reallocation is at least as large as its contribution through raising human capital within sectors. For example, comparing our baseline results to the growth decomposition of Brandt, Hsieh and Zhu (2008), increased educational attainment accounts for over one-third of the growth attributable to labor reallocation in China from 1978 to 2004. Moreover, roughly 11 percent of the overall growth in output per worker is accounted for by increased education, with 9 percent coming through the labor reallocation channel and 2 percent being accounted for by education's role in increasing human capital. Our main qualitative findings are robust to a number of different robustness checks. Evidently, to fully understand education's impact on economic growth in China, it is important to consider education's role in facilitating labor reallocation.

These findings suggest that education policy may be a useful tool for achieving a more efficient allocation of labor in China. Although the Chinese government could reduce labor market barriers directly by eliminating internal migration restrictions, this may not be desirable given concerns about the ability of urban areas to accommodate massive labor inflows. On the other hand, because migration regulations are less restrictive for educated workers, policies that reduce the cost of education can facilitate a gradual reduction in the effective restrictiveness of the regulations, even if the regulations themselves do not change. Note that the Chinese government has worked to reduce the (private) cost of education over the years by introducing free compulsory education systems and school-building programs (Pepper 1996 and Hannum 1999).

Our paper is related to several strands of research in development and labor economics. Our emphasis on education as a driving force of structural transformation draws from the insights of Caselli and Coleman (2001). They build a model in which declining education costs allow an increasing proportion of the labor force to become educated and work in the nonagricultural sector and use their model to explain features of the U.S. structural transformation. As for China's more recent structural transformation, Brandt et al. (2008) and Dekle and Vandenbroucke (2006) have quantified the importance of labor reallocation for growth but have not focused on the role of education per se. All three studies take a "top-down" approach by calibrating structural models to aggregate data and then conducting counterfactual experiments to gauge the importance of various driving forces for reallocation and growth.3In this paper, we do not attempt to cover the same breadth of driving forces as the top-down approach, but instead take a "bottom-up" approach, which allows for a careful quantitative investigation of education's impact on labor reallocation built on micro-level estimates of its effects.

Our work is also motivated by studies of the importance of the efficient allocation of inputs for understanding both within-country growth and income differences across countries. For example, Hsieh and Klenow (forthcoming) use micro data on manufacturing establishments to calculate potential productivity gains in China and India that could result from an improved allocation of capital and labor.4 Looking at a broader set of countries, Restuccia, Yang and Zhu (2008) and Vollrath (2009) emphasize the depressing effect an inefficient sectoral allocation of inputs (i.e., too many inputs in the lower-productivity agricultural sector) has on aggregate output and total factor productivity. These empirical studies present general measures of the potential gain from better allocations but do not focus on providing measures of how potential policies might improve efficiency. Our finding that education helps individuals overcome labor market barriers has implications for policies aimed at facilitating labor reallocation, even for countries without explicit migration barriers like China. As long as returns to education are sufficiently higher in higher-productivity sectors,5 a government subsidy which reduces the cost of education can induce workers to increase their educational attainment and move into these sectors.

Finally, our work is closely related to micro-level empirical studies on the relationship between education and income. Our empirical model and specification draws from many papers including Mincer (1974) for modelling education's impact on earnings, Roy (1951) for determining sector choice given a worker's unobserved sector-specific ability, and Willis and Rosen (1979) and Heckman and Honore (1990) for specifying the choice of education as a function of a worker's unobserved ability. Our contributions to this literature include providing a new estimate of returns to education in China by controlling for a worker's choice of both education and sector, and ultimately linking these estimates with changes in the aggregate distribution of worker characteristics to better understand China's economic growth.

The rest of the paper proceeds as follows. In Section 2, we describe a two-sector growth accounting framework and establish which key variables we need to estimate from micro-level data to account for education's impact on growth. Section 3 describes the data and discusses why increased educational attainment was likely an important driving force of labor reallocation in China. In Section 4, we present our empirical model and report baseline estimates of the effects of education on reallocation and growth. Section 5 presents the results of various robustness checks and discusses how our approach relates to others in the growth accounting literature. Section 6 concludes.


2 Growth Accounting Framework

We begin by presenting a growth accounting framework for an economy with two sectors, agriculture  (a) and non-agriculture  (n), in which there are barriers to labor mobility between sectors. As a result, wages (assumed to equal marginal products of labor) are not necessarily equated across sectors. We will consider the contribution of increased educational attainment to growth in this economy. Education affects growth through increasing the human capital of workers - possibly at different rates in each sector - and also through lowering the cost of mobility, thus enabling workers to move into the higher-productivity sector.

We assume sectoral value production functions of the form

\displaystyle Y_{st} = p_{st} (A_{st} H_{st})^{\alpha_{s}} K_{st}^{1-\alpha_{s}},    

where  \alpha_{s} denotes the labor share in sector  s,  K_{st} denotes physical capital,  H_{st} denotes the amount of human-capital-augmented labor, and  A_{st} is a measure of total factor productivity. The inclusion of  p_{st}, the price of the sector  s good relative to the aggregate price level, means  Y_{st} denotes a value product rather than the quantity of goods produced. It also allows us to simply express aggregate output  Y_t as
\displaystyle Y_{t} = Y_{at} + Y_{nt}.    

We follow the growth accounting literature by decomposing aggregate labor productivity (i.e., output per worker) growth into within-sector productivity growth and growth resulting from reallocation of labor between sectors. Specifically, let  y_{at},  y_{nt}, and  y_{t} denote output per worker in agriculture, non-agriculture, and the aggregate, respectively, and  l_{at} and  l_{nt} denote agricultural and nonagricultural shares of total employment, so  l_{at} + l_{nt} = 1. Aggregate output per worker can then be expressed as the weighted average of output per worker in each sector

\displaystyle y_{t} = y_{at} l_{at} + y_{nt} l_{nt},    

and aggregate labor productivity growth is given by
\displaystyle d\ln y_{t} = \frac{y_{at}l_{at}}{y_{t}}d\ln y_{at} + \frac{y_{nt}l_{nt}}{y_{t}} d\ln y_{nt} + \frac{\left( y_{nt}-y_{at} \right) }{y_{t}} dl_{nt}. (1)

Equation (1) shows three channels through which increased education can account for growth in output per worker: (1) growth in agricultural labor productivity  d\ln y_{at}, (2) growth in nonagricultural labor productivity  d\ln y_{nt}, and (3) labor reallocation  d l_{nt}.

Let's first consider how to construct the within-sector productivity growth attributable to increased education. The production function can be rewritten in terms of output per worker and human capital per worker,  h_{st}, as

\displaystyle y_{st} = p_{st}^{\frac{1}{\alpha_s}} A_{st} h_{st} \left( \frac{K_{st}} {Y_{st}} \right) ^ {\frac{1-\alpha_{s}} {\alpha_{s}} }.    

Decomposing output per worker in terms of the capital-output ratio rather than the capital-labor ratio is useful because it accounts for both the direct impact of increased human capital on output per worker and the potential indirect impact through raising physical capital per worker (see, for example, Klenow and Rodriquez-Clare (1997) for further discussion). As a result, in our accounting, the growth rate of human capital per worker has a one-for-one impact on the growth rate of output per worker:
\displaystyle d\ln y_{st} = d\ln h_{st} + \frac{1}{\alpha_s} d\ln p_{st} + d\ln A_{st} + \frac{1-\alpha_{s}} {\alpha_{s}} d\ln \left( \frac{K_{st}} {Y_{st}} \right).    

To construct sectoral measures of the growth in human capital per worker due to educational attainment, we follow Young (2003) and assume human capital is a constant returns to scale function of differentiated labor types: in our baseline case, educated ( edu) and uneducated ( uned). Human capital growth may then be expressed as

\displaystyle d\ln h_{st} = \beta_{st}     \displaystyle d \ln l^{uned}_{st} + (1 - \beta_{st})     \displaystyle d \ln l^{edu}_{st}, (2)

where  l^{uned}_{st}  (l^{edu}_{st}) denotes the fraction of uneducated (educated) workers in sector  s at time  t, and  \beta_{st} is the share of sectoral wages paid to uneducated workers:
\displaystyle \beta_{st} \equiv \frac{w^{uned}_{st} l^{uned}_{st}} {w^{uned}_{st} l^{uned}_{st} + w^{edu}_{st} l^{edu}_{st}} = \frac{l^{uned}_{st}/l^{edu}_{st}} {l^{uned}_{st}/l^{edu}_{st} + exp\left( \gamma_{st} \right)}, (3)

where  \gamma_{st} \equiv ln(w_{st}^{edu}) - ln(w_{st}^{uned}) denotes sectoral returns to education. To convert our estimates of returns to education into measures of human capital, we are assuming that relative incomes of individuals within a sector reflect their relative marginal products. This assumption is consistent with the presence of labor market barriers as workers can be paid their marginal product and, at the same time, not necessarily be working in the sector in which they would be most productive. In summary, to compute education's contribution to within-sector human capital growth, it is sufficient to have returns to education (  \gamma_{st}) and the shares of educated and uneducated workers (  l^{edu}_{st}, l^{uned}_{st}) in each sector.

Turning to labor productivity growth due to labor reallocation, first note that in the absence of labor market barriers and under the assumption that both sectors have the same labor share, sectoral (average) labor productivities would be equated and reallocation would make no contribution to aggregate labor productivity growth.6 In the presence of labor market barriers, however, labor reallocation from low- to high- productivity sectors can generate sizeable growth in aggregate productivity. Isolating the portion of this growth that comes from increased educational attainment requires two measures: the change in the nonagricultural share of employment due to education and the associated productivity gain.

Suppose that the nonagricultural sector's share of total employment ( l_{nt}) is a function of the share of educated workers in the labor force ( l^{edu}_t) and other factors ( X_t). Education can affect the nonagricultural employment share through various channels such as reducing the barriers to working in non-agriculture and, to the extent returns to education are higher in non-agriculture, making workers more likely to choose a nonagricultural job. We can write

\displaystyle d l_{nt} = \frac{\partial l_{nt}}{\partial l^{edu}_t} d l^{edu}_t + \frac{\partial l_{nt}}{\partial X_t} dX_t,    

and want to measure the first term on the right side. Using micro-level data, we can construct the effect of education on sector choice as follows
\displaystyle \frac{\partial l_{nt}}{\partial l^{edu}_t} = \int \left[ P^{edu}_{nt}(i) - P^{uned}_{nt}(i) \right] dF^{edu}_{t}(i),    

where  i denotes a worker's type (based on observable characteristics such as education, gender, etc.),  P^{E}_{nt}(i) is an individual/time-specific probability of working in the nonagricultural sector conditional on education status  E, and  F^{edu}_{t} is the cumulative density of educated workers' types. That is, education's impact on sector choice is given by the average (across educated workers) change in the probability of working in the nonagricultural sector that results from obtaining education. Multiplying this measure by the change in the fraction of educated workers,  d l^{edu}_t, produces the change in the nonagricultural employment share due to education.

To construct the contribution to aggregate productivity growth, we must weight education's impact on the nonagricultural employment share by the productivity gain associated with reallocation. From the third term of Equation (1), one might consider simply weighting by the difference in average productivity in the two sectors:  \frac{\left( y_{nt}-y_{at} \right) }{y_{t}} \frac{\partial l_{nt}}{\partial l^{edu}_t} d l^{edu}_t. The problem with this weighting, however, is that it does not adequately isolate the productivity gain due solely to reallocation. Instead, we construct individual-specific measures of the productivity gain as the difference between being an uneducated nonagricultural worker and an uneducated agricultural worker.7 We also account for the possibility that the productivity gain may be correlated with the individual-specific change in the probability of working in the nonagricultural sector that results from obtaining education. Thus, our measure of education's contribution to productivity growth through reallocation is

\displaystyle \left [ \int \frac{\left[y^{uned}_{nt}(i) - y^{uned}_{at}(i) \right] \left[ P^{edu}_{nt}(i) - P^{uned}_{nt}(i) \right]}{y_t} dF^{edu}_{t}(i) \mbox{ } \right] d l^{edu}_t. (4)

In Section 4, we show how alternative measures that ignore the aforementioned correlation or rely on aggregate measures of productivity affect our results.

To recap, the key message of this section is that sector-specific returns to education (  \gamma_{st}) and individual-specific measures of education's impact on the likelihood  \left ( P^{edu}_{nt}(i) - P^{uned}_{nt}(i) \right ) and productivity gain  \left( \frac{y^{uned}_{nt}(i) - y^{uned}_{at}(i)} {y_t}\right) from working in the nonagricultural sector are needed to compute the contribution of education to labor productivity growth in a two-sector economy with barriers to labor mobility across sectors. In subsequent sections, we describe how the Chinese economy fits into the framework we have described, use micro-level data from China to estimate these measures, and then decompose the contribution of education to growth into the within- and between-sector channels.


3 Data and Institutional Background

3.1 China Health and Nutrition Survey

Quantifying education's role in accounting for both labor reallocation between the agricultural and nonagricultural sectors and increased human capital within sectors requires micro-level data with detailed information about the income, education and sector of workers. We use data from the China Health and Nutrition Survey (CHNS),8 an unbalanced panel household survey with refreshment that contains this key information along with other socioeconomic variables for each household member. It surveys the nine Chinese provinces shown in Figure 1: Guangxi, Guizhou, Heilongjiang, Henan, Hubei, Hunan, Jiangsu, Liaoning, and Shandong. We use six waves of the CHNS, ranging from 1989 to 2004, which allows us to also analyze time trends and recent changes in the Chinese labor market.

Figure 1: Map of Surveyed Chinese Provinces. Chinese provinces represented in the China Health and Nutrition Survey. A map of China in which the Guangxi, Guizhou, Heilongjiang, Henan, Hubei, Hunan, Jiangsu, Liaoning, and Shandong provinces are shaded.


The main advantages of the CHNS relative to other publicly-available data sets are that it includes data on both rural and urban areas and that it has several, including recent, survey years.9 One concern that arises, given our aggregate focus, is whether the nine provinces are representative of China as a whole on the dimensions most crucial to our analysis, which include educational attainment, the sectoral decomposition of workers, and the income differential between agricultural and nonagricultural jobs. To allay this concern, in Table 1 we compare summary statistics from the multiple waves of the CHNS (first panel) to those from national surveys - specifically, the census and China Statistical Yearbooks (CSYs) (second panel). Because the census only occurs every five years (1990, 1995, 2000, and 2005), Table 1 entries that rely on census data are reported for the year closest to the census survey year. In addition, for comparison with the CHNS, we construct annual statistics by fitting a second-order polynomial time trend to the census data.

Comparison of the two panels of Table 1 shows that the CHNS oversamples workers in the nonagricultural sector in early waves of the survey and undersamples them in later years. Therefore, in each survey year, we weight the CHNS sample so that sectors have the same weights as in the national surveys. The share of workers completing middle school (a proxy for educational attainment) in our weighted sample remains roughly in line with census data.

For the income statistics, although the CSYs do not provide a breakdown of income by sector, household income per capita is reported by rural and urban areas and we will use this decomposition for our comparison. Because 75 percent of nonagricultural workers live in urban areas and 86 percent of agricultural workers live in rural areas, the comparison of urban-rural income differentials probably gives a good picture of the nonagricultural-agricultural income differentials. Appendix A provides a detailed description of our income measures, and Table 1 displays annual income per household member by area in the CHNS. The income differential in the CHNS is slightly smaller than that in the CSY in most years, although the discrepancy increases in 2000 and 2004. A partial explanation of the discrepancy is that we use gross annual income measures for agricultural workers in the CHNS, whereas the CSY measures correspond to net total income. We did not use net income for agricultural workers because subtracting reported expenses from gross revenues led to negative reported net income for some households, which apparently is somewhat of a common problem with survey data on revenues and expenses (de Mel et al., 2007). In our view, using gross incomes for agriculture workers is a better choice. The main implication is that we will find a smaller benefit from labor reallocation out of the agricultural sector, but this works against our main finding that education has a large impact through the labor reallocation channel.

Next, we examine whether the CHNS data exhibits reasonable income patterns across educated and uneducated workers, which are important for our estimates of returns to education in Section 4. To do so, we regress income on education measures (and other control variables) and compare the coefficients to related studies. The first four columns of Table 2 contain results from regressing the log of per capita household income on household head characteristics. Regressions of this type are often used in the agricultural sector in the absence of individual income data, but they have also been run for nonagricultural workers. We find that one additional year of schooling for an agricultural (nonagricultural) household head is associated with a 0.02 (0.05) percent increase in income per household member, consistent with findings from a number of studies (e.g., Meng and Wu (1994), Li and Zhang (1998), Benjamin et al. (2000), and Yang and An (2002)). The regressions in columns (2) and (4) replace years of schooling with a dummy variable for whether the household head completed middle school. We postpone discussion of these results until Section 5, where we compare them to our main results, which report individual-level returns to education. The final column in Table 2 follows Young (2003) by regressing the average labor incomes of (sex x age x education) groups on appropriate dummies. We find returns to education similar to his estimates (although a bit higher for tertiary education).

3.2 Education and Barriers to Labor Mobility

Previous studies have documented the sizeable contribution of labor reallocation to China's recent growth (Brandt et al. 2008 and Dekle and Vandenbroucke 2006). Although many factors undoubtedly played a part in driving the reallocation, we have chosen to focus our efforts on quantifying the impact of one factor: increased educational attainment. One might wonder, "why is education a likely suspect for explaining the movement of workers into the nonagricultural sector?" In this subsection, we document Chinese migration regulations that created barriers to working in non-agriculture. We then argue that educated workers were more likely to overcome these restrictions and present evidence consistent with this hypothesis. If our hypothesis is correct, then from a macro perspective, increased education could have played a major role in making the allocation of labor more efficient.

3.2.1 Chinese Migration Regulations

Throughout our sample period, rural residents formed the majority of the Chinese population, ranging between 82 percent in 1978 and 57 percent in 2005, while most of the nonagricultural jobs could be found in urban areas. As a result, rural-to-urban migration was often necessary for a Chinese worker to obtain a nonagricultural job. A review of Chinese internal migration policy suggests that barriers to migration existed during the past three decades and were larger for uneducated individuals.

Migration from rural to urban areas is regulated primarily by China's household registration system, called the hukou system. Just as foreigners must acquire a working visa to legally work in many countries, rural Chinese residents should obtain a permit to work in urban areas. There are three types of permits: temporary residence permits lasting three to six months, medium-term permits (blue-stamped hukous) and permanent urban hukous. To obtain a medium-term permit or permanent urban hukou, an applicant must satisfy certain restrictive qualification criteria, largely based on wealth and education, and pay an urban entry fee set by the respective local government.10Thus, education affects the ability to migrate in two ways: directly through the qualification criteria and indirectly by reducing the real cost (in terms of hours worked) of the urban entry fee.

Despite changes to the hukou system that have made rural-to-migration easier, obtaining either a medium-term permit or permanent urban hukou has remained important for a rural migrant seeking employment.11 For example, a person without a local hukou needs special approval for her children to attend local primary or secondary school, and even with such approval, must pay additional fees that are often ten times the normal fees for attending primary or secondary school (Wang, 2005).

3.2.2 Empirical Evidence of Migration Barriers

The previous subsection described regulations which, if followed, make it relatively easier for educated workers to migrate to urban areas and work in the nonagricultural sector. However, if most migrants have temporary permits or no permit at all, then education might not actually be empirically linked with migration. In what follows, we check whether there is in fact a relationship (i.e., positive correlation) between education and migration. To be clear, we are not trying to identify a causal effect of education on outcomes - that exercise will be undertaken in the next section - but rather we are simply looking for evidence consistent with our hypothesis that Chinese migration regulations have real effects that are larger for the less educated.

To examine this hypothesis, we run regressions of the form

\displaystyle M(i) = \lambda_0 + \lambda_1 E(i) + \lambda_2 X(i) + \epsilon(i), (5)

where  M(i) indicates whether worker  i has migrated,  E(i) is an indicator of the worker's education level, and  X(i) is a vector of other characteristics. If education does help workers to migrate and work in the nonagricultural sector,  \lambda_1 will be positive. For these regressions, we use two measures of migration, inter-provincial migration and migration from anywhere (including intra-provincial migration). We also consider two different measures of education level, "completed middle school" and "completed high school". Other control variables,  X(i), include both individual characteristics (gender, race, birth-year fixed effects) and province-level controls, defined for the worker's birth province at the time the worker was 12 years old. The provincial controls include fixed effects and proxies for socioeconomic conditions which may affect the migration decision, such as the share of urban hukuo holders, the share of nonagricultural employment, and the share of the state sector.

The sample we use to estimate Equation (5) consists only of household heads because they are the only individuals for whom the CHNS provides migration status. We further restrict the sample to nonagricultural sector workers because the migration regulations restrict migration from rural to urban areas (i.e., into nonagricultural employment). Table 3 displays the results. Columns (1)-(2) are based on the "completed middle school" education measure, while columns (3)-(4) repeat these regressions using "completed high school". The odd-numbered columns present results for all migrants, while the even columns focus on between-province migrants. We also experimented with various combinations of the provincial controls (not shown in Table 3) but found that, regardless of the specification, educated workers in the nonagricultural sector are more likely to be migrants than uneducated workers, especially if they have completed high school.

One caveat concerning the applicability of these results to the entire Chinese population is that the CHNS may not representatively sample individuals based on their hukou status. That is, if migrants with temporary permits or no permit at all are under-represented in the CHNS, our finding above could simply reflect the effect of education on obtaining a medium-term or permanent permit.12 Although this could possibly drive the results above, other pieces of evidence increase our confidence that educated workers are, in fact, more likely to migrate.

First are the results of studies based on household surveys which contain information on household members that have migrated to another location. Using data for the Changping county of suburban Beijing from 1979-1988, Zhao (1997) found that schooling played an important role in raising the probability of rural-to-urban migration. De Brauw and Giles (2008) use data collected from four provinces (Shanxi, Jiangsu, Anhui, and Henan) from 1986 to 2003. They exploit policy changes which made it easier for rural migrants to obtain a temporary permit and find that relaxed migration barriers lead to reduced educational attainment for rural migrants. One interpretation of this finding is that because temporary permits do not include education as a qualification criteria, making temporary permits easier to obtain leads to a reduction in migration costs especially for the uneducated. Education's role in reducing the cost of migration is thus weakened, and educational attainment may decrease.

Second, previewing some of the results reported in the next section, we find that the income difference between working in the nonagricultural and agricultural sector is greater for educated workers. Because educated workers have more to gain from working in the nonagricultural sector, they have a greater incentive to migrate in search of these jobs.


4 Empirical Framework and Results

In this section we develop and estimate a tractable model to quantify the effect of education on workers' sector choice, their earnings within each sector, and their total productivity gains from education. The key variables of interest are sector-specific returns to education (  \gamma_{s}), education's impact on the probability of working in the nonagricultural sector  \left ( P^{edu}_{nt}(i) - P^{uned}_{nt}(i) \right ) and the associated productivity gain  \left( \frac{y^{uned}_{nt}(i) - y^{uned}_{at}(i)} {y_t}\right). We then use these estimates to construct measures of education's contribution to labor reallocation and economic growth.

4.1 The Model

We specify a model in which individuals choose their level of education and the sector in which they work to maximize the expected net present discounted value (PDV) of their lifetime income. We assume that individuals first decide whether to obtain an education  \left(E \in \{0,1\} \right), then choose to work in the agricultural or nonagricultural sector  \left( s \in \{a,n\} \right), and finally enter the work force, remaining in the same sector until retirement.

We denote  V_{s}^{E}(i) as the expected PDV of lifetime income for individual  i with education level  E who works in sector  s

\displaystyle V_{s}^{E}(i)=\mathbb{E}\left( \underset{t=t_{E}+b}{\overset{T+b}{\sum }}\rho ^{t-(t_{0}+b)}Y_{s,t}^{E}(i)\right) ,    

where  t denotes calendar year,  b denotes  i's birth year,  t_{E} is the age at which  i starts work conditional on having education level  E,  T denotes retirement age,  \rho is the discount factor, and  Y_{s,t}^{E}(i) is individual  i's earnings at time  t. Following the literature started by Roy (1951) and Mincer (1974), we posit that individual earnings take the form
\displaystyle Y_{s,t}^{E}(i)=\exp (\alpha_{s}+\beta _{s}X_{t}(i)+\gamma _{s}E(i)+\mu_{s}(i)+\epsilon _{s,t}(i)). (6)

That is, earnings of individual  i at time  t depend on her educational attainment  E(i), other observable characteristics  X_{t}(i), her individual-sector-specific productivity  \mu _{s}(i) which is not observed by researchers, and random shocks  \epsilon _{s,t}(i). We assume that the unobserved heterogeneity and the random shocks are independent of each other and are drawn from iid normal distributions with mean zero and variances  \sigma _{\mu,s}^{2} and  \sigma_{\epsilon,s}^{2}, respectively.

An individual's choice of education and sector will maximize her discounted earnings net of the costs associated with the schooling and sector choice. We denote  C^{E}_{n}(i) as the cost of working in the nonagricultural sector (relative to the agricultural sector) for an individual with education level  E

\displaystyle C_{n}^{E}(i) = \widetilde{ \theta} Z(i) + \widetilde{\delta}E(i) + \widetilde{ \eta}(i),    

where  Z(i) are some observable characteristics and  \widetilde{\eta}(i) is a normal random variable with mean zero and variance  \sigma_{\eta}^2. This cost function is meant to capture barriers to labor mobility between sectors, such as the migration regulations, and we allow for the possibility that education can alter these costs. In particular, if the cost of working in the nonagricultural sector is relatively low for the educated,  \widetilde{\delta} will be negative. In addition, let  C_{edu}(i) denote the cost of education for individual  i
\displaystyle C_{edu}(i) = \widetilde{\kappa} W(i) + \widetilde{\nu}(i),    

where  W(i) are some observable characteristics and  \widetilde{\nu}(i) is a normal random variable with mean zero and variance  \sigma_{\nu}^2.

For a given educational attainment, individual  i will choose to work in the nonagricultural sector if and only if the difference between the expected PDV of lifetime income in the nonagricultural and agricultural sector is greater than the cost of working in non-agriculture

\displaystyle \frac{\left[V^{E}_{n}(i) - V^{E}_{a}(i)\right]} {\sigma_{\eta}} > \theta Z(i) + \delta E(i) + \eta(i), (7)

where  \theta \equiv \widetilde{ \theta}/\sigma_{\eta}, etc. Moving to education choice, we can express an individual's expected net PDV of lifetime income conditional on education as
\displaystyle V^{E}(i) = \left( V^{E}_{n}(i)-C_{n}^{E}(i)\right)     \displaystyle \Pr [s=n\vert E(i)]     \displaystyle +     \displaystyle V^{E}_{a}(i) \Pr [s=a\vert E(i)].    

Individual  i obtains education if and only if the difference between the expected PDV of net lifetime income with education and that without education is greater than the cost of education:
\displaystyle \frac{\left[ V^1(i) - V^0(i) \right]}{\sigma_{\nu}}> \kappa W_{i} + \nu(i)   , where  \kappa=\frac{\widetilde{\kappa}}{\sigma_{\nu}} &  \nu(i)=\frac{\widetilde{\nu}(i)}{\sigma_{\nu}}. (8)

We now describe our baseline specification of Equations (6) - (8), postponing discussion of alternative specifications until Section 5. We exclude individuals who change education or sector from the sample for our baseline analysis but will include these observations in a subsequent robustness exercise.

(i) Income process: Education  E(i) is a dummy variable taking a value of 1 if the individual completes middle school. Other worker characteristics  X_{t}(i) which affect earnings include gender, race, marital status, age, age squared, dummies for provinces, and linear province-specific time trends where year 1989 is normalized to zero. The estimated income processes, one for each sector, allow us to predict a worker's expected income over time. Then, to calculate the worker's predicted present discounted value of lifetime income, we assume a discount rate of 0.95, that workers choose whether to attend middle school at age 12, that they enter the labor force at age 15 if educated and age 12 otherwise, and that they retire at age 65.

(ii) Sector choice: The cost of working in the nonagricultural sector is a function of characteristics  Z(i): gender, race, dummies for birth groups, dummies for provinces, and province-level employment shares of the state and nonagricultural sector. Birth groups are defined as workers born in (1) 1947 or earlier, (2) from 1948 to 1955, (3) from 1956 to 1965 and (4) 1966 or later. Workers in the fourth birth group made their education and sector choice in or after 1978, the beginning of the Open Door policy reforms. The other dates were chosen to equally divide the remaining cohorts into three groups. We use dummies for birth groups to control for cohort effects but also consider a more flexible form of accounting for such effects in our alternative specifications. The final two variables, the province-level employment share of the state and nonagricultural sector, are for the individual's original province at the time she makes her education/sector choice (age 12). We include the nonagricultural employment share because workers from provinces with a large supply of nonagricultural sector jobs likely face a lower cost of working in that sector. The state sector employment share is included because provinces with large state sectors may be particularly slow to respond to the rapid expansion of nonagricultural jobs, and thus, workers in these provinces likely face higher costs of working in the nonagricultural sector.

(iii) Education choice: Variables  W(i) which affect the cost of education include gender, race, dummies for provinces, dummies for birth groups, and the number of students enrolled in secondary school, which is constructed for the individual's original province at the time she was age 12. Higher enrollment may proxy for lower schooling costs or positive peer pressure to attend school.

(iv) Attrition: Finally, we also assume that the distribution of unobserved heterogeneity in the CHNS sample remains the same as the population distribution, assumed to be normal  (0, \sigma _{\mu,s}^{2}). This implies that either worker attrition from our sample was random in terms of unobserved sector-specific ability or that these workers were replaced by workers with, on average, the same ability.

Our model specification obviously invokes some restrictive assumptions, such as assuming that workers make their education and sector choice just once, specifying educational attainment as a binary variable (as opposed to allowing for multiple levels), and assuming random attrition in terms of unobservable sector-specific abilities. The main reason we employ these rather simple assumptions is because of data limitations that would make it difficult to get identification in a more general model.

Specifically, regarding our assumptions on education and sector choice, although the CHNS panel covers a fairly long time period (1989 - 2004), less than 18 percent of workers ever change sector or education status. This reflects the fact that over 87 percent of workers were older than 22 (i.e., above schooling age) when they entered the survey, and the average time span for a surveyed individual is only 4.5 years. Moreover, the survey does not provide information on workers' employment histories nor the timing of their school attendance. Given these data limitations, estimating a more general model would require additional assumptions which would likely raise as many concerns as our current assumptions. However, although we do not attempt to estimate a more complicated model, we do undertake some robustness checks in Section 5 to partially address the concerns regarding education and sector choice.

We make our assumption of random attrition for two reasons. First, some attrition appears to be generated by technical issues regarding survey design as opposed to workers' choices. For example, households in Liaoning province were sampled in 1989 and 1991, but not in 1993 and 1997. Second, our data provides little information about why workers leave the sample and where they go, and thus, we cannot estimate a worker's decision to leave without relying on additional assumptions regarding the worker's outside options for residential choice.13

A detailed explanation of the identification of model parameters and our estimation method (simulated method of moments) is relegated to Appendix B, but we did think our procedure for estimating the income process of individuals in the agricultural sector deserved some explanation here. A challenge arises because the majority of workers in the agricultural sector are self-employed and multiple household members often work in the family farm. In other words, we observe household-level rather than individual-level income for much of the agricultural sector. To estimate individual-specific parameters using the household-level data, we aggregate the individual income processes within households and assume all household members receive the same random shocks. That is, for household  h_{j}:

\displaystyle ln \left( \sum_{i\in h_{j}}Y_{s,t}^{E}(i) \right) = ln \left( \sum_{i\in h_{j}}\exp (\alpha_{s} + \beta_{s} X_{t}(i) + \gamma_{s} E(i) + \mu_{s}(i)) \right) + \epsilon _{s,t}(h_{j}), (9)

where  \epsilon _{s,t}(i) = \epsilon _{s,t}(h_{j}) for all  i\in h_{j}. Thus, the identification of the agricultural sector income process comes from variation in the composition of agricultural households.

4.2 Estimates from Micro-level Data

Tables 4 and 5 contain the estimated income processes for the agricultural and nonagricultural sectors, with results for the baseline specification reported in column (1). We estimate returns to education of 10.1 percent in the agricultural sector and 14.2 percent in the nonagricultural sector. One interpretation of these findings is that formal education produces human capital which is more useful in nonagricultural production.

Although the estimated returns to education in the agricultural sector are not statistically significant at the conventional level, we do not think the lack of statistical power necessarily implies no returns to education. Because our model allows for correlated unobservable income shocks across time within a household (i.e., from the unobserved sector-specific ability), our "effective" number of observations is closer to the number of unique households (around 2,500) rather than the almost 8,000 household-survey-year observations reported in Table 4. When we do not allow for correlation of unobservable income shocks (see Section 5), the estimated returns to education are somewhat smaller but statistically significant at the 5 percent level (column (2) of Table 4).

Table 6 shows the parameters governing sector choice. In our baseline specification in column (1), education decreases the cost of working in the nonagricultural sector and, for the marginal worker, raises the probability of working in the nonagricultural sector by more than 50 percent. Tables 7 and 8 show the rest of the results from our baseline model estimation, including parameters governing education choice (Table 7) and the shock processes (Table 8). Although we will not use these estimates for subsequent analysis, we note a few results: workers are more likely to obtain education the larger the difference between the expected PDVs with and without education ( V^1 - V^0) and the smaller is the cost of education, which was proxied for by province-level school enrollment at the time individuals chose schooling. Finally, because all the equations in the baseline specification (column (1) of Tables 4-7) are simultaneously estimated, the model-fit statistics (i.e., sum of squared residuals) are reported just once, in Table 8.

4.3 Education's Contribution to Economic Growth

We can now assess the role of increased educational attainment in accounting for China's economic growth from 1978 to 2004.14 Education led to greater labor productivity in two ways: it increased human capital, and it facilitated the reallocation of labor from agricultural jobs to higher-productivity nonagricultural jobs. Our main finding is that the second channel was at least as important as the first in accounting for recent Chinese growth.

Table 9 shows the results of our calculations and compares them with other growth accounting exercises for China. The first panel of the table shows estimates of the growth in output per worker from various studies of the Chinese economy. For studies which considered multiple sectors of the economy, the second panel weights the sectoral growth rates by the sector's share of the economy to decompose aggregate growth according to Equation (1). Finally, the third panel displays our estimates of the contribution of education to aggregate growth through each term in Equation (1).

Our baseline estimates of agricultural and nonagricultural returns to education of 10.1 and 14.2 percent, respectively, imply within-sector annual growth in human capital per worker of 0.147 and 0.142 percent. Multiplying these estimates by the income share of each sector, Table 9 (first row of third panel) shows that human-capital growth from increased education in agriculture (non-agriculture) accounts for 0.034 (0.107) percentage points of growth per year. That is, the direct effect of increased educational attainment on human capital accounts for about 2 percent (0.49 by agriculture and 1.54 by non-agriculture) of overall growth in Chinese output per worker from 1978 to 2004.

Turning to education's contribution to productivity through labor reallocation, we use our estimated income processes and sector choice equation to construct the productivity gain resulting from the increased probability educated individuals have of obtaining nonagricultural jobs. Column (4) of Table 9 contains results for our baseline measure - see Equation (4) - which we now describe in more detail. For each educated worker  i, we construct the productivity difference between being an uneducated nonagricultural worker and an uneducated agricultural worker based on our estimated income processes. We then multiply this difference by the change in the individual-specific probability of working in the nonagricultural sector that results from attaining education. Note that the probability difference could be zero for some workers, meaning educational attainment has no impact on their sector choice. Finally, we calculate the average (across individuals) of this product for each of the six CHNS waves and then average over the six waves. We multiply our estimate of the average productivity gain by the growth in the share of educated workers, which averaged 1.11 percentage points per year from 1978 to 2004.

The result is an increase in output per worker of 0.652 percentage points, which is just over 9 percent of the total growth in output per worker of 7.0 percent reported by Brandt et al. (2008). Thus, rising educational attainment accounts for 38 percent of the contribution of labor reallocation to Chinese economic growth. Moreover, it appears education's impact on economic growth through facilitating labor reallocation is relatively more important than its impact through increasing human capital (a combined 0.141 percentage points per year).

Columns (5)-(8) of Table 9 contain alternative measures of education's contribution to growth through labor reallocation that are based on different constructions of the productivity gain associated with moving into the nonagricultural sector. Our baseline measure took into the account that individuals who were more likely to work in the nonagricultural sector might also have higher productivity gains:  \int_i \left( y^{uned}_n(i) - y^{uned}_a(i) \right)\left( P^{edu}_n(i) - P^{uned}_n(i) \right). If we do not allow for this interaction,  \int_i \left( y^{uned}_n(i) - y^{uned}_a(i) \right) \times \int_i \left( P^{edu}_n(i) - P^{uned}_n(i) \right), column (5) (first row of third panel) shows our estimate of education's contribution to growth is slightly lower (0.636 vs 0.652). The measure in column (6) allows for the interaction but does not isolate the productivity gain due solely to reallocation. Specifically, it allows a change in education status in addition to the sector change,  y^{edu}_n(i) - y^{uned}_a(i), and as expected, education's contribution through reallocation increases (to 0.761). Finally, columns (7) and (8) show the effect of simply using the average productivity difference between nonagricultural and agricultural,  y_n - y_a, rather than controlling for individual characteristics. The measure in column (7) is based on average sectoral labor incomes from our CHNS sample, scaled by sectoral labor shares to produce measures of output per worker, while column (8) uses direct measures of output per worker from the CSYs. The difference between the measures based on alternative sources of aggregate data is small (0.896 vs. 0.862) compared to the difference between using aggregate and micro-level data (0.652). This clearly shows the importance of controlling for individual characteristics to isolate education's contribution to growth through reallocation.


5 Robustness Checks

In this section we examine the robustness of our growth accounting results. We first consider four alternative specifications of our empirical model and re-compute education's contribution through the within- and between-sector channels. We then make some back-of-the-envelope calculations to relate our approach to others from the growth accounting literature.

5.1 Alternative Empirical Specifications

5.1.1 No Unobserved Heterogeneity

Our first alternative specification omits the unobserved individual-sector-specific ability. Formally, we assume that  \mu _{s}(i) is drawn from a degenerate distribution, and thus, the error terms in the wage equation will be uncorrelated with the shocks affecting sector choice and education. This simpler model will produce biased estimates, but we think it is interesting to investigate the magnitude of the bias. Note that the sign of the bias is not clear ex-ante. For example, if high productivity workers face low costs of education and thus stay in school to signal their types to potential employers, then returns to education will be overestimated (ability bias). On the other hand, if the opportunity cost of schooling is high for extremely productive entrepreneurs, then returns to education will be underestimated (selection bias).

Column (2) of Tables 4-7 show the resulting estimates under this specification.15 The returns to education are fairly similar to our baseline estimates in column (1), just a bit smaller for agriculture and somewhat larger for non-agriculture. The small differences are consistent with studies that find returns to education based on OLS regressions are quantitatively similar to those that control for the correlation between educational attainment and unobserved ability, either by using instrumental variables (e.g., Angrist and Krueger (1991)) or by focusing on identical twins (e.g., Ashenfelter and Rouse (1999)). Education's effect on sector choice is also similar to the baseline case (Table 6), and as a result, the growth accounting analysis without unobserved heterogeneity is quantitatively close to the baseline: rows (1) and (2) of the third panel of Table 9 show total contributions of education to growth of 0.793 and 0.798 percent per year, respectively.

Thus, the results without unobservable heterogeneity introduce only a small bias. At the same time, the computational intensity of the estimation procedure is greatly diminished because we can estimate the model equations sequentially, as described in Appendix B. We will therefore omit unobservable heterogeneity for the rest of our robustness checks to take advantage of the reduced computational burden.

5.1.2 Time-varying Effects of Education

We next allow for the possibility that education's impact on income and the cost of working in the nonagricultural sector may have changed over time. Specifically, we add interaction terms  E_{i} \times \tau to the income processes ( \tau = survey year) and sector-choice equation ( \tau = birth-cohort dummy). We are particularly interested in whether education had a smaller impact on sector choice as migration restrictions were loosened over time. Column (3) of Table 6 shows only slight evidence of this. The coefficients on the interaction terms are positive, meaning that education led to a smaller decrease in the cost of working in the nonagricultural sector over time, but statistically insignificant. We also find that returns to education increase over time in both sectors. Turning to our growth accounting, the bottom line from Table 9 is that education's contribution to growth is a little smaller than in the previous specification (0.740 vs 0.798), and the reallocation channel still accounts for most of the contribution.

5.1.3 Changes of Education and Sector

We also examine how our findings change when we include individuals who change their sector and educational attainment over time. 5.8 percent of workers in our sample increase their education from "not completed middle school" in an earlier survey to "middle school attainment or higher" in a later survey.16In addition, 1,084 workers (12.6 percent of all workers) in our sample change sectors at some point. 600 workers move from agricultural to nonagricultural, 235 move the other direction, and the rest change sectors multiple times.

We include individuals whose status changes by using their latest reported educational attainment and sector as their education and sector throughout their survey participation. Although a simple way of including changers, this method will cause us to underestimate the contribution of education through reallocation because it classifies a large number of uneducated agricultural workers in early surveys as uneducated nonagricultural workers. Because we continue to use these workers' agricultural incomes while classifying them as nonagricultural workers, the income gain from moving from agriculture to non-agriculture will be underestimated. Related, nonagricultural returns to education will be biased upwards. In fact, we find that nonagricultural returns to education (Table 5) are 33.6 percent,17 about double the baseline case, while the gain from labor reallocation is only 60 percent of the baseline case. Even though the contribution of education through labor reallocation is biased downwards, row (4) of Table 9 shows that the impact of the reallocation channel continues to be more important than within-sector human capital growth (0.386 vs 0.237).

5.1.4 Definition of Education

A final alternative specification defines educated individuals as those who have completed high school rather than middle school. The last row of Table 9 shows that economic growth resulting from labor reallocation due to education is 0.168 percentage points per year, significantly lower than the baseline "no heterogeneity" specification (0.650). This change is largely the result of much slower growth in the share of high-school educated workers (0.33 vs 1.11 percentage points per year for those completing middle-school). Our estimates of economic growth within sectors also show small changes in the direction one would expect. Although education's total contribution to growth is lower, the labor reallocation continues to be the primary channel through which education affects growth.

5.2 Relation to Other Approaches in Literature

5.2.1 Household- vs Individual-Level Returns to Education

As discussed in Section 3, studies in the literature often estimate returns to education in the agricultural sector at the household-head level. We find that our estimates of household-head returns to education (columns (2) and (4) of Table 2) are higher than our baseline estimates of individual-level returns to education (column (2) of Tables 4 and 5). The reason for this is that 61 percent of workers in multiple-worker households have the same education level as their household head. Returns to education measured at the household-head level can thus be interpreted as individual-level returns weighted by the number of educated workers in the household.

5.2.2 Level of Schooling

To this point in our analysis we have simply classified individuals as either educated or uneducated rather than considering a finer gradation of education levels. We do this primarily to keep our estimation of the Roy model tractable but are aware that it may dampen our within-sector measures of human capital growth. We therefore consider a back-of-the-envelope calculation to examine the extent of the bias.

Specifically, we follow Young (2003) in considering four levels of education: none, primary, secondary, and tertiary. We use Young's estimates of returns to education in the nonagricultural sector and the number of nonagricultural workers of each education type (taken from the census) to construct the growth rate of human capital as described by Equations (2) and (3) but for four labor types. The resulting growth rate of human capital due to education is 0.258 percentage points. We then combine the two lowest and the two highest education groups and redo the calculation. The growth rate of human capital becomes 0.187 percentage points.18 Although within-sector human capital growth is smaller when we consider only two education levels, the increase in growth from allowing more education levels (0.07 percentage points) would not change our conclusion about the relative importance of the reallocation and human capital channels.

5.2.3 Comparison with One-Sector Growth Accounting

In the absence of evidence on sectoral levels of education in China, previous studies often employed a single education index and aggregate production function to assess education's contribution to growth (e.g., Bosworth and Collins 2008). Our finding that labor reallocation is an important channel through which education impacts growth raises the question of whether not explicitly taking multiple sectors into account will significantly alter the estimate of education's contribution. Although it is not possible to derive an analytical relationship showing how the results of the two-sector and one-sector approaches differ, we can use our data and estimates to make a rough calculation.19

Using our measures of sectoral returns to education and the distribution of workers across sector-education groups, we first construct an average (across sectors) return to education.20 This is simply the unconditional (on sector) expected income differential between an educated and uneducated worker. We then compute aggregate human capital growth by applying Equations (2) and (3) to a one-sector model. Our resulting estimate of human capital growth due to education, which equals the total contribution of education to growth in a one-sector setup, is 0.47 percent, whereas education's contribution in our two-sector model was 0.76. This may indicate that a one-sector framework does not fully capture education's impact on growth through moving workers into the higher-productivity sector.


6 Conclusion

This paper examines education's role in accounting for recent Chinese economic growth through two channels: facilitating labor reallocation from the agricultural sector to the nonagricultural sector and increasing human capital within sectors. We find that increased education accounts for more than one-third of the growth attributable to labor reallocation. Overall, it accounts for about 11 percent of aggregate growth, and the labor reallocation channel (9 percent) was more important than the direct human capital channel (2 percent). We thus conclude that enabling labor reallocation is an important channel through which education can affect economic growth.

Our findings raise some interesting questions for future research. The first is assessing how important education's impact on labor reallocation has been for the growth experience of countries other than China. In order for education to affect growth through the labor reallocation channel, all that is needed is some labor market frictions which are less restrictive for educated workers. In particular, explicit regulations on labor mobility (as found in China) are not necessary for this channel to be operative. Although education's impact on labor reallocation may be exceptionally large in China because of the migration regulations, there is suggestive evidence that this channel could be quantitatively important in other developing countries. Many empirical papers, following Schultz (1971) and Todaro (1976), have documented substantial income differences between urban and rural areas in developing countries and have shown that educated workers are more likely to migrate into urban areas.

A second question considers the question of optimal government education and migration policy. Although allowing increased rural-to-urban migration may yield substantial income gains (and a more efficient allocation of labor inputs), large-scale migration could also impose substantial negative externalities on city dwellers, such as overcrowding and congestion effects. For a policymaker weighing the trade-off between increased economic growth and the negative externalities associated with migration, an optimal migration policy may condition on education as a way of gradually allowing unrestricted labor movement. Although we make no claim on its optimality, especially in terms of how restrictive the migration policy should be, this is exactly the type of policy employed in China.


Bibliography

`Plant-Size Distribution and Cross-Country Income Differences'.
NBER WP 14060.
`Does Compulsory School Attendance Affect Schooling and Earnings'.
Quarterly Journal of Economics 106(1):976-1014.
`The Payoff to Education'.
Working paper, Princeton University.
`Cross Country Differences in Productivity: The Role of Allocative Efficiency'.
Working Paper, University of Maryland.
`Markets, Human Capital, and Inequality: Evidence from Rural China'.
Davidson Institute Working Paper 298.
`A Decade Lost and Found: Mexico and Chile in the 1980s'.
Revies of Economic Dynamics 5(1):166-205.
`Does Schooling Cause Growth'.
American Economic Review 90(5):1160-1183.
`Accounting for Growth: Comparing China and India'.
Journal of Economic Perspectives 22(1):45-66.
`Growth and Structural Transformation in China'.
In L. Brandt & T. Rawski (eds.), China's Great Economic Transformation, pp. 683-728. Cambridge University Press, New York.
`The U.S. Structural Transformation and Regional Convergence: A Reinterpretation'.
Journal of Political Economy 109(3):584-616.
`Is China Abolishing the Hukou system?'.
The China Quarterly 195.
`The Hukou System and Rural-Urban Migration in China: Processes and Changes'.
The China Quarterly 160:818-855.
`Migrant Opportunity and the Educational Attainment of Youth in Rural China'.
Working paper, Michigan State University.
`Measuring Microenterprise Profits: Don't Ask How the Sausage is Made'.
World Bank Policy Research WP 4229.
`A Quantitative Analysis of China's Structural Transformation'.
Working paper, University of Iowa.
`Political Change and the Urban-Rural Gap in Basic Education in China, 1949-1990'.
Comparative Education Review 43(2):193-211.
`The Depressing Effect of Agricultural Institutions on the Prewar Japanese Economy'.
Journal of Political Economy 116(4):573-632.
`The Empirical Content of the Roy Model'.
Econometrica 58(5):1121-1149.
`Misallocation and Manufacturing TFP in China and India'.
Quarterly Journal of Economics .
`The Neoclassical Revival in Growth Economics: Has It Gone Too Far?'.
In B. S. Bernanke & J. J. Rotemburg (eds.), NBER Macroeconomics Annual 1997. MIT Press, Cambridge, MA.
`Returns to education under collective and household farming in China'.
Journal of Development Economics 56(2):307-335.
`A Practitioner's Approach to Estimating Intertemporal Relationships Using Longitudinal Data: Lessons from Applications in Wage Dynamics'.
In Z. Griliches & M. Intriligator (eds.), Handbook of Econometrics, vol. 6, pp. 4057-4167. Elsevier Science/North-Holland, Amsterdam.
`Household Income Determination and Regional Income Differentiation in Rural China'.
Working paper, Munich Personal RePEc Archive.
Schooling, Experience, and Earnings.
Columbia University Press, New York.
`Barriers to Technology Adoption and Development'.
Journal of Political Economy 102(2):298-321.
'Population Census of China'.
China Statis. Pub. House, Beijing.
'Chinese Statistical Yearbook'.
China Statis. Pub. House, Beijing.
Radicalism and Education Reform in 20th-century China.
Cambridge University Press, New York.
`Agriculture and Aggregate Productivity: A Quantitative Cross-Country Analysis'.
Journal of Monetary Economics 55(2):234-250.
`Some Thoughts on the Distribution of Earnings'.
Oxford Economic Papers 3(2):135-146.
`Rural-Urban Migration in Colombia'.
The Review of Economics and Statistics 53(2):157-163.
Internal Migration in Developing Countries.
International Labour Office, Geneva.
`How Important are Dual Economy Effects for Aggregate Productivity'.
Journal of Development Economics 88:325-334.
`Reformed Migration Control and New Targeted People: China's Hukou System in the 2000s'.
The China Quarterly 177:115-132.
Organizing Through Division and Exclusion.
Stanford University Press.
`Education and Self-Selection'.
Journal of Political Economy 87(5):S7-S36.
Econometric Analysis of Cross Section and Panel Data.
The MIT Press, Cambridge, MA.
`Human capital, entrepreneurship, and farm household earnings'.
Journal of Development Economics 68(1):65-88.
`Gold into Base Metals: Productivity Growth in the People's Republic of China during the Reform Period'.
Journal of Political Economy 111(6):1220-1261.
`Labor Migration and Returns to Rural Education in China'.
American Journal of Agricultural Economics 79(4):1278-1283.


Appendix

A. Definition of Variables

  • Sector: An individual is classified as an agricultural worker if her primary occupation is farming, fishing, or hunting.
  • Annual Income: We construct income measures at the household level in the agricultural sector and at the individual level in the nonagricultural sector. In the agricultural sector, gross annual income includes both income earned from selling products and the market value of farm products consumed by the household. In the nonagricultural sector, income includes wages, bonuses, and subsidies for each individual. Individual subsidy income is constructed by dividing the household-level subsidy income, which aggregates subsidies for food, housing and other items, by the total number of workers in the household.

    The reason we construct household rather than individual income in the agricultural sector is that, given our data, there is no satisfactory method for splitting household income between working members. For example, although we have a measure of the number of hours spent in agricultural production by each household member, we cannot construct individual-specific incomes by allocating household income proportionally to the number of hours worked by each member because this would imply no returns to education within a household.

B.1 Unobserved Heterogeneity

We use simulated method of moments to estimate the baseline model. We group the moments into five categories corresponding to agricultural- and nonagricultural-sector income, sector choice, education choice and the distribution of unobservable heterogeneity. Let  M denote a vector of moments partitioned into five groups:  M = [M^{1}, M^{2}, M^{3}, M^{4}, M^{5}]'. For the first four groups of moments, we use the difference between actual and predicted outcomes, multiplied by instruments that are observable variables in the corresponding equations. Assuming unobservable productivity  \mu \equiv \{[\mu_a(i),\mu_n(i)]'\}_{i=1}^I is known, we can compute the first four groups of moments as follows:

\displaystyle M^{j}(\mu) = \frac{1}{I_j} \sum_{i} w_i X_{i}^{j} 1_{i}^{j} ( Y_i^{j}- \widehat{Y}_i^{j}(\mu))    for  j=1,2,3,4\displaystyle \\    

where for  j = 1,2,  Y_i^{j} and  \widehat{Y}_i^{j} are actual and predicted income of agricultural ( j=1) and nonagricultural ( j=2) workers, for  j = 3, they are indicator functions that take a value of one if person  i works (or is predicted to work) in the nonagricultural sector, and for  j=4, they take a value of one if person  i received (or was predicted to receive) education.  1_{i}^{j} is an indicator variable for whether person  i contributes to moment  j (i.e., for  j=1,  1_{i}^{j} = 1 if  i is an agricultural worker), and  I_j is the total number of individuals who contribute to moment  j. As discussed in Section 3, the observations weights  w_i are set so that the sectoral composition of our sample matches that from national surveys.

For the last group of moments  M^5, we use five moments related to the model's random variables (i.e., random shocks to income and unobservable heterogeneity). The first two moments are the variance (over time) of individual income in each of the two sectors, where income is demeaned by an individual's average income across years. The other three moments include the variance of mean income across workers in each sector (2 moments) and the mean income difference between the two sectors (1 moment).

Our computational algorithm works as follows. We first make an initial guess for the parameters, including the variances of  \mu _{a} and  \mu _{n}. Given these variances, we draw 20 sets of  \{[\mu_a(i),\mu_n(i)]'\}_{i=1}^I (i.e., unobservable productivities for all individuals). Given the productivity draws and parameters, we then compute the objective function, that is the sum of the squared residuals of the moments (i.e.,  M'M). We use optimization algorithms to search for the parameter values which minimize the objective function, using 87 moments to pin down 87 parameter values of which 82 are coefficients of observable variables and 5 govern the distribution of random variables in the model. Note that because the model is exactly identified, the choice of weighting matrix does not affect the asymptotic efficiency of the estimates. The inference of the estimation follows Wooldridge (2001) and MaCurdy (2007).

B.2 No Unobserved Heterogeneity

Our alternative specifications assume the error terms in Equations (6), (7) and (8) are uncorrelated. Specifically, the errors in the income processes are mean independent of an individual's education and sector. For the nonagricultural sector, we observe individual incomes and can estimate Equation (6) by taking the log of both sides of the equation and using weighted ordinary least squares. For the agricultural sector, we observe household-level income and therefore estimate Equation (9) using weighted nonlinear least squares.

After estimating the income process in both sectors, we construct fitted values of the present discounted value of income given the education and sector of each individual. We let  \widehat{V}_{s}^{E}(i) denote the fitted value. Then, given education and these fitted values, we can estimate the sector choice equation (Equation 7) as follows:

\displaystyle I(s = n\vert E) \displaystyle \iff \displaystyle \widehat{V}_{n}^{E}(i) - \widehat{V}_{a}^{E}(i) - C_{n}^{E}(i) > 0  
  \displaystyle \iff \displaystyle \frac{1}{\sigma _{\eta}} \left( \widehat{V}_{n}^{E}(i) - \widehat{V}_{a}^{E}(i)\right) - \theta Z(i) - \delta E(i) > \eta(i),  

where  I(s = n\vert E) is an indicator of sector choice. The estimated coefficient on  \widehat{V}_{n}^{E}(i) - \widehat{V}_{a}^{E}(i) is thus the inverse of the square root of the variance of  \eta.

We can then estimate a worker's choice of education as follows:

\displaystyle I(E = 1) \displaystyle \iff \displaystyle \left\{ \widehat{V}^{1}(i) - \widehat{V}^{0}(i) \right\} - C_{edu}(i)>0  
  \displaystyle \iff \begin{displaymath}\frac{1}{\sigma _{\nu}} \left\{ \begin{array}{l} \left( \widehat{V}_{n}^{1}(i)\times \widehat{\Pr} (n\vert 1)(i) + \widehat{V}_{a}^{1}(i) \times \widehat{\Pr} (a\vert 1)(i)\right) - \ \left( \widehat{V}_{n}^{0}(i) \times \widehat{\Pr} (n\vert)(i) + \widehat{V}_{a}^{0}(i) \times \widehat{\Pr} (a\vert)(i)\right) \end{array}\right\} -\kappa W(i)>\nu(i).\end{displaymath}  

Finally, because regressors in the sector and educational choice equations are constructed from fitted values of the earnings equations, we compute bootstrapped standard errors.


C.1 Within-sector growth from increased human capital

We construct annual growth in human capital per worker due to increased educational attainment  \left ( d\ln h_{st} \right ) as follows. As explained in Sections 3 and 4, we use CHNS data to estimate time-invariant sectoral returns to middle-school education  (\gamma_s). Using census data, we construct the share of sector employees in each census year that were middle-school educated, and we interpolate to get annual shares  (l^{edu}_{st}, l^{uned}_{st}). We use these inputs in Equations (2) and (3) - approximating  \gamma_{st} with  \gamma_{s} - to compute the annual human capital per worker growth resulting from increased attainment of middle-school education.

As described in Section 2, the growth rate of human capital per worker translates one-for-one into the growth rate of output per worker  \left ( d \ln y_{st} \right ). To construct the contribution to aggregate growth, we weight these growth rates by the respective sector's share of the economy  \frac{y_{st} l_{st}} {y_{t}} \equiv \frac{Y_{st}}{Y_t}, which is constructed from annual data from the China Statistical Yearbooks (CSY). The resulting within-sector growth rates reported in the third panel of Table 9 are averages (across time) of these weighted growth rates.

C.2 Labor Reallocation

Recall that growth from labor reallocation is equal to the growth in the employment share of the nonagricultural sector weighted by the productivity differential across the sectors:  \frac{\left( y_{nt}-y_{at} \right) }{y_{t}} dl_{nt} in Equation (1). For our purposes, we need to isolate the portion of the increased number of jobs and associated productivity gain that is attributable to increased educational attainment. We do this by multiplying the average growth in the share of educated workers by our estimates of education's impact on the probability of obtaining a nonagricultural sector job and the productivity gain that accompanies the job.

We construct the share of workers with a middle-school (or above) education from 1982 and 2005 Census data and then assume a constant growth rate of the educated share over that time period. We use CHNS data to estimate sector-specific income processes and a sector-choice equation and then use these estimates and our cross-sectional data to construct individual-specific counterfactual incomes and the increased probability of working in the nonagricultural sector associated with increased educational attainment. The income measures are then converted to productivity measures using average sectoral labor shares and individual-specific productivity gains from switching sectors are constructed. We take the product of the increased probability and the productivity gain, take the mean of this product across individuals, and take the average of this mean over time (six CHNS waves). Finally, we multiply this average by the average growth rate of the share of educated workers to arrive at our estimate of education's contribution to growth through labor reallocation.


Table 1: CHNS Descriptive Statistics I
  1989 1991 1993 1997 2000 2004
[1] CHNS: Number of Provinces 8 8 8 8 9 9
[1] CHNS: Number of Workers * 5,725 6,635 5,462 5,727 5,341 3,308
[1] CHNS: Share of male workers 55.65 54.77 51.99 52.51 54.92 53.90
[1] CHNS: Share of workers completing middle school 48.67 48.51 52.48 55.33 62.05 64.22
[1] CHNS: Share of workers in the nonagricultural sector 44.85 42.06 45.10 46.29 48.93 52.85
[1] CHNS: Urban Income per Household Member** (unit: Yuan) 1229 2219 2212 4761 4407 8284
[1] CHNS: Rural Income per Household Member** (unit: Yuan) 653 800 1007 2305 2552 4287
[1] CHNS: Ratio of Urban Income/Rural Income 1.88 2.77 2.20 2.07 1.73 1.93
[2] National Surveys ***: Number of Provinces 31 31 31 31 31 31
[2] National Surveys ***: Share of male workers 55.04   54.26   54.66 54.58
[2] National Surveys ***: Share of male workers fitted values 55.09 54.84 54.65 54.43 54.41 54.58
[2] National Surveys ***: Share of workers completing middle school 45.26   50.15   59.20 63.20
[2] National Surveys ***: Share of workers completing middle school fitted values 43.43 46.19 48.87 54.07 57.69 62.34
[2] National Surveys ***: Share of workers in the nonagricultural sector 39.90 40.30 43.60 50.10 50.00 53.10
[2] National Surveys ***: Urban Income per Household Member**** (unit: Yuan) 1374 1701 2577 5160 6280 9422
[2] National Surveys ***: Rural Income per Household Member**** (unit: Yuan) 602 709 922 2090 2253 2937
[2] National Surveys ***: Ratio of Urban Income/Rural Income 2.28 2.40 2.80 2.47 2.79 3.21
* "Workers" refers to currently employed individuals for whom we have information on income and education Prior to 2004, any household member could complete the survey for all household members, but in 2004, the adult participant questionnaires from which we get information on income and education had to be completed by the individuals themselves. The 2004 sample has a comparable number of households to other survey years but a drop in adult workers.
** Authors' calculation using gross household income in the agricultural sector and individual wages, bonuses, and subsidies in the nonagricultural sector. See Appendix A for further discussion.
*** Source: Census - fraction of male workers and workers completing middle school. Because census years are 1990, 1995, 2000, and 2005, table entries are reported for year closest to census year. Fitted values correspond to fitting a second-order polynomial time trend through the four data points. CSY - fraction of workers in the nonagricultural sector and income per household member.
**** We use net income for rural households and per capita disposable income for urban households.


Table 2: CHNS Descriptive Statistics II
  (1) (2) (3) (4) (5)
  Agriculture Agriculture Nonagriculture Nonagriculture Nonagriculture
Years of Schooling 0.019   0.051    
Years of Schooling Standard Errors [0.005]***   [0.003]***    
Primary School or Lower   Base   Base  
Middle School or Higher   0.082   0.245  
Middle School or Higher Standard Errors   [0.034]**   [0.022]***  
None         -0.247
None Standard Errors         [0.031]***
Primary         Base
Secondary         0.258
Secondary Standard Errors         [0.020]***
Tertiatry         0.659
Tertiatry Standard Errors         [0.028]***
Observations 6,006 6,006 5,428 5,428 15,258
R-squared 0.21 0.21 0.47 0.48 0.11

Notes: Specifications (1) (4) regress the log of income per household member on household head characteristics. Specifications (1) and (3) include years of schooling as a control variable, while specifications (2) and (4) have a dummy variable for whether the household head completed middle school. For specifications (1) - (4), covariates not listed in the table include age, age-squared, dummies for gender, race, marital status, and provincial fixed effects and provincial time trends. Specification (5) regresses individual incomes on dummy variables associated with educational degree, sex, and birth cohort, and coefficients are relative to the base group. Standard errors are in brackets (*significant at 10%; ** significant at 5%; *** significant at 1%).


Table 3: Migration and Education
  (1) (2) (3) (4)
  Migrant

(Inter or Intra Province)

Migrant

(Inter Province)

Migrant

(Inter or Intra Province)

Migrant

(Inter Province)

  OLS OLS OLS OLS
Middle School Educated 0.120 0.043    
Middle School Educated Standard Errors [0.014]*** [0.008]***    
High School Educated     0.147 .049
High School Educated Standard Errors     [0.012]*** [0.006]***
Female 0.068 -0018 0.062 - 0.21
Female Standard Errors [0.020]*** [0.010]* [0.020]*** [0.010]**
Minority 0.075 -0021 .079 -0.020
Minority Standard Errors [0.020]*** [0.01]** [0.019]*** [0.010]*
Fraction of urban hukou holders -0.529 -0.483 -0.540 -0.487
Fraction of urban hukou holders standard errors [0.182]*** [0.095]*** [0.181]*** [0.095]***
Fraction of nonagr workers 1.068 1.927 1.038 1.918
Fraction of nonagr worker standard errors [0.225]*** [0.118]*** [0.224]*** [0.118]***
Fraction of state firms 0.083 -1.273 0.135 -1.257
Fraction of state firms standard errors [0.188] [0.099]*** [0.187] [0.098]***
         
Observations 5,350 5,350 5,350 5,350
R-squared 0.09 0.16 0.1 0.16

Notes: Other control variables include provincial and cohort fixed effects. The unit of observation is (household head x year). Standard errors are in brackets (*significant at 10%; ** significant at 5%; *** significant at 1%).


Table 4: Returns to Education in the Agricultural Sector
  (1) (2) (3) (4) (5)
  Heterogeneity No Heterogeneity Time Varying Incl. Changers High school
  SMM NLS NLS NLS NLS
Education 0.101 0.068 -0.089 0.089 0.066
Education Standard Errors [0.099] [0.027]** [0.044]** [0.028]*** [0.053]
Education*year     0.019    
Education*year Standard Errors     [0.005]***    
Age/100 9.508 9.230 8.831 9.264 9.231
Age/100 Standard Errors [1.184]*** [0.045] *** [0.683]*** [0.157]*** [0.053]***
Age2/1000 -10.447 -10.567 -10.123 -10.558 -10.567
Age2/10000 Standard Errors [0.536]*** [0.048] *** [0.806]*** [0.168]*** [0.053]***
Female 0.245 0.234 0.238 0.114 0.236
Female Standard Errors [0.022]*** [0.035] *** [0.032]*** [0.034]*** [0.035]***
Minority 0.013 0.157 0.157 0.131 0.158
Minority Standard Errors [0.026] [0.034] ** [0.030]*** [0.032]*** [0.035]***
Married 0.093 0.216 0.224 0.198 0.221
Married Standard Errors [0.064]* [0.034] *** [0.035]*** [0.033]*** [0.036]***
Year 0.069 0.113 0.108 0.109 0.114
Year Standard Errors [0.075] [0.006] *** [0.006]*** [0.005]*** [0.006]***
Observations 7,715 7,715 7,715 8,286 7,715
R-squared - 0.23 0.23 0.22 0.23

Notes: The dependent variable is the log of household agricultural income, and the unit of observation is (household x year). Covariates not listed in the table include provincial fixed effects and provincial time trends. Standard errors are in brackets (*significant at 10%; ** significant at 5%; *** significant at 1%). Standard errors and significance level for models (2) (5) are based on a bootstrap with 100 replications.


Table 5: Returns to Education in the Nonagricultural Sector
  (1) (2) (3) (4) (5)
  Heterogeneity No Heterogeneity Time Varying Incl. Changers High school
  SMM OLS OLS OLS OLS
Education 0.142 0.171 0.091 0.336 0.163
Education Standard Errors [0.066]*** [0.022]*** [0.024]*** [0.025]*** [0.012]***
Education*Year     0.014    
Education*Year Standard Errors     [0.004]***    
Age/100 4.973 4.964 4.923 6.776 3.972
Age/100 Standard Errors [0.086]*** [0.431]*** [0.401]*** [0.582]*** [0.411]***
Age2/1000 -5.276 -5.276 -5.246 -6.911 -4.175
Age2/10000 Standard Errors [0.131]*** [0.547]*** [0.492]*** [0.700]*** [0.505]***
Female -0.139 -0.139 -0.140 -0.034 -0.148
Female Standard Errors [0.009]*** [0.010]*** [0.010]*** [0.012]*** [0.010]***
Minority -0.066 -0.034 -0.034 -0.065 -0.067
Minority Standard Errors [0.012]*** [0.023]* [0.020]** [0.025]** [0.020]**
Married -0.012 0.010 0.012 -0.053 0.030
Married Standard Errors [0.022] [0.019] [0.020] [0.027]*** [0.021]
Year 0.103 0.102 0.090 0.106 0.104
Year Standard Errors [0.000]*** [0.002]*** [0.005]*** [0.003]*** [0.003]***
Observations 13091 13091 13091 15855 13091
R-squared - 0.57 0.57 0.44 0.56

Notes: The dependent variable is the log of individual income, and the unit of observation is (individual x year). Covariates not listed in the table include provincial fixed effects and provincial time trends. Standard errors are in brackets (*significant at 10%; ** significant at 5%; *** significant at 1%). Standard errors and significance level for models (2) (5) are based on a bootstrap with 100 replications.


Table 6: Choice of Sector
Panel a: Benefit from working in nonagriculture
  (1)   (2) (3) (4) (5)
  Heterogeneity   No Heterogeneity Time Varying Incl. Changers High School
  SMM [marginal] Probit Probit Probit Probit
V(S=N \vert E) - V(S=A \vert E) 0.025 0.007 0.359 0.470 0.245 0.220
V(S=N \vert E) - V(S=A \vert E)
Standard Errors
[0.069]   [0.088]*** [1.026]*** [0.098]*** [0.065]***

Table 6: Choice of Sector
Panel b: Cost of working in nonagriculture
  (1)   (2) (3) (4) (5)
  Heterogeneity   No Heterogeneity Time Varying Incl. Changers High School
  SMM [marginal] Probit Probit Probit Probit
Education -1.734 -0.563 -1.569 -1.624 -1.280 1.672
Education Standard Errors [0.140]***   [0.049]*** [0.085]*** [0.049]*** [0.053]***
Education*Birth Group1       0.050    
Education*Birth Group1 Standard Errors       [0.109]    
Education*Birth Group2       0.062    
Education*Birth Group2 Standard Errors       [0.131]    
Education*Birth Group3       0.186    
Education*Birth Group3 Standard Errors       [0.124]**    
Female 0.073 0.019 0.214 0.143 0.157 0.315
Female Standard Errors [0.076]   [0.032]*** [0.031]*** [0.032]*** [0.029]***
Minority 0.363 0.095 0.509 0.418 0.351 0.378
Minority Standard Errors [0.078]***   [0.070]*** [0.067]*** [0.068]*** [0.067]***
Nonagricultural share -0.585 -0.152 -1.349 -0.869 -0.392 0.656
Nonagricultural share standard errors [0.901]   [0.537]*** [0.603]* [0.527] [0.556]
State employment share 0.653 0.168 0.791 0.847 0.279 .325
State employment share standard errors [0.668]   [0.591]* [0.626] [0.567] [0.615]
Observations 12,024   24,819 24,819 32,006 24,819
R-squared -   0.35 0.32 0.26 0.33

Notes: The dependent variable is an indicator of whether the worker has a nonagricultural job. The unit of observation is the individual for specification (1) and (individual x year) for the rest. All specifications include constants and dummy variables for birth-year group. Standard errors are in brackets (*significant at 10%; ** significant at 5%; *** significant at 1%). Standard errors and significance level for models (2) (5) are based on a bootstrap with 100 replications. For specification (3), the omitted birth group includes people born before 1948, and birth groups 1 - 3 include people born between 1948 and 1955, 1956 and 1965, and after 1965, respectively.


Table 7: Choice of Education
Panel a: Benefit from education
  (1)   (2) (3) (4) (5)
  Heterogeneity   No Heterogeneity Time varying Incl. Changes High School
  SMM [Marginal] Probit Probit Probit Probit
V1 - V0 1.072 0.300 1.181 1.595 3.027 0.376
V1 - V0 Standard Errors [20.932]   [0.329]*** [0.309]*** [0.286]*** [0.158]*

Table 7: Choice of Education
Panel b: Cost of education
  (1)   (2) (3) (4) (5)
  Heterogeneity   No Heterogeneity Time varying Incl. Changes High School
  SMM [Marginal] Probit Probit Probit Probit
Female 0.480 0.147 0.467 0.389 0.487 0.162
Female Standard Errors [1.251]   [0.081]*** [0.046]*** [0.042]*** [0.068]***
Minority -0.214 -0.067 -0.239 -0.294 0.040 0.093
Minority Standard Errors [6.974]   [0.148]*** [0.107] [0.076] [0.098]*
No enrolled student -0.196 -0.061 -0.002 -0.002 -0.146 -0.000
No. enrolled student standard errors [0.227]   [0.031]*** [0.035]*** [0.025]*** [0.028]
Observations 12,024   24,819 24,819 32,006 24,819
R-squared -   0.24 0.24 0.22 0.07

Notes: The dependent variable is an indicator of whether the worker has obtained a certain level of education (high school for (5); middle school for the rest). The unit of observation is the individual for specification (1) and (individual x year) for the rest. All specifications include constants and dummy variables for birth-year groups. Standard errors are in brackets (*significant at 10%; ** significant at 5%; *** significant at 1%). Standard errors and significance levels for models (2) (5) are based on a bootstrap with 100 replications.


Table 8: Additional Parameter Estimates for Model with Unobserved Heterogeneity
Variable Estimates se
 \sigma _{\mu ,a} 3.235 [0.248]***
 \sigma _{\mu ,n} 0.388 [0.322]
 \rho (\mu _a ,\mu _n ) 0.388 [22.201]
 \sigma _{\varepsilon ,a} 3.640 [0.033]***
 \sigma _{\varepsilon ,n} 0.145 [0.000]***
Sum of squared residuals: 0.00046


Table 9: Education's Contribution to Growth
  (1) (2) (3) (4) (5) (6) (7) (8)
  Aggregate Agriculture Nonagriculture Reallocation

Baseline

Reallocation

No Corr

Reallocation

y ^{ed}(i) -
y ^{un}(i)

Reallocation

y _{n} - y _{a}

CHNS

Reallocation

y _{n} - y _{a}

CSY

Growth rate of output per worker: Young (1978 \sim 1998) 5.2%   3.6%          
Growth rate of output per worker: Brandt et al. (1978 \sim 2004) 7.0% 6.8% 4.7%          
Growth rate of output per worker:

Dekle & Vandenbrouke (1978 \sim 2003)

5.7% 4.5% 4.1%          
Growth Decomposition: Brandt et al. (1978 \sim 2004) 7.0% 1.9% 3.4% 1.7%        
Growth Decomposition: Brandt et al. (1978 \sim 2004)

% of aggregate growth

(100.0) (27.3) (48.1) (24.6)        
Growth Decomposition: Dekle & Vandenbrouke (1978 \sim 2003) 5.7% 0.8% 3.0% 1.9%        
Growth Decomposition: Dekle & Vandenbrouke (1978 \sim 2003)

% of aggregate growth

(100.0) (14.1) (52.6) (33.3)        
Contribution of Education (1978 \sim 2004): (1) Heterogeneity 0.793% 0.034% 0.107% 0.652% 0.636% 0.761% 0.896% 0.862%
Contribution of Education (1978 \sim 2004): (% of the avg. growth) (11.40) (0.49) (1.54) (9.37) (9.14) (10.94) (12.87) (12.39)
Contribution of Education (1978 \sim 2004):

(% of aggregate growth in Brandt et al)

  [1.79] [3.20] [38.08] [37.15] [44.47] [52.32] [50.36]
Contribution of Education (1978 \sim 2004) Alternative Model:

(2) No Heterogeneity

0.798% 0.026% 0.122% 0.650% 0.584% 0.772% 0.822% 0.792%
Contribution of Education (1978 \sim 2004) Alternative Model:

(3) Time varying

0.740 0.005 0.114 0.621 0.506 0.671 0.712 0.686
Contribution of Education (1978 \sim 2004) Alternative Model:

(4) Incl. Changers

0.623% 0.031% 0.206% 0.386% 0.498% 0.566% 0.702% 0.677%
Contribution of Education (1978 \sim 2004) Alternative Model:

(5) High school

0.311% 0.00% 0.143% 0.168% 0.172% 0.239% 0.243% 0.235%

Notes: The aggregate contribution of education (column (1) of the third panel) is simply the sum of columns (2) - (4). Columns (5) - (8) present alternative measures of growth due to reallocation as described in Section 4.3.



Footnotes

* An earlier version of this paper circulated under the title "The Role of Education in Economic Growth through the Sectoral Reallocation of Labor". We have benefitted from discussions with participants of the Stanford macro bag lunch, Stanford labor-development seminar, 2006 SED, 2007 PACDEV, and Federal Reserve Board lunch seminar. We thank Dongfang Shao for providing access to a digital version of the CSY data sets and Shufa Du for answering numerous questions regarding the CHNS. Lee acknowledges the Taube Fellowship for financial support. The views expressed here are those of the authors and do not necessarily reflect the views of the Federal Reserve System. Return to Text
1. For example, Young (2003) estimates growth in aggregate output per worker of 5.2 percent from 1978 to 1998; Brandt, Hsieh, and Zhu (2008) report growth of 6.96 percent from 1978 to 2004; and Dekle and Vandenbroucke (2006) report growth of 5.7 percent from 1978 to 2003. Return to Text
2. Education's impact on human capital has long been considered an important way in which education affects income. See, for example, labor studies following Mincer (1974) and cross-country studies such as Bils and Klenow (2000). Return to Text
3. "Aggregate" data refers to national- or sector-level data, in contrast to the micro-level (household) data we use. Driving forces considered by the Chinese studies include sectoral total factor productivity growth, increases in the rate of fixed investment, changes in the size of the public sector, and labor market barriers, but education is not modelled explicitly. Return to Text
4. See Bergoeing, Kehoe, Kehoe, and Soto (2002), Bartelsman, Haltiwanger, and Scarpetta (2008), and Alfaro, Charlton, and Kanczuk (2008) for similar empirical evidence on the importance of resource misallocation in other countries. Return to Text
5. For example, Schultz (1971) reports that returns to education appear to be higher in urban areas (with higher-wage jobs) than in rural areas for Colombia. Return to Text
6. The absence of labor market barriers implies marginal products are equated across sectors and technologies with identical factor shares implies the ratio of average products in the two sectors is equal to the ratio of marginal products. Return to Text
7. This measure is based on the following decomposition. Expected income  Y conditional on education level  E is  \mathbb{E}(Y\vert E)  =  Pr(s=n\vert E) \times \mathbb{E}(Y^{E}_{n}) + Pr(s=a\vert E) \times \mathbb{E}(Y^{E}_{a}). The difference in expected income between an educated ( edu) and uneducated ( uned) worker can then be written as  \mathbb{E}(Y\vert edu)-\mathbb{E}(Y\vert uned) = Pr(n\vert edu)\times \left [\mathbb{E}(Y^{edu}_{n}) - \mathbb{E}(Y^{uned}_{n}) \right ] + Pr(a\vert edu) \times \left [\mathbb{E}(Y^{edu}_{a}) - \mathbb{E}(Y^{uned}_{a})\right]
 + \left [ Pr(n\vert edu) - Pr(n\vert uned) \right] \times \left[ \mathbb{E}(Y^{uned}_{n}) - \mathbb{E}(Y^{uned}_{a}) \right]. The first two terms reflect gains from education within a sector, while the third term consists of gains from a change in the probability of working in the nonagricultural sector. Return to Text
8. The CHNS is jointly conducted by the Carolina Population Center at the University of North Carolina at Chapel Hill and the National Institute of Nutrition and Food Safety at the Chinese Center for Disease Control and Prevention. More detailed information about the CHNS can be found at the CPC UNC website: http://www.cpc.unc.edu/projects/china Return to Text
9. The Chinese Household Income Project survey is another publicly-available Chinese household income data set, but only its 1988 and 1995 vintages are publicly available. The National Bureau of Statistics of China conducts household surveys in both rural and urban areas, but only the urban surveys are publicly available. Return to Text
10. Urban entry fees vary by location, but to give some idea of their magnitude, the fee ranged from 10,000 to 40,000 Yuan in Shanghai in 1999 (Chan and Zhang 1999), which was 4.5 to 18.1 times the annual net income per rural household member in China in 1999. Return to Text
11. Chan and Zhang (1999), Wang (2004, 2005) and Chan and Buckingham(2008) provide detailed explanations of hukou reforms. Among recent reforms, a movement to remove agricultural and nonagricultural classifications took place in 2001 but was suspended in mid-2002 due to lack of funding and an inability to accommodate new urban residents. Wang (2005) concludes that while hukou reforms since 1997 relaxed restrictions, most migrants are still unable to obtain a permanent urban hukou. Chan and Buckingham (2008) conclude, "The cumulative effect of these reforms (reported often in the newspapers in late 2005) is not abolition of the hukou but devolution of responsibility for hukou policies to local governments, which in many cases actually makes permanent migration of peasants to cities harder than before." Return to Text
12. We cannot directly check whether the distribution of hukou holders in the CHNS represents the population distribution because there are no good official statistics for checking the number and characteristics of illegal/temporary migrants. We would not only need to know the number of urban and rural hukou holders but also where they actually reside. Return to Text
13. For example, one reason workers may leave the sample is that they choose to migrate to an unsurveyed area. To model a worker's choice of residency, we would need information on his choice set (i.e., possible residences and their characteristics, such as the type of job opportunities) and ultimate choice, which we do not have. Return to Text
14. Please see Appendix C for a more detailed description of our calculations. Return to Text
15. We also considered various alterations of our model specification but do not show these results as the estimates did not change much. These alterations included using different sets of control variables (e.g., adding year-squared and an education-year interaction term) for the income equations, not controlling for differences in the PDV of lifetime income in the sector and education choice equations, and using weighted OLS rather than weighted probit to estimate the sector and education choice equations. Return to Text
16. Because these individuals are older than 16, we suspect their increased educational attainment is due to participation in the non-regular education system. Young (2003) argues that the quality of such institutions is questionable and excludes this type of education when quantifying the growth of human capital. Return to Text
17. Because we only observe agricultural income at the household level, we calculate individual incomes for workers who ultimately move into non-agriculture by dividing the household income across workers in proportion to their hours worked. This will overstate the income of uneducated agricultural workers classified as uneducated nonagricultural because it implicitly assumes no returns to education. However, because returns to education in the agricultural sector are less that 10 percent while the nonagriculture/agriculture income ratio is a factor of two to five, the estimated returns to education for nonagricultural workers primarily depends on the income differential. Return to Text
18. Our baseline measure of human capital growth due to education was 0.141. We suspect our returns to education are lower than those estimated by Young (2003) because we add more control variables such as province-level characteristics. Return to Text
19. The analytical relationship cannot be derived because it is not possible to cobmine two sectoral production functions into one aggregate Cobb-Douglas production function if the labor share differs across sectors. We also can not directly compare our results to those of Bosworth and Collins (2008) because they assume a returns to education rather than estimating it. Return to Text
20. It can be shown that our approach of re-weighting sector-specific returns to education to get a measure of aggregate returns is equivalent to estimating aggregate returns by running one OLS regression on observations pooled together from both sectors. Return to Text

This version is optimized for use by screen readers. Descriptions for all mathematical expressions are provided in LaTex format. A printable pdf version is available. Return to Text