RESEARCH ARTICLE Growth Rate Models: Emphasizing Growth

3 downloads 0 Views 1MB Size Report
Nov 3, 2011 - Kounin, and H.F. Wright, eds., New York: McGraw-Hill Book Company, Inc., 1943, pp. 147–160. [15] K.J. Grimm, J.S. Steele, A.J. Mashburn, ...
November 3, 2011

13:16

Journal of Applied Statistics

”JAS resubmission Nov 2011”

RESEARCH ARTICLE Growth Rate Models: Emphasizing Growth Rate Analysis through Growth Curve Modeling Zhiyong Zhanga⇤ , John J. McArdleb , and John R. Nesselroadec a

University of Notre Dame; b University of Southern California; c University of Virginia (Received 00 Month 200x; in final form 00 Month 200x)

To emphasize growth rate analysis, we develop a general method to reparametrize growth curve models to analyze rates of growth for a variety of growth trajectories, such as quadratic and exponential growth. The resulting growth rate models are shown to be related to rotations of growth curves. Estimated conveniently through growth curve modeling techniques, growth rate models have advantages above and beyond traditional growth curve models. The proposed growth rate models are used to analyze longitudinal data from the National Longitudinal Study of Youth (NLSY) on children’s mathematics performance scores including covariates of gender and behavioral problems (BPI). Individual differences are found in rates of growth from age 6 to 11. Associations with BPI, gender, and their interaction to rates of growth are found to vary with age. Implications of the models and the findings are discussed.

Keywords: Growth curve models, growth rate models, reparameterization and rotation, mathematics ability development

1.

Introduction

In the past half century, growth curve models have quickly evolved into one of the most powerful tools in facilitating the analysis of change [e.g. 24, 27]. Growth curve models are also one of the few methods that can directly address a primary goal of longitudinal research - to analyze both intra-individual change and inter-individual differences in intraindividual change [1]. Many researchers have shown interest in using growth curve models to analyze rates of growth or change [e.g. 2, 22, 38]. Except for the linear one, growth curve models in their original forms are generally unable to analyze rates of growth directly. For example, in the linear growth curve model, the linear slope is equal to the growth rate. Thus, the rate of growth can be analyzed directly. However, for the other (nonlinear) models, the rate of growth does not appear in the models explicitly and therefore the rate of growth cannot be analyzed directly. The rate of growth is an important concept in studying change. If the level of growth is viewed as the current status of a process at a specific time, the rate of growth measures how fast the process is changing at that time. Individual differences in the rate of growth are of obvious importance. For example, Freeman and Flory [14] pointed out that two persons of the same ability at a given time may differ markedly in their ability at a future time if their rates of growth are different [see also 13, 34]. Cattell [9] also emphasized that even for the same individual, the rate of growth at one level of a response variable can be ⇤ Correspondence should be addressed to Zhiyong Zhang ([email protected]), Department of Psychology, 118-B Haggar Hall, University of Notre Dame, Notre Dame, IN 46556.

November 3, 2011

13:16 2

Journal of Applied Statistics

”JAS resubmission Nov 2011”

Zhiyong Zhang, John J. McArdle & John R. Nesselroade

very different from that at another level of the variable. Therefore, when studying growth and development, the analysis of rates of growth should provide useful information about the process under investigation. Techniques for analyzing rates of growth are still relatively rare compared to the evolvement of growth curve models. Other than the linear growth curve model, there are only a few important methods for analyzing rates of growth. Sandland and McGilchrist [33] proposed a stochastic growth curve model based on stochastic differential equations. This method focused on the analysis of a single subject. Boker and colleagues developed a set of dynamical systems models based on differential equations and the estimation of derivatives [3]. These models can be used to investigate complex dynamical systems by modeling the relationship among derivatives of a system. McArdle and colleagues also developed latent difference score models for the analysis of change [e.g. 26]. These models can be used as a method to analyze rates of growth with discrete time intervals. Finally, Ramsay and colleagues [e.g. 30] have developed functional data analysis techniques, which can also be used to analyze rates of growth. However, none of the methods has directly investigated inter-individual differences in the rates of growth. In this study, we develop a general method to reparametrize growth curve models so that rates of growth can be analyzed for a variety of growth trajectories, such as quadratic and exponential growth. The merits of this method and the resulting growth rate models include, but not limited to, the following. First, they can be used to analyze rates of growth explicitly. Second, individual differences in rates of growth and their relationship to covariates can be investigated directly. Third, it is a general form for analyzing rates of growth for both linear and nonlinear growth trajectories. Fourth, the model estimation and selection methods of growth curve models are used when analyzing rates of growth. In the following, we review a general form of growth curve models. Then, we will discuss how to reparametrize growth curve models to analyze rates of growth connecting the reparametrization method with the rotation of growth curves. After that, an empirical example is presented to demonstrate how to use the proposed method, and simulation studies are conducted to evaluate the performance of the growth rate models. Finally, implications of the models and the application are discussed.

2.

Growth Curve Models

In hierarchical modeling terms, a typical growth curve model is a two-level model. The first level involves fitting a growth curve with random coefficients for each individual and the second level models the relations between the random coefficients and covariates of interest. Formally, the first level of a growth curve model can be written as, yit = f (timeit , bi ) + eit ,

t = 1, . . . , T, i = 1, . . . , N,

(1)

where yit is the observed response for individual i at trial or occasion t, timeit denotes the measurement time for the tth measurement occasion on the ith individual, f (timeit , bi ) is a function of time representing the growth curve with individual coefficients bi = (bi1 , bi2 , . . . , bip ), a p ⇥ 1 vector of random coefficients, and eit is the residual and the whole vector ei = (ei1 , ei2 , . . . , eiT )t is assumed to follow a multivariate normal distribution with the mean vector E(ei ) = 0T⇥1 and the covariance matrix Cov(ei ) = ⌃T⇥T . Usually, it is assumed that Cov(ei ) = I i2 or even Cov(ei ) = I 2 where I is a T ⇥ T identity matrix. The function f (timeit , bi ) can take either a linear or a nonlinear form of time although the current study focuses on the nonlinear functions. The function f determines the growth

November 3, 2011

13:16

Journal of Applied Statistics

”JAS resubmission Nov 2011” Journal of Applied Statistics

3

trajectory representing how an individual changes over time.1 Because bi are latent random coefficients, the f (timeit , bi ) can be viewed as the true score at measurement occasion t for individual i. The second level of a growth curve model can be expressed as, bi = B Xi + ui , p⇥1 p⇥q q⇥1 p⇥1

(2)

where Xi is the ith column (data for the ith individual) of the design matrix Xq⇥N with ones in the first row and data for the q 1 covariates in the other rows, B is the fixed-effects coefficient matrix, and ui represents the multivariate normal errors with E(ui ) = 0p⇥1 and Cov(ui ) = Dp⇥p . The ui is also called the random effects from a mixed-effects model perspective. The estimated parameters B provide information on the estimated growth trajectory of an “average” person, an individual whose coefficients are equal to the population mean values [12]. Each bi determines an individual growth trajectory. However, many other features such as rates of growth related to the growth phenomenon that are important and interesting to substantive researchers are not directly emphasized. We argue that this is mainly because the parameters (random coefficients) of a growth function may not have an intuitive meaning especially in terms of rates of growth before reparametrization. This problem has also been pointed out by Cudeck and du Toit [11] and Stimson, Carmines, and Zeller [37] when studying quadratic growth. This study attempts to address the interpretation limitations by directly defining and modeling rates of growth. As mentioned earlier, rates of growth are an important feature of a growth process that are of special interest to substantive researchers and should be analyzed more broadly [e.g., 21, 38]. It is not surprising to see studies that have adopted the linear growth curve model simply because it can be easily interpreted in terms of rates of growth although the linear growth curve model does not seem to fit the data very well [e.g. 20, 32]. For quadratic growth curve models, Cudeck and du Toit [11] proposed a reparametrization method that transforms the intercept, slope, and quadratic terms to initial performance, maximizer, and maximum parameters that have attractive interpretations. Intrigued by the reparametrization idea, we propose to analyze rates of growth through the reparametrization of growth curve models. 3.

Growth Rate Models and Reparametrization of Growth Curve Models

With the growth function in Equation (1), rates of growth can be calculated conveniently using the first derivative of the growth function. Let rit denote the rate of growth for the ith participant at occasion t or time timeit . It is easy to see that 0 rit = yit =

d d f (timeit , bi ) + eit . dtimeit dtimeit

(3)

If we further assume that the residuals are time-independent as in many growth curve d analysis procedures and therefore the derivative of the residuals dtime eit is equal to zero, it the above rate of growth can be simplified to 0 rit = yit =

1 To

d f (timeit , bi ). dtimeit

(4)

be precise, the trajectory used in the current study is a particular path in an individual’s state space. A state space is a space in which all possible states of an individual are represented, with each possible state of the individual corresponding to one unique point in the state space. A trajectory is the path that connects the points in a state space along time.

November 3, 2011

13:16

Journal of Applied Statistics

4

”JAS resubmission Nov 2011”

Zhiyong Zhang, John J. McArdle & John R. Nesselroade

Another underlying assumption is that the growth function is differentiable at occasion t. Many widely used growth functions such as polynomial, exponential, and logistic growth functions are differentiable. Also note that rit is the instantaneous rate of growth at occasion t, and in general, it varies with time. However, one may consider it as the average rate of growth within a small time interval around occasion t. As in growth curve analysis, we are often interested in predicting rates of growth such that rit =

t Xi

+ vit

where t is 1 ⇥ q vector of regression coefficients of rates of growth on covariates at occasion t and vit , E(vit ) = 0 and V ar(vit ) = t2 , is the normal residual part that is not explained by the covariates. Given that the focus of the analysis is on rates of growth, the key here is to estimate t and t2 . Intuitively, one can first “estimate” the random coefficients bi for each individual through growth curve models and then calculate rates of growth using rˆit =

d ˆi ). f (timeit , b dtimeit

Then, in a second step, one can estimate the parameters that are related to rates of growth. However, approaches, such as the maximum likelihood method used in SEM software like Mplus and the empirical Bayesian method used in multilevel modeling software like PROC MIXED, for estimating the random coefficients are not optimal because they are sequential as opposed to simultaneous. Furthermore, the random coefficients bi are difficult to estimate accurately especially when the number of measurement occasions is small [e.g. 16, 17, 24, 40]. Given the fact that it is not necessary to estimate bi in estimating the growth curve model parameters of B and D , we seek a method to estimate t and t2 without estimating rit . From Equation (4), the rate of growth is a function of the time variable timeit and random coefficients bi which can be expressed as rit = g(timeit , bi ) =

d f (timeit , bi ), dtimeit

where g denotes the first derivative function. Without loss of generalization, we can find a bik , k 2 [1, p] that can be solved from the equation above as bik = g

1

(timeit , bki , rit ),

(5)

where bki = (bi1 , . . . , bik 1 , bik+1 , . . . , bip )0 . Equation (5) means that we can express at least one random coefficient to be a function of timeit , the rate of growth rit , and the other random coefficients. By substituting bik in the growth curve models in Equations (1) and (2), we have 8 y = f ⇤ (timeit , rij , bki ) + eit > > < it > > :

bki = B k Xi + uki

rij =

j Xi

,

t = 1, . . . , T, i = 1, . . . , N,

(6)

+ vij

where B k = (B1 , . . . , Bk 1 , Bk+1 , . . . , Bp ), uki = (ui1 , . . . , ui,k 1 , ui,k+1 , . . . , uip ), and f ⇤ denotes a function that is in general different from f in growth curve models. Note

November 3, 2011

13:16

Journal of Applied Statistics

”JAS resubmission Nov 2011” Journal of Applied Statistics

5

especially, rij = rit |t=j =

d f (timeit , bi )|t=j dtimeit

because rates of growth generally vary across time. Here j is a time index indicating that the growth rate is evaluated at occasion t = j . Note also that through the reparametrization, we can express the growth function as a function of rates of growth. Furthermore, Equation (6) has the same form as growth curve models and thus it can be estimated using growth curve modeling techniques. To facilitate presentation, the models in Equation (6) will be called growth rate models henceforth in this article although they can also be viewed as growth curve models.

3.1

Examples

For illustrations, we examine several examples of constructing growth rate models from corresponding growth curve models. In these examples, we assume that there are two covariates. For comparison, we summarize the example growth curve models and corresponding growth rate models in Table 1. 3.1.1

Linear growth rate model

A linear growth curve model with two covariates is given in (1a) in Table 1. Taking the first derivative of the linear growth function, rit =

d d f (timeit , bi ) = (bi1 + bi2 timeit ) = bi2 , dtimeit dt

which is a constant. Thus, we can directly replace the random coefficient bi2 with the rate of growth rit ⌘ ri and then interpret the linear growth curve model as a growth rate model as shown in (1b) in Table 1. Actually, the bi2 in the linear growth curve model is often interpreted in terms of rates of growth [e.g. 2, 4, 29, 32]. 3.1.2

Quadratic growth rate model

A quadratic growth curve model with two covariates can be written as (2a) in Table 1. The rate of growth for quadratic growth at occasion j(j = 1, . . . , T ) is rij =

d (bi1 + bi2 timeit + bi3 time2it )|t=j = 0bi1 + bi2 + 2timeij bi3 . dtimeit

(7)

From Equation (7), we have bi2 = rij 2timeij bi3 . Note that we can also solve for bi3 equivalently. Plugging it into the quadratic growth curve model, we can get the quadratic growth rate model as shown in (2b) in Table 1. When timeij = 0, the rate of growth rij = bi2 . This is why bi2 has been interpreted as the instantaneous rate of growth at time zero [e.g. 4, 35]. A possible limitation of this instantaneous rate of growth is that time zero is usually not part of the measurement process or may not exist and thus centering has often to be used to make time zero meaningful. Instead, the quadratic growth rate model can analyze the rate of growth at any interesting time point without changing the meaning of the initial level parameter bi1 and the acceleration parameter bi3 . Individual differences in the rate of growth can be also evaluated and predicted through covariates. Note that for quadratic growth phenomena, the model discussed by [2, p. 42] based on time coding method is similar to our quadratic growth rate model.

Exponential

Quadratic

Linear

Growth rate models 8 > > yit = bi1 + ri timeit + eit < (1b) bi1 = 01 + 11 xi1 + 21 xi2 + ui1 > > : ri = 02 + 12 xi1 + 22 xi2 + ui2 8 yit = bi1 + rij timeit + bi3 (time2it 2timeij timeit ) + eit > > > > > < bi1 = 01 + 11 xi1 + 21 xi2 + ui1 (2b) > > bi3 = 03 + 13 xi1 + 23 xi2 + ui3 > > > : rij = j0 + j1 xi1 + j2 xi2 + vij 8 yit = bi1 rij ⇥ exp[ (timeit timeij )⇢]/⇢ + eit > > < (3b) bi1 = 01 + 11 xi1 + 21 xi2 + ui1 > > : rij = j0 + j1 xi1 + j2 xi2 + vij

Journal of Applied Statistics

Growth curve models 8 yit = bi1 + bi2 timeit + eit > > < (1a) bi1 = 01 + 11 xi1 + 21 xi2 + ui1 > > : bi2 = 02 + 12 xi1 + 22 xi2 + ui2 8 yit = bi1 + bi2 timeit + bi3 time2it + eit > > > > > < bi1 = 01 + 11 xi1 + 21 xi2 + ui1 (2a) > > bi2 = 02 + 12 xi1 + 22 xi2 + ui2 > > > : bi3 = 03 + 13 xi1 + 23 xi2 + ui3 8 yit = bi1 bi2 ⇥ exp[ (timeit timei1 )⇢] + eit > > < (3a) bi1 = 01 + 11 xi1 + 21 xi2 + ui1 > > : bi2 = 02 + 12 xi1 + 22 xi2 + ui2

6

Comparison of growth curve models and growth rate models

13:16

Table 1.

November 3, 2011

”JAS resubmission Nov 2011”

Zhiyong Zhang, John J. McArdle & John R. Nesselroade

November 3, 2011

13:16

Journal of Applied Statistics

”JAS resubmission Nov 2011” Journal of Applied Statistics

3.1.3

7

Exponential growth rate model

One form of the exponential growth curve model [5] can be expressed as (3a) in Table 1. The rate of growth for exponential growth at time j(j = 1, . . . , T ) is rij =

d (bi1 dtimeit

bi2 ⇥ exp[ (timeit

= 0bi1 + bi2 ⇢ exp[ (timeij

timei1 )⇢])|t=j

(8)

timei1 )⇢].

From Equation (8), bi2 = rij /{⇢ exp[ (timeij

timei1 )⇢]}.

By plugging it into the exponential growth curve model, we can get the exponential growth rate model with the form in (3b) in Table 1. Although in the exponential growth curve analysis the parameter ⇢ has been called the rate of growth, it should be clear that this rate is the instantaneous rate of growth per capita or the rate of approach to the asymptote. In this study, the parameter ⇢ has been estimated as a constant in the population as typically done in the application of the exponential growth curve model. If we estimate it as a random coefficient ⇢i as in Browne [5] and Neale and McArdle [29], the relation between this rate and our rates of growth is rit = ⇢i /f (timeit , bi ) [see also 7]. While the rate of growth ⇢i monitors the overall growth trajectory for person i, the rate of growth rit reveals the subtleties of growth at each developmental phase.

3.2

Centering and reparametrization for analysis of growth rates

For growth curve analysis, centering the time variable can improve the interpretability of model parameters [e.g. 31, 36]. For example, if one centers time at a certain value for a linear model, then the intercept term becomes the level of attribute at that point in time which could improve the interpretability of the intercept term. For other polynomial growth curve models, one can also study rates of growth through centering. This can be illustrated using the quadratic growth curve model. From Equation (7), the rate of growth is a linear combination of the linear slope and quadratic slope terms. If the combination coefficient (timeij ) is equal to 0, then the linear slope term is equivalent to the rate of growth at occasion j . Thus, if we center time at occasion j and then let timeij = 0, we can study the rate of growth through the linear slope term. Furthermore, by centering time at different time points, we can study the rate of growth across time. However, centering only works for certain models, typically polynomial models, in order to study growth rates. For other nonlinear models, centering may not achieve the goal of growth rate analysis. For example, for the exponential growth curve model, no matter how to center the time variable (according to Equation 8), the term bi2 cannot be interpreted as the rate of growth directly. However, the reparametrization method described here can be applied to analyze rates of growth for the exponential growth curve model. Even for the quadratic growth curve model where centering can be applied to study growth rates, the reparametrization method can be viewed as the underlying mechanism of the centering method for the analysis of the rate of growth. From Equation (9), to express the linear slope term bi2 as the rate of growth, one expects that rij = bi2 . Thus, we should let timeij = 0 and the center for the centering method should be at occasion j . Therefore, the reparametrization method proposed in this study is more general and useful for the analysis of rates of growth than the centering method.

November 3, 2011

13:16

Journal of Applied Statistics

8

3.3

”JAS resubmission Nov 2011”

Zhiyong Zhang, John J. McArdle & John R. Nesselroade

Growth rate models and rotation of growth curves

Growth rate models can also be derived through the rotation of growth curves. Growth curve models can be viewed as confirmatory factor models with factors already identified. For example, for a linear growth curve model, we may hypothesize and fit a level factor and a slope factor. For the quadratic growth curve model, we have yit = bi1 + bi2 timeit + bi3 time2it + eit ,

where bi1 , bi2 , and bi3 can be viewed as factor scores for the ith individual of three factors, level, linear slope, and quadratic slope terms, and (1, timeit , time2it ) are the factor loadings. Sometimes, when the growth curve analysis is used solely to find the curves that best fit the data, the obtained factors may be difficult or even impossible to interpret in a meaningful way. Researchers have suggested rotating factors to enhance their meaning in the framework of factor analysis [see recent reviews by 6, 19]. Tucker [39] suggested that rotation to simple structure may not be meaningful in the framework of learning or growth curves because the resulting factors could still be difficult to interpret. Our construction of growth rate models provides a way to rotate growth curves by rotating factors of growth curves to predefined meaningful target factors. More specifically, after rotation, one of the factors will be the rate of growth at a specific point in time. The rotation is conducted by ✓

bki rij



=



Ik cij



bi = Qbi ,

where Q here represents the rotation matrix and I k is a matrix with the k th row removed from the identity matrix Ip⇥p and cij is a vector of combination coefficients which can be determined by Equation (4). Using the quadratic model as an example, the rotation procedure for the factors is 0

1 0 10 1 0 1 bi1 10 0 bi1 bi1 @ rij A = @ 0 1 2timeij A @ bi2 A = Q @ bi2 A . bi3 00 1 bi3 bi3

With this rotation matrix, the factors bi1 and bi3 remain the same but the second factor now is the rate of growth at occasion j . When factors are rotated, factor loadings need to be rotated correspondingly in order to retain the covariance structure. For the quadratic growth example, one has 0

1 0 1 timei1 time2i1 1 timei1 B 1 timei2 time2 C B 1 timei2 i2 C B B B 1 timei3 time2 C B 1 i3 ⇤=B C Q = B 1 timei3 B .. B .. .. .. C .. @. @. . . A . 2 1 timeiT timeiT 1 timeiT 0 1 1 timei1 time2i1 2timei1 timeij B 1 timei2 time2 2timei2 timeij C i2 B C 2 B C = B 1 timei3 timei3 2timei3 timeij C , B .. C .. .. @. A . . 1 timeiT time2iT 2timeiT timeij

1 time2i1 0 time2i2 C C 10 time2i3 C C@0 1 .. C 0 0 . A time2iT

1 0 2timeij A 1

November 3, 2011

13:16

Journal of Applied Statistics

”JAS resubmission Nov 2011” Journal of Applied Statistics

9

where ⇤ denotes the factor loadings after rotation. Note that the elements in this matrix correspond to the factor loadings (or design matrix) in Equation (3.1). The same procedure also applies to the exponential growth rate model. The growth rate factor is obtained by ✓

bi1 rij



=



1 0 0 ⇢ exp[ (timeij

timei1 )⇢]

◆✓

bi1 bi2



=Q



bi1 bi2



.

Thus, the factor loading matrix for the exponential growth rate model is 0

1 B1 B B ⇤ = B1 B .. @.

1 exp[ (timei2 exp[ (timei3 .. .

1

exp[ (timeiT

1 B1 B B = B1 B .. @. 1

exp[ (timei1 exp[ (timei2 exp[ (timei3 .. .

0

exp[ (timeiT

1

timei1 )⇢] C ◆ C✓ 1 0 timei1 )⇢] C C C 0 exp[(timeij timei1 )⇢]/⇢ A timei1 )⇢] 1 timeij )⇢]/⇢ timeij )⇢]/⇢ C C timeij )⇢]/⇢ C C. C A timeij )⇢]/⇢

This factor loading matrix corresponds to the one in model (3b) in Table 1. 3.4

Estimation of growth rate models

Because they are reparametrizations and rotation of growth curve models, growth rate models can be estimated using the usual methods and software for growth curve analysis. For example, one can estimate the models as mixed-effects models using the maximum likelihood estimation method in SAS PROC MIXED and NLMIXED. Alternatively, one can estimate them as structural equation models using readily available structural equation modeling software through the specification of the rotated factors and factor loading matrix. Furthermore, the growth rate model fit can be evaluated using the fit statistics and fit indices developed for growth curve models such as the chi-square statistics, the root mean squared error of approximation (RMSEA), and the goodness of fit index (GFI). When multiple competing growth rate models can be fitted to the data, one can also use the likelihood ratio test, Akaike information criterion (AIC), and Bayesian information criterion (BIC) to select the best-fitting models. Actually, the fit statistics and fit indices are identical for the growth rate models and their corresponding growth curve models. Therefore, the evaluation of model fit and model selection can be conducted directly based on growth curve models. For the linear model, the linear slope term – also the linear rate of growth – is constant for each person. However, for other polynomial and nonlinear models, the growth rate varies with time. There are two approaches to analyzing varying growth rates over time. First, one can investigate the rate of growth at each time point separately. In this way, at each time point, the term for the growth rate is a random effect in the usual sense. Second, one can put some equivalence constraints on the rates of growth at different times according to the dependence among growth rates so that the rates of growth can be estimated simultaneously in one model. For example, the growth rate of the quadratic model at any time point depends on the linear and quadratic terms as shown in Equation (9). In other words, give that growth is quadratic, the growth rate at one time point can be determined by those at other points in time. Therefore, Equation (9) can be used to impose

November 3, 2011

13:16

Journal of Applied Statistics

10

”JAS resubmission Nov 2011”

Zhiyong Zhang, John J. McArdle & John R. Nesselroade

the constraints for the estimation of growth rates. Practically, the first approach is easier to implement using the available software and arrives at the same results as the second approach. In the current study, the first approach is used for data analysis. Specifically, we estimate the rate of growth at each time point and also investigate the relationship between the growth rate and covariates separately at each time point. 4.

Application

We have discussed how to construct growth rate models through the reparametrization of growth curve models to analyze rates of growth. Next we use empirical data to demonstrate how to apply these models to answer substantive questions. 4.1

Background

Mathematical ability is an extensively studied cognitive attribute. The development of mathematics has been examined in several longitudinal studies [e.g. 23, 41]. Associations between mathematical performance and selected covariates have also been studied. Gender differences in mathematics performance, especially, have been investigated extensively in many studies. A review by Hyde, Fennema, and Lamon [18] showed that males outperformed females significantly on mathematics tests. Associations between cognitive abilities and various behavior problems have also been investigated. In a recent longitudinal study, Bub, McCartney, and Willett [8] found that children with higher initial levels of internalizing and externalizing behaviors at 24 months had lower cognitive ability and achievement scores in the first grade. However, a couple of perspectives on the nature of mathematical ability development and change are still missing from the literature. First, a systematic analysis of rates of growth of mathematical ability is needed. Previous research has found that the development of mathematics is not linear over time [e.g. 15, 23, 41]. Thus, rates of growth of mathematics cannot be adequately analyzed using linear growth curve models. To shed additional light on this matter, the proposed growth rate models will be used to analyze rates of mathematical ability development. Second, how rates of growth of mathematics are related to covariates such as gender and behavior problems also invites more investigation. Therefore, to contribute further to the substantive literature while illustrating our models, we examine individual differences in rates of mathematical ability growth and how they are related to gender and a general measure of behavior problems. 4.2

Data

Data for our empirical example are drawn from the National Longitudinal Surveys of Youth. Specifically, participants were the children of female respondents to the National Longitudinal Surveys of Youth 1979 cohort [NLSY79, 10]. The survey on the children was sponsored and directed by the U.S. Bureau of Labor Statistics and the National Institute for Child Health and Human Development. The data used in the current study are from N = 1, 233 children ranging in age from 6 to 15 years with a 10-year age span. One cognitive variable – the Peabody Individual Achievement Test (PIAT) mathematics assessment – was used as a measure of mathematical ability and analyzed by means of growth rate models. The PIAT is a wide-ranging measure of academic achievement for children aged five and over and is widely used in research. The Mathematics (PIAT Math) subtest consists of 84 multiple-choice items arranged in increasing order of difficulty. Possible scores range from 0 to 84. For computational convenience, the scores were rescaled by dividing by 10. Two covariates, gender and Behavior Problems Index (BPI), are included

November 3, 2011

13:16

Journal of Applied Statistics

”JAS resubmission Nov 2011” Journal of Applied Statistics

11

in the analysis as time-invariant covariates. 4.3

Descriptive statistics

Descriptive statistics for gender, BPI, and PIAT Math at each age are given in Table 2. About 48% of the children in the sample were girls. The average BPI was 7.42. The mean scores for PIAT math increased with age but the rates of increase seemed to be slowing over age. A longitudinal plot of PIAT math is shown in Figure 1. From the plot, one can see that the growth trajectory of mathematics performance is not linear with age. Furthermore, it appeared that there was not much of a mean difference between boys and girls in mathematical ability; however, children with higher BPI had lower scores on PIAT math than those with lower BPI. Table 2.

Descriptive Statistics

Variable Gender BPI

Math

Age

N 1,233 1,233

Mean 0.48 7.42

Std 0.50 5.16

Min 0 (M) 0

Max 1 (F) 28

6 7 8 9 10 11 12 13 14 15

630 498 601 499 585 459 458 302 212 80

1.34 2.00 2.81 3.71 4.43 4.83 5.11 5.35 5.58 5.62

0.51 0.77 0.99 1.04 1.07 1.08 1.10 1.11 1.16 1.16

0 0.5 0.5 0.1 0.1 0.3 0.6 0.1 0.8 2.8

3.6 5.2 6.6 6.5 7 7.5 8.2 7.6 8.3 8.1

Note. Std: standard deviation; Min: minimum; Max: maximum; M: male; F: female; Math: mathematical performance.

For this application, age, instead of measurement occasions, was used as the time scale for the following reasons. First, age is a natural scale for child development analysis. Second, using age as the time scale makes the interpretation of the results easier than the use of the measurement occasions.1 In the data set, information on children’s age in months and age in years when the test was administered was available. The age of children in years was used in the current analysis. The analysis also assumes that the time interval is approximately one year. Furthermore, in order to make the growth curve analysis more interpretable, we centered the age variable at 10. Data were missing at each age. At age 8, about 48.9% of data were missing. At age 15 about 93.5% of data were missing. The total percentage of missingness was about 64.9%. Furthermore, 29 children only participated in the study at one occasion, 96 participated at two occasions, 533 participated at three occasions, 399 participated at four occasions, 183 participated at five occasions, and no children participated in 6 or more occasions. In all analyses, missing data were assumed to be missing at random [MAR; e.g., 25] and the full information maximum likelihood (FIML) estimation method was used to obtain the growth curve and growth rate model parameter estimates. 4.4

Research questions

Two main questions identified earlier are answered in the empirical data analysis. First, are there individual differences in rates of growth of mathematical ability? Second, how

1 Another

possibility is to use grade as the time variable. The reason to use age as the time variable instead of grade is that grade was missing for some PIAT scores, whereas age was always recorded.

13:16

Journal of Applied Statistics

8

4

6

8

10

Zhiyong Zhang, John J. McArdle & John R. Nesselroade

10

12

”JAS resubmission Nov 2011”

2

Gender

0

M F

7

8

9

10 11 12 13 14 15

6

6

4

2

6

8

10

4

PIAT math

2

BPI Low High

0

0

November 3, 2011

6

6

7

8

9

10

11

12

7

8

13

9

10 11 12 13 14 15

14

15

Age Figure 1. The longitudinal plot of PIAT Math for a random sample of 100 children from the sample. The thick line is the average trajectory. The subplot on the top left corner gives the average trajectories for male and female groups. The subplot on the bottom right corner gives the average trajectories for children with low and high BPI.

are individual differences in growth rates of mathematical ability related to gender and BPI and the interaction between the two? 4.5

Data analysis and results

In order to apply growth rate models to the data, we needed to determine what kind of growth curves mathematical ability development follows. Here, we first fitted linear, quadratic, and exponential growth curve models to the data to determine the best-fitting curve for the data based on several fit indices. Fit statistics, including Chi-square, AIC, BIC, and RMSEA, are summarized in Table 3. Based on AIC, BIC and RMSEA, the quadratic growth curve model fitted the current mathematical ability data better than the alternatives. Thus, the quadratic growth rate model was used to analyze mathematical ability growth rates.1 1 The

program scripts for the data analysis can be obtained from the first author or downloaded from http://nd. psychstat.org/research/johnny_zhang/grm.

November 3, 2011

13:16

Journal of Applied Statistics

”JAS resubmission Nov 2011” Journal of Applied Statistics

Table 3.

Fit indices for the linear, quadratic, and exponential growth rate models

Chi-square d.f.a AICb BICb RMSEAc CI for RMSEAc a Degrees b Akaike

(BIC)

13

Linear

Quadratic

Exponential

950 49 10728 10804 0.122 [0.115, 0.129]

231 45 10017 10114 0.058 [0.051, 0.065]

294 48 10074 10156 0.064 [0.058, 0.072]

of freedom information criterion (AIC) and Bayesian information criterion

c Root

mean squares error of approximation (RMSEA) and confidence interval (CI)

To investigate individual differences in rates of mathematical ability development, a quadratic growth rate model without covariates was first fitted to the data. The output of the quadratic growth rate model naturally includes all the information for the corresponding quadratic growth curve model. Table 4 summarizes the parameter estimates that are the same for both the quadratic growth rate model and the corresponding growth curve model. The estimated mean and variance of rates of growth at each observed age are summarized in Table 5. Note that in Table 4, the slope estimate was the instantaneous rate of growth, 0.56, for the quadratic growth curve model at age 10. Furthermore, because the estimated variability of the slope was significant (0.012, S.E.=0.001), there existed individual differences in terms of growth rate at age 10. More information about rates of growth can be seen in Table 5. Clearly, the average rates of growth were not a constant but declined linearly with age. Furthermore, the variability in rates of growth first decreased and then increased. There were significant individual differences in rates of growth before age 12. From age 6 to age 11, there was significant growth in mathematical ability and there were also significant individual differences in the rate because both the mean and variance of rates of growth were significant. However, after age 12 there was only significant mean growth in mathematical ability; individual differences in rates of growth disappeared. That is, different children did not develop at different rates on mathematical ability after age 12 because there was no individual difference in rates of growth of mathematical ability. Note that the slope parameter in Table 4 gives the instantaneous rate of growth at age 10 based on the quadratic growth curve model after centering the age variable at 10. The slope parameter mean and variance estimates were exactly the same as those from the growth rate analysis. This is because for polynomial growth curve models, rates of growth can also be analyzed through centering as discussed earlier. However, if we only look at the results of the instantaneous rate of growth at one certain age from quadratic growth curve model, we cannot see the dynamic change in the rates of mathematical ability development. Especially, the significant result at one time point does not imply the significance at other time points.1 The covariates – gender and BPI variables – were then included to predict rates of growth in children’s mathematics performance. In addition to the main effects of gender and BPI, their interaction was also included as a covariate in the models. The results are summarized in Table 6, Table 7, and Figure 2. Again, the quadratic growth rate model provides a better picture on how the covariates are related to the rates of growth over time. From the analysis, the following conclusions can be drawn. First, the effects of the covariates on the rate of growth were linearly related to age because the best-fitted model 1 For the quadratic growth curve model and more general polynomial growth curve models, one could center age at different

values to study growth rates over age. However, this strategy is rarely applied in practice. A typical growth curve analysis still focuses on one certain time.

November 3, 2011

13:16 14

Journal of Applied Statistics

”JAS resubmission Nov 2011”

Zhiyong Zhang, John J. McArdle & John R. Nesselroade Table 4. Results same for both the quadratic growth curve model and the quadratic growth rate model (age is centered at 10)

Fixed-effects parameters Level Slope Quadratic Random-effects parameters Level – Level Level – Slope Level – Quadratic Slope – Slope Slope – Quadratic Quadratic – Quadratic

Residual variances

6 7 8 9 10 11 12 13 14 15

Estimates

S.E.

4.257 0.56 -0.046

0.029 0.005 0.002

0.785 0.075 -0.014 0.012 -0.002 3.73E-04 0.042 0.322 0.44 0.5 0.367 0.317 0.297 0.34 0.33 0.219

0.042 0.006 0.002 0.001 3.29E-04 1.59E-04 0.033 0.029 0.031 0.041 0.034 0.037 0.037 0.047 0.068 0.12

Table 5. Estimated means and variances of rates of growth for mathematics performance at each age

Age

Mean

Variance

6 7 8 9 10 11 12 13 14 15

0.931 (0.012) 0.838 (0.010) 0.745 (0.007) 0.653 (0.005) 0.560 (0.005) 0.467 (0.007) 0.374 (0.009) 0.282 (0.012) 0.189 (0.015) 0.096 (0.018)

0.061 (0.011) 0.044 (0.007) 0.030 (0.004) 0.020 (0.002) 0.012 (0.001) 0.007 (0.002) 0.005 (0.004) 0.007 (0.008) 0.011 (0.012) 0.018 (0.018)

was a quadratic growth model. Second, the standard errors of the estimated coefficients first decreased and then increased. Third, BPI was negatively related to the rates of growth from age 6 to 13. Children with higher BPI scores tended to grow more slowly on mathematics performance. However, at age 14 and 15, BPI was no longer related to rates of mathematics growth. Fourth, gender differences in rates of growth appeared only at age 8 and 9. At those ages, boys seemed to show quicker improvement in their performance on mathematics. Fifth, the interaction between gender and BPI had a significant effect on the growth rate between age 8 and 11. Closer examination revealed a positive interaction. Thus, for the same amount of increase in BPI, girls showed less decrease in rates of growth. Similarly, the dynamic associations between rates of growth and gender and BPI cannot be directly studies based on the use of a quadratic growth curve model. 4.6

Illustration of the exponential growth rate model

For the purpose of demonstration, we also applied the exponential growth rate model to analyze the mathematical ability data although the quadratic growth rate model fitted the data better. Figure 3 portrays the associations between rates of growth and covariates BPI and gender and their interaction across age. From Figure 3, the coefficients for the covariates and interaction were significant across the measured age range. Furthermore, the coefficients for both BPI and gender and their interaction changed in a nonlinear way with age. Note that for the exponential growth analysis, the rate of growth cannot be

13:16

Journal of Applied Statistics

”JAS resubmission Nov 2011” 15

0.00 -0.02 -0.04 -0.06 -0.08

Effect of BPI on growth rate

0.02

Journal of Applied Statistics

6

8

10

12

14

12

14

12

14

0.05 0.00 -0.05

Effect of Gender on growth rate

Age

6

8

10

-0.04

0.00

0.04

0.08

Age

Effect of BPI and Gender interaction on growth rate

November 3, 2011

6

8

10 Age

Figure 2. The estimated coefficients with error bars for the covariates and their interaction at each age from the quadratic growth rate model. The error bars are calculated as the 95% confidence intervals.

November 3, 2011

13:16

Journal of Applied Statistics

16

”JAS resubmission Nov 2011”

Zhiyong Zhang, John J. McArdle & John R. Nesselroade Table 6. Results same for both the quadratic growth curve model and the quadratic growth rate model with covariates

Fixed-effects parameters Level on Intercept BPI Gender BPI*Gender Slope on Intercept BPI Gender BPI*Gender Quadratic on Intercept BPI Gender BPI*Gender Random-effects parameters Level – Level Level – Slope Level – Quadratic Slope – Slope Slope – Quadratic Quadratic – Quadratic

Residual variances

6 7 8 9 10 11 12 13 14 15

Estimates

S.E.

4.28 -0.246 -0.04 0.094

0.039 0.038 0.057 0.057

0.568 -0.042 -0.016 0.028

0.007 0.007 0.01 0.01

-0.048 0.001 0.003 -5.00E-04

0.002 0.002 0.003 0.003

0.745 0.068 -0.014 0.011 -1.55E-03 3.72E-04 0.046 0.319 0.438 0.495 0.362 0.324 0.301 0.358 0.329 0.203

0.041 0.005 0.002 0.001 3.21E-04 1.57E-04 0.033 0.029 0.031 0.041 0.034 0.038 0.037 0.048 0.067 0.118

Table 7. The estimated coefficients for the covariates of gender and BPI and their interaction

Age 6 7 8 9 10 11 12 13 14 15

BPI

s.e.

Gender

s.e.

B*G

s.e.

-0.051 -0.049 -0.046 -0.044 -0.042 -0.039 -0.037 -0.035 -0.032 -0.030

0.017 0.013 0.009 0.007 0.007 0.009 0.012 0.016 0.019 0.023

-0.039 -0.034 -0.028 -0.022 -0.016 -0.010 -0.004 0.002 0.008 0.014

0.024 0.019 0.014 0.010 0.010 0.012 0.017 0.022 0.028 0.034

0.032 0.031 0.03 0.029 0.028 0.027 0.026 0.025 0.024 0.023

0.024 0.019 0.014 0.010 0.010 0.012 0.017 0.022 0.028 0.034

analyzed through centering. 5.

Simulation Studies

To evaluate whether parameters in growth rate models can be recovered accurately, we conducted two simulation studies, one for the quadratic growth curve model and another for the exponential growth curve model. Because of the presence of missing data in real data analysis, we also investigated the impact of missing data on growth rate model parameter estimates. For both simulation studies, the population models were set up to mimic the quadratic and exponential growth curve models used in the empirical data analysis. First, two covariates – BPI and gender – were generated and their interaction term was also included in the simulated models. Second, the population parameter values were determined using the parameter estimates from the empirical study. Third, 10 occasions of

13:16

Journal of Applied Statistics

”JAS resubmission Nov 2011”

-0.06

-0.04

-0.02

17

-0.08

Effect of BPI on growth rate

Journal of Applied Statistics

6

8

10

12

14

12

14

12

14

-0.02 -0.04 -0.06

Effect of Gender on growth rate

0.00

Age

6

8

10

0.01

0.03

0.05

0.07

Age

Effect of BPI and Gender interaction on growth rate

November 3, 2011

6

8

10 Age

Figure 3. The estimated coefficients with error bars for the covariates and their interaction at each age from the exponential growth rate model. The error bars are calculated as the 95% confidence intervals.

November 3, 2011

13:16

Journal of Applied Statistics

18

”JAS resubmission Nov 2011”

Zhiyong Zhang, John J. McArdle & John R. Nesselroade

data, from age 6 to 15 were simulated. Fourth, the sample size was fixed at N = 1, 233. Finally, the generated missing data had exactly the same missingness patterns as the real data. For each of the two growth rate models, we first generated 1, 000 sets of complete data and then obtained the model parameter estimates for each set of data. Then, missing data for each complete data set were created according to the missingness patterns of the real data. Subsequently, the same growth rate model was used to analyze the incomplete data and obtain the model parameter estimates. In total, there were 1, 000 sets of parameter estimates for the quadratic and exponential growth rate models, respectively, under both complete and missing data conditions. For each parameter in the growth rate models, four statistics including relative bias, mean standard error, empirical standard deviation, and coverage probability were computed. Let ✓ denote the population value of a parameter in the simulation of the growth rate models and let ✓ˆr denote its parameter estimate and sˆr denote the standard error estimate from the rth set of simulated data. First, relative bias was calculated as "P # 1000 ˆ ✓ r r=1 Bias = 100 ⇥ 1 . 1000✓ For a well-estimated parameter, one would expect that the relative bias would be smaller than 5. Second, the mean standard error (S.E.) can be calculated as S.E. =

1000 1 X sˆr . 1000 r=1

Third, the empirical standard deviation was calculated as v u u 1 1000 X S.D. = t (✓ˆr 999 r=1

Fourth, the coverage probability for a 100(1 CVG =

1000 1 Xˆ 2 ✓r ) . 1000 r=1

↵)% confidence interval was obtained by

number of times ✓ 2 [✓ˆr z↵/2 sˆr , ✓ˆr + z1 1000

ˆr ] ↵/2 s

,

where z represents the cut points of a standard normal distribution. In the simulations, the 95% coverage probability was used. The results for the quadratic growth rate model simulation are summarized in Table 8. In order to save space, only the regression coefficients between the rates of growth and the covariates are reported. For complete data analysis, the relative biases were all smaller than 5% indicating that the regression coefficients at each age were very well recovered. The standard error estimates were also very close to the empirical standard deviation and thus the standard error estimates were also accurate. Furthermore, the coverage probability of each parameter was close to the nominal level 0.95. Thus, the quadratic growth rate model parameters were estimated very well. With the presence of missing data, the relative biases remained small for most parameters. Although the relative biases for the gender covariance at ages 12 and 13 were slightly larger than 5%, they were still not large given that the true parameter values were very small with an absolute value of 0.003.1 As with the complete data case, the standard er1 With a small parameter value, the random sample error and rounding error tend to have a big influence on the relative bias.

13:16

Journal of Applied Statistics

”JAS resubmission Nov 2011” Journal of Applied Statistics

Table 8.

19

Simulation results for the quadratic growth rate model without and with missing data

Bias

S.E.

S.D.

CVG

Bias

S.E.

S.D.

CVG

BPI

Missing data

TRUE

6 7 8 9 10 11 12 13 14 15

-0.052 -0.050 -0.048 -0.046 -0.044 -0.042 -0.040 -0.038 -0.036 -0.034

-0.276 -0.140 0.007 0.167 0.341 0.533 0.743 0.983 1.234 1.522

0.013 0.011 0.009 0.007 0.006 0.006 0.007 0.008 0.010 0.012

0.013 0.011 0.009 0.007 0.006 0.006 0.007 0.008 0.010 0.012

0.948 0.955 0.951 0.953 0.956 0.948 0.939 0.936 0.935 0.940

-0.739 -0.444 -0.348 -0.127 0.114 0.400 0.669 0.990 1.347 1.746

0.020 0.016 0.012 0.009 0.008 0.010 0.014 0.018 0.023 0.027

0.021 0.016 0.012 0.009 0.008 0.010 0.014 0.018 0.023 0.028

0.941 0.933 0.935 0.941 0.947 0.946 0.946 0.950 0.946 0.947

Gender

Complete data Age

6 7 8 9 10 11 12 13 14 15

-0.039 -0.033 -0.027 -0.021 -0.015 -0.009 -0.003 0.003 0.009 0.015

0.173 0.268 0.406 0.622 1.012 1.921 3.465 -4.471 -2.623 -1.715

0.019 0.016 0.013 0.011 0.009 0.008 0.009 0.011 0.014 0.017

0.020 0.016 0.013 0.011 0.009 0.009 0.010 0.012 0.014 0.018

0.945 0.942 0.946 0.946 0.937 0.947 0.946 0.947 0.945 0.945

-3.720 -3.042 -2.854 -2.049 -0.601 2.854 5.334 -7.665 -1.893 -4.737

0.028 0.022 0.017 0.013 0.012 0.015 0.020 0.025 0.032 0.038

0.028 0.022 0.017 0.013 0.012 0.014 0.019 0.025 0.032 0.038

0.944 0.951 0.955 0.953 0.952 0.953 0.949 0.951 0.947 0.945

Interaction

November 3, 2011

6 7 8 9 10 11 12 13 14 15

0.034 0.035 0.036 0.037 0.038 0.039 0.040 0.041 0.042 0.043

-0.537 -0.207 0.105 0.401 0.681 0.946 1.198 1.429 1.667 1.885

0.019 0.016 0.013 0.011 0.009 0.008 0.009 0.011 0.014 0.017

0.020 0.017 0.014 0.011 0.009 0.009 0.010 0.012 0.015 0.018

0.942 0.939 0.935 0.925 0.936 0.942 0.947 0.943 0.946 0.950

-2.774 -2.203 -1.647 -1.129 -0.639 -0.167 0.269 0.681 1.090 1.472

0.028 0.022 0.017 0.013 0.012 0.015 0.020 0.026 0.032 0.038

0.029 0.022 0.017 0.013 0.012 0.015 0.020 0.026 0.032 0.039

0.942 0.946 0.943 0.948 0.943 0.950 0.947 0.943 0.946 0.944

Note. Bias: relative bias in %. S.E.: average standard error. S.D.: standard deviation. CVG: coverage probability.

rors were accurately estimated and the coverage probabilities were close to the nominal level 0.95. Thus, the quadratic growth rate model parameters were still estimated very well in the presence of missing data. However, the standard error estimates were notably larger than those from complete data, which was expected because missing data caused the loss of information. Table 9 presents the results for the exponential growth rate model with and without missing data. The results clearly show that both parameter estimates and their standard errors were accurately estimated regardless of missing data. Furthermore, the coverage probabilities were close to the nominal level 0.95. Therefore, the parameters of both the quadratic growth rate model and the exponential growth rate model were estimated very well under the conditions of complete and missing data. 6.

Discussion

We proposed a systematical method to reparametrize growth curve models for analyzing rates of growth. This method provides a precise formulation on how to construct growth rate models from their corresponding growth curve models and can especially benefit substantive researchers because of its usability. First, the method directly targets the rate of growth that has been the interest of many studies. Second, growth rate models can be estimated conveniently using existing SEM and multilevel software. Third, growth rate models can provide useful information above and beyond the information provided by traditional growth curve analysis as demonstrated by the application and simulations. For example, growth rate models such as the exponential growth rate model can be used

13:16

Journal of Applied Statistics

20

”JAS resubmission Nov 2011”

Zhiyong Zhang, John J. McArdle & John R. Nesselroade Table 9.

Simulation results for the exponential growth rate model without and with missing data

Bias

S.E.

S.D.

CVG

Bias

S.E.

S.D.

CVG

BPI

Missing data

TRUE

6 7 8 9 10 11 12 13 14 15

-0.062 -0.053 -0.046 -0.039 -0.033 -0.029 -0.025 -0.021 -0.018 -0.015

-0.160 -0.187 -0.219 -0.256 0.000 0.000 -0.408 0.000 0.000 0.000

0.009 0.008 0.007 0.006 0.005 0.004 0.004 0.003 0.003 0.002

0.009 0.008 0.007 0.006 0.005 0.004 0.004 0.003 0.003 0.002

0.956 0.953 0.953 0.957 0.957 0.958 0.956 0.949 0.957 0.954

0.481 0.562 0.438 0.512 0.599 0.699 0.408 0.957 1.117 1.307

0.012 0.010 0.009 0.008 0.007 0.006 0.005 0.004 0.004 0.003

0.013 0.011 0.009 0.008 0.007 0.006 0.005 0.004 0.004 0.003

0.947 0.943 0.939 0.946 0.946 0.948 0.932 0.941 0.944 0.944

Gender

Complete data Age

6 7 8 9 10 11 12 13 14 15

-0.031 -0.027 -0.023 -0.020 -0.017 -0.014 -0.012 -0.011 -0.009 -0.008

0.962 1.124 1.316 1.026 1.198 1.399 1.639 0.952 1.111 0.000

0.013 0.011 0.010 0.008 0.007 0.006 0.005 0.004 0.004 0.003

0.013 0.011 0.010 0.008 0.007 0.006 0.005 0.004 0.004 0.003

0.952 0.951 0.952 0.949 0.954 0.955 0.955 0.953 0.956 0.956

0.000 0.000 0.439 0.513 0.599 0.000 0.820 0.000 0.000 0.000

0.017 0.015 0.013 0.011 0.009 0.008 0.007 0.006 0.005 0.004

0.017 0.015 0.012 0.011 0.009 0.008 0.007 0.006 0.005 0.004

0.959 0.955 0.959 0.957 0.962 0.964 0.951 0.965 0.961 0.961

Interaction

November 3, 2011

6 7 8 9 10 11 12 13 14 15

0.047 0.040 0.034 0.029 0.025 0.022 0.018 0.016 0.013 0.012

-0.641 -0.750 -0.875 -0.683 -0.797 -0.930 -1.087 -0.637 -0.746 -0.870

0.013 0.011 0.010 0.008 0.007 0.006 0.005 0.004 0.004 0.003

0.013 0.011 0.009 0.008 0.007 0.006 0.005 0.004 0.004 0.003

0.947 0.952 0.953 0.953 0.954 0.957 0.952 0.948 0.954 0.953

0.214 0.500 0.292 0.341 0.398 0.000 0.000 0.637 0.746 0.870

0.017 0.015 0.013 0.011 0.009 0.008 0.007 0.006 0.005 0.004

0.017 0.015 0.013 0.011 0.009 0.008 0.007 0.006 0.005 0.004

0.954 0.954 0.955 0.953 0.954 0.951 0.938 0.950 0.950 0.951

Note. Bias: relative bias in %. S.E.: average standard error. S.D.: standard deviation. CVG: coverage probability.

to study dynamic relations between growth rates and covariates over time for nonlinear trajectories. The method of growth rate analysis has many important theoretical implications. The adoption of the formal definition of the rate of growth is important to spur future model development and is likely to stimulate future research in this area. For example, one may take one step further to investigate the analysis of the acceleration of growth or change. The results from the analysis of children’s mathematical performance data also have interesting implications for developmentalists. Overall, the results showed that individual differences in rates of growth occurred only at certain ages. It was also found that BPI and gender showed different relationships with rates of growth at different ages. Boys have been shown to outperform girls in school performance of mathematics. However, the current study uncovered some subtleties that warrant further discussion. Gender was found to be related to the rate of mathematical performance only at age 8 and 9 across the age span 6 to 15 years. It appears that there is a narrow “window” during which boys separate themselves from girls because of significantly greater rates of growth in mathematical performance. However, because the rates of growth do not differ after age 9, even though they keep improving, the girls cannot catch up to the boys and the differences established at that early age tend to persist. There were positive interaction effects between gender and BPI from age 8 to age 11. For both boys and girls, higher BPI corresponded to lower rates of growth of mathematics performance. For the same increase in BPI, girls showed relatively less decrease in rates of growth. After a certain threshold of BPI, girls could show higher rates of growth in mathematics performance. However, the current observations of BPI did not exceed the threshold. It was also found that BPI was negatively related to rates of mathematical

November 3, 2011

13:16

Journal of Applied Statistics

”JAS resubmission Nov 2011” REFERENCES

21

performance growth before age 14. This is consistent with the general conclusion that children with higher levels of behavior problems tended to perform less well in school [28]. Based on the current results, children performed worse seemingly in part because of the slower rates of improvement. 6.1

Future directions

Several perspectives of both the method and the application have implications for future investigations. Conceptually, the idea of analyzing rates of growth can be extended to any research related to change. For example, in cluster analysis, mixture models etc., one can investigate the classification of participants based on the rate of growth as well as the level of growth. For example, two individuals performing at the same level but one slowing down and the other speeding up most likely do not belong in the same cluster if one is studying longitudinal change. Here, we will simply mention several possible extensions of the current work that can be carried out immediately. First, although the covariates used in the current study were time-invariant, growth curve models with time-varying covariates (e.g., different covariate values at different response times) have been developed and used. The inclusion of time-varying covariates in growth rate models is a natural extension of this work in the future. Second, in the construction of growth rate models, it is assumed that the growth functions have to be differentiable at certain time points. However, for some models, such as the latent basis growth curve models, this assumption cannot be met. Thus, other methods, such as nonparametric methods, can be considered to accommodate this problem. Furthermore, we have assumed that measurement errors are independent of time. This assumption can also be relaxed in future work. 6.2

Concluding remarks

Rate of growth or change is often a primary focus of longitudinal research. However, traditional growth curve models generally cannot directly analyze rate of growth. Although rate of growth is constant for linear growth, rate at one time point does not provide a full picture of the rates for the whole time span of a study for nonlinear growth. Growth rate models proposed in this study complement growth curve models in providing information on rate of growth for longitudinal data analysis. It is recommended that growth rate analysis should be carried out when growth curve analysis is conducted. Acknowledgement

The authors would like to thank two anonymous reviewers for their suggestions that have largely improved this study. References [1] P.B. Baltes and J.R. Nesselroade, History and rationale of longitudinal research, in Longitudinal research in the study of behavior and development, J.R. Nesselroade and P.B. Baltes, eds., New York: Academic Press, 1979, pp. 1–39. [2] J.C. Biesanz, N. Deeb-Sossa, A.A. Papadakis, K.A. Bollen, and P.J. Curran, The role of coding time in estimating and interpreting growth curve models, Psychological Methods 9 (2004), pp. 30–52. [3] S.M. Boker and J.R. Nesselroade, A method for modeling the intrinsic dynamics of intraindividual variability: Recovering the parameters of simulated oscillators in multi–wave data, Multivariate Behavioral Research 37 (2002), pp. 127–160. [4] K. Bollen and P. Curran, Latent curve models: A structural equation perspective, New Jersey: Wiley, 2006. [5] M.W. Browne, Structured latent curve models, in Multivariate analysis: Future directions 2, C.M. Cuadras and C.R. Rao, eds., Amsterdam: North-Holland, 1993, pp. 171–198.

November 3, 2011

13:16 22

Journal of Applied Statistics

”JAS resubmission Nov 2011” REFERENCES

[6] M.W. Browne, An overview of analytic rotation in exploratory factor analysis, Multivariate Behavioral Research 36 (2001), pp. 111–150. [7] M.W. Browne and S.H.C. Du Toit, Models for learning data, in Best methods for the analysis of change, L. Collins and J.L. Horn, eds., Washington, D.C.: APA Press, 1991, pp. 47–68. [8] K.L. Bub, K. McCartney, and J.B. Willett, Behavior problem trajectories and first-grade cognitive ability and achievement skills: A latent growth curve analysis., Journal of Educational Psychology 99 (2007), pp. 653–670. [9] R.B. Cattell, Patterns of change: Measurement in relation to state dimension, trait change, lability, and process concepts, in Handbook of Multivariate Experimental Psychology, R.B. Cattell, ed., 1st ed., Rand McNally, Chicago, IL, 1966, pp. 355–402. [10] Center for Human Resource Research, NLSY79 CHILD & YOUNG ADULT DATA USERS GUIDE: A guide to the 19862004 child data [Computer software manual], The Ohio State University, Columbus, Ohio, 2006. [11] R. Cudeck and S.H.C. du Toit, A version of quadratic regression with interpretable parameters, Multivariate Behavioral Research 37 (2002), pp. 501–519. [12] R. Cudeck and J. Harring, Analysis of nonlinear patterns of change with random coefficient models, Annual review of psychology 58 (2007), p. 615. [13] F.N. Freeman and C.D. Flory, Growth in intellectual ability as measured by repeated tests, Monographs of the Society for Research in Child Development 2 (1937), pp. 1–116. [14] F.N. Freeman and C.D. Flory, Growth in intellectual ability, in Child behavior and development, R.G. Barker, J.S. Kounin, and H.F. Wright, eds., New York: McGraw-Hill Book Company, Inc., 1943, pp. 147–160. [15] K.J. Grimm, J.S. Steele, A.J. Mashburn, M. Burchinal, and R.C. Pianta, Early behavioral associations of achievement trajectories, Developmental Psychology 46 (2010), pp. 976–983. [16] D. Hedeker and R.D. Gibbons, Longitudinal data analysis, John Wiley & Sons, Inc., Hoboken, New Jersey, 2006. [17] R.K. Hoa and I. Hu, Flexible modelling of random effects in linear mixed models: A bayesian approach, Computational Statistics and Data Analysis 52 (2008), pp. 1347–1361. [18] J.S. Hyde, E. Fennema, and S.J. Lamon, Gender differences in mathematics performance: A meta-analysis., Psychological Bulletin 107 (1990), pp. 139–155. [19] R.I. Jennrich, Rotation methods, algorithms, and standard errors, in Factor Analysis at 100: Historical Developments and Future Directions, R. Cudeck and R.C. MacCallum, eds., Lawrence Erlbaum Associates, Inc., Mahwah, NJ, 2007, pp. 315–336. [20] N.C. Jordan, D. Kaplan, C. Ramineni, and M.N. Locuniak, Early math matters: Kindergarten number competence and later mathematics outcomes., Developmental Psychology 45 (2009), pp. 850 – 867. [21] D. Kaplan, Methodological advances in the analysis of individual growth with relevance to education policy, Peabody Journal of Education 77 (2002), pp. 189–215. [22] K. Kelly and S.E. Maxwell, Delineating the average rate of change in longitudinal models, Journal of Educational and Behavioral Statistics 33 (2008), pp. 307–332. [23] L. Kowalski-Jones and G.J. Duncan, The structure of achievement and behavior across middle childhood, Child Development 70 (1999), pp. 930–943. [24] N.M. Laird and J.H. Ware, Random-effects models for longitudinal data, Biometrics 38 (1982), pp. 963–974. [25] R.J.A. Little and D.B. Rubin, Statistical analysis with missing data, Wiley, New York, N.Y., 1987. [26] J.J. McArdle and F. Hamagami, Latent difference score structural models for linear dynamic analysis with incomplete longitudinal data, in New methods for the analysis of change, L. Collins and A. Sayer, eds., American Psychological Association, Washington, D.C., 2001, pp. 139–175. [27] J.J. McArdle and J.R. Nesselroade, Growth curve analysis in contemporary psychological research, in Comprehensive Handbook of Psychology: Research Methods in Psychology, J. Schinka and W. Velicer, eds., Vol. 2, New York: Wiley, 2003, pp. 447–480. [28] M.M. McClelland, F.J. Morrison, and D.L. Holmes, Children at risk for early academic problems: The role of learning-related social skills., Early Childhood Research Quarterly 15 (2000), pp. 307–329. [29] M.C. Neale and J.J. McArdle, Structured latent growth curves for twin data, Twin Research 3 (2000), pp. 165–177. [30] J.O. Ramsay and B.W. Silverman, Applied functional data analysis: Methods and case studies, 2nd ed., Springer, 2002. [31] S.W. Raudenbush and A.S. Bryk, Hierarchical linear models: Applications and data analysis methods, Sage, 2002. [32] C.A. Reynolds, D. Finkel, M. Gatz, and N.L. Pedersen, Sources of influence on rate of cognitive change over time in swedish twins: An application of latent growth models., Experimental Aging Research 28 (2002), pp. 407–433. [33] R.L. Sandland and C.A. McGilchrist, Stochastic growth curve analysis, Biometrics 35 (1979), pp. 255–271. [34] R. Scammon, The first seriatim study of human growth, American Journal of Physical Anthropology 10 (1927), pp. 329–336. [35] C. Schuster and A. von Eye, Determining the meaning of parameters in multilevel models for longitudinal data., International Journal of Behavioral Development 22 (1998), pp. 475–491. [36] J.D. Singer and J.B. Willett, Applied longitudinal data analysis: modeling change and event occurrence, Oxford Universal Academy Press, 2003. [37] J.A. Stimson, E.G. Carmines, and R.A. Zeller, Interpreting polynomial regression, Sociological Methods and Research 6 (1978), pp. 515–524. [38] R. Tate, Elaboration of HLM growth modeling results, Florida Journal of Education Research 40 (2000), pp. 53–75. [39] L.R. Tucker, Learning theory and multivariate experiment: Illustration by determination of parameters of generalized learning curves, in The Handbook of Multivariate Experimental Psychology, R.B. Cattell, ed., Rand McNally, New York, 1966, pp. 476–501. [40] G. Verbeke and G. Molenberghs, Linear mixed model for longitudinal data, Springer, 2000. [41] G.L. Williamson, M. Appelbaum, and A. Epanchin, Longitudinal analyses of academic achievement, Journal of Educational Measurement 28 (1991), pp. 61–76.