0.1 Overview
0.2 Outline
0.3 Data Preparation
- 0.3.1 Difference score calculation
1 Models Models Models
- 1.1 Auto-Regression Model
- 1.2 Difference Score Model
2 Conclusion

0.1 Overview

This script works through some basic representations of change: Auto-Regressive Models of Change, and Difference-Score Models of Change. These two types of models consider and answer different kinds of research questions: questions about “Change in Interindividual Differences” or questions about “Interindividual Differences and Intraindividual Change”.

0.2 Outline

Data Preparation
Auto-Regression Model
Difference Score Model
Conclusion

0.2.0.1 Prelim - Loading libraries used in this script.

library(psych)
library(ggplot2)

0.3 Data Preparation

We use two occasions of the multi-occasion WISC data for our examples.

Load the repeated measures data

############################
####### Reading in the Data
############################
#set filepath for data file
filepath <- "https://quantdev.ssri.psu.edu/sites/qdev/files/wisc3raw.csv"
#read in the .csv file using the url() function
wisc3raw <- read.csv(file=url(filepath),header=TRUE)

Subsetting to a dataset with just two occasions. We include the id, verb1 and verb6 variables.

wiscsub <- wisc3raw[,c("id","verb1","verb6")]
head(wiscsub)

id	verb1	verb6
1	24.42	55.64
2	12.44	37.81
3	32.43	50.18
4	22.69	44.72
5	28.23	70.95
6	16.06	39.94

Some basic descriptives …

#descriptives
describe(wiscsub[,c("verb1","verb6")])

	vars	n	mean	sd	median	trimmed	mad	min	max	range	skew	kurtosis	se
verb1	1	204	19.58505	5.807695	19.335	19.49768	5.41149	3.33	35.15	31.82	0.1301071	-0.0528376	0.4066200
verb6	2	204	43.74990	10.665051	42.545	43.45610	11.29741	17.35	72.59	55.24	0.2356459	-0.3605210	0.7467029

corr.test(wiscsub[,c("verb1","verb6")])

## Call:corr.test(x = wiscsub[, c("verb1", "verb6")])
## Correlation matrix 
##       verb1 verb6
## verb1  1.00  0.65
## verb6  0.65  1.00
## Sample Size 
## [1] 204
## Probability values (Entries above the diagonal are adjusted for multiple tests.) 
##       verb1 verb6
## verb1     0     0
## verb6     0     0
## 
##  To see confidence intervals of the correlations, print with the short=FALSE option

And some bivariate plots of the two-occasion relations.

pairs.panels(wiscsub[,c("verb1","verb6")])

We can also plot intraindividual change, by putting time along the x-axis. This requires reshaping the data.

#reshaping wide to long
wiscsublong <- reshape(data=wiscsub[c("id","verb1","verb6")], 
                    varying=c("verb1","verb6"), 
                    timevar="grade", idvar="id", 
                    direction="long", sep="")
#sorting for convenience
wiscsublong <- wiscsublong[order(wiscsublong$id,wiscsublong$grade),]

#making intraindividual change plot
ggplot(data = wiscsublong, aes(x = grade, y = verb, group = id)) +
  geom_point() + 
  geom_line() +
  xlab("Grade") + 
  ylab("WISC Verbal Score") + ylim(0,100) +
  scale_x_continuous(breaks=seq(1,6,by=1))

Notice here that each line indicates how an individual’s Grade 6 score differs from their Grade 1 score - Intraindividual Change.

0.3.1 Difference score calculation

Following from the plot, we can create the difference score from verb6 and verb1 by subtraction. We name the new variable “verbD”

\[ \Delta verb_{i} = verbD_{i} = verb6_{i} - verb1_{i}\]

#calculating difference score
wiscsub$verbD <- wiscsub$verb6-wiscsub$verb1

head(wiscsub)

id	verb1	verb6	verbD
1	24.42	55.64	31.22
2	12.44	37.81	25.37
3	32.43	50.18	17.75
4	22.69	44.72	22.03
5	28.23	70.95	42.72
6	16.06	39.94	23.88

Look at the descriptives with the difference score.

describe(wiscsub[,c("verb1","verb6","verbD")])

	vars	n	mean	sd	median	trimmed	mad	min	max	range	skew	kurtosis	se
verb1	1	204	19.58505	5.807695	19.335	19.49768	5.411490	3.33	35.15	31.82	0.1301071	-0.0528376	0.4066200
verb6	2	204	43.74990	10.665051	42.545	43.45610	11.297412	17.35	72.59	55.24	0.2356459	-0.3605210	0.7467029
verbD	3	204	24.16485	8.151262	23.910	23.85000	8.094996	4.62	50.88	46.26	0.3776055	0.1352728	0.5707025

corr.test(wiscsub[,c("verb1","verb6","verbD")])

## Call:corr.test(x = wiscsub[, c("verb1", "verb6", "verbD")])
## Correlation matrix 
##       verb1 verb6 verbD
## verb1  1.00  0.65  0.14
## verb6  0.65  1.00  0.84
## verbD  0.14  0.84  1.00
## Sample Size 
## [1] 204
## Probability values (Entries above the diagonal are adjusted for multiple tests.) 
##       verb1 verb6 verbD
## verb1  0.00     0  0.04
## verb6  0.00     0  0.00
## verbD  0.04     0  0.00
## 
##  To see confidence intervals of the correlations, print with the short=FALSE option

Of particular interest in questions about intraindividual change is the relation between the “pre-test” score and the amount of intraindividual change. We can look at the bivariate association.

pairs.panels(wiscsub[,c("verb1","verbD")])

1 Models Models Models

There are two basic models of change …

1.1 Auto-Regression Model

The Auto-Regression (AR) model is useful for examining questions about “Change in Interindividual Differences”. The model is written as

\[ verb6_{i} = \beta_{0} + \beta_{1}verb1_{i} + e_{i}\]

We note that this is a model of relations among between-person differences. This model is similar to, but is not a single-subject time-series model (which are also called auto-regression models, but are fit to a different kind of data).

Translating the between-person Auto-Regression Mode into code and fitting to two-occasion data …

#AR Model
ARfit <- lm(formula= verb6 ~ 1 + verb1,
            data=wiscsub,
            na.action=na.exclude)
summary(ARfit)

## 
## Call:
## lm(formula = verb6 ~ 1 + verb1, data = wiscsub, na.action = na.exclude)
## 
## Residuals:
##      Min       1Q   Median       3Q      Max 
## -20.2459  -5.8651   0.1781   4.9048  27.9976 
## 
## Coefficients:
##             Estimate Std. Error t value Pr(>|t|)    
## (Intercept) 20.22485    1.99608   10.13   <2e-16 ***
## verb1        1.20117    0.09773   12.29   <2e-16 ***
## ---
## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
## 
## Residual standard error: 8.087 on 202 degrees of freedom
## Multiple R-squared:  0.4279, Adjusted R-squared:  0.425 
## F-statistic: 151.1 on 1 and 202 DF,  p-value: < 2.2e-16

The intercept term, \(\beta_{0}\) = 20.22 is the expected value of Verbal Ability at the 2nd occasion, for an individual with a Verbal Ability score = 0 at the 1st occassion. The slope term, \(\beta_{1}\) = 1.20 indicates that for every 1 point difference in Verbal Ability at the 1st occassion, we expect a 1.2 point difference at the 2nd occassion.

We can plot the Auto-Regression model prediction with confidence intervals (CI). The function ‘termplot’ takes the fitted lm object. The CI bounds are plotted with the ‘se’ option and residuals with ‘partial.resid’ option.

termplot(ARfit,se=TRUE,partial.resid=TRUE,
         main="Auto-Regression Model",
         xlab="Verbal Score at Grade 1",
         ylab="Verbal Score at Grade 6")

Note that this code makes use of the lm() model object.

We can also do something similar with the raw data using ggplot.

#making interindividual regression plot
ggplot(data = wiscsub, aes(x = verb1, y = verb6)) +
  geom_point() + 
  geom_smooth(method="lm", formula= y ~ 1 + x, 
              se=TRUE, fullrange=TRUE, color="red", size=2) +
  xlab("Verbal Score at Grade 1") + 
  ylab("Verbal Score at Grade 6") +
  ggtitle("Auto-Regression Model")

Note that this code embeds an lm() model within the ggplot function.

1.2 Difference Score Model

The Difference Score model is useful for examining questions about “Interindividual Differences and Intraindividual Change”. The model is written as

\[ verbD_{i} = \beta_{0} + \beta_{1}verb1_{i} + e_{i}\]

#Difference score model
DIFfit <- lm(formula = verbD ~ 1 + verb1,
             data=wiscsub,
             na.action=na.exclude)
summary(DIFfit)

## 
## Call:
## lm(formula = verbD ~ 1 + verb1, data = wiscsub, na.action = na.exclude)
## 
## Residuals:
##      Min       1Q   Median       3Q      Max 
## -20.2459  -5.8651   0.1781   4.9048  27.9976 
## 
## Coefficients:
##             Estimate Std. Error t value Pr(>|t|)    
## (Intercept) 20.22485    1.99608  10.132   <2e-16 ***
## verb1        0.20117    0.09773   2.058   0.0408 *  
## ---
## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
## 
## Residual standard error: 8.087 on 202 degrees of freedom
## Multiple R-squared:  0.02054,    Adjusted R-squared:  0.0157 
## F-statistic: 4.237 on 1 and 202 DF,  p-value: 0.04083

The intercept term, \(\beta_{0}\) = 20.22 is the expected value of the Difference score (Change in Verbal Ability), for an individual with a Verbal Ability score = 0 at the 1st occassion. The slope term, \(\beta_{1}\) = 0.20 indicates that for every 1 point difference in Verbal Ability at the 1st occassion, we expect a 1.2 point difference in the amount of intraindividual change.

The same methods as above can be used to plot the resutls of the Difference Score model.

termplot(DIFfit,se=TRUE,partial.resid=TRUE,
         main="Difference-score Model",
         xlab="Verbal Score at Time 1",
         ylab="Difference in G1 and G6 Verbal Scores")

We can also do something similar using ggplot.

#making interindividual regression plot
ggplot(data = wiscsub, aes(x = verb1, y = verbD)) +
  geom_point() + 
  geom_smooth(method="lm", formula= y ~ 1 + x, 
              se=TRUE, fullrange=TRUE, color="red", size=2) +
  xlab("Verbal Score at Grade 1") + 
  ylab("Difference Score") +
  ggtitle("Difference Score Model")

Note that each of these model results plots are regression plots: outcome on the y-axis, predictor on the x-axis.

2 Conclusion

This session briefly touched on two different basic representations of change: Auto-Regressive Models of Change that are used to examine “Change in Interindividual Differences”; and Difference-Score Models of Change that are used to examine “Interindividual Differences and Intraindividual Change”. It may be noted that it is not possible to distinguish these models by goodness-of-fit tests with only two occasions. However, they can be distinguished when t > 2 repeated measures are available. The interpretations of change from the two models are fundamentally different, so choose carefully. Make sure that the model chosen matches the intended research question.

Two Occasion Change