.b[Serial Correlation]

.title[
# .b[Serial Correlation]
]
.subtitle[
## .b[.green[EC 339]]
]
.author[
### Marcio Santetti
]
.date[
### Fall 2022
]

---

# Motivation

---

# The road so far

Over the past weeks, we have learned:

- That .it[omitting] relevant variables from a model causes .hi[bias];
  
  - That deterministic/strong stochastic .it[linear relationships] between two independent variables harm regression .hi[standard errors], and, therefore, OLS .hi[inference].

This week and the next, we turn our attention to the .hi[residual] term, *u*.

- We begin by investigating what happens when observations within *u* share some sort of .hi[linear relationship].
  
  - This problem is .it[extremely common] in time-series data, given that the .hi[order] of observations matters, which is not true for cross-section data.

---

# Pure serial correlation

---

# Pure serial correlation

Recall .hi[CLRM Assumption IV]:

> *"Observations of the error term are .it[uncorrelated] with each other."*
  
$$
`\begin{align}
\mathbb{E}(r_{u_i,uj}) = 0 \hspace{1cm} \text{with } i \neq j    
\end{align}`
$$

In a .hi[well-specified] model, autocorrelation can be characterized in the following way:

$$
`\begin{align}
u_t = \rho u_{t-1} + e_t  
\end{align}`
$$

where *&rho;* is known as the _.it[autocorrelation coefficient]_.

As *&rho; &rarr;* |1|, the higher the .it[degree] of serial correlation.

If *&rho; >* |1|, we have an .it[explosive] trajectory.

---

# Pure serial correlation

---

# Impure serial correlation

---

# Impure serial correlation

The .hi["impure"] version of serial correlation occurs in .it[misspecified models].

Whenever the error term contains a relevant variable that has been omitted from the model, which in turn is .hi[serially correlated] itself, we have a case of impure serial correlation.

- A simple .it[example]: suppose we are interested in a person's wealth over time. In case we omit their .it[credit score] measure, for instance, it will be part of the error term.

- Do you believe one's credit score .it[today] is dependent on their .it[last year's] credit score?
  
  - If you .it[do], then this omitted variable is affecting the error term, thus causing serial correlation, even if the error term, .it[by itself], is not serially correlated.
  
---

# Impure serial correlation

Recall what happens when we omit a relevant variable from a model:

- Suppose we have the .it["true"] population model:

$$
`\begin{align}
y_t = \beta_0 + \beta_1x_{1t} + \beta_2x_{2t} + u_t  
\end{align}`
$$ 
--

- And instead we estimate:
  
$$
`\begin{align}
y_t = \beta_0 + \beta_1x_{1t} + u_t^* 
\end{align}`
$$

with `$u_t^* = u_t + \beta_2x_{2t}$`.

In case `$x_2$` is .it[serially correlated], it will affect the residual term, which in turn .it[will be] serially correlated.

---

# Consequences of serial correlation

---

# Consequences of serial correlation

Firstly, autocorrelation .hi[does not] cause .hi[bias] to OLS estimates.

However, it affects OLS .hi[standard errors], undermining .hi[inference] from OLS models.

- Since it usually .hi[underestimates] SEs, we end up being .it[more likely] to .it[reject] null hypotheses, increasing the likelihood of .it[Type I error].

This way, OLS is no longer .hi-blue[BLUE].

- Why? Its .hi-blue[B] part is affected.
  
  - .hi-blue["Best"] refers to .hi-blue[minimum variance], which is not achieved with serial correlation.

---

# Dealing with serial correlation

---

# Dealing with serial correlation

In addition to .hi[visualizing] OLS residuals, there are several .hi[tests] for serial correlation.

The most common ones are the .hi[Durbin-Watson] and .hi-orange[Breusch-Godfrey] tests.

Moreover, we can use the .hi-orange[Cochrane-Orcutt] estimator to correct for serial correlation.

We will study these procedures through an .hi[applied example].

---

# Okun's law

.hi[Okun's law] illustrates the relationship between .hi-orange[unemployment] and .hi-orange[growth] in an economy over time.

In a very .it[basic] form, it can be expressed as follows:

$$
`\begin{align}
u_t - u_{t-1} = -\gamma (g_t - g_n)
\end{align}`
$$

- where `$u_t$` and `$u_{t-1}$` are the unemployment rate at time `$t$` and `$t-1$`, respectively;
 - `$g_t$` is the output growth rate at time `$t$`, and `$g_n$` is the "normal" output growth rate, which can be assumed as constant.
 - The `$\gamma$` coefficient measures this relationship. If the growth of output is *above* the normal rate, unemployment falls; a growth rate *below* the normal rate leads to an increase in unemployment.

---

# Okun's law

We can rewrite Okun's law as:

$$
`\begin{align}
\Delta u_t = -\gamma \ g_t
\end{align}`
$$ 
where `$\Delta u_t$` denotes the .it[change] in unemployment from `$t-1$` to `$t$`.

As an .it[econometric model], we can write it as follows:

$$
`\begin{align}
\Delta u_t = \beta_0 + \beta_1g_t + \varepsilon_i
\end{align}`
$$

Let's throw some .it[data] in!

---

# Okun's law: data for Australia (1978Q2&mdash;2016Q2):

---

# Okun's law

A quick check at the model's .hi[residuals]:

Does it look .hi-blue[autocorrelated]?

---

# The Durbin-Watson test

The .hi-orange[Durbin-Watson] test for autocorrelation is used to test for .hi[first-degree] serial correlation.

Provided that the regression model contains an .it[intercept] term `$(\beta_0)$` and has .it[no lagged] independent variable `$(e.g., x_{1,\ t-1})$`, this test can be implemented.

$$
`\begin{align}
d = \sum_{t=2}^{T}(\varepsilon_t-\varepsilon_{t-1})^2 \bigg/  \sum_{t=1}^{T}(\varepsilon_t)^2
\end{align}`
$$

with `$0 \le d \le 4$`.

It can be .it[approximated] by `$2(1-\hat{\rho})$`.

😌

---

# The Durbin-Watson test

The .hi[recipe] 👩‍🍳 👨‍🍳:

1. Estimate the regression model via OLS, storing its residuals;

2. Calculate the `$d$` test statistic;

3. Based on k, the number of slope coefficients, and on n, the sample size, consult the DW table for critical values.

4. The test’s null hypothesis is of no serial correlation in the residuals. In case we reject *H0*, we
have evidence of serial correlation.

]

---

# The Durbin-Watson test

```r
okun_model %>% 
  dwtest()
```

```
#> 
#> 	Durbin-Watson test
#> 
#> data:  .
#> DW = 1.331, p-value = 1.562e-05
#> alternative hypothesis: true autocorrelation is greater than 0
```

---

# The Durbin-Watson test

```{}
. estat dwatson

Durbin–Watson d-statistic(  2,   152) =  1.330972

```

---

# The Breusch-Godfrey test

The .hi[Breusch-Godfrey] test follows a similar procedure as the Durbin-Watson test's.

Its main .it[difference] involves the .hi[auxiliary regression] estimated to find the autocorrelation coefficient, `$\rho$`. It must also include all .hi-orange[independent variables] from the original model.

$$
`\begin{align}
LM = (n-q)R^2_{\hat{\varepsilon}}
\end{align}`
$$

where `$n$` is the sample size from the original regression model;

`$q$` is the order of autocorrelation we wish to test for;

and `$R^2_{\hat{\varepsilon}}$` is the coefficient of determination from the auxiliary regression.

---

# The Breusch-Godfrey test

```r
okun_model %>% 
  bgtest(order = 1, fill = NA)
```

```
#> 
#> 	Breusch-Godfrey test for serial correlation of order up to 1
#> 
#> data:  .
#> LM test = 18.154, df = 1, p-value = 2.037e-05
```

What is our .it[inference]?

---

# The Breusch-Godfrey test

```{}
. estat bgodfrey, lag(1) nomiss0

Breusch–Godfrey LM test for autocorrelation
---------------------------------------------------------------------------
    lags(p)  |          chi2               df                 Prob > chi2
-------------+-------------------------------------------------------------
       1     |         18.154               1                   0.0000
---------------------------------------------------------------------------
                        H0: no serial correlation

```

What is our .it[inference]?

---

# The Cochrane-Orcutt estimator

From the two previous tests, we can infer that our Okun's law model suffers from .it[serial correlation].

So what do we do?

The .hi[Cochrane-Orcutt] procedure allows for the estimation of a .it[modified] version of the original regression model, allowing for serially .it[uncorelated] residuals.

---

# The Cochrane-Orcutt estimator

The .hi[recipe] 👩‍🍳 👨‍🍳:

1. Estimate the regression model via OLS, storing its residuals;

2. Estimate a first-order Markov scheme for `$\hat{u}_t$`, storing `$\hat{\rho}$`;

3. Transform the variables from the original regression into *quasi-differenced* terms, using `$\hat{\rho}$`;

4. Re-estimate the model via OLS using the quasi-differenced variables from step 4.

]

---

# The Cochrane-Orcutt estimator

Step 3: Transform the variables from the original regression into *quasi-differenced* terms, using `$\hat{\rho}$`.

]

For our Okun's law model, we have:

$$
`\begin{align}
\tilde{g}_t = g_t - \hat{\rho}g_{t-1}
\end{align}`
$$

$$
`\begin{align}
\widetilde{\Delta u}_t = \Delta u_t - \hat{\rho}\Delta u_{t-1}
\end{align}`
$$ 
--

Step 4: Re-estimate the model via OLS using the quasi-differenced variables from step 4.

]

$$
`\begin{align}
\widetilde{\Delta u}_t = \tilde{\beta}_0 + \beta_1\tilde{g}_t + e_t
\end{align}`
$$

where `$\tilde{\beta}_0 = (1-\hat{\rho})\beta_0$`.

---

# The Cochrane-Orcutt estimator

```r
library(orcutt)

summary(cochrane.orcutt(okun_model))
```

```
#> Call:
#> lm(formula = du ~ g, data = okun_data)
#> 
#> Estimate Std. Error t value Pr(>|t|)
#> (Intercept) 0.0035386 0.0510639 0.069 0.9448
#> g -0.0107379 0.0245295 -0.438 0.6622
#> 
#> Residual standard error: 0.2554 on 149 degrees of freedom
#> Multiple R-squared: 0.0013 , Adjusted R-squared: -0.0054
#> F-statistic: 0.2 on 1 and 149 DF, p-value: < 6.622e-01
#> 
#> Durbin-Watson statistic 
#> (original): 1.33097 , p-value: 1.562e-05
#> (transformed): 2.27044 , p-value: 9.558e-01
```

So what?

---

# The Cochrane-Orcutt estimator

```{}
. prais du g, corc

Cochrane–Orcutt AR(1) regression with iterated estimates

Source |       SS           df       MS      Number of obs   =       151
-------------+----------------------------------   F(1, 149)       =      0.19
       Model |    .0124954         1    .0124954   Prob > F        =    0.6622
    Residual |  9.71553772       149  .065204951   R-squared       =    0.0013
-------------+----------------------------------   Adj R-squared   =   -0.0054
       Total |  9.72803312       150  .064853554   Root MSE        =    .25535

------------------------------------------------------------------------------
          du | Coefficient  Std. err.      t    P>|t|     [95% conf. interval]
-------------+----------------------------------------------------------------
           g |   -.010738   .0245295    -0.44   0.662    -.0592086    .0377326
       _cons |   .0035386   .0510639     0.07   0.945    -.0973644    .1044416
-------------+----------------------------------------------------------------
         rho |   .5612189
------------------------------------------------------------------------------
Durbin–Watson statistic (original)    = 1.330972
Durbin–Watson statistic (transformed) = 2.270438

```

# The Cochrane-Orcutt estimator

Now, the residuals from the .hi-blue[Cochrane-Orcutt] procedure:

---

# Next time: Serial correlation in practice

---
exclude: true