EC 421, Set 7
Prologue
Asymptotics, probability limits, and consistency
Time series
Problem set and midterm
Asymptotics and consistency
Review
Compare/contrast the concepts expected value and probability limit.
What does it mean if the estimator \(\hat{\theta}\) is consistent for \(\theta\)?
What is required for an omitted variable to bias OLS estimates of \(\beta_j\)?
Does omitted-variable bias affect the consistency of OLS for \(\beta_j\)?
What can we know about the direction of omitted-variable bias?
How does measurement error in an explanatory variable affect the OLS estimate for that variable’s effect on the outcome variable?
How does measurement error in an outcome variable affect OLS?
Time-series data
Up to this point, we focused on cross-sectional data.
Today, we focus on a different type of data: time-series data.
US monthly births, 1933–2015: Classic time-series graph
US monthly births, 1933–2015: Newfangled time-series graph
US monthly births per 30 days, 1933–2015: Newfangled time-series graph
Time-series models
Our model now looks something like
\[ \begin{align} \text{Births}_t = \beta_0 + \beta_1 \text{Income}_t + u_t \end{align} \]
or perhaps
\[ \begin{align} \text{Births}_t = \beta_0 + \beta_1 \text{Income}_t + \beta_2 \textcolor{#e64173}{\text{Income}_{t-1}} + u_t \end{align} \]
maybe even
\[ \begin{align} \text{Births}_t = \beta_0 + \beta_1 \text{Income}_t + \textcolor{#e64173}{\beta_2 \text{Income}_{t-1}} + \beta_3 \textcolor{#6A5ACD}{\text{Births}_{t-1}} + u_t \end{align} \]
where \(t-1\) denotes the time period prior to \(t\) (lagged income or births).
New: Weakly persistent outcomes—essentially, \(x_{t+k}\) in the distant period \(t+k\) is weakly correlated with period \(x_t\) (when \(k\) is “big”).
\(y_t\) is a linear function of its parameters and disturbance.
There is no perfect collinearity in our data.
The \(u_t\) have conditional mean of zero (exogeneity), \(\mathop{\boldsymbol{E}}\left[ u_t \middle| X \right] = 0\).
The \(u_t\) are homoskedastic with zero correlation between \(u_t\) and \(u_s\), i.e., \(\mathop{\text{Var}} \left( u_t | X \right) = \mathop{\text{Var}} \left( u_t \right) = \sigma^2\) and \(\mathop{\text{Cor}} \left( u_t,\,u_s \middle| X \right) = 0\).
Normality of disturbances, i.e., \(u_t\overset{\text{iid}}{\sim}\mathop{N}\left( 0,\,\sigma^2 \right)\).
Time-series modeling boils down to two classes of models.
Static models: Do not allow for persistent effect.
Dynamic models: Allow for persistent effects.
Models with lagged explanatory variables
Autoregressive, distributed-lag (ADL) models
Option 1: Static models
Static models assume the outcome depends upon only the current period.
\[ \begin{align} \text{Births}_{\textcolor{#e64173}{t}} = \beta_0 + \beta_1 \text{Income}_{\textcolor{#e64173}{t}} + u_{\textcolor{#e64173}{t}} \end{align} \]
Here, we must believe that income immediately affects the number of births and does not affect on the numbers of births in the future.
We also need to believe current births do not depend upon previous births.
Can be a very restrictive way to consider time-series data.
Option 2: Dynamic models
Dynamic models allow the outcome to depend upon other periods.
Option 2a: Dynamic models with lagged explanatory variables
These models allow the outcome to depend upon the explanatory variable(s) in other periods.
\[ \begin{align} \text{Births}_{\textcolor{#e64173}{t}} = &\beta_0 + \beta_1 \text{Income}_{\textcolor{#e64173}{t}} + \beta_2 \text{Income}_{\textcolor{#6A5ACD}{t-1}} + \\ &\beta_3 \text{Income}_{\textcolor{#6A5ACD}{t-2}} + \beta_4 \text{Income}_{\textcolor{#6A5ACD}{t-3}} + u_{\textcolor{#e64173}{t}} \end{align} \]
Here, income immediately affects the number of births and affects future numbers of births.
In other words: Births today depend today’s income and lags of income—e.g., last month’s income, last year’s income, …
Estimate total effects by summing lags’ coefficients, e.g., \(\beta_1 + \beta_2 + \beta_3 + \beta_4\).
Note: We still assume current births don’t affect future births.
Option 2b: Autoregressive distributed-lag (ADL) models
These models allow the outcome to depend upon the explanatory variable(s) and/or the outcome variable in prior periods.
\[ \begin{align} \text{Births}_{\textcolor{#e64173}{t}} = \beta_0 + \beta_1 \text{Income}_{\textcolor{#e64173}{t}} + \beta_2 \text{Income}_{\textcolor{#6A5ACD}{t-1}} + \beta_3 \text{Births}_{\textcolor{#6A5ACD}{t-1}} + u_{\textcolor{#e64173}{t}} \end{align} \]
Here, current income affects affects current births and future births.
In addition, current births affect future births—we’re allowing lags of the outcome variable.
ADL models are often specified as \(\text{ADL}(\textcolor{#FFA500}{p},\,\textcolor{#e64173}{q})\), where
\(\textcolor{#FFA500}{p}\) is the (maximum) number of lags for the outcome variable.
\(\textcolor{#e64173}{q}\) is the (maximum) number of lags for explanatory variables.
Example: \(\text{ADL}(\textcolor{#FFA500}{1},\,\textcolor{#e64173}{0})\)
\[ \begin{aligned} \text{Births}_t = \beta_0 + \beta_1 \text{Income}_t + \beta_2 \text{Births}_{\textcolor{#FFA500}{t-1}} + u_t \end{aligned} \]
Example: \(\text{ADL}(\textcolor{#FFA500}{2},\,\textcolor{#e64173}{2})\)
\[ \begin{aligned} \text{Births}_t = &\beta_0 + \beta_1 \text{Income}_t + \beta_2 \text{Income}_{\textcolor{#e64173}{t-1}} + \beta_3 \text{Income}_{\textcolor{#e64173}{t-2}} \\ & + \beta_4 \text{Births}_{\textcolor{#FFA500}{t-1}} + \beta_5 \text{Births}_{\textcolor{#FFA500}{t-2}} + u_t \end{aligned} \]
Due to their lags, ADL models actually estimate even more complex relationships than you might first guess.
Consider ADL(1, 0): \(\text{Births}_t = \beta_0 + \beta_1 \text{Income}_t + \beta_2 \text{Births}_{t-1} + u_t\)
Write out the model for period \(t-1\):
\[ \begin{align} \text{Births}_{t-1} = \beta_0 + \beta_1 \text{Income}_{t-1} + \beta_2 \text{Births}_{t-2} + u_{t-1} \end{align} \]
which we can substitute in for \(\text{Births}_{t-1}\) in the first equation, i.e.,
\[ \begin{align} \text{Births}_t = &\beta_0 + \beta_1 \text{Income}_t + \\ &\beta_2 \underbrace{\left( \beta_0 + \beta_1 \text{Income}_{t-1} + \beta_2 \text{Births}_{t-2} + u_{t-1} \right)}_{\text{Births}_{t-1}} + u_t \end{align} \]
Continuing…
\[ \begin{align} \text{Births}_t = &\beta_0 + \beta_1 \text{Income}_t + \\ &\beta_2 \underbrace{\left( \beta_0 + \beta_1 \text{Income}_{t-1} + \beta_2 \text{Births}_{t-2} + u_{t-1} \right)}_{\text{Births}_{t-1}} + u_t \\ =& \beta_0 \left(1 + \beta_2 \right) + \beta_1 \text{Income}_t + \beta_1 \beta_2 \text{Income}_{t-1} + \\ &\beta_2^2 \text{Births}_{t-2} + u_{t} + \beta_2 u_{t-1} \end{align} \]
We could then substitute in the equation for \(\text{Births}_{t-2}\), \(\text{Births}_{t-3}\), …
Eventually we arrive at
\[ \begin{align} \text{Births}_t = &\beta_0 \left( 1 + \beta_2 + \beta_2^2 + \beta_2^3 + \cdots \right) + \\ &\beta_1 \left( \text{Income}_t + \beta_2 \text{Income}_{t-1} + \beta_2^2 \text{Income}_{t-2} + \cdots \right) +\\ & u_t + \beta_2 u_{t-1} + \beta_2^2 u_{t-2} + \cdots \end{align} \]
The point?
By including just one lag of the dependent variable—as in a ADL(1, 0)—we implicitly include for many lags of the explanatory variables and disturbances.†
1
There are times that actually want to model an individual’s desired amount, rather than her actual amount, but we are unable to observe the desired level.
Partial-adjustment models help us model this situation.
Example
We want to know how the desired number of cigarettes, \(\textcolor{#e64173}{\widetilde{\text{Cig}}_t}\) , changes with the current period’s cigarette tax, e.g.,
\[ \begin{align} \textcolor{#e64173}{\widetilde{\text{Cig}}_t} = \beta_0 + \beta_1 \text{Tax}_t + u_t \tag{A} \end{align} \]
Imagine actual cigarette consumption, \(\textcolor{#6A5ACD}{\text{Cig}_t}\), doesn’t change immediately (e.g., habit persistence). Instead, consumption depends upon current desired level and previous consumption level
\[ \begin{align} \textcolor{#6A5ACD}{\text{Cig}_t} = \lambda \textcolor{#e64173}{\widetilde{\text{Cig}}_t} + \left( 1-\lambda \right) \textcolor{#FFA500}{\text{Cig}_{t-1}} \tag{B} \end{align} \]
Example, continued
\[ \begin{align} \textcolor{#e64173}{\widetilde{\text{Cig}}_t} &= \beta_0 + \beta_1 \text{Tax}_t + u_t \tag{A} \\[0.3em] \textcolor{#6A5ACD}{\text{Cig}_t} &= \lambda \textcolor{#e64173}{\widetilde{\text{Cig}}_t} + \left( 1-\lambda \right) \textcolor{#FFA500}{\text{Cig}_{t-1}} \tag{B} \end{align} \]
Substituting \(\textcolor{#e64173}{\widetilde{\text{Cig}}_t}\) from \((\text{A})\) into \((\text{B})\) yields
\[ \begin{align} \textcolor{#6A5ACD}{\text{Cig}_t} &= \lambda \left( \beta_0 + \beta_1 \text{Tax}_t + u_t \right) + \left( 1-\lambda \right) \textcolor{#FFA500}{\text{Cig}_{t-1}} \\[0.3em] &= \lambda\beta_0 + \lambda\beta_1 \text{Tax}_t + \left( 1-\lambda \right) \textcolor{#FFA500}{\text{Cig}_{t-1}} + \lambda u_t \tag{C} \end{align} \]
The equation in \((\text{C})\) is ADL(1, 0).
We can also estimate/recover the speed-of-adjustment coefficient \(\lambda\).
OLS in time series
As before, the unbiased-ness of OLS is going to depend upon our exogeneity assumption, i.e., \(\mathop{\boldsymbol{E}}\left[ u_t \middle| X \right] = 0\).
We can split this assumption into two parts.
We need both of these parts to be true for OLS to be unbiased.
We need both parts of our exogeneity assumption for OLS to be unbiased:
\[ \begin{align} \mathop{\boldsymbol{E}}\left[ \hat{\beta}_1 \middle| X \right] &= \beta_1 + \mathop{\boldsymbol{E}}\left[ \dfrac{\sum_t \left( x_t - \overline{x} \right) u_t}{\sum_t \left( x_t - \overline{x} \right)^2} \middle| X \right] \end{align} \]
I.e., to guarantee the numerator equals zero, we need \(\mathop{\boldsymbol{E}}\left[ u_t | X \right] = 0\)—for both \(\mathop{\boldsymbol{E}}\left[ u_t | X_t \right] = 0\) and \(\mathop{\boldsymbol{E}}\left[ u_t | X_{s} \right] = 0\) \((s\neq t)\).
The second part of our exogeneity assumption—requiring that \(u_t\) is independent of all regressors in other periods—fails with dynamic models with lagged outcome variables.
Thus, OLS is biased for dynamic models with lagged outcome variables.
To see why dynamic models with lagged outcome variables violate our exogeneity assumption, consider two periods of our simple ADL(1, 0) model.
\[ \begin{align} \textcolor{#e64173}{\text{Births}_t} &= \beta_0 + \beta_1 \text{Income}_t + \beta_2 \text{Births}_{t-1} + \textcolor{#e64173}{u_t} \tag{1}\\[0.3em] \text{Births}_{t+1} &= \beta_0 + \beta_1 \text{Income}_{t+1} + \beta_2 \textcolor{#e64173}{\text{Births}_t} + u_{t+1} \tag{2} \end{align} \]
In \((1)\), \(\textcolor{#e64173}{u_t}\) clearly correlates with \(\textcolor{#e64173}{\text{Births}_t}\).
However, \(\textcolor{#e64173}{\text{Births}_t}\) is a regressor in \((2)\) (lagged dependent variable).
∴ The disturbance in \(t\) \(\left(\textcolor{#e64173}{u_t}\right)\) correlates with a regressor in \(t+1\) \(\left(\textcolor{#e64173}{\text{Births}_t}\right)\).
This correlation violates the second part of our exogeneity requirement.
All is not lost.
For OLS to be consistent, we only need contemporaneous exogeneity.
Contemporaneous exogeneity: each disturbance is uncorrelated with the explanatory variables in the same period, i.e.,
\[ \begin{align} \mathop{\boldsymbol{E}}\left[ u_t \middle| X_t \right] = 0 \end{align} \]
With contemporaneous exogeneity, OLS estimates for the coefficients in a time series model are consistent.
To see why OLS is consistent with contemporaneous exogeneity, consider the OLS estimate for \(\beta_1\) in
\[ \begin{align} \text{Births}_t &= \beta_0 + \beta_1 \text{Births}_{t-1} + u_t \end{align} \]
which we’ve shown (a few times) can be written
\[ \begin{align} \hat{\beta}_1 &= \beta_1 + \dfrac{\sum_t \left( \text{Births}_{t-1} - \overline{\text{Births} } \right)u_t}{\sum_t \left( \text{Births}_{t-1} - \overline{\text{Births} } \right)^2} \end{align} \]
\[ \begin{align} \mathop{\text{plim}} \hat{\beta}_1 &= \mathop{\text{plim}} \left( \beta_1 + \dfrac{\sum_t \left( \text{Births}_{t-1} - \overline{\text{Births} } \right)u_t}{\sum_t \left( \text{Births}_{t-1} - \overline{\text{Births} } \right)^2} \right) \\[0.3em] &= \beta_1 + \dfrac{\mathop{\text{plim}} \left[\sum_t \left( \text{Births}_{t-1} - \overline{\text{Births} } \right)u_t/T\right]}{\mathop{\text{plim}} \left[\sum_t \left( \text{Births}_{t-1} - \overline{\text{Births} } \right)^2/T\right]} \\[0.3em] &= \beta_1 + \dfrac{\textcolor{#e64173}{\mathop{\text{Cov}} \left( \text{Births}_{t-1},\, u_t \right)}}{\mathop{\text{Var}} \left( \text{Births}_{t} \right)} \end{align} \]
\(\hspace{8em}=\beta_1\hspace{1em}\) if \(\textcolor{#e64173}{\mathop{\text{Cov}} \left( \text{Births}_{t-1},\, u_t \right)=0}\)
Contemporaneous exogeneity gives us \(\textcolor{#e64173}{\mathop{\text{Cov}} \left( \text{Births}_{t-1},\, u_t \right)=0}\).
Thus, if we assume contemporaneous exogeneity, OLS is consistent for the coefficients, even for models with lagged dependent variables.
The end.
ADL models also offer interesting insights for long-run/equilibrium effects. \[ \begin{aligned} \text{Births}_t = \beta_0 + \textcolor{#e64173}{\beta_1} \text{Income}_t + \beta_2 \text{Births}_{t-1} + u_t \end{aligned} \]
In this ADL(1, 0) model, \(\beta_1\) gives the short-run effect of income on the number of births.
I.e., how income in time \(t\) affects births in time \(t\).
Starting with
\[ \begin{aligned} \text{Births}_t = \beta_0 + \beta_1 \text{Income}_t + \beta_2 \text{Births}_{t-1} + u_t \end{aligned} \]
we move into equilibrium, i.e., \(\text{Births}_t=\text{Births}^\star\), i.e.,
\[ \begin{aligned} \text{Births}^\star &= \beta_0 + \beta_1 \text{Income}^\star + \beta_2 \text{Births}^\star \end{aligned} \]
Now rearrange…
\[ \begin{align} \text{Births}^\star - \beta_2 \text{Births}^\star &= \beta_0 + \beta_1 \text{Income}^\star \\ \left(1 - \beta_2\right) \text{Births}^\star &= \beta_0 + \beta_1 \text{Income}^\star \\ \text{Births}^\star &= \dfrac{\beta_0}{\left(1 - \beta_2\right)} + \dfrac{\beta_1}{\left(1 - \beta_2\right)} \text{Income}^\star \end{align} \]
Short-run effect of income on births: \[ \begin{aligned} \text{Births}_t = \beta_0 + \textcolor{#e64173}{\beta_1} \text{Income}_t + \beta_2 \text{Births}_{t-1} + u_t \end{aligned} \]
Long-run effect of income on births: \[ \begin{align} \text{Births}^\star = \dfrac{\beta_0}{\left(1 - \beta_2\right)} + \textcolor{#6A5ACD}{\dfrac{\beta_1}{\left(1 - \beta_2\right)}} \text{Income}^\star \end{align} \]
Another way to see this result:
We already showed \[ \begin{align} \text{Births}_t =& \beta_0 + \beta_1 \text{Income}_t + \beta_2 \text{Births}_{t-1} \end{align} \]
gives us
\[ \begin{align} \text{Births}_t = &\beta_0 \left( 1 + \beta_2 + \beta_2^2 + \beta_2^3 + \cdots \right) + \\ &\beta_1 \left( \text{Income}_t + \beta_2 \text{Income}_{t-1} + \beta_2^2 \text{Income}_{t-2} + \cdots \right) +\\ & u_t + \beta_2 u_{t-1} + \beta_2^2 u_{t-2} + \cdots \end{align} \]
In equilibrium: \(\text{Income}_t=\text{Income}_{t-k}=\text{Income}^\star\) for all \(k\).
Substituting \(\text{Income}_{t}=\text{Income}^\star\) for all \(k\)
(and assuming no disturbances in equilibrium):
\[ \begin{align} \text{Births}_t = &\beta_0 \left( 1 + \beta_2 + \beta_2^2 + \beta_2^3 + \cdots \right) + \\ &\beta_1 \left( \text{Income}^\star + \beta_2 \text{Income}^\star + \beta_2^2 \text{Income}^\star + \cdots \right) +\\ \end{align} \]
count: false
Substituting \(\text{Income}_{t}=\text{Income}^\star\) for all \(k\)
(and assuming no disturbances in equilibrium): \[
\begin{align}
\text{Births}_t =
&\beta_0 \left( 1 + \beta_2 + \beta_2^2 + \beta_2^3 + \cdots \right) + \\
&\beta_1 \left( \text{Income}^\star + \beta_2 \text{Income}^\star + \beta_2^2 \text{Income}^\star + \cdots \right) +\\
= &\beta_0 \left( \dfrac{1}{\beta_2} \right) + \\
&\beta_1 \left( \dfrac{1}{\beta_2} \right) \text{Income}^\star
\end{align}
\]
So long as \(-1<\beta_2<1\).†
1