Asymptotics and consistency

EC 421, Set 6

Ed Rubin

edwardr@uoregon.edu

Prologue

Schedule

Last Time

Living with heteroskedasticity

Today

Asymptotics and consistency

New problem set
Next topic: Time series
Midterm next week (May 7th)

R showcase

Need speed? R allows essentially infinite parallelization.

Three popular packages:

future and furrr
parallel
foreach

And here’s a nice tutorial.

Consistency

Welcome to asymptopia

Previously: We examined estimators (e.g., \(\hat{\beta}_j\)) and their properties using

The mean of the estimator’s distribution: \(\mathop{\boldsymbol{E}}\left[ \hat{\beta}_j \right] = ?\)

The variance of the estimator’s distribution: \(\mathop{\text{Var}} \left( \hat{\beta}_j \right) = ?\)

which tell us about the tendency of the estimator if we took ∞ samples, each with sample size \(\textcolor{#e64173}{n}\).

This approach misses something.

Consistency

Welcome to asymptopia

New question:
How does our estimator behave as our sample gets larger (as \(n\rightarrow\infty\))?

This new question forms a new way to think about the properties of estimators: asymptotic properties (or large-sample properties).

A “good” estimator will become indistinguishable from the parameter it estimates when \(n\) is very large (close to \(\infty\)).

Consistency

Probability limits

Just as the expected value helped us characterize the finite-sample distribution of an estimator with sample size \(n\),

the probability limit helps us analyze the asymptotic distribution of an estimator (the distribution of the estimator as \(n\) gets “big”¹).

Consistency

Probability limits

Let \(B_n\) be our estimator with sample size \(n\).

Then the probability limit of \(B\) is \(\alpha\) if

\[ \lim_{n\rightarrow\infty} \mathop{P}\left( \middle| B_n - \alpha \middle| > \epsilon \right) = 0 \tag{1} \]

for any \(\varepsilon > 0\).

The definition in \((1)\) essentially says that as the sample size approaches infinity, the probability that \(B_n\) differs from \(\alpha\) by more than a very small number \((\epsilon)\) is zero.

Practically: \(B\)’s distribution collapses to a spike at \(\alpha\) as \(n\) approaches \(\infty\).

Consistency

Probability limits

Equivalent statements:

The probability limit of \(B_n\) is \(\alpha\).
\(\text{plim}\: B = \alpha\)
\(B\) converges in probability to \(\alpha\).

Consistency

Probability limits

Probability limits have some nice/important properties:

\(\mathop{\text{plim}}\left( X \times Y \right) = \mathop{\text{plim}}\left( X \right) \times \mathop{\text{plim}}\left( Y \right)\)
\(\mathop{\text{plim}}\left( X + Y \right) = \mathop{\text{plim}}\left( X \right) + \mathop{\text{plim}}\left( Y \right)\)
\(\mathop{\text{plim}}\left( c \right) = c\), where \(c\) is a constant
\(\mathop{\text{plim}}\left( \dfrac{X}{Y} \right) = \dfrac{\mathop{\text{plim}}\left( X \right)}{ \mathop{\text{plim}}\left( Y \right)}\)
\(\mathop{\text{plim}}\!\big( f(X) \big) = \mathop{f}\!\big(\mathop{\text{plim}}\left( X \right)\big)\)

Consistency

Consistent estimators

We say that an estimator is consistent if

The estimator has a prob. limit (its distribution collapses to a spike).
This spike is located at the parameter the estimator predicts.

In other words…

An estimator is consistent if its asymptotic distribution collapses to a spike located at the estimated parameter.

In math: The estimator \(B\) is consistent for \(\alpha\) if \(\mathop{\text{plim}} B = \alpha\).

The estimator is inconsistent if \(\mathop{\text{plim}} B \neq \alpha\).

Consistency

Consistent estimators

Example: We want to estimate the population mean \(\mu_x\) (where \(X\)∼Normal).

Let’s compare the asymptotic distributions of two competing estimators:

The first observation: \(X_{1}\)
The sample mean: \(\overline{X} = \dfrac{1}{n} \sum_{i=1}^n x_i\)
Some other estimator: \(\widetilde{X} = \dfrac{1}{n+1} \sum_{i=1}^n x_i\)

Note that (1) and (2) are unbiased, but (3) is biased.

Consistency

Consistent estimators

To see which are unbiased/biased:

\(\mathop{\boldsymbol{E}}\left[ X_1 \right] =\) \(\mu_x\)

\(\mathop{\boldsymbol{E}}\left[ \overline{X} \right] =\) \(\mathop{\boldsymbol{E}}\left[ \dfrac{1}{n} \sum_{i=1}^n x_i \right] =\) \(\dfrac{1}{n} \sum_{i=1}^n \mathop{\boldsymbol{E}}\left[ x_i \right] =\) \(\dfrac{1}{n} \sum_{i=1}^n \mu_x =\) \(\dfrac{n}{n} \mu_x =\) \(\mu_x\)

\(\mathop{\boldsymbol{E}}\left[ \widetilde{X} \right] =\) \(\mathop{\boldsymbol{E}}\left[ \dfrac{1}{n+1} \sum_{i=1}^n x_i \right] =\) \(\dfrac{1}{n+1} \sum_{i=1}^n \mathop{\boldsymbol{E}}\left[ x_i \right] =\) \(\dfrac{n}{n+1}\mu_x\) \(\neq \mu_x\)