Simple Linear Regression: Inference

class: center, middle, inverse, title-slide

# Simple Linear Regression: Inference
## EC 320: Introduction to Econometrics
### Winter 2022

---

class: inverse, middle

# Prologue

---
# Housekeeping

- Lab today & Ex05 due today
- Midterm 1 solution posted
- Extra OH 7pm-8pm on Zoom

---
# Last Time

We discussed the .hi-green[classical assumptions of OLS:]

> 
1. **Linearity:** The population relationship is linear in parameters with an additive error term.
2. **Sample Variation:** There is variation in `$X$`.
3. **Random Sampling:** We have a random sample from the population of interest.
4. **Exogeneity:** The `$X$` variable is exogenous (*i.e.,* `$\mathop{\mathbb{E}}\left( u|X \right) = 0$`).
5. **Homoskedasticity:** The error term has the same variance for each value of the independent variable (*i.e.,* `$\mathop{\text{Var}}(u|X) = \sigma^2$`).
6. **Normality:** The population error term is normally distributed with mean zero and variance `$\sigma^2$` (*i.e.,* `$u \sim N(0,\sigma^2)$`)

We restricted our attention to the first 5 assumptions.

---
count: false

# Last Time

We discussed the .hi-green[classical assumptions of OLS:]

> 
1. **Linearity:** The population relationship is linear in parameters with an additive error term.
2. **Sample Variation:** There is variation in `$X$`.
3. **Random Sampling:** We have a random sample from the population of interest.
4. **Exogeneity:** The `$X$` variable is exogenous (*i.e.,* `$\mathop{\mathbb{E}}\left( u|X \right) = 0$`).
5. **Homoskedasticity:** The error term has the same variance for each value of the independent variable (*i.e.,* `$\mathop{\text{Var}}(u|X) = \sigma^2$`).
6. .hi[Normality:] .pink[The population error term is normally distributed with mean zero and variance] `$\color{#e64173}{\sigma^2}$` .pink[(*i.e.,*] `$\color{#e64173}{u \sim N(0,\sigma^2)}$`.pink[)]

We restricted our attention to the first 5 assumptions.

---
# Classical Assumptions

## Last Time

1. We used the first 4 assumptions to show that OLS is unbiased: `$\mathop{\mathbb{E}}\left[ \hat{\beta} \right] = \beta$`

2. We used the first 5 assumptions to derive a formula for the __variance__ of the OLS estimator: `$\mathop{\text{Var}}(\hat{\beta}_1) = \frac{\sigma^2}{\sum_{i=1}^n (X_i - \bar{X})^2}$`.

---
# Classical Assumptions

## Today

We will use the sampling distribution of `$\hat{\beta_1}$` to conduct hypothesis tests.

- Can use all 6 classical assumptions to show that OLS is normally distributed:

`$$\hat{\beta}_1 \sim \mathop{N}\left( \beta_1, \frac{\sigma^2}{\sum_{i=1}^n (X_i - \bar{X})^2} \right)$$`

- We'll "prove" this using .mono[R].

---
# Simulation

.pull-left[

.center[**Population**]

]

.pull-right[

.center[**Population relationship**]

$$ Y_i = 2.53 + 0.57 X_i + u_i $$

$$ Y_i = \beta_0 + \beta_1 X_i + u_i $$

]

---
# Simulation

.pull-left[

.center[**Sample 1:** 30 random individuals]

]

.pull-right[

.center[

**Population relationship**
 
`$Y_i = 2.53 + 0.57 X_i + u_i$`

**Sample relationship**
 
`$\hat{Y}_i = 2.36 + 0.61 X_i$`

]

---
# Simulation

.pull-left[

.center[**Sample 2:** 30 random individuals]

]

.pull-right[

.center[

**Population relationship**
 
`$Y_i = 2.53 + 0.57 Y_i + u_i$`

**Sample relationship**
 
`$\hat{Y}_i = 2.79 + 0.56 X_i$`

]

]
---
# Simulation

.pull-left[

.center[**Sample 3:** 30 random individuals]

]

.pull-right[

.center[

**Population relationship**
 
`$Y_i = 2.53 + 0.57 X_i + u_i$`

**Sample relationship**
 
`$\hat{Y}_i = 3.21 + 0.45 X_i$`

]

---
layout: false
class: white-slide, middle

Repeat **10,000 times** (Monte Carlo simulation).

---
class: white-slide

---
class: white-slide, middle

.pull-left[

.center[
**Intercept Estimates**
]
<img src="10-Simple_Linear_Regression_Inference_files/figure-html/simulation hist1-1.png" style="display: block; margin: auto;" />
]

.pull-right[

.center[
**Slope Estimates**
]
<img src="10-Simple_Linear_Regression_Inference_files/figure-html/simulation hist2-1.png" style="display: block; margin: auto;" />
]

---
# Simulation