EC421, Set 01
Prologue
Let’s start with a few basic, general questions:
What is the goal of econometrics?
Why do economists (or other people) study or use econometrics?
One simple answer: Learn about the world using data.
Learn about the world = Raise, answer, and challenge questions, theories, assumptions.
data = Plural of datum.
One might (reasonably) guess a company’s sales are a function of its advertising spending, price, and intesity of competitors.
So, one might hypothesize a model \(\textcolor{#6A5ACD}{\text{Sales}} = f(\textcolor{#e64173}{\text{Ad}, \text{Price}, \text{Comp}})\)
where
We expect that sales \(\uparrow\) with advertising and \(\downarrow\) with price and competition.
But who needs to expect?
We can test these hypotheses using regression.
More importantly: Regression estimates the size of these effects
These (causal) questions are central to efficient decision-making
and are the bread and butter of econometrics.
Regression model:
\[ \textcolor{#6A5ACD}{\text{Sales}}_i = \beta_0 + \beta_1 \textcolor{#e64173}{\text{Ad}}_i + \beta_2 \textcolor{#e64173}{\text{Price}}_i + \beta_3 \textcolor{#e64173}{\text{Comp}}_i + \varepsilon_i \]
Regression model:
\[ \textcolor{#6A5ACD}{\text{Sales}}_i = \beta_0 + \beta_1 \textcolor{#e64173}{\text{Ad}}_i + \beta_2 \textcolor{#e64173}{\text{Price}}_i + \beta_3 \textcolor{#e64173}{\text{Comp}}_i + \varepsilon_i \]
With this basic regression model, we can test/estimate/quantify the (linear) relationship between sales and advertising, price, and competition.
Regression model:
\[ \textcolor{#6A5ACD}{\text{Sales}}_i = \beta_0 + \beta_1 \textcolor{#e64173}{\text{Ad}}_i + \beta_2 \textcolor{#e64173}{\text{Price}}_i + \beta_3 \textcolor{#e64173}{\text{Comp}}_i + \varepsilon_i \]
Regression model:
\[ \textcolor{#6A5ACD}{\text{Sales}}_i = \beta_0 + \beta_1 \textcolor{#e64173}{\text{Ad}}_i + \beta_2 \textcolor{#e64173}{\text{Price}}_i + \beta_3 \textcolor{#e64173}{\text{Comp}}_i + \varepsilon_i \]
Regression model:
\[ \textcolor{#6A5ACD}{\text{Sales}}_i = \beta_0 + \beta_1 \textcolor{#e64173}{\text{Ad}}_i + \beta_2 \textcolor{#e64173}{\text{Price}}_i + \beta_3 \textcolor{#e64173}{\text{Comp}}_i + \varepsilon_i \]
Regression model:
\[ \textcolor{#6A5ACD}{\text{Sales}}_i = \beta_0 + \beta_1 \textcolor{#e64173}{\text{Ad}}_i + \beta_2 \textcolor{#e64173}{\text{Price}}_i + \beta_3 \textcolor{#e64173}{\text{Comp}}_i + \varepsilon_i \]
Population parameters are averages; individuals are rarely average.
Regression model:
\[ \textcolor{#6A5ACD}{\text{Sales}}_i = \beta_0 + \beta_1 \textcolor{#e64173}{\text{Ad}}_i + \beta_2 \textcolor{#e64173}{\text{Price}}_i + \beta_3 \textcolor{#e64173}{\text{Comp}}_i + \varepsilon_i \]
You’ve learned how powerful and flexible ordinary least squares (OLS) regression can be.
However, the results you learned required assumptions.
Real life often violates these assumptions.
EC421 asks “What happens when we violate these assumptions?”
OLS still does some amazing things, but you need to know when to be cautious, confident, or dubious.
Suppose you estimate our sales model for your boss.
\[ \textcolor{#6A5ACD}{\text{Sales}}_i = \hat{\beta}_0 + \hat{\beta}_1 \textcolor{#e64173}{\text{Ad}}_i + \hat{\beta}_2 \textcolor{#e64173}{\text{Price}}_i + \hat{\beta}_3 \textcolor{#e64173}{\text{Comp}}_i + e_i \]
Can you trust that \(\hat{\beta}_2\) gives you the actual effect of price on sales?
You should be asking several questions…
Applied econometrics, data science, analytics require:
Intuition for the theory behind statistics/econometrics
(assumptions, results, strengths, weaknesses).
Practical knowledge of how to apply theoretical methods to data.
Efficient methods for working with data
(cleaning, aggregating, joining, visualizing).
This course aims to deepen your knowledge in each of these three areas.
My “big-picture takeaways” (the intuition that I hope you form)
Next: R basics + (More) Metrics review(s)