-
Question
The daily expenses of summer tourists in Turin are analyzed. A
survey with tourists is conducted. This shows that the
tourists spend on average EUR. The sample variance
is equal to .
You are asked to determine a confidence interval for the average daily
expenses (in EUR) of a tourist. Using your computations, tell me which of the
following statements is correct:
-
The lower bound of the confidence interval is 113.959
-
The upper bound of the confidence interval is 119.008
-
None of the statements are correct.
Solution
The confidence interval for the average expenses is
given by:
-
True
-
False
-
False
-
Question
Which of the following statements about the regression model and it’s associated predicted values are correct?
-
On average, if is greater than , then the corresponding is smaller than it’s mean
-
On average, if is smaller than , then the corresponding is also smaller than it’s mean
-
On average, if is smaller than , then the corresponding is greater than it’s mean
-
On average, if is greater than , then the corresponding is also greater than it’s mean
-
None of the provided answers is correct.
Solution
We talked about this solution in class with a picture and some algebra. You can consult the solution here. You can see it under Task 4 here.
-
False. We have that seen that and are orthogonal, i.e. .
-
False. We have that seen that and are orthogonal, i.e. .
-
False. We have that seen that and are orthogonal, i.e. .
-
False. We have that seen that and are orthogonal, i.e. .
-
True. We have that seen that and are orthogonal, i.e. .
-
Question
Which of the following statements about the regression model and it’s associated residuals values are correct?
-
The mean of the residuals from a linear regression is zero only if we include a slope.
-
None of the provided answers is correct.
-
The mean of the residuals from a linear regression is always zero, regardless of whether there is an intercept or not.
-
The mean of the residuals from a linear regression is zero only if we include an intercept.
Solution
The solution to this question is in slide 32/40 of lecture 5
-
False.
-
False.
-
False.
-
True.
-
Question
Consider the following dataset and the fitted line:
plot of chunk nonlinearplot
-
the equation for the OLS fitted line looks like
-
OLS cannot represent nonlinear data. As the name states, its ordinary linear squares.
-
None of the provided answers is correct.
-
the equation for the OLS fitted line looks like
-
the equation for the OLS fitted line looks like
Solution
-
False.
-
False.
-
False.
-
True.
-
False.
-
Question
You are analysing the determinants of pay, using 420 observations. Each of the columns displays the OLS regression associated to one of the following models, where male is a binary indicator equal to 1 if an individual is male. All models were run on the same input dataset. Which of the following statements is correct?
plot of chunk interaction-plot
| |
(1) |
(2) |
| + p < 0.1, * p < 0.05, ** p < 0.01, *** p < 0.001 |
| (Intercept) |
34.012*** |
22.040*** |
|
(0.854) |
(1.205) |
| education |
-1.134*** |
0.373+ |
|
(0.173) |
(0.191) |
| maleTRUE |
|
10.656*** |
|
|
(0.854) |
| Num.Obs. |
420 |
420 |
| R2 |
0.094 |
0.340 |
-
The plot corresponds to model (2)
-
The estimate for
(Intercept) in model (2) is statistically significant at least at the 5% level
-
None of the statements is correct.
Solution
-
TRUE
-
TRUE
-
FALSE
-
Question
For 64 firms the number of employees and the amount of
expenses for continuing education (in EUR) were recorded. The
statistical summary of the data set is given by:
|
Variable |
Variable |
| Mean |
53.38 |
235.8 |
| Variance |
109.86 |
2826.75 |
The covariance between and is equal to 481.65.
Estimate the expected amount of money spent for continuing education
by a firm with 47 employees using least squares regression. Your solution should be rounded to 2 digits. Which of the following statements are correct?
-
The expected amount of money spent is 209.81
-
The intercept of your regression is 1.77
-
None of the above statements is correct.
Solution
First, the regression line is determined. The regression coefficients are given by:
The estimated amount of money spent by a firm with
47 employees is then given by:
-
False
-
True
-
False
-
Question
The following figure shows a scatterplot. Notice that you can visually estimate the standard deviation of a normally distributed random variable by dividing it’s range by 6. Both and are normally distributed in this example. Which of the following statements are correct?
plot of chunk scatterplot
-
The standard deviation of is at least .
-
For , can be expected to be about .
-
The mean of is at least .
-
The absolute value of the correlation coefficient is at most .
-
The scatterplot is standardized.
Solution
-
True. The standard deviation of is about equal to and is therefore larger than .
-
True. The regression line at implies a value of about .
-
True. The mean of is about equal to and hence is larger than .
-
False. A strong association between the variables is given in the scatterplot. Hence the absolute value of the correlation coefficient is close to and therefore larger than .
-
False. The scatterplot is not standardized, because and do not both have mean and variance .
-
Question
Below we show the summary of running 2 models (A and B) for 300 times. Each time we generate a dataset containing where and we run the regression
Both models A and B differ in how strongly and are correlated with each other. The true population parameters are and .
|
Model A |
Model B |
| Mean |
4 |
3.99 |
| Mean |
1 |
1.01 |
| Mean SE of |
0.04 |
0.29 |
| Mean SE of |
0.04 |
0.29 |
Which of the following statements are true? When I say below that 2 numbers differ significantly I mean that they should differ by several multiples, i.e. and would differ if for .
-
The average point estimates from both models for both slope coefficients do not differ significantly.
-
The averages of the standard error of estimates from both models for both slope coefficients do not differ significantly.
-
Given this evidence, Model B can be described as a situation of multicollinearity.
-
Given this evidence, under multcollinearity, OLS is biased.
Solution
-
True
-
False
-
True
-
False
-
Question
The university introduces mandatory Moodle quizzes in 2024 for one course (treated), while another similar course (control) continues without them. The table below reports the average exam scores recorded in each of the cases:
Compute the Difference-in-Differences (DiD) estimate of the effect of introducing Moodle quizzes. Which of the following answers is correct?
-
The DiD estimate is equal to: -3.6
-
The DiD estimate is equal to: 3.2
-
The DiD estimate is equal to: 3.6
-
None of the above statements is correct.
Solution
-
False
-
True
-
False
-
False
-
Question
Which of the following statements about the abbreviation of BLUE in the context of the OLS estimator is correct? BLUE means…
-
Best Linear Unbiased Exception.
-
Biased Linear Unconditional Estimator.
-
Best Linear Unbiased Estimator.
-
Best Linear Unconditional Estimator.
-
Binary Linear Unbiased Estimator.
Solution
-
False.
-
False.
-
True.
-
False.
-
False.