Documentos de Académico
Documentos de Profesional
Documentos de Cultura
2 2
Goodness-of-Fit
We can think of each observation as being made
up of an explained part, and an unexplained part,
yi y i ui We then define the following :
2
i
Goodness-of-Fit (continued)
How do we think about how well our
sample regression line fits our sample data?
Can compute the fraction of the total sum
of squares (SST) that is explained by the
model, call this the R-squared of regression
R2 = SSE/SST = 1 SSR/SST
6
Goodness-of-Fit (continued)
We can also think of R 2 as being equal to
the squared correlation coefficient between
the actual yi and the values y i
y y y y
y y y y
2
x
x
i1
i1
x1 yi
x1
1 1
11
x x
x x
i1
i1
0
2
1 xi1 2 xi 2 ui
2 xi1 x1 xi 2 xi1 x1 ui
12
x x x x
x x x
i1
i1
i2
2
i1
i1
x1 ui
x1
~
E 1 1 2
x x x
x x
i1
i1
i2
2
13
~
~
so E 1 1 2 1
x x x
x x
i1
i1
i2
2
14
Positive bias
Negative bias
2 < 0
Negative bias
Positive bias
15
Var j
, where
2
SST j 1 R j
2
2
j
Misspecified Models
Consider again the misspecified model
~ ~
~
~
y 0 1 x1 , so that Var 1
SST1
~
Thus, Var Var unless x and
n k 1 SSR df
thus, se SST 1 R
2
2
i
j
2 12
j
df = n (k + 1), or df = n k 1
df (i.e. degrees of freedom) is the (number
of observations) (number of estimated
parameters)
25