Está en la página 1de 3

AP Statistics - Chapter 14 Review

Name __________________________________________________________
Part I - Multiple Choice (Questions 1-7) - Circle the answer of your choice.
1.

A bivariate set of data relates the amount of annual salary raise and previous performance rating. The least-squares regression equation is
^

y = 1,400 + 2,000x , where y is the estimated raise and x is the performance rating. Which of the following statements is not
correct?
(a)
(b)
(c)
(d)
(e)

For each increase of one point in performance rating, the raise will increase on average by $2,000.
This equation produces predicted raises with an average error of 0.
A rating of 0 will yield a predicted raise of $1,400.
The correlation coefficient of the data is positive.
All of the above are true.

2.

A coefficient of determination is found to be 0.81. Which of the following is true?

(a)
(b)
(c)
(d)
(e)

81% of the variation between the variables is accounted for in the linear relationship.
81% of the data points lie on a line.
The correlation coefficient is approximately 0.9 .
19% of the variation between the variables is accounted for in the linear relationship.
All of these are true.

3.

If the 90% confidence interval for the slope of a regression line does not contain 0, then which of the following is a valid conclusion?

(a)
(b)
(c)
(d)
(e)

The confidence interval is not valid.


A significance test will not be significant at the 10% level.
There is insufficient evidence to believe that the slope of the true regression line is 0.
There is sufficient evidence to believe that the slope of the true regression line is 0.
None of these is valid.
In questions 4-7, use the following printout of the linear regression relating the SAT Math scores of 200 randomly chosen college freshmen
and their first semester GPAs.
The regression equation is
GPA = 1.53 + 0.00170 Math
Predictor
Constant
Math

Coeff
1.5264
0.0016990

StDev
0.3981
0.0006098

4. The value of SEb for this regression is:

(a)
(b)
(c)
(d)
(e)

.0006098
.0016990
.006
.3981
1.5264

5. The test statistic for a test of significance for a non-zero slope is:

(a)
(b)
(c)
(d)
(e)

.0006098
.3981
2.79
3.83
None of these.

T
3.83
2.79

P
0.000
0.006

6. Which of the following is a valid conclusion that could be drawn from this regression analysis?

(a)
(b)
(c)
(d)
(e)

0.
There is not sufficient evidence to reject the hypothesis that 0
There is sufficient evidence to reject the hypothesis that
This test is not significant at the 1% level.
Significance cannot be determined from this printout.
None of these is a valid conclusion.

7. Which of the following is the 95% confidence interval for the population slope?

(a)
(b)
(c)
(d)
(e)

(.0005, .0029)
(.0129, .0211)
(-.0170, .0340)
(.0008, .0026)
None of these.

Part II Free Response (Questions 8-9) Show your work and explain your results clearly.
8.

A new process designed to increase the temperature inside steel girders shows great promise. In a test of 90 randomly selected girders,
the following regression was performed; a partial computer printout is displayed:
Predictor
Constant
Temp 1
S= 0.6009

Coeff
0.2074
1.05651

StDev
0.2318
0.02221
R-Sq = 96.3 %

T
0.89
?

P
0.373
?

R-Sq(adj) = 96.2 %

Temp 1 is the initial temperature and Temp 2 is the temperature after the process has terminated.
(a)

State the regression equation.


Predicted Temp 2 = 0.2074 + 1.05651(Temp 1)

(b) Interpret the slope of the regression in the context of the problem.
For each increase of one degree in Temp 1, we expect an increase of 1.05651 degrees in Temp 2 on average.

(c)

Interpret the value of R-Sq in words.


96.3% of the variation in Temp 2 can be explained by the variation in Temp 1.

(d) Find the values of T and P indicated by the question marks in the printout.

t = b/SEb = 1.05651/0.02221 = 47.5691


p = 2tcdf(47.5691, 1e99, 88) = 1.4459 x 10-64

9.

(a)

A midterm exam in Applied Mathematics consists of problems in 8 topical areas. One of the teachers believes that the most important of
these, and the best indicator of overall performance, is the section on problem solving. She analyses the scores of 36 randomly chosen
students using computer software and produces the following printout relating the total score to the problem solving subscore, ProbSolv:
Predictor
Constant
ProbSolv

Coeff
12.960
4.0162

StDev
6.228
0.5393

S= 11.09

R-Sq = 62.0 %

R-Sq(adj) = 60.9 %

T
2.08
7.45

P
0.045
0.000

What is the predicted Total Score is the ProbSolve scores was 20 ?

y 12.960 4.0162 x when x = 20, y 93.284


(b) What is the residual for the data point (10, 55) ?

residual = observed-Predicted
55 53.122 = 1.878

(c)

Calculate the 95% confidence interval of the slope of the regression line for all Applied Mathematics students.

We may assume conditions for inference are met.

b t * SEb 4.0162 2.042(.5393)

We are 95 % confident that the true rate at which Total score changes with
the problem-solving score falls in the interval from 2.91495 to 5.11745 on average. Since
zero isnt in the interval, we have evidence of a positive linear and predictable
relationship.

2.91495,5.11745

(d) Use the information provided to test whether there is a significant relationship between the problem solving subsection and the total score
at the 5% level. We may assume conditions necessary for inference are met.
Ho: =0
Ha: 0
Results for a Linreg T-test:

t = 7.45

p=0

df = 34

= .05

Conclusion: Since p < , the results are significant and we reject Ho. There is sufficient evidence to believe that the section on problem solving
predicts the total score on average.

(e) Are the decisions reached through the construction of the confidence interval and through the use of a significance test consistent. Explain
the reasons for your answer.
These two forms of inference are consistent. The confidence interval does not contain zero, indicating a linear relationship. In the t-test, we
reject Ho: = 0, also indicating a linear relationship.

También podría gustarte