AP Stat Quiz on Chapter 2


Name:

1.  A study is conducted to determine if one can predict the yield of a crop based on the amount of yearly rainfall. What is the explanatory variable in this study?

A.   yield of the crop
B.   amount of yearly rainfall
C.   the experimenter
D.   either bushels or inches of water

2.  When are the variables said to be positively associated?

A.   When above average values of one variable tend to accompany below average values of the other.
B.   When above average values of one variable tend to accompany above average values of the other.
C.   When below average values of one variable tend to accompany above average values of the other.
D.   When below average values of one variable can be accompanied by either above or below average values of the other.

3.  A researcher is interested in determining if one could predict the score on a statistics exam from the amount of time spent studying for the exam. In this study, what is the explanatory variable?

A.   the researcher
B.   the amount of time spent studying for the exam
C.   the score on the exam
D.   the fact that this is a statistics exam

4.  When water flows across farm land, some of the soil is washed away, resulting in erosion. An experiment was conducted to investigate the effect of the rate of water flow on the amount of soil washed away. Flow is measured in liters per second and the eroded soil is measured in kilograms. The data are given in the following table:

Flow rate .31 .85 1.26 2.47 3.75
Eroded Soil .82 1.95 2.18 3.01 6.07

What is the association between flow rate and amount of eroded soil?

A.   positive.
B.   negative.
C.   neither positive nor negative.
D.   impossible to determine since both variables are categorical.

5.  A student wonders if people of similar heights tend to date each other. She measures herself, her dormitory roommate, and the women in the adjoining rooms; then she measures the next man each woman dates. Here are the data (heights in inches):

Women 66 64 66 65 70 65
Men 72 68 70 68 74 69
Which of the following statements is true?

A.   The variables measured are all categorical.
B.   There is a strong negative association between the heights of men and women, since the women are always smaller than the men they date.
C.   There is a positive association between the heights of men and women given.
D.   Any height above 70 inches must be considered an outlier.

6.  In baseball, power hitters are sometimes thought of as being large and slow. Base stealers are often viewed as smaller and faster players. If this is true, one might expect teams that hit many home runs to have relatively few stolen bases and teams that steal many bases to hit relatively few home runs. Suppose we wish to investigate how well the number of home runs hit by a team predicts the number of stolen bases by the team. What is the explanatory variable?

A.   number of home runs hit by a team
B.   the number of stolen bases by a team
C.   a categorical variable
D.   the same as the response variable

7.  Here is a scatterplot of the number of home runs versus the number of stolen bases for each of the teams in the major leagues in 1993.

Which statement best describes what this scatterplot shows?

A.   There is a perfect association between number of home runs and number of stolen bases.
B.   There is a strong positive association between number of home runs and number of stolen bases.
C.   There is a weak negative association between number of home runs and number of stolen bases.
D.   Home runs is a categorical variable.

8.  Below is a scatterplot of number of home runs vs. number of stolen bases for Major League teams in 1993. American League teams are represented by x's and National League teams by circles.

Which statement best describes what this scatterplot shows?

A.   There is a positive association for American League teams, but a negative association for National League teams.
B.   There is a negative association for American League teams, but a positive association for National League teams.
C.   There is a weak negative association for both leagues.
D.   All American League teams hit more home runs than National League teams, while all National League teams stole more bases than American League teams.

9.  Which option is a scatterplot of the data in the following small, artificial data set?

Y
(response variable)
X
(explanatory variable)
9
0
4
1
1
2
0
3
1
4
4
5
9
6

A.  

B.  

C.  

D.  

10.  What does the correlation coefficient measure?

A.   whether there is a relation between two variables
B.   whether or not a scatterplot shows an interesting pattern
C.   whether a cause and effect relation exists between two variables
D.   the strength of any straight line relation between two variables

11.  Which option is a plausible value for the correlation coefficient between weight and MPG in the following scatterplot?

A.   +0.2
B.   -0.9
C.   +0.7
D.   -1.0

12.  Which of the following statements is true?

A.   The correlation coefficient equals the proportion of times two variables lie on a straight line.
B.   The correlation coefficient will be +1.0 only if all the data lie on a perfectly horizontal straight line.
C.   The correlation coefficient measures the fraction of outliers that appear in a scatterplot.
D.   The correlation coefficient is a unitless number and must always lie between -1.0 and +1.0

13.  What is the correlation r for the data in the following small, artificial data set?

Y
(response variable)
X
(explanatory variable)
9
0
4
1
1
2
0
3
1
4
4
5
9
6

A.   0.0
B.   1.0
C.   -1.0
D.   It is outside the usual range of -1.0 to 1.0.

14.  For which of the following small data sets is the correlation r between the x and y values equal to -1.0?

A.  
x: 1 2 3
y: 2 4 6
B.  
x: 1 2 3
y: 3 2 1
C.  
x: 1 2 3
y: -3 -2 -1
D.  
x: 1 2 3
y: 1 2 3

15.  Which of the following statements is correct?

A.   Changing the units of measurements of x or y does not change the value of the correlation r.
B.   A negative value for the correlation r indicates the data are strongly unassociated.
C.   The correlation always has the same units as the x variable, but not the y variable.
D.   The correlation always has the same units as the y variable, but not the x variable.

16.  Which of the following statements is correct?

A.   Faculty who are good researchers tend to be poor teachers and vice-versa, so the correlation between teaching and research is 0.
B.   Women tend to be, on average, about 3.5 inches shorter than the men they marry, so the correlation between the heights of spouses must be negative.
C.   A researcher finds the correlation between the shoe size of children and their score on a reading test to be 0.22. The researcher must have made a mistake since these two variables are clearly unrelated and must have correlation 0.
D.   If people with larger heads tend to be more intelligent then we would expect the correlation between head size and intelligence to be positive.

17.  Two Olympic gymnasts tie on every event, each always getting the exact same score as the other. What is the correlation between the scores of these contestants?

A.   1.0
B.   0.0
C.   -1.0
D.   It is impossible to compute without knowing the precise values of the scores.

18.  Consider the following scatterplot of amounts of CO (carbon monoxide) and NOX (nitrogen oxide) in grams per mile driven, in the exhausts of cars. The least squares regression line has been drawn in the plot.

A.   -1.0
B.   -0.73
C.   0.49
D.   0.0

19.  In the above scatterplot, the slope of the least squares line fitted to these data would be which of the following?

A.   negative
B.   positive
C.   influential
D.   both positive and negative

20.  In the above scatterplot, the least squares line would predict that a car which emits 10 grams of CO per mile driven would emit how many grams of NOX per mile driven?

A.   10.0
B.   0.7
C.   2.2
D.   1.1

21.  If removing an observation from a data set would have a marked change on the position of the least-squares regression line fit to the data, what is the point called?

A.   robust
B.   a residual
C.   influential
D.   a response

22.  In regression, the residuals are which of the following?

A.   those factors unexplained by the data
B.   the difference between the observed responses and the values predicted by the regression line
C.   those data points which were recorded after the formal investigation was completed
D.   possible models unexplored by the experimenter

23.  What does the squared correlation coefficient, r2, measure?

A.   the slope of the least squares regression line
B.   the intercept of the least squares regression line
C.   the extent to which cause and effect is present in the data
D.   the fraction of the variation of one variable that is explained by least squares regression on the other

24.  Below are the Olympic gold medal performances in the men's high jump from 1948 to 1984. High jump heights are in inches.

High jump Year
78.00
1948
80.32
1952
83.25
1956
85.00
1960
85.75
1964
88.25
1968
87.75
1972
88.50
1976
92.75
1980
92.50
1984

If we compute the least squares regression line of high jump height (y) on year (x), we find it has equation:

= -671.924 + 0.3856x

Referring to the above data, predicting the winning high jump height in the year 2200 using the regression equation given is an example of what?

A.   an influential observation
B.   extrapolation
C.   a lurking variable
D.   causation

25.  Suppose the correlation between two variables x and y is due to the fact that both are responding to changes in some unobserved third variable. What is this due to?

A.   cause and effect between x and y
B.   the effect of a lurking variable
C.   extrapolation
D.   common sense


Made with with Fr. Chris' HTML Quiz Maker