Questions AP4.38 and AP4.39 refer to the following situation. Do children’s fear levels change over time and, if so, in what ways? Little research has been done on the prevalence and persistence of fears in children. Several years ago, two researchers surveyed a randomly selected group of 94 third- and fourth-grade children, asking them to rate their level of fearfulness about a variety of situations. Two years later, the children again completed the same survey. The researchers computed the overall fear rating for each child in both years and were interested in the relationship between these ratings. They then assumed that the true regression line was μlater rating=β0+β1(initial rating) and that the assumptions for regression inference were satisfied. This model was fitted to the data using least-squares regression. The following results were obtained from statistical software.

Here is a scatterplot of the later ratings versus the initial ratings and a plot of the residuals versus the initial ratings:

AP4.38 Which of the following statements is supported by these plots?
a. The abundance of outliers and influential observations in the plots means that the assumptions for regression are clearly violated.
b. These plots contain dramatic evidence that the standard deviation of the response about the true regression line is not approximately the same for each x -value.
c. These plots call into question the validity of the assumption that the later ratings vary Normally about the least-squares line for each value of the initial ratings.
d. A linear model isn’t appropriate here because the residual plot shows no association.
e. There is no striking evidence that the assumptions for regression inference are violated

Short Answer

Expert verified

The correct answer is option (a) The abundance of outliers and influential observations in the plots means that the assumptions for regression are clearly violated.

Step by step solution

01

Given information

To determine which of the statements is supported by these plots.

02

Explanation

The distribution between the actual and fitted response values is depicted in a residual plot. A random dispersion of points forms an approximately constant width band around the identity line in the ideal residual plot, also known as the null residual plot.
The researchers are interested in learning if and how the children's terror levels vary over time. In the question, there is a scatterplot.
There is no evident pattern in the residual plot, and neither the scatterplot nor the residual plot appear to have any outliers.
As a result, answer (a) is accurate, because the regression inference assumptions are not violated.

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with Vaia!

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

Lamb’s quarters is a common weed that interferes with the growth of corn. An agriculture researcher planted corn at the same rate in 16small plots of ground and then weeded the plots by hand to allow a fixed number of lamb’s quarters plants to grow in each meter of cornrow. The decision on how many of these plants to leave in each plot was made at random. No other weeds were allowed to grow. Here are the yields of corn (bushels per acre) in each of the plots:


Here is some computer output from a least-squares regression analysis of these data. Do these data provide convincing evidence at the α=0.05level that more lamb’s quarters reduce corn yield?


PredictorCoefSECoefTPConstant166.4832.72561.110.000Weedsper1.09870.57121.920.075meterS=7.97665R-Sq=20.9%R-Sq(adj)=15.3%

Beer and BAC How well does the number of beers a person drinks predict his or her blood alcohol content (BAC)? Sixteen volunteers aged 21or older with an initial BAC of 0took part in a study to find out. Each volunteer drank a randomly assigned number of cans of beer. Thirty minutes later, a police officer measured their BAC. A least-squares regression analysis was performed on the data using x=number of beers and y=BAC. Here is a residual plot and a histogram of the residuals. Check whether the conditions for performing inference about the regression model are met.

a. Find the critical value for a 99%confidence interval for the slope of the true regression line. Then calculate the confidence interval.

b. Interpret the interval from part (a).

c. Explain the meaning of “localid="1654184305701" 99%confident” in this context

Here is computer output from the least-squares regression analysis of the beer and blood alcohol dat

T12.9 Which of the following would provide evidence that a power model of the form y=axp, wherep0and p1, describes the relationship between a response variable y and an explanatory variable x?
a. A scatterplot of y versus x looks approximately linear.
b. A scatterplot of Iny versus x looks approximately linear.
c. A scatterplot of y versus lnx looks approximately linear.
d. A scatterplot of Iny versus lnx looks approximately linear.
e. None of these

Of the 98teachers who responded, 23.5%said that they had one or more tattoos.

a. Construct and interpret a 95%confidence interval for the true proportion of all teachers at the AP institute who would say they have tattoos.

b. Does the interval in part (a) provide convincing evidence that the proportion of all teachers at the institute who would say they have tattoos is different from 0.29. (the value cited in the Harris Poll report)? Justify your answer.

c. Two of the selected teachers refused to respond to the survey. If both of these teachers had responded, could your answer to part (b) have changed? Justify your answer.

Exercises T12.4–T12.8 refer to the following setting. An old saying in golf is “You drive for show and you putt for dough.” The point is that good putting is more important than long driving for shooting low scores and hence winning money. To see if this is the case, data from a random sample of 69 of the nearly 1000 players on the PGA Tour’s world money list are examined. The average number of putts per hole (fewer is better) and the player’s total winnings for the previous season are recorded and a least-squares regression line was fitted to the data. Assume the conditions for
inference about the slope are met. Here is computer output from the regression analysis:

T12.6 The P -value for the test in Exercise T12.5 is 0.0087. Which of the following is a correct interpretation of this result?
a. The probability there is no linear relationship between average number of putts per hole and total winnings for these 69 players is 0.0087.
b. The probability there is no linear relationship between average number of putts per hole and total winnings for all players on the PGA Tour’s world money list is 0.0087.
c. If there is no linear relationship between average number of putts per hole and total winnings for the players in the sample, the probability of getting a random sample of 69 players that yields a least-squares regression line with a slope of −4,139,198 or less is 0.0087.
d. If there is no linear relationship between average number of putts per hole and total winnings for the players on the PGA Tour’s world money list, the probability of getting a random sample of 69 players that yields a least-squares regression line with a slope of −4,139,198 or less is 0.0087.
e. The probability of making a Type I error is 0.0087.

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.

Sign-up for free