Exercises 13–28 use the same data sets as Exercises 13–28 in Section 10-1. In each case, find the regression equation, letting the first variable be the predictor (x) variable. Find the indicated predicted value by following the prediction procedure summarized in Figure 10-5 on page 493.

Using the listed duration and interval after times, find the best predicted “interval after” time for an eruption with a duration of 253 seconds. How does it compare to an actual eruption with a duration of 253 seconds and an interval after time of 83 minutes?

Short Answer

Expert verified

The regression equation is\(\hat y = 90.190 + 0.007X\)

The best predicted ‘interval after’ time for an eruption with a duration of 253 seconds will be approximately 92 (minutes).

There is an error of approximately 9 minutes in prediction. This is because the actual eruption with a duration of 253 seconds and an ‘interval after’ time is 83 minutes.

Step by step solution

01

Given information

The given data render the ‘interval after’ (in minutes) and eruption with the duration as follows.

The actual eruption with a duration of 253 seconds has an ‘interval after’ value of 83 minutes.

02

State the estimated regression line

The formula for the estimated regression line is

\(y = {b_0} + {b_1}x\),

where

\({b_0}\)is the Y-intercept,

\({b_1}\)is the slope,

\(x\)is the explanatory variable, and

\(\hat y\)is the response variable (predicted value).

Let X denote the duration (in seconds), and Y denote the ‘interval after’ (in minutes).

03

Compute the slope and intercept

The calculations required to compute the slope and intercept are as follows.

The sample size \(\left( n \right) = 7\).

The slope is computed as

\(\begin{array}{c}{b_1} = \frac{{n\left( {\sum {xy} } \right) - \left( {\sum x } \right)\left( {\sum y } \right)}}{{n\left( {\sum {{x^2}} } \right) - {{\left( {\sum x } \right)}^2}}}\\ = \frac{{7 \times 145348 - 1584 \times 642}}{{7 \times 369212 - {{1584}^2}}}\\ = 0.006735\\ \approx 0.00673\end{array}\).

The intercept is computed as

\(\begin{array}{c}{b_0} = \frac{{\left( {\sum y } \right)\left( {\sum {{x^2}} } \right) - \left( {\sum x } \right)\left( {\sum {xy} } \right)}}{{n\left( {\sum {{x^2}} } \right) - {{\left( {\sum x } \right)}^2}}}\\ = \frac{{642 \times 369212 - 1584 \times 145348}}{{7 \times 369212 - {{1584}^2}}}\\ = 90.19027\\ \approx 90.190\end{array}\).

So, the estimated regression equation is

\(\begin{array}{c}\hat y = {b_0} + {b_1}x\\ = 90.2 + 0.00673x\end{array}\)

04

Check the model

Refer to exercise 21 of section 10-1 for the following result.

1) The scatter plot does not show an approximate linear relationship between the variables.

2)The P-value is 0.921.

As theP-value is greater than the level of significance (0.05), the null hypothesis fails to be rejected.

Therefore, the correlation is not significant.

Referring to figure 10-5, the criteria for a good regression model are not satisfied.

As the model is bad, the best-predicted value of a variable is its sample mean.

05

Compute the predicted value

The best-predicted interval after times for an eruption with a duration of 253 seconds is required to be obtained.

As this is a bad model, the sample mean of the response variable will be used to predict the value.

The sample meanfor the response variable is

\(\begin{array}{c}\bar y = \frac{{\sum {{y_i}} }}{n}\\ = \frac{{91 + 81 + ... + 91}}{7}\\ = 91.7143\end{array}\).

Therefore, the best-predicted interval after time for an eruption with a duration of 253 seconds will be approximately 91.7 minutes.

Thus, the actual value of 83 minutes differs significantly from 91.7 minutes.

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with Vaia!

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

Testing for a Linear Correlation. In Exercises 13–28, construct a scatterplot, and find the value of the linear correlation coefficient r. Also find the P-value or the critical values of r from Table A-6. Use a significance level of A = 0.05. Determine whether there is sufficient evidence to support a claim of a linear correlation between the two variables. (Save your work because the same data sets will be used in Section 10-2 exercises.)

Tips Listed below are amounts of bills for dinner and the amounts of the tips that were left. The data were collected by students of the author. Is there sufficient evidence to conclude that there is a linear correlation between the bill amounts and the tip amounts? If everyone were to tip with the same percentage, what should be the value of r?

Bill(dollars)

33.46

50.68

87.92

98.84

63.6

107.34

Tip(dollars)

5.5

5

8.08

17

12

16

Explore! Exercises 9 and 10 provide two data sets from “Graphs in Statistical Analysis,” by F. J. Anscombe, the American Statistician, Vol. 27. For each exercise,

a. Construct a scatterplot.

b. Find the value of the linear correlation coefficient r, then determine whether there is sufficient evidence to support the claim of a linear correlation between the two variables.

c. Identify the feature of the data that would be missed if part (b) was completed without constructing the scatterplot.

x

10

8

13

9

11

14

6

4

12

7

5

y

9.14

8.14

8.74

8.77

9.26

8.10

6.13

3.10

9.13

7.26

4.74

Exercises 13–28 use the same data sets as Exercises 13–28 in Section 10-1. In each case, find the regression equation, letting the first variable be the predictor (x) variable. Find the indicated predicted value by following the prediction procedure summarized in Figure 10-5 on page 493.

Use the CPI/subway fare data from the preceding exercise and find

the best predicted subway fare for a time when the CPI reaches 500. What is wrong with this prediction?

Super Bowl and\({R^2}\)Let x represent years coded as 1, 2, 3, . . . for years starting in 1980, and let y represent the numbers of points scored in each Super Bowl from 1980. Using the data from 1980 to the last Super Bowl at the time of this writing, we obtain the following values of\({R^2}\)for the different models: linear: 0.147; quadratic: 0.255; logarithmic: 0.176; exponential: 0.175; power: 0.203. Based on these results, which model is best? Is the best model a good model? What do the results suggest about predicting the number of points scored in a future Super Bowl game?

Outlier Refer to the accompanying Minitab-generated scatterplot. a. Examine the pattern of all 10 points and subjectively determine whether there appears to be a correlation between x and y. b. After identifying the 10 pairs of coordinates corresponding to the 10 points, find the value of the correlation coefficient r and determine whether there is a linear correlation. c. Now remove the point with coordinates (10, 10) and repeat parts (a) and (b). d. What do you conclude about the possible effect from a single pair of values?

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.

Sign-up for free