Exercises 13–28 use the same data sets as Exercises 13–28

in Section 10-1. In each case, find the regression equation, letting the first variable be the predictor (x) variable. Find the indicated predicted value by following the prediction procedure summarized in Figure 10-5 on page 493.

Using the listed width/weight data, find the best predicted weight of a seal if the overhead width measured from a photograph is 2 cm. Can the prediction be correct? If not, what is wrong?

Overhead Width

7.2

7.4

9.8

9.4

8.8

8.4

Weight

116

154

245

202

200

191

Short Answer

Expert verified

The regression equation is\(\hat y = - 157 + 40.2x\).

The best predicted weight of a seal if the overhead width measured from a photograph is 2 cm is -76.6.

The predicted weight is negative which is not possible in general. Also, the prediction is exptrapolated.

Step by step solution

01

Given informationa

Values are given on two variables namely, overhead width and weight.

02

Calculate the mean values

Let x represent the overhead width.

Let y represent the weight

Themean value of xis given as,

\(\begin{array}{c}\bar x = \frac{{\sum\limits_{i = 1}^n {{x_i}} }}{n}\\ = \frac{{7.2 + 7.4 + .... + 8.4}}{6}\\ = 8.5\end{array}\)

Therefore, the mean value of x is 8.5.

Themean value of yis given as,

\(\begin{array}{c}\bar y = \frac{{\sum\limits_{i = 1}^n {{y_i}} }}{n}\\ = \frac{{116 + 154 + .... + 191}}{6}\\ = 184.667\end{array}\)

Therefore, the mean value of y is 184.67.

03

Calculate the standard deviation of x and y

The standard deviation of xis given as,

\(\begin{array}{c}{s_x} = \sqrt {\frac{{\sum\limits_{i = 1}^n {{{({x_i} - \bar x)}^2}} }}{{n - 1}}} \\ = \sqrt {\frac{{{{\left( {7.2 - 8.5} \right)}^2} + {{\left( {7.4 - 8.5} \right)}^2} + ..... + {{\left( {8.4 - 8.5} \right)}^2}}}{{6 - 1}}} \\ = 1.049\end{array}\)

Therefore, the standard deviation of x is 1.05.

The standard deviation of y is given as,

\(\begin{array}{c}{s_y} = \sqrt {\frac{{\sum\limits_{i = 1}^n {{{({y_i} - \bar y)}^2}} }}{{n - 1}}} \\ = \sqrt {\frac{{{{\left( {116 - 184.667} \right)}^2} + {{\left( {154 - 184.667} \right)}^2} + ..... + {{\left( {191 - 184.667} \right)}^2}}}{{6 - 1}}} \\ = 44.433\end{array}\)

Therefore, the standard deviation of y is 44.43.

04

Calculate the correlation coefficient

The correlation coefficient is given as,

\(r = \frac{{n\left( {\sum {xy} } \right) - \left( {\sum x } \right)\left( {\sum y } \right)}}{{\sqrt {\left( {\left( {n\sum {{x^2}} } \right) - {{\left( {\sum x } \right)}^2}} \right)\left( {\left( {n\sum {{y^2}} } \right) - {{\left( {\sum y } \right)}^2}} \right)} }}\)

The calculations required to compute the correlation coefficient are as follows:

The correlation coefficient is given as,

\(\begin{array}{l}r = \frac{{n\left( {\sum {xy} } \right) - \left( {\sum x } \right)\left( {\sum y } \right)}}{{\sqrt {\left( {\left( {n\sum {{x^2}} } \right) - {{\left( {\sum x } \right)}^2}} \right)\left( {\left( {n\sum {{y^2}} } \right) - {{\left( {\sum y } \right)}^2}} \right)} }}\\ = \frac{{6\left( {9639} \right) - \left( {51} \right)\left( {1108} \right)}}{{\sqrt {\left( {\left( {6 \times 439} \right) - {{\left( {51} \right)}^2}} \right)\left( {\left( {6 \times 214482} \right) - {{\left( {1108} \right)}^2}} \right)} }}\\ = 0.9485\end{array}\)

Therefore, the correlation coefficient is 0.9485.

05

Calculate the slope of the regression line

The slopeof the regression line is given as,

\(\begin{array}{c}{b_1} = r\frac{{{s_Y}}}{{{s_X}}}\\ = 0.9485 \times \frac{{44.433}}{{1.049}}\\ = 40.2\end{array}\)

Therefore, the value of slope is 40.2.

06

Calculate the intercept of the regression line

The interceptis computed as,

\(\begin{array}{c}{b_0} = \bar y - {b_1}\bar x\\ = 184.667 - \left( {40.176 \times 8.5} \right)\\ = - 156.829\\ \approx - 157\end{array}\)

Therefore, the value of intercept is -157.

07

Form a regression equation

Theregression equationis given as,

\(\begin{array}{c}\hat y = {b_0} + {b_1}x\\ = - 157 + 40.2x\end{array}\)

Thus, the regression equation is \(\hat y = - 157 + 40.2x\).

08

Analyze the model

Referring to exercise 23 of section 10-1,

1)The scatter plot shows a linear relationship between the variables.

2)The P-value is 0.004.

As the P-value is less than the level of significance (0.05), this implies the null hypothesis is rejected.

Therefore, the correlation is significant.

Referring to figure 10-5, the criteria for a good regression model are satisfied.

Therefore, the regression equation can be used to predict the value of y.

09

Predict the value

The best predicted weight of a seal if the overhead width measured from a photograph is 2 cm is computed as,

\(\begin{array}{c}\hat y = - 157 + \left( {40.2 \times 2} \right)\\ = - 76.6\end{array}\)

Therefore, the best predicted weight of a seal if the overhead width measured from a photograph is 2 cm is -76.6 kg.

10

State if the prediction is correct

Thepredicted weight is negativewhich is not possible in general.

Also, the measure 2 cm is extreme as compared to observations of the sampled responses. Thus, the prediction is not suggested to be made as it would lead to extrapolation resulting in inaccurate prediction.

Therefore, the prediction is not correct.

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with Vaia!

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

Confidence Intervals for a Regression Coefficients A confidence interval for the regression coefficient b1 is expressed

\(\begin{array}{l}{b_1} - E < {\beta _1} < {b_1} + E\\\end{array}\)

Where

\(E = {t_{\frac{\alpha }{2}}}{s_{{b_1}}}\)

The critical t score is found using n –(k+1) degrees of freedom, where k, n, and sb1 are described in Exercise 17. Using the sample data from Example 1, n = 153 and k = 2, so df = 150 and the critical t scores are \( \pm \)1.976 for a 95% confidence level. Use the sample data for Example 1, the Stat diskdisplay in Example 1 on page 513, and the Stat Crunchdisplay in Exercise 17 to construct 95% confidence interval estimates of \({\beta _1}\) (the coefficient for the variable representing height) and\({\beta _2}\) (the coefficient for the variable representing waist circumference). Does either confidence interval include 0, suggesting that the variable be eliminated from the regression equation?

Outlier Refer to the accompanying Minitab-generated scatterplot. a. Examine the pattern of all 10 points and subjectively determine whether there appears to be a correlation between x and y. b. After identifying the 10 pairs of coordinates corresponding to the 10 points, find the value of the correlation coefficient r and determine whether there is a linear correlation. c. Now remove the point with coordinates (10, 10) and repeat parts (a) and (b). d. What do you conclude about the possible effect from a single pair of values?

Testing for a Linear Correlation. In Exercises 13–28, construct a scatterplot, and find the value of the linear correlation coefficient r. Also find the P-value or the critical values of r from Table A-6. Use a significance level of A = 0.05. Determine whether there is sufficient evidence to support a claim of a linear correlation between the two variables. (Save your work because the same data sets will be used in Section 10-2 exercises.)

CPI and the Subway Use CPI>subway data from the preceding exercise to determine whether there is a significant linear correlation between the CPI (Consumer Price Index) and the subway fare.

Effects of Clusters Refer to the Minitab-generated scatterplot given in Exercise 12 of Section 10-1 on page 485.

a. Using the pairs of values for all 8 points, find the equation of the regression line.

b. Using only the pairs of values for the 4 points in the lower left corner, find the equation of the regression line.

c. Using only the pairs of values for the 4 points in the upper right corner, find the equation of the regression line.

d. Compare the results from parts (a), (b), and (c).

Interpreting a Computer Display. In Exercises 9–12, refer to the display obtained by using the paired data consisting of Florida registered boats (tens of thousands) and numbers of manatee deaths from encounters with boats in Florida for different recent years (from Data Set 10 in Appendix B). Along with the paired boat, manatee sample data, StatCrunch was also given the value of 85 (tens of thousands) boats to be used for predicting manatee fatalities.

Testing for Correlation Use the information provided in the display to determine the value of the linear correlation coefficient. Is there sufficient evidence to support a claim of a linear correlation between numbers of registered boats and numbers of manatee deaths from encounters with boats?

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.

Sign-up for free