Regression and Predictions. Exercises 13–28 use the same data sets as Exercises 13–28 in Section 10-1. In each case, find the regression equation, letting the first variable be the predictor (x) variable. Find the indicated predicted value by following the prediction procedure summarized in Figure 10-5 on page 493.

Using the listed actress/actor ages, find the best predicted age of the Best Actor given that the age of the Best Actress is 54 years. Is the result reasonably close to the Best Actor’s (Eddie Redmayne) actual age of 33 years, which happened in 2015, when the Best Actress was Julianne Moore, who was 54 years of age?

Short Answer

Expert verified

The regression equation is\(\hat y = 51.6 - 0.165x\).

The best predicted age of the Best Actor given that the age of the Best Actress is 54 years is 45 years.

Step by step solution

01

Given information

Values are given on two variables namely, actress age and actor age.

02

Calculate the mean values

Let x represent the age of Best Actress.

Let y represent the age of Best Actor.

Themean value of xis given as,

\(\begin{array}{c}\bar x = \frac{{\sum\limits_{i = 1}^n {{x_i}} }}{n}\\ = \frac{{28 + 30 + .... + 54}}{{12}}\\ = 39.083\end{array}\)

Therefore, the mean value of x is 39 years.

Themean value of yis given as,

\(\begin{array}{c}\bar y = \frac{{\sum\limits_{i = 1}^n {{y_i}} }}{n}\\ = \frac{{43 + 37 + .... + 33}}{{12}}\\ = 45.167\end{array}\)

Therefore, the mean value of y is 45 years.

03

Calculate the standard deviation of x and y

The standard deviation of x is given as,

\(\begin{array}{c}{s_x} = \sqrt {\frac{{\sum\limits_{i = 1}^n {{{({x_i} - \bar x)}^2}} }}{{n - 1}}} \\ = \sqrt {\frac{{{{\left( {28 - 39.083} \right)}^2} + {{\left( {30 - 39.083} \right)}^2} + ..... + {{\left( {54 - 39.083} \right)}^2}}}{{12 - 1}}} \\ = 13.734\end{array}\)

Therefore, the standard deviation of x is 13.734.

The standard deviation of yis given as,

\(\begin{array}{c}{s_y} = \sqrt {\frac{{\sum\limits_{i = 1}^n {{{({y_i} - \bar y)}^2}} }}{{n - 1}}} \\ = \sqrt {\frac{{{{\left( {43 - 45.167} \right)}^2} + {{\left( {37 - 45.167} \right)}^2} + ..... + {{\left( {33 - 45.167} \right)}^2}}}{{12 - 1}}} \\ = 7.872\end{array}\)

Therefore, the standard deviation of y is 7.872.

04

Calculate the correlation coefficient

Thecorrelation coefficientis given as,

\(r = \frac{{n\left( {\sum {xy} } \right) - \left( {\sum x } \right)\left( {\sum y } \right)}}{{\sqrt {\left( {\left( {n\sum {{x^2}} } \right) - {{\left( {\sum x } \right)}^2}} \right)\left( {\left( {n\sum {{y^2}} } \right) - {{\left( {\sum y } \right)}^2}} \right)} }}\)

The calculations required to compute the correlation coefficient are as follows:

The correlation coefficient is given as,

\(\begin{array}{l}r = \frac{{n\left( {\sum {xy} } \right) - \left( {\sum x } \right)\left( {\sum y } \right)}}{{\sqrt {\left( {\left( {n\sum {{x^2}} } \right) - {{\left( {\sum x } \right)}^2}} \right)\left( {\left( {n\sum {{y^2}} } \right) - {{\left( {\sum y } \right)}^2}} \right)} }}\\ = \frac{{12\left( {20841} \right) - \left( {469} \right)\left( {542} \right)}}{{\sqrt {\left( {\left( {12 \times 20405} \right) - {{\left( {469} \right)}^2}} \right)\left( {\left( {12 \times 25162} \right) - {{\left( {542} \right)}^2}} \right)} }}\\ = - 0.288\end{array}\)

Therefore, the correlation coefficient is -0.288.

05

Calculate the slope of the regression line

The slopeof the regression line is given as,

\(\begin{array}{c}{b_1} = r\frac{{{s_Y}}}{{{s_X}}}\\ = - 0.288 \times \frac{{7.872}}{{13.734}}\\ = - 0.165\end{array}\)

Therefore, the value of slope is - 0.165.

06

Calculate the intercept of the regression line

The interceptis computed as,

\(\begin{array}{c}{b_0} = \bar y - {b_1}\bar x\\ = 45.167 - \left( { - 0.165 \times 39.083} \right)\\ = 51.6\end{array}\)

Therefore, the value of intercept is 51.6.

07

Form a regression equation

Theregression equationis given as,

\(\begin{array}{c}\hat y = {b_0} + {b_1}x\\ = 51.6 - 0.165x\end{array}\)

Thus, the regression equation is \(\hat y = 51.6 - 0.165x\).

08

Analyze the regression model

Referring to exercise 21 of section 10-1 for the following result

1)The scatter plot shows approximately a linear relationship between the variables.

2)The P-value is 0.364.

As the P-value is greater than the level of significance (0.05), this implies the null hypothesis fails to reject.

Therefore, the correlation is not significant.

Referring to Figure 10-5, thecriteria for a good regression model are not satisfied.

Therefore, the regression equation cannot be used to predict the value of y.

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with Vaia!

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

Testing for a Linear Correlation. In Exercises 13–28, construct a scatterplot, and find the value of the linear correlation coefficient r. Also find the P-value or the critical values of r from Table A-6. Use a significance level of A = 0.05. Determine whether there is sufficient evidence to support a claim of a linear correlation between the two variables. (Save your work because the same data sets will be used in Section 10-2 exercises.)

CSI Statistics Use the paired foot length and height data from the preceding exercise. Is there sufficient evidence to conclude that there is a linear correlation between foot lengths and heights of males? Based on these results, does it appear that police can use foot length to estimate the height of a male?

Shoe print(cm)

29.7

29.7

31.4

31.8

27.6

Foot length(cm)

25.7

25.4

27.9

26.7

25.1

Height (cm)

175.3

177.8

185.4

175.3

172.7

What is the difference between the regression equation\(\hat y = {b_0} + {b_1}x\)and the regression equation\(y = {\beta _0} + {\beta _1}x\)?

Testing for a Linear Correlation. In Exercises 13–28, construct a scatterplot, and find the value of the linear correlation coefficient r. Also find the P-value or the critical values of r from Table A-6. Use a significance level of A = 0.05. Determine whether there is sufficient evidence to support a claim of a linear correlation between the two variables. (Save your work because the same data sets will be used in Section 10-2 exercises.)

Sports Diameters (cm), circumferences (cm), and volumes (cm3) from balls used in different sports are listed in the table below. Is there sufficient evidence to conclude that there is a linear correlation between diameters and circumferences? Does the scatterplot confirm a linear association?


Diameter

Circumference

Volume

Baseball

7.4

23.2

212.2

Basketball

23.9

75.1

7148.1

Golf

4.3

13.5

41.6

Soccer

21.8

68.5

5424.6

Tennis

7

22

179.6

Ping-Pong

4

12.6

33.5

Volleyball

20.9

65.7

4780.1

Softball

9.7

30.5

477.9

Critical Thinking: Is the pain medicine Duragesic effective in reducing pain? Listed below are measures of pain intensity before and after using the drug Duragesic (fentanyl) (based on data from Janssen Pharmaceutical Products, L.P.). The data are listed in order by row, and corresponding measures are from the same subject before and after treatment. For example, the first subject had a measure of 1.2 before treatment and a measure of 0.4 after treatment. Each pair of measurements is from one subject, and the intensity of pain was measured using the standard visual analog score. A higher score corresponds to higher pain intensity.

Pain Intensity Before Duragesic Treatment

1.2

1.3

1.5

1.6

8

3.4

3.5

2.8

2.6

2.2

3

7.1

2.3

2.1

3.4

6.4

5

4.2

2.8

3.9

5.2

6.9

6.9

5

5.5

6

5.5

8.6

9.4

10

7.6










Pain Intensity After Duragesic Treatment

0.4

1.4

1.8

2.9

6

1.4

0.7

3.9

0.9

1.8

0.9

9.3

8

6.8

2.3

0.4

0.7

1.2

4.5

2

1.6

2

2

6.8

6.6

4.1

4.6

2.9

5.4

4.8

4.1










Two Independent Samples The methods of Section 9-2 can be used to test the claim that two populations have the same mean. Identify the specific claim that the treatment is effective, then use the methods of Section 9-2 to test that claim. The methods of Section 9-2 are based on the requirement that the samples are independent. Are they independent in this case?

Exercises 13–28 use the same data sets as Exercises 13–28 in Section 10-1. In each case, find the regression equation, letting the first variable be the predictor (x) variable. Find the indicated predicted value by following the prediction procedure summarized in Figure 10-5 on page 493.

Internet and Nobel Laureates Find the best predicted Nobel Laureate rate for Japan, which has 79.1 Internet users per 100 people. How does it compare to Japan’s Nobel Laureate rate of 1.5 per 10 million people?

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.

Sign-up for free