In exercise 10-1 12. Clusters Refer to the following Minitab-generated scatterplot. The four points in the lower left corner are measurements from women, and the four points in the upper right corner are from men.

a. Examine the pattern of the four points in the lower left corner (from women) only, and subjectively determine whether there appears to be a correlation between x and y for women.

b. Examine the pattern of the four points in the upper right corner (from men) only, and subjectively determine whether there appears to be a correlation between x and y for men.

c. Find the linear correlation coefficient using only the four points in the lower left corner (for women). Will the four points in the upper left corner (for men) have the same linear correlation coefficient?

d. Find the value of the linear correlation coefficient using all eight points. What does that value suggest about the relationship between x and y?

e. Based on the preceding results, what do you conclude? Should the data from women and the data from men be considered together, or do they appear to represent two different and distinct populations that should be analyzed separately?

Short Answer

Expert verified

a.There seems to be no linear correlation among x and y for women.

b. There seems to be no linear correlation among x and y for men.

c. The correlation coefficient between x and y for women is 0. The value of the correlation coefficient between x and y is the same for men.

d.The correlation between x and y when all the 8 points are taken into account is equal to 0.985.

e. The points corresponding to women and men appear to represent two different and distinct populations and should be analyzed separately.

Step by step solution

01

Given Information

A scatterplot is constructed for the pairs of measurements for women and men.

02

Examination of  linear correlation

a.

Since the points on the lower-left corner do not seem to follow an increasing straight-line trend or a decreasing straight-line trend, there seems to be no linear correlation among x and y for women.

b.

Similarly, the points on the upper right corner neither follow an increasing straight-line trend or a decreasing straight-line trend, there seems to be no linear correlation among x and y for men.

03

Computation of correlation coefficient

c.

To calculate the correlation coefficient between x and y for women, the following computations are made:

x

y

\({x^2}\)

\({y^2}\)

\(xy\)

1

1

1

1

1

1

2

1

4

2

2

1

4

1

2

2

2

4

4

4

\(\sum x = 6\)\(\)

\(\sum y = 6\)

\(\sum {x^2} = 10\)

\(\sum {y^2} = 10\)

\(\sum xy = 9\)

The formula to calculate correlation coefficient r is given by

\(\begin{aligned} r &= \frac{{n\sum xy - \left( {\sum x} \right)\left( {\sum y} \right)}}{{\sqrt {n\left( {\sum {x^2}} \right) - {{\left( {\sum x} \right)}^2}} \sqrt {n\left( {\sum {y^2}} \right) - {{\left( {\sum y} \right)}^2}} }}\\ &= \frac{{4\left( 9 \right) - \left( 6 \right)\left( 6 \right)}}{{\sqrt {4\left( {10} \right) - {{\left( 6 \right)}^2}} \sqrt {4\left( {10} \right) - {{\left( 6 \right)}^2}} }}\\ &= 0\end{aligned}\)

Thus, the correlation coefficient between x and y for women is 0.

To calculate the correlation coefficient between x and y for men, consider the following calculations:

x

y

\({x^2}\)

\({y^2}\)

\(xy\)

9

9

81

81

81

9

10

81

100

90

10

9

100

81

90

10

10

100

100

100

\(\sum x = 38\)\(\)

\(\sum y = 38\)

\(\sum {x^2} = 362\)

\(\sum {y^2} = 362\)

\(\sum xy = 361\)

The formula to calculate correlation coefficient r is given by

\(\begin{aligned} r &= \frac{{n\sum xy - \left( {\sum x} \right)\left( {\sum y} \right)}}{{\sqrt {n\left( {\sum {x^2}} \right) - {{\left( {\sum x} \right)}^2}} \sqrt {n\left( {\sum {y^2}} \right) - {{\left( {\sum y} \right)}^2}} }}\\ &= \frac{{4\left( {361} \right) - \left( {38} \right)\left( {38} \right)}}{{\sqrt {4\left( {362} \right) - {{\left( {38} \right)}^2}} \sqrt {4\left( {362} \right) - {{\left( {38} \right)}^2}} }}\\ &= 0\end{aligned}\)

Thus, the correlation coefficient between x and y for men is 0.

04

Computation of the correlation coefficient with all 8 points

d.

To compute the correlation between all the 8 points, consider the following calculations:

x

Y

\({x^2}\)

\({y^2}\)

\(xy\)

1

1

1

1

1

1

2

1

4

2

2

1

4

1

2

2

2

4

4

4

9

9

81

81

81

9

10

81

100

90

107

9

100

81

90

10

10

100

100

100

\(\sum x = 44\)\(\)

\(\sum y = 44\)

\(\sum {x^2} = 372\)

\(\sum {y^2} = 372\)

\(\sum xy = 370\)

The correlation coefficient is equal to:

\(\begin{aligned} r &= \frac{{n\sum xy - \left( {\sum x} \right)\left( {\sum y} \right)}}{{\sqrt {n\left( {\sum {x^2}} \right) - {{\left( {\sum x} \right)}^2}} \sqrt {n\left( {\sum {y^2}} \right) - {{\left( {\sum y} \right)}^2}} }}\\ &= \frac{{8\left( {370} \right) - \left( {44} \right)\left( {44} \right)}}{{\sqrt {8\left( {372} \right) - {{\left( {44} \right)}^2}} \sqrt {8\left( {372} \right) - {{\left( {44} \right)}^2}} }}\\ &= 0.985\end{aligned}\)

Thus, the correlation between x and y when all the 8 points are taken into account is equal to 0.985.

05

Analysis of the correlation

e.

All of the points corresponding to women lie on the lower-left corner while all of the points corresponding to men lie on the upper-right corner.

There seems to be no association/mix-up of the two sets of values.

Therefore, the points corresponding to women and menappear to represent two different and distinct populations and should be analyzed separately.

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with Vaia!

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

Exercises 13–28 use the same data sets as Exercises 13–28 in Section 10-1. In each case, find the regression equation, letting the first variable be the predictor (x) variable. Find the indicated predicted value by following the prediction procedure summarized in Figure 10-5 on page 493.

Using the listed duration and interval after times, find the best predicted “interval after” time for an eruption with a duration of 253 seconds. How does it compare to an actual eruption with a duration of 253 seconds and an interval after time of 83 minutes?

Testing for a Linear Correlation. In Exercises 13–28, construct a scatterplot, and find the value of the linear correlation coefficient r. Also find the P-value or the critical values of r from Table A-6. Use a significance level of A = 0.05. Determine whether there is sufficient evidence to support a claim of a linear correlation between the two variables. (Save your work because the same data sets will be used in Section 10-2 exercises.)

Tips Listed below are amounts of bills for dinner and the amounts of the tips that were left. The data were collected by students of the author. Is there sufficient evidence to conclude that there is a linear correlation between the bill amounts and the tip amounts? If everyone were to tip with the same percentage, what should be the value of r?

Bill(dollars)

33.46

50.68

87.92

98.84

63.6

107.34

Tip(dollars)

5.5

5

8.08

17

12

16

Cigarette Tar and Nicotine The table below lists measured amounts (mg) of tar, carbonmonoxide (CO), and nicotine in king size cigarettes of different brands (from Data Set 13“Cigarette Contents” in Appendix B).

a. Is there is sufficient evidence to support a claim of a linear correlation between tar and nicotine?

b. What percentage of the variation in nicotine can be explained by the linear correlation between nicotine and tar?

c. Letting yrepresent the amount of nicotine and letting xrepresent the amount of tar, identify the regression equation.

d. The Raleigh brand king size cigarette is not included in the table, and it has 23 mg of tar. What is the best predicted amount of nicotine? How does the predicted amount compare to the actual amount of 1.3 mg of nicotine?

Tar

25

27

20

24

20

20

21

24

CO

18

16

16

16

16

16

14

17

Nicotine

1.5

1.7

1.1

1.6

1.1

1.0

1.2

1.4

Effects of an Outlier Refer to the Minitab-generated scatterplot given in Exercise 11 of

Section 10-1 on page 485.

a. Using the pairs of values for all 10 points, find the equation of the regression line.

b. After removing the point with coordinates (10, 10), use the pairs of values for the remaining 9 points and find the equation of the regression line.

c. Compare the results from parts (a) and (b).

In Exercises 5–8, we want to consider the correlation between heights of fathers and mothers and the heights of their sons. Refer to the

StatCrunch display and answer the given questions or identify the indicated items.

The display is based on Data Set 5 “Family Heights” in Appendix B.

Identify the following:

a. The P-value corresponding to the overall significance of the multiple regression equation

b. The value of the multiple coefficient of determination\({R^2}\).

c. The adjusted value of \({R^2}\)

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.

Sign-up for free