Testing for a Linear Correlation. In Exercises 13–28, construct a scatterplot, and find the value of the linear correlation coefficient r. Also find the P-value or the critical values of r from Table A-6. Use a significance level of A = 0.05. Determine whether there is sufficient evidence to support a claim of a linear correlation between the two variables. (Save your work because the same data sets will be used in Section 10-2 exercises.)

Weighing Seals with a Camera Listed below are the overhead widths (cm) of seals

measured from photographs and the weights (kg) of the seals (based on “Mass Estimation of Weddell Seals Using Techniques of Photogrammetry,” by R. Garrott of Montana State University). The purpose of the study was to determine if weights of seals could be determined from overhead photographs. Is there sufficient evidence to conclude that there is a linear correlation between overhead widths of seals from photographs and the weights of the seals?

Overhead Width

7.2

7.4

9.8

9.4

8.8

8.4

Weight

116

154

245

202

200

191

Short Answer

Expert verified

The scatterplot is shown below:

The value of the correlation coefficient is 0.948.

The p-value is 0.004.

There is enough evidence to support the claim that there is linear correlation between overhead width and weight.

Step by step solution

01

Given information

The data for overhead width and weights are recorded as shown below:

Overhead Width

Weight

7.2

116

7.4

154

9.8

245

9.4

202

8.8

200

8.4

191

02

Sketch a scatterplot

A graph thatdenotes a paired set of observations in a plotcan be used to analyze the trend between two variables.

Steps to sketch a scatterplot:

  1. Formthe x and y axes for overhead width and weight, respectively.
  2. Mark the points as coordinates on the graph.

The graph formed is shown below.

03

Compute the measure of the correlation coefficient

The correlation coefficient formula is

\(r = \frac{{n\sum {xy} - \left( {\sum x } \right)\left( {\sum y } \right)}}{{\sqrt {n\left( {\sum {{x^2}} } \right) - {{\left( {\sum x } \right)}^2}} \sqrt {n\left( {\sum {{y^2}} } \right) - {{\left( {\sum y } \right)}^2}} }}\).

Let x be the overhead width and y be the weight.

The valuesare listed in the table below:

x

y

\({x^2}\)

\({y^2}\)

\(xy\)

7.2

116

51.84

13456

835.2

7.4

154

54.76

23716

1139.6

9.8

245

96.04

60025

2401

9.4

202

88.36

40804

1898.8

8.8

200

77.44

40000

1760

8.4

191

70.56

36481

1604.4

\(\sum x = 51\)

\(\sum y = 1108\)

\(\sum {{x^2}} = 439\)

\(\sum {{y^2} = } \;214482\)

\(\sum {xy\; = \;} 9639\)

Substitute the values in the formula:

\(\begin{aligned} r &= \frac{{6\left( {9639} \right) - \left( {51} \right)\left( {1108} \right)}}{{\sqrt {6\left( {439} \right) - {{\left( {51} \right)}^2}} \sqrt {6\left( {214482} \right) - {{\left( {1108} \right)}^2}} }}\\ &= 0.948\end{aligned}\)

Thus, the correlation coefficient is 0.948.

04

Step 4:Conduct a hypothesis test for correlation

Definethe true measure of the correlation coefficientas\(\rho \).

For testing the claim, form the hypotheses.

\(\begin{array}{l}{H_o}:\rho = 0\\{H_a}:\rho \ne 0\end{array}\)

The samplesize is6(n).

The test statistic is computed as follows:

\(\begin{aligned} t &= \frac{r}{{\sqrt {\frac{{1 - {r^2}}}{{n - 2}}} }}\\ &= \frac{{0.948}}{{\sqrt {\frac{{1 - {{\left( {0.948} \right)}^2}}}{{6 - 2}}} }}\\ &= 5.957\end{aligned}\)

Thus, the test statistic is 5.957.

The degree of freedom is

\(\begin{aligned} df &= n - 2\\ &= 6 - 2\\ &= 4.\end{aligned}\)

Thep-value is computed from the t-distribution table.

\(\begin{aligned} p{\rm{ - value}} &= 2P\left( {T > 5.957} \right)\\ &= 0.0039\\ &\approx 0.004\end{aligned}\)

Thus, the p-value is 0.004.

Since thep-value is lesser than 0.05, the null hypothesis is rejected.

Therefore, there is enough evidence to conclude the existence of a linear correlation between the two variables.

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with Vaia!

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

Testing for a Linear Correlation. In Exercises 13–28, construct a scatterplot, and find the value of the linear correlation coefficient r. Also find the P-value or the critical values of r from Table A-6. Use a significance level of A = 0.05. Determine whether there is sufficient evidence to support a claim of a linear correlation between the two variables. (Save your work because the same data sets will be used in Section 10-2 exercises.)

Oscars Listed below are ages of Oscar winners matched by the years in which the awards were won (from Data Set 14 “Oscar Winner Age” in Appendix B). Is there sufficient evidence to conclude that there is a linear correlation between the ages of Best Actresses and Best Actors? Should we expect that there would be a correlation?

Actress

28

30

29

61

32

33

45

29

62

22

44

54

Actor

43

37

38

45

50

48

60

50

39

55

44

33

Interpreting the Coefficient of Determination. In Exercises 5–8, use the value of the linear correlation coefficient r to find the coefficient of determination and the percentage of the total variation that can be explained by the linear relationship between the two variables.

Weight , Waist r = 0.885 (x = weight of male, y = waist size of male)

In Exercises 9 and 10, use the given data to find the equation of the regression line. Examine the scatterplot and identify a characteristic of the data that is ignored by the regression line.

Critical Thinking: Is the pain medicine Duragesic effective in reducing pain? Listed below are measures of pain intensity before and after using the drug Duragesic (fontanels) (based on data from Janssen Pharmaceutical Products, L.P.). The data are listed in order by row, and corresponding measures are from the same subject before and after treatment. For example, the first subject had a measure of 1.2 before treatment and a measure of 0.4 after treatment. Each pair of measurements is from one subject, and the intensity of pain was measured using the standard visual analog score. A higher score corresponds to higher pain intensity.

Pain intensity before Duragestic Treatment

1.2

1.3

1.5

1.6

8

3.4

3.5

2.8

2.6

2.2

3

7.1

2.3

2.1

3.4

6.4

5

4.2

2.8

3.9

5.2

6.9

6.9

5

5.5

6

5.5

8.6

9.4

10

7.6

Pain intensity after Duragestic Treatment

0.4

1.4

1.8

2.9

6.0

1.4

0.7

3.9

0.9

1.8

0.9

9.3

8.0

6.8

2.3

0.4

0.7

1.2

4.5

2.0

1.6

2.0

2.0

6.8

6.6

4.1

4.6

2.9

5.4

4.8

4.1

Regression:Use the given data to find the equation of the regression line. Let the response (y) variable be the pain intensity after treatment. What would be the equation of the regression line for a treatment having absolutely no effect?

The following exercises are based on the following sample data consisting of numbers of enrolled students (in thousands) and numbers of burglaries for randomly selected large colleges in a recent year (based on data from the New York Times).

Enrollment (thousands)

53

28

27

36

42

Burglaries

86

57

32

131

157

True or false: If the sample data lead us to the conclusion that there is sufficient evidence to support the claim of a linear correlation between enrollment and number of burglaries, then we could also conclude that higher enrollments cause increases in numbers of burglaries.

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.

Sign-up for free