Testing for a Linear Correlation. In Exercises 13–28, construct a scatterplot, and find the value of the linear correlation coefficient r. Also find the P-value or the critical values of r from Table A-6. Use a significance level of A = 0.05. Determine whether there is sufficient evidence to support a claim of a linear correlation between the two variables. (Save your work because the same data sets will be used in Section 10-2 exercises.)

Manatees Listed below are numbers of registered pleasure boats in Florida (tens of thousands) and the numbers of manatee fatalities from encounters with boats in Florida for each of several recent years. The values are from Data Set 10 “Manatee Deaths” in Appendix B. Is there sufficient evidence to conclude that there is a linear correlation between numbers of registered pleasure boats and numbers of manatee boat fatalities?

Pleasure Boats

99

99

97

95

90

90

87

90

90

Manatee Fatalities

92

73

90

97

83

88

81

73

68

Short Answer

Expert verified

The scatterplot is shown below:

The value of the correlation coefficient is 0.341.

The p-value is 0.369.

There is not enough evidence to support the claim that there existsa linear correlation betweenthe counts of pleasure boats and manatee fatalities.

Step by step solution

01

Given information

The data is listed for the number of registered pleasure boats and manatee fatalities in Florida.

Pleasure Boats

Manatee Fatalities

99

92

99

73

97

90

95

97

90

83

90

88

87

81

90

73

90

68

02

Sketch a scatterplot

A scatterplot determines the pattern in any paired set of data.

Steps to sketch a scatterplot:

  1. Mark the two variables as pleasure boats and manatee fatalities on x and y-axis, respectively.
  2. Mark the observations in a paired form on the graph.

The scatterplotis shown below.

03

Compute the measure of the correlation coefficient

The correlation coefficient formula is

\(r = \frac{{n\sum {xy} - \left( {\sum x } \right)\left( {\sum y } \right)}}{{\sqrt {n\left( {\sum {{x^2}} } \right) - {{\left( {\sum x } \right)}^2}} \sqrt {n\left( {\sum {{y^2}} } \right) - {{\left( {\sum y } \right)}^2}} }}\).

Let variable x be pleasure boats counts and y be the count of manatee fatalities.

The valuesare listedin the table below:

x

y

\({x^2}\)

\({y^2}\)

\(xy\)

99

92

9801

8464

9108

99

73

9801

5329

7227

97

90

9409

8100

8730

95

97

9025

9409

9215

90

83

8100

6889

7470

90

88

8100

7744

7920

87

81

7569

6561

7047

90

73

8100

5329

6570

90

68

8100

4624

6120

\(\sum x = 837\)

\(\sum y = 745\)

\(\sum {{x^2}} = 78005\)

\(\sum {{y^2} = } \;62449\)

\(\sum {xy\; = \;} 69407\)

Substitute the values in the formula:

\(\begin{aligned} r &= \frac{{9\left( {69407} \right) - \left( {837} \right)\left( {745} \right)}}{{\sqrt {9\left( {78005} \right) - {{\left( {837} \right)}^2}} \sqrt {9\left( {62449} \right) - {{\left( {745} \right)}^2}} }}\\ &= 0.341\end{aligned}\)

Thus, the correlation coefficient is 0.341.

04

Step 4:Conduct a hypothesis test for correlation

Definemeasure\(\rho \)as the true correlation measure for the two variables.

For testing the claim, form the hypotheses:

\(\begin{array}{l}{H_o}:\rho = 0\\{H_a}:\rho \ne 0\end{array}\)

The samplesize is9(n).

The test statistic is computed as follows:

\(\begin{aligned} t &= \frac{r}{{\sqrt {\frac{{1 - {r^2}}}{{n - 2}}} }}\\ &= \frac{{0.341}}{{\sqrt {\frac{{1 - {{\left( {0.341} \right)}^2}}}{{9 - 2}}} }}\\ &= 0.9597\\ &\approx 0.960\end{aligned}\)

Thus, the test statistic is 0.960.

The degree of freedom is calculated below:

\(\begin{aligned} df &= n - 2\\ &= 9 - 2\\ &= 7\end{aligned}\)

The p-value is computed from the t-distribution table.

\(\begin{aligned} p{\rm{ - value}} &= 2P\left( {T > 0.960} \right)\\ &= 0.369\end{aligned}\)

Thus, the p-value is 0.369.

Since thep-value is greater than 0.05, the null hypothesis fails to be rejected.

Therefore, there is not sufficient evidence to conclude the existence of a linear correlation between the counts of pleasure boats and manatee boat fatalities.

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with Vaia!

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

Stocks and Sunspots. Listed below are annual high values of the Dow Jones Industrial Average (DJIA) and annual mean sunspot numbers for eight recent years. Use the data for Exercises 1–5. A sunspot number is a measure of sunspots or groups of sunspots on the surface of the sun. The DJIA is a commonly used index that is a weighted mean calculated from different stock values.

DJIA

14,198

13,338

10,606

11,625

12,929

13,589

16,577

18,054

Sunspot

Number

7.5

2.9

3.1

16.5

55.7

57.6

64.7

79.3

Confidence Interval Construct a 95% confidence interval estimate of the mean sunspot number. Write a brief statement interpreting the confidence interval.

Ages of MoviegoersThe table below shows the distribution of the ages of moviegoers(based on data from the Motion Picture Association of America). Use the data to estimate themean, standard deviation, and variance of ages of moviegoers.Hint:For the open-ended categoryof “60 and older,” assume that the category is actually 60–80.

Age

2-11

12-17

18-24

25-39

40-49

50-59

60 and older

Percent

7

15

19

19

15

11

14

In Exercises 5–8, we want to consider the correlation between heights of fathers and mothers and the heights of their sons. Refer to the

StatCrunch display and answer the given questions or identify the indicated items.

The display is based on Data Set 5 “Family Heights” in Appendix B.

Should the multiple regression equation be used for predicting the height of a son based on the height of his father and mother? Why or why not?

Cell Phones and Driving In the author’s home town of Madison, CT, there were 2733 police traffic stops in a recent year, and 7% of them were attributable to improper use of cell phones. Use a 0.05 significance level to test the claim that the sample is from a population in which fewer than 10% of police traffic stops are attributable to improper cell phone use.

In Exercises 5–8, we want to consider the correlation between heights of fathers and mothers and the heights of their sons. Refer to the

StatCrunch display and answer the given questions or identify the indicated items.

The display is based on Data Set 5 “Family Heights” in Appendix B.

Identify the following:

a. The P-value corresponding to the overall significance of the multiple regression equation

b. The value of the multiple coefficient of determination\({R^2}\).

c. The adjusted value of \({R^2}\)

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.

Sign-up for free