Testing for a Linear Correlation. In Exercises 13–28, construct a scatterplot, and find the value of the linear correlation coefficient r. Also find the P-value or the critical values of r from Table A-6. Use a significance level of A = 0.05. Determine whether there is sufficient evidence to support a claim of a linear correlation between the two variables. (Save your work because the same data sets will be used in Section 10-2 exercises.)

Old Faithful Listed below are duration times (seconds) and time intervals (min) to the next eruption for randomly selected eruptions of the Old Faithful geyser in Yellowstone National Park. Is there sufficient evidence to conclude that there is a linear correlation between duration times and interval after times?

Duration

242

255

227

251

262

207

140

Interval After

91

81

91

92

102

94

91

Short Answer

Expert verified

The scatter plot is:

The value of the correlation coefficient is 0.046.

The p-value is 0.921.

There is not enough evidence to support the claim that there is a linear correlation between the two variables.

Step by step solution

01

Given information

The data is recorded for two variables: duration in seconds and time intervals in minutes for the next eruption of a geyser.

Duration

Interval After

242

91

255

81

227

91

251

92

262

102

207

94

140

91

02

Sketch a scatterplot

A scatterplot is a graph oftwo variables that havepaired values. Each variable is scaled on one axis.

Steps to sketch a scatterplot:

  1. Mark two axes, xand y,for duration and interval after, respectively.
  2. Mark the paired data values on the graph corresponding to the axes.

The resultant graph is shown below.

03

Compute the measure of correlation coefficient

The formula for the correlation coefficient is

\(r = \frac{{n\sum {xy} - \left( {\sum x } \right)\left( {\sum y } \right)}}{{\sqrt {n\left( {\sum {{x^2}} } \right) - {{\left( {\sum x } \right)}^2}} \sqrt {n\left( {\sum {{y^2}} } \right) - {{\left( {\sum y } \right)}^2}} }}\).

Let the duration be variable x and theinterval after be variable y.

The valuesare listedin the table below:

x

y

\({x^2}\)

\({y^2}\)

\(xy\)

242

91

58564

8281

22022

255

81

65025

6561

20655

227

91

51529

8281

20657

251

92

63001

8464

23092

262

102

68644

10404

26724

207

94

42849

8836

19458

140

91

19600

8281

12740

\(\sum x = 1584\)

\(\sum y = 642\)

\(\sum {{x^2}} = 369212\)

\(\sum {{y^2} = } \;59108\)

\(\sum {xy\; = \;} 145348\)

Substitute the values in the formula:

\(\begin{aligned} r &= \frac{{7\left( {145348} \right) - \left( {1584} \right)\left( {642} \right)}}{{\sqrt {7\left( {369212} \right) - {{\left( {1584} \right)}^2}} \sqrt {7\left( {59108} \right) - {{\left( {642} \right)}^2}} }}\\ &= 0.046\end{aligned}\)

Thus, the correlation coefficient is 0.046.

04

Step 4:Conduct a hypothesis test for correlation

Define\(\rho \)as the true measure ofthe correlation coefficient for the two variables.

For testing the claim, form the hypotheses:

\(\begin{array}{l}{{\rm{H}}_{\rm{o}}}:\rho = 0\\{{\rm{{\rm H}}}_{\rm{a}}}:\rho \ne 0\end{array}\)

The samplesize is7 (n).

The test statistic is computed as follows:

\(\begin{aligned} t &= \frac{r}{{\sqrt {\frac{{1 - {r^2}}}{{n - 2}}} }}\\ &= \frac{{0.046}}{{\sqrt {\frac{{1 - {{0.046}^2}}}{{7 - 2}}} }}\\ &= 0.103\end{aligned}\)

Thus, the test statistic is 0.103.

The degree of freedom is

\(\begin{aligned} df &= n - 2\\ &= 7 - 2\\ &= 5.\end{aligned}\)

The p-value is computed from the t-distribution table.

\(\begin{aligned} p{\rm{ - value}} &= 2P\left( {T > t} \right)\\ &= 2P\left( {T > 0.103} \right)\\ &= 2\left( {1 - P\left( {T < 0.103} \right)} \right)\\ &= 0.921\end{aligned}\)

Thus, the p-value is 0.921.

Since the p-value is greater than 0.05, the null hypothesis fails to be rejected.

Therefore, there is not enough evidence to conclude that variables x(duration) and y (interval after) have a linear correlation between them.

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with Vaia!

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

Critical Thinking: Is the pain medicine Duragesic effective in reducing pain? Listed below are measures of pain intensity before and after using the drug Duragesic (fontanels) (based on data from Janssen Pharmaceutical Products, L.P.). The data are listed in order by row, and corresponding measures are from the same subject before and after treatment. For example, the first subject had a measure of 1.2 before treatment and a measure of 0.4 after treatment. Each pair of measurements is from one subject, and the intensity of pain was measured using the standard visual analog score. A higher score corresponds to higher pain intensity.

Pain intensity before Duragestic Treatment

1.2

1.3

1.5

1.6

8

3.4

3.5

2.8

2.6

2.2

3

7.1

2.3

2.1

3.4

6.4

5

4.2

2.8

3.9

5.2

6.9

6.9

5

5.5

6

5.5

8.6

9.4

10

7.6

Pain intensity after Duragestic Treatment

0.4

1.4

1.8

2.9

6.0

1.4

0.7

3.9

0.9

1.8

0.9

9.3

8.0

6.8

2.3

0.4

0.7

1.2

4.5

2.0

1.6

2.0

2.0

6.8

6.6

4.1

4.6

2.9

5.4

4.8

4.1

Regression:Use the given data to find the equation of the regression line. Let the response (y) variable be the pain intensity after treatment. What would be the equation of the regression line for a treatment having absolutely no effect?

Testing for a Linear Correlation. In Exercises 13–28, construct a scatterplot, and find the value of the linear correlation coefficient r. Also find the P-value or the critical values of r from Table A-6. Use a significance level of A = 0.05. Determine whether there is sufficient evidence to support a claim of a linear correlation between the two variables. (Save your work because the same data sets will be used in Section 10-2 exercises.)

Weighing Seals with a Camera Listed below are the overhead widths (cm) of seals

measured from photographs and the weights (kg) of the seals (based on “Mass Estimation of Weddell Seals Using Techniques of Photogrammetry,” by R. Garrott of Montana State University). The purpose of the study was to determine if weights of seals could be determined from overhead photographs. Is there sufficient evidence to conclude that there is a linear correlation between overhead widths of seals from photographs and the weights of the seals?

Overhead Width

7.2

7.4

9.8

9.4

8.8

8.4

Weight

116

154

245

202

200

191

Interpreting a Computer Display. In Exercises 9–12, refer to the display obtained by using the paired data consisting of Florida registered boats (tens of thousands) and numbers of manatee deaths from encounters with boats in Florida for different recent years (from Data Set 10 in Appendix B). Along with the paired boat, manatee sample data, StatCrunch was also given the value of 85 (tens of thousands) boats to be used for predicting manatee fatalities.

Predicting Manatee Fatalities Using x = 85 (for 850,000 registered boats), what is the single value that is the best predicted number of manatee fatalities resulting from encounters with boats?

Ages of Moviegoers Based on the data from Cumulative Review Exercise 7, assume that ages of moviegoers are normally distributed with a mean of 35 years and a standard deviation of 20 years.

a. What is the percentage of moviegoers who are younger than 30 years of age?

b. Find\({P_{25}}\), which is the 25th percentile.

c. Find the probability that a simple random sample of 25 moviegoers has a mean age that is less than 30 years.

d. Find the probability that for a simple random sample of 25 moviegoers, each of the moviegoers is younger than 30 years of age. For a particular movie and showtime, why might it not be unusual to have 25 moviegoers all under the age of 30?

The following exercises are based on the following sample data consisting of numbers of enrolled students (in thousands) and numbers of burglaries for randomly selected large colleges in a recent year (based on data from the New York Times).

Repeat the preceding exercise, assuming that the linear correlation coefficient is r= 0.997.

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.

Sign-up for free