Testing for a Linear Correlation. In Exercises 13–28, construct a scatterplot, and find the value of the linear correlation coefficient r. Also find the P-value or the critical values of r from Table A-6. Use a significance level of A = 0.05. Determine whether there is sufficient evidence to support a claim of a linear correlation between the two variables. (Save your work because the same data sets will be used in Section 10-2 exercises.)

Sports Diameters (cm), circumferences (cm), and volumes (cm3) from balls used in different sports are listed in the table below. Is there sufficient evidence to conclude that there is a linear correlation between diameters and circumferences? Does the scatterplot confirm a linear association?


Diameter

Circumference

Volume

Baseball

7.4

23.2

212.2

Basketball

23.9

75.1

7148.1

Golf

4.3

13.5

41.6

Soccer

21.8

68.5

5424.6

Tennis

7

22

179.6

Ping-Pong

4

12.6

33.5

Volleyball

20.9

65.7

4780.1

Softball

9.7

30.5

477.9

Short Answer

Expert verified

The scatterplot is drawn below:

The value ofthe correlation coefficient is approximately1.000.

The p-value is 0.000.

There is sufficient evidence to support the existence of a linear correlation between the diameter and circumference.

Yes, the scatterplot reveals a straight-line pattern, which implies a strong linear association between the variables.

Step by step solution

01

Given information

The data for diameter and circumference is given.

Diameter

Circumference

7.4

23.2

23.9

75.1

4.3

13.5

21.8

68.5

7

22

4

12.6

20.9

65.7

9.7

30.5

02

Sketch a scatterplot

A scatterplot is used for determining a relationship between a pair of variables by tracing the paired values onto a graph.

Steps to sketch a scatterplot:

  1. Mark diameter on the x-axisand circumference on they-axis.
  2. Draw dots for each paired observation.

The resultantscatterplotis shown below.

03

Compute the measure of the correlation coefficient

The formula for correlation coefficient is

\(r = \frac{{n\sum {xy} - \left( {\sum x } \right)\left( {\sum y } \right)}}{{\sqrt {n\left( {\sum {{x^2}} } \right) - {{\left( {\sum x } \right)}^2}} \sqrt {n\left( {\sum {{y^2}} } \right) - {{\left( {\sum y } \right)}^2}} }}\).

Define x as the diameter and y as the circumference.

The valuesare tabulatedbelow:

x

y

\({x^2}\)

\({y^2}\)

\(xy\)

7.4

23.2

54.76

538.24

171.68

23.9

75.1

571.21

5640.01

1794.89

4.3

13.5

18.49

182.25

58.05

21.8

68.5

475.24

4692.25

1493.3

7

22

49

484

154

4

12.6

16

158.76

50.4

20.9

65.7

436.81

4316.49

1373.13

9.7

30.5

94.09

930.25

295.85

\(\sum x = 99\)

\(\sum y = 311.1\)

\(\sum {{x^2}} = 1715.6\)

\(\sum {{y^2} = } \;16942.25\)

\(\sum {xy\; = \;} 5391.3\)

Substitute the values in the formula:

\(\begin{aligned} r &= \frac{{8\left( {5391.3} \right) - \left( {99} \right)\left( {311.1} \right)}}{{\sqrt {8\left( {1715.6} \right) - {{\left( {99} \right)}^2}} \sqrt {8\left( {16942.25} \right) - {{\left( {311.1} \right)}^2}} }}\\ &= 0.999999\\ &\approx 1.000\end{aligned}\)

Thus, the correlation coefficient is approximately 1.000.

04

Step 4:Conduct a hypothesis test for correlation

Define\(\rho \)asbe the measure of the linear correlation coefficient for diameter and circumference.

For testing the claim, form the hypotheses:

\(\begin{array}{l}{H_o}:\rho = 0\\{H_a}:\rho \ne 0\end{array}\)

The samplesize is8(n).

The test statistic is calculated below:

\(\begin{aligned} t &= \frac{r}{{\sqrt {\frac{{1 - {r^2}}}{{n - 2}}} }}\\ &= \frac{{0.999999}}{{\sqrt {\frac{{1 - {{\left( {0.999999} \right)}^2}}}{{8 - 2}}} }}\\ &= 1732.05\end{aligned}\)

Thus, the test statistic is1732.05.

The degree of freedom iscalculated below:

\(\begin{aligned} df &= n - 2\\ &= 8 - 2\\ &= 6\end{aligned}\)

The p-value is computed from the t-distribution table.

\(\begin{aligned} p{\rm{ - value}} &= 2P\left( {T > 21732.05} \right)\\ &= 0.000\end{aligned}\)

Thus, the p-value is 0.000.

Since the p-value is lesser than 0.05, the null hypothesis is rejected.

Therefore, there is sufficient evidence to support the existence of a linear correlation between diameter and circumference.

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with Vaia!

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

Exercises 13–28 use the same data sets as Exercises 13–28 in Section 10-1. In each case, find the regression equation, letting the first variable be the predictor (x) variable. Find the indicated predicted value by following the prediction procedure summarized in Figure 10-5 on page 493.

Using the listed duration and interval after times, find the best predicted “interval after” time for an eruption with a duration of 253 seconds. How does it compare to an actual eruption with a duration of 253 seconds and an interval after time of 83 minutes?

let the predictor variable x be the first variable given. Use the given data to find the regression equation and the best predicted value of the response variable. Be sure to follow the prediction procedure summarized in Figure 10-5 on page 493. Use a 0.05 significance level.

For 50 randomly selected speed dates, attractiveness ratings by males of their

female date partners (x) are recorded along with the attractiveness ratings by females of their male date partners (y); the ratings are from Data Set 18 “Speed Dating” in Appendix B. The 50 paired ratings yield\(\bar x = 6.5\),\(\bar y = 5.9\), r= -0.277, P-value = 0.051, and\(\hat y = 8.18 - 0.345x\). Find the best predicted value of\(\hat y\)(attractiveness rating by female of male) for a date in which the attractiveness rating by the male of the female is x= 8.

Explore! Exercises 9 and 10 provide two data sets from “Graphs in Statistical Analysis,” by F. J. Anscombe, the American Statistician, Vol. 27. For each exercise,

a. Construct a scatterplot.

b. Find the value of the linear correlation coefficient r, then determine whether there is sufficient evidence to support the claim of a linear correlation between the two variables.

c. Identify the feature of the data that would be missed if part (b) was completed without constructing the scatterplot.

x

10

8

13

9

11

14

6

4

12

7

5

y

9.14

8.14

8.74

8.77

9.26

8.10

6.13

3.10

9.13

7.26

4.74

Interpreting\({R^2}\)In Exercise 2, the quadratic model results in = 0.255. Identify the percentage of the variation in Super Bowl points that can be explained by the quadratic model relating the variable of year and the variable of points scored. (Hint: See Example 2.) What does the result suggest about the usefulness of the quadratic model?

Critical Thinking: Is the pain medicine Duragesic effective in reducing pain? Listed below are measures of pain intensity before and after using the drug Duragesic (fentanyl) (based on data from Janssen Pharmaceutical Products, L.P.). The data are listed in order by row, and corresponding measures are from the same subject before and after treatment. For example, the first subject had a measure of 1.2 before treatment and a measure of 0.4 after treatment. Each pair of measurements is from one subject, and the intensity of pain was measured using the standard visual analog score. A higher score corresponds to higher pain intensity.

Pain Intensity Before Duragesic Treatment

1.2

1.3

1.5

1.6

8

3.4

3.5

2.8

2.6

2.2

3

7.1

2.3

2.1

3.4

6.4

5

4.2

2.8

3.9

5.2

6.9

6.9

5

5.5

6

5.5

8.6

9.4

10

7.6










Pain Intensity After Duragesic Treatment

0.4

1.4

1.8

2.9

6

1.4

0.7

3.9

0.9

1.8

0.9

9.3

8

6.8

2.3

0.4

0.7

1.2

4.5

2

1.6

2

2

6.8

6.6

4.1

4.6

2.9

5.4

4.8

4.1










Two Independent Samples The methods of Section 9-2 can be used to test the claim that two populations have the same mean. Identify the specific claim that the treatment is effective, then use the methods of Section 9-2 to test that claim. The methods of Section 9-2 are based on the requirement that the samples are independent. Are they independent in this case?

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.

Sign-up for free