Testing for a Linear Correlation. In Exercises 13–28, construct a scatterplot, and find the value of the linear correlation coefficient r. Also find the P-value or the critical values of r from Table A-6. Use a significance level of A = 0.05. Determine whether there is sufficient evidence to support a claim of a linear correlation between the two variables. (Save your work because the same data sets will be used in Section 10-2 exercises.)

CSI Statistics Police sometimes measure shoe prints at crime scenes so that they can learn something about criminals. Listed below are shoe print lengths, foot lengths, and heights of males (from Data Set 2 “Foot and Height” in Appendix B). Is there sufficient evidence to conclude that there is a linear correlation between shoe print lengths and heights of males? Based on these results, does it appear that police can use a shoe print length to estimate the height of a male?

Shoe print(cm)

29.7

29.7

31.4

31.8

27.6

Foot length(cm)

25.7

25.4

27.9

26.7

25.1

Height (cm)

175.3

177.8

185.4

175.3

172.7

Short Answer

Expert verified

The scatterplot is shown below:

The value of the correlation coefficient is 0.591.

The p-value is 0.294.

There is not enough evidence to support the claim that there is a linear correlation between the two variables.

As the scatterplot only reveals an upward trend and no specific association, the shoe print length cannot be used for estimating the height ofmales.

Step by step solution

01

Given information

The data for shoe print lengths, foot lengths, and heights of males is recorded.

Shoe print(cm)

29.7

29.7

31.4

31.8

27.6

Foot length(cm)

25.7

25.4

27.9

26.7

25.1

Height (cm)

175.3

177.8

185.4

175.3

172.7

02

Sketch a scatterplot

A two-dimensional plot based on a paired data plotted with reference to two axes is called a scatterplot.

Steps to sketch a scatterplot:

  1. Mark horizontal axis for shoe print lengthand vertical axis for the height of males.
  2. Mark points for the paired observations corresponding to both axes.

The resultant graph is the scatterplot.

03

Compute the measure of the correlation coefficient

The formula for the correlation coefficient is

\(r = \frac{{n\sum {xy} - \left( {\sum x } \right)\left( {\sum y } \right)}}{{\sqrt {n\left( {\sum {{x^2}} } \right) - {{\left( {\sum x } \right)}^2}} \sqrt {n\left( {\sum {{y^2}} } \right) - {{\left( {\sum y } \right)}^2}} }}\).

Let shoe print be defined by variable x and the height of males be defined by variable y.

The valuesare listed in the table below:

x

y

\({x^2}\)

\({y^2}\)

\(xy\)

29.7

175.3

882.09

30730.09

5206.41

29.7

177.8

882.09

31612.84

5280.66

31.4

185.4

985.96

34373.16

5821.56

31.8

175.3

1011.24

30730.09

5574.54

27.6

172.7

761.76

29825.29

4766.52

\(\sum x = 150.2\)

\(\sum y = 886.5\)

\(\sum {{x^2}} = 4523.14\)

\(\sum {{y^2} = } \;157271.5\)

\(\sum {xy\; = \;} 26649.69\)

Substitute the values in the formula:

\(\begin{aligned} r &= \frac{{5\left( {26649.69} \right) - \left( {150.2} \right)\left( {886.5} \right)}}{{\sqrt {5\left( {157271.5} \right) - {{\left( {150.2} \right)}^2}} \sqrt {5\left( {4523.14} \right) - {{\left( {886.5} \right)}^2}} }}\\ &= 0.591\end{aligned}\)

Thus, the correlation coefficient is 0.591.

04

Step 4:Conduct a hypothesis test for correlation

Define\(\rho \)as the actual value of the correlation coefficient for shoe print length and the height of males.

For testing the claim, form the hypotheses:

\(\begin{array}{l}{{\rm{H}}_{\rm{o}}}:\rho = 0\\{{\rm{{\rm H}}}_{\rm{a}}}:\rho \ne 0\end{array}\)

The samplesize is 5 (n).

The test statistic is computed as follows:

\(\begin{aligned} t &= \frac{r}{{\sqrt {\frac{{1 - {r^2}}}{{n - 2}}} }}\\ = \frac{{0.591}}{{\sqrt {\frac{{1 - {{0.591}^2}}}{{5 - 2}}} }}\\ &= 1.27\end{aligned}\)

Thus, the test statistic is 1.270.

The degree of freedom is

\(\begin{aligned} df &= n - 2\\ &= 5 - 2\\ &= 3.\end{aligned}\)

Thep-value is computed from the t-distribution table.

\(\begin{aligned} p{\rm{ - value}} &= 2P\left( {T > t} \right)\\ &= 2P\left( {T > 1.27} \right)\\ &= 2\left( {1 - P\left( {T < 1.27} \right)} \right)\\ &= 0.294\end{aligned}\)

Thus, the p-value is 0.294.

Since the p-value is greater than 0.05, the null hypothesis fails to berejected.

Therefore, there is not enough evidence to conclude that the variables shoe print length and height have a linear correlation between them.

05

Analyze if the shoe print length can help predict the height of males

Since the two variables are not linearly correlated to one other, one variable cannot be used to estimate the other.

Also, the scatterplot reveals no specific pattern between shoe print length and height of men (linear or non-linear).Thus, the two variables are not associated with one other.

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with Vaia!

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

Finding a Prediction Interval. In Exercises 13–16, use the paired data consisting of registered Florida boats (tens of thousands) and manatee fatalities from boat encounters listed in Data Set 10 “Manatee Deaths” in Appendix B. Let x represent number of registered boats and let y represent the corresponding number of manatee deaths. Use the given number of registered boats and the given confidence level to construct a prediction interval estimate of manatee deaths.

Boats Use x = 85 (for 850,000 registered boats) with a 99% confidence level.

In Exercises 9–12, refer to the accompanying table, which was obtained using the data from 21 cars listed in Data Set 20 “Car Measurements” in Appendix B. The response (y) variable is CITY (fuel consumption in mi , gal). The predictor (x) variables are WT (weight in pounds), DISP (engine displacement in liters), and HWY (highway fuel consumption in mi , gal).

If exactly two predictor (x) variables are to be used to predict the city fuel consumption, which two variables should be chosen? Why?

In Exercises 5–8, use a significance level of A = 0.05 and refer to the

accompanying displays.

Casino Size and Revenue The New York Times published the sizes (square feet) and revenues (dollars) of seven different casinos in Atlantic City. Is there sufficient evidence to support the claim that there is a linear correlation between size and revenue? Do the results suggest that a casino can increase its revenue by expanding its size?

Ages of Moviegoers Based on the data from Cumulative Review Exercise 7, assume that ages of moviegoers are normally distributed with a mean of 35 years and a standard deviation of 20 years.

a. What is the percentage of moviegoers who are younger than 30 years of age?

b. Find\({P_{25}}\), which is the 25th percentile.

c. Find the probability that a simple random sample of 25 moviegoers has a mean age that is less than 30 years.

d. Find the probability that for a simple random sample of 25 moviegoers, each of the moviegoers is younger than 30 years of age. For a particular movie and showtime, why might it not be unusual to have 25 moviegoers all under the age of 30?

Stocks and Sunspots. Listed below are annual high values of the Dow Jones Industrial Average (DJIA) and annual mean sunspot numbers for eight recent years. Use the data for Exercises 1–5. A sunspot number is a measure of sunspots or groups of sunspots on the surface of the sun. The DJIA is a commonly used index that is a weighted mean calculated from different stock values.

DJIA

14,198

13,338

10,606

11,625

12,929

13,589

16,577

18,054

Sunspot

Number

7.5

2.9

3.1

16.5

55.7

57.6

64.7

79.3

Confidence Interval Construct a 95% confidence interval estimate of the mean sunspot number. Write a brief statement interpreting the confidence interval.

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.

Sign-up for free