Testing for a Linear Correlation. In Exercises 13–28, construct a scatterplot, and find the value of the linear correlation coefficient r. Also find the P-value or the critical values of r from Table A-6. Use a significance level of A = 0.05. Determine whether there is sufficient evidence to support a claim of a linear correlation between the two variables. (Save your work because the same data sets will be used in Section 10-2 exercises.)

CPI and the Subway Use CPI>subway data from the preceding exercise to determine whether there is a significant linear correlation between the CPI (Consumer Price Index) and the subway fare.

Short Answer

Expert verified

The scatter plot is shown below:

The value of the correlation coefficient is 0.973.

The p-value is 0.000.

There is enough evidence to support the claim that there is a linear correlation between the two variables (CPI and subway fare).

Step by step solution

01

Given information

Refer to Exercise 15 for the data.

Year

1960

1973

1986

1995

2002

2003

2009

2013

2015

Pizza Cost

0.15

0.35

1

1.25

1.75

2

2.25

2.3

2.75

Subway Fare

0.15

0.35

1

1.35

1.5

2

2.25

2.5

2.75

CPI

30.2

48.3

112.3

162.2

191.9

197.8

214.5

233

237.2

02

Sketch a scatterplot

A scatterplot hasdots torepresent paired observations of a dataset projected corresponding to theaxes scaled for two variables.

Steps to sketch a scatterplot:

  1. Mark horizontal axis for CPI and vertical axis for subway fare.
  2. Mark the points ofobservations corresponding to each axis.
  3. The resultant graph is the scatterplot.

03

Compute the measure of the correlation coefficient

The formula for correlation coefficient is

\(r = \frac{{n\sum {xy} - \left( {\sum x } \right)\left( {\sum y } \right)}}{{\sqrt {n\left( {\sum {{x^2}} } \right) - {{\left( {\sum x } \right)}^2}} \sqrt {n\left( {\sum {{y^2}} } \right) - {{\left( {\sum y } \right)}^2}} }}\).

Let CPI be defined by variable x and subway fare be defined by variable y.

The valuesare listedin the table below:

x

y

\({x^2}\)

\({y^2}\)

\(xy\)

30.2

0.15

912.04

0.0225

4.53

48.3

0.35

2332.9

0.1225

16.905

112.3

1

12611

1

112.3

162.2

1.35

26309

1.8225

218.97

191.9

1.5

36826

2.25

287.85

197.8

2

39125

4

395.6

214.5

2.25

46010

5.0625

482.63

233

2.5

54289

6.25

582.5

237.2

2.75

56264

7.5625

652.3

\(\sum x = 1427.4\)

\(\sum y = 13.85\)

\(\sum {{x^2}} = 274678.6\)

\(\sum {{y^2} = } 28.0925\)

\(\sum {xy\; = \;} 2753.58\)

Substitute the values in the formula:

\(\begin{aligned}{c}r &= \frac{{9\left( {2753.58} \right) - \left( {1427.4} \right)\left( {13.85} \right)}}{{\sqrt {9\left( {274678.6} \right) - {{\left( {1427.4} \right)}^2}} \sqrt {9\left( {28.0925} \right) - {{\left( {13.85} \right)}^2}} }}\\ &= 0.973\end{aligned}\)

Thus, the correlation coefficient is 0.973.

04

Step 4:Conduct a hypothesis test for correlation

Define\(\rho \)as the actual value of thecorrelation coefficient for pizza cost and subway fare.

For testing the claim, form the hypotheses:

\(\begin{array}{l}{{\rm{H}}_{\rm{o}}}:\rho = 0\\{{\rm{{\rm H}}}_{\rm{a}}}:\rho \ne 0\end{array}\)

The samplesize is9 (n).

The test statistic is computed as follows:

\(\begin{aligned} t &= \frac{r}{{\sqrt {\frac{{1 - {r^2}}}{{n - 2}}} }}\\ &= \frac{{0.973}}{{\sqrt {\frac{{1 - {{0.973}^2}}}{{9 - 2}}} }}\\ &= 11.154\end{aligned}\)

Thus, the test statistic is 11.154

The degree of freedom is

\(\begin{aligned} df &= n - 2\\ &= 9 - 2\\ &= 7.\end{aligned}\)

The p-value is computed from the t-distribution table.

\(\begin{aligned}{c}p{\rm{ - value}} &= 2P\left( {T > t} \right)\\ &= 2P\left( {T > 11.154} \right)\\ &= 2\left( {1 - P\left( {T < 11.154} \right)} \right)\\ &= 0.000\end{aligned}\)

Thus, the p-value is 0.000.

Since the p-value is less than 0.05, the null hypothesis is rejected.

Therefore, there is enough evidence to conclude that the variables CPI and subway fare have a linear correlation between them.

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with Vaia!

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

The following exercises are based on the following sample data consisting of numbers of enrolled students (in thousands) and numbers of burglaries for randomly selected large colleges in a recent year (based on data from the New York Times).

If you had computed the value of the linear correlation coefficient to be 1.500, what should you conclude?

Testing for a Linear Correlation. In Exercises 13–28, construct a scatterplot, and find the value of the linear correlation coefficient r. Also find the P-value or the critical values of r from Table A-6. Use a significance level of A = 0.05. Determine whether there is sufficient evidence to support a claim of a linear correlation between the two variables. (Save your work because the same data sets will be used in Section 10-2 exercises.)

Oscars Listed below are ages of Oscar winners matched by the years in which the awards were won (from Data Set 14 “Oscar Winner Age” in Appendix B). Is there sufficient evidence to conclude that there is a linear correlation between the ages of Best Actresses and Best Actors? Should we expect that there would be a correlation?

Actress

28

30

29

61

32

33

45

29

62

22

44

54

Actor

43

37

38

45

50

48

60

50

39

55

44

33

The following exercises are based on the following sample data consisting of numbers of enrolled students (in thousands) and numbers of burglaries for randomly selected large colleges in a recent year (based on data from the New York Times).

Repeat the preceding exercise, assuming that the linear correlation coefficient is r= 0.997.

In Exercises 9–12, refer to the accompanying table, which was obtained using the data from 21 cars listed in Data Set 20 “Car Measurements” in Appendix B. The response (y) variable is CITY (fuel consumption in mi, gal). The predictor (x) variables are WT (weight in pounds), DISP (engine displacement in liters), and HWY (highway fuel consumption in mi, gal).

A Honda Civic weighs 2740 lb, it has an engine displacement of 1.8 L, and its highway fuel consumption is 36 mi/gal. What is the best predicted value of the city fuel consumption? Is that predicted value likely to be a good estimate? Is that predicted value likely to be very accurate?

\({s_e}\)Notation Using Data Set 1 “Body Data” in Appendix B, if we let the predictor variable x represent heights of males and let the response variable y represent weights of males, the sample of 153 heights and weights results in\({s_e}\)= 16.27555 cm. In your own words, describe what that value of \({s_e}\)represents.

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.

Sign-up for free