Testing for a Linear Correlation. In Exercises 13–28, construct a scatterplot, and find the value of the linear correlation coefficient r. Also find the P-value or the critical values of r from Table A-6. Use a significance level of A = 0.05. Determine whether there is sufficient evidence to support a claim of a linear correlation between the two variables. (Save your work because the same data sets will be used in Section 10-2 exercises.)

Oscars Listed below are ages of Oscar winners matched by the years in which the awards were won (from Data Set 14 “Oscar Winner Age” in Appendix B). Is there sufficient evidence to conclude that there is a linear correlation between the ages of Best Actresses and Best Actors? Should we expect that there would be a correlation?

Actress

28

30

29

61

32

33

45

29

62

22

44

54

Actor

43

37

38

45

50

48

60

50

39

55

44

33

Short Answer

Expert verified

The scatter plot is shown below:

The value of the correlation coefficient is –0.288.

The p-value is 0.364.

There is not enough evidence to support the claim that there existsa linear correlation between the ages of actresses and actors.

No correlation is expected between the ages of best actressesand actors in Oscars.

Step by step solution

01

Given information

The data is recorded for the ages of best actresses and actorsin Oscars.

Best Actress

Best Actor

28

43

30

37

29

38

61

45

32

50

33

48

45

60

29

50

62

39

22

55

44

44

54

33

02

Sketch a scatterplot

A scatterplot is used to reveal the pattern of association between two variables.

Steps to sketch a scatterplot:

  1. Define two axesfor the ages of actors and actresseseach year.
  2. Mark each paired set of values on the graph.

The resultant scatterplot is shown below.

03

Compute the measure of the correlation coefficient

The correlation coefficient formula is

\(r = \frac{{n\sum {xy} - \left( {\sum x } \right)\left( {\sum y } \right)}}{{\sqrt {n\left( {\sum {{x^2}} } \right) - {{\left( {\sum x } \right)}^2}} \sqrt {n\left( {\sum {{y^2}} } \right) - {{\left( {\sum y } \right)}^2}} }}\).

For the variable of age for best actress (x) and age of best actor (y), compute the correlation coefficient.

The valuesare listedin the table below:

x

y

\({x^2}\)

\({y^2}\)

\(xy\)

28

43

784

1849

1204

30

37

900

1369

1110

29

38

841

1444

1102

61

45

3721

2025

2745

32

50

1024

2500

1600

33

48

1089

2304

1584

45

60

2025

3600

2700

29

50

841

2500

1450

62

39

3844

1521

2418

22

55

484

3025

1210

44

44

1936

1936

1936

54

33

2916

1089

1782

\(\sum x = 469\)

\(\sum y = 542\)

\(\sum {{x^2}} = 20405\)

\(\sum {{y^2} = } \;25162\)

\(\sum {xy\; = \;} 20841\)

Substitute the values in the formula:

\(\begin{aligned} r &= \frac{{12\left( {20841} \right) - \left( {469} \right)\left( {542} \right)}}{{\sqrt {12\left( {20405} \right) - {{\left( {469} \right)}^2}} \sqrt {12\left( {25162} \right) - {{\left( {542} \right)}^2}} }}\\ &= - 0.288\end{aligned}\)

Thus, the correlation coefficient is –0.288.

04

Step 4:Conduct a hypothesis test for correlation

Definethe actual measure of the correlation coefficient between the age of best actress and actor in Oscars as\(\rho \).

For testing the claim, form the hypotheses:

\(\begin{array}{l}{H_o}:\rho = 0\\{H_a}:\rho \ne 0\end{array}\)

The samplesize is 12 (n).

The test statistic is computed as follows:

\(\begin{aligned} t &= \frac{r}{{\sqrt {\frac{{1 - {r^2}}}{{n - 2}}} }}\\ &= \frac{{ - 0.288}}{{\sqrt {\frac{{1 - {{\left( { - 0.288} \right)}^2}}}{{12 - 2}}} }}\\ &= - 0.951\end{aligned}\)

Thus, the test statistic is–0.951.

The degree of freedom is

\(\begin{aligned} df &= n - 2\\ &= 12 - 2\\ &= 10.\end{aligned}\)

The p-value is computed from the t-distribution table.

\(\begin{aligned} p{\rm{ - value}} &= 2P\left( {T < - 0.951} \right)\\ &= 0.364\end{aligned}\)

Thus, the p-value is 0.364.

Since the p-value is greater than 0.05, the null hypothes is fails to berejected.

Therefore, there is not enough evidence to conclude that there is no linear correlation between the age of actors and actresses.

05

Discuss if there exists a correlation between the ages of actors and actresses

It was not expected that actors and actresses have acorrelation between their ages.They can beparts of different movies.There is no clear logic for any association between their ages.

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with Vaia!

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

Interpreting\({R^2}\)In Exercise 2, the quadratic model results in = 0.255. Identify the percentage of the variation in Super Bowl points that can be explained by the quadratic model relating the variable of year and the variable of points scored. (Hint: See Example 2.) What does the result suggest about the usefulness of the quadratic model?

In Exercises 5–8, we want to consider the correlation between heights of fathers and mothers and the heights of their sons. Refer to the

StatCrunch display and answer the given questions or identify the indicated items.

The display is based on Data Set 5 “Family Heights” in Appendix B.

Identify the following:

a. The P-value corresponding to the overall significance of the multiple regression equation

b. The value of the multiple coefficient of determination\({R^2}\).

c. The adjusted value of \({R^2}\)

Exercises 13–28 use the same data sets as Exercises 13–28 in Section 10-1. In each case, find the regression equation, letting the first variable be the predictor (x) variable. Find the indicated predicted value by following the prediction procedure summarized in Figure 10-5 on page 493.

Internet and Nobel Laureates Find the best predicted Nobel Laureate rate for Japan, which has 79.1 Internet users per 100 people. How does it compare to Japan’s Nobel Laureate rate of 1.5 per 10 million people?

Confidence Intervals for a Regression Coefficients A confidence interval for the regression coefficient b1 is expressed

\(\begin{array}{l}{b_1} - E < {\beta _1} < {b_1} + E\\\end{array}\)

Where

\(E = {t_{\frac{\alpha }{2}}}{s_{{b_1}}}\)

The critical t score is found using n –(k+1) degrees of freedom, where k, n, and sb1 are described in Exercise 17. Using the sample data from Example 1, n = 153 and k = 2, so df = 150 and the critical t scores are \( \pm \)1.976 for a 95% confidence level. Use the sample data for Example 1, the Stat diskdisplay in Example 1 on page 513, and the Stat Crunchdisplay in Exercise 17 to construct 95% confidence interval estimates of \({\beta _1}\) (the coefficient for the variable representing height) and\({\beta _2}\) (the coefficient for the variable representing waist circumference). Does either confidence interval include 0, suggesting that the variable be eliminated from the regression equation?

Interpreting a Graph The accompanying graph plots the numbers of points scored in each Super Bowl to the last Super Bowl at the time of this writing. The graph of the quadratic equation that best fits the data is also shown in red. What feature of the graph justifies the value of\({R^2}\)= 0.255 for the quadratic model?

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.

Sign-up for free