Estimating r For each of the following, estimate the value of the linear correlation coefficient r for the given paired data obtained from 50 randomly selected adults.

a. Their heights are measured in inches (x) and those same heights are recorded in centimetres (y).

b. Their IQ scores (x) are measured and their heights (y) are measured in centimetres.

c. Their pulse rates (x) are measured and their IQ scores are measured (y).

d. Their heights (x) are measured in centimetres and those same heights are listed again, but with negative signs (y) preceding each of these second listings.

Short Answer

Expert verified

a. The estimated value of the linear correlation coefficient (r) between heights measured in inches and heights measured in centimeters is approximately 1.

b. The estimated value of the linear correlation coefficient (r) between the IQ scores and heights measured in centimeters is approximately 0.

c. The estimated value of the linear correlation coefficient (r) between the pulse rates and the IQ scores is approximately 0.

d. The estimated value of the linear correlation coefficient (r) between the heights measured in centimeters and the same heights with a negative sign is approximately -1.

Step by step solution

01

Given information

Few variables are listed for a set of 50 randomly selected adults.

02

Define a linear correlation coefficient

An estimate of the linear correlation coefficient for any set of variables is a value in the range of -1 to 1, which can be guessed (or computed if observations are known) using the prior knowledge of the relationship between the two variables.

For example:

  • Correlation 0 implies there is no linear relationship.
  • Correlation -1 implies there is a negative linear relationship.
  • Correlation 1 implies there is a positive linear relationship.
03

Estimate the correlation coefficient for heights in two units

a.

Let x denote the heights of the 50 adults measured in inches.

Let y denote the heights of the 50 adults measured in centimeters.

It is known that one inch of measurement is equivalent to 2.54 centimeters.

Thus, each observation for the x variable increases by a multiple of 2.54 units.

Since the variables on both the axes represent the same set of observations measured in different units, the scatterplot between x and y will be a near-perfect straight line sloping upward (since when x increases, y will also increase).

A straight-line pattern sloping upward on a scatterplot implies a correlation coefficient equal to 1.

04

Estimate the correlation coefficient for heights and IQ

b.

Let x denote the IQ scores of the 50 adults.

Let y denote the heights of the 50 adults measured in centimeters.

Intuitively, there is no relationship between the height of an individual and IQ scores. Thus, it is expected that the scatterplot constructed between the variables will show the points randomly scattered over the graph. It is unlikely that the variables form a close to linear pattern.

Since randomly scattered points (no pattern) on a scatterplot imply a correlation coefficient equal to 0, the estimated correlation coefficient value is 0.

05

Estimate the correlation coefficient for pulse rates and IQ

c.

Let x denote the pulse rates of the 50 adults.

Let y denote the IQ scores of the 50 adults.

Intuitively, there is no relationship between the pulse rate of an individual and IQ scores. Thus, it is expected that the scatterplot constructed between the variables will be a randomly scattered set of observations with no specific pattern. It is unlikely that variables form a linear pattern as there is no association between the pulse rate of an adult and his/her IQ score

Since randomly scattered points (no pattern) on a scatterplot imply a correlation coefficient equal to 0, the estimated correlation coefficient value is 0.

06

Estimate the correlation coefficient for heights with opposite sign 

d.

Let x denote the heights of the 50 adults measured in centimeters.

Let y denote the heights of the 50 adults measured in centimeters with a negative sign.

From the definition of variables, each observation of y corresponds to a negative measure of the corresponding x observation.

Since the variables on both axes represent the same thing, the scatterplot between x and y will be a near-perfect straight line. However, here, the values have an opposite sign, which indicates that the straight line between x and y will be sloping downward (since when x increases, y will decrease).

Astraight-line pattern sloping downward on a scatterplot implies a correlation coefficient equal to -1.

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with Vaia!

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

Linear Correlation In this section we use r to denote the value of the linear correlation coefficient. Why do we refer to this correlation coefficient as being linear?

Frequency Polygon. In Exercises 15 and 16, construct the frequency polygons.

Old Faithful Use the frequency distribution from Exercise 11 in Section 2-1 on page 49 to construct a frequency polygon. Does the graph suggest that the distribution is skewed? If so, how?

Linear Correlation Coefficient In Exercises 9–12, the linear correlation coefficient r is provided. Use Table 2-11 on page 71 to find the critical values of r. Based on a comparison of the linear correlation coefficient r and the critical values, what do you conclude about a linear correlation?

Using the data from Exercise 8 “Heights of Fathers and Sons,” the linear correlation coefficient is r = -0.017.

In Exercises 1–6, refer to the data below, which are total home game playing times (hours) for all Major League Baseball teams in a recent year (based on data from Baseball Prospectus).

236 237 238 239 241 241 242 245 245 245 246 247 247 248 248 249 250 250 250 251 252 252 253 253 258 258 258 260 262 264

Data Type

a. The listed playing times are all rounded to the nearest whole number. Before rounding, are the exact playing times discrete data or continuous data?

b. For the listed times, are the data categorical or quantitative?

c. Identify the level of measurement of the listed times: nominal, ordinal, interval, or ratio.

d. Which of the following best describes the sample data: voluntary response sample, random sample, convenience sample, simple sample?

e. The listed total game times are from one recent year, and the data are available for all years back to 1950. Given that the listed times are part of a larger collection of times, do the data constitute a sample or a population?

Pareto Charts. In Exercises 11 and 12 construct the Pareto chart. Getting a Job In a survey, subjects seeking a job were asked to whom they should send a thank-you note after having a job interview. Results were as follows: 40 said only the person they spent the most time with, 396 said everyone they met, 40 said only the most senior-level person, 15 said the person that they had the best conversation with, and 10 said that they don’t send thank-you notes (based on data from TheLadders.com). Comment on the results.

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.

Sign-up for free