a. compute the three sums of squares, SST,SSR,SSE, using the defining formulas

b. verify the regression identity,SST=SSR+SSE

c. compute the coefficient of determination.

d. determine the percentage of variation in the observed values of the response variable that is required by the regression

e. State how useful the regression equation appears to be for making predictions.

y^=9-2x

Short Answer

Expert verified

(a) SST=38SSR=24SSE=14

(b) SST=38

(c) 0.6316

(d) 63.16%

(e) Utilising the regression equation to generate predictions is not practical, and the regression can only explain around 63%of the variation.

Step by step solution

01

Part (a) Step 1: Given information

The given data is

y^=9-2x

02

Part (a) Step 2: Explanation

The given regression equation is

y^=9-2x

Formulas to calculate the sum of squares is

SST=yi-y¯2SST=y^i-y¯2SST=yi-y^2

As shown in the table below, the relevant sums can be determined.

SST=38SSR=24SSE=14

03

Part (b) Step 1: Given information

The given data is

y^=9-2x

04

Part (b) Step 2: Explanation

From the above answer

SST=SSR+SSE

=24+14=38

05

Part (c) Step 1: Given information

The given data is

y^=9-2x

06

Part (c) Step 2: Explanation

The formula for the coefficient of determination is

r2=SSRSST

=2438=0.6316

07

Part (d) Step 1: Given information

The given data is

y^=9-2x

08

Part (d) Step 2: Explanation

The coefficient of determination restated as a percentage is the percentage of variation:

0.6316=63.16%

09

Part (e) Step 1: Given information 

The given data is

y^=9-2x

10

Part (e) Step 2: Explanation

The regression equation can be used to generate predictions if the estimated r2is near to 1.

The computed r2is 0.6316, which is a long way from 1.

As a result, utilising the regression equation to generate predictions is not practical, and the regression can only explain around 63% of the variation.

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with Vaia!

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

Shortleaf Pines. The data from Exercise 4.80for volume, in cubic feet, and diameter at breast height, in inches, for 70shortleaf pines are on the Weiss Stats site.

a) Decide whether finding a regression line for the data is reasonable. If so, then also do puts (b)-(d).

For each exercise, determine the linear correlation coefficient by using

a. Define 4on page 183,

b. Formula 4.3an page 185.

Compare your answer an para (a) and (b

A Knight-Ridder News Service article in an issue of the Wichita Eagle discussed a study on the relationship between country music and suicide. The results of the study. coauthored by S. Stack and J. Gundlach, appeared as the paper "The Effect of Country Music on Suicide" (Social Forces, Vol. 71, Issue 1. Pp. 211-218). According to the article, " analysis of 49 metropolitan areas shows that the greater the airtime devoted to country music. the greater the white suicide rate," (Suicide rates in the black population were found to be uncorrelated with the amount of country music airtime.)

(a). Use the terminology introduced in this section to describe the statement quoted above.

(b). One of the conclusions stated in the journal article was that country music "nurtures a suicidal mood" by dwelling on marital status and alienation from work. Is this conclusion warranted solely on the basis of the positive correlation found between airtime devoted to country music and white suicide rate? Explain your answer.

Answer true or false to the following statement and provide a reason for your answer: If there is a very strong positive correlation between two variables, a causal relationship exists between the two variables.

The data for shell thickness and concentration of PCBs for 60Anacapa pelican eggs from Exercise4.76are on the Weiss Stats site.

(a) Decide whether finding a regression line for the data is reasonable. If so, then also do puts (b)-(d).

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.

Sign-up for free