As we noted, because of the regression identity, we can express the coefficient of determination in terms of the total sum of squares and the error sum of squares as r2=1-SSE/SST

a. Explain why this formula shows that the coefficient of determination can also be interpreted as the percentage reduction obtained in the total squared error by using the regression equation instead of the mean. Y¯. to predict the observed values of the response variable.

b.

x
6
6
6
2
2
5
4
5
1
4
y
290
280
295
425
384
315
355
325
425
325

What percentage reduction is obtained in the total squared error by using the regression equation instead of the mean of the observed prices to predict the observed prices?

Short Answer

Expert verified

(a) r2=1-SSESSTis proved.

(b)r2=93.80%

Step by step solution

01

Part (a) Step 1: Given Information

Given In the question that,r2=1-SSE/SST

we have to Explain why this formula shows that the coefficient of determination can also be interpreted as the percentage reduction obtained in the total squared error by using the regression equation instead of the mean. Y. to predict the observed values of the response variable.

02

Part (a) Step 2: Explanation

The coefficient of determination given by the formula:

r2=SSRSST

It can also be interpreted as the coefficient of variation, denoted by r2;which is equal to the proportion of variation in the observed values of the response variable explained by the regression.

Now, from the concept of regression identity, total sum of squares is equal to the sum of regression sum of squares and the error sum of squares, that is,

SSR=SST+SSE

Therefore, the coefficient of determination can be written as

r2=SSRSST=SST-SSESST=1-SSESST

Hence,proved.

03

Part (b) Step 1: Given Information

Given in the question that,

x
6
6
6
2
2
5
4
5
1
4
y
290
280
295
425
384
315
355
328
425
325

we have to identify percentage reduction is obtained in the total squared error by using the regression equation instead of the mean of the observed prices to predict the observed prices.

04

Part (b) Step 2: Explanation

The percentage reduction in the total squared error using the MINITAB can be obtained as

Step 1: Store the given data in columns named xandyrespectively

Step 2: Choose Stat >Regression >Fitted Line Plot......

Step 3: Specify C2 in Response (y) and C1 in Predictor (x) text boxes

Step 4: Click OK.

Hence, the answer will ber2=93.80%

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with Vaia!

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

Tax Efficiency. In Exercise 4.58, you determined a regression equation that relates the variables percentage of investments in energy securities and tax efficiency for mutual fund portfolios.

a. Should that regression equation be used to predict the tax efficiency of a mutual fund portfolio with 6.4%of its investments in energy securities? with 15%of its investments in energy securities? Explain your answers.

b. For which percentages of investments in energy securities is use of the regression equation to predict tax efficiency reasonable?

Home Size and Value. The data from Exercise 4.74 for home size (in square feet) and assessed value (in thousands of dollars) for the same homes as in Exercise 4.157 are on the WeissStats site.

a. decide whether use of the linear correlation coefficient as a descriptive measure for the data is appropriate. If so, then also do parts (b) and (c).

b. obtain the linear correlation coefficient.

c. interpret the value of r in terms of the linear relationship between the mo variables in question.

9. Based on the least-squares criterion, the line that best fits a set of data points is the one with the ________possible sum of squared errors.

Birdies and Score. The data from Exercise 4.70 for number of birdies during a tournament and final score for 63women golfers are on the WeissStats site.

a. decide whether use of the linear correlation coefficient as a descriptive measure for the data is appropriate, If so, then also do parts (b) and (c).

b. obtain the linear correlation coefficient.

c. interpret the value of rin terms of the linear relationship between the two variables in question.

In Exercise 4.6, we give linear equations. For each equation,

a. find the y-intercept and slope.

b. determine whether the line slopes upward, slopes downward, or is horizontal, without graphing the equation.

c. use two points to graph the equation.

Given equation is,

y=2x-1

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.

Sign-up for free