We repeat the data and provide the regression equations.

Part (a): Compute the three sums of squares, SST, SSR and SSE using the defining formulas.

Part (b): Verify the regression identity,SST=SSR+SSE.

Part (c): Compute the coefficient of determination.

Part (d): Determine the percentage of variation in the observed values of the response variable that is explained by the regression.

Part (e): State how useful the regression equation appears to be for making predictions.

Short Answer

Expert verified

Part (a): The three sums of squares, SST, SSR and SSE are 26,20and 6respectively.

Part (b): Substitute the values of SSR and SSE in the formula of SST, to verify the given regression identity.

Part (c): The coefficient of determination is 0.769.

Part (d): The percentage of variation in the observed values of the response variable that is explained by the regression is 76.9%.

Part (e): The regression is useful for making predictions.

Step by step solution

01

Part (a) Step 1. Given information.

Consider the given question,

02

Part (a) Step 2. Write the formulas of three sums of squares, SST, SSR and SSE.

We know,

SST=iyi-y2,wherey=iyinSSR=iyi-y2,SSE=SST-SSR

Then,y=4+5+0-14=2

03

Part (a) Step 3. Construct the table.

On constructing the table,

For i=1,2,3 by putting the values of xin equation y^=-3+2x.

Using the table,

SST=iyi-y2=26SSR=iyi-y2=20SSE=SST-SSR=26-20=6

04

Part (b) Step 1. Verify the equation of SST.

In regression, the equation,

SST=SSR+SSE=20+6=26

05

Part (c) Step 1. Compute the coefficient of determination.

Consider the coefficient of determination,

r2=SSRSST=2026=0.769

06

Part (d) Step 1. Determine the percentage of variation.

As the coefficient of determination is 0.769.

Then we can say that 76.9%of the variation in the observed value is explained by the regression.

07

Part (e) Step 1. State usefulness of the regression equation.

On stating the usefulness of the regression equation,

We can say that here the regression is useful for making predictions.

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with Vaia!

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

PCBs and Pelicans. The data on shell thickness and concentration of PCBs for 60Anacapa pelican eggs from Exercise 4.76 are on the WeissStats site.

a. decide whether use of the linear correlation coefficient as a descriptive measure for the data is appropriate. If so, then also do parts (b) and (c).

b. obtain the linear correlation coefficient.

c. interpret the value of r in terms of the linear relationship between the two variables in question.

In Exercise 4.9, we give linear equations. For each equation,

a. find the y-intercept and slope.

b. determine whether the line slopes upward, slopes downward, or is horizontal, without graphing the equation.

c. use two points to graph the equation.

Given equation is,

y=0.5x-2

13. Identify a use of the coefficient of determination as a descriptive measure.

4.76 PCBs and Pelicans, Polychlorinated biphenyls (PCBs), industrial pollutants, are known to be carcinogens and a great danger to natural ecosystems. As a result of several studies, PCB production was banned in the United States in 1979and by the Stockholm Convention on Persistent Organic Pollutants in 2001. One study, published in 1972 by R. Risebrough, is titled "Effects of Environmental Pollutants Upon Animals Other Than Man" (Proceedings of the 6th Berkeley Symposium on Mathematics and Statistics, Vl, University of California Press, Pp. 443-463). In that study, 60Anacapa pelican eggs were collected and measured for their shell thickness, in millimeters (mm), and concentration of PCBs, in parts per million (ppm). The data are on the Weiss Stats site.

a. Obtain a scatterplot for the data.

b. Decide whether finding a regression line for the data is reasonable. If so, then also do parts (c)-(f).

As we noted, because of the regression identity, we can express the coefficient of determination in terms of the total sum of squares and the error sum of squares as r2=1-SSE/SST

a. Explain why this formula shows that the coefficient of determination can also be interpreted as the percentage reduction obtained in the total squared error by using the regression equation instead of the mean. Y¯. to predict the observed values of the response variable.

b.

x
6
6
6
2
2
5
4
5
1
4
y
290
280
295
425
384
315
355
325
425
325

What percentage reduction is obtained in the total squared error by using the regression equation instead of the mean of the observed prices to predict the observed prices?

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.

Sign-up for free