On the WeissStats site are data on home size (in square feet) and assessed value (in thousands of dollars) for the same homes as in Exercise \(4.73\).

a. Obtain a scatterplot for the data.

b. Decide whether finding a regression line for the data is reasonable. If so, then also do parts (c)-(f)

c. Determine the interpret the regression equation for the data.

d. Identify potential outliers and influential observations.

e. In case a potential outlier is present, remove it and discuss the effect.

f. In case a potential influential observation is present, remove it and discuss the effect.

Short Answer

Expert verified

Part a.

Part b. It can be noted that, there is no strong curvature present in the scatterplot, therefore it is reasonable to find the regression line to the data

Part c. \(\hat{y}=111.5233+0.1116x\)

Part d. There is no any potential outliers and influential observations.

Part e. Not applicable

Part f. Not applicable

Step by step solution

01

Part a. Step 1. Given information

The below table gives the size of home (in square feet) and the assessed value (in thousands of dollars) for a sample of \(44\) homes.

02

Part a. Step 2. Graph

The below graph represents the given points and home size is on the horizontal axis and the value is on the vertical axis.

03

Part b. Step 1. Explanation

It is reasonable to find the regression line for the data if there is no strong curvature present in the scatterplot.

It can be noted that, there is no strong curvature present in the scatterplot, therefore it is reasonable to find the regression line to the data

04

Part c. Step 1. Calculation

The homes are valued by their sizes. Therefore the response variable is value and the predictor variable is the home size.

The sample size \(n=44\).

Below are the necessary sums.

\(\sum x_{i}=132455\)

\(\sum y_{i}=19689\)

\(\sum x_{i}^{2}=430481773\)

\(\sum x_{i}y_{i}=62813175\)

To find \(s_{xy}\) and \(s_{xx}\):

\(s_{xx}=430481773-\frac{132455^{2}}{44}=31747067.8864\)

\(s_{xy}=62813175-\frac{(132455)(19689)}{63}=3542572.8409\)

To find the averages:

\(\bar{x}=\frac{132455}{44}=3010.3409\)

\(\bar{y}=\frac{19689}{44}=447.4773\)

Hence the parameters are:

\(b_{1}=\frac{3542572.8409}{31747067.8864}\)

\(=0.1116\)

\(b_{0}=447.4773-(0.1116)\times 3010.3409\)

\(=111.5233\)

The regression equation to predict the value \((y)\) from the home size \((x)\) is,

\(\hat{y}=111.5233+0.1116x\)

From the regression equation, the value of a house is increase on average by \(0.1116\) thousand dollars if the home size is increase by \(1\) square feet.

05

Part d. Step 1. Concept Used

If a data point lies far from the regression line then it is an outlier.

If the removal of a point causes a considerable change in the regression equation then the point is called an influential observation. That is, the removal of a point causes a considerable change in the direction of the regression line.

06

Part d. Step 2. Explanation

The predicted values for the given data are summarized in the below table.

The below graph represents the given points and the fitted regression line.

From the plotted graph,

  • All the points are closed to the regression line therefore there are no potential outliers in the dataset.
  • The removal of a point does not cause any considerable change in the direction of the regression line therefore there are no potential influential observations.
07

Part e. Step 1. Explanation

Not applicable, because it was concluded that there are no outliers in part (d).

08

Part f. Step 1. Explanation

Not applicable, because it was concluded that there are no potential influential observation in part (d).

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with Vaia!

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

In Exercise 4.8, we give linear equations. For each equation,

a. find the y-intercept and slope.

b. determine whether the line slopes upward, slopes downward, or is horizontal, without graphing the equation.

c. use two points to graph the equation.

Given equation is,

y=-4x-8

Acreage and Value. The data from Exercise 4.73 for lot size (in acres) and assessed value (in thousands of dollars) for a sample of homes in a particular area are on the WeissStats site.

Movie Grosses. The data from Exercise 4.72 on domestic and overseas grosses for a random sample of 50movies are on the Weiss Stats site.

a. decide whether use of the linear correlation coefficient as a descriptive measure for the data is appropriate. If so, then also do parts (b) and (CK.

b. obtain the linear correlation coefficient.

c. interpret the value of rin terms of the linear relationship between the two variables in question.

Sample Covariance. For a set of n data points, the sample covariance, sxy+is given by

The sample covariance can be used as an alternative method for tinding the slope and y-intercept of a regression line. The formulas are

b1=sv/xk2andb0=y^-b1i^n

where sidenotes the sample standard deviation of the x-values.

a. Use Equation (4.1) to determine the sample covariance of the data points in Exercise 4,45.

b. Use Equation (4.2) and your answer from part (a) to find the regression equation. Compare your result to that found in Exercise 4.57.

Answer true or false to the following statement and provide a reason for your answer: If there is a very strong positive correlation between two variables, a causal relationship exists between the two variables.

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.

Sign-up for free