In Exercises 9–12, refer to the accompanying table, which was obtained using the data from 21 cars listed in Data Set 20 “Car Measurements” in Appendix B. The response (y) variable is CITY (fuel consumption in mi/gal). The predictor (x) variables are WT (weight in pounds), DISP (engine displacement in liters), and HWY (highway fuel consumption in mi/gal).

Which regression equation is best for predicting city fuel consumption? Why?

Short Answer

Expert verified

The best regression equation is \({\rm{CITY}} = - 3.15 + 0.819{\rm{HWY}}\)to predict the city fuel consumption.

Step by step solution

01

Given information

The table representing the predictor variables, P-value, \({R^2}\) , Adjusted \({R^2}\)and the regression equations are provided.

02

Criteria for selecting the best model

The model with the highest measure of R-square and adjusted R-square is a good fit. Also, the number of predictors in the model should not be large to avoid overfitting. Thus, a two-predictor model is better if there is a significant increase in the measures of R-squared measure from the one-predictor model.

03

Determine the regression equation for the best model

It is alwaysbetter to use one predictor variable instead of twoin a regression equation.

It can be observed that all models have a P-value of 0.0000, which indicates a significant model.

The highest adjusted\({R^2}\)value in one predictor model is 0.920 for the HWY predictor variable. As the WT or DISP variable is added in the analysis, the adjusted\({R^2}\)measure increases to 0.935 and 0.928, which is not significant increase.

Therefore, the best predictor variable to predict the city’s fuel consumption is HWY, and the best regression equation is \({\rm{CITY}} = - 3.15 + 0.819{\rm{HWY}}\)

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with Vaia!

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

a. What is a residual?

b. In what sense is the regression line the straight line that “best” fits the points in a scatterplot?

In Exercises 9–12, refer to the accompanying table, which was obtained using the data from 21 cars listed in Data Set 20 “Car Measurements” in Appendix B. The response (y) variable is CITY (fuel consumption in mi , gal). The predictor (x) variables are WT (weight in pounds), DISP (engine displacement in liters), and HWY (highway fuel consumption in mi , gal).

If exactly two predictor (x) variables are to be used to predict the city fuel consumption, which two variables should be chosen? Why?

Testing for a Linear Correlation. In Exercises 13–28, construct a scatterplot, and find the value of the linear correlation coefficient r. Also find the P-value or the critical values of r from Table A-6. Use a significance level of A = 0.05. Determine whether there is sufficient evidence to support a claim of a linear correlation between the two variables. (Save your work because the same data sets will be used in Section 10-2 exercises.)

CSI Statistics Police sometimes measure shoe prints at crime scenes so that they can learn something about criminals. Listed below are shoe print lengths, foot lengths, and heights of males (from Data Set 2 “Foot and Height” in Appendix B). Is there sufficient evidence to conclude that there is a linear correlation between shoe print lengths and heights of males? Based on these results, does it appear that police can use a shoe print length to estimate the height of a male?

Shoe print(cm)

29.7

29.7

31.4

31.8

27.6

Foot length(cm)

25.7

25.4

27.9

26.7

25.1

Height (cm)

175.3

177.8

185.4

175.3

172.7

Confidence Intervals for a Regression Coefficients A confidence interval for the regression coefficient b1 is expressed

\(\begin{array}{l}{b_1} - E < {\beta _1} < {b_1} + E\\\end{array}\)

Where

\(E = {t_{\frac{\alpha }{2}}}{s_{{b_1}}}\)

The critical t score is found using n –(k+1) degrees of freedom, where k, n, and sb1 are described in Exercise 17. Using the sample data from Example 1, n = 153 and k = 2, so df = 150 and the critical t scores are \( \pm \)1.976 for a 95% confidence level. Use the sample data for Example 1, the Stat diskdisplay in Example 1 on page 513, and the Stat Crunchdisplay in Exercise 17 to construct 95% confidence interval estimates of \({\beta _1}\) (the coefficient for the variable representing height) and\({\beta _2}\) (the coefficient for the variable representing waist circumference). Does either confidence interval include 0, suggesting that the variable be eliminated from the regression equation?

Super Bowl and\({R^2}\)Let x represent years coded as 1, 2, 3, . . . for years starting in 1980, and let y represent the numbers of points scored in each Super Bowl from 1980. Using the data from 1980 to the last Super Bowl at the time of this writing, we obtain the following values of\({R^2}\)for the different models: linear: 0.147; quadratic: 0.255; logarithmic: 0.176; exponential: 0.175; power: 0.203. Based on these results, which model is best? Is the best model a good model? What do the results suggest about predicting the number of points scored in a future Super Bowl game?

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.

Sign-up for free