Going for it on fourth down in the NFL. Refer to the Chance (Winter 2009) study of fourth-down decisions by coaches in the National Football League (NFL), Exercise 11.69 (p. 679). Recall that statisticians at California State University, Northridge, fit a straight-line model for predicting the number of points scored (y) by a team that has a first-down with a given number of yards (x) from the opposing goal line. A second model fit to data collected on five NFL teams from a recent season was the quadratic regression model, E(y)=β0+β1x+β2x2.The regression yielded the following results: y=6.13+0.141x-0.0009x2,R2=0.226.

a) If possible, give a practical interpretation of each of the b estimates in the model.

b) Give a practical interpretation of the coefficient of determination,R2.

c) In Exercise 11.63, the coefficient of correlation for the straight-line model was reported asR2=0.18. Does this statistic alone indicate that the quadratic model is a better fit than the straight-line model? Explain.

d) What test of hypothesis would you conduct to determine if the quadratic model is a better fit than the straight-line model?

Short Answer

Expert verified

a.β0 indicates the y-intercept term of the curve. It means it gives the value of E(y) whenx1=0

β1indicates the magnitude of the shift in parabola due to changes in the value of x (shift parameter)

β2indicates the rate of curvature of the parabola. (shape parameter).

b. Here, 23% is a very low value for R2meaning the model is not a good fit for the data.

c. When a straight-line model was fitted to the data, the value of R2was 18% while when a quadratic model is fitted to the data, the value of R2increases to 23%. This means that the quadratic model is a better fit for the data than a straight-line model. However, 23% is still a lower value meaning a better quadratic model can be used to fit the data.

d. To test whether a quadratic model is a good fit for the data, F-test needs to be done.

Step by step solution

01

Interpretation of beta estimates

β0indicates the y-intercept term of the curve. It means it gives the value of E(y) whenx1=0

β1indicates the magnitude of the shift in parabola due to changes in the value of x (shift parameter)

β2indicates the rate of curvature of the parabola. (Shape parameter).

02

Simplification of R2

The value ofR2given here is 0.226 which denotes that about 23% of the variation in the variables can be explained by the model. A higher value ofR2means that the model is a good fit for the data while a lower value suggests otherwise.

Here, 23% is a very low value forR2meaning the model is not a good fit for the data.

03

Analysis of R2

When a straight-line model was fitted to the data, the value of R2was 18% while when a quadratic model is fitted to the data, the value of R2increases to 23%. This means that the quadratic model is a better fit for the data than a straight-line model. However, 23% is still a lower value meaning a better quadratic model can be used to fit the data.

04

Significance of the model

To test whether a quadratic model is a good fit for the data, F-test needs to be done wherethe null hypothesis is whether the model parameters are explaining the model where the beta values are zero and the alternate hypothesis is whether the beta values are non-zero.

Mathematically,

H0:β1=β2=0

Ha:At least one of the parametersβ1orβ2is non zero

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with Vaia!

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

Question: The complete modelE(y)=β0+β1x1+β2x2+β3x3+β4x4+εwas fit to n = 20 data points, with SSE = 152.66. The reduced model,E(y)=β0+β1x1+β2x2+ε, was also fit, with

SSE = 160.44.

a. How many β parameters are in the complete model? The reduced model?

b. Specify the null and alternative hypotheses you would use to investigate whether the complete model contributes more information for the prediction of y than the reduced model.

c. Conduct the hypothesis test of part b. Use α = .05.

Question: Failure times of silicon wafer microchips. Refer to the National Semiconductor study of manufactured silicon wafer integrated circuit chips, Exercise 12.63 (p. 749). Recall that the failure times of the microchips (in hours) was determined at different solder temperatures (degrees Celsius). The data are repeated in the table below.

  1. Fit the straight-line modelEy=β0+β1xto the data, where y = failure time and x = solder temperature.

  2. Compute the residual for a microchip manufactured at a temperature of 149°C.

  3. Plot the residuals against solder temperature (x). Do you detect a trend?

  4. In Exercise 12.63c, you determined that failure time (y) and solder temperature (x) were curvilinearly related. Does the residual plot, part c, support this conclusion?

Question:Consider the first-order model equation in three quantitative independent variables E(Y)=2-3x1+5x2-x3

  1. Graph the relationship between Y and x3for x1=2 and x2=1
  2. Repeat part a for x1=1and x2=-2
  3. How do the graphed lines in parts a and b relate to each other? What is the slope of each line?
  4. If a linear model is first-order in three independent variables, what type of geometric relationship will you obtain when is graphed as a function of one of the independent variables for various combinations of the other independent variables?

Consider fitting the multiple regression model

Ey=β0+β1x1+β2x2+β3x3+β4x4+β5x5

A matrix of correlations for all pairs of independent variables is given below. Do you detect a multicollinearity problem? Explain.


Suppose you used Minitab to fit the model y=β0+β1x1+β2x2+ε

to n = 15 data points and obtained the printout shown below.

  1. What is the least squares prediction equation?

  2. Find R2and interpret its value.

  3. Is there sufficient evidence to indicate that the model is useful for predicting y? Conduct an F-test using α = .05.

  4. Test the null hypothesis H0: β1= 0 against the alternative hypothesis Ha: β1≠ 0. Test using α = .05. Draw the appropriate conclusions.

  5. Find the standard deviation of the regression model and interpret it.

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.

Sign-up for free