Question: Estimating repair and replacement costs of water pipes. Refer to the IHS Journal of Hydraulic Engineering (September, 2012) study of the repair and replacement of water pipes, Exercise 11.21 (p. 655). Recall that a team of civil engineers used regression analysis to model y = the ratio of repair to replacement cost of commercial pipe as a function of x = the diameter (in millimeters) of the pipe. Data for a sample of 13 different pipe sizes are reproduced in the accompanying table. In Exercise 11.21, you fit a straight-line model to the data. Now consider the quadratic model,E(y)=β0+β1x+β2x2. A Minitab printout of the analysis follows (next column).

  1. Give the least squares prediction equation relating ratio of repair to replacement cost (y) to pipe diameter (x).
  2. Conduct a global F-test for the model usingα=0.01. What do you conclude about overall model adequacy?
  3. Evaluate the adjusted coefficient of determination,Ra2, for the model.
  4. Give the null and alternative hypotheses for testing if the rate of increase of ratio (y) with diameter (x) is slower for larger pipe sizes.
  5. Carry out the test, part d, using α=0.01.
  6. Locate, on the printout, a 95% prediction interval for the ratio of repair to replacement cost for a pipe with a diameter of 240 millimeters. Interpret the result.

Short Answer

Expert verified

Answers:

  1. Therefore, the least squares prediction equation isy^=6.265955+0.007914x+0.00000425x2
  2. At 99% significance level, it can be concluded thatβ1β20
  3. The value ofRa2in the excel output is 0.972166. This means that around 97% of the variation in the variables is explained by the model. The higher the value, the better the model is for fitting to the data. 97% indicates the model is an ideal fit for the data.
  4. The null hypothesis is whether β2=0while the alternate checks if the value ofβ2<0 .
  5. Mathematically,H0:β2=0 ,while Ha:β2<0.
  6. At 99% confidence level,β2=0.This means that the parabola doesn’t have a curvature and it essentially is a straight line.
  7. The 95% prediction interval for the ratio of repair to replacement cost for a pipe with a diameter of 250 mm is given as (7.59472, 8.36237). This means that the ratio of repair to replacement can be between 7.59472 and 8.36237 for a given value of the diameter of the pipe (here 240 mm)

Step by step solution

01

Least Squares prediction equation

Ratio of repair to replacement cost (y)

Pipe diameter (x)

x2

6.58

80

6400

6.97

100

10000

7.39

125

15625

7.61

150

22500

7.78

200

40000

7.92

250

62500

8.2

300

90000

8.42

350

122500

8.6

400

160000

8.97

450

202500

9.31

500

250000

9.47

600

360000

9.72

700

490000

Excel summary output

SUMMARY OUTPUT

















Regression Statistics








Multiple R

0.988335








R Square

0.976805








Adjusted R Square

0.972166








Standard Error

0.162211








Observations

13

















ANOVA









df

SS

MS

F

Significance F




Regression

2

11.08098

5.540491

210.5646

6.71E-09




Residual

10

0.263125

0.026313






Total

12

11.34411













Coefficients

Standard Error

t Stat

P-value

Lower 95%

Upper 95%

Lower 95.0%

Upper 95.0%

Intercept

6.265955

0.155438

40.31149

2.11E-12

5.919617

6.612294

5.919617

6.612294

Pipe diameter (x)

0.007914

0.000997

7.934744

1.26E-05

0.005692

0.010137

0.005692

0.010137

x2

-4.3E-06

1.32E-06

-3.22867

0.009041

-7.2E-06

-1.3E-06

-7.2E-06

-1.3E-06

Therefore, the least-squares prediction equation isy^=6.265955+0.007914x+0.00000425x2

02

Overall goodness of fit

H0:β1=β2=0

Ha:At least one of the parametersβ1 orβ2 is non zero

Here, F test statistic =SSEn-k+1=210.5646.Here, p-value = 6.71E-09

H0is rejected if p-value < α. Forα=0.01 , Since p-value <0.01

Sufficient evidence to rejectH0 at 99% confidence interval.

Hence, β1β20

03

Interpretation of Ra2

The value of Ra2in the excel output is 0.972166. This means that around 97% of the variation in the variables is explained by the model. The higher the value, the better the model is for fitting to the data. 97% indicates the model is an ideal fit for the data.

04

Null and alternate hypothesis

To test whether the rate of increase of ratio (y) with a diameter (x) is slower for larger diameter pipe sizes, we want to check whether the curvature of the parabola is the negative meaning if the parabola is downward sloping.

The null hypothesis is whetherβ2=0 while the alternate checks if the value ofβ2<0 .

Mathematically, H0:β2=0whileHa:β2<0 .

05

Significance of β2

H0:β2=0

Ha:β2<0

Here, t-test statistic=β2^sβ2=-0.0000040.000001=-4

Value oft0.01,12 is 2.681

H0is rejected if t statistic > t0.01,12. Forα=0.01 , since t <t0.01,12

Not sufficient evidence to reject H0at a 99% confidence interval.

Thus,β2=0.

This means that the parabola doesn’t have a curvature and it essentially is a straight line.

06

Prediction value

The 95% prediction interval for the ratio of repair to replacement cost for a pipe with a diameter of 250 mm is given as (7.59472, 8.36237). This means that the ratio of repair to replacement can be between 7.59472 and 8.36237 for a given value of the diameter of the pipe (here 240 mm).

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with Vaia!

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

Question: Study of supervisor-targeted aggression. “Moonlighters” are workers who hold two jobs at the same time. What are the factors that impact the likelihood of a moonlighting worker becoming aggressive toward his/her supervisor? This was the research question of interest in the Journal of Applied Psychology (July 2005). Completed questionnaires were obtained from n = 105 moonlighters, and the data were used to fit several multiple regression models for supervisor-directed aggression score 1y2. Two of the models (with R2-values in parentheses) are given below:

a. Interpret the R2-values for the models.

b. Give the null and alternative hypotheses for comparing the fits of models 1 and 2.

c. Are the two models nested? Explain.

d. The nested F-test for comparing the two models resulted in F = 42.13 and p-value < .001. What can you conclude from these results?

e. A third model was fit, one that hypothesizes all possible pairs of interactions between self-esteem, history of aggression, interactional injustice at primary job, and abusive supervisor at primary job. Give the equation of this model (model 3).

f. A nested F-test to compare models 2 and 3 resulted in a p-value > .10. What can you conclude from this result?

Suppose the mean value E(y) of a response y is related to the quantitative independent variables x1and x2

E(y)=2+x1-3x2-x1x2

a) Identify and interpret the slope forx2

b) Plot the linear relationship between E(y) andx2for role="math" localid="1649796003444" x1=0,1,2, whererole="math" localid="1649796025582" 1x23

c) How would you interpret the estimated slopes?

d) Use the lines you plotted in part b to determine the changes in E(y) for eachrole="math" localid="1649796051071" x1=0,1,2.

e) Use your graph from part b to determine how much E(y) changes whenrole="math" localid="1649796075921" 3x15androle="math" localid="1649796084395" 1x23.

Question: Consumer behavior while waiting in line. The Journal of Consumer Research (November 2003) published a study of consumer behavior while waiting in a queue. A sample of n = 148 college students was asked to imagine that they were waiting in line at a post office to mail a package and that the estimated waiting time is 10 minutes or less. After a 10-minute wait, students were asked about their level of negative feelings (annoyed, anxious) on a scale of 1 (strongly disagree) to 9 (strongly agree). Before answering, however, the students were informed about how many people were ahead of them and behind them in the line. The researchers used regression to relate negative feelings score (y) to number ahead in line (x1) and number behind in line (x2).

a.The researchers fit an interaction model to the data. Write the hypothesized equation of this model.

b. In the words of the problem, explain what it means to say that “x1 and x2 interact to affect y.”

c. A t-test for the interaction β in the model resulted in a p-value greater than 0.25. Interpret this result.

d. From their analysis, the researchers concluded that “the greater the number of people ahead, the higher the negative feeling score” and “the greater the number of people behind, the lower the negative feeling score.” Use this information to determine the signs of β1 and β2 in the model.

Question: Tilting in online poker. In poker, making bad decisions due to negative emotions is known as tilting. A study in the Journal of Gambling Studies (March, 2014) investigated the factors that affect the severity of tilting for online poker players. A survey of 214 online poker players produced data on the dependent variable, severity of tilting (y), measured on a 30-point scale (where higher values indicate a higher severity of tilting). Two independent variables measured were poker experience (x1, measured on a 30-point scale) and perceived effect of experience on tilting (x2, measured on a 28-point scale). The researchers fit the interaction model, . The results are shown below (p-values in parentheses).

  1. Evaluate the overall adequacy of the model using α = .01.

b. The researchers hypothesize that the rate of change of severity of tilting (y) with perceived effect of experience on tilting (x2) depends on poker experience (x1). Do you agree? Test using α = .01.

Question: Job performance under time pressure. Refer to the Academy of Management Journal (October 2015) study of how time pressure affects team job performance, Exercise 12.89 (p. 765). Recall that the researchers hypothesized a complete second-order model relating team performance (y) to perceived time pressure (x1), and whether or not the team had an effective leader (x2 = 1 if yes, 0 if no):

E(Y)=β0+β1x1+β2x22+β3x2+β4x1x2+β5x12x2

a) How would you determine whether the rate of increase of team performance with time pressure depends on effectiveness of the team leader?

b) For fixed time pressure, how would you determine whether the mean team performance differs for teams with effective and non-effective team leaders?

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.

Sign-up for free