Question: Comparing two orange juice extractors. The Florida Citrus Commission is interested in evaluating the performance of two orange juice extractors, brand A and brand B. It is believed that the size of the fruit used in the test may influence the juice yield (amount of juice per pound of oranges) obtained by the extractors. The commission wants to develop a regression model relating the mean juice yield E(y) to the type of orange juice extractor (brand A or brand B) and the size of orange (diameter), .

a. Identify the independent variables as qualitative or quantitative.

b. Write a model that describes the relationship between E(y) and size of orange as two parallel lines, one for each brand of extractor.

c. Modify the model of part b to permit the slopes of the two lines to differ.

d. Sketch typical response lines for the model of part b. Do the same for the model of part c. Carefully label your graphs.

e. Specify the null and alternative hypotheses you would use to determine whether the model in part c provides more information for predicting yield than does the model in part b.

f. Explain how you would obtain the quantities necessary to compute the F-statistic that would be used in testing the hypotheses you described in part e.

Short Answer

Expert verified

Answer

a. The variable type of orange juice extractor (brand A or brand B) is a qualitative variable and the variable size of an orange is a quantitative variable.

b. A model equation to describe the relationship between E(y) and size of orange and types of orange juice extractor used can be written asEy=β0+β1x1+β2x2, wherex1=sizeoforangeandx2=typeoforangejuiceextractorused.

c. A model equation to describe the relationship between E(y) and size of orange and types of orange juice extractor used the slopes are different where can be written asEy=β0+β1x1+β2x2+β3x3, wherex1=sizeoforangeandx2=typeoforangejuiceextractorused.

d. Graph

e. To check whether the models described in part b and part c are able to explain the variation in the data, F-test or goodness of fit test is conducted. Mathematically,H0:β1=β2=β3=0whileHa:At least one of the parametersβ1,β2,andβ3is nonzero.

f. To do a goodness of fit test is conducted to check the overall goodness of the model where F-statistic=R2/k1-R2/n-k+1, Where, SSE=yi-y^2, N=no.ofobservationsand K=no.ofvariablesusedinthemodelincludingβ0.

Step by step solution

01

Variable identification

The variable type of orange juice extractor (brand A or brand B) is a qualitative variable and the variable size of orange is a quantitative variable.

02

Model equation 

A model equation to describe the relationship between E(y) and size of orange and types of orange juice extractor used can be written as, Ey=β0+β1x1+β2x2 where x1=sizeoforange and x2=typeoforangejuiceextractorused.

03

Model equation with interaction terms

A model equation to describe the relationship between E(y) and size of orange and types of orange juice extractor used the slopes are different where can be written as Ey=β0+β1x1+β2x2+β3x3, where x1=sizeoforangeand x2=typeoforangejuiceextractorused.

04

Model graph

A model graph without interaction can be drawn as two straight parallel lines where one line represents the relation between y and x1 and the other line represents the relation between y and x2. Since there is no relation amongst the variables, the lines do not interact.

A model graph with interaction can be drawn as two straight lines with interaction where one line represents the relation between y and x1 and the other line represents the relation between y and x2.

05

Hypotheses

To check whether the models described in part b and part c are able to explain the variation in the data, F-test or goodness of fit test is conducted

H0:β1=β2=β3=0 whileHa: At least one of the parameters β1,β2and β3is nonzero.

06

F-statistic

To do goodness of fit testis conducted to check the overall goodness of the model.

F-statistic=R2/k1-R2/n-k+1

Where,SSE=yi-y^2

.N=no.ofobservationsK=no.ofvariablesusedinthemodelincludingβ0

.

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with Vaia!

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

Question: Reality TV and cosmetic surgery. Refer to the Body Image: An International Journal of Research (March 2010) study of the impact of reality TV shows on one’s desire to undergo cosmetic surgery, Exercise 12.17 (p. 725). Recall that psychologists used multiple regression to model desire to have cosmetic surgery (y) as a function of gender(x1) , self-esteem(x2) , body satisfaction(x3) , and impression of reality TV (x4). The SPSS printout below shows a confidence interval for E(y) for each of the first five students in the study.

  1. Interpret the confidence interval for E(y) for student 1.
  2. Interpret the confidence interval for E(y) for student 4

Question: Shared leadership in airplane crews. Refer to the Human Factors (March 2014) study of shared leadership by the cockpit and cabin crews of a commercial airplane, Exercise 8.14 (p. 466). Recall that simulated flights were taken by 84 six-person crews, where each crew consisted of a 2-person cockpit (captain and first officer) and a 4-person cabin team (three flight attendants and a purser.) During the simulation, smoke appeared in the cabin and the reactions of the crew were monitored for teamwork. One key variable in the study was the team goal attainment score, measured on a 0 to 60-point scale. Multiple regression analysis was used to model team goal attainment (y) as a function of the independent variables job experience of purser (x1), job experience of head flight attendant (x2), gender of purser (x3), gender of head flight attendant (x4), leadership score of purser (x5), and leadership score of head flight attendant (x6).

a. Write a complete, first-order model for E(y) as a function of the six independent variables.

b. Consider a test of whether the leadership score of either the purser or the head flight attendant (or both) is statistically useful for predicting team goal attainment. Give the null and alternative hypotheses as well as the reduced model for this test.

c. The two models were fit to the data for the n = 60 successful cabin crews with the following results: R2 = .02 for reduced model, R2 = .25 for complete model. On the basis of this information only, give your opinion regarding the null hypothesis for successful cabin crews.

d. The p-value of the subset F-test for comparing the two models for successful cabin crews was reported in the article as p 6 .05. Formally test the null hypothesis using α = .05. What do you conclude?

e. The two models were also fit to the data for the n = 24 unsuccessful cabin crews with the following results: R2 = .14 for reduced model, R2 = .15 for complete model. On the basis of this information only, give your opinion regarding the null hypothesis for unsuccessful cabin crews.

f. The p-value of the subset F-test for comparing the two models for unsuccessful cabin crews was reported in the article as p < .10. Formally test the null hypothesis using α = .05. What do you conclude?

Question: Suppose the mean value E(y) of a response y is related to the quantitative independent variables x1and x2

E(y)=2+x1-3x2-x1x2

a. Identify and interpret the slope forx2.

b. Plot the linear relationship between E(y) andx2forx1=0,1,2, where.

c. How would you interpret the estimated slopes?

d. Use the lines you plotted in part b to determine the changes in E(y) for each x1=0,1,2.

e. Use your graph from part b to determine how much E(y) changes when3x15and1x23.

Production technologies, terroir, and quality of Bordeaux wine. In addition to state-of-the-art technologies, the production of quality wine is strongly influenced by the natural endowments of the grape-growing region—called the “terroir.” The Economic Journal (May 2008) published an empirical study of the factors that yield a quality Bordeaux wine. A quantitative measure of wine quality (y) was modeled as a function of several qualitative independent variables, including grape-picking method (manual or automated), soil type (clay, gravel, or sand), and slope orientation (east, south, west, southeast, or southwest).

  1. Create the appropriate dummy variables for each of the qualitative independent variables.
  2. Write a model for wine quality (y) as a function of grape-picking method. Interpret theβ’s in the model.
  3. Write a model for wine quality (y) as a function of soil type. Interpret theβ’s in the model.
  4. Write a model for wine quality (y) as a function of slope orientation. Interpret theβ’s in the model.

Question:If the analysis of variance F-test leads to the conclusion that at least one of the model parameters is nonzero, can you conclude that the model is the best predictor for the dependent variable ? Can you conclude that all of the terms in the model are important for predicting ? What is the appropriate conclusion?

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.

Sign-up for free