Question: Comparing mosquito repellents. Which insect repellents protect best against mosquitoes? Consumer Reports (June 2000) tested 14 products that all claim to be an effective mosquito repellent. Each product was classified as either lotion/cream or aerosol/spray. The cost of the product (in dollars) was divided by the amount of the repellent needed to cover exposed areas of the skin (about 1>3 ounce) to obtain a cost-per-use value. Effectiveness was measured as the maximum number of hours of protection (in half-hour increments) provided when human testers exposed their arms to 200 mosquitoes. The data from the report are listed in the table.

  1. Suppose you want to use repellent type to model the cost per use (y). Create the appropriate number of dummy variables for repellent type and write the model.
  2. Fit the model, part a, to the data.
  3. Give the null hypothesis for testing whether repellent type is a useful predictor of cost per use (y).
  4. Conduct the test, part c, and give the appropriate conclusion. Use α = .10.
  5. Repeat parts a–d if the dependent variable is the maximum number of hours of protection (y).

Short Answer

Expert verified

Answer

  1. For qualitative regression model, variables are introduced in the model where, . Since in this question there are only 2 levels, 1 qualitative variable is introduced in the model. Where when repellent is lotion based, 0 otherwise.
  2. From the excel output, the regression equation becomesE(y)=0.7775+0.109167x1
  3. The null and alternate hypothesis for whether repellent type is a useful predictor of cost per use (y) can be written as H0:β1=0against.Ha:β10
  4. At 95%significance level,β3=0 meaning that is statistically significant for the model
  5. The regression equation for maximum hours of protection as a function of repellent type is.E(y)=7.5625-1.64583x1 At 95% significance levelβ3=0, meaning that β3is statistically significant for the model.

Step by step solution

01

Given Information and definition

It is given that y is the dependent variable (here, cost per use). The test to be conducted at α=.10.

Dummy variables indicate the presence or absence of the categorical effect which may shift the outcome. Dummy variable can take only the value 0 or 1.

02

Dummy variables 

a.

For qualitative regression model,(k-1) variables are introduced in the model where k = no. of levels. Since in this question there are only 2 levels, 1 qualitative variable x1 is introduced in the model where x1 = 1 when repellent lotion is based, 0 otherwise.

03

Regression model

b. From the excel output, the regression equation becomes E(y)=0.7775+0.109167x1.

For the anova table one need to calculate the mean of the independent variable and then calculate the SSR, SSE AND SST, after that one need to calculate the degrees of freedom and the mean squares and then F.

The SSR is calculated by usingnΣXj--x¯j..2, and the SSE is calculated by squaring the each term and adding them all. The SST is the sum of SSR and SSE. The MS regression is calculated by dividing SST by degrees of regression and similarly the MS residual is calculated by dividing SSE by degrees of residual and F is calculated by dividing MS regression by MS residual.

The coefficients of x is calculated by using this formula:nxy-xynx2-x2whereas the coefficient of intercept is calculated by yx2-xxynx2-x2.

The standard error is calculated by dividing the standard deviation by the sample size's square root.

04

Hypothesis statement

c.

The null and alternate hypothesis for whether repellent type is a useful predictor of cost per use (y) can be written asH0:β1=0 against Ha:β10.

05

Significance of β1 

d.

H0:β1=0Ha:β10

Here, t-test statistic

.t=β1Sβ1=0.1091670.454457=0.24021

Value oft0.05,13is 1.771.

H0 is rejected if t statistic>t0.05,13.

Forα=0.05,since<t0.05,13.

Not sufficient evidence to reject H0 at 95% confidence interval.

Therefore,β3=0meaning thatrole="math" localid="1660794217559" β3is statistically significant for the model.

06

Regression model when dependent variable is maximum number of hours of protection

e.

For the anova table one need to calculate the mean if the independent variable and then calculate the SSR, SSE AND SST, after that onee need to calculate the degrees of freedom and the mean squares and the F.

The SSR is calculated by usingnΣ(Xi-Xj), and the SSE is calculated by squaring the each term and adding them all. The SST is the sum of SSR and SSE. The MS regression is calculated by dividing SST by degrees of regression and similarly the MS residual is calculated by dividing SSE by degrees of residual and F is calculated by dividing MS regression by MS residual.

The coefficients of x is calculated by using this formula: nxy-xynx2-x2 whereas the coefficient of intercept is calculated by yx2-xxynx2-x2.

The standard error is calculated bydividing the standard deviation by the sample size's square root.

The R-squared is calculated asR2=SSyy--SSESSyy.

The excel output is

The regression equation for maximum hours of protection as a function of repellent type isE(y)=7.5625-1.64583x1

To test the significance ofβ1

H0:β1=0Ha:β10

Here, t-test statistic

t=β1Sβ1=-1.645833.573625=-0.46054

Value of t0.05,13is 1.771

H0 is rejected if t statistic >t0.05,13

For,since>t0.05,13.

Not sufficient evidence to reject H0 at95%confidence interval.

Therefore,β3=0meaning thatβ3 is statistically significant for the model.

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with Vaia!

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

Question: Shopping on Black Friday. Refer to the International Journal of Retail and Distribution Management (Vol. 39, 2011) study of shopping on Black Friday (the day after Thanksgiving), Exercise 6.16 (p. 340). Recall that researchers conducted interviews with a sample of 38 women shopping on Black Friday to gauge their shopping habits. Two of the variables measured for each shopper were age (x) and number of years shopping on Black Friday (y). Data on these two variables for the 38 shoppers are listed in the accompanying table.

  1. Fit the quadratic model, E(y)=β0+β1x+β2x2, to the data using statistical software. Give the prediction equation.
  2. Conduct a test of the overall adequacy of the model. Use α=0.01.
  3. Conduct a test to determine if the relationship between age (x) and number of years shopping on Black Friday (y) is best represented by a linear or quadratic function. Use α=0.01.

Commercial refrigeration systems. The role of maintenance in energy saving in commercial refrigeration was the topic of an article in the Journal of Quality in Maintenance Engineering (Vol. 18, 2012). The authors provided the following illustration of data relating the efficiency (relative performance) of a refrigeration system to the fraction of total charges for cooling the system required for optimal performance. Based on the data shown in the graph (next page), hypothesize an appropriate model for relative performance (y) as a function of fraction of charge (x). What is the hypothesized sign (positive or negative) of the β2parameter in the model?

State casket sales restrictions. Some states permit only licensed firms to sell funeral goods (e.g., caskets, urns) to the consumer, while other states have no restrictions. States with casket sales restrictions are being challenged in court to lift these monopolistic restrictions. A paper in the Journal of Law and Economics (February 2008) used multiple regression to investigate the impact of lifting casket sales restrictions on the cost of a funeral. Data collected for a sample of 1,437 funerals were used to fit the model. A simpler version of the model estimated by the researchers is E(y)=β0+β1x1+β2x2+β3x1x2, where y is the price (in dollars) of a direct burial, x1 = {1 if funeral home is in a restricted state, 0 if not}, and x2 = {1 if price includes a basic wooden casket, 0 if no casket}. The estimated equation (with standard errors in parentheses) is:

y^=1432 + 793x1- 252x2+ 261x1x2, R2= 0.78

(70) (134) (109)

  1. Calculate the predicted price of a direct burial with a basic wooden casket at a funeral home in a restricted state.

  2. The data include a direct burial funeral with a basic wooden casket at a funeral home in a restricted state that costs \(2,200. Assuming the standard deviation of the model is \)50, is this data value an outlier?

  3. The data also include a direct burial funeral with a basic wooden casket at a funeral home in a restricted state that costs \(2,500. Again, assume that the standard deviation of the model is \)50. Is this data value an outlier?

Question: The Excel printout below resulted from fitting the following model to n = 15 data points: y=β0+β1x1+β2x2+ε

Where,

x1=(1iflevel20ifnot)x2=(1iflevel30ifnot)

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.

Sign-up for free