Question: Forecasting a job applicant’s merit rating. A large research and development firm rates the performance of each member of its technical staff on a scale of 0 to 100, and this merit rating is used to determine the size of the person’s pay raise for the coming year. The firm’s personnel department is interested in developing a regression model to help forecast the merit rating that an applicant for a technical position will receive after being employed for 3 years. The firm proposes to use the following second-order model to forecast the merit ratings of applicants who have just completed their graduate studies and have no prior related job experience:

E(y)=β0+β1x1+β2x2+β3x1x2+β4x12+β5x22, where

y = Applicant’s merit rating after 3 years

x1= Applicant’s GPA in graduate school

x2= Applicant’s total score (verbal plus quantitative) on the Graduate Record Examination (GRE)

The model, fit to data collected for a random sample of n=40employees, resulted inSSE=1830.44andSSmodel=4911.5. The reduced modelE(y)=β0+β1x1+β2x2 is also fit to the same data, resulting inSSE=3197.16.

  1. Identify the appropriate null and alternative hypotheses to test whether the complete (second-order) model contributes information for predicting y.
  2. Conduct the test of the hypothesis given in part a. Test using. Interpret the results in the context of this problem.
  3. Identify the appropriate null and alternative hypotheses to test whether the complete model contributes more information than the reduced (first-order) model for predicting y.
  4. Conduct the test of the hypothesis given in part c. Test using. Interpret the results in the context of this problem.
  5. Which model, if either, would you use to predict y? Explain.

Short Answer

Expert verified

Answer

  1. The null and alternate hypothesis for testing whether the second order model contributes information for the prediction of y can be written as
    H0:β4=β5=0while At least one of the parameters β3,β4orβ5or is non-zero.
  2. At a 95% significance levelβ4β50 , the second-order model doesn’t contribute much to the model.
  3. The null and alternate hypotheses for testing whether the complete model contributes more information than the reduced model for the prediction of y can be written asH0:β3=β4=β5=0 while

At least one of the parametersβ3,β4orβ5.

4. At the 95% significance level, at least one of the parameters β3,β4orβ5is non-zero.

5. Since the hypothesis testing concludes the complete model is a better fit for the data, the complete model would be more useful for predicting y.

Step by step solution

01

Given information 

The second order model used by the firm is given as Ey=β0+β1x1+β2x2+β3x1x2+β4x12+β5x22,there are 40 observations and the,SSE=1830.44,SSmodel=4911.5 .

The reduced model is given where and the test needs to be conducted .

02

(a) Hypothesis statement

The Research and development firm is doing a regression analysis to find if certain variables affect their employees’ merit scores or not. The firm first fits a second-order equation and then fits a reduced equation with only one variable.

Here, the null and alternate hypotheses for testing whether the second order model contributes information for the prediction of y can be written asH0:β4=β5=0while

At least one of the parametersβ4orβ5 is non-zero.

03

(b) Hypothesis testing

H0:β4=β5=0

Ha :At least one of the parametersβ4orβ5 is non-zero.

Here,

F-teststatistic=SSEn-k+1=1830.4440-6=53.836

Value of F0.05,39,39is 1.509.

H0 :is rejected if F-statistic>F0.05,39,39.

There is sufficient evidence to reject H0at 95% confidence interval.

Therefore, β4β50meaning that the second order model contributes much to the model.

04

(c) Hypothesis statement

The null and alternate hypothesis for testing whether the complete model contributes more information than the reduced model for the prediction of y can be written asH0:β3=β4=β5=0
while At least one of the parameters β3,β4orβ5is non-zero.

05

(d) Hypothesis testing

H0:β3=β4=β5=0

Ha:At least one of the parameters β3,β4orβ5is non-zero.

Teststatistic=SSER-SSECk-gSSECn-k+1=3197.16-1830.443-11830.4440-3+1=13.4401

Forα=.05,F-Test statistic = 13.4401.

The numerator degree of freedom is k – g =2, and the denominator degree of freedom is n – (k+1) = 36.

We know that it is rejected if

The critical value of F0.05 = 3.37, since F-statistic>F0.05, therefore, at95%confidence interval we reject H0.

Therefore, at least one of the parametersβ3,β4orβ5 is non-zero.

06

(e) Interpretation

The results of the hypothesis testing show that the complete model fits the data better. It would be more useful to predict y.

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with Vaia!

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

Question: The Excel printout below resulted from fitting the following model to n = 15 data points: y=β0+β1x1+β2x2+ε

Where,

x1=(1iflevel20ifnot)x2=(1iflevel30ifnot)

Suppose you fit the quadratic model E(y)=β0+β1x+β2x2to a set of n = 20 data points and found R2=0.91, SSyy=29.94, and SSE = 2.63.

a. Is there sufficient evidence to indicate that the model contributes information for predicting y? Test using a = .05.

b. What null and alternative hypotheses would you test to determine whether upward curvature exists?

c. What null and alternative hypotheses would you test to determine whether downward curvature exists?

Question: Women in top management. Refer to the Journal of Organizational Culture, Communications and Conflict (July 2007) study on women in upper management positions at U.S. firms, Exercise 11.73 (p. 679). Monthly data (n = 252 months) were collected for several variables in an attempt to model the number of females in managerial positions (y). The independent variables included the number of females with a college degree (x1), the number of female high school graduates with no college degree (x2), the number of males in managerial positions (x3), the number of males with a college degree (x4), and the number of male high school graduates with no college degree (x5). The correlations provided in Exercise 11.67 are given in each part. Determine which of the correlations results in a potential multicollinearity problem for the regression analysis.

  1. The correlation relating number of females in managerial positions and number of females with a college degree: r =0.983.

  2. The correlation relating number of females in managerial positions and number of female high school graduates with no college degree: r =0.074.

  3. The correlation relating number of males in managerial positions and number of males with a college degree: r =0.722.

  4. The correlation relating number of males in managerial positions and number of male high school graduates with no college degree: r =0.528.

Question:Suppose you fit the first-order model y=β0+β1x1+β2x2+β3x3+β4x4+β5x5+εto n=30 data points and obtain SSE = 0.33 and R2=0.92

(A) Do the values of SSE and R2suggest that the model provides a good fit to the data? Explain.

(B) Is the model of any use in predicting Y ? Test the null hypothesis H0:β1=β2=β3=β4=β5=0 against the alternative hypothesis {H}at least one of the parameters β1,β2,...,β5 is non zero.Useα=0.05 .

Question: Tipping behaviour in restaurants. Can food servers increase their tips by complimenting the customers they are waiting on? To answer this question, researchers collected data on the customer tipping behaviour for a sample of 348 dining parties and reported their findings in the Journal of Applied Social Psychology (Vol. 40, 2010). Tip size (y, measured as a percentage of the total food bill) was modelled as a function of size of the dining party(x1)and whether or not the server complimented the customers’ choice of menu items (x2). One theory states that the effect of the size of the dining party on tip size is independent of whether or not the server compliments the customers’ menu choices. A second theory hypothesizes that the effect of size of the dining party on tip size is greater when the server compliments the customers’ menu choices as opposed to when the server refrains from complimenting menu choices.

a. Write a model for E(y) as a function of x1 and x2 that corresponds to Theory 1.

b. Write a model for E(y) as a function of x1and x2that corresponds to Theory 2.

c. The researchers summarized the results of their analysis with the following graph. Based on the graph, which of the two models would you expect to fit the data better? Explain.

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.

Sign-up for free