Question: Revenues of popular movies. The Internet Movie Database (www.imdb.com) monitors the gross revenues for all major motion pictures. The table on the next page gives both the domestic (United States and Canada) and international gross revenues for a sample of 25 popular movies.

  1. Write a first-order model for foreign gross revenues (y) as a function of domestic gross revenues (x).
  2. Write a second-order model for international gross revenues y as a function of domestic gross revenues x.
  3. Construct a scatterplot for these data. Which of the models from parts a and b appears to be the better choice for explaining the variation in foreign gross revenues?
  4. Fit the model of part b to the data and investigate its usefulness. Is there evidence of a curvilinear relationship between international and domestic gross revenues? Try usingα=0.05.
  5. Based on your analysis in part d, which of the models from parts a and b better explains the variation in international gross revenues? Compare your answer with your preliminary conclusion from part c.

Short Answer

Expert verified

Answer:

  1. The first-order model equation for foreign gross revenues (y) as a function of domestic gross revenues (x) can be written asy=β0+β1x+ε.
  2. The second-order model equation for international gross revenues as a function of domestic gross revenue (x) can be written asy=β0+β1x+β1x2+ε.
  3. Looking at the graph, it is visible that there is a curvilinear relation between foreign gross revenues (y) and domestic gross revenues (x). Hence, it would be appropriate to fit a second-order model equation to the data.
  4. The model equation here becomesy=-322.95+2.8874x-0.000988x2and at 95% confidence level,β2=0this means that the parabola doesn’t have a curvature and it essentially is a straight line.
  5. From part d, it is concluded that the curvilinear relation between x and y does not exist. Hence, the model the part a where a first-order model equation is fitted to the data is better to explain the variation in the international gross revenue. In part c, it was concluded from the graph that I curvilinear relationship between x and y might exist however, a linear relationship is a better option.

Step by step solution

01

First-order model equation

The first-order model equation for foreign gross revenues (y) as a function of domestic gross revenues (x) can be written asy=β0+β1x+ε .

02

Second-order model equation

The second-order model equation for international gross revenues as a function of domestic gross revenue (x) can be written as y=β0+β1x+β1x2+ε.

03

Scatterplot and model fit for the data

Movie Title (year)

International Gross

Domestic Gross

x2

Star Wars VII (2015)

1122

936.4

876845

Avatar (2009)

2023.4

760.5

578360.3

Jurassic World (2015)

1018.1

652.2

425364.8

Titanic (1997)

1548.9

658.7

433885.7

The Dark Knight (2008)

469.5

533.3

284408.9

Pirates of the Caribbean (2006)

642.9

423.3

179182.9

E.T.(1982)

357.9

434.9

189138

Spider-Man (2002)

418

403.7

162973.7

Frozen (2013)

873.5

400.7

160560.5

Jurassic Park (1993)

643.1

395.7

156578.5

Furious 7 (2015)

1163

351

123201

Lion King (1994)

564.7

422.8

178759.8

Harry Potter and the Sorcerer's Stone (2001)

657.2

317.6

100869.8

Inception (2010)

540

292.6

85614.76

Sixth Sense (1999)

379.3

293.5

86142.25

The jungle book (2016)

491.2

292.6

85614.76

The Hangover (2009)

201.6

277.3

76895.29

Jaws (1975)

210.6

260

67600

Ghost (1990)

300

217.6

47349.76

Saving Private Ryan (1998)

263.2

216.1

46699.21

Gladiator (2000)

268.6

187.7

35231.29

Dances with wolves (1990)

240

184.2

33929.64

The Exorcist (1973)

153

204.6

41861.16

My Big Fat Greek Wedding (2002)

115.1

241.4

58273.96

Rocky IV (1985)

172.6

127.9

16358.41

Looking at the graph, it is visible that there is a curvilinear relation between foreign gross revenues (y) and domestic gross revenues (x). Hence, it would be appropriate to fit a second-order model equation to the data.

04

Model equation and significance of β2

The excel output is attached here

SUMMARY OUTPUT

















Regression Statistics








Multiple R

0.802142835








R Square

0.643433127








Adjusted R Square

0.611017957








Standard Error

293.541447








Observations

25

















ANOVA









df

SS

MS

F

Significance F




Regression

2

3420771

1710385

19.84975

1.18E-05




Residual

22

1895665

86166.58






Total

24

5316436













Coefficients

Standard Error

t Stat

P-value

Lower 95%

Upper 95%

Lower 95.0%

Upper 95.0%

Intercept

-322.9500969

285.3873

-1.13162

0.269979

-914.807

268.907

-914.807

268.907

Domestic Gross

2.887488271

1.321995

2.184189

0.039893

0.145838

5.629139

0.145838

5.629139


-0.000988686

0.00129

-0.76639

0.451591

-0.00366

0.001687

-0.00366

0.001687

The model equation here becomesy=-322.95+2.8874x-0.000988x2

To check for the curvilinear relationship between variable x and y

H0:β2=0Ha:β20

Here, t-test statistic=β2^sβ2=-0.0009880.00129=-0.7658

Value oft0.05,385is 2.069

is rejected if t statistic > . Forα=0.05, since t <t0.05,385

Not sufficient evidence to reject at 95% confidence interval.

Therefore, β2=0

This means that the parabola doesn’t have a curvature and it essentially is a straight line.

05

Interpretation of variations in international gross revenue

From part d, it is concluded that the curvilinear relation between x and y does not exist. Hence, the model the part a where a first-order model equation is fitted to the data is better to explain the variation in the international gross revenue. In part c, it was concluded from the graph that I curvilinear relationship between x and y might exist however, a linear relationship is a better option.

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with Vaia!

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

Question: Suppose you fit the first-order multiple regression model y=β0+β1x1+β2x2+ε to n=25 data points and obtain the prediction equationy^=6.4+3.1x1+0.92x2 . The estimated standard deviations of the sampling distributions of β1 and β2are 2.3 and .27, respectively

Commercial refrigeration systems. The role of maintenance in energy saving in commercial refrigeration was the topic of an article in the Journal of Quality in Maintenance Engineering (Vol. 18, 2012). The authors provided the following illustration of data relating the efficiency (relative performance) of a refrigeration system to the fraction of total charges for cooling the system required for optimal performance. Based on the data shown in the graph (next page), hypothesize an appropriate model for relative performance (y) as a function of fraction of charge (x). What is the hypothesized sign (positive or negative) of the β2parameter in the model?

Assertiveness and leadership. Management professors at Columbia University examined the relationship between assertiveness and leadership (Journal of Personality and Social Psychology, February 2007). The sample represented 388 people enrolled in a full-time MBA program. Based on answers to a questionnaire, the researchers measured two variables for each subject: assertiveness score (x) and leadership ability score (y). A quadratic regression model was fit to the data, with the following results:

a. Conduct a test of overall model utility. Useα=0.05 .

b. The researchers hypothesized that leadership ability increases at a decreasing rate with assertiveness. Set up the null and alternative hypotheses to test this theory.

  1. Use the reported results to conduct the test, part b. Give your conclusion(atα=0.05 )in the words of the problem.

Consider a multiple regression model for a response y, with one quantitative independent variable x1 and one qualitative variable at three levels.

a. Write a first-order model that relates the mean response E(y) to the quantitative independent variable.

b. Add the main effect terms for the qualitative independent variable to the model of part a. Specify the coding scheme you use.

c. Add terms to the model of part b to allow for interaction between the quantitative and qualitative independent variables.

d. Under what circumstances will the response lines of the model in part c be parallel?

e. Under what circumstances will the model in part c have only one response line?

Suppose you used Minitab to fit the model y=β0+β1x1+β2x2+ε

to n = 15 data points and obtained the printout shown below.

  1. What is the least squares prediction equation?

  2. Find R2and interpret its value.

  3. Is there sufficient evidence to indicate that the model is useful for predicting y? Conduct an F-test using α = .05.

  4. Test the null hypothesis H0: β1= 0 against the alternative hypothesis Ha: β1≠ 0. Test using α = .05. Draw the appropriate conclusions.

  5. Find the standard deviation of the regression model and interpret it.

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.

Sign-up for free