In his book Outliers,author Malcolm Gladwell argues that more

American-born baseball players have birth dates in the months immediately following July 31 because that was the age cutoff date for nonschool baseball leagues. The table below lists months of births for a sample of American-born baseball players and foreign-born baseball players. Using a 0.05 significance level, is there sufficient evidence to warrant rejection of the claim that months of births of baseball players are independent of whether they are born in America? Do the data appear to support Gladwell’s claim?


Born in America

Foreign Born

Jan.

387

101

Feb.

329

82

March

366

85

April

344

82

May

336

94

June

313

83

July

313

59

Aug.

503

91

Sept.

421

70

Oct.

434

100

Nov.

398

103

Dec.

371

82

Short Answer

Expert verified

There is sufficient evidence to warrant rejection of the claim that months of births of baseball players are independent of whether they are born in America.

Step by step solution

01

Given information

The data for the months of births for a sample of American-born baseball players and foreign-born baseball players is provided.

The level of significance is 0.05.

02

Compute the expected frequencies

Theexpected frequencyis computed as,

\(E = \frac{{\left( {{\rm{row}}\;{\rm{total}}} \right)\left( {{\rm{column}}\;{\rm{total}}} \right)}}{{\left( {{\rm{grand}}\;{\rm{total}}} \right)}}\)

The table with row and column total is represented as,


Born in America

Foreign Born

Column total

Jan.

387

101

488

Feb.

329

82

411

March

366

85

451

April

344

82

426

May

336

94

430

June

313

83

396

July

313

59

372

Aug.

503

91

594

Sept.

421

70

491

Oct.

434

100

534

Nov.

398

103

501

Dec.

371

82

453

Row Total

4515

1032

5547

Theexpected frequency tableis represented as,


Born in America

Foreign Born

Jan.

397.2093

90.7907

Feb.

334.5349

76.46512

March

367.093

83.90698

April

346.7442

79.25581

May

350

80

June

322.3256

73.67442

July

302.7907

69.2093

Aug.

483.4884

110.5116

Sept.

399.6512

91.3488

Oct.

434.6512

99.3488

Nov.

407.7907

93.2093

Dec.

368.7209

84.27907

The expected value is larger than 5.

Assuming that the subjects are randomly selected, the requirements of the test are satisfied.

03

State the null and alternate hypothesis

The hypotheses are stated below,

\({H_0}:\)The months of births of baseball players are independent of whether they were born in America.

\({H_1}:\)The months of births of baseball players are dependent on whether they are born in America.

04

Compute the test statistic

The value of the test statistic is computed as,

\[\begin{aligned}{c}{\chi ^2} = \sum {\frac{{{{\left( {O - E} \right)}^2}}}{E}} \\ = \frac{{{{\left( {387 - 397.2093} \right)}^2}}}{{397.2093}} + \frac{{{{\left( {329 - 334.5349} \right)}^2}}}{{334.5349}} + ... + \frac{{{{\left( {82 - 84.2790} \right)}^2}}}{{84.2790}}\\ = 20.0539\\ \approx 20.054\end{aligned}\]

Therefore, the value of the test statistic is 20.054.

05

Compute the degrees of freedom

The degrees of freedom are computed as,

\(\begin{aligned}{c}\left( {r - 1} \right)\left( {c - 1} \right) = \left( {2 - 1} \right)\left( {12 - 1} \right)\\ = 11\end{aligned}\)

Therefore, the degrees of freedom are 11.

06

Compute the critical value

From the chi-square table, the critical value corresponding to 11 degrees of freedom and at 0.05 level of significance is 19.675.

Therefore, the critical value is 19.675.

From the table, the p-value is obtained as 0.045.

07

State the decision

Since the critical value (19.675) is less than the value of the test statistic (20.054). In this case, the null hypothesis is rejected.

Therefore, the decision is to reject the null hypothesis.

08

State the conclusion

There isinsufficient evidence to favor the claim of the studythat months of births of baseball players are independent of whether they are born in America.

Thus, the players’ birth months are dependent on their places of birth (America or not).

Thus, the data support the claim of the author.

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with Vaia!

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

Benford’s Law. According to Benford’s law, a variety of different data sets include numbers with leading (first) digits that follow the distribution shown in the table below. In Exercises 21–24, test for goodness-of-fit with the distribution described by Benford’s law.

Leading Digits

Benford's Law: Distributuon of leading digits

1

30.10%

2

17.60%

3

12.50%

4

9.70%

5

7.90%

6

6.70%

7

5.80%

8

5.10%

9

4.60%

Author’s Computer Files The author recorded the leading digits of the sizes of the electronic document files for the current edition of this book. The leading digits have frequencies of 55, 25, 17, 24, 18, 12, 12, 3, and 4 (corresponding to the leading digits of 1, 2, 3, 4, 5, 6, 7, 8, and 9, respectively). Using a 0.05 significance level, test for goodness-of-fit with Benford’s law.

A study of people who refused to answer survey questions provided the randomly selected sample data shown in the table below (based on data from “I Hear You Knocking But You Can’t Come In,” by Fitzgerald and Fuller, Sociological Methods and Research,Vol. 11, No. 1). At the 0.01 significance level, test the claim that the cooperation of

the subject (response or refusal) is independent of the age category. Does any particular age group appear to be particularly uncooperative?

Age


18-21

22-29

30-39

40-49

50-59

60 and over

Responded

73

255

245

136

138

202

Refused

11

20

33

16

27

49

Motor Vehicle Fatalities The table below lists motor vehicle fatalities by day of the week for a recent year (based on data from the Insurance Institute for Highway Safety). Use a 0.01 significance level to test the claim that auto fatalities occur on the different days of the week with the same frequency. Provide an explanation for the results.

Day

Sun.

Mon.

Tues.

Wed.

Thurs.

Fri.

Sat.

Frequency

5304

4002

4082

4010

4268

5068

5985

Do World War II Bomb Hits Fit a Poisson Distribution? In analyzing hits by V-1 buzz bombs in World War II, South London was subdivided into regions, each with an area of 0.25\(k{m^2}\). Shown below is a table of actual frequencies of hits and the frequencies expected with the Poisson distribution. (The Poisson distribution is described in Section 5-3.) Use the values listed and a 0.05 significance level to test the claim that the actual frequencies fit a Poisson distribution. Does the result prove that the data conform to the Poisson distribution?

Number of Bomb Hits

0

1

2

3

4

Actual Number of Regions

229

211

93

35

8

Expected Number of Regions

(from Poisson Distribution)

227.5

211.4

97.9

30.5

8.7

In a clinical trial of the effectiveness of echinacea for preventing

colds, the results in the table below were obtained (based on data from “An Evaluation of Echinacea Angustifoliain Experimental Rhinovirus Infections,” by Turner et al., NewEngland Journal of Medicine,Vol. 353, No. 4). Use a 0.05 significance level to test the claim that getting a cold is independent of the treatment group. What do the results suggest about the

effectiveness of echinacea as a prevention against colds?

Treatment Group


Placebo

Echinacea:

20% Extract

Echinacea:

60% Extract

Got a Cold

88

48

42

Did Not Get a Cold

15

4

10

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.

Sign-up for free