Win 4 Lottery Shown below is a histogram of digits selected in California’s Win 4 lottery. Each drawing involves the random selection (with replacement) of four digits between 0 and 9 inclusive.

a. What is fundamentally wrong with the graph?

b. Does the display depict a normal distribution? Why or why not? What should be the shape of the histogram?

c. Identify the frequencies, then test the claim that the digits are selected from a population in which the digits are all equally likely. Is there a problem with the lottery?

Short Answer

Expert verified

a. The vertical axis does not start from 0, which makes the graph deceptive.

b. The graph is not normally distributed, and the shape is expected to be rectangular.

c. The frequencies are tabulated below. Also, the test proves that the digits are taken from a population where each digit is equally likely.

Digits

Frequency

0

19

1

21

2

22

3

21

4

18

5

23

6

16

7

16

8

22

9

22

No, there is no problem with the lottery as the distribution of digits is even.

Step by step solution

01

Given information

The provided histogram shows the digits drawn in California’s win fourlotteries.

02

Identify the flaw in the graph

a.

A deceptive graph is a visual representation of data thatcreates a disparity from the true set of observations.

The graph does not start from 0, but from 15.

The resultant histogram creates a deception that there is a large disparity between the counts of the categories.

Therefore, it can be concluded that the scaling on the vertical axis is wrong in the graph.

03

Analyze the shape of the histogram

b.

The shape of the histogram shows the pattern followed by the observations in the data. The shape can be indicative of the appropriate distribution that underlies the observations.

For example:

  • If the shape is almost bell-like,the underlying distribution is normal for the observations.
  • If the shape is constant, forming a rectangle parallel to the x-axis,the underlying distribution is uniformfor the observations.

As per the description, the shape of the histogram is not bell-like.Thus, the distribution is non-normal distribution.

As the distribution of digits from 0 to 9 is expected to be constant (same probability for each digit), the shape of the histogram is expected to be a flat curve (rectangular shape of the same height).

04

Determine the frequencies

c.

The frequencies are equivalent to the heights of the columns corresponding to each digit.

Observing the histogram, you get the frequencies as listed below:

Digits

Frequency

0

19

1

21

2

22

3

21

4

18

5

23

6

16

7

16

8

22

9

22

Sum

200

05

Test for the fit of the values

The hypothesis to test the claim that that the digits are selected from a population in which the digits are all equally likely are given below:

\(\begin{array}{l}{H_o}:{p_0} = {p_1} = {p_2} = ... = {p_9}\\{H_a}:{\rm{at}}\;{\rm{least}}\;{\rm{one}}\;{\rm{proportion}}\;{\rm{is}}\;{\rm{different}}\end{array}\)


Here,\({p_i}\)is the proportion of frequencies for the i-th digit.

Assume the significance level is 0.05.

The test statistic is

\({\chi ^2} = \sum {\frac{{{{\left( {O - E} \right)}^2}}}{E}} \).

Here, O is the observed frequency and E is the expected frequency.

The sum of frequencies is 200.

For uniform distribution, the expected frequencies are the same in number at\(\frac{{200}}{{10}} = 20\).

Digits

O

E

\(\frac{{{{\left( {O - E} \right)}^2}}}{E}\)

0

19

20

0.05

1

21

20

0.05

2

22

20

0.2

3

21

20

0.05

4

18

20

0.2

5

23

20

0.45

6

16

20

0.8

7

16

20

0.8

8

22

20

0.2

9

22

20

0.2

Total

3.000

Thus, the test statistic is\({\chi ^2} = 3.000\).

The degree of freedom is calculated below:

\(\begin{array}{c}df = n - 1\\ = 10 - 1\\ = 9\end{array}\)

The p-value is computed using the chi-square table as shown below:

\(\begin{array}{c}p - value = P\left( {{\chi ^2} > 3.000} \right)\\ = 0.965\end{array}\)

As the p-value exceeds the significance measure of 0.05, the null hypothesis fails to be rejected.

Thus, there is enough evidence to prove that the digits have the same frequencies and the distribution of digits is uniform.

There appears to be no problem with the lottery as there is enough evidence to prove that there are even chances of getting any digit since they are uniformly distributed.

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with Vaia!

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

In Exercises 5–8, use the following two control charts that result from testing batches of newly manufactured aircraft altimeters, with 100 in each batch. The original sample values are errors (in feet) obtained when the altimeters are tested in a pressure chamber that simulates an altitude of 6000 ft. The Federal Aviation Administration requires an error of no more than 40 ft at that altitude.

Is the process variation within statistical control? Why or why not?

In Exercises 5–16, use analysis of variance for the indicated test.

Triathlon Times Jeff Parent is a statistics instructor who participates in triathlons. Listed below are times (in minutes and seconds) he recorded while riding a bicycle for five stages through each mile of a 3-mile loop. Use a 0.05 significance level to test the claim that it takes the same time to ride each of the miles. Does one of the miles appear to have a hill?

Mile 1

3:15

3:24

3:23

3:22

3:21

Mile 2

3:19

3:22

3:21

3:17

3:19

Mile 3

3:34

3:31

3:29

3:31

3:29

Pulse Rates If we use the data given in Exercise 1 with two-way analysis of variance, we get the accompanying display. What do you conclude?

In Exercises 1–4, use the following listed arrival delay times (minutes) for American Airline flights from New York to Los Angeles. Negative values correspond to flights that arrived early. Also shown are the SPSS results for analysis of variance. Assume that we plan to use a 0.05 significance level to test the claim that the different flights have the same mean arrival delay time.

Flight 1

-32

-25

-26

-6

5

-15

-17

-36

Flight 19

-5

-32

-13

-9

-19

49

-30

-23

Flight 21

-23

28

103

-19

-5

-46

13

-3

ANOVA

a. What characteristic of the data above indicates that we should use one-way analysis of variance?

b. If the objective is to test the claim that the three flights have the same mean arrival delay time, why is the method referred to as analysis of variance?

Lead and Full IQ Scores Example 1 used measured performance IQ scores for three different blood lead levels. If we use the same three categories of blood lead levels with the fullIQ scores, we get the accompanying Excel display. (The data are listed in Data Set 7 “IQ and Lead” in Appendix B.) Using a 0.05 significance level, test the claim that the three categories of blood lead level have the same mean full IQ score. Does it appear that exposure to lead has an effect on full IQ scores?

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.

Sign-up for free