Box plots and the standard normal distribution. What relationship exists between the standard normal distribution and the box-plot methodology (Section 2.8) for describing distributions of data using quartiles? The answer depends on the true underlying probability distribution of the data. Assume for the remainder of this exercise that the distribution is normal.

a. Calculate the values of the standard normal random variable z, call them zL and zU, that correspond to the hinges of the box plot—that is, the lower and upper quartiles, QL and QU—of the probability distribution.

b. Calculate the zvalues that correspond to the inner fences of the box plot for a normal probability distribution.

c. Calculate the zvalues that correspond to the outer fences of the box plot for a normal probability distribution.

d. What is the probability that observation lies beyond the inner fences of a normal probability distribution? The outer fences?

e. Can you better understand why the inner and outer fences of a box plot are used to detect outliers in a distribution? Explain.

Short Answer

Expert verified

a. The lower and upper quartile is -0.67449 and 0.67449

b. The zvalues that correspond to the inner fences of the box plot are -2.697959 and 2.697959

c. The zvalues that correspond to the outer fences of the box plot are -4.72143 and 4.72143

d. The probability of an observation falling outside of inner fences is 0.006977 and outer fences is 0

e. The probability is very low for an observation to fall outside of these fences

Step by step solution

01

Given information

The given distribution is a normal distribution

02

 Calculating the lower and upper quantile

a.

The lower quartile is 25th percentile

Let zL be the standard normal random variable that corresponds to QL

i.e,Pz<zL=0.25ΦzL=0.25zL=Φ-10.25zL=-0.67449

So the lower quantile is -0.67449

The upper quantile is 75th percentile

Let zQ be the standard normal random variable corresponds to QU

i.e,Pz<zU=0.75ΦzU=0.75zU=Φ-10.75zU=0.67449

So the upper quartile is 0.67449

03

 Calculating the inner fences of the box

b.IQR=QU-QL=0.67449--0.67449=1.34898ThelowerinnerfenceisLIF=QL-1.5xIQRLIF=QL-1.5xIQR=QL-1.5xQU-QL=-0.67449-1.5x1.34898=-2.697959TheupperinnerfenceisUIF=QU+1.5xIQRUIF=QU+1.5xIQR=QU+1.5QU-QL=0.67449+1.5x1.34898=2.697959Sothelowerinnerfenceis-2.697959andtheupperinnerfenceis2.697959

04

 Calculating the outer fences of the box

C.IQR=QU-QL=0.67449--0.67449=1.34898

The lower outer fence is LOF=Q1-3xIQR

LOF=QL-3xIQR=QL-3xQU-QL=-0.67449-3x1.34898=-4.72143

The upper outer fence isUOF=QU+3xIQR

UOF=QL+3xIQR=QU+3xQU-QL=0.67449+3x1.34898=4.72143

So the lower outer fence is -4.72143 and upper outer fence is 4.72143

05

 Calculating the probabilities

d.

The probability that observation lies beyond the inner fences of a normal probability distribution is,

I.E,Pz<-2.697959+Pz<-2.697959=1-Pz<-2.697959+1-z<-2.697959=2-2Pz<-2.697959=2-2Φ2.697959=2-2X0.996512=0.006977

So, the probability is 0.006977

The probability that an observation lies beyond the outer fences of a normal probability distribution is,

I.E,Pz<-2.697959+Pz>4.72143=1-Pz<4.72143+1-z<4.72143=2-2Pz<4.72143=2-2Φ4.72143=2-2X0.999999𝆏0

So the probability is 0

06

Explanation

The inner and outer fences of box plot are used to detect outliers in a distribution in the following ways:

Values that are beyond the inner fences are deemed potential outliers because they are extreme values that represent relatively rare occurrences. In fact, for a normal probability distribution, less than 1% of the observations are expected to fall outside of inner fences.

Measurements that fall beyond the outer fences are very extreme measurements that require special analysis. Since less than one-hundredth of 1% (0.1% or 0.001) of the measurements from a normal distribution are expected to fall beyond the outer fences, these measurements are considered to be outliers.

From part(d) we clearly understand why the inner and outer fences of the box plot are used to detect outliers in a distribution.

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with Vaia!

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

Suppose x is a normally distributed random variable with μ= 11 and σ= 2. Find each of the following:

a)P(10χ12)

b) P(6χ10)

c)P(13χ16)

d)P(7.8χ12.6)

e)P(χ13.24)

f)P(χ7.62)


Do social robots walk or roll? Refer to the International Conference on Social Robotics (Vol. 6414, 2010) study of the trend in the design of social robots, Exercise 2.3 (p. 72). Recall that in a random sample of 106 social (or service) robots designed to entertain, educate, and care for human users, 63 were built with legs only, 20 with wheels only, 8 with both legs and wheels, and 15 with neither legs nor wheels. Assume the following: Of the 63 robots with legs only, 50 have two legs, 5 have three legs, and 8 have four legs; of the 8 robots with both legs and wheels, all 8 have two legs. Suppose one of the 106 social robots is randomly selected. Let x equal the number of legs on the robot.

  1. List the possible values of x.
  2. Find the probability distribution of x.
  3. Find E(x)and give a practical interpretation of its value.

Consider the probability distributions shown here:

  1. Use your intuition to find the mean for each distribution. How did you arrive at your choice?
  2. Which distribution appears to be more variable? Why?
  3. Calculateμ and σ2 for each distribution. Compare these answers with your answers in parts a and b.

Waiting for a car wash. An automatic car wash takes exactly 5 minutes to wash a car. On average, 10 cars per hour arrive at the car wash. Suppose that 30 minutes before closing time, 5 cars are in line. If the car wash is in continuous use until closing time, will anyone likely be in line at closing time?

USDA chicken inspection. In Exercise 3.19 (p. 170), you learned that one in every 100 slaughtered chickens passes USDA inspection with fecal contamination. Consider a random sample of three slaughtered chickens that all pass USDA inspection. Let x equal the number of chickens in the sample that has fecal contamination.

  1. Find p(x)for x = 0, 1, 2, 3.
  2. Graph p(x).
  3. Find P(x1).
See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.

Sign-up for free