Best-paid CEOs.Refer to Glassdoor Economic Research firm’s 2015 ranking of the 40 best-paid CEOs in Table 2.1 (p. 65). Recall that data were collected on a CEO’s current salary, age, and the ratio of salary to a typical worker’s pay at the firm.

a.Create a scatterplot to relate a CEO’s ratio of salary to worker pay to the CEO’s age. Comment on the strength of the association between these two variables.

b.Conduct an outlier analysis of the ratio variable. Identify the highly suspect outlier in the data.

c.Remove the highly suspect outlier from the data and recreate the scatterplot of part a. What do you observe?

Short Answer

Expert verified

a. The graph is given below:


No strong association

b.Highly suspect outlier = 1951

c. No major change

Step by step solution

01

Creating a Scatterplot

The graph is given below:

There is no visible strong association between the CEO’s age and the ratio of the CEO’s salary to that of a typical worker. The data is clustered to the left of the graph except for a few scattered points.

02

Conducting an outlier analysis and identifying the highly suspect outliers

We will use the box plot method to conduct an outlier analysis on the ratio variable.

We will first calculate the quartiles, IQR, Inner, and Outer fences.

Arranging the data in ascending order,

Q1=N+14=40+14=414=10.25th

Term=483

Q2=N+12=40+12=412=20.25th

Term=536.5

Q3=3N+14=340+14=1234=30.75th

Term=644.25

IQR=QU–QL=644.25–483=161.25

LowerInnerFence=QL1.5IQR=4831.5161.25=483241.875=241.125

UpperInnerFence=QU+1.5IQR=644.25+1.5161.25=644.25+241.875=886.125

Now we will plot these,

The graph shows thatthe outliers are 1951, 1522, 1192, 1133, and 939.

To find which of these are highly suspect outliers (with a z-score > 3), we will check the z-score of these outliers.

First, we will calculate the mean and standard deviation of the data.

Mean=SumofallobservationsNo.ofobservations=2567040=641.75

Variance=χ2n1=386417039=99081.28

StandardDeviation=Variance=99081.28=314.77

Mean = 641.75 and Standard Deviation = 314.77

z-scoreof1951=1951641.75314.77=1309.25314.77=4.159

z-scoreof1522=1522641.75314.77=880.25314.77=2.79

Based on the values of the z-score, 1522 is not a highly suspect outlier. Therefore, none of the values lower than 1522 are outliers.

So, the only highly suspect outlier is 1951.

03

Creating a scatterplot without the highly suspect outlier.

1951 is a highly suspect outlier. Therefore, we will remove that from the data and create the scatterplot.

There is not much difference in the scatterplot after removing the highly suspect outlier.

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with Vaia!

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

Answer the following questions about the variability of data sets:

a.What is the primary disadvantage of using the range to compare the variability of data sets?

b.Describe the sample variance using words rather than a formula. Do the same with the population variance.

c.Can the variance of a data set ever be negative? Explain. Can the variance ever be smaller than the standard deviation? Explain.

Describe how the mean compares to the median for distribution as follows:

a.Skewed to the left

b.Skewed to the right

c.Symmetric

Refer to Exercise 2.141. Assuming all populations are approximately mound-shaped, for parts a–d, determine whether the values 0, 4, and 12 are outliers.

Voltage sags and swells. Refer to the Electrical Engineering (Vol. 95, 2013) study of transformer voltage sags and swells, Exercise 2.76 (p. 110). Recall that for a sample of 103 transformers built for heavy industry, the mean number of sags per week was 353 and the mean number of swells per week was 184. Assume the standard deviation of the sag distribution is 30 sags per week and the standard deviation of the swell distribution is 25 swells per week. Suppose one of the transformers is randomly selected and found to have 400 sags and 100 swells in a week.

a. Find the z-score for the number of sags for this transformer. Interpret this value.

b. Find the z-score for the number of swells for this transformer. Interpret this value.

Calculate the mode, mean, and median of the following data:

18 10 15 13 17 15 12 15 18 16 11

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.

Sign-up for free