Have a ball! Can students throw a baseball farther than a softball? To find out, researchers conducted a study involving 24randomly selected students from a large high school. After warming up, each student threw a baseball as far as he or she could and threw a softball as far as he she could, in a random order. The distance in yards for each throw was recorded. Here are the data, along with the difference (Baseball – Softball) in distance thrown, for each student:

a. Explain why these are paired data.

b. A boxplot of the differences is shown. Explain how the graph gives some evidence that students like these can throw a baseball farther than a softball.

c. State appropriate hypotheses for performing a test about the true mean difference. Be sure to define any parameter(s) you use.

d. Explain why the Normal/Large Sample condition is not met in this case. The mean difference (Baseball−Softball) in distance thrown for these 24students is xdiff = 6.54yards. Is this a surprisingly large result if the null hypothesis is true? To find out, we can perform a simulation assuming that students have the same ability to throw a baseball and a softball. For each student, write the two distances thrown on different note cards. Shuffle the two cards and designate one distance to baseball and one distance to softball. Then subtract the two distances (Baseball−Softball) . Do this for all the students and find the simulated mean difference. Repeat many times. Here are the results of 100trials of this simulation

e. Use the results of the simulation to estimate the P-value. What conclusion would you draw ?

Short Answer

Expert verified

Part(a) These are to utilise paired t methods because the two samples contain the identical participants.

Part(b) The boxplot is to the right of zero, indicating that the majority of the distances are positive, and so the distance in baseball is higher than in softball.

Part(c) The appropriate hypotheses is :

H0:μD=0Ha:μD>0

Part(d) Because the distribution of the differences is considerably skewed to the right due to the outlier to the right in the boxplot, the distribution is not approximately normal.

Part(e) There is convincing evidence that student like these can throw baseball farther than soft ball.

Step by step solution

01

Part(a) Step 1 : Given information

We need to explain the paired data.

02

Part(a) Step 2 : Simplify

If the two samples contain the same individuals or if the subjects in one sample are connected to the subjects in the other sample, we must employ paired t methods.
If the subjects in the two samples are fully unrelated, we must employ two sample t methods.
In this scenario, we have the baseball distance thrown and the softball distance thrown for 24students each.
As a result, the first sample is the baseball distance thrown, whereas the second sample is the softball distance thrown.
We should utilise paired t methods because the two samples contain the identical participants.

03

Part(b) Step 1 : Given information

We need to explain given boxplot.

04

Part(b) Step 2 : Simplify

Students like them, it is believed, can throw a baseball farther than a softball.
The data in the boxplot reflects the difference in distance between the 24students' baseball and softball distances.
We can see that the majority of the boxplot is to the right of zero, indicating that the majority of the distances are positive, and so the distance in baseball is higher than in softball.
This supports the allegation, thus there is some evidence to back it up.

05

Part(c) Step 1 : Given information

We need to state hypotheses for performing a test about the true mean difference.

06

Part(c) Step 2 : Simplify

As a result, the claim that the mean difference is positive is correct.
Now we must determine the most relevant hypotheses for a significance test.
As a result, either the null hypothesis or the alternative hypothesis is the claim.
According to the null hypothesis, the population proportions are equal.
If the claim is the null hypothesis, the alternative hypothesis is the polar opposite of the null hypothesis.
As a result, the following assumptions are appropriate:

H0:μD=0Ha:μD>0
Here, μDis the mean difference in thrown distance between the baseball and the softball.

07

Part(d) Step 1 : Given information

We need to explain why Normal/Large Sample condition is not met in this case.

08

Part(d) Step 2 : Simplify

The normal or big condition necessitates either a large sample or a distribution of differences that is roughly normal.
The sample isn't big enough because the sample size is only 24, which isn't even close to 30.
Because the distribution of the differences is considerably skewed to the right due to the outlier to the right in the boxplot, the distribution is not approximately normal.
This indicates that the condition has not been met.

09

Part(e) Step 1 : Given information

We need to estimate the P-value to draw conclusion.

10

Part(e) Step 2 : Simplify

As given in the question , n=24

Mean : x=i=1nxin=8+32+9+12+14+12+5+0+12+5+7+2+18-3+9+6+2+1+3+1+3+1+3-524=15724=6.5417

P-value is probability of obtaining sample results.

P-value =010=0

As P-value is less than or equal to significance level then null hypothesis is rejected.

P<0.05RejectH0

Therefore, At the α=0.05 level , students like these can throw baseball faster than softball.

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with Vaia!

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

School A has 400students and School B has 2700students. A local newspaper wants to compare the distributions of SAT scores for the two schools. Which of the following would be the most useful for making this comparison?

a. Back-to-back stem plots for A and B

b. A scatterplot of A versus B

c. Two dot plots for A and B drawn on the same scale

d. Two relative frequency histograms of A and B drawn on the same scale

e. Two bar graphs for A and B drawn on the same scale

Broken crackers We don’t like to find broken crackers when we open the package. How can makers reduce breaking? One idea is to microwave the crackers for 30seconds right after baking them. Randomly assign 65newly baked crackers to the microwave and another 65to a control group that is not microwaved. After 1day, none of the microwave group were broken and 16of the control group were broken. Let p1be the true proportions of crackers like these that would break if baked in the microwave and p2be the true proportions of crackers like these that would break if not microwaved. Check if the conditions for calculating a confidence interval forp1-p2met.

Better barley Does drying barley seeds in a kiln increase the yield of barley? A famous experiment by William S. Gosset (who discovered the t distributions) investigated this question. Eleven pairs of adjacent plots were marked out in a large field. For each pair, regular barley seeds were planted in one plot and kiln-dried seeds were planted in the other. A coin flip was used to determine which plot in each pair got the regular barley seed and which got the kiln-dried seed. The following table displays the data on barley yield (pound per acre) for each plot.

Do these data provide convincing evidence at the α=0.05level that drying barley seeds in a kiln increases the yield of barley, on average?

Shortly before the 2012presidential election, a survey was taken by the school newspaper at a very large state university. Randomly selected students were asked, “Whom do you plan to vote for in the upcoming presidential election?” Here is a two-way table of the responses by political persuasion for 1850students:

Candidate of

choice


Political persuasion

Democrat
Republican
Independent
Total
Obama
925
78
26
1029
Romney
78
598
19
695
Other
2
8
11
21
Undecided
32
28
45
105
Total
1037
712
101
1850

Which of the following statements about these data is true?

a. The percent of Republicans among the respondents is 41%.

b. The marginal relative frequencies for the variable choice of candidate are given by

Obama: 55.6%; Romney: 37.6%; Other: 1.1%; Undecided: 5.7%.

c. About 11.2%of Democrats reported that they planned to vote for Romney.

d. About 44.6%of those who are undecided are Independents.

e. The distribution of political persuasion among those for whom Romney is the

candidate of choice is Democrat: 7.5%; Republican: 84.0%; Independent: 18.8%.

A beef rancher randomly sampled 42 cattle from her large herd to obtain a 95%confidence interval for the mean weight (in pounds) of the cattle in the herd. The interval obtained was (1010,1321). If the rancher had used a 98%confidence interval instead, the interval would have been

a. wider with less precision than the original estimate.

b. wider with more precision than the original estimate.

c. wider with the same precision as the original estimate.

d. narrower with less precision than the original estimate.

e. narrower with more precision than the original estimate.

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.

Sign-up for free