Describe the sampling distribution of based on large samples of size n—that is, give the mean, the standard deviation, and the (approximate) shape of the distribution of when large samples of size n are (repeatedly) selected fromthe binomial distribution with probability of success p.

Short Answer

Expert verified

Mean p

Standard deviationp1-pn

The approximate shape is the normal bell-curved.

Step by step solution

01

Given information

It is required to state the mean, standard deviation, and a rough idea of the distribution's shape for the sample size is huge (n), and the sampling proportion of the parameter p.

02

Determination of mean

We know that the value of sample proportion remains the same as population proportion as it is an unbiased estimator of the population proportion

Hence an unbiased estimate ofp^ is p

03

Determination of standard deviation

The standard deviation of the sampling distribution of p^isp1-pn

04

Size of the sampling distribution

For large samples, the sampling distribution follows the normal distribution. The sample size is considered large if bothnp^15,n1-p^15

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with Vaia!

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

USGA golf ball tests. The United States Golf Association (USGA) tests all new brands of golf balls to ensure that they meet USGA specifications. One test conducted is intended to measure the average distance traveled when the ball is hit by a machine called "Iron Byron," a name inspired by the swing of the famous golfer Byron Nelson. Suppose the USGA wishes to estimate the mean distance for a new brand to within 1 yard with 90% confidence. Assume that

past tests have indicated that the standard deviation of the distances Iron Byron hits golf balls is approximately 10 yards. How many golf balls should be hit by Iron Byron to achieve the desired accuracy in estimating the mean?

What is the confidence level of each of the following confidence intervals μ?

  1. χ¯±1.96σn
  2. χ¯±1.645σn
  3. χ¯±2,575σn
  4. χ¯±1.282σn
  5. χ¯±.99σn

Question: Heart rate variability of police officers. Are police officers susceptible to higher-than-normal heart rates? The heart rate variability (HRV) of police officers was the subject of research published in the American Journal of Human Biology (January 2014). HRV is defined as the variation in time intervals between heartbeats. A measure of HRV was obtained for each in a sample of 355 Buffalo, N.Y., police officers. (The lower the measure of HRV, the more susceptible the officer is to cardiovascular disease.) For the 73 officers diagnosed with hypertension, a 95% confidence interval for the mean HRV was (4.1, 124.5). For the 282 officers who are not hypertensive, a 95% confidence interval for the mean HRV was (148.0, 192.6).

a. What confidence coefficient was used to generate the confidence intervals?

b. Give a practical interpretation of both 95% confidence intervals. Use the phrase “95% confident” in your answer.

c. When you say you are “95% confident,” what do you mean?

d. If you want to reduce the width of each confidence interval, should you use a smaller or larger confidence coefficient? Explain.

A sampling dispute goes to court. Sampling of Medicare and Medicaid claims by the federal and state agencies who administer those programs has become common practice to determine whether providers of those services are submitting valid claims. (See the Statistics in Action for this chapter.) The reliability of inferences based on those samples depends on the methodology used to collect the sample of claims. Consider estimating the true proportion, p, of the population of claims that are invalid. (Invalid claims should not have been reimbursed by the agency.) Of course, to estimate a binomial parameter, p, within a given level of precision we use the formula provided in Section 6.5 to determine the necessary sample size. In a recent actual case, the statistician determined a sample size large enough to ensure that the bound on the error of the estimate would not exceed 0.05, using a 95% confidence interval. He did so by assuming that the true error rate was, which, as discussed in Section 6.5, provides the maximum sample size needed to achieve the desired bound on the error.

a. Determine the sample size necessary to estimate p to within .05 of the true value using a 95% confidence interval.

b. After the sample was selected and the sampled claims were audited, it was determined that the estimated error rate was and a 95% confidence interval for p was (0.15, 0.25). Was the desired bound on the error of the estimate met?

c. An economist hired by the Medicare provider noted that, since the desired bound on the error of .05 is equal to 25% of the estimated invalid claim rate, the “true” bound on the error was .25, not .05. He argued that a significantly larger sample would be necessary to meet the “relative error” (the bound on the error divided by the error rate) goal of .05, and that the statistician’s use of the “absolute error” of .05 was inappropriate, and more sampling was required. The statistician argued that the relative error was a moving target, since it depends on the sample estimate of the invalid claim rate, which cannot be known prior to selecting the sample. He noted that if the estimated invalid claim rate turned out to be larger than .5, the relative error would then be lower than the absolute error bound. As a consequence, the case went to trial over the relative vs. absolute error dispute. Give your opinion on the matter. [Note: The Court concluded that “absolute error was the fair and accurate measure of the margin of error.” As a result, a specified absolute bound on the error continues to be the accepted method for determining the sample size necessary to provide a reliable estimate of Medicare and Medicaid providers’ claim submission error rates.]

Crude oil biodegradation. Refer to the Journal of Petroleum Geology (April 2010) study of the environmental factors associated with biodegradation in crude oil reservoirs, Exercise 2.29 (p. 85). One indicator of biodegradation is the level of dioxide in the water. Recall that 16 water specimens were randomly selected from various locations in a reservoir on the floor of a mine and the amount of dioxide (milligrams/liter) as well as presence of oil was determined for each specimen. These data are reproduced in the next table.

a. Estimate the true mean amount of dioxide present in water specimens that contain oil using a 95% confidence interval. Give a practical interpretation of the interval.

b. Repeat part a for water specimens that do not contain oil.

c. Based on the results, parts a and b, make an inference about biodegradation at the mine reservoir.

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.

Sign-up for free