The following table gives data on the mean number of seeds produced in a year by several common tree species and the mean weight (in milligrams) of the seeds produced. Two species appear twice because their seeds were counted in two locations. We might expect that trees with heavy seeds produce fewer of them, but what mathematical model best describes the relationship?

a. Describe the association between seed count and seed weight shown in the scatterplot.

b. Two alternative models based on transforming the original data are proposed to predict the seed weight from the seed count. Here are graphs and computer output from a least-squares regression analysis of the transformed data.

Model A:

Model B:

Which model, A or B, is more appropriate for predicting seed weight from seed count? Justify your answer.

c. Using the model you chose in part (b), predict the seed weight if the seed count is 3700.

Short Answer

Expert verified

a. The direction of the scatterplot is negative because the pattern in the scatterplot slopes downward.

b. Model B is appropriate.

c. The predicted seed weight is19.7766mg.

Step by step solution

01

Part a. Step 1. Explanation

It is given the data and the scatterplot on the mean number of seeds produced in a year by several common tree species and the mean weight of the seeds produced. By looking at the scatterplot we can say that,

The direction of the scatterplot is negative because the pattern in the scatterplot slopes downward. And the form of the scatterplot is curved because there is a strong curvature present in the scatterplot. Also the strength of the scatterplot is strong because the points in the scatterplot do not deviate much from the general pattern in the points. The unusual features in the scatterplot: There appear to be one outlier because the right most point in the scatterplot lies far from the other points in the scatterplot.

02

Part b. Step 1. Explanation

It is given the data and the scatterplot on the mean number of seeds produced in a year by several common tree species and the mean weight of the seeds produced. The two alternative models are proposed to predict the seed weight from the seed count. Thus, the scatterplot of model A contains strong curvature and the residual plot of the model A contains strong curvature as well which indicates that model A is not appropriate. Whereas, the scatterplot B contains no strong curvature and the residual plot of model B contains no strong curvature as well. Moreover, the residuals in the residual plot appear to be randomly scattered about the horizontal line at zero and thus model B appears to be appropriate for predicting seed weight from the seed count.

03

Part c. Step 1. Explanation

It is given the data and the scatterplot on the mean number of seeds produced in a year by several common tree species and the mean weight of the seeds produced. The two alternative models are proposed to predict the seed weight from the seed count. In part (b), we find out the model B is more appropriate. Then we will use the model B. Thus, the general equation of the least square regression line is:

y^=b0+b1x

Thus, form the computer output, we have that the estimate of the constant is given in the row “Constant” and in the column “Coef” as:

b0=15.491

The slope b1is given in the row “Mentos” and in the column “Coef” of the given computer output as:

b1=-1.5222

Now replacing the values in the equation we have,

y^=b0+b1x=15.491-1.5222x

Now, take the logarithm in the equation and solve it as:

lny^=15.491-1.5222x

Replace xby 3700,

lny^=15.491-1.5222x=15.491-1.5222(3700)=2.9845

Now taking exponential on both sides we have,

y^=elny^=e2.9845=19.7766

Thus, the predicted seed weight is19.7766mg.

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with Vaia!

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

Click-through rates Companies work hard to have their website listed at the top of an Internet search. Is there a relationship between a website’s position in the results of an Internet search (1=top position,2=2nd position, etc.) and the percentage of people who click on the link for the website? Here are click-through rates for the top 10 positions in searches on a mobile device:

a. Make an appropriate scatterplot for predicting click-through rate from the position. Describe what you see.

b. Use transformations to linearize the relationship. Does the relationship between click-through rate and position seem to follow an exponential model or a power model? Justify your answer.

c. Perform least-squares regression on the transformed data. Give the equation of your regression line. Define any variables you use.

d. Use your model from part (c) to predict the click-through rate for a website in the 11th position.

R12.4 Long legs Construct and interpret a 95% confidence interval for the slope of the population regression line. Assume that the conditions for inference are met. Explain how the interval provides more information than the test in R12.3.

Determining tree biomass It is easy to measure the diameter at breast height (in centimeters) of a tree. It’s hard to measure the total aboveground biomass (in kilograms) of a tree because to do this, you must cut and weigh the tree. Biomass is important for studies of ecology, so ecologists commonly estimate it using a power model. The following figure is a scatterplot of the natural logarithm of biomass against the natural logarithm of diameter at breast height (DBH) for 378trees in tropical rain forests. The least-squares regression line for the transformed data is lny=-2.00+2.42lnxlny^=-2.00+2.42lnx^

Use this model to estimate the biomass of a tropical tree 30cm in diameter.

Recycle and Review Exercises 29-31 refer to the following setting. Does the color in which words are printed affect your ability to read them? Do the words themselves affect your ability to name the color in which they are printed? Mr. Starnes designed a study to investigate these questions using the 16 students in his AP Statistics class as subjects. Each student performed the following two tasks in random order while a partner timed his or her performance: (1) Read 32words aloud as quickly as possible, and (2) say the color in which each of32 words is printed as quickly as possible. Try both tasks for yourself using the word list given.

Color words (4.2) Let's review the design of the study-

a. Explain why this was an experiment and not an observational study.

b. Did Mr. Stames use a completely randomized design a: randomizéd black design? Why do you think he choose this experimental design?

c. Explain the purpose of the ramdom assignment in the context of the study.

Here are the data from Mr. Stames's experiment. For each subject, the time to perform the Iwo tasks is given to the nearest second.

A scatterplot of yversus xshows a positive, nonlinear association. Two different transformations are attempted to try to linearize the association: using the logarithm of the y-values and using the square root of the y-values. Two least-squares regression lines are calculated, one that uses x to predict log(y) and the other that uses x to predict y. Which of the following would be the best reason to prefer the least-squares regression line that uses x to predict log(y)?

a. The value of r2is smaller.

b. The standard deviation of the residuals is smaller.

c. The slope is greater.

d. The residual plot has more random scatter.

e. The distribution of residuals is more Normal.

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.

Sign-up for free