Using the Kruskal-Wallis Test. In Exercises 5–8, use the Kruskal-Wallis test.

Correcting the H Test Statistic for Ties In using the Kruskal-Wallis test, there is a correction factor that should be applied whenever there are many ties: Divide H by

\(1 - \frac{{\sum T }}{{{N^3} - N}}\)

First combine all of the sample data into one list, and then, in that combined list, identify the different groups of sample values that are tied. For each individual group of tied observations, identify the number of sample values that are tied and designate that number as t, then calculate\(T = {t^3} - t\). Next, add the T values to get\(\sum T \). The value of N is the total number of observations in all samples combined. Use this procedure to find the corrected value of H for Example 1 in this section on page 628. Does the corrected value of H differ substantially from the value found in Example 1?

Short Answer

Expert verified

The corrected value of H is equal to 0.703.

No, the corrected value of H does not differ significantly from the ordinary value.

Step by step solution

01

Given information

Three samples showing the performance IQ scores of subjects with different blood lead levels are given.

Refer to Example 1 of Section 13-5.The following table shows the ranks:

IQ score

Sample Name

Ranks

85

Low Lead Level

6.5

90

Low Lead Level

8.5

107

Low Lead Level

18.5

85

Low Lead Level

6.5

100

Low Lead Level

15.5

97

Low Lead Level

12.5

101

Low Lead Level

17

64

Low Lead Level

1

78

Medium Lead Level

2

97

Medium Lead Level

12.5

107

Medium Lead Level

18.5

80

Medium Lead Level

4

90

Medium Lead Level

8.5

83

Medium Lead Level

5

93

High Lead Level

10

100

High Lead Level

15.5

97

High Lead Level

12.5

79

High Lead Level

3

97

High Lead Level

12.5

02

Obtain the correction factor

Under the Kruskal-Wallis test, the value of the test statistic (H) is computed without accounting for the number of ties.

Ties represent sample values that occur multiple times in three samples.

If there are ties present, a correction is applied to the value of H.

The correction factor has the following formula:

\(1 - \frac{{\sum T }}{{{N^3} - N}}\)

where

\(T = {t^3} - t\)

Here, t is the number of sample values tied to each tied group.

Referring to Example 1 of Section 13-5, the given table shows the sample values that are tied, the corresponding value of t, and the corresponding value of T.

Sample values that are tied

t

\(T = {t^3} - t\)

85

2

6

90

2

6

107

2

6

97

4

60

100

2

6



Sum=84

Here, the sample size of the three groups is given as:

\(\begin{array}{c}{n_1} = 8\\{n_2} = 6\\{n_3} = 5\\\end{array}\)

The total number of observations is given as:

\(\begin{array}{c}N = 8 + 6 + 5\\ = 19\end{array}\)

The correction factor is computed as shown:

\(\begin{array}{c}1 - \frac{{\sum T }}{{{N^3} - N}} = 1 - \frac{{84}}{{6859 - 19}}\\ = 0.987719\end{array}\)

Thus, the correction factor is equal to 0.987719.

03

Obtain the sum of the ranks

The sum of the ranks corresponding to the low blood lead level is computed as follows:

\(\begin{array}{c}{R_1} = 6.5 + 8.5 + 18.5 + .... + 1\\ = 86\end{array}\)

The sum of the ranks corresponding tothe medium blood lead levelis computed as follows:

\(\begin{array}{c}{R_2} = 2 + 12.5 + 18.5 + .... + 5\\ = 50.5\end{array}\)

The sum of the ranks corresponding to the high blood lead level is computed as follows:

\(\begin{array}{c}{R_3} = 10 + 15.5 + 12.5 + .... + 12.5\\ = 53.5\end{array}\)

04

Calculate the test statistic

The value of the test statistic is computed as shown below:

\)(\begin{array}{c}H = \frac{{12}}{{N\left( {N + 1} \right)}}\left( {\frac{{{R_1}^2}}{{{n_1}}} + \frac{{{R_2}^2}}{{{n_2}}} + \frac{{{R_3}^2}}{{{n_3}}}} \right) - 3\left( {N + 1} \right)\\ = \frac{{12}}{{19\left( {20} \right)}}\left( {\frac{{{{86}^2}}}{8} + \frac{{{{50.5}^2}}}{6} + \frac{{{{53.5}^2}}}{5}} \right) - 3\left( {20} \right)\\ = 0.694\end{array}\)

The ordinary value of H is equal to 0.694.

05

Determine the value of the corrected H

The corrected H can be computed as:

\(\begin{array}{c}{\rm{Corrected}}\;H = \frac{H}{{{\rm{Correction}}\;{\rm{factor}}}}\\ = \frac{{0.694}}{{0.987719}}\\ = 0.703\end{array}\)

Thus, the corrected value of H is equal to 0.703.

06

Compare the H value and the corrected H value

The corrected value of H(0.703) does not differ significantly from the H value (0.694). Thus, it can be said that a large number of ties does not seem to have a considerable amount of effect on the H value.

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with Vaia!

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

In Exercises 1–4, use the following sequence of political party affiliations of recent presidents of the United States, where R represents Republican and D represents Democrat.

R RRR D R D R RR D R RR D D R D D R R D R R D R D

Runs Test If we use a 0.05 significance level to test for randomness, what are the critical values from Table A-10? Based on those values and the number of runs from Exercise 2, what should be concluded about randomness?

Notation What do r, \({r_s}\) , \(\rho \), and \({\rho _s}\) denote? Why is the subscript s used? Does the subscript s represent the same standard deviation s introduced in Section 3-2?

Nonparametric Tests

a. Which of the following terms is sometimes used instead of “nonparametric test”: normality test; abnormality test; distribution-free test; last testament; test of patience?

b. Why is the term that is the answer to part (a) better than “nonparametric test”?

Student Evaluations of Professors Use the sample data given in Exercise 1 and test the claim that evaluation ratings of female professors have the same median as evaluation ratings of male professors. Use a 0.05 significance level.

Finding Critical Values An alternative to using Table A–9 to find critical values for rank correlation is to compute them using this approximation:

\({r_s} = \pm \sqrt {\frac{{{t^2}}}{{{t^2} + n - 2}}} \)

Here, t is the critical t value from Table A-3 corresponding to the desired significance level and n - 2 degrees of freedom. Use this approximation to find critical values of\({r_s}\)for Exercise 15 “Blood Pressure.” How do the resulting critical values compare to the critical values that would be found by using Formula 13-1 on page 633?

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.

Sign-up for free