11-2. Chi-Square One-Sample Goodness-of-Fit Tests
Suppose we wish to determine if an ordinary-looking six-sided die is fair, or balanced, meaning that every face has probability 1/6 of landing on top when the die is tossed. We could toss the die dozens, maybe hundreds, of times and compare the actual number of times each face landed on top to the expected number, which would be 1/6 of the total number of tosses. We wouldn’t expect each number to be exactly 1/6 of the total, but it should be close. To be specific, suppose the die is tossed n = 60 times with the results summarized in Table 11.8 "Die Contingency Table". For ease of reference we add a column of expected frequencies, which in this simple example is simply a column of 10s. The result is shown as Table 11.9 "Updated Die Contingency Table". In analogy with the previous section we call this an “updated” table. A measure of how much the data deviate from what we would expect to see if the die really were fair is the sum of the squares of the differences between the observed frequency O and the expected frequency E in each row, or, standardizing by dividing each square by the expected number, the sum . If we formulate the investigation as a test of hypotheses, the test is
vs.
Table 11.8 Die Contingency Table
Die Value
Assumed Distribution
Observed Frequency
1
1/6
9
2
1/6
15
3
1/6
9
4
1/6
8
5
1/6
6
6
1/6
13
Table 11.9 Updated Die Contingency Table
Die Value
Assumed Distribution
Observed Freq.
Expected Freq.
1
1/6
9
10
2
1/6
15
10
3
1/6
9
10
4
1/6
8
10
5
1/6
6
10
6
1/6
13
10
We would reject the null hypothesis that the die is fair only if the number is large, so the test is right-tailed. In this example the random variable has the chi-square distribution with five degrees of freedom.
If we had decided at the outset to test at the 10% level of significance, the critical value defining the rejection region would be, reading from Figure 12.4 "Critical Values of Chi-Square Distributions", , so that the rejection region would be the interval .
When we compute the value of the standardized test statistic using the numbers in the last two columns of Table 11.9 "Updated Die Contingency Table", we obtain
Since 5.6 < 9.236 the decision is not to reject . See Figure 11.5 "Balanced Die". The data do not provide sufficient evidence, at the 10% level of significance, to conclude that the die is loaded.
Figure 11.5 Balanced Die
In the general situation we consider a discrete random variable that can take I different values, for which the default assumption is that the probability distribution is
We wish to test the hypotheses
vs.
We take a sample of size n and obtain a list of observed frequencies. This is shown in Table 11.10 "General Contingency Table". Based on the assumed probability distribution we also have a list of assumed frequencies, each of which is defined and computed by the formula
Table 11.10 General Contingency Table
Factor Levels
Assumed Distribution
Observed Frequency
1
p1
O1
2
p2
O2
⋮
⋮
⋮
I
pI
OI
Table 11.10 "General Contingency Table" is updated to Table 11.11 "Updated General Contingency Table" by adding the expected frequency for each value of X. To simplify the notation we drop indices for the observed and expected frequencies and represent Table 11.11 "Updated General Contingency Table" by Table 11.12 "Simplified Updated General Contingency Table".
Table 11.11 Updated General Contingency Table
Factor Levels
Assumed Distribution
Observed Freq.
Expected Freq.
1
p1
O1
E1
2
p2
O2
E2
⋮
⋮
⋮
⋮
I
pI
OI
EI
Table 11.12 Simplified Updated General Contingency Table
Factor Levels
Assumed Distribution
Observed Freq.
Expected Freq.
1
p1
O
E
2
p2
O
E
⋮
⋮
⋮
⋮
I
pI
O
E
Here is the test statistic for the general hypothesis based on Table 11.12 "Simplified Updated General Contingency Table", together with the conditions that it follow a chi-square distribution.
Test Statistic for Testing Goodness of Fit to a Discrete Probability Distribution
where the sum is over all the rows of the table (one for each value of X).
If
the true probability distribution of is as assumed, and
the observed count O of each cell in Table 11.12 "Simplified Updated General Contingency Table" is at least 5,
then approximately follows a chi-square distribution with degrees of freedom.
The test is known as a goodness-of-fit test since it tests the null hypothesis that the sample fits the assumed probability distribution well. It is always right-tailed, since deviation from the assumed probability distribution corresponds to large values of .
Testing is done using either of the usual five-step procedures.
EXAMPLE 2. Table 11.13 "Ethnic Groups in the Census Year" shows the distribution of various ethnic groups in the population of a particular state based on a decennial U.S. census. Five years later a random sample of 2,500 residents of the state was taken, with the results given in Table 11.14 "Sample Data Five Years After the Census Year" (along with the probability distribution from the census year). Test, at the 1% level of significance, whether there is sufficient evidence in the sample to conclude that the distribution of ethnic groups in this state five years after the census had changed from that in the census year.
TABLE 11.13 ETHNIC GROUPS IN THE CENSUS YEAR
Ethnicity
White
Black
Amer.-Indian
Hispanic
Asian
Others
Proportion
0.743
0.216
0.012
0.012
0.008
0.009
TABLE 11.14 SAMPLE DATA FIVE YEARS AFTER THE CENSUS YEAR
Ethnicity
Assumed Distribution
Observed Frequency
White
0.743
1732
Black
0.216
538
American-Indian
0.012
32
Hispanic
0.012
42
Asian
0.008
133
Others
0.009
23
[ Solution ]
We test using the critical value approach.
Step 1. The hypotheses of interest in this case can be expressed as
vs.
Step 2. The distribution is chi-square.
Step 3. To compute the value of the test statistic we must first compute the expected number for each row of Table 11.14 "Sample Data Five Years After the Census Year". Since n = 2500, using the formula and the values of pi from either Table 11.13 "Ethnic Groups in the Census Year" or Table 11.14 "Sample Data Five Years After the Census Year",
Table 11.14 "Sample Data Five Years After the Census Year" is updated to Table 11.15 "Observed and Expected Frequencies Five Years After the Census Year".
TABLE 11.15 OBSERVED AND EXPECTED FREQUENCIES FIVE YEARS AFTER THE CENSUS YEAR
Ethnicity
Assumed Dist.
Observed Freq.
Expected Freq.
White
0.743
1732
1857.5
Black
0.216
538
540
American-Indian
0.012
32
30
Hispanic
0.012
42
30
Asian
0.008
133
20
Others
0.009
23
22.5
The value of the test statistic is
Since the random variable takes six values, I = 6. Thus the test statistic follows the chi-square distribution with degrees of freedom.
Since the test is right-tailed, the critical value is . Reading from Figure 12.4 "Critical Values of Chi-Square Distributions", , so the rejection region is .
Since the decision is to reject the null hypothesis. See Figure 11.6. The data provide sufficient evidence, at the 1% level of significance, to conclude that the ethnic distribution in this state has changed in the five years since the U.S. census.
Figure 11.6 Note 11.15 "Example 2"
Last updated