9-4. Comparison of Two Population Proportions

Suppose we wish to compare the proportions of two populations that have a specific characteristic, such as the proportion of men who are left-handed compared to the proportion of women who are left-handed.

Figure 9.7 "Independent Sampling from Two Populations In Order to Compare Proportions" illustrates the conceptual framework of our investigation. Each population is divided into two groups, the group of elements that have the characteristic of interest (for example, being left-handed) and the group of elements that do not. We arbitrarily label one population as Population 1 and the other as Population 2, and subscript the proportion of each population that possesses the characteristic with the number 1 or 2 to tell them apart.

We draw a random sample from Population 1 and label the sample statistic it yields with the subscript 1. Without reference to the first sample we draw a sample from Population 2 and label its sample statistic with the subscript 2.

Figure 9.7 Independent Sampling from Two Populations In Order to Compare Proportions

1. Confidence Intervals

and

[ Solution ]

Because the β€œNo public web access” population was labeled as Population 1 and the β€œPublic web access” population was labeled as Population 2, in words this means that we estimate that the proportion of projects that passed on the first inspection increased by 13 percentage points after records were posted on the web.

so that

and for sample 2:

so that

Thus the desired confidence interval is

Taking into account the labeling of the two populations, this means that we are 90% confident that the proportion of projects that pass on the first inspection is between 6 and 20 percentage points higher after public access to the records than before.

2. Hypothesis Testing

The three forms of the alternative hypothesis, with the terminology for each case, are:

As long as the samples are independent and both are large the following formula for the standardized test statistic is valid, and it has the standard normal distribution.

Standardized Test Statistic for Hypothesis Tests Concerning the Difference Between Two Population Proportions

The test statistic has the standard normal distribution.

The samples must be independent, and each sample must be large: each of the intervals

and

EXAMPLE 11. Using the data of Note 9.25 "Example 10", test whether there is sufficient evidence to conclude that public web access to the inspection records has increased the proportion of projects that passed on the first inspection by more than 5 percentage points. Use the critical value approach at the 10% level of significance.

[ Solution ]

  • In the context of the problem our conclusion is:

    The data provide sufficient evidence, at the 10% level of significance, to conclude that the rate of passing on the first inspection has increased by more than 5 percentage points since records were publicly posted on the web.

Figure 9.8 Rejection Region and Test Statistic for Note 9.27 "Example 11"

EXAMPLE 12. Perform the test of Note 9.27 "Example 11" using the p-value approach.

[ Solution ]

Last updated