8-2. Large Sample Tests for a Population Mean

In this section we describe and demonstrate the procedure for conducting a test of hypotheses about the mean of a population in the case that the sample size n is at least 30. The Central Limit Theorem states that Xˉ\bar{X} is approximately normally distributed, and has mean μ_\bar{X}=μ and standard deviation σ_\bar{X}=σ/\sqrt{n} , where μμ and σσ are the mean and the standard deviation of the population. This implies that the statistic

has the standard normal distribution, which means that probabilities related to it are given in Figure 12.2 "Cumulative Normal Probability" and the last line in Figure 12.3 "Critical Values of ".

If we know σσ then the statistic in the display is our test statistic. If, as is typically the case, we do not know σσ, then we replace it by the sample standard deviation ss. Since the sample is large the resulting test statistic still has a distribution that is approximately standard normal.

Standardized Test Statistics for Large Sample Hypothesis Tests Concerning a Single Population Mean

 If σσ is known:  zo=xˉμ0σ/nz_o=\frac {\bar{x}−μ_0}{σ/\sqrt{n} }

If σσ is unknown: to=xˉμ0s/nt_o=\frac {\bar{x}−μ_0}{s/\sqrt{n} }

The test statistic has the standard normal distribution.

The distribution of the standardized test statistic and the corresponding rejection region for each form of the alternative hypothesis (left-tailed, right-tailed, or two-tailed), is shown in Figure 8.4 "Distribution of the Standardized Test Statistic and the Rejection Region".

Figure 8.4

EXAMPLE 4. It is hoped that a newly developed pain reliever will more quickly produce perceptible reduction in pain to patients after minor surgeries than a standard pain reliever. The standard pain reliever is known to bring relief in an average of 3.5 minutes with standard deviation 2.1 minutes. To test whether the new pain reliever works more quickly than the standard one, 50 patients with minor surgeries were given the new pain reliever and their times to relief were recorded. The experiment yielded sample mean xˉ=3.1\bar{x} = 3.1 minutes and sample standard deviation s=1.5s = 1.5 minutes. Is there sufficient evidence in the sample to indicate, at the 5% level of significance, that the newly developed pain reliever does deliver perceptible relief more quickly?

[ Solution ]

  • Step 1. H0:μ=3.5H_0:μ=3.5 vs. Ha:μ=<3.5, α=0.05Ha:μ=<3.5 , \space α=0.05

  • Step 2. the test statistic is zo=xˉμ0s/nz_o=\frac {\bar{x} - μ_0}{s/\sqrt{n}}

    and has the standard normal distribution.

  • Step 3. Compute the test statistic zo=xˉμ0s/n=(3.13.5)1.5/50=1.886z_o=\frac{\bar{x}−μ_0}{s/\sqrt{n}} = \frac {(3.1−3.5)} { 1.5/ \sqrt{50}}=−1.886

  • Step 4. Since the symbol in HaH_a is “ << ” this is a left-tailed test, so there is a single critical value, zα=z0.05=1.645−z_α=−z_{0.05} = −1.645 rejection region : (,1.645]. (−∞,−1.645].

  • Step 5. the test statistic falls in the rejection region. The decision is to reject H0H_0 .

  • Conclusion: The data provide sufficient evidence, at the 5% level of significance, to conclude that the average time until patients experience perceptible relief from pain using the new pain reliever is smaller than the average time for the standard pain reliever.

Figure 8.5 Rejection Region and Test Statistic for Note 8.27 "Example 4"

n <- 50
x <- 3.1
mu <- 3.5
sd <- 1.5
alpha <- 0.05

# in case of left-tail test
se <- sd / sqrt(n)

z_c <- qnorm(1-alpha); z_c
z_o <- (x-mu)/se; z_o

if (z_o <= - z_c) cat("Reject Ho.\n") else 
                cat ("Can not reject Ho.\n")
  • Using Rstat Package : meantest1.plot()

library(Rstat)

meantest1.plot(xb=3.1, mu0=3.5, sig=1.5, n=50, side="low")

EXAMPLE 5. A cosmetics company fills its best-selling 8-ounce jars of facial cream by an automatic dispensing machine. The machine is set to dispense a mean of 8.1 ounces per jar. Uncontrollable factors in the process can shift the mean away from 8.1 and cause either underfill or overfill, both of which are undesirable. In such a case the dispensing machine is stopped and recalibrated. Regardless of the mean amount dispensed, the standard deviation of the amount dispensed always has value 0.22 ounce. A quality control engineer routinely selects 30 jars from the assembly line to check the amounts filled. On one occasion, the sample mean is xˉ=8.2\bar{x} = 8.2 ounces and the sample standard deviation is s=0.25s = 0.25 ounce. Determine if there is sufficient evidence in the sample to indicate, at the 1% level of significance, that the machine should be recalibrated.

[ Solution ]

  • Step 1. the hypothesis test is H0:μ=8.1vs.Ha:μ8.1,@α=0.01H_0:μ=8.1 vs.  H_a:μ≠8.1, @ α=0.01

  • Step 2. The sample is large and the population standard deviation is known. Thus the test statistic zo=xˉμ0σ/nz_o=\frac {\bar{x}−μ_0}{σ/\sqrt{n} }

    and has the standard normal distribution.

  • Step 3. Inserting the data into the formula for the test statistic gives zo=xˉμ0σ/n=(8.28.0)0.22/30=2.490z_o=\frac {\bar{x}−μ_0}{σ/\sqrt{n} } = \frac {(8.2 -8.0)}{ 0.22 / \sqrt{30} } = 2.490

  • Step 4. Since the symbol in HaH_a is “ ” this is a two-tailed test, so there are two critical values, ±zα2=±z0.005=2.576±z_{α∕2}=±z_{0.005} = 2.576 . The rejection region is (,2.576][2.576,). (−∞,−2.576]∪[2.576,∞). .

  • Step 5. the test statistic does not fall in the rejection region. The decision is not to reject H0H_0.

  • conclusion is:

    The data do not provide sufficient evidence, at the 1% level of significance, to conclude that the average amount of product dispensed is different from 8.1 ounce. We conclude that the machine does not need to be recalibrated.

Figure 8.6

n <- 30
x <- 8.2
mu <- 8.1
sd <- 0.22
alpha <- 0.01

# in case of two test
se <- sd / sqrt(n)

z_c <- qnorm(1-alpha/2); z_c
z_o <- (x-mu)/se; z_o


if (z_o <= -z_c || z_o >= z_c ) cat("Reject Ho.\n") else 
                cat ("Can not reject Ho.\n")
  • Using Rstat Package : meantest1.plot()

library(Rstat)

meantest1.plot(xb=8.2, mu0=8.1, sig=0.22, n=30, side="two")

Last updated