8-4. Small Sample Tests for a Population Mean

In the previous section hypotheses testing for population means was described in the case of large samples. The statistical validity of the tests was insured by the Central Limit Theorem, with essentially no assumptions on the distribution of the population.

When sample sizes are small, as is often the case in practice, the Central Limit Theorem does not apply. One must then impose stricter assumptions on the population to give statistical validity to the test procedure.

One common assumption is that the population from which the sample is taken has a normal probability distribution to begin with. Under such circumstances, if the population standard deviation σ\sigma is known, then the test statistic (xˉμ0)/(σ/n)(\bar{x}−μ_0) /(σ/\sqrt{n}) still has the standard normal distribution, as in the previous two sections. If σ\sigma is unknown and is approximated by the sample standard deviation s, then the resulting test statistic (xˉμ0)/(s/n)(\bar{x}−μ_0) /(s/\sqrt{n}) follows Student’s t-distribution with (n1)( n−1) degrees of freedom.

Standardized Test Statistics for Small Sample Hypothesis Tests Concerning a Single Population Mean

  • If  σσ  is known: zo=(xˉμ0)(σ/n)z_o = \frac {(\bar{x}−μ_0)}{(σ/\sqrt{n})}

  • If  σσ  is unknown: to=(xˉμ0)(s/n)t_o = \frac {(\bar{x}−μ_0)}{(s/\sqrt{n})}

The first test statistic ( σσ known) has the standard normal distribution.

The second test statistic (σσ unknown) has Student’s t-distribution with (n1)(n−1) degrees of freedom.

The population must be normally distributed.

The distribution of the second standardized test statistic (the one containing s) and the corresponding rejection region for each form of the alternative hypothesis (left-tailed, right-tailed, or two-tailed), is shown in Figure 8.11 "Distribution of the Standardized Test Statistic and the Rejection Region". This is just like Figure 8.4 "Distribution of the Standardized Test Statistic and the Rejection Region", except that now the critical values are from the t-distribution. Figure 8.4 "Distribution of the Standardized Test Statistic and the Rejection Region" still applies to the first standardized test statistic (the one containing σ) since it follows the standard normal distribution.

Figure 8.11 Distribution of the Standardized Test Statistic and the Rejection Region

The p-value of a test of hypotheses for which the test statistic has Student’s t-distribution can be computed using statistical software, but it is impractical to do so using tables, since that would require 30 tables analogous to Figure 12.2 "Cumulative Normal Probability", one for each degree of freedom from 1 to 30. Figure 12.3 "Critical Values of " can be used to approximate the p-value of such a test, and this is typically adequate for making a decision using the p-value approach to hypothesis testing, although not always.

For this reason the tests in the two examples in this section will be made following the critical value approach to hypothesis testing summarized at the end of Section 8.1 "The Elements of Hypothesis Testing", but after each one we will show how the p-value approach could have been used.

EXAMPLE 10. The price of a popular tennis racket at a national chain store is $179. Portia bought five of the same racket at an online auction site for the following prices:

155 179 175 175 161

Assuming that the auction prices of rackets are normally distributed, determine whether there is sufficient evidence in the sample, at the 5% level of significance, to conclude that the average price of the racket is less than $179 if purchased at an online auction.

[ Solution ]

  • Step 1. The hypothesis test is H0:μ=179vs.Ha:μ<179,@α=0.05H0:μ=179  vs. Ha:μ<179, @ α=0.05

  • Step 2. The sample is small and the population standard deviation is unknown. Thus the test statistic is to=(xˉμ0)(s/n)t_o = \frac {(\bar{x}−μ_0)}{(s/\sqrt{n})}

    and has the Student t-distribution with (n1)=(51)=4(n−1)=(5−1)=4 degrees of freedom.

  • Step 3. From the data we compute xˉ=169\bar{x}=169 and s=10.39s = 10.39 . Inserting these values into the formula for the test statistic gives to=(xˉμ0)(s/n)=(169179)10.39/5=2.152t_o = \frac {(\bar{x}−μ_0)}{(s/\sqrt{n})} = \frac {(169−179)}{10.39/\sqrt{5}} = -2.152

  • Step 4. Since the symbol in Ha is “<” this is a left-tailed test, so there is a single critical value, tα=t0.05[df=4]−t_α=−t_{0.05}[df=4] . Reading from the row labeled df=4df=4 in Figure 12.3 "Critical Values of " its value is −2.132. The rejection region is (,2.132].(−∞,−2.132].

  • Step 5. As shown in Figure 8.12 "Rejection Region and Test Statistic for " the test statistic falls in the rejection region. The decision is to reject H0H_0 .

  • In the context of the problem our conclusion is:

    The data provide sufficient evidence, at the 5% level of significance, to conclude that the average price of such rackets purchased at online auctions is less than $179.

Figure 8.12 Rejection Region and Test Statistic for Note 8.42 "Example 10"

n <- 5
xb <- 169
m0 <- 179
sd <- 10.39
alpha <- 0.05

# in case of left-tail test
se <- sd / sqrt(n)

t_c <- qt(1-alpha, n-1); t_c
t_o <- (xb-m0)/se; t_o

if (t_o <= -t_c) cat("Reject Ho.\n") else 
                 cat ("Can not reject Ho.\n")

To perform the test in Note 8.42 "Example 10" using the p-value approach, look in the row in Figure 12.3 "Critical Values of " with the heading df=4df=4 and search for the two t-values that bracket the unsigned value 2.152 of the test statistic. They are 2.132 and 2.776, in the columns with headings t0.050t_{0.050} and t0.025t_{0.025} . They cut off right tails of area 0.050 and 0.025, so because 2.152 is between them it must cut off a tail of area between 0.050 and 0.025.

By symmetry −2.152 cuts off a left tail of area between 0.050 and 0.025, hence the p-value corresponding to t=2.152t=−2.152 is between 0.025 and 0.05. Although its precise value is unknown, it must be less than α=0.05α=0.05 , so the decision is to reject H0H_0 .

  • Using Rstat Package : meantest2.plot()

library(Rstat)

meantest2.plot(xb=169, mu0=179, sig=10.39, n=5, side="low")

EXAMPLE 11. A small component in an electronic device has two small holes where another tiny part is fitted. In the manufacturing process the average distance between the two holes must be tightly controlled at 0.02 mm, else many units would be defective and wasted. Many times throughout the day quality control engineers take a small sample of the components from the production line, measure the distance between the two holes, and make adjustments if needed. Suppose at one time four units are taken and the distances are measured as

0.021 0.019 0.023 0.020

Determine, at the 1% level of significance, if there is sufficient evidence in the sample to conclude that an adjustment is needed. Assume the distances of interest are normally distributed.

[ Solution ]

  • Step 1. The relevant test is H0:μ=0.02vs.Ha:μ=0.02,@α=0.01H_0:μ=0.02 vs. H_a:μ=≠0.02, @ α=0.01 where μ denotes the mean distance between the holes.

  • Step 2. The sample is small and the population standard deviation is unknown. Thus the test statistic is to=(xˉμ0)(s/n)t_o = \frac {(\bar{x}−μ_0)}{(s/\sqrt{n})}

    and has the Student t-distribution with (n1)=(41)=3(n−1)=(4−1)=3 degrees of freedom.

  • Step 3. From the data we compute xˉ=0.02075\bar{x}=0.02075 and s=0.00171.s = 0.00171. Inserting these values into the formula for the test statistic gives to=(xˉμ0)(s/n)=(0.020750.02)0.00171/4=0.877t_o = \frac {(\bar{x}−μ_0)}{(s/\sqrt{n})} = \frac {(0.02075−0.02)}{0.00171/\sqrt{4}} = 0.877

  • Step 4. Since the symbol in HaH_a is “ ” this is a two-tailed test, so there are two critical values, ±tα2=±t0.005[df=3].±t_{α∕2}=±t_{0.005}[df=3]. Reading from the row in Figure 12.3 "Critical Values of " labeled df=3df=3 their values are ±5.841.±5.841. The rejection region is (,5.841][5.841,).(−∞,−5.841]∪[5.841,∞).

  • Step 5. As shown in Figure 8.13 "Rejection Region and Test Statistic for " the test statistic does not fall in the rejection region. The decision is not to reject H0H_0 .

  • In the context of the problem our conclusion is:

    The data do not provide sufficient evidence, at the 1% level of significance, to conclude that the mean distance between the holes in the component differs from 0.02 mm.

Figure 8.13 Rejection Region and Test Statistic for Note 8.43 "Example 11"

n <- 4
xb <- 0.02075
m0 <- 0.02
sd <- 0.00171
alpha <- 0.01

# in case of two-tail test
se <- sd / sqrt(n)

t_c <- qt(1-alpha/2, n-1); t_c
t_o <- (xb-m0)/se; t_o

if (t_o <= -t_c || t_o >= t_c ) cat("Reject Ho.\n") else 
                                cat ("Can not reject Ho.\n")

To perform the test in Note 8.43 "Example 11" using the p-value approach, look in the row in Figure 12.3 "Critical Values of " with the heading df=3df=3 and search for the two t-values that bracket the value 0.877 of the test statistic. Actually 0.877 is smaller than the smallest number in the row, which is 0.978, in the column with heading t0.200t_{0.200} . The value 0.978 cuts off a right tail of area 0.200, so because 0.877 is to its left it must cut off a tail of area greater than 0.200.

Thus the p-value, which is the double of the area cut off (since the test is two-tailed), is greater than 0.400. Although its precise value is unknown, it must be greater than α=0.01α=0.01 , so the decision is not to reject H0H_0 .

  • Using Rstat Package : meantest2.plot()

library(Rstat)

meantest2.plot(xb=0.02075, mu0=0.02, sig=0.00171, n=4, side="two")

Last updated