5-2. The Standard Normal Distribution

标准正态分布

1. A Standard Normal Random Variable

A standard Normal Random Variable is a normally distributed random variable with mean μ=0μ = 0 and standard deviation σ=1σ = 1 . It will always be denoted by the letter ZZ : ZN(0,1)Z \sim N(0, 1) .

XN(μ,σ2)X \sim N(\mu, \sigma^2 )일 때, Z=(Xμ)σZ = \frac {(X-\mu )}{\sigma } 라 하면, ZN(0,1)Z \sim N(0, 1)이다. 즉, 정규분포를 따르는 확률변수를 표준화하면, 그 확률변수는 표준정규분포를 따른다.

2. Cumulative Probability

To compute probabilities for ZZ we will not work with its density function directly but instead read probabilities out of Figure "Cumulative Normal Probability" in Chapter 12 "Appendix". The tables are tables of cumulative probabilities; their entries are probabilities of the form P(Z<z)P(Z<z) . The use of the tables will be explained by the following series of examples.

EXAMPLE 4. Find the probabilities indicated, where as always ZZ denotes a standard normal random variable.

  1. P(Z<1.48)P(Z < 1.48) .

  2. P(Z<0.25)P(Z< −0.25) .

[ Solution ]

  1. Figure 5.10 "Computing Probabilities Using the Cumulative Table" shows how this probability is read directly from the table without any computation required. The digits in the ones and tenths places of 1.48, namely 1.4, are used to select the appropriate row of the table; the hundredths part of 1.48, namely 0.08, is used to select the appropriate column of the table. The four decimal place number in the interior of the table that lies in the intersection of the row and column selected, 0.9306, is the probability sought: P(Z<1.48)=0.9306.P(Z<1.48)=0.9306.

  2. The minus sign in −0.25 makes no difference in the procedure; the table is used in exactly the same way as in part (a): the probability sought is the number that is in the intersection of the row with heading −0.2 and the column with heading 0.05, the number 0.4013. Thus P(Z<0.25)=0.4013P(Z < −0.25) = 0.4013 .

Figure 5.10 Computing Probabilities Using the Cumulative Table

library(Rstat)

# 1. Compute P(Z<1.48)
pnorm(1.48)

# 2. Plot
norm.trans(0, 1, a=-4, b=1.48)

EXAMPLE 5. Find the probabilities indicated.

  1. P(Z>1.60)P(Z > 1.60) .

  2. P(Z>1.02)P(Z > −1.02) .

[ Solution ]

  1. Because the events Z>1.60Z > 1.60 and Z1.60Z ≤ 1.60 are complements, the Probability Rule for Complements implies that P(Z>1.60)=1P(Z1.60)P(Z>1.60)=1−P(Z≤1.60). Since inclusion of the endpoint makes no difference for the continuous random variable ZZ , P(Z1.60)=P(Z<1.60)P(Z≤1.60)=P(Z<1.60) , which we know how to find from the table. The number in the row with heading 1.6 and in the column with heading 0.00 is 0.9452. Thus P(Z<1.60)=0.9452.P(Z<1.60)=0.9452. so P(Z>1.60)=1P(Z1.60)=10.9452=0.0548.P(Z>1.60)=1−P(Z≤1.60)=1−0.9452=0.0548. Figure 5.11 "Computing a Probability for a Right Half-Line" illustrates the ideas geometrically. Since the total area under the curve is 1 and the area of the region to the left of 1.60 is (from the table) 0.9452, the area of the region to the right of 1.60 must be 1−0.9452=0.0548.

2. The minus sign in −1.02 makes no difference in the procedure; the table is used in exactly the same way as in part (a). The number in the intersection of the row with heading −1.0 and the column with heading 0.02 is 0.1539. This means that P(Z<1.02)=P(Z1.02)=0.1539,P(Z<−1.02)=P(Z≤−1.02)=0.1539, hence P(Z>1.02)=1P(Z1.02)=10.1539=0.8461P(Z>−1.02)=1−P(Z≤−1.02)=1−0.1539=0.8461

library(Rstat)

# 1-1. Compute P(Z>1.60)
1-pnorm(1.60)

# 1-2. Plot
norm.trans(0, 1, a=1.60, b=5)

# 2-1. Compute P(Z>-1.02)
1-pnorm(-1.02)

# 2-2. Plot
norm.trans(0, 1, a=-1.02, b=5)

EXAMPLE 6. Find the probabilities indicated.

  1. P(0.5<Z<1.57).P(0.5<Z<1.57).

  2. P(2.55<Z<0.09).P(−2.55<Z<0.09).

[ Solution ]

  1. P(0.5<Z<1.57)=P(Z<1.57)P(Z<0.50)=0.94180.6915=0.2503P(0.5<Z<1.57)=P(Z<1.57)−P(Z<0.50)=0.9418−0.6915=0.2503

2. P(2.55<Z<0.09)==P(Z<0.09)P(Z<2.55)=0.53590.0054=0.5305P(−2.55<Z<0.09)==P(Z<0.09)−P(Z<−2.55)=0.5359−0.0054=0.5305

library(Rstat)

# 1-1. Compute P(0.5<Z<1.57)
a <- 0.5; b <- 1.57
pnorm(b) - pnorm(a)

# 1-2. Plot
norm.trans(0, 1, a=a, b=b)

# 2-1. Compute P(-2.55<Z<0.09)
a <- -2.55; b <- 0.09
pnorm(b) - pnorm(a)

# 2-2. Plot
norm.trans(0, 1, a=a, b=b)

EXAMPLE 7. Find the probabilities indicated.

  1. P(1.13<Z<4.16).P(1.13<Z<4.16).

  2. P(5.22<Z<2.15).P(−5.22<Z<2.15).

[ Solution ]

1.P(1.13<Z<4.16)==P(Z<4.16)P(Z<1.13)=1.00000.8708=0.12921. P(1.13<Z<4.16)==P(Z<4.16)−P(Z<1.13) =1.0000−0.8708=0.1292

2.P(5.22<Z<2.15)=0.98420.0000=0.98422. P(−5.22<Z<2.15)=0.9842−0.0000=0.9842

library(Rstat)

# 1-1. Compute P(1.13<Z<4.16)
a <- 1.13; b<- 4.16
pnorm(b) - pnorm(a)

# 1-2. Plot
norm.trans(0, 1, a=a, b=b)

# 2-1. Compute P(-5.22<Z<2.15)
a <- -5.22; b <- 2.15
pnorm(b) - pnorm(a)

# 2-2. Plot
norm.trans(0, 1, a=a, b=b)

EXAMPLE 8. Find the probabilities indicated.

  1. P(1<Z<1).P(−1<Z<1).

  2. P(2<Z<2).P(−2<Z<2).

  3. P(3<Z<3).P(−3<Z<3).

[ Solution ]

1.P(1<Z<1)=0.84130.1587=0.68261. P(−1<Z<1)=0.8413−0.1587=0.6826

2.P(2<Z<2)=0.97720.0228=0.95442. P(−2<Z<2)=0.9772−0.0228=0.9544

3.P(3<Z<3)=0.99870.0013=0.99743. P(−3<Z<3)=0.9987−0.0013=0.9974

library(Rstat)

# 1-1. Compute P(-1<Z<1)
a <- -1; b <- 1
pnorm(b) - pnorm(a)

# 1-2. Plot
norm.trans(0, 1, a=a, b=b)

# 2-1. Compute P(-2<Z<2)
a <- -2; b <- 2
pnorm(b) - pnorm(a)

# 2-2. Plot
norm.trans(0, 1, a=a, b=b)

# 3-1. Compute P(-3<Z<3)
a <- -3; b <- 3
pnorm(b) - pnorm(a)

# 3-2. Plot
norm.trans(0, 1, a=a, b=b)

3. Plotting Probability Curves using R

You may, on some occasion, want to plot a curve or probability distribution. Here are a couple of approaches to plotting a standard normal curve but, they can be used for other probability distributions. The first comes from Tomas Aragon, the second from John Fox.

3-1. using the sequence operator

This approach uses the normal probability formula…

f(x)=12πσ2e(xμ)22σ2f(x)=\frac{1}{\sqrt{2πσ^2}} e^ \frac{−(x−μ)^2} {2σ^2}

… over a sequence of values.

mu <- 0; sigma <- 1 
x <- seq(-4, 4, .01) 
fx <- (1/sqrt(2*pi*sigma^2))*exp(-(x-mu)^2/(2*sigma^2))
plot(x, fx, type = "l", lwd = 2)  
# options type="l"  produces line,  lwd=2 doubles line width

3-2. another approach

Here, we use the dnorm()function and add a few bells and whistles.

oldpar <- par(mar = c(5, 6, 4, 2) + 0.1)  # room on left
z <- seq(-4, 4, length=1000) 
p <- dnorm(z)
plot(z, p, type="l", lwd=2, 
     main=expression("The Standard Normal Density Function"
    ~~ phi(z)), ylab=expression(phi(z) == frac(1, sqrt(2*pi)) *
     ~~ e^- ~~ frac(z^2, 2)))
abline(h=0, col="gray") ; abline(v=0, col="gray")
z0 <- z[z >= 1.96]    # define region to fill
z0 <- c(z0[1], z0) ;  p0 <- p[z >= 1.96]
p0 <- c(0, p0)
polygon(z0, p0, col="gray")
coords <- locator(2)    # locate head and tail of arrow using your mouse...
arrows(coords$x[1], coords$y[1], coords$x[2], coords$y[2], 
     code=1, length=0.125)
text(coords$x[2], coords$y[2], pos=3, # text above tail arrow
expression(integral(phi(z)*dz, 1.96, infinity) == .025))

3-3. Cumulative Normal Distribution, X ~ N(0,1)

# Cumulative normal distribution plot, X~N(0,1) 
x <- seq(-3, 3, length=200) 
plot(x, pnorm(x, mean=0, sd=1), 
        type='l', 
        main="Cumulative normal distribution, X~N(0,1)")
abline(h=0, col="gray")

标准正态分布 Standard normal distribution 累积分布函数 Cumulative Distribution function(c.d.f)

Last updated