3-1. Sample Spaces, Events, and Their Probabilities
样本空间和事件和概率
样本空间和事件 (概率论) - 文氏图(Venn Diagram) - 树形图(Tree Diagram)
概率(Probability)
1. Sample Spaces and Events
A random experiment is a mechanism that produces a definite outcome that cannot be predicted with certainty. The sample space associated with a random experiment is the set of all possible outcomes. An event is a subset of the sample space.
An event E is said to occur on a particular trial of the experiment if the outcome observed is an element of the set E.
EXAMPLE 1. Construct a sample space for the experiment that consists of tossing a single coin.
[ Solution ]
# install.packages("prob")
library(prob)
tosscoin(1)
EXAMPLE 2. Construct a sample space for the experiment that consists of rolling a single die. Find the events that correspond to the phrases “an even number is rolled” and “a number greater than two is rolled.”
[ Solution ] , ,
library(prob)
## 1. Sample Space : rolling a single die
rolldie(1)
# 2. Event 1 : an even number is rolled
S <- rolldie(1)
E1 <- subset(S, X1 %% 2 ==0); E1
# 3. Event 2 : a number greater than two is rolled.
S <- rolldie(1)
E2 <- subset(S, X1 > 2); E2
1-1. Venn Diagram
A graphical representation of a sample space and events is a Venn diagram

EXAMPLE 3. A random experiment consists of tossing two coins.
Construct a sample space for the situation that the coins are indistinguishable, such as two brand new pennies.
Construct a sample space for the situation that the coins are distinguishable, such as one a penny and the other a nickel.
[ Solution ]
two same coins : two head -> 2h, two tails -> 2t, 2 different faces : d =>
two different coins (penny, nickel) :
1-2. Venn Diagram Plot in R
type of count data.
A 450
B 1800
A and B both 230
I want to develop a colorful (possibly semi-transparency at intersections) like the following Venn diagram.
require(venneuler)
v <- venneuler(c(A=450, B=1800, "A&B"=230))
plot(v)
# install package
if (!requireNamespace("BiocManager", quietly = TRUE))
install.packages("BiocManager")
BiocManager::install("VennDiagram")
library("VennDiagram")
# 2-set diagram
venn.plot <- draw.pairwise.venn(30, 20, 10, c("A-up", "B-up"), scaled = FALSE);
venn.plot <- draw.triple.venn(
area1 = 60,
area2 = 70,
area3 = 80,
n12 = 30,
n23 = 20,
n13 = 10,
n123 = 5,
category = c("A_up-regulation", "B_up-regulation", "C_up- regulation"),
);
# 3-set diagram
venn.plot <- draw.triple.venn(
area1 = 60,
area2 = 70,
area3 = 80,
n12 = 30,
n23 = 20,
n13 = 10,
n123 = 5,
category = c("A_up-regulation", "B_up-regulation", "C_up- regulation"),
);
# 4-set diagram
venn.plot <- draw.quad.venn(
area1 = 90,
area2 = 80,
area3 = 75,
area4 = 49,
n12 = 37,
n13 = 25,
n14 = 26,
n23 = 34,
n24 = 30,
n34 = 22,
n123 = 16,
n124 = 15,
n134 = 10,
n234 = 12,
n1234 = 3,
category = c("First", "Second", "Third", "Fourth"),
fill = c("orange", "red", "green", "blue"),
lty = "dashed",
cex = 2,
cat.cex=2,
cat.col = c("orange", "red", "green", "blue")
);
[ 참고자료 - Venn Diagram ]
1-3. tree diagram
A device that can be helpful in identifying all possible outcomes of a random experiment, particularly one that can be viewed as proceeding in stages, is what is called a tree diagram.
EXAMPLE 4. Construct a sample space that describes all three-child families according to the genders of the children with respect to birth order.
[ Solution ] , g=girl ; b=boy

The line segments are called branches of the tree. The right ending point of each branch is called a node. The nodes on the extreme right are the final nodes; to each one there corresponds an outcome, as shown in the figure.
1-4. Tree Diagram in R






2. Probability
The probability of an outcome e in a sample space S is a number p between 0 and 1 that measures the likelihood that e will occur on a single trial of the corresponding random experiment. The value corresponds to the outcome e being impossible and the value corresponds to the outcome e being certain.
The probability of an event A is the sum of the probabilities of the individual outcomes of which it is composed. It is denoted .
If an event is , then

EXAMPLE 5. A coin is called “balanced” or “fair” if each side is equally likely to land up. Assign a probability to each outcome in the sample space for the experiment that consists of tossing a single fair coin.
[ Solution ] ,
# install.packages("prob")
library(prob)
tosscoin(1, makespace = TRUE)
EXAMPLE 6. A die is called “balanced” or “fair” if each side is equally likely to land on top. Assign a probability to each outcome in the sample space for the experiment that consists of tossing a single fair die. Find the probabilities of the events : “an even number is rolled” and : “a number greater than two is rolled.”
[ Solution ]
,
,
library(prob)
# 1. Sample Space
S <- rolldie(1, makespace = TRUE); S
# 2. P(E)
E <- subset(S, X1 %% 2 == 0); E
Prob(S, X1 %% 2 == 0) # or Prob(E)
# 3. P(T)
T <- subset(S, X1 > 2); T
Prob(S, X1 > 2) # or Prob(T)
EXAMPLE 7. Two fair coins are tossed. Find the probability that the coins match, i.e., either both land heads or both land tails.
[ Solution ]
identical coins : , =>
two different coins : , =>
[ Solution 1 ]
library(prob)
a <- tosscoin(2, makespace = TRUE); a
S1 <- subset(a, toss1 == toss2); S1
S2 <- subset(a, toss1 != toss2); S2
S2[,1] <- "D" ; S2
S2[,2] <- "D" ; S2
# 1) Sample Space
S <- union(S1, S2)
S$probs <- 1/3; S
# 2) Probability that the coins match..
Prob(S, toss1 == "H" | toss2 =="T")
[ Solution 2 ]
library(prob)
# 1. Sample Space
S <- tosscoin(2, makespace = TRUE); S
# 2. P(E)
Prob(S, toss1 == toss2)
EXAMPLE 8. The breakdown of the student body in a local high school according to race and ethnicity is 51% white, 27% black, 11% Hispanic, 6% Asian, and 5% for all others. A student is randomly selected from this high school. (To select “randomly” means that every student has the same chance of being selected.) Find the probabilities of the following events:

: the student is black,
: the student is minority (that is, not white),
: the student is not black.
[ Solution ]
library(prob)
X1 <- c("w", "b", "h", "a", "o")
probs <- c(0.51, 0.27, 0.11, 0.06, 0.05)
# 1. Sample Space
S <- data.frame(X1, probs)
# 2. P(B)
Prob(S, X1 == "b")
# 3. P(M) = 1 - P(w)
1 - Prob(S, X1 == "w")
# 4. P(N) = 1 - P(b)
1 - Prob(S, X1 == "b")
EXAMPLE 9. The student body in the high school considered in "Example 8" may be broken down into ten categories as follows: 25% white male, 26% white female, 12% black male, 15% black female, 6% Hispanic male, 5% Hispanic female, 3% Asian male, 3% Asian female, 1% male of other minorities combined, and 4% female of other minorities combined. A student is randomly selected from this high school. Find the probabilities of the following events:

: the student is black,
: the student is minority female,
: the student is female and is not black.
[ Solution ]
sex <- c("Male", "Female")
race <- c("w", "b", "h", "a", "o")
probs <- c(0.25, 0.12, 0.06, 0.03, 0.01, 0.26, 0.15, 0.05, 0.03, 0.04)
prob <- matrix( probs, ncol=5, byrow=TRUE)
rownames(prob) <- sex
colnames(prob) <- race
Prob <- as.table(prob); Prob
addmargins(Prob)
# 1. P(B)
sum(Prob[,"b"])
# 2. P(MF) = P(F) - P(wf)
sum(Prob["Female",]) - Prob["Female", "w"]
# 3. P(FN) = P(F) - P(bf)
sum(Prob["Female",]) - Prob["Female", "b"]
样本空间和事件 (概率论) - 文氏图(Venn Diagram) - 树形图(Tree Diagram)
概率(Probability)
Last updated
Was this helpful?