Previously, we looked at evaluating and summarising data. Data is mostly randomly generated, though not necessarily in a purely unpredictable manner, by chance. It is of our interest, therefore, to now turn to games of chance.
Consider a fair 6-sided die with possible values (plural: dice). We collect these outcomes into a set, defined
, and collect sub-collections of these outcomes also as sets.
Example 1. Write down the sub-collection of even-numbered outcomes of the die.
Solution. Since the even-numbered outcomes of the die are , the required subset is
.
Definition 1. Let be sets. We write:
if the two sets have exactly the same elements,
and call
a subset of
if
is a sub-collection of
,
if
is not a subset of
.
For instance, in Example 1, we have . We can illustrate this relationship using a Venn diagram.

Therefore, we will use sets in order to model chance. However, sets alone don’t get at the full picture. We also need to quantify certainty. Intuitively, since there are 3 even numbers in the set , then the probability that we roll an even number on the die should be
. We formalise this idea using sets.
For any (finite) set , let
denote the number of elements in the set. For instance,
and
.
Definition 2. Let be a set, which we usually call the universal set of discourse. Given
, define the uniform probability of
by
Using the language of Definition 2, the probability of rolling an even number on the die is displayed as
Denote for simplicity, because we are lazy. Recall that
.
Remark 1. Observe that , and
. Furthermore, some education systems denote the universal set by the following alternate notation and use them in their assessments:
. In the spirit of learning set-formulated probability theory, we will not follow such practice in these blog posts.
Example 2. What is the probability that we would roll a multiple of ?
Solution. The multiples of are described by the subset
. Therefore, the required probability is
Example 3. What is the probability that we would roll an even multiple of ?
Solution. The even numbers are given by the subset , and the multiples of
are given by the subset
. The common number is
, and therefore the subset of
that contains all even multiples of
is
. Therefore, the required probability is
To capture the idea of “common elements”, we use the notion of the intersection. We can illustrate this common-ness using another Venn diagram.

In order to do that, we need to introduce the idea of “membership”.
Definition 3. Let be a set. We write
to mean that
belongs to
. In this case, we say that
is an element of
. We write
to mean that
does not belong to
.
For instance, if , then
and
. Furthermore, we can write
in set-builder notation:
Definition 4. Let be subsets of
. We call the sub-collection
of common elements the intersection of
and
. Formally, we define this intersection by
For example .
Example 4. What is the probability that we would roll a number that is both odd and even?
Solution. The subset of odd numbers is and the subset of even numbers is
. There…are no numbers in
that belong to both subsets. The required subset is empty:
. In the language of Definition 4,
Therefore, the required probability is
Remark 2. We denote , motivated by the observation
. Furthermore, we say that
are mutually disjoint since
.

Example 5. What is the probability that we would roll a number that is either even or a multiple of ?
Solution. If we require a number to be at least one of these criterion, we allow it to be taken from either of the subsets or
, then the desired subset would be
. We can illustrate this “collaboration” using another Venn diagram.

Therefore, the required probability is
Definition 5. Let be subsets of
. We call the sub-collection
of “collaborated” elements the union of
and
. Formally, we define this union by
For example .
Example 6. What is the probability that we would roll a number that is not a multiple of ?
Solution. By accepting all elements of that are not multiples of
, the desired subset is
. We can visualise this subset, once again, using a Venn diagram.

Therefore, the required probability is
Definition 6. For any , define the complement of
by
For example, .
At this point, alarm bells should ring, since by Remark 1 and Example 2,
Furthermore, we notice that
That is, we can add probabilities of unions of mutually exclusive subsets.
Theorem 1. Let be mutually disjoint subsets of
. Then
Proof. Since are mutually disjoint, every element in
belongs either to
and not
, or
and not
.

Therefore, must equal
, and hence,
Remark 3. This property holds for any number of mutually disjoint subsets:
whenever each whenever
. This result is called the (finite or countable) additivity property of probability.
Corollary 1. Given ,
Proof. By definition, and
. By Remark 1 and Theorem 1,
Therefore, .
Remark 4. In particular, so that
Example 7. Given subsets , not necessarily disjoint, show that
Solution. Consider the Venn diagram below for illustrative purposes.

Given , there are two non-overlapping cases:
and
.
Denoting for brevity,
By Theorem 1,
On the other hand, we observe that given , either
or
. Refer to the zoomed-in Venn diagram below.

Therefore,
By Theorem 1 again,
Making the subject of the equation,
Therefore, we use set notation to describe our intuitive notions of probability. We can formalise these ideas with far more advanced tools, but we shall relegate that rabbit hole as an exercise for the keen reader. We keep these ideas simple for now.
Next time, we solve some simple problems involving probability.
—Joel Kindiak, 18 Mar 26, 1509H
Leave a comment