Discrete Random Variables

A type of variable that takes distinct, countable values, often used to model finite outcomes. 📊

Discrete Random Variable

A discrete random variable is a mapping X of all of the outcomes of a sample space to numerical values x \in \mathbb{Z}.

X: Outcome \in Sample \ Space \to x \in \mathbb{Z}

Probability Mass Function (PMF)

A probability mass function (PMF) is a mapping of values x of discrete random variables X to probabilities [0,1].

p_{X}(x) = P(X = x)

PMF has properties: p_{X}(x) \geq 0, \quad \sum_{x} p_{X}(x) = 1

Cumulative Distribution Function (CDF)

The cumulative distribution function (CDF) for discrete random variables is defined as:

F_{X}(x) = P(X \leq x) = \sum_{k \leq x} p_{X}(k)

Relation to PMF: p_{X}(x) = \frac{dF_{X}(x)}{dx}

F_{X}(x) = \sum_{k \leq x}p_{X}(k)

Illustration: PMF and CDF

Expectation and Variance of PMFs

Discrete expectation, or mean, is the average numerical value that the discrete random variable takes over the PMF. E[X] = \sum_{x} x \ p_{X}(x)

Discrete variance is the expected squared difference from the mean of a PMF. Var[X] = E[(X - E[X])^{2}] = \sum_{x}(x - E[X])^2 p_{X}(x)

Joint and Marginal PMFs

The joint PMF calculates the intersection of two discrete random variables: p_{X,Y}(x_{1},x_{2}) = P(X = x_{1}, Y = x_{2})

From the previous definition we can also compute the marginal PMF for a particular random variable: p_{X}(x_{1}) = \sum_{x_{2}} p_{X,Y}(X = x_{1},Y = x_{2})

Conditional Expectation of PMFs

The conditional PMF gives the probability distribution of a discrete random variable given that another variable has a specific value. p_{X|Y}(x_{1}| x_{2}) = \frac{p_{X,Y}(x_{1}, x_{2})}{p_{Y}(x_{2})}

The conditional expectation is the expected value of a discrete random variable given that another variable is fixed at a specific value. E[X| Y = x_{2}] = \sum_{x_{1}} x_{1} \frac{p_{X,Y}(x_{1}, x_{2})}{p_{Y}(x_{2})}

Independence

Two discrete random variables are independent if:

p_{X,Y}(x_{1}, x_{2}) = p_{X}(x_{1}) p_{Y}(x_{2})

For multiple discrete random variables: p_{X_{1},...,X_{n}}(x_{1}, ..., x_{n}) = p_{X_{1}}(x_{1}) ... \ p_{X_{n}}(x_{n})