Welcome to Discrete Random Variables!

In this chapter, we are going to explore Discrete Random Variables (DRVs). Don't let the name intimidate you! It’s essentially a fancy way of using math to predict what might happen in situations involving chance—like rolling dice, playing a board game, or even predicting how many customers will walk into a shop. By the end of these notes, you’ll be able to calculate averages and "spread" for these scenarios like a pro.

1. Understanding Discrete Random Variables (DRVs)

A Random Variable is a quantity whose value depends on the outcome of a random event. We call it Discrete because it can only take specific, distinct values (like 1, 2, or 3), rather than any value in a range (like 1.572...).

How we represent DRVs

We usually represent the variable with a capital letter, like \(X\), and the specific values it can take with a lowercase letter, \(x\). There are two main ways to show a distribution:

1. Probability Distributions in Tables: This is a simple table showing every possible value of \(X\) and its matching probability \(P(X = x)\).

2. Probability Mass Functions (PMF): This is a formula used to calculate the probability for any given \(x\), often written as \(P(X = x) = f(x)\).

The Golden Rule: For any valid DRV, the sum of all probabilities must equal 1. Mathematically, this is written as: \(\sum P(X = x) = 1\).

Example: If you flip a coin and get £2 for heads and £0 for tails, your DRV \(X\) takes values {0, 2} each with a probability of 0.5.

Quick Review:
- Discrete: Countable values only.
- Random: Based on chance.
- Variable: Can change.
- Total Probability: Always adds up to 1!

Key Takeaway: A DRV is just a map that links outcomes of an experiment to numbers and their probabilities.

2. Average and Spread: Mean, Mode, and Median

Just like in GCSE stats, we want to find the "center" and the "spread" of our data. However, since we are dealing with probabilities, the methods look slightly different.

The Mode

The Mode is simply the value of \(x\) that has the highest probability. Look at your table or formula and find the "winner" with the biggest \(P(X = x)\).

The Median

The Median is the middle value. To find it, you add up the probabilities (cumulative probability) until you reach or cross 0.5. The value of \(x\) where this happens is your median.

The Mean (Expectation)

In Further Maths, we call the mean the Expectation, written as \(E(X)\). Think of this as the average value you would get if you ran the experiment thousands of times.

The formula is: \(E(X) = \sum x_i p_i\)
Translation: Multiply every value of \(x\) by its probability, then add them all up.

Key Takeaway: \(E(X)\) isn't necessarily a value that \(X\) can actually be (e.g., the average of a die roll is 3.5), but it represents the long-term average.

3. Variance and Standard Deviation

The Variance, written as \(Var(X)\), tells us how much the values of \(X\) vary from the mean. A high variance means the outcomes are very spread out; a low variance means they are consistent.

The Calculation Steps

To find the variance, we first need \(E(X^2)\). This is the "Expectation of \(X\) squared."
Formula: \(E(X^2) = \sum x_i^2 p_i\)
(Square each \(x\) value first, then multiply by its probability, then add them up.)

Now, use the Variance Formula:
\(Var(X) = E(X^2) - [E(X)]^2\)

Memory Aid: A common rhyme to remember this is: "The mean of the squares minus the square of the mean."

Standard Deviation

The Standard Deviation is simply the square root of the variance: \(\sigma = \sqrt{Var(X)}\).

Common Mistake to Avoid: Don't forget to square the mean (\(E(X)\)) before subtracting it from \(E(X^2)\)! This is the most frequent error students make.

Key Takeaway: Variance measures risk or spread. The formula \(E(X^2) - [E(X)]^2\) is your best friend in this chapter.

4. Linear Functions of DRVs

Sometimes, we might change our DRV. For example, if you are playing a game where you win \(X\) amount of money, but the host adds a £5 bonus and then doubles your total winnings. This is a linear transformation, usually written as \(aX + b\).

The Rules for Expectation and Variance

1. For Expectation: It follows the rules exactly.
\(E(aX + b) = aE(X) + b\)
(If you double the scores and add 5, the average doubles and increases by 5.)

2. For Variance: It’s a bit more sensitive!
\(Var(aX + b) = a^2 Var(X)\)
(Adding a constant \(b\) doesn't change the spread at all, but multiplying by \(a\) increases the variance by \(a^2\).)

Analogy: Imagine a group of people standing in a line. If everyone takes 5 steps to the right (adding \(b\)), the "average" position moves, but the distance between the people (the spread) stays exactly the same. That's why \(+b\) disappears in the variance formula!

Key Takeaway: When transforming variance, always square the multiplier (\(a\)) and ignore the added constant (\(b\)).

5. The Discrete Uniform Distribution

A Discrete Uniform Distribution is a special case where every outcome has the exact same probability. The most common example is a fair, six-sided die where every number from 1 to 6 has a probability of \(1/6\).

If \(X\) is defined on the set \(\{1, 2, ..., n\}\), then \(P(X = x) = 1/n\) for every value.

Shortcut Formulae

Instead of doing long tables, we can use these "cheat codes" (which you need to be able to prove):
- Mean: \(E(X) = \frac{n + 1}{2}\)
- Variance: \(Var(X) = \frac{n^2 - 1}{12}\)

Did you know? These formulae come from the sum of the first \(n\) integers. For example, the sum of numbers 1 to \(n\) is \(\frac{n(n+1)}{2}\). Dividing this by \(n\) (the number of values) gives you the mean: \(\frac{n+1}{2}\).

Key Takeaway: Use these shortcuts only when you are certain every outcome has the same probability and the values are the integers 1 to \(n\).

Final Chapter Summary

- DRVs: Use tables or functions; total probability must be 1.
- Expectation \(E(X)\): The average outcome (\(\sum xp\)).
- Variance \(Var(X)\): The spread of outcomes (\(E(X^2) - [E(X)]^2\)).
- Coding: \(E(aX+b)\) stays linear; \(Var(aX+b)\) squares the multiplier and drops the constant.
- Uniform DRVs: Use the \(\frac{n+1}{2}\) and \(\frac{n^2-1}{12}\) shortcuts for fair, sequential integer outcomes.

Don't worry if the variance formula feels a bit clunky at first. With a few practice problems, it will become second nature!