Welcome to Discrete Random Variables!
In your previous maths studies, you’ve likely dealt with variables that have a single, fixed value. In Statistics, we step into a world where values are uncertain—they depend on chance. A Discrete Random Variable (DRV) is essentially a way of mapping the outcomes of a random process (like rolling a die or counting hits on a website) to numbers.
Don't worry if this seems a bit abstract at first! Think of a DRV as a "mathematical menu" that tells you what could happen and how likely each course is. By the end of these notes, you’ll be able to calculate averages, measure "spread," and use special models to predict everything from sports results to the number of chocolate chips in a cookie.
1. The Basics: What is a Discrete Random Variable?
A Random Variable (usually denoted by a capital letter like \( X \)) is a quantity whose value is determined by the outcome of a random event. It is Discrete if it can only take certain distinct values (like 0, 1, 2...), rather than any value in a range.
Probability Distributions
A probability distribution is just a complete list of all possible values for a DRV and their associated probabilities. You will usually see these represented in a table or as a function, \( P(X = x) = f(x) \).
Example: Let \( X \) be the number of heads when flipping a fair coin twice.The possible values are \( x = 0, 1, 2 \). The table would look like this:
- \( P(X=0) = 0.25 \)
- \( P(X=1) = 0.50 \)
- \( P(X=2) = 0.25 \)
Quick Review Box: The Golden Rule
The sum of all probabilities in a distribution must equal 1.
\( \sum P(X = x) = 1 \)
Common Mistake to Avoid: Forgetting that \( x \) represents the value (e.g., "3 goals") while \( P(X=x) \) represents the chance (e.g., "0.1"). Don't mix them up in your calculations!
2. Expectation and Variance
Once we have a distribution, we want to know two things: What is the "average" outcome, and how much do the outcomes vary?
Expectation (The Mean)
The Expectation, \( E(X) \), is the long-term average value if you ran the experiment many, many times. It is also called the mean, \( \mu \).
Formula: \( E(X) = \mu = \sum x_i p_i \)
Analogy: Imagine a seesaw. The Expectation is the "balance point" of the distribution where all the probabilities on either side would perfectly balance out.
Variance
The Variance, \( Var(X) \), measures the spread. A high variance means the outcomes are very spread out; a low variance means they are clustered near the mean.
Formula: \( Var(X) = \sigma^2 = \sum x_i^2 p_i - \mu^2 \)
Memory Aid: "The mean of the squares minus the square of the mean."
1. Calculate \( E(X^2) \) by squaring each \( x \) and multiplying by its probability.
2. Subtract the square of the mean you found earlier.
Linear Coding (Changing the Scale)
What happens if you double all the values and add 5? We use these handy rules:
1. \( E(aX + b) = aE(X) + b \) (The mean changes exactly as you'd expect).
2. \( Var(aX + b) = a^2 Var(X) \) (Adding a constant doesn't change the spread, but multiplying by \( a \) increases variance by \( a^2 \)).
Key Takeaway: Expectation is the "where is it?" (location), and Variance is the "how wide is it?" (spread).
3. The Discrete Uniform Distribution
This is the simplest distribution. It occurs when every outcome is equally likely.
Notation: \( X \sim U(n) \) where \( X \) takes values \( 1, 2, ..., n \).
Real-world example: Rolling a fair six-sided die. Every number from 1 to 6 has a probability of \( \frac{1}{6} \).
Did you know? Because every outcome has the same chance, the mean is always exactly in the middle of the range!
4. The Binomial Distribution
You’ve seen this in A Level Maths, but in Further Maths, we focus on its "summary" properties.
Conditions: Fixed number of trials (\( n \)), two outcomes (success/failure), and a constant probability of success (\( p \)).
Formulae for \( X \sim B(n, p) \):
1. Mean: \( E(X) = np \)
2. Variance: \( Var(X) = np(1 - p) \)
Step-by-step: If you shoot 10 basketball hoops (\( n=10 \)) and your success rate is 70% (\( p=0.7 \)), your expected number of hoops is \( 10 \times 0.7 = 7 \). Your variance is \( 10 \times 0.7 \times 0.3 = 2.1 \).
5. The Geometric Distribution
This distribution is used when we are counting the number of trials until the first success occurs.
Notation: \( X \sim Geo(p) \)
Key Formulae:
- Probability of first success on trial \( x \): \( P(X=x) = (1-p)^{x-1}p \)
- Probability it takes more than \( x \) trials: \( P(X > x) = (1-p)^x \)
- Mean: \( E(X) = \frac{1}{p} \)
- Variance: \( Var(X) = \frac{1-p}{p^2} \)
Analogy: Think of trying to get a "6" on a die. The probability is \( \frac{1}{6} \). The expected number of rolls to get your first "6" is \( \frac{1}{1/6} = 6 \) rolls.
Common Mistake: Students often confuse \( P(X=x) \) and \( P(X > x) \). Remember, \( P(X > x) \) is just the probability that you failed \( x \) times in a row!
6. The Poisson Distribution
The Poisson distribution models the number of times an event occurs in a fixed interval of time or space.
Notation: \( X \sim Po(\lambda) \) where \( \lambda \) (lambda) is the average rate of occurrence.
Conditions (The "SIM" Rule):
- Singly: Events occur one at a time.
- Independently: One event doesn't affect the next.
- Maintain a constant rate: The average rate (\( \lambda \)) stays the same.
The Math:
- Probability formula: \( P(X = x) = \frac{e^{-\lambda} \lambda^x}{x!} \)
- The Magic Property: For a Poisson distribution, Mean = Variance.
\( E(X) = \lambda \) and \( Var(X) = \lambda \).
Summing Poisson Variables:
If you have two independent Poisson variables, \( X \sim Po(\lambda_1) \) and \( Y \sim Po(\lambda_2) \), their sum is also Poisson:
\( X + Y \sim Po(\lambda_1 + \lambda_2) \)
Example: If a shop gets 3 customers per hour in the morning and 5 in the afternoon, the total daily customers (assuming 1 hour of each) follows \( Po(3 + 5) = Po(8) \).
Final Quick-Tips for Success
- Calculator Skills: Ensure you know how to use your calculator's distribution menu (Binomial, Poisson) to find probabilities quickly.
- Check the Context: Are you counting successes in fixed trials (Binomial)? Trials until success (Geometric)? Or events in a time period (Poisson)? Identifying the model is 80% of the battle!
- Rounding: Keep your mean (\( \mu \)) exact when calculating variance to avoid "rounding creep."
Summary: You now have the tools to describe any discrete random event. You can find its average (Expectation), its consistency (Variance), and apply specialized models (Uniform, Binomial, Geometric, Poisson) to solve complex real-world problems. Keep practicing, and these patterns will become second nature!