Welcome to the Geometric Distribution!

Ever wondered how many times you would have to flip a coin before you finally saw "Heads"? Or how many packs of football cards you’d need to buy to find that one rare player? In Statistics, when we are waiting for the first success to happen, we use the Geometric Distribution.

Don’t worry if statistics sometimes feels like a mountain of formulas. We are going to break this down into small, manageable steps. By the end of these notes, you’ll see that the Geometric Distribution is just a pattern of "fails" followed by one "win." Let’s get started!

1. When Do We Use It? (Conditions for Application)

Before we start calculating, we need to know if the Geometric Distribution is the right tool for the job. For a situation to be "Geometric," it must follow these rules:

1. Two Outcomes: Each trial has only two possibilities: Success or Failure. (Example: Passing or failing a test).
2. Independent Trials: One attempt doesn't affect the next. (Example: If you miss a trick shot once, your chance of hitting it the next time stays the same).
3. Constant Probability: The probability of success, which we call \(p\), must stay exactly the same for every trial.
4. The "Until" Rule: We keep going until the first success happens. Once we succeed, we stop.

Memory Aid: Think of "The Persistent Toddler"
Imagine a toddler trying to stack a block. They fail, fail, fail... and the moment they succeed, they clap and stop. That’s a Geometric Distribution!

Quick Review: We use the notation \(X \sim Geo(p)\), where \(X\) is the number of trials required to get the first success, and \(p\) is the probability of success in a single trial.

2. Calculating Probabilities

Let's say the probability of success is \(p\). This means the probability of failure is \(q = 1 - p\).

Finding the probability of success on exactly the \(x^{th}\) trial

If the success happens on the \(x^{th}\) attempt, it means you must have failed \(x-1\) times in a row, and then succeeded on the very last try.

The Formula:
\(P(X = x) = q^{x-1} \times p\)

Example: If you have a 0.2 chance of hitting a target, what is the probability you hit it for the first time on your 4th try?
1. Success (\(p\)) = 0.2
2. Failure (\(q\)) = 0.8
3. We need 3 failures then 1 success: \(0.8 \times 0.8 \times 0.8 \times 0.2\)
4. Calculation: \(0.8^3 \times 0.2 = 0.1024\)

Finding the probability of success within a certain number of trials

Sometimes we want to know the probability that success happens on or before the \(x^{th}\) trial. This is written as \(P(X \le x)\).

The Formula:
\(P(X \le x) = 1 - q^x\)

Why does this work?
Think of it logically: The only way you don't succeed within \(x\) trials is if you fail every single time for \(x\) trials. The probability of failing \(x\) times is \(q^x\). So, the probability of not failing every time is \(1 - q^x\). This is a great shortcut to save you time!

Key Takeaway: Use \(P(X > x) = q^x\) when you want to find the probability that the first success takes more than \(x\) trials.

3. Mean and Variance

In the exam, you need to know how to find the "average" (Mean) and the "spread" (Variance) of a Geometric Distribution.

The Mean: \(E(X) = \frac{1}{p}\)
This makes total sense! If you have a 1 in 10 chance of winning (\(p=0.1\)), you would expect to play 10 times to win once (\(1 / 0.1 = 10\)).

The Variance: \(Var(X) = \frac{1-p}{p^2} = \frac{q}{p^2}\)
This tells us how much the number of trials might vary from the average.

Did you know?
The Geometric Distribution is memoryless. This means if you've already failed 10 times, the probability that you succeed on the 11th trial is still just \(p\). The "universe" doesn't owe you a win just because you've been losing!

4. Derivations (For the Brave!)

The syllabus requires you to understand where these formulas come from. Don't worry if this seems tricky at first; it's mostly about using your knowledge of geometric series from Pure Maths.

Deriving the Mean \(E(X)\)

By definition, the expected value is the sum of (value \(\times\) probability):
\(E(X) = \sum x \cdot P(X=x) = 1p + 2qp + 3q^2p + 4q^3p + ...\)
Factor out the \(p\):
\(E(X) = p(1 + 2q + 3q^2 + 4q^3 + ...)\)
The part in the brackets is the derivative of the geometric series \(1 + q + q^2 + q^3...\) which is \(\frac{1}{1-q}\).
Through calculus/algebraic manipulation, the bracketed sum becomes \(\frac{1}{(1-q)^2}\).
Since \(1-q = p\), we get:
\(E(X) = p \times \frac{1}{p^2} = \frac{1}{p}\)

Quick Review Box:
Expected Value: \(1/p\)
Variance: \(q/p^2\)
\(P(X \le x)\): \(1 - q^x\)
Always remember: \(q = 1 - p\)

5. Common Mistakes to Avoid

1. Using the wrong "x": In the formula \(q^{x-1}p\), the power is \(x-1\), not \(x\). If you want the success on the 5th trial, you only have 4 failures.
2. Forgetting the "Until" rule: Students often confuse Geometric with Binomial. Remember: Binomial has a fixed number of trials (e.g., flip 10 times). Geometric has a fixed number of successes (e.g., flip until you get 1 head).
3. Misinterpreting "More than": If a question asks for \(P(X > 5)\), it means you failed the first 5 trials. The formula is simply \(q^5\). You don't need to do a massive summation!

Summary of Geometric Distribution

• It models the number of trials to get the first success.
• Success probability \(p\) must be constant and trials must be independent.
• The average number of trials is \(1/p\).
• The probability of failing until trial \(x\) is \(q^x\).
• Always double-check if the question is asking for "exactly," "more than," or "at most."

You've got this! Practice a few questions where you calculate \(E(X)\) and \(P(X=x)\) to make these formulas feel like second nature.