Introduction: The Shape of the World

Welcome to one of the most famous topics in all of Statistics! Have you ever noticed that most people are "average" height, and very few are exceptionally tall or short? Or that most apples in a bag weigh about the same, with only a few being tiny or massive? This "natural" pattern is what we call the Normal Distribution.

In this chapter, we will learn how to model these patterns using math. For your OCR MEI H640 exam, this is a vital tool because it allows us to calculate probabilities for things that happen in real life. Don't worry if this seems tricky at first—once you see the "Bell Curve" shape, it all starts to click!

1. What is the Normal Distribution?

The Normal Distribution is a continuous probability distribution. This means it deals with data that can take any value (like time, weight, or height), rather than "counting" data (like the number of goals scored).

Key Characteristics

  • Bell-Shaped: It is perfectly symmetrical around the middle.
  • The Mean (\(\mu\)): The peak of the curve is exactly at the mean. This is also the median and the mode!
  • Points of Inflection: The curve changes from "curving down" to "curving up" exactly one standard deviation (\(\sigma\)) away from the mean.
  • Total Area: The total area under the curve is exactly 1 (representing a total probability of 100%).

The Notation

We write it like this: \(X \sim N(\mu, \sigma^2)\)
Where:
- \(X\) is our random variable.
- \(\mu\) is the mean.
- \(\sigma^2\) is the variance (remember: \(\sigma\) is the standard deviation).

Quick Review Box:
If you see \(X \sim N(50, 16)\), the mean is 50 and the standard deviation is \(\sqrt{16} = 4\). Common Mistake: Students often forget to square root the second number to get the standard deviation!

2. The "Bell Curve" Shape and Symmetry

Because the curve is symmetrical, we know that 50% of the data is below the mean and 50% is above the mean. This makes calculations much easier!

Did you know?

In a Normal Distribution:
- About 68% of the data lies within 1 standard deviation of the mean (\(\mu \pm \sigma\)).
- About 95% lies within 2 standard deviations (\(\mu \pm 2\sigma\)).
- About 99.7% lies within 3 standard deviations (\(\mu \pm 3\sigma\)).

3. Calculating Probabilities

In your exam, you won't have to do the complicated calculus behind this curve. Instead, you will use your statistical calculator.

How to use your calculator:

1. Find the Distribution menu.
2. Select Normal CD (Cumulative Distribution).
3. Enter your Lower bound, Upper bound, \(\sigma\) (standard deviation), and \(\mu\) (mean).
4. Example: To find \(P(X < 60)\) when \(X \sim N(50, 16)\), your Lower would be a very small number (like -9999) and your Upper would be 60.

Analogy: Think of the probability as the amount of paint needed to cover the area under the curve between two points. The more area you cover, the higher the probability!

4. Standardizing: The Z-score

Sometimes we want to compare two different normal distributions (like comparing a score in a hard Math test to a score in an easy English test). To do this, we "Standardize" them into a Standard Normal Distribution, which we call \(Z\).

\(Z \sim N(0, 1)\) (Mean is 0, Variance is 1).

The Formula:

\(Z = \frac{X - \mu}{\sigma}\)

This tells you how many standard deviations a value is away from the mean.

5. The Continuity Correction

The syllabus mentions that we sometimes use the Normal distribution to model discrete data (like the number of people who pass a test). Because the Normal distribution is continuous, we have to make a small adjustment.

If you want to find the probability of "at least 10" in discrete data, you would look at the area from 9.5 upwards in the Normal model. Imagine each whole number occupies a "box" that stretches 0.5 units on either side.

Key Takeaway: Use the continuity correction only when you are using a continuous curve (Normal) to estimate discrete counts (like Binomial data).

6. Hypothesis Testing for the Mean

This is a big part of the MEI syllabus. We want to test if the average of a sample suggests that the whole population's mean has changed.

The Distribution of the Sample Mean

If we take a sample of size \(n\), the mean of that sample (\(\bar{X}\)) follows its own Normal distribution:
\(\bar{X} \sim N(\mu, \frac{\sigma^2}{n})\)

Crucial Point: The variance of the sample mean is much smaller than the original variance! It is the original variance divided by the sample size (\(n\)).

Step-by-Step Hypothesis Test:

1. State Hypotheses: \(H_0: \mu = \text{value}\) and \(H_1: \mu \neq, <, \text{ or } > \text{value}\).
2. Find the Distribution: Write down \(\bar{X} \sim N(\mu, \frac{\sigma^2}{n})\).
3. Calculate the p-value: Use your calculator to find the probability of getting your sample mean (or something more extreme).
4. Compare: If the p-value is less than the significance level (e.g., 0.05), Reject \(H_0\).
5. Context: Always write your conclusion in words related to the question (e.g., "There is sufficient evidence to suggest the mean weight of the apples has decreased.").

Common Mistake: Forgetting to divide the variance by \(n\) when doing a hypothesis test. Always check your sample size!

Summary: What you must remember

  • The Model: \(X \sim N(\mu, \sigma^2)\). The curve is symmetrical around \(\mu\).
  • Standardizing: \(Z = \frac{X - \mu}{\sigma}\) converts any normal distribution to the \(Z\) distribution.
  • Probability: This is the area under the curve. Use Normal CD on your calculator.
  • Sample Means: When testing a sample of size \(n\), use the standard deviation \(\frac{\sigma}{\sqrt{n}}\).
  • Symmetry: Use the fact that the curve is mirrored to solve problems where you are given a probability and need to find a value (Inverse Normal).

Keep practicing those calculator steps! Once you are comfortable with the buttons, the Normal Distribution becomes one of the most predictable and high-scoring sections of your Statistics paper.