Welcome to the Heart of Statistics!

Ever wondered what a "typical" score is in a video game, or what the "average" person earns? In Statistics, we don't just guess; we use Measures of Central Tendency. These are clever ways to find the "center" of a data set. Think of it like finding the "typical" or "representative" value that sums up a whole pile of numbers.

In this chapter, we’ll look at the "Big Three" (Mode, Median, and Mean), see how to handle grouped data, and explore some special averages used by experts. Don’t worry if it seems a bit "mathsy" at first—we’ll take it one step at a time!

1. The "Big Three" Averages

There isn't just one way to find an average. Depending on what you are looking at, one might be better than the others.

The Mode (The "Most Common")

The Mode is the value that appears most often in a data set.
Memory Aid: MOde = MOst common.
Example: In the set {2, 3, 3, 5, 8}, the mode is 3 because it appears twice.

Quick Review:
• If no number repeats, there is no mode.
• If two numbers repeat the same number of times, it is bimodal.
• For grouped data, we talk about the Modal Class (the group with the highest frequency).

The Median (The "Middle Value")

The Median is the middle value when the data is put in order.
Memory Aid: The "median" is the concrete strip in the middle of a road!
Step-by-Step:
1. Put the numbers in order from smallest to largest.
2. Find the position using the formula: \( \frac{n+1}{2} \), where \( n \) is how many numbers you have.
3. If you have an even number of values, the median is the mean of the two middle values.

The Mean (The "Arithmetic Average")

This is what most people mean when they say "average." You add everything up and share it out equally.
The Formula: \( \bar{x} = \frac{\sum x}{n} \)
(In plain English: Mean = Total of all values ÷ Number of values)
Example: For 2, 4, 6, the mean is \( (2+4+6) \div 3 = 4 \).

Key Takeaway: The Mode is about popularity, the Median is about the middle, and the Mean uses every single piece of data.

2. Choosing the Best Average

Why do we need three? Because data can be "messy"! Selecting the right one is a key skill for your exam.

Use the Mode when:
• The data is non-numeric (qualitative). Example: Finding the most popular car color. You can't calculate a "mean color"!
• You are a shopkeeper. Example: You need to know the most common shoe size to stock up.

Use the Median when:
• There are outliers (extreme values). Example: If 9 people earn £20k and one CEO earns £10 million, the mean would look huge! The median gives a better "typical" salary.
• The data is skewed (bunched up at one end).

Use the Mean when:
• You want to use all the data available.
• The data is fairly symmetrical with no crazy outliers.

Common Mistake to Avoid: Don't forget to order the data before finding the median! This is the most common way students lose marks.

3. Averages from Frequency Tables

Sometimes, data is given in a table because there is too much to list out.
Example: A table showing the number of goals scored in 20 matches.

Finding the Mean:
1. Create a new column: Frequency × Value (often called \( f \times x \)).
2. Find the Total Frequency (\( \sum f \)).
3. Find the Total of the \( fx \) column (\( \sum fx \)).
4. Mean = \( \frac{\sum fx}{\sum f} \).

Finding the Median: Look for the \( \frac{n+1}{2} \) position in the cumulative frequency.

4. Grouped Data (Estimating the Mean)

When data is grouped (e.g., "Weight: 10kg to 20kg"), we lose the exact values. Because of this, we can only find an estimate of the mean.

The Step-by-Step Process:
1. Find the Midpoint of each class. (Add the start and end of the group and divide by 2).
2. Multiply each Midpoint by its Frequency.
3. Add those totals up.
4. Divide by the Total Frequency.

Higher Tier Tip - Linear Interpolation: For the median of grouped data, you might be asked to use interpolation. This is just a fancy way of saying "estimating where the middle value sits inside its group."
\( \text{Median} = L + \left( \frac{\frac{n}{2} - F}{f} \right) \times w \)
Where \( L \) is the lower bound of the class, \( F \) is the cumulative frequency before the class, \( f \) is the frequency of the class, and \( w \) is the class width. Don't panic! Just think of it as finding how far "through" the group you need to go.

5. Higher Tier: Special Averages

If you are taking the Higher Tier, you need to know these three extra tools:

Weighted Mean

Used when some numbers are more important than others.
Formula: \( \frac{\sum (value \times weight)}{\sum weights} \)
Example: Your final grade might be 30% from a test and 70% from an exam. The exam "weighs" more!

Geometric Mean

Used for finding the average growth rate or percentage changes over time.
Formula: \( \text{Geometric Mean} = \sqrt[n]{x_1 \times x_2 \times ... \times x_n} \)
(The \( n^{th} \) root of the product of all values).
Did you know? We use this for money and population growth because they multiply, rather than add!

Mean Seasonal Variation

Used in Time Series. It helps us find the average "extra" or "less" we expect during a specific season (like higher ice cream sales in summer) once the general trend is removed.

6. Changing the Data (Transformations)

What happens to the averages if we change every piece of data?
Addition/Subtraction: If you add 5 to every score, the Mean, Median, and Mode all increase by 5.
Multiplication (Scaling): If you double every score, the Mean, Median, and Mode all double.

What if we add one more person?
• If you add a value higher than the current mean, the mean will go up.
• If you add a value equal to the mean, the mean stays the same.

Final Summary Checklist

Key Takeaways:
1. Mode is for popularity/categories.
2. Median is the middle (great for skewed data with outliers).
3. Mean is the total shared out (uses all data).
4. For Grouped Data, always use the Midpoint to estimate the mean.
5. Transformations: Averages follow the same math you apply to the data (add 10 to data = add 10 to average).

Don't worry if this seems tricky at first! Practice with some simple lists of numbers, then move on to tables. You've got this!