Welcome to the World of Geographical Data!
In Geography, we aren't just looking at pretty pictures of mountains or maps of cities. We are also "data detectives." We collect information (data) and use statistical skills to find patterns and tell the story of what is happening in our world. Don't worry if you aren't a "maths person"—these skills are easy to master with a bit of practice, and they will help you get top marks in all three of your AQA GCSE papers!
1. Finding the "Middle": Measures of Central Tendency
Sometimes we have a huge list of numbers, like the amount of rainfall in a city over 30 days. To understand the data, we look for the "average" or the center.
The Mean (The Average)
This is what most people mean when they say "the average." You add everything up and divide by how many numbers you have.
Formula: \(\text{Mean} = \frac{\text{Total Sum of All Values}}{\text{Number of Values}}\)
The Median (The Middle)
If you lined up all your data points from smallest to largest, the Median is the one right in the middle.
Quick Trick: Think of the "Median" as the concrete strip in the middle of a motorway. It’s always in the center!
Note: If you have an even number of values, the median is the halfway point between the two middle numbers.
The Mode and Modal Class
The Mode is the value that appears most often. If your data is grouped into categories (like "0-10mm of rain" and "11-20mm of rain"), the group with the most entries is called the Modal Class.
Quick Review:
- Mean: Add and divide.
- Median: Line them up, find the middle.
- Mode: The most popular one.
2. Measuring the "Spread": Range and Quartiles
Knowing the middle is great, but we also need to know how "spread out" our data is. For example, two cities might both have a mean temperature of \(15^\circ C\), but one might be steady all year, while the other has freezing winters and boiling summers.
The Range
The Range is the difference between the highest and lowest value.
Formula: \(\text{Range} = \text{Highest Value} - \text{Lowest Value}\)
Quartiles and the Inter-quartile Range (IQR)
Sometimes the Range is misleading because one "weird" result (an outlier) can make the spread look much bigger than it really is. To fix this, we use Quartiles.
Imagine your ordered data is a chocolate bar. You snap it in half (the Median), then snap those halves in half again. Now you have four parts (Quarters):
- Lower Quartile (LQ): The 25% mark.
- Upper Quartile (UQ): The 75% mark.
- Inter-quartile Range (IQR): The distance between the LQ and UQ.
Formula: \(\text{IQR} = \text{UQ} - \text{LQ}\)
Why use it? It only looks at the middle 50% of the data, so it ignores those "weird" outliers!
Key Takeaway: The Range shows the total spread; the IQR shows the "typical" spread.
3. Percentages and Percentiles
Geographers use percentages to compare things of different sizes, like comparing population growth in a small village versus a huge city.
Percentage Increase and Decrease
This is a very common exam question. Use this simple formula:
Formula: \(\text{Percentage Change} = \frac{\text{New Value} - \text{Old Value}}{\text{Old Value}} \times 100\)
Example: If a forest was 50 \(km^2\) and is now 40 \(km^2\), the change is \( -10 \). So: \(\frac{-10}{50} \times 100 = -20\%\). That is a 20% decrease!
Percentiles
A Percentile tells you where a value stands compared to others. If a city is in the 90th percentile for pollution, it means it is more polluted than 90% of all other cities. The 50th percentile is the same as the Median.
4. Cumulative Frequency
Cumulative Frequency is just a fancy way of saying a "running total."
Imagine you are counting pebbles on a beach. In the first meter, you find 5. In the second meter, you find 8.
- The frequency for the second meter is 8.
- The cumulative frequency is 13 (the 5 from before + the new 8).
We plot this on a graph that usually looks like a long "S" shape. It helps us easily find the Median and Quartiles by looking at the 50%, 25%, and 75% points on the vertical axis.
5. Bivariate Data: Seeing Relationships
Bivariate data is just a fancy term for data with two variables (like "Temperature" and "Ice Cream Sales"). We use Scatter Plots to see if they are linked.
Trend Lines and Correlation
When you look at a scatter plot, do the dots seem to go in a certain direction?
- Positive Correlation: As one goes up, the other goes up (e.g., more rainfall = higher river level).
- Negative Correlation: As one goes up, the other goes down (e.g., higher altitude = lower temperature).
- No Correlation: The dots are just a mess! No link.
Line of Best Fit
This is a straight line drawn through the middle of the dots on a scatter graph. Don't worry if it doesn't touch every dot! It should have roughly the same number of dots above it as below it.
Interpolation vs. Extrapolation
- Interpolation: Predicting a value inside the range of data you already have. This is usually quite accurate.
- Extrapolation: Extending your line of best fit to predict values outside your data range. Be careful! This is risky because the pattern might change in the future.
Quick Review Box:
- Bivariate: Two sets of data compared.
- Line of Best Fit: Shows the general trend.
- Extrapolation: Predicting the unknown (risky!).
6. Being a Critical Geographer: Selective Data
Did you know? Statistics can be used to trick people! Sometimes, people only show the data that supports their argument. This is called selective presentation.
When you look at a graph in an exam, ask yourself:
- Does the scale start at zero? (If not, it might make small changes look huge!)
- Is there data missing?
- Is the person presenting the data biased?
Key Takeaway: Always look closely at the axes and the source of the data before you believe what a graph is telling you!