【Math: 8th Grade】 Comparing Data (Quartiles and Box-and-Whisker Plots)

Hello! In this session, we’re going to study "Comparing Data," a key topic in 8th-grade math.
You might feel a bit overwhelmed by all the graphs and numbers, but don't worry! This unit is all about learning the "magical" tools that help us organize data and see the "spread" (variation) at a single glance.
These concepts are super useful in everyday life, whether you're looking at sports statistics or test results, so let's take it one step at a time!

1. Dividing Data into Four: The "Quartiles"

When you have a large set of data, if you arrange it in ascending order and divide it into four equal groups, the boundary values are called quartiles. There are three main boundary points in total.

Names and Roles of Quartiles

① First Quartile (\(Q_1\)): The value located at the 25% mark (the first quarter) of the data when arranged from smallest to largest.
② Second Quartile (\(Q_2\)): The middle value. This is the same as the "median" you've learned about before!
③ Third Quartile (\(Q_3\)): The value located at the 75% mark (the third quarter) of the data when arranged from smallest to largest.

【Key Point】 How to find quartiles

To find the quartiles, start by arranging the data in ascending order, then follow these steps:
1. First, find the overall median (\(Q_2\)).
2. Find the median of the "left side" (the first half) of the data — this is \(Q_1\)!
3. Find the median of the "right side" (the second half) of the data — this is \(Q_3\)!

Example: With 7 data points: 1, 2, 3, 4, 5, 6, 7
・The middle is 4, so \(Q_2 = 4\)
・The middle of the first half (1, 2, 3) is 2, so \(Q_1 = 2\)
・The middle of the second half (5, 6, 7) is 6, so \(Q_3 = 6\)

【Common Mistake】
Many students forget to arrange the data first. Always sort your data from smallest to largest before you start!

★ Summary:
Quartiles are the "dividers" that split data into four equal parts!

2. Understanding Spread: The "Interquartile Range"

We've previously calculated the "range" as "maximum value minus minimum value," but this can be heavily affected by extreme high or low values (outliers).
To get a better picture of the "spread of the middle portion of the data," we use the interquartile range.

Interquartile Range

\( (\text{Third Quartile } Q_3) - (\text{First Quartile } Q_1) \)
This range contains exactly the middle 50% of the data. The smaller this value is, the more the data is clustered toward the center (meaning less variability).

Quartile Deviation

This is simply half of the interquartile range.
\( (\text{Interquartile Range}) \div 2 \)

【Pro Tip】
"Deviation" refers to how much something shifts or differs. It acts as a hint to see how far the data is spread from the center.

3. Visualize at a Glance: The "Box-and-Whisker Plot"

A graph that displays these "quartiles" is called a box-and-whisker plot. It gets its name because it looks like a "box" with "whiskers" sticking out of it.

How to Draw a Box-and-Whisker Plot

1. Identify the 5 key values: minimum value, first quartile (\(Q_1\)), median (\(Q_2\)), third quartile (\(Q_3\)), and maximum value.
2. Draw a "box" from \(Q_1\) to \(Q_3\).
3. Draw a line inside the box at the position of the median (\(Q_2\)).
4. Extend the "whiskers" from both ends of the box to the minimum and maximum values.

【Key Point】
When looking at a box-and-whisker plot, keep in mind that "each section contains roughly the same number of data points"!
・From whisker tip to left side of box (Min to \(Q_1\)): About 25% of the total data
・Left half of box (\(Q_1\) to \(Q_2\)): About 25% of the total data
・Right half of box (\(Q_2\) to \(Q_3\)): About 25% of the total data
・Right side of box to whisker tip (\(Q_3\) to Max): About 25% of the total data

In other words, "longer boxes or longer whiskers mean the data is more widely spread out" in that section!

4. Comparing Data Sets

When you line up box-and-whisker plots for things like test results from two different classes, the differences become obvious.

Checklist for Comparison

① Position of the median: Which class has higher scores overall?
② Length of the box (interquartile range): Which class has more stable (consistent) scores?
③ Total length (Min to Max): Which data set has a wider spread from end to end?

Example: Comparing Class A and Class B
"Class A has a shorter box, so the middle 50% of students have fairly similar scores."
"Class B has very long whiskers, showing a big difference between the highest and lowest scoring students."
This is the kind of insight you can gain!

★ Summary:
Using box-and-whisker plots, you can easily compare distributions even if the number of data points differs!

Final Review: Key Takeaways

Quartiles are values that divide data into four parts (\(Q_1, Q_2, Q_3\)).
Interquartile Range is \(Q_3 - Q_1\), representing the spread of the middle 50%.
Box-and-Whisker Plots are drawn using 5 points: Min, \(Q_1, Q_2, Q_3\), and Max.
・Remember that each section of the plot represents about 25% of the data!

At first, you might get confused about which is the first or third quartile, but once you draw a few plots yourself, you'll get the hang of it quickly.
Enjoy the process of "turning data characteristics into a visual shape." Great job today!