Introduction: Making Decisions with Data

Welcome to the world of Hypothesis Testing! Have you ever heard a claim and wondered if it was actually true? For example, a chocolate company might claim their bars weigh exactly 50g, but you suspect they might be smaller. Or a tutor claims their new method improves scores by 10 points.

In this chapter, we learn the mathematical way to decide whether these claims are backed up by evidence. It’s like being a judge in a courtroom: we assume someone is "innocent" (the status quo) until we have enough "evidence" (data) to prove otherwise. Don't worry if this seems a bit abstract at first—once you see the step-by-step process, it will click!

1. The Core Concepts: The "Cast" of Hypothesis Testing

To perform a test, we need to define our two competing ideas:

The Null Hypothesis \( (H_0) \)

Think of \( H_0 \) as the "status quo" or the "boring" version. It assumes that nothing has changed or that the manufacturer's claim is true.
Example: "The mean weight of the chocolate bar is 50g." (\( H_0: \mu = 50 \))

The Alternative Hypothesis \( (H_1) \)

This is the "exciting" version—the claim you are actually trying to test or find evidence for.
Example: "The mean weight is actually less than 50g." (\( H_1: \mu < 50 \))

Level of Significance \( (\alpha) \)

This is the threshold for our "evidence." Usually, it is 5% (\( 0.05 \)) or 1% (\( 0.01 \)). It represents how much risk we are willing to take of being wrong. If the probability of our result happening by pure chance is lower than this level, we reject the "boring" \( H_0 \).

Quick Review:
- \( H_0 \): Always uses an equal sign (\( = \)).
- \( H_1 \): Uses \( < \), \( > \), or \( \neq \).
- Evidence: We only reject \( H_0 \) if our data is very unlikely to have happened by chance.

2. One-Tailed vs. Two-Tailed Tests

How do we know which sign to use for \( H_1 \)? It depends on what we are looking for!

1-Tailed Test (Directional): We are specifically looking for an increase or a decrease.
Example: "Has the new medicine increased reaction time?" (\( H_1: \mu > \text{value} \)) or "Has the diet reduced weight?" (\( H_1: \mu < \text{value} \)).

2-Tailed Test (Non-directional): We just want to know if the value has changed or is different, regardless of whether it went up or down.
Example: "Is the machine still calibrated correctly, or is the mean weight different from 50g?" (\( H_1: \mu \neq 50 \)).

Memory Aid:
If the question says "increased," "decreased," "more than," or "less than" → 1-tail.
If the question says "changed," "differ," or "is the claim still valid" → 2-tails.

3. The Test Statistic and the p-value

To make a decision, we need to turn our sample data into a standardized score called a Test Statistic (\( Z \)). For H1 Math, we focus on the population mean (\( \mu \)).

The formula for the Test Statistic is:
\( Z = \frac{\bar{x} - \mu}{\sigma / \sqrt{n}} \)

Where:
- \( \bar{x} \) is the sample mean.
- \( \mu \) is the population mean from \( H_0 \).
- \( \sigma \) is the population standard deviation.
- \( n \) is the sample size.

The p-value

This is the "magic number" your Graphing Calculator (GC) will give you. It represents the probability of getting your sample result if \( H_0 \) were true.

The Golden Rule:
- If p-value \( < \alpha \): The result is "significant." Reject \( H_0 \).
- If p-value \( \geq \alpha \): Not enough evidence. Do not reject \( H_0 \).

Did you know? In the real world, many scientific discoveries only count if the p-value is less than 0.05. It's the "gold standard" for proof!

4. When Can We Use This Test?

We can use the \( Z \)-test in two main scenarios based on the 8865 syllabus:

Scenario A: Normal Population with Known Variance
If the question says the population is "normally distributed" and gives you the population variance (\( \sigma^2 \)), you are good to go!

Scenario B: Large Sample (Central Limit Theorem)
If the population is not normal (or we don't know), but the sample size is large (\( n \geq 30 \)), the Central Limit Theorem (CLT) allows us to treat the sample mean as normally distributed anyway!

Pro-tip: If you don't know the population variance (\( \sigma^2 \)), you must use the unbiased estimate of the population variance (\( s^2 \)) calculated from your sample.

5. Step-by-Step Guide to Solving Questions

Don't worry if this seems like a lot. Just follow these 5 steps every time:

Step 1: State the Hypotheses
Write down \( H_0: \mu = \text{something} \) and \( H_1: \mu (<, >, \text{ or } \neq) \text{ something} \). Define \( \mu \) clearly in words!

Step 2: State the Significance Level
Example: "Test at the 5% level of significance."

Step 3: State the Distribution and Test Statistic
Identify if you are using a Normal distribution or the CLT (\( n \geq 30 \)). Calculate your \( Z \) value (or let your GC do it).

Step 4: Find the p-value (or Critical Region)
Use your Graphing Calculator's "Z-Test" function. It will ask for \( \mu_0 \), \( \sigma \), \( \bar{x} \), and \( n \).

Step 5: Make a Conclusion
Compare the p-value to \( \alpha \).
"Since p-value = 0.03 < 0.05, we reject \( H_0 \). There is sufficient evidence at the 5% level to suggest that the mean weight has decreased."

6. Critical Regions and Critical Values

Instead of using a p-value, sometimes we look at the Critical Region. This is the "rejection zone" on the graph.

If your calculated \( Z \)-statistic falls inside the Critical Region, you reject \( H_0 \).

Example for a 5% 1-tail test (upper tail):
The Critical Value is \( Z = 1.645 \). If your calculated \( Z \) is \( > 1.645 \), you are in the "rejection zone."

Common Mistake to Avoid:
When doing a 2-tailed test at a 5% significance level, you must split the 5% into two halves! You look for 2.5% (\( 0.025 \)) in each tail. Your GC usually handles this if you select the \( \neq \) option, but keep it in mind for conceptual questions!

Summary: Key Takeaways

1. Hypotheses: \( H_0 \) is the default; \( H_1 \) is what you're testing for.
2. Decision Rule: Reject \( H_0 \) if p-value < level of significance.
3. Large Samples: If \( n \geq 30 \), we can use the Z-test even if the population isn't normal (thanks to the CLT).
4. Context is King: Always write your final answer in terms of the original problem (e.g., "The mean height of plants...").
5. Never say "Accept \( H_0 \)": We either "Reject \( H_0 \)" or "Do not reject \( H_0 \)." We haven't proven \( H_0 \) is 100% true; we just don't have enough evidence to throw it out yet!