Module 1: Why Does Normality Matter?

Discovery Lab - Statistical Assumptions

☁️ Working Guidelines

0% Complete

🎯 Learning Objectives

By the end of this module, you will:

  1. Understand why checking normality assumptions matters for statistical inference
  2. Discover how violations affect Type I error rates
  3. Recognize the consequences of ignoring assumptions
  4. Apply this knowledge to real research scenarios

πŸ” The Puzzle

Here's a mystery:

Two researchers analyze the SAME research question using Ξ± = .05 (meaning they accept a 5% chance of false positives).

Both used t-tests. Both set Ξ± = .05. Why are the results so different?

Your job: Investigate what's happening and discover WHY assumptions matter.

Part 1: Run the Simulation

Let's see this problem in action. Click the buttons below to run the simulations.

Researcher A: Normal Data

Researcher B: Skewed Data

πŸ“ Observe & Record

Question 1: Record the false positive rates you observed:


Question 2: DISCUSS WITH PARTNER: Why do you think these rates are different? What's the key difference between the two simulations?

Our initial hypothesis:

πŸ€” Stuck? Click for a hint

Think about:

  • What type of distribution did Researcher A use? (normal)
  • What type did Researcher B use? (exponential/skewed)
  • Which assumption of the t-test might be violated?

Part 2: Investigate the Data

Let's visualize what the data from each researcher actually look like.

πŸ“ Analyze the Visualizations

Question 3: Describe the SHAPE of each histogram:

Researcher A (blue histogram):

Researcher B (red histogram):

Question 4: Look at the Q-Q plots (where points should follow the diagonal line if data are normal).

Researcher A's Q-Q plot:

Researcher B's Q-Q plot:

Question 5: When data are symmetric, how do mean and median compare? When data are skewed?

Part 3: Test Your Understanding

Question 6: PREDICTION TIME!

Imagine you're about to analyze reaction time data. You create histograms and Q-Q plots, and the data look like Researcher B's data (right-skewed).

If you proceed with a t-test anyway, what do you predict will happen?



Explain your reasoning (2-3 sentences):

Question 7: Based on everything you've discovered so far, why might Researcher B's inflated false positive rate be a problem in real research?

Part 4: The Power Demonstration

Let's see another consequence: violations can also reduce statistical power (ability to detect real effects).

Part 5: Real-World Application

Question 8: You're reviewing a manuscript for a journal. The Methods section says:

"We compared anxiety scores between the treatment and control groups using an independent samples t-test (Ξ± = .05). The treatment group showed significantly lower anxiety (p = .042)."

The authors did NOT report checking normality assumptions.

Part A: What would you want to see in the paper to evaluate whether their analysis was appropriate?

Part B: Why is it important that they checked (or didn't check) assumptions, given that they found p = .042 (just barely significant)?

Part C: If you were the reviewer, what would you recommend the authors do?

Part 6: Synthesis & Reflection

Question 9: THE BIG IDEA

Complete this sentence with your partner (3-4 sentences total):

"Checking for normality before running parametric tests matters because..."

Question 10: METACOGNITIVE REFLECTION

Before this module, did you know that violating assumptions could inflate false positive rates? How does this change the way you'll approach your own data analysis in the future?

🎯 Key Discoveries

What We Discovered Today:

πŸ“‹ Before You Submit

βœ… Submission Checklist

πŸ“€ How to Submit

  1. Click "Save Progress" to ensure everything is stored
  2. Use your browser's Print function: Ctrl+P (Windows) or Cmd+P (Mac)
  3. Choose "Save as PDF" as the destination
  4. Save as: module1_lastname1_lastname2.pdf
  5. Upload the PDF to your course management system

πŸŽ‰ You've completed Module 1! πŸŽ‰

Great work discovering why normality matters.