Discovery Lab - Statistical Assumptions
By the end of this module, you will:
Here's a mystery:
Two researchers analyze the SAME research question using Ξ± = .05 (meaning they accept a 5% chance of false positives).
Both used t-tests. Both set Ξ± = .05. Why are the results so different?
Your job: Investigate what's happening and discover WHY assumptions matter.
Let's see this problem in action. Click the buttons below to run the simulations.
Question 1: Record the false positive rates you observed:
Question 2: DISCUSS WITH PARTNER: Why do you think these rates are different? What's the key difference between the two simulations?
Our initial hypothesis:
Think about:
Let's visualize what the data from each researcher actually look like.
Question 3: Describe the SHAPE of each histogram:
Researcher A (blue histogram):
Researcher B (red histogram):
Question 4: Look at the Q-Q plots (where points should follow the diagonal line if data are normal).
Researcher A's Q-Q plot:
Researcher B's Q-Q plot:
Question 5: When data are symmetric, how do mean and median compare? When data are skewed?
Question 6: PREDICTION TIME!
Imagine you're about to analyze reaction time data. You create histograms and Q-Q plots, and the data look like Researcher B's data (right-skewed).
If you proceed with a t-test anyway, what do you predict will happen?
Explain your reasoning (2-3 sentences):
Question 7: Based on everything you've discovered so far, why might Researcher B's inflated false positive rate be a problem in real research?
Let's see another consequence: violations can also reduce statistical power (ability to detect real effects).
Question 8: You're reviewing a manuscript for a journal. The Methods section says:
"We compared anxiety scores between the treatment and control groups using an independent samples t-test (Ξ± = .05). The treatment group showed significantly lower anxiety (p = .042)."
The authors did NOT report checking normality assumptions.
Part A: What would you want to see in the paper to evaluate whether their analysis was appropriate?
Part B: Why is it important that they checked (or didn't check) assumptions, given that they found p = .042 (just barely significant)?
Part C: If you were the reviewer, what would you recommend the authors do?
Question 9: THE BIG IDEA
Complete this sentence with your partner (3-4 sentences total):
"Checking for normality before running parametric tests matters because..."
Question 10: METACOGNITIVE REFLECTION
Before this module, did you know that violating assumptions could inflate false positive rates? How does this change the way you'll approach your own data analysis in the future?
module1_lastname1_lastname2.pdfπ You've completed Module 1! π
Great work discovering why normality matters.