Module 5: Non-Parametric Alternatives

When Transformation Isn't the Answer

☁️ Working Guidelines

⏱️ Estimated time: 50 minutes
👥 Work with your partner—compare parametric vs. non-parametric!
💾 Your answers are saved automatically in your browser
📄 When finished, use Print (Ctrl+P/Cmd+P) and "Save as PDF" to submit

0% Complete

🎯 Learning Objectives

By the end of this module, you will:

Understand what non-parametric tests are and when to use them
Learn the most common non-parametric alternatives (Mann-Whitney, Kruskal-Wallis)
Compare parametric vs. non-parametric approaches on the same data
Recognize when non-parametric tests are better than transformation

🤔 What Are Non-Parametric Tests?

📚 The Big Picture

Parametric tests (t-test, ANOVA) make assumptions:

✓ Data are approximately normally distributed
✓ Variance is roughly equal across groups
✓ Data are interval or ratio scale

Non-parametric tests are "assumption-free":

✓ No assumption about distribution shape
✓ Work with ranks instead of raw values
✓ Can handle ordinal, interval, or ratio data
✓ Robust to outliers

The Trade-off:

Non-parametric tests are ~95% as powerful as parametric tests when data ARE normal, but can be MORE powerful when data are skewed or have outliers!

Part 1: Common Non-Parametric Tests

Research Question	Parametric Test	Non-Parametric Alternative
Compare 2 independent groups	Independent t-test	Mann-Whitney U test (also called Wilcoxon rank-sum)
Compare 2 paired/matched samples	Paired t-test	Wilcoxon signed-rank test
Compare 3+ independent groups	One-way ANOVA	Kruskal-Wallis test
Compare 3+ related groups	Repeated-measures ANOVA	Friedman test
Correlation between 2 variables	Pearson's r	Spearman's rho (ρ)

Part 2: How Do Non-Parametric Tests Work?

💡 The Ranking Trick

Non-parametric tests convert your data to ranks before analysis:

Example:

Original data: 50, 80, 85, 200, 220

Ranks: 1, 2, 3, 4, 5

Why this matters:

The value 200 is MUCH larger than 85 (outlier!)
But in ranks, 200 (rank 4) is only one step above 85 (rank 3)
This makes the test robust to outliers and skewness

What are you testing?

Non-parametric tests ask: "Do the distributions differ?" rather than "Do the means differ?"

Part 3: Parametric vs. Non-Parametric Comparison

Let's generate some data and run BOTH tests to see how they compare!

Scenario 1: Normal Data

When assumptions are met, both tests should agree.

Scenario 2: Skewed Data with Outliers

When data violate assumptions, non-parametric tests often perform better.

📝 Observe & Analyze

Question 1: Compare the results from Scenario 1 (normal data). Did both tests reach the same conclusion? Were the p-values similar?

Question 2: Now look at Scenario 2 (skewed data). How did the results differ? Which test seems more appropriate for these data?

Question 3: Based on these demonstrations, when would you choose Mann-Whitney over a t-test?

Part 4: When to Use Non-Parametric Tests

✓ USE Non-Parametric Tests When:

1. Data are clearly non-normal AND small sample

n < 30 and severe skewness or outliers → Use non-parametric

2. Data are ordinal

Likert scales (1-5 ratings), rankings → Non-parametric is appropriate

3. Transformation doesn't work or is too complex

Tried log, sqrt, etc., but still not normal → Use non-parametric

4. You want robustness

Worried about outliers influencing results → Non-parametric is more robust

5. Your research question is about distributions, not just means

"Do groups differ?" is broader than "Do means differ?" → Non-parametric

⚠️ DON'T Automatically Use Non-Parametric If:

1. You have large samples (n > 50-100) with mild violations

Parametric tests are robust; transformation may be easier

2. Your field expects parametric tests

Consider using parametric + reporting assumption checks + sensitivity analysis

3. You need specific comparisons (post-hocs)

Parametric post-hoc tests are more developed than non-parametric equivalents

4. Interpretation is important

Means are easier to interpret than "sum of ranks"

Part 5: Decision Flowchart

🗺️ Complete Decision Guide

Step 1: Check normality

Visual: Histogram, Q-Q plot
Statistical: Shapiro-Wilk test

Step 2: Decide based on results + sample size

Normal OR (mild violation AND n > 50) → Parametric test
Non-normal AND n < 30 → Try transformation
Transformation worked → Parametric on transformed data
Transformation didn't work → Non-parametric
Ordinal data → Non-parametric
Bimodal distribution → Investigate groups, don't just test

Step 3: Report clearly

Always report which test you used and why!

Part 6: Real-World Scenarios

Scenario A: You're comparing pain ratings (1-10 scale) between treatment and control groups (n=25 each). Data are skewed right. What do you do?

Scenario B: You have reaction time data (n=80 per group). Right-skewed. Shapiro-Wilk p = 0.001. Q-Q plot shows moderate deviation. What do you do?

Scenario C: Survey with satisfaction ratings (Very Dissatisfied to Very Satisfied, n=200). Compare 3 departments. What test?

Scenario D: Income data from 40 people, comparing two cities. Heavily right-skewed with outliers (a few millionaires). What do you do?

Part 7: Reporting Non-Parametric Results

📝 How to Report

Bad example:

"We used a Mann-Whitney test. p = 0.03."

Good example:

"Data were not normally distributed (Shapiro-Wilk: W = 0.89, p = 0.003) and transformation did not improve normality. We therefore used the Mann-Whitney U test to compare groups. The treatment group (Mdn = 85) scored significantly higher than the control group (Mdn = 72), U = 245, p = .031, r = .34."

Key elements to include:

✓ Why you chose non-parametric (assumption violation)
✓ Report medians, not means
✓ Include test statistic (U, H, etc.)
✓ Report p-value
✓ Include effect size when possible

🎯 Key Takeaways

What You Should Remember:

✓ Non-parametric tests are not "inferior"

They're just different! ~95% power when assumptions hold, MORE power when violated.

✓ They test distributions, not means

The question changes from "Are means different?" to "Are distributions different?"

✓ Use ranks, not raw values

This makes them robust to outliers and skewness.

✓ Main alternatives:

t-test → Mann-Whitney U
ANOVA → Kruskal-Wallis
Paired t-test → Wilcoxon signed-rank
Pearson r → Spearman ρ

✓ When to use them:

Small samples + non-normal
Ordinal data
Transformation doesn't work
Want robustness to outliers

✓ Always report why you chose non-parametric

Document your assumption checking and decision-making process!

📚 Course Complete!

🎉 Congratulations!

You've now completed the entire normality testing curriculum!

Module 1: Why normality matters (Type I error inflation)

Module 2: Visual detection (histograms, Q-Q plots)

Module 3: Statistical tests (Shapiro-Wilk, large sample paradox)

Module 4: Transformations (log, sqrt, when to transform)

Module 5: Non-parametric alternatives (Mann-Whitney, Kruskal-Wallis)

You now have a complete toolkit for handling normality in your research!

📋 Before You Submit

✅ Submission Checklist

Both partner names filled in
Ran both normal and skewed comparisons
Completed all observation questions (Q1-Q3)
Answered all scenario questions (Q4a-d)
Viewed ranking demonstration

📤 How to Submit

Click "Save Progress"
Print: Ctrl+P (Windows) or Cmd+P (Mac)
Choose "Save as PDF"
Save as: module5_lastname1_lastname2.pdf
Upload to your course site

🎉 You've completed all 5 modules! 🎉

You're now equipped to handle normality testing like a pro!

Next: Apply these skills to your own research data!