Confidence intervals for non-parametric tests

Confidence intervals for non-parametric tests

Confidence intervals for non-parametric tests

On this page

Non-Parametric CIs - Beyond the Bell Curve

  • Used when data is skewed or does not follow a normal distribution, violating assumptions of parametric tests (like t-tests).
  • Confidence intervals are constructed for the median, not the mean.
  • Common Methods:
    • Wilcoxon Rank-Sum Test: The CI for the difference between two medians is based on the Hodges-Lehmann estimator, which uses the median of all possible pairwise differences between the two samples.
    • Sign Test: The CI for a single sample median is derived from the binomial distribution.
    • Bootstrapping: A resampling technique to estimate CIs for any statistic without distribution assumptions.

High-Yield: Bootstrapping is a versatile, computer-intensive method. It involves repeatedly sampling with replacement from the data to create thousands of new samples, generating a distribution of the statistic (e.g., median) to find the confidence interval.

Normal vs. Skewed Distributions: Mean, Median, Mode

Wilcoxon & Mann-Whitney - Sizing Up Medians

  • Use Case: Essential for skewed data or when assumptions for t-tests (like normal distribution) are not met. These tests analyze medians, not means.

  • Mann-Whitney U Test (Wilcoxon Rank-Sum Test)

    • Compares medians of two independent groups.
    • Analogy: Non-parametric version of the independent samples t-test.
    • Example: Comparing the median symptom score in a treatment group vs. a placebo group.
  • Wilcoxon Signed-Rank Test

    • Compares medians of two paired/matched samples.
    • Analogy: Non-parametric version of the paired t-test.
    • Example: Evaluating median cholesterol levels in the same patients before and after a new diet.

Power Consideration: While robust, non-parametric tests are less powerful than their parametric counterparts (t-tests) if the data actually follows a normal distribution. This means a higher risk of a Type II error (failing to detect a real effect).

Parametric vs. Non-Parametric Tests Comparison

Wilcoxon Signed-Rank - Paired Data Power

  • Analogue: Non-parametric equivalent to the paired t-test.
  • Use When:
    • Data is from paired samples (e.g., before-and-after measurements).
    • Data is not normally distributed.
    • Outcome is ordinal or continuous.
  • Mechanism:
    • Calculates the difference for each pair: $d_i = \text{after}_i - \text{before}_i$.
    • Ranks the absolute values of non-zero differences.
    • Sums the ranks for positive differences ($W^+$) and negative differences ($W^-$).
    • The test statistic, $W$, is the smaller of $W^+$ and $W^-$.
  • Power: More powerful than the Sign Test; less powerful than the paired t-test if data is normally distributed.

High-Yield Pearl: The Wilcoxon Signed-Rank test retains approximately 95% of the statistical power of the paired t-test even on data that perfectly fits the t-test's normal distribution assumptions, making it an excellent and safe alternative.

High‑Yield Points - ⚡ Biggest Takeaways

  • Non-parametric CIs estimate the population median, not the mean, as they are based on ranks.
  • The CI for a Wilcoxon test estimates the median of the difference between groups (independent or paired).
  • Essential for skewed data or samples with outliers, where the mean is not a reliable measure.
  • Interpretation is key: if a 95% CI for a difference in medians contains 0, the result is not significant.
  • Generally wider (less precise) than parametric CIs because they use less data information.
Rezzy AI Tutor

Have doubts about this lesson?

Ask Rezzy, our AI tutor, to explain anything you didn't understand

Practice Questions: Confidence intervals for non-parametric tests

Test your understanding with these related questions

Group of 100 medical students took an end of the year exam. The mean score on the exam was 70%, with a standard deviation of 25%. The professor states that a student's score must be within the 95% confidence interval of the mean to pass the exam. Which of the following is the minimum score a student can have to pass the exam?

1 of 5

Flashcards: Confidence intervals for non-parametric tests

1/8

What is the relationship between the mean, median, and mode in a negatively skewed distribution?_____

TAP TO REVEAL ANSWER

What is the relationship between the mean, median, and mode in a negatively skewed distribution?_____

Mean < median < mode

browseSpaceflip

Enjoying this lesson?

Get full access to all lessons, practice questions, and more.

Start For Free