The mean, median, and mode weight of 37 newborns in a hospital nursery is 7 lbs 2 oz. In fact, there are 7 infants in the nursery that weigh exactly 7 lbs 2 oz. The standard deviation of the weights is 2 oz. The weights follow a normal distribution. A newborn delivered at 10 lbs 2 oz is added to the data set. What is most likely to happen to the mean, median, and mode with the addition of this new data point?

The mean will increase; the median will stay the same; the mode will stay the same

The mean will increase; the median will increase; the mode will stay the same

The mean will stay the same; the median will increase; the mode will stay the same

The mean will increase; the median will increase; the mode will increase

The mean will stay the same; the median will increase; the mode will increase

A health system implements a new sepsis protocol across 20 hospitals. A researcher plans to evaluate effectiveness using a stepped-wedge cluster randomized design where hospitals sequentially adopt the protocol every 3 months. She calculates sample size based on individual patient outcomes (mortality) needing 2,000 patients total. The biostatistician identifies a critical error. Evaluate what modification is needed.

Account for intra-cluster correlation coefficient (ICC) requiring substantial sample size inflation

Adjust for multiple time periods using Bonferroni correction

Use hospital-level outcomes instead of patient-level outcomes as unit of analysis

Increase alpha to 0.10 to account for cluster randomization reducing power

Include random effects for both hospital and time period in power calculation

A 41-year-old research fellow designs a non-inferiority trial comparing oral to IV antibiotics for osteomyelitis. She sets the non-inferiority margin at 10% (cure rate difference), expects 85% cure in both groups, and calculates 300 patients per arm for 80% power with α=0.025 (one-sided). Her mentor suggests this underestimates required sample size. Evaluate the mentor's concern.

Correct; non-inferiority trials require larger samples than superiority trials for equivalent power

Incorrect; non-inferiority trials actually require smaller samples due to less stringent hypotheses

Correct; dropout rates in antibiotic trials necessitate 20% inflation of calculated sample size

Incorrect; the calculation appropriately uses one-sided alpha for non-inferiority testing

Correct; the margin should be set at 5% requiring doubling of sample size

Sample size for non-inferiority trials — USMLE Step 1 Lesson

Non-Inferiority Trials - Not Worse, Just Different

Goal: To show a new treatment is not unacceptably worse than the standard. Used when new options offer other benefits (e.g., ↑safety, ↓cost).
Non-Inferiority Margin (Δ): The pre-specified, largest clinically acceptable difference to still be considered "good enough."
Hypotheses:
- H₀ (Null): The new treatment is inferior (Difference > Δ).
- H₁ (Alternative): The new treatment is non-inferior (Difference ≤ Δ).
Sample Size: Influenced by α, β (power), variance, and Δ. A smaller, stricter margin (↓Δ) requires a ↑ sample size.

Non-inferiority trial confidence interval interpretation

⭐ For non-inferiority to be claimed, the entire confidence interval for the treatment effect difference must be less than the non-inferiority margin (Δ).

NI Sample Size - The Secret Sauce

Goal: Prove a new treatment is not unacceptably worse than the standard. The sample size hinges on the non-inferiority margin (δ).
Core Formula (per group):
- $n = \frac{(Z_{\alpha} + Z_{\beta})^2 \times (2\sigma^2)}{(\Delta - \delta)^2}$
- $\Delta$: Assumed true difference between treatments.
- $\delta$: The pre-defined non-inferiority margin.
Key Relationship: The required sample size is highly sensitive to the gap between the true effect ($\Delta$) and the NI margin ($\delta$).

⭐ Exam Pearl: Counterintuitively, non-inferiority trials often require a larger sample size than superiority trials, especially if the new drug's efficacy is expected to be very similar to the standard (i.e., Δ is small).

The Formula - Cranking the Numbers

Calculates subjects needed to prove a new treatment is not unacceptably worse than standard treatment.
Formula for continuous outcomes (per group): $$ n = \frac{2 \sigma^2 (Z_{\alpha} + Z_{\beta})^2}{(\Delta - \delta)^2} $$
- Key Inputs:
  - $Z_{\alpha}$: Significance level (e.g., 1.96 for α=0.025)
  - $Z_{\beta}$: Statistical power (e.g., 0.84 for 80% power)
  - $\sigma^2$: Data variability (variance)
  - $\delta$: The non-inferiority margin (critical value)
  - $\Delta$: Expected difference in effect (often assumed to be 0)
Sample Size Drivers:
- Sample size ↑ as power ↑, significance ↑ (α ↓), or variance ↑.
- Crucially, sample size ↑ dramatically as the margin (δ) ↓ (becomes stricter).

⭐ The non-inferiority margin (δ) is the most critical choice. It must be smaller than the active control's established benefit over placebo, ensuring the new drug preserves a clinically meaningful effect.

Sample Size Levers - Dialing It In

Non-Inferiority Margin (δ): The most critical lever.
- Smaller (stricter) margin → ↑ sample size.
- Larger (lenient) margin → ↓ sample size.
Power (1-β):
- Higher power (e.g., 90% vs 80%) → ↑ sample size. Reduces Type II error risk.
Significance Level (α):
- Lower α (e.g., 0.01) → ↑ sample size. Reduces Type I error risk.
Outcome Variability (σ²):
- Higher data variability → ↑ sample size for precise estimates.

⭐ The non-inferiority margin (δ) isn't arbitrary. It's set based on historical data of the active control's effect over a placebo, ensuring the new drug preserves a clinically meaningful effect.

High‑Yield Points - ⚡ Biggest Takeaways

The goal is to show a new treatment is not unacceptably worse than the standard one.

A pre-specified non-inferiority margin (δ) sets the boundary of acceptable difference.

Success requires the entire confidence interval of the effect to be above -δ.

Sample size is driven by the margin (δ), power (1-β), and significance (α).

A smaller (stricter) margin demands a larger sample size to achieve adequate power.

If the CI crosses -δ, the result is inconclusive, not a confirmation of inferiority.

Unlock the full lesson and continue reading

Signup to continue reading this lesson and unlimited access questions, flashcards, AI notes, and more

Scan to download app

UNLOCK FREE ACCESS