A randomized double-blind controlled trial is conducted on the efficacy of 2 different ACE-inhibitors. The null hypothesis is that both drugs will be equivalent in their blood-pressure-lowering abilities. The study concluded, however, that Medication 1 was more efficacious in lowering blood pressure than medication 2 as determined by a p-value < 0.01 (with significance defined as p ≤ 0.05). Which of the following statements is correct?

We can reject the null hypothesis.

We can accept the null hypothesis.

This trial did not reach statistical significance.

There is a 0.1% chance that medication 2 is superior.

There is a 10% chance that medication 1 is superior.

You are conducting a study comparing the efficacy of two different statin medications. Two groups are placed on different statin medications, statin A and statin B. Baseline LDL levels are drawn for each group and are subsequently measured every 3 months for 1 year. Average baseline LDL levels for each group were identical. The group receiving statin A exhibited an 11 mg/dL greater reduction in LDL in comparison to the statin B group. Your statistical analysis reports a p-value of 0.052. Which of the following best describes the meaning of this p-value?

There is a 5.2% chance of observing a difference in reduction of LDL of 11 mg/dL or greater even if the two medications have identical effects

There is a 95% chance that the difference in reduction of LDL observed reflects a real difference between the two groups

Though A is more effective than B, there is a 5% chance the difference in reduction of LDL between the two groups is due to chance

If 100 permutations of this experiment were conducted, 5 of them would show similar results to those described above

This is a statistically significant result

A randomized control double-blind study is conducted on the efficacy of 2 sulfonylureas. The study concluded that medication 1 was more efficacious in lowering fasting blood glucose than medication 2 (p ≤ 0.05; 95% CI: 14 [10-21]). Which of the following is true regarding a 95% confidence interval (CI)?

If the same study were repeated multiple times, approximately 95% of the calculated confidence intervals would contain the true population parameter.

The 95% confidence interval is the probability chosen by the researcher to be the threshold of statistical significance.

When a 95% CI for the estimated difference between groups contains the value ‘0’, the results are significant.

It represents the probability that chance would not produce the difference shown, 95% of the time.

The study is adequately powered at the 95% confidence interval.

A researcher is conducting a study to compare fracture risk in male patients above the age of 65 who received annual DEXA screening to peers who did not receive screening. He conducts a randomized controlled trial in 900 patients, with half of participants assigned to each experimental group. The researcher ultimately finds similar rates of fractures in the two groups. He then notices that he had forgotten to include 400 patients in his analysis. Including the additional participants in his analysis would most likely affect the study's results in which of the following ways?

Increased probability of rejecting the null hypothesis when it is truly false

Wider confidence intervals of results

Increased probability of committing a type II error

Decreased significance level of results

Increased external validity of results

Common misinterpretations of p-values — USMLE Step 1 Lesson

Common misinterpretations of p-values - The P-Value Fallacy Files

A p-value is the probability of obtaining the observed study results (or more extreme results) assuming the null hypothesis (H₀) is true.
Fallacy 1: The p-value is the probability that the null hypothesis is true.
- ⚠️ This is the most common error. The p-value is calculated assuming H₀ is true. It is $P(\text{data} | H₀)$, not $P(H₀ | \text{data})$.

⭐ A p-value of 0.05 does not mean there is a 5% chance the null hypothesis is true. It means there is a 5% chance of observing the data (or more extreme data) if the null hypothesis were true.

Fallacy 2: A non-significant p-value (e.g., p > 0.05) means the null hypothesis is true.
- This wrongly equates "no evidence of an effect" with "evidence of no effect."
- The study may be underpowered, leading to a Type II error (false negative).
Fallacy 3: A small p-value indicates a large effect size.
- P-values are confounded by sample size. A huge sample can yield a tiny p-value for a trivial, clinically irrelevant effect.
- Always evaluate effect size (e.g., relative risk, odds ratio) and confidence intervals to determine the magnitude of the effect.

T-distribution with shaded p-value region

Fallacy 4: Statistical significance equals clinical significance.
- A result can be statistically significant (p < 0.05) but not clinically meaningful.
- Example: A new drug lowers blood pressure by a statistically significant 1 mmHg, which is not a clinically relevant improvement.

Confidence Intervals - The P-Value's Big Brother

Definition: A Confidence Interval (CI) is a range of values calculated from sample data that is likely to contain the true population parameter (e.g., mean difference, relative risk).
The 95% CI is standard, implying that if a study were repeated many times, 95% of the calculated CIs would contain the true value.

CI & P-Value Relationship (for α = 0.05):

A CI provides more information than a p-value; it presents a range of plausible values for the true effect.
Statistical Significance:
- If the 95% CI does not contain the null value, the result is statistically significant (p < 0.05).
- If the 95% CI does contain the null value, the result is not statistically significant (p ≥ 0.05).
Key Null Values:
- For differences (e.g., mean difference): Null value is 0.
- For ratios (e.g., Odds Ratio [OR], Relative Risk [RR]): Null value is 1.

Confidence intervals for odds ratios and another entirely above 1 (significant))

Why CIs are Superior:

Precision: The width of the CI indicates the precision of the point estimate.
- Narrow CI → High precision (less random error).
- Wide CI → Low precision (more random error).
Effect Size: The CI provides the magnitude and direction of the effect, which a p-value alone cannot do.

⭐ When evaluating a study's OR or RR, check if the 95% CI includes 1. If it does (e.g., CI: 0.8 to 2.1), the association is not statistically significant. If it does not (e.g., CI: 1.5 to 3.0), the association is significant.

A p-value is NOT the probability that the null hypothesis is true. It is the probability of the observed data (or more extreme) assuming the null hypothesis is true.

A non-significant p-value does not prove the null hypothesis. It simply reflects insufficient evidence to reject it (absence of evidence ≠ evidence of absence).

Statistical significance does not imply clinical significance. A tiny p-value can be associated with a clinically trivial effect, especially in large studies.

Unlock the full lesson and continue reading

Signup to continue reading this lesson and unlimited access questions, flashcards, AI notes, and more

Scan to download app

UNLOCK FREE ACCESS