You are conducting a study comparing the efficacy of two different statin medications. Two groups are placed on different statin medications, statin A and statin B. Baseline LDL levels are drawn for each group and are subsequently measured every 3 months for 1 year. Average baseline LDL levels for each group were identical. The group receiving statin A exhibited an 11 mg/dL greater reduction in LDL in comparison to the statin B group. Your statistical analysis reports a p-value of 0.052. Which of the following best describes the meaning of this p-value?

There is a 5.2% chance of observing a difference in reduction of LDL of 11 mg/dL or greater even if the two medications have identical effects

There is a 95% chance that the difference in reduction of LDL observed reflects a real difference between the two groups

Though A is more effective than B, there is a 5% chance the difference in reduction of LDL between the two groups is due to chance

If 100 permutations of this experiment were conducted, 5 of them would show similar results to those described above

This is a statistically significant result

A researcher is examining the relationship between socioeconomic status and IQ scores. The IQ scores of young American adults have historically been reported to be distributed normally with a mean of 100 and a standard deviation of 15. Initially, the researcher obtains a random sampling of 300 high school students from public schools nationwide and conducts IQ tests on all participants. Recently, the researcher received additional funding to enable an increase in sample size to 2,000 participants. Assuming that all other study conditions are held constant, which of the following is most likely to occur as a result of this additional funding?

Decrease in standard error of the mean

Increase in risk of systematic error

Increase in range of the confidence interval

Increase in probability of type II error

You submit a paper to a prestigious journal about the effects of coffee consumption on mesothelioma risk. The first reviewer lauds your clinical and scientific acumen, but expresses concern that your study does not have adequate statistical power. Statistical power refers to which of the following?

The probability of detecting an association when an association does exist.

The probability of detecting an association when no association exists.

The probability of not detecting an association when an association does exist.

The square root of the variance.

A 28-year-old male presents to his primary care physician with complaints of intermittent abdominal pain and alternating bouts of constipation and diarrhea. His medical chart is not significant for any past medical problems or prior surgeries. He is not prescribed any current medications. Which of the following questions would be the most useful next question in eliciting further history from this patient?

"Can you tell me more about the symptoms you have been experiencing?"

"Does the diarrhea typically precede the constipation, or vice-versa?"

"Is the diarrhea foul-smelling?"

"Please rate your abdominal pain on a scale of 1-10, with 10 being the worst pain of your life"

"Are the symptoms worse in the morning or at night?"

A 54-year-old man is brought to the emergency department 30 minutes after being hit by a car while crossing the street. He had a left-sided tonic-clonic seizure and one episode of vomiting while being transported to the hospital. On arrival, he is not oriented to person, place, or time. Physical examination shows flaccid paralysis of all extremities. A CT scan of the head is shown. This patient's symptoms are most likely the result of a hemorrhage in which of the following structures?

Between the arachnoid mater and the pia mater

Between the dura mater and the arachnoid mater

Between the skull and the dura mater

Effect sizes and confidence intervals — USMLE Lesson

Effect Size - Beyond P-Hacking

Quantifies the magnitude of an intervention's effect, moving beyond the simple "significant vs. not significant" of p-values.
Helps counter p-hacking; a large sample can make a clinically trivial effect statistically significant (p < 0.05), but the effect size remains small.

Cohen’s d and Overlap Between Distributions

Common measures:
- Cohen's d: Standardized difference between means.
- Odds Ratio (OR) / Relative Risk (RR): For categorical outcomes.
- Correlation coefficient (r): For linear relationships.
The 95% CI of an effect size indicates the precision of the estimate.

⭐ A statistically significant result (p < 0.05) with a small effect size may be clinically meaningless. Always assess both the p-value and the effect size to determine clinical importance.

Confidence Intervals - Range of Reality

A Confidence Interval (CI) provides a range of plausible values for an unknown population parameter (e.g., true mean difference or odds ratio), based on sample data.
A 95% CI means that if a study were repeated infinitely, 95% of the calculated CIs would contain the true population value. It is a measure of precision, not statistical significance alone.

Interpretation:
- Narrow CI → High precision (more certain about the true value).
- Wide CI → Low precision (less certain).

⭐ The width of the confidence interval is inversely related to the sample size. A larger sample size leads to a narrower, more precise CI.

Clinical Significance - Stats vs. Reality

Statistical significance (p-value) ≠ Clinical significance (real-world importance).
- A p-value < 0.05 simply indicates that an observed effect is unlikely to be due to chance. It does not quantify the size or practical importance of the effect.
Effect Size: The primary measure of an intervention's impact magnitude, indicating its clinical relevance.
Confidence Intervals (CIs) are vital for assessing clinical significance.
- A narrow CI implies a precise estimate of the effect.
- Key question: Does the CI range include effects that are clinically trivial? If so, the finding may be unimportant, even if statistically significant.

⭐ A large study might find a new drug lowers blood pressure by 0.1 mmHg (p < 0.001). While statistically significant, this effect size is clinically meaningless. Always scrutinize the confidence interval.

Normal distribution curves showing effect sizes

High‑Yield Points - ⚡ Biggest Takeaways

A 95% Confidence Interval (CI) that excludes the null value (e.g., 0 for mean difference, 1 for OR/RR) implies statistical significance (p < 0.05).

Narrow CIs indicate high precision and are often the result of larger sample sizes.

Wider CIs suggest lower precision and may be due to smaller sample sizes.

CIs provide the range of plausible effect sizes, which a p-value alone does not.

Always assess if the effect size is clinically meaningful, not just statistically significant.

Increasing the confidence level (e.g., from 95% to 99%) widens the CI.

Unlock the full lesson and continue reading

Signup to continue reading this lesson and unlimited access questions, flashcards, AI notes, and more

Scan to download app

UNLOCK FREE ACCESS