You are conducting a study comparing the efficacy of two different statin medications. Two groups are placed on different statin medications, statin A and statin B. Baseline LDL levels are drawn for each group and are subsequently measured every 3 months for 1 year. Average baseline LDL levels for each group were identical. The group receiving statin A exhibited an 11 mg/dL greater reduction in LDL in comparison to the statin B group. Your statistical analysis reports a p-value of 0.052. Which of the following best describes the meaning of this p-value?

There is a 5.2% chance of observing a difference in reduction of LDL of 11 mg/dL or greater even if the two medications have identical effects

There is a 95% chance that the difference in reduction of LDL observed reflects a real difference between the two groups

Though A is more effective than B, there is a 5% chance the difference in reduction of LDL between the two groups is due to chance

If 100 permutations of this experiment were conducted, 5 of them would show similar results to those described above

This is a statistically significant result

A group of researchers is looking to study the effect of body weight on blood pressure in the elderly. Previous work measuring body weight and blood pressure at 2-time points in a large group of healthy individuals revealed that a 10% increase in body weight was accompanied by a 7 mm Hg increase in blood pressure. If the researchers want to determine if there is a linear relationship between body weight and blood pressure in a subgroup of elderly individuals in this study, which of the following statistical methods would best be employed to answer this question?

One-way analysis of variance (ANOVA)

Two-way analysis of variance (ANOVA)

A neuro-oncology investigator has recently conducted a randomized controlled trial in which the addition of a novel alkylating agent to radiotherapy was found to prolong survival in comparison to radiotherapy alone (HR = 0.7, p < 0.01). A number of surviving participants who took the alkylating agent reported that they had experienced significant nausea from the medication. The investigator surveyed all participants in both the treatment and the control group on their nausea symptoms by self-report rated mild, moderate, or severe. The investigator subsequently compared the two treatment groups with regards to nausea level. | | Mild nausea | Moderate nausea | Severe nausea | |---|---|---|---| | Treatment group (%) | 20 | 30 | 50 | | Control group (%) | 35 | 35 | 30 | Which of the following statistical methods would be most appropriate to assess the statistical significance of these results?

Pearson correlation coefficient

Inferential statistics - Free USMLE High-Yield Review

Inferential Statistics - The Hypothesis Game

Hypothesis Testing: Aims to determine if there's enough evidence to reject a null hypothesis ($H_0$).
- Null Hypothesis ($H_0$): States no association or difference (e.g., new drug = placebo).
- Alternative Hypothesis ($H_a$): States an association or difference exists.
p-value: Probability of obtaining observed results, assuming $H_0$ is true. If p ≤ $\alpha$ (usually 0.05), results are statistically significant.
Confidence Interval (CI): Range of values for a population parameter. A 95% CI not containing the null value (e.g., 1 for RR/OR) is significant.

⭐ A p-value < 0.05 is significant, but a CI provides more info: the effect size's range and precision.

Type I and Type II Errors in Hypothesis Testing

Errors & Power:
- Type I Error ($\alpha$): False positive. Rejecting a true $H_0$. 📌 Stating a difference exists when it doesn't.
- Type II Error ($\beta$): False negative. Failing to reject a false $H_0$. 📌 Missing a real difference.
- Power ($1-\beta$): Probability of detecting a true effect. Increased by ↑ sample size.

Errors & Power - Dodging Statistical Traps

Type I Error (α): False positive. Rejecting a true null hypothesis (H₀). Probability = p-value.
- α is the risk of a Type I error you're willing to accept (e.g., 0.05).
Type II Error (β): False negative. Failing to reject a false H₀.
Power: Probability of detecting a true effect. Power = $1 - β$. Standard is 0.80.

⭐ Increasing sample size (n) is the most common method to increase a study's power.

Statistical Power, Alpha, Beta, and Effect Size

Statistical Tests - The Right Tool for the Job

t-test: Compares means of 2 groups.
ANOVA: Compares means of ≥3 groups.
Chi-square ($χ^2$): Compares proportions of ≥2 categorical variables.
Pearson Correlation (r): Measures linear association between 2 continuous variables.

⭐ ANOVA is preferred over multiple t-tests for comparing >2 groups because it reduces the cumulative risk of a Type I (α) error.

Correlation: Measures the strength and direction of a linear relationship between two continuous variables.
- Pearson Coefficient ($r$): Value is between -1 and +1.
  - +1: Perfect positive correlation.
  - -1: Perfect negative correlation.
  - 0: No linear correlation.
- Coefficient of Determination ($r^2$): Proportion of variance in one variable explained by the other. An $r$ of 0.7 means $r^2 = 0.49$, so 49% of the variance is shared.
Linear Regression: Predicts a dependent variable's value based on an independent variable.

⭐ Correlation does not imply causation. A significant p-value (< 0.05) suggests the observed relationship is unlikely due to chance, but doesn't prove one variable causes the other.

Scatter plots: negative, no, and positive correlation

High‑Yield Points - ⚡ Biggest Takeaways

p-value < 0.05 means results are statistically significant, rejecting the null hypothesis.

Confidence intervals are significant if they exclude 0 for mean differences or 1 for ratios (OR/RR).

Type I error (α) is a false positive-rejecting a true null hypothesis.

Type II error (β) is a false negative-failing to reject a false null hypothesis.

Power (1-β) is the probability of detecting a true difference; it increases with sample size.

T-tests compare means of 2 groups; ANOVA for ≥3 groups; chi-square for categorical data.