The Basics - Looking Backwards
- Retrospective & Observational: Starts with the outcome and looks back in time for exposure.
- Groups: Compares a group with the disease (cases) to a group without the disease (controls).
- Goal: To determine if a past exposure is associated with the current outcome.
📌 Mnemonic: Case-Control looks back in time to find the cause.
⭐ The primary measure of association calculated in a case-control study is the Odds Ratio (OR).
- Odds Ratio (OR): $OR = (a/c) / (b/d) = ad/bc$
Pros & Cons - A Quick Trade-off
Strengths:
- Quick & Inexpensive: Ideal for initial investigations as they are faster and more economical than cohort studies.
- Rare Diseases: Excellent for studying conditions with low prevalence, as you start with known cases.
- Long Latency Periods: Efficient for diseases that take many years to develop following an exposure.
Weaknesses:
- Bias Prone: Highly susceptible to several biases, most notably:
- Recall Bias: Cases may remember past exposures differently than controls.
- Selection Bias: Controls may not be representative of the population that produced the cases.
- Cannot Calculate Incidence/Prevalence: This design does not follow a population over time, so true rates cannot be determined.
⭐ Case-control studies are particularly well-suited for investigating outbreaks and rare diseases.
Bias Traps - Spotting the Flaws
- Selection Bias: Controls are not representative of the population that produced the cases.
- Berkson's Bias: Hospitalized controls are more likely to have other health issues than the general population.
- Neyman Bias (Prevalence-Incidence): Misses fatal, mild, or silent cases, skewing the data.
- Recall Bias: Cases, aware of their disease, may recall exposures differently than controls.
⭐ Recall bias is a major limitation, where cases (who have the disease) may remember past exposures differently than controls (who do not).
- Observer Bias: Investigator's knowledge of case/control status influences data collection.
- Confounding: A third variable is associated with both exposure and outcome, distorting the true relationship.
- Mitigation: Use matching (e.g., for age, sex) or statistical adjustment.
Data Analysis - The Odds Ratio
- Calculated in case-control studies to estimate the strength of association between an exposure and a disease.
- Uses a 2x2 table:
| Cases (Disease) | Controls (No Disease) | |
|---|---|---|
| Exposed | a | b |
| Unexposed | c | d |
- Interpretation:
- OR > 1: ↑ Odds of exposure in cases (risk factor).
- OR < 1: ↓ Odds of exposure in cases (protective factor).
- OR = 1: No association.

⭐ If the 95% confidence interval for the odds ratio includes 1.0, the association is not statistically significant.
High‑Yield Points - ⚡ Biggest Takeaways
- Retrospective design that starts with the outcome (disease) and looks backward for associated exposures.
- Best for studying rare diseases or those with long latency periods.
- The primary measure of association is the Odds Ratio (OR).
- Cannot be used to calculate incidence or prevalence.
- Highly susceptible to recall bias and selection bias (especially in choosing controls).
- Groups are defined by disease status (cases vs. controls).
Continue reading on Oncourse
Sign up for free to access the full lesson, plus unlimited questions, flashcards, AI-powered notes, and more.
CONTINUE READING — FREEor get the app