Core Concept - Hindsight is 20/20
- Starts with the outcome and looks back in time to ascertain exposure status. The investigator uses historical records to select a cohort and trace their history forward to the present.
- Direction: Retrospective (Past → Present).
- Both exposure and outcome have already occurred at the start of the study.
- Relies heavily on the quality of existing medical records or databases.
- Measure of Association: Relative Risk (RR).
⭐ High-Yield: Exceptionally efficient for studying outcomes with long latency periods. Since data is historical, you can assess decades of follow-up instantly.

Design & Measures - The Data Detective
- The investigator identifies a cohort from past records and determines their exposure status at that time.
- The "follow-up" occurs by tracking these individuals forward (via records) to the present to see who developed the outcome.
- 📌 Retrospective = Review of Records.
- The primary measure of association is Relative Risk (RR), the ratio of incidence in the exposed to the unexposed.
- $RR = [a/(a+b)] / [c/(c+d)]$
⭐ A key strength is the ability to study risk factors for diseases with long latency periods quickly and efficiently.
Strengths vs. Weaknesses - The Trade-Offs
| Strengths (Advantages) | Weaknesses (Disadvantages) |
|---|---|
| * Time & Cost Efficient: Faster and less expensive as data has already been collected. Ideal for initial investigations. | * Data Limitations: Heavily reliant on existing records, which may be inaccurate, incomplete, or lack key variables. |
| * Rare Exposures: A key advantage is the ability to effectively study exposures that are uncommon in the general population. | * Bias Risk: Prone to selection bias (if survival is needed for inclusion) and information bias (misclassification of exposure/outcome). |
| * Long Latency Periods: Excellent for diseases that take many years to develop following an exposure, like cancer. | * Confounding: Often impossible to control for confounders that were not recorded in the original data source. |
| * Multiple Outcomes: Allows for the examination of multiple potential outcomes stemming from a single exposure. | * Temporal Ambiguity: While better than case-control, it can sometimes be hard to confirm exposure preceded the outcome. |
Study Showdown - Cohort vs. Case-Control
| Feature | Cohort Study | Case-Control Study |
|---|---|---|
| Starting Point | Exposure status | Disease (outcome) status |
| Directionality | Forward: Exposure → Outcome | Backward: Outcome → Exposure |
| Key Question | "What will happen?" (Prospective) "What has happened?" (Retrospective) | "What happened?" |
| Measure | Relative Risk (RR) | Odds Ratio (OR) |
| 📌 Mnemonic | Cohort = Coming events | Case-Control = Causes past |
High-Yield Points - ⚡ Biggest Takeaways
- A retrospective cohort study works backward, identifying a cohort based on past exposure status from existing records and tracking forward to the present for outcomes.
- It is faster and more economical than a prospective study, relying on pre-existing data like medical charts.
- The primary weakness is the dependency on data quality; incomplete or inconsistently collected records can introduce significant bias.
- Calculates Relative Risk (RR).
Continue reading on Oncourse
Sign up for free to access the full lesson, plus unlimited questions, flashcards, AI-powered notes, and more.
CONTINUE READING — FREEor get the app