Case-Control Studies
USMLE Step 1 trap: Incorrectly calculates relative risk instead of odds ratio from case-control data. Because disease prevalence in the sample is set by the investigator, true disease incidence is unknown, so only the odds ratio can be calculated from case-control data.
Case-control studies are one of the highest-yield observational designs on USMLE Step 1, and they're also one of the most consistently misapplied. The core structure: you start with people who already have the disease (cases) and people who don't (controls), then look backward to compare how often each group was exposed to a risk factor. That backward directionality — outcome first, then exposure — is what defines the design and explains everything downstream, including why you can only calculate an odds ratio, not a relative risk.
The exam tests this concept from multiple angles. Straightforward recall questions ask you to identify the design from a vignette. Application questions give you a 2x2 table and ask which statistic to compute. Passage-based questions describe a scenario — say, a researcher studying a rare cancer or a disease with a 20-year latency — and ask you to choose the most appropriate study design. Each angle requires a different layer of understanding, and students who only memorize 'case-control = odds ratio' routinely get the application and scenario questions wrong.
What makes this tricky is that the misconceptions are intuitive. It feels like you should be able to calculate relative risk from any 2x2 table. And it seems like recall bias would affect everyone equally. USMLE Step 1 specifically exploits these intuitions. The fix is understanding the mechanism behind each rule, not just the rule itself.
A gap in most decks — fewer than half of students in our cohort have cards covering this topic.
Common misconceptions
What the exam tests
- Identify the directionality of a case-control study: it starts with disease status (cases vs. controls) and looks backward at past exposure — not forward from exposure to outcome.
- Explain why relative risk cannot be calculated from case-control data and why the odds ratio is the correct effect measure instead.
- Recognize which research scenarios favor a case-control design, specifically rare diseases and conditions with long latency periods where prospective follow-up would be impractical.
- Identify the biases that disproportionately threaten case-control studies, especially recall bias and selection bias, and understand the mechanism by which they distort results.
Can you avoid these mistakes?
Related topics
See how your Anki deck covers this topic.
Upload your deck for a free audit →