Common misconceptions

Common mistake
Gap: Cannot enumerate all three required conditions for a variable to qualify as a confounder
A confounder must be associated with the exposure, independently associated with the outcome, and not be an intermediate step in the causal pathway between exposure and outcome.
Students often remember that a confounder is 'related to both exposure and outcome' but forget the third condition: it must not be an intermediate variable on the causal pathway. If the variable sits between the exposure and outcome in the causal chain, adjusting for it would block the very pathway you're trying to study — that's overadjustment, not confounder control. All three conditions must be satisfied simultaneously, and the exam will include distractors that meet only two of them.
Common mistake
Wrong: Confounding and effect modification are both problems to be eliminated from analysis.
Right: Confounding is a bias to be controlled and removed, whereas effect modification is a true biological phenomenon that should be reported by presenting stratum-specific estimates.
Confounding is an artifact — a distortion of the true exposure-outcome relationship caused by a third variable — so the goal is to remove it and report a single adjusted estimate. Effect modification means the exposure has genuinely different effects in different subgroups, which is real biology worth knowing. You handle effect modification by reporting stratum-specific results, not by pooling or adjusting them away. Collapsing across strata when effect modification is present would hide clinically important information.
Common mistake
Wrong: Matching in a case-control study eliminates confounding without any further analytic steps.
Right: Matching controls confounding at the design stage but requires matched analysis (e.g., conditional logistic regression) to avoid introducing bias by over-matching.
Matching in a case-control study controls confounding at the design stage by ensuring cases and controls are similar on the matched variable — but it doesn't eliminate the statistical influence of that variable. If you then run an unmatched analysis, you can actually introduce bias (overmatching). The correct approach requires a matched analytic method, such as conditional logistic regression or McNemar's test, to properly account for the matched pairs.
Common mistake
Gap: Cannot walk through a concrete example illustrating how a third variable meets all three criteria for confounding
The classic confounding example is alcohol and lung cancer: smoking is a confounder because it is associated with alcohol use, independently causes lung cancer, and is not on the causal pathway from alcohol to lung cancer.
In the classic example, crude data suggest alcohol use increases lung cancer risk. But smoking meets all three criteria for confounding: smokers drink more (associated with exposure), smoking independently causes lung cancer (associated with outcome), and smoking is not caused by alcohol on the way to causing cancer (not an intermediate). Once you stratify by or adjust for smoking, the alcohol-lung cancer association disappears or shrinks substantially — that's the hallmark of confounding.
Free Deck audit

See if your Anki deck covers this topic.

Upload your deck →
Guided session

Stuck on this? An AI tutor that probes your understanding.

Start a session →

What the exam tests

  1. Know all three required conditions a variable must meet to qualify as a confounder: it must be associated with the exposure, independently associated with the outcome, and not be an intermediate step on the causal pathway from exposure to outcome.
  2. Be able to walk through a classic example — like alcohol, smoking, and lung cancer — and explicitly map each criterion onto the variables to explain why smoking is the confounder.
  3. Know the design-stage strategies to control confounding (randomization, restriction, matching) and understand what each one does and does not accomplish on its own.
  4. Know the analysis-stage strategies to control confounding (stratification using Mantel-Haenszel, multivariable regression) and when each is used.
  5. Distinguish confounding from effect modification: confounding is a bias to be removed, while effect modification is a real phenomenon that should be reported using stratum-specific estimates rather than a single pooled estimate.

Can you avoid these mistakes?

A study finds that coffee drinkers have higher rates of pancreatic cancer. A critic argues that smoking is a confounder. Walk through all three required conditions and determine whether smoking qualifies as a confounder in this relationship.
You stratify a cohort study by sex and find that the RR for the exposure-outcome relationship is 3.2 in men and 3.1 in women, but the crude RR is 1.4. What does this pattern suggest — confounding or effect modification — and what should you report?
A researcher wants to control for age as a potential confounder in a case-control study. She decides to match each case to a control of the same age. Is confounding now fully controlled? What additional step is required, and why?
A researcher designs a case-control study on the association between red wine intake and cardiovascular disease. She is concerned that socioeconomic status (SES) may confound the relationship. She decides to restrict enrollment to patients with high SES and also matches each case to a control with identical SES. Name the two design-stage strategies she used. Then name two analysis-stage alternatives she could have used instead. For one of her design-stage strategies, identify a specific limitation that the other design-stage strategy does not share.

Related topics

See how your Anki deck covers this topic.

Upload your deck for a free audit →