Common misconceptions

Common mistake
Wrong: Any variable associated with the outcome is a confounder.
Right: A confounder must be associated with both the exposure and the outcome, and must not be on the causal pathway between them.
A variable that only correlates with the outcome but has nothing to do with the exposure is not a confounder — it's just a correlate of the outcome. For something to confound the exposure-outcome relationship, it has to be linked to both sides: it influences who receives the exposure AND independently affects the outcome. If you only check one association, you'll misclassify variables and choose wrong answer choices on passage questions.
Common mistake
Wrong: A control variable and a confounding variable are the same thing.
Right: A control variable is deliberately held constant by the experimenter to isolate the independent variable's effect; a confounding variable is an uncontrolled third variable that distorts the exposure-outcome relationship.
These are conceptually opposite, not synonymous. A control variable is a deliberate design choice — the researcher fixes it to prevent it from varying and muddying the results. A confounding variable is a design failure — it varies freely across groups and creates a spurious or distorted association between exposure and outcome. The same variable (say, age) could be a control variable in one study and a confounder in another, depending on how the study was designed.
Common mistake
Wrong: Randomization eliminates confounders only if the sample size is very large.
Right: Randomization distributes both known and unknown confounders equally across groups regardless of whether researchers can identify them, and is effective even at moderate sample sizes.
Randomization works by distributing all characteristics — known and unknown — roughly equally across groups, so those characteristics can't systematically favor one group over another. This mechanism doesn't depend on sample size to 'kick in'; it works by the logic of random assignment itself. Large samples make the distribution more precise, but randomization is the reason confounders are controlled, not sample size alone.
Common mistake
Wrong: A variable on the causal pathway between exposure and outcome is a confounder that should be controlled.
Right: A variable on the causal pathway is a mediator, not a confounder; controlling for a mediator blocks the causal effect and distorts the analysis.
A mediator sits on the causal chain between the exposure and the outcome — it's how the exposure produces its effect. If you statistically control for a mediator, you're blocking the very pathway you're trying to study, which will make a real causal effect disappear or look smaller than it is. This is the opposite of what controlling a confounder does. Always ask: 'Is this variable caused by the exposure?' If yes, it's likely a mediator — don't control for it.
Free Deck audit

See if your Anki deck covers this topic.

Upload your deck →
Guided session

Stuck on this? An AI tutor that probes your understanding.

Start a session →

What the exam tests

  1. Know the four variable types cold: independent (manipulated by researcher), dependent (measured as outcome), control (held constant to isolate the IV), and confounding (uncontrolled variable that distorts the exposure-outcome relationship).
  2. Understand the two-part rule for confounders: a variable only qualifies as a confounder if it is associated with both the exposure and the outcome, AND it is not on the causal pathway between them — both conditions must hold.
  3. Given a passage describing a flawed study, identify which variable is acting as a confounder and explain how it could be controlled — through randomization, matching, restriction, or statistical stratification.
  4. Critique how variables are operationally defined in a study design and evaluate whether control variables adequately isolate the independent variable's effect from other explanations.

Can you avoid these mistakes?

A study examines whether coffee consumption (exposure) causes hypertension (outcome). Researchers notice that smokers tend to drink more coffee and smoking independently raises blood pressure. Is smoking a confounder, a mediator, or a control variable here — and what would you need to verify before concluding it's a confounder?
A researcher studying a new drug randomly assigns 60 participants to drug vs. placebo groups. A critic argues the randomization can't control for age as a confounder because the sample is too small. Is this criticism valid? Explain the mechanism by which randomization controls confounders.
In a study on exercise and depression, researchers statistically adjust for 'serotonin levels' when calculating the effect of exercise on depression scores. A classmate says this is good practice because serotonin is associated with both exercise and depression. What's the flaw in this reasoning?
A lab experiment tests whether light intensity affects plant growth rate. The researcher keeps temperature, soil type, and water volume the same across all pots. What type of variables are temperature, soil type, and water volume — and how are they conceptually different from a confounding variable that might appear in an observational version of this same question?

Related topics

See how your Anki deck covers this topic.

Upload your deck for a free audit →