Statistical Tests (t-test, ANOVA, Chi-Square, Correlation)
USMLE Step 1 trap: Applies chi-square to continuous outcomes instead of t-test or ANOVA. Chi-square tests are for categorical (nominal) data; continuous data comparing two groups requires a t-test, and more than two groups requires ANOVA.
Statistical tests are the decision-making tools of biostatistics — they tell you whether the difference you see between groups is real or just noise. USMLE Step 1 doesn't ask you to calculate these tests. It asks you to pick the right one given a study setup, recognize when an assumption is violated, and interpret what the output actually means. The three angles that show up repeatedly are: matching the test to the data type and number of groups, knowing when to switch from a parametric to a non-parametric test, and distinguishing what linear versus logistic regression actually produce.
The core skill is a two-step lookup: first, is the outcome categorical or continuous? Second, how many groups are being compared? Continuous outcome, two groups → t-test. Continuous outcome, three or more groups → ANOVA. Categorical outcome → chi-square (or Fisher's exact when sample sizes are small). Students get into trouble because they pick chi-square out of habit whenever they see a comparison, even when the outcome is something like blood pressure or serum creatinine — both of which are continuous and need a t-test or ANOVA.
The parametric vs. non-parametric distinction trips up students who think 'more powerful = always better.' Parametric tests (t-test, ANOVA, Pearson correlation) assume the data are normally distributed. When that assumption breaks down — small samples, skewed distributions — you swap to the non-parametric equivalent (Mann-Whitney U instead of t-test, Kruskal-Wallis instead of ANOVA, Spearman instead of Pearson). USMLE Step 1 loves giving you a vignette with a small, skewed sample and asking which test is appropriate. Defaulting to the parametric version without reading for normality is a reliable way to lose that point.
Common misconceptions
What the exam tests
- Given a study outcome type (categorical vs. continuous) and number of comparison groups, select the correct statistical test from: t-test, paired t-test, ANOVA, chi-square, or Fisher's exact test.
- Recognize when the assumptions for a parametric test (normality) are violated and identify the appropriate non-parametric substitute (e.g., Mann-Whitney U, Kruskal-Wallis, Spearman correlation).
- Distinguish between linear and logistic regression by their input requirements and outputs — linear regression predicts a continuous value and reports a coefficient/slope; logistic regression handles a binary outcome and reports an odds ratio.
Can you avoid these mistakes?
Related topics
See how your Anki deck covers this topic.
Upload your deck for a free audit →