Statistical Significance and p-Values
MCAT trap: Misinterprets the p-value as the probability that the null hypothesis is true. The p-value is the probability of observing data at least as extreme as the results obtained, assuming the null hypothesis is true.
Statistical significance and p-values are tested heavily on the MCAT — and the dominant misconception is that the p-value tells you how likely the null hypothesis is to be true. It does not. The p-value assumes the null is true and asks how surprising your data would be in that world: P(data this extreme | null is true). Flipping that to P(null is true | your data) is a logical error the MCAT directly tests. If p = 0.03, that is not a 3% chance the null is true — it is the probability of seeing your results if the null were true. Run your experiment, collect data, and if the p-value falls below your significance threshold (α, usually 0.05), you reject the null.
The MCAT tests this from multiple angles. At the definition level, it checks whether you know what a p-value actually represents. In passage-based questions, it hands you a results table and asks you to interpret which comparisons are significant — and then whether that significance actually matters clinically. These are different questions, and conflating them is one of the most common errors. You'll also see questions that require you to reason through the null vs. alternative hypothesis setup and decide what a given p-value tells you about rejecting or retaining the null.
The three big traps: thinking the p-value tells you the probability the null is true (it doesn't — it assumes the null is true and asks about your data), thinking a significant result means a meaningful one (a huge sample can make a 0.1 mmHg blood pressure difference 'significant'), and thinking that failing to reject the null proves there's no effect. The MCAT loves all three of these. Know the distinctions cold.
Common misconceptions
What the exam tests
- Know the precise definition of a p-value: it's the probability of observing data at least as extreme as yours, given that the null hypothesis is true — not the probability that the null hypothesis is true.
- Understand the logic of hypothesis testing: what the null and alternative hypotheses represent, and how comparing p to α (typically 0.05) determines whether you reject or fail to reject the null.
- Apply hypothesis testing mechanically: given a test statistic or p-value, decide whether the result crosses the significance threshold and what conclusion that supports.
- Interpret a data table from a research passage: identify which results are statistically significant and separately evaluate whether those results are clinically meaningful or practically important.
Can you avoid these mistakes?
Related topics
See how your Anki deck covers this topic.
Upload your deck for a free audit →