Question 1

Misdefines p-value as the probability the null hypothesis is true rather than a conditional probability

Accepted Answer

The p-value is not a probability about the null hypothesis itself — it's a probability about your data given the null. Specifically, it answers: 'If the null were true, how likely is it I'd see results this extreme or more extreme?' A small p-value means your data are unlikely under the null, which is evidence against it — but it never tells you the probability that the null is actually true. Confusing these two framings leads to wildly wrong interpretations on vignettes that ask what the p-value means.

Question 2

Swaps definitions of Type I (false positive) and Type II (false negative) errors

Accepted Answer

Type I error is a false positive — you cried wolf when there was none. You rejected the null hypothesis, but it was actually true. Type II error is a false negative — you missed a real effect. The null was false, but you failed to reject it. A memory anchor: Type I = you made a positive claim that was wrong (false positive); Type II = you failed to make a claim when you should have (false negative). Alpha is the acceptable rate of Type I errors you set before the study.

Question 3

Uses CI-crosses-zero rule for ratio measures instead of CI-crosses-one

Accepted Answer

The null value in statistics is the value that represents no effect — and what 'no effect' looks like depends on whether you're measuring a ratio or a difference. For ratios like RR, OR, and HR, a value of 1 means the groups are identical (event rates cancel out in the ratio). For differences like ARR or mean difference, a value of 0 means no difference. So when you check if a CI suggests statistical significance, ask: does this CI include the null value? For ratios, that's 1. For differences, that's 0. Applying the wrong rule to a ratio measure is a classic Step 1 trap.

Question 4

Incorrectly believes raising alpha reduces power rather than increases it

Accepted Answer

Raising alpha means you're willing to accept more false positives — you've lowered the bar for declaring a result significant. Because it's easier to reject the null, you're also less likely to miss a true effect, which means power goes up. This is counterintuitive because alpha and beta feel like they should move together, but they don't — they trade off against each other. The full list of power-increasing factors: larger n, larger true effect size, smaller variance in measurements, and larger alpha. Know all four.

Question 5

Misinterprets 95% CI as a probability statement about a fixed interval rather than a frequentist coverage statement

Accepted Answer

Once a confidence interval is calculated, the true parameter is either in it or it isn't — there's no probability involved for that specific interval. The 95% refers to the long-run performance of the method: if you ran this study 100 times and built a CI each time, about 95 of those intervals would capture the true value. This is a frequentist coverage statement, not a Bayesian probability about a fixed interval. On the exam, answers that say 'there is a 95% chance the true mean lies between X and Y' are technically wrong — the correct framing is about the procedure, not the specific interval.

Hypothesis Testing, Power, and Confidence Intervals

Common misconceptions

What the exam tests

Can you avoid these mistakes?

Related topics