Distributions — Normal, Skewed, Mean vs Median
USMLE Step 1 trap: Incorrectly places the mean toward the mode rather than toward the tail in skewed distributions. In a skewed distribution, the mean is pulled toward the tail — above the median in right skew and below it in left skew.
Distributions are a foundational concept in biostatistics that shows up on USMLE Step 1 in both direct recall questions and passage-based vignettes. You need to know the properties of the normal (Gaussian) distribution, what happens to mean/median/mode when data are skewed, and when to choose one summary statistic over another. These ideas sound simple but students lose points here because they memorize definitions without building a working model of what skew actually does to summary statistics.
The USMLE Step 1 tests this in a few specific ways: identifying whether a distribution is normal or skewed from a graph or description, applying the 68-95-99.7 rule to calculate probabilities or reference ranges, and interpreting which summary statistic a researcher should use or report. Vignette questions might describe a dataset of income or lab values and ask you to identify the most appropriate measure of central tendency — that's a direct test of your skew intuition.
The biggest trap here is the skew-mean relationship. Students instinctively feel like the mean should sit near the 'center' of the data or near the peak — but that's the mode. The mean gets dragged toward the tail, not the peak. Get that direction right and the rest of the concept clicks into place.
Common misconceptions
What the exam tests
- Know the properties of the normal distribution: it is symmetric, bell-shaped, and described entirely by its mean and standard deviation — and know that 68% of values fall within 1 SD, 95% within 2 SD, and 99.7% within 3 SD of the mean.
- Given a skewed distribution (or a graph), correctly order mean, median, and mode — specifically, that the mean is pulled toward the tail: above the median in right (positive) skew and below the median in left (negative) skew.
- Determine whether mean or median is the more appropriate summary statistic for a given dataset — median is preferred when data are skewed or contain outliers; mean is appropriate when data are normally distributed.
Can you avoid these mistakes?
Related topics
See how your Anki deck covers this topic.
Upload your deck for a free audit →