Common misconceptions

Common mistake
Wrong: Negative reinforcement decreases behavior because it involves something negative happening.
Right: Negative reinforcement increases behavior by removing an aversive stimulus; only punishment (positive or negative) decreases behavior.
The word 'negative' does not mean bad outcome — it means removal, like a minus sign. Negative reinforcement always increases the target behavior because you're taking away something aversive (pain, noise, nagging) when the behavior occurs. If behavior goes up, it's reinforcement; if behavior goes down, it's punishment. Use that outcome-based rule first, then decide positive (adding a stimulus) or negative (removing a stimulus).
Common mistake
Wrong: Fixed ratio schedules produce the greatest resistance to extinction because they deliver the most frequent rewards.
Right: Variable ratio schedules produce the greatest resistance to extinction because the unpredictable reward timing prevents the organism from detecting omission of reinforcement.
Fixed ratio schedules actually create a predictable pattern — the organism knows a reward is coming after a set number of responses, so when it stops coming, the absence is noticeable relatively quickly. Variable ratio schedules are unpredictable by design, so the organism can't distinguish 'reward is delayed' from 'reward is gone,' which is exactly why slot machines and social media feeds are so hard to quit. The unpredictability is the mechanism driving resistance to extinction.
Common mistake
Wrong: The Premack principle states that primary reinforcers are more effective than secondary reinforcers.
Right: The Premack principle states that a more probable (preferred) behavior can reinforce a less probable behavior (e.g., 'eat your vegetables, then you can play').
Primary vs. secondary reinforcers is about whether the reinforcer is innately rewarding (food, water) or learned (money, praise). The Premack principle is completely different — it's about using a high-probability behavior as a reinforcer for a low-probability behavior. 'Clean your room and then you can play video games' is Premack: playing games (preferred activity) reinforces cleaning (less preferred activity). The distinction is about behavior probability, not reinforcer type.
Common mistake
Wrong: Shaping and chaining are the same process because both build complex behaviors gradually.
Right: Shaping reinforces successive approximations toward a single target behavior; chaining links a sequence of already-learned behaviors into a behavioral chain.
Think of shaping as sculpting one behavior out of raw material — you reward closer and closer approximations until the animal does exactly what you want. Chaining is more like assembling a pipeline — each behavior in the sequence is already learned, and you're linking them together so the completion of one behavior cues the start of the next. The key test: if it's about getting one new behavior via incremental steps, it's shaping; if it's about connecting a series of already-trained behaviors, it's chaining.
Free Deck audit

See if your Anki deck covers this topic.

Upload your deck →
Guided session

Stuck on this? An AI tutor that probes your understanding.

Start a session →

What the exam tests

  1. Given a scenario (e.g., a dog sits to avoid a shock being turned off, or a child loses TV time for bad behavior), correctly classify it into the 2x2 grid: positive reinforcement, negative reinforcement, positive punishment, or negative punishment.
  2. Distinguish among the four reinforcement schedules (fixed ratio, variable ratio, fixed interval, variable interval) and predict their characteristic response patterns — especially why variable ratio produces the highest response rate and greatest resistance to extinction.
  3. Read a passage describing an animal training or behavior modification protocol and identify whether shaping, chaining, primary reinforcers, secondary reinforcers, or the Premack principle is being used.
  4. Describe Thorndike's puzzle box experiments and Skinner's operant chamber, understand what each was designed to measure, and recognize how this methodology established the behavioral approach to studying learning.

Can you avoid these mistakes?

A student studies harder to make a headache go away (the quiet of studying reduces stress-induced pain). Is this positive reinforcement, negative reinforcement, positive punishment, or negative punishment — and how do you know?
A casino slot machine pays out on a variable ratio schedule. A coworker gives you a bonus after every 10 sales calls (fixed ratio). Which behavior will be harder to extinguish if rewards stop, and mechanistically why?
A passage describes a trainer who first rewards a dog for looking at a hoop, then for approaching it, then for putting its nose through, and finally for jumping through completely. What technique is being described, and how would your answer change if the dog already knew each sub-step and the trainer was now connecting them in sequence?
Your friend says 'I use the Premack principle by giving my kid stickers (secondary reinforcers) instead of candy (primary reinforcers).' What two errors is your friend making, and what would a correct application of the Premack principle actually look like?

Related topics

See how your Anki deck covers this topic.

Upload your deck for a free audit →