BackLecture 8: Learning – Classical and Operant Conditioning
Study Guide - Smart Notes
Tailored notes based on your materials, expanded with key definitions, examples, and context.
Learning
Introduction to Learning
Learning is the process by which behaviour or knowledge changes as a result of experience. In psychology, learning is often studied through the relationship between a stimulus (S) and a response (R). Key questions include whether learning is automatic, how it can be acquired, and what factors influence the increase or decrease of a response.
Stimulus (S): Any event or object in the environment that elicits a response.
Response (R): The behaviour or reaction to a stimulus.
Learning: Change in behaviour or knowledge due to experience.
Classical Conditioning
Definition and Key Concepts
Classical conditioning (also known as Pavlovian conditioning) is a type of learning in which an association is acquired between a neutral stimulus (NS) and an unconditioned stimulus (US) to produce a change in response (R).
Unconditioned Stimulus (US): A stimulus that naturally elicits a response (e.g., food).
Unconditioned Response (UR): The natural reaction to the US (e.g., salivation).
Neutral Stimulus (NS): A stimulus that does not initially elicit the UR (e.g., metronome).
Conditioned Stimulus (CS): The NS after association with the US (e.g., metronome after pairing).
Conditioned Response (CR): The learned response to the CS (e.g., salivation to metronome).
Pavlov's Dogs Example
US: Food
UR: Salivation
NS: Metronome
After repeated pairing of NS (metronome) with US (food), the NS becomes a CS.
CS: Metronome
CR: Salivation to metronome
Example: Pavlov's experiment demonstrated that dogs could learn to salivate to a previously neutral stimulus (metronome) after it was repeatedly paired with food.
Little Albert Experiment
US: Loud noise
UR: Fear response
NS: White rat
After pairing the NS (white rat) with the US (loud noise), the NS becomes a CS.
CS: White rat
CR: Fear response to white rat
Example: The Little Albert study showed that emotional responses (like fear) could be conditioned in humans.
Processes of Classical Conditioning
Acquisition: The initial phase of learning when a response is first established.
Extinction: Reduction in the conditioned response (CR) when the conditioned stimulus (CS) and unconditioned stimulus (US) are no longer paired.
Spontaneous Recovery: The reappearance of a previously extinguished CR after a period of non-exposure to the CS.
Example: If a dog stops salivating to a metronome after it is no longer paired with food (extinction), but later salivates again when hearing the metronome (spontaneous recovery).
Operant Conditioning
Definition and Key Concepts
Operant conditioning is a type of learning in which the relationship between stimulus and response is influenced by the consequences of the behavior. Behaviors are strengthened or weakened depending on whether they are followed by reinforcement or punishment.
Reinforcement: Any consequence that increases the likelihood of a behavior.
Punishment: Any consequence that decreases the likelihood of a behavior.
Positive: Adding a stimulus.
Negative: Removing a stimulus.
Principles of Operant Conditioning
Positive Reinforcement: Adding a pleasant stimulus to increase behavior (e.g., giving a treat).
Negative Reinforcement: Removing an unpleasant stimulus to increase behavior (e.g., turning off a loud noise).
Positive Punishment: Adding an unpleasant stimulus to decrease behavior (e.g., scolding).
Negative Punishment: Removing a pleasant stimulus to decrease behavior (e.g., taking away a toy).
Avoidance Learning: Behavior prevents the possibility that a stimulus will occur.
Escape Learning: Behavior removes a stimulus that is already present.
Example: In Skinner's box, a rat learns to press a lever to receive food (positive reinforcement) or to avoid a shock (negative reinforcement).
Processes of Operant Conditioning
Primary Reinforcer: Naturally reinforcing (e.g., food, water).
Secondary Reinforcer: Learned reinforcer (e.g., money, praise).
Discriminative Stimulus: A cue indicating whether a response will be rewarded.
Discrimination: Learning to respond to the original stimulus but not to similar ones.
Generalization: Response occurs to new, similar stimuli.
Example: A rat presses a lever only when a light is on (discriminative stimulus), but not when the light is off.
Reinforcement Schedules
Reinforcement schedules determine how and when a behavior is followed by a reinforcer. These schedules affect the rate and stability of learning.
Schedule Type | Description | Example |
|---|---|---|
Fixed Ratio (FR) | Reinforcement after a set number of responses | Factory worker paid for every 10 items produced |
Variable Ratio (VR) | Reinforcement after a variable number of responses | Slot machine payout |
Fixed Interval (FI) | Reinforcement after a set period of time | Weekly paycheck |
Variable Interval (VI) | Reinforcement after a variable period of time | Checking for a randomly arriving email |
Example: Gambling is often maintained by a variable ratio schedule, which is highly resistant to extinction.
Relevant Equations
Learning Rate Equation: Additional info: This equation is a simplified representation; actual learning rates depend on schedule and context.
Additional info: Dopamine pathways in the brain (e.g., nucleus accumbens and ventral tegmental area) are involved in reinforcement learning, influencing motivation and reward.