PSYC 2081 -无代写|学霸联盟

PSYC 2081 -无代写

时间：2025-08-21

Human associative learning
Lecture 2
Intro to human associative learning pt 2
Prof Mike Le Pelley
m.lepelley@unsw.edu.au
PSYC 2081
The Hats of Power
• Post thoughts, insights, jokes, memes,
observations etc to the Moodle forum
• Best entries will win one of four Hats of Power
Summary of Lecture 1
• Contingency: How strongly are events related?
• Learning is influenced by contingency (as you would hope!)
• But that’s not the whole story – for example
– Learning is gradual even though ΔP is constant; “learning curves”
– When we learn about multiple cues at the same time, we don’t learn
about each one individually based on its contingency
– Instead cues compete with each other
 Blocking as an example of “cue competition”
• Learning is not the same as calculating contingency. So what
are people doing when they learn?
• We can understand learning as the gradual formation of an
association (a link) between representations
An alternative view: Association formation
CS
cue
US
outcome
• What determines whether (and how fast) this association
strengthens?
• Proposed by Rescorla & Wagner (1972)
• Surprise is the key to learning
– We only learn when something happens that we don’t expect
 We learn when we are surprised by something
– If everything occurs as expected, we don’t learn anything new
 We don’t learn if we are not surprised
• Rescorla & Wagner captured this idea in an equation, providing a
simple but powerful model of the learning process
The Rescorla-Wagner model
Robert
Rescorla
Allan
Wagner
∆V = α β ( λ – ΣV )
Salience
of cue
Salience
of
outcome
Observed
magnitude
of outcome
Expected
magnitude
of outcome
Σ means “sum”
So ΣV means “summed
associative strength of all cues”
∆ means “change”
So ∆V means “change in
associative strength”
i.e., learning
cue outcome
V
V = predictive Value
(or associative strength)
Amount of
learning
The Rescorla-Wagner model
∆V = α β ( λ – ΣV )
Prediction error:
“Surprisingness” of
outcome
• When prediction error is zero, there is no learning
• Learning = Reduction in PE
• Learning = Things becoming more expected, and less surprising
Salience
of cue
Salience
of
outcome
Observed
magnitude
of outcome
Expected
magnitude
of outcome
Amount of
learning
The Rescorla-Wagner model
cue outcome
V
V = predictive Value
(or associative strength)
V = 0.4
∆V = α β ( λ – ΣV )
Associative
strength
Salience
of cue
Salience of
outcome
Observed
magnitude of
outcome
Expected
magnitude of
outcome
∆Vvege = 0.5 x 0.8 ( 1 – 0 ) = 0.4
Vvege = 0
Vvege = 0.4
0.5 0.8 1Let’s assume
Vege illness
Large
prediction
error
V = 0.64
∆V = α β ( λ – ΣV )
∆Vvege = 0.5 x 0.8 ( 1 – 0.4 ) = 0.24
Vvege = 0.4
Vvege = 0.64
Vege illness
Smaller
prediction
error
Associative
strength
Salience
of cue
Salience of
outcome
Observed
magnitude of
outcome
Expected
magnitude of
outcome
0.5 0.8 1Let’s assume
Trial 0: Vvege = 0
Trial 1: Vvege = 0.40
Trial 2: Vvege = 0.64
Trial 3: Vvege = 0.78
Trial 4: Vvege = 0.87
Trial 5: Vvege = 0.92
0
0.5
1
0 5 10 15
As
so
cia
tiv
e
st
re
ng
th
Trial
∆V = α β ( λ – ΣV )
SURPRISING
NOT
SURPRISING
Associative
strength
Salience
of cue
Salience of
outcome
Observed
magnitude of
outcome
Expected
magnitude of
outcome
0.5 0.8 1Let’s assume
V = 0.99
Vege illness
0
0.5
1
0 5 10 15
As
so
cia
tiv
e
st
re
ng
th
Trial
∆V = α β ( λ – ΣV )
Associative
strength
Salience
of cue
Salience of
outcome
Observed
magnitude of
outcome
Expected
magnitude of
outcome
∆Vvege = 0.5 x 0.8 ( 0 – 0.99 ) = –0.40
Vvege = 0.99
Vvege = 0.59
0.5 0.8 0Let’s assume
Negative
prediction
error
V = 0.59
Vege illness
V = 0.35
∆V = α β ( λ – ΣV )
∆Vvege = 0.5 x 0.8 ( 0 – 0.59 ) = –0.24
Vvege = 0.59
Vvege = 0.35
Vege illness
Smaller
negative
prediction
error
Associative
strength
Salience
of cue
Salience of
outcome
Observed
magnitude of
outcome
Expected
magnitude of
outcome
0.5 0.8 0Let’s assume
00.5
1
0 5 10 15 20 25
As
so
cia
tiv
e
st
re
ng
th
Trial
Acquisition
Extinction
Acquisition and extinction
SURPRISING
NOT
SURPRISING
NOT
SURPRISINGSURPRISING
∆V = α β ( λ – ΣV )
Salience
of cue
Salience of
outcome
Observed
magnitude
of outcome
Expected
magnitude
of outcome
0
0.5
1
0 5 10 15
As
so
cia
tiv
e
st
re
ng
th
Trial
Associative
strength
α = 0.5, β = 0.8
α = 0.5, β = 0.2
α = 0.2, β = 0.8
-80
-60
-40
-20
0
20
40
60
80
0 5 10 15 20 25 30 35 40
Ju
dg
m
en
t r
at
in
g
Trial
Lopez & Shanks (unpublished)
See Shanks 1995, ch2, p32
Rescorla-Wagner and contingency
People
• Learning is gradual, even though contingency (ΔP) does not change
– “Learning curves”
• The Rescorla-Wagner model also shows this pattern
∆P = 0.5
∆P = -0.5
-0.6
-0.4
-0.2
0
0.2
0.4
0.6
0 5 10 15 20 25 30 35 40
As
so
cia
tiv
e
st
re
ng
th
Trial
RW
∆P = 0.5
∆P = -0.5
01
2
3
4
Bread Dates
Ju
dg
m
en
t r
at
in
g
Aitken, Larkin & Dickinson
(2000, Expt 1)
Phase 1 Phase 2 Test
Apple → allergy Apple & Bread → allergy Bread?
Carrot & Dates → allergy Dates?
Rescorla-Wagner and blocking
• Previous learning about apples blocks
subsequent learning about bread
• Cues are not learned about
independently
• Instead, cues compete with each other
00.5
1
0 5 10 15 20
As
so
cia
tiv
e
st
re
ng
th
Trial
∆Vbread : (λ – ΣV)
Vapple
Rescorla-Wagner and blocking
Phase 1 Phase 2 Test
Apple → allergy Apple & Bread → allergy Bread?
Carrot & Dates → allergy Dates?
00.5
1
0 5 10 15 20
As
so
cia
tiv
e
st
re
ng
th
Trial
∆Vbread : (1 – ΣV)
Vapple
Rescorla-Wagner and blocking
Phase 1 Phase 2 Test
Apple → allergy Apple & Bread → allergy Bread?
Carrot & Dates → allergy Dates?
00.5
1
0 5 10 15 20
As
so
cia
tiv
e
st
re
ng
th
Trial
∆Vbread : (1 – [Vapple + Vbread])
Vapple
Rescorla-Wagner and blocking
Phase 1 Phase 2 Test
Apple → allergy Apple & Bread → allergy Bread?
Carrot & Dates → allergy Dates?
00.5
1
0 5 10 15 20
As
so
cia
tiv
e
st
re
ng
th
Trial
Vapple
Rescorla-Wagner and blocking
Outcome not
surprising
∆Vbread : (1 – [0.85 + Vbread])
Phase 1 Phase 2 Test
Apple → allergy Apple & Bread → allergy Bread?
Carrot & Dates → allergy Dates?
00.5
1
0 5 10 15 20
As
so
cia
tiv
e
st
re
ng
th
Trial
Vbread
Vapple
Outcome not
surprising
Rescorla-Wagner and blocking
= 0.15
Small
prediction
error
∆Vbread : (1 – [0.85 + 0])
Phase 1 Phase 2 Test
Apple → allergy Apple & Bread → allergy Bread?
Carrot & Dates → allergy Dates?
Allergy
Apples
Bread
Phase 1 Phase 2 Test
Apple → allergy Apple & Bread → allergy Bread?
Carrot & Dates → allergy Dates?
Rescorla-Wagner and blocking
Allergy
Carrot
Dates
Phase 1 Phase 2 Test
Apple → allergy Apple & Bread → allergy Bread?
Carrot & Dates → allergy Dates?
Rescorla-Wagner and blocking
01
2
3
4
Bread Dates
Ju
dg
m
en
t r
at
in
g
Aitken, Larkin & Dickinson
(2000, Expt 1)
Phase 1 Phase 2 Test
Apple → allergy Apple & Bread → allergy Bread?
Carrot & Dates → allergy Dates?
• People learn a
weaker
association for
bread than dates
• And so does the
RW model!
0
0.2
0.4
0.6
0.8
Bread Dates
As
so
cia
tiv
e
st
re
ng
th
People RW
Rescorla-Wagner and blocking
Summary so far
• Lecture 1: Learning = Calculating contingency?
• A different view: Learning = Forming association between
representations
• The Rescorla-Wagner model
– Surprise drives association formation (learning)
– RW is one model of association formation – there are many others
 RW is probably the most influential model of associative learning
 A simple, intuitive account – but very powerful
• E.g., explains effects of contingency, learning curves, and cue
competition (blocking)
 Still often used as a “default” view of learning
An alternative view
“Many theorists maintain a belief in a learning mechanism in
which links between mental representations are formed
automatically… We conclude that learning is the consequence
of propositional reasoning processes that cooperate with the
unconscious processes involved in memory retrieval”
An alternative view
The link-based approach
(e.g., Rescorla-Wagner)
ILLV
The propositional approach
Last time I ate Vegemite I was ill.
Perhaps Vegemite makes me ill.
The propositional approach
Last 2 times I ate Vegemite I was ill.
Fairly sure Vegemite makes me ill.
0
0.5
1
0 5 10 15
St
re
ng
th
o
f b
el
ie
f
Trial
The propositional approach
• The “thinking about it” approach
• Remember what went with what in the past, try and
work out most likely explanation
Blocking
0
1
2
3
4
Bread Dates
Ju
dg
m
en
t r
at
in
g • Previous learning about effectiveness of apples
blocks subsequent learning
about bread
• Cues compete with each
other
Phase 1 Phase 2 Test
Apple → allergy Apple & Bread → allergy Bread?
Carrot & Dates → allergy Dates?
Aitken, Larkin & Dickinson
(2000, Expt 1)
Propositions and blocking
• Remember what went with what in the past, try and
work out most likely explanation
“If apple and bread together cause the outcome to occur with
the same probability as apple alone, this implies that bread is
not a cause of the outcome.”
Phase 1 Phase 2 Test
Apple → allergy Apple & Bread → allergy Bread?
Carrot & Dates → allergy Dates?
Blocking and trial order
• According to the propositional account, order of the trials doesn’t matter
• Should get blocking in the standard (forward) procedure, and in a backward
procedure
Phase 1 Phase 2 Test
A → outcome A + B → outcome B?
C + D → outcome D?
“If A and B together cause the outcome to occur with the same probability and
magnitude as A alone, this implies that B is not a cause of the outcome.”
FO
RW
AR
D Proposition
Blocking
BA
CK
W
AR
D
Phase 1 Phase 2 Test
A + B → outcome A → outcome B?
C + D → outcome D?
Blocking
• In the Forward case, RW model says that blocking occurs because the outcome
is less surprising on “A+B” trials than “C+D” trials
• In the backward case, the outcome is equally surprising on “A+B” trials and
“C+D” trials
• So RW model predicts no blocking in backward condition
Blocking and trial order
Phase 1 Phase 2 Test
A → outcome A + B → outcome B?
C + D → outcome D?FO
RW
AR
D Proposition
Blocking
BA
CK
W
AR
D
Phase 1 Phase 2 Test
A + B → outcome A → outcome B?
C + D → outcome D?
Blocking
RW
Blocking
No
blocking
B D
0
2
4
6
8
Ju
dg
m
en
t r
at
in
g ✱• We’ve already seen
blocking in the
“Forward” case
– e.g., Aitken, Larkin &
Dickinson (2000), see
Lecture 1
• And it also occurs in the
“Backward” case!
– e.g., Wasserman & Berglan (1998)
• Consistent with propositional
account, but not with RW
Blocking and trial order
Phase 1 Phase 2 Test
A → outcome A + B → outcome B?
C + D → outcome D?FO
RW
AR
D Proposition
BA
CK
W
AR
D
Phase 1 Phase 2 Test
A + B → outcome A → outcome B?
C + D → outcome D?
Blocking
Blocking
RW
Blocking
No
blocking
“If A and B together cause the outcome to occur with the same
probability and magnitude as A alone, this implies that B is not
a cause of the outcome.”
• In order to draw this inference, you need to know that B has
been paired with the outcome
– Inferences based on memories of A→outcome and AB→outcome
– B is paired with the outcome, but does not cause the outcome
Blocking of memory
Phase 1 Phase 2 Test
A → outcome AB → outcome B?
Causality Memory
Propositional
account
Blocking of causal
judgement
Good memory of
outcome
Link-based account
(e.g. RW)
Blocking of causal
judgement
Blocking of memory
of outcome
Phase 1 Phase 2 Test
A → outcome AB → outcome B?
Blocking of memory
Phase 1 Phase 2 Test
A → outcome AB → outcome B?
A
B
outcome
• RW: Blocked cue does not form a
strong link with the outcome
– People don’t encode that B was ever
paired with the outcome
Blocking of memory
Phase 1 Phase 2 Test
A → outcome AB → outcome B?
Blocking of memory
Causality Memory
Propositional
account
Blocking of causal
judgement
Good memory of
outcome
Link-based account
(e.g. RW)
Blocking of causal
judgement
Blocking of memory
of outcome
Mitchell, Lovibond, Minard & Lavis (2006)
Mr X ate
Apple
He suffered
Xianethis
Mitchell, Lovibond, Minard & Lavis (2006)
Mr X ate
Peach
He suffered
Daryosis
Mitchell, Lovibond, Minard & Lavis (2006)
Mr X ate
Apple Breadand
He suffered
Xianethis
Which illness followed
Bread
during training?
Daryosis Xianethis
Plonthema Eneuritis
To what extent did the food cause
this illness?
0 1 2 3 4 5 6 7 8
definitely
not causal
possibly definitely
causal
Blocking in ratings
of causality…
… and in memory
of outcomes
A → outcome AB → outcome B = Blocked
CD → outcome D = Control
Mitchell, Lovibond, Minard & Lavis (2006)
Phase 1 Phase 2 Test
A → outcome AB → outcome B?
Blocking of memory
Causality Memory
Propositional
account
Blocking of causal
judgement
Good memory of
outcome
Link-based account
(e.g. RW)
Blocking of causal
judgement
Blocking of memory
of outcome
Summary
Propositions Links
Backward
blocking
Blocking of
memory of cue-
outcome pairings
Summary
• It’s still not clear: More research needed!
• Seems likely that both types of process operate
• Simpler experiments encourage “thinking about it”
• More complex experiments discourage thinking about it, so more
automatic link-formation mechanism dominates
Wasserman & Berglan (1998)
Backward blocking
Supports propositional
Mitchell et al (2006): Blocking of cue-outcome memory
Supports link-based
Summary
• “Associative learning” may be due to more
than one process

学霸联盟