Task 6 - Instrumental Conditioning Flashcards

Question

Response consequence delay

Answer 1

the longer one waits to punish something/someone the less the association will be made between the punishment and the ...

Answer 2

an organism’s willingness to forego a small immediate reward in favor of a larger future reward

Answer 3

positive does not mean good → instead it means added

Answer 4

the desired response causes the reinforcer to be added to the environment

Answer 5

an undesired response causes a punisher to be added to the environment

Answer 6

behaviour is encouraged (reinforced) because it causes something to be subtracted from the environment -- over time the response becomes more frequent -- sometimes called avoidance training

Answer 7

something is subtracted (negative) from the environment, and this subtraction punishes the behavior -- sometimes called omission training

Answer 8

negative does not mean bad, it means subtraction in a mathematical sense

Answer 9

the terms reinforcement and punishment describe whether the response increases (reinforcement) or decreases (punishment) as a result of training. the terms positive and negative describe whether the outcome is added (positive) or taken away (negative)

Answer 10

patterns in which an outcome follows a response less than 100 percent of the time -- Example: Becky has to clean her room seven days in a row to obtain her weekly allowance (seven responses for one reinforcement)

Answer 11

1. Fixed-ratio (FR) schedule 2. Fixed-interval (FI) schedule 3. Variable-ratio (VR) schedule 4. Variable-interval (VI) schedule

Answer 12

In operant conditioning, a reinforcement schedule in which a specific number of responses are required before a reinforcer is delivered; for example, FR 5 means that reinforcement arrives after every fifth response

Answer 13

In operant conditioning with a fixed-ratio (FR) schedule of reinforcement, a brief pause following a period of fast responding leading to reinforcement. It just happens -- the animal takes a break -- the longer the organism is doing the response the longer the pause will be

Answer 14

an FI schedule reinforces the first response after a fixed amount of time

Answer 15

a VR schedule provides reinforcement after a certain average number of responses --> as a result, there is a steady, high rate of responding even immediately after a reinforcement is delivered, because the very next response just might result in another reinforcement

Answer 16

a VI schedule reinforces the first response after an interval that averages a particular length of time -- VI schedules tend to produce higher rates of responding than FI schedules (more reinforcing than the fixed-ratio) The interval schedules are better than the ratio

Answer 17

in which the organism can make any of several possible responses, each leading to a different outcome -- Linked to behavioural economic --> how they use their time and resources

Answer 18

the principle that an organism, given a choice between multiple responses, will make a particular response at a rate proportional to how often that response is reinforced relative to the other choices

Answer 19

the study of how organisms allocate their time and resources among possible options -- economic theory predicts that each consumer will allocate resources in a way that maximizes her “subjective value,” or relative satisfaction. (in microeconomics, the word utility is used instead of subjective value.) the value is subjective because it differs from person to person Pigeon could either get a reinforcer after a minute or two pellets after 2 min

Answer 20

the particular allocation of resources that provides maximal subjective value to an individual - Changes depending on context

Answer 21

The theory that the opportunity to perform a highly frequent behavior can reinforce a less frequent behavior; later refined as the response deprivation hypothesis. - - Example: if you have been studying for several hours straight, the idea of “taking a break” to clean your room or do the laundry can begin to look downright attractive - - Rats want to run on their wheel

Answer 22

a refinement of the Premack principle stating that the opportunity to perform any behaviour can be reinforcing if access to that behaviour is restricted → want something because you can’t have it

Answer 23

collection of ganglia (cluster of neurons) information from the sensory cortex to the motor cortex can also travel via this indirect route One part of the basal ganglia is the dorsal striatum -- which can be further subdivided into the caudate nucleus and the putamen

Answer 24

receives highly processed stimulus information from sensory cortical areas and projects to the motor cortex, which produces a behavioral response -- Plays a critical role in operant conditioning, particularly if discriminative stimuli are involved Rats with lesions of the dorsal striatum can learn operant responses (e.g., when placed in a skinner box, lever-press R to obtain food O). But if discriminative stimuli are added (e.g., lever-press r is reinforced only in the presence of a light sd), then the lesioned rats are markedly impaired -- similar to people that have a disruption to the striatum due to Parkinson’s disease or huntington’s disease → the dorsal striatum appears necessary for learning SD → R associations based on feedback about reinforcement and punishment

Answer 25

appears to contribute to goal-directed behavior by representing predicted outcomes - - receives inputs conveying the full range of sensory modalities (sight, touch, sound, etc.) and also visceral sensations (including hunger and thirst), allowing this brain area to integrate many types of information; - - outputs from the orbitofrontal cortex travel to the striatum, where they can help determine which motor responses are executed

Answer 26

First projects from the sensory cortex (stimulus) to → the orbitofrontal cortex (prediction) → then to the basal ganglia (SD → R association)→ then to the striatum (motor learning)→ then to the motor cortex (reaction)

Answer 27

later studies identified that rats would work for electrical stimulation in several brain areas, including the ventral tegmental area (VTA)

Answer 28

a small region in the midbrain of rats, humans, and other mammals -- produces dopamine (wanting something) -- can stimulate the VTA to get same effect as a reinforcer

Answer 29

some researchers inferred that the rats “liked” the stimulation, and the VTA and other areas of the brain where electrical stimulation was effective became informally known as “pleasure centers.”

Answer 30

the incentive salience hypothesis proves this wrong -- that wanting and liking is the same thing and that dopamine is for both

Answer 31

the subjective “goodness” of a reinforcer, or how much we like it

Answer 32

meaning how much we “want” a reinforcer and how hard we are willing to work to obtain it

Answer 33

The hypothesis that dopamine helps provide organisms with the motivation to work for reinforcement -- states that the role of dopamine in operant conditioning is to signal how much the animal “wants” a particular outcome—how motivated it is to work for it

Answer 34

brain chemicals that are naturally occurring neurotransmitter-like substances (peptides) with many of the same effects as opiate drugs

Answer 35

Possible way that the two brain systems (of liking and wanting) interact: differences in the amount of endogenous opioid released, and in the specific opiate receptors they activate, may help determine an organism’s preference for one reinforcer over another

Answer 36

a strong habit that is maintained despite harmful consequences addiction may involve not only seeking the “high” but also avoiding the adverse effects of withdrawal from the drug. in a sense, the high provides a positive reinforcement, and the avoidance of withdrawal symptoms provides a negative reinforcement—and both processes reinforce the drug-taking responses

Answer 37

are addictions to behaviour, rather than drugs, that produce reinforcements or highs, as well as cravings and withdrawal symptoms when the behaviour is prevented -- Perhaps the most widely agreed-upon example of a behavioral addiction is compulsive gambling

Answer 38

taking a different drug instead -- like drinking alcohol free beer

Answer 39

if response R stops producing outcome o, the frequency of r should decline

Answer 40

avoiding the stimuli that trigger the unwanted response

Answer 41

reinforce yourself for example with a spa day if you didn’t use the drug or punish yourself if you did use the drug

Answer 42

whenever the smoker gets the urge to light up, she can impose a fixed delay (e.g., an hour) before giving in to it

Answer 43

combine cognitive therapy (including counseling and support groups) with behavioral therapy based on conditioning principles—and medication for the most extreme cases

Answer 44

delusional -- would rather work for their food then get it freely -- you think that you do something for an effect-- vs habit slip

Answer 45

the firing of dopamine

Answer 46

the firing of dopamine -- the phasic activity of dopaminergic neurons in the midbrain signals a discrepancy between the predicted and currently experienced reward of a particular event

Task 6 - Instrumental Conditioning Flashcards

(70 cards)