lesson 8 Flashcards

Question

Latent Learning (Tolman & Honzik, 1930)

Answer 1

Learning that occurs without explicit reinforcement and is not immediately expressed in behavior. It becomes evident when reinforcement is introduced later. Demonstrated in Tolman's maze experiment.

Answer 2

Role: Reinforcement provides motivation for performance but is not strictly necessary for learning to occur. Animals can acquire knowledge about their environment even without reward.

Answer 3

: Anything that causes an animal to perform the corresponding operant behavior more or less often; anything an animal will work for (or work to avoid).

Answer 4

A stimulus that has acquired reinforcing properties through its association with a primary reinforcer. (e.g., points in a game associated with eventual reward).

Answer 5

A sequence of learned behaviors where each behavior acts as a conditioned reinforcer for the previous behavior and a discriminative stimulus for the next, leading to a primary reinforcer at the end.

Answer 6

Traditional View: A stimulus that has innate reinforcing properties (appetitive or aversive) without prior learning (similar to a US). Revised View (Premack): Can be a high-probability behavior that an animal is restricted from performing.

Answer 7

A higher-probability behavior can reinforce a lower-probability behavior. Reinforcers are not just stimuli but can be activities that an animal is motivated to engage in. Reinforcement depends on relative preferences.

Answer 8

Core Idea: Animals have a preferred level (bliss-point) for all behaviors. When a behavior is restricted below its preferred level, access to it becomes reinforcing.

Answer 9

Definition: An individual's ideal distribution of behaviors in the absence of constraints.

Answer 10

utcome: When prevented from reaching their bliss-point, individuals will adjust their behavior to get as close as possible to it within the imposed restrictions, even if it means increasing a less preferred behavior to gain access to a more preferred one.

Answer 11

Bliss-Point in terms of Premack: The bliss-point represents the baseline probabilities of various behaviors. The Premack principle suggests that a behavior with a higher baseline probability (closer to its bliss-point if unrestricted) can reinforce a behavior with a lower baseline probability (further from its bliss-point due to restriction). Common Idea: Both are based on the idea that a reinforcer is fundamentally something the animal "wants" to do more of than it is currently able to, whether due to external restrictions or the need to perform another behavior to gain access. Reinforcement involves moving behavior closer to the individual's preferred distribution.

Answer 12

The beer and pizza are primary reinforcers; the money is a conditioned reinforcer; and the work is the operant. Note that money is always a conditioned reinforcer – it has no value in itself, it’s just a way of getting other things. This shows us that conditioned reinforcers can be every bit as powerful in driving behavior as primary reinforcers are.

Answer 13

Jimmy’s parents could deprive Jimmy of ice cream and make him work for it by, for example, eating vegetables. Eating vegetables is not as desirable for Jimmy, but because it causes ice cream to happen, he will do it [there are, obviously, other correct answers to this question].

Answer 14

Rules that determine how and when a behavior will be reinforced. Can be based on the number of responses (ratio) or the passage of time (interval), and can be fixed or variable.

Answer 15

Reinforcement is delivered after a specific number of responses.

Answer 16

Reinforcement is delivered for the first response after a specific amount of time has elapsed since the last reinforcement (or the start of the interval).

Answer 17

The number of responses (FR) or the time interval (FI) required for reinforcement remains constant.

Answer 18

The number of responses (VR) or the time interval (VI) required for reinforcement varies around a fixed average.

Answer 19

Reinforcement occurs after a fixed number of responses (e.g., FR10 = every 10 responses). Often produces a high rate of responding with a post-reinforcement pause.

Answer 20

Reinforcement occurs for the first response after a fixed time interval has elapsed (e.g., FI 30s = first response after 30 seconds). Produces a scalloped pattern of responding (increasing rate as the interval ends).

Answer 21

Reinforcement occurs after a variable number of responses, averaged around a specific number (e.g., VR10 = on average every 10 responses). Produces a high and steady rate of responding.

Answer 22

Reinforcement occurs for the first response after a variable time interval, averaged around a specific duration (e.g., VI 30s = on average after 30 seconds). Produces a moderate and steady rate of responding.

Answer 23

Reinforcement is delivered after every response.

Answer 24

Two or more reinforcement schedules operate simultaneously for different responses, allowing the subject to choose how to allocate its behavior.

Answer 25

In concurrent interval schedules, the proportion of responses directed toward one alternative will match the proportion of reinforcers obtained from that alternative. Equation: B 1 +B 2 B 1 = R 1 +R 2 R 1 , where B = rate of behavior and R = rate of reinforcement for alternatives 1 and 2.

Answer 26

In concurrent ratio schedules, the optimal strategy to maximize reinforcement is to exclusively respond on the alternative with the lower ratio requirement. However, animals don't always strictly adhere to this.

Answer 27

A situation where the reinforcer is not delivered immediately after the response. Often leads to a preference for smaller, immediate rewards over larger, delayed ones.

Answer 28

The phenomenon where the perceived value or effectiveness of a reinforcer decreases as the delay to its delivery increases.

Answer 29

The subjective value of a reinforcer increases as the time to its delivery approaches. Choices between smaller/sooner and larger/later rewards can shift as the smaller reward becomes more imminent.

Answer 30

Potential Methods (see textbook pp. 269-272 for more): Precommitment: Making a decision for the larger, later reward in advance, when both options are still distant. Making the delayed reward more immediate/salient: Visualizing the future reward, breaking it into smaller, closer milestones. Reducing the value of the immediate reward: Avoiding cues that trigger impulsive choices. Increasing the value of the delayed reward: Associating it with other positive outcomes. Self-monitoring: Tracking progress towards the long-term goal. Rule-governed behavior: Establishing and following rules that prioritize long-term rewards. Personal Application: (This will vary based on individual circumstances - encourage the user to think about specific examples from their own life). For example, to prioritize studying (larger, later reward of good grades) over immediate entertainment (smaller, sooner reward), one might use precommitment by scheduling study time, make the future reward more salient by visualizing career goals, and reduce the value of immediate distractions by turning off notifications.

Answer 31

Fixed ratio (FR): the subject is reinforced after a fixed number of responses. Fixed interval (FI): the subject is reinforced for the first response after a fixed interval. Variable ratio (VR): the subject is reinforced after a number of responses that varies around a fixed mean. Variable interval (VI): the subject is reinforced for the first response after an interval that varies around a fixed mean.

Answer 32

Jimmy’s enemies post at random times, with a fixed average, so we would call these variable interval schedules. Johnny is on a VI10 and Jennifer on a VI20. Since Jimmy doesn’t know when the next image of an annoyingly perfect plate of ceviche (with a beach backdrop, of course) will drop, he has to divide his time between the two feeds. Since Johnny posts twice as often, Jimmy should spend twice as much time on Johnny’s feed (66% of his time) as on Jennifer’s (33% of his time).

Answer 33

Help more old men with their bags

Answer 34

Under stimulus control

lesson 8 Flashcards

(60 cards)