Lecture 1 Flashcards

(55 cards)

1
Q

Behavioral data science, what is that

A

a multidisciplinary scientific field that aims to facilitate understanding, prediction, and change of human behavior through the analysis of behaviourally defined variables as they arise in large datasets, typically gathered using modern digital technology and analyzed with techniques for detecting patterns from high-dimensional data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

it is highly multidimensional, but what fields are close important

A
  • psychology
  • mathematical modeling
  • statistics etc

With psychology: mathematical psychology

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

understanding

A

construction of psychological theories to explain behavior

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

prediction

A

application of statistical models to predict behavior

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Change

A

development of interventions to change behavior

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

the topic is quite …. but the models and theories quite…

A

difficult
simple

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

welk woordje in een zin kan verwijzen naar dat het om data gaat?

A

“which”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

drift rate quantifies

A

Task difficulty or subject ability

how steep that line is.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Boundary separation (a) quantifies

A

response caution and is responsible for the speed-accuracy tradeoff.

If you are told to make fewer errors, you set your boundaries wider, you will make fewer errors but it will take you longer. Because this process is noisy

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Starting point z reflects?

A

a priori biases due to say, proportion or payoff manipulations that favor one response over the other

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

behavioral data science is about understanding, prediction and change of human behavior - explain these

A
  • understanding: construction of psychological theories to explain behavior
  • predicition: application of statistical models to predict behaviour
  • Change: development of interventions to change behavior
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

When they say that the vocabulary that is being used in data science has a sort of ‘lingua franca’, what do they mean with that?

A

certain different fields may have difficulty talking to eachother about their different sciences and the jargon belonging to it, but data science and its vocabulary is shared across sciences

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Data:

A

representations of observations.;
pete correctly solved IQ test item 36 (observation)
The row that represents Pete has a 1 in the column that represents the IQ item (representation)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What do rows represent

A

cases

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What do columns represent

A

features/properties/attributions.
The values in the colums represent a variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Vanuit data analyseer je ‘phenomena’ wat houdt dat in?

A

Robust features of the world
- the positive manifold of intelligence, the robust correlation between insomnia and depression, the effect of time pressure on accuracy.

note: phenomena are not themselves data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Is phenomena, data?

A

No, phenomena are evidenced by patterns in the data. Data are representations of things that happened at a very specific time and place and situation: phenomena are more general!

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What are theories

A

sets of principles or ideas of how the world works.
- there are many kinds of theories, but we are often interested in explanatory theories: a set of principles that aims to explain phenomena. It describes a world in which the phenomena would follow as a matter of course.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

How do data and theories relate to phenomena

A

data establish phenomena, and theories explain phenomena

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Wat is de taak in de lexical decision task?

A

Deelnemers beoordelen of een letterreeks een bestaand woord is of niet-woord door een toets in te drukken.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Welke twee metingen zijn belangrijk in de lexical decision task?

A

Reactietijd (RT) en nauwkeurigheid (aantal correcte antwoorden).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Waarom wordt de lexical decision task gebruikt?

A

Om te meten hoe gemakkelijk lexicale representaties uit het geheugen worden geactiveerd.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Wat is een bekend fenomeen dat uit de lexical decision task naar voren komt?

A

Woorden met hoge frequentie worden sneller herkend dan woorden met lage frequentie.

24
Q

Hoe worden deelnemers geïnstrueerd bij de lexical decision task?

A

Ze moeten zo snel en accuraat mogelijk reageren.

25
Wat is het effect van leeftijd op de prestaties in de lexical decision task?
Oudere volwassenen reageren langzamer dan jongere volwassenen.
26
Wat stelt de “general slowing” hypothese?
Dat alle cognitieve processen langzamer verlopen bij oudere volwassenen.
27
In het fictieve voorbeeld van vier deelnemers, hoe presteren ouderen in vergelijking met jongeren?
Ouderen zijn 50-100 ms langzamer, maar iets accurater dan jongeren.
28
Wat is een probleem met standaardanalyses van de lexical decision task?
Ze negeren vaak nauwkeurigheid en focussen alleen op reactietijd.
29
Waarom is het problematisch om alleen RT te analyseren? Noem 4 redenen.
Deelnemers moeten én snel én accuraat reageren. Er is een trade-off tussen snelheid en nauwkeurigheid. Geen procesmodel: het blijft speculatief. Geen decompositie van onderliggende processen.
30
Wat is een oplossing voor het analyseren van data uit de lexical decision task?
Het gebruik van een procesmodel, zoals het diffusion model van Ratcliff.
31
Wat doet het diffusion model van Ratcliff?
- describes how noisy evidence is accumulated over time - describes data from simple decision-making experiments - allows manifest behaviour to be decomposed into latent psychological processes.
32
Wat maakt het diffusion model nuttig in psychologie?
Het kan observeerbaar gedrag opdelen in latente psychologische processen.
33
Welke historische figuren hielden zich bezig met ideeën verwant aan het diffusion model?
Norbert Wiener (Wiener-proces) en Robert Brown (Brownse beweging).
34
What is the core idea of the Ratcliff Diffusion Model?
It models decision-making as a process where noisy evidence is accumulated over time until a threshold is reached
35
What type of process is used in the Ratcliff Diffusion Model to describe decision-making?
Sequential sampling of noisy information.
36
What does "diffusion of evidence" mean in the Ratcliff model?
It refers to how evidence is gradually and noisily accumulated over time during decision-making.
37
What is the "drift rate" in the Ratcliff Diffusion Model?
The average rate at which evidence accumulates toward one decision; reflects signal strength or clarity.
38
What does a higher drift rate indicate in the diffusion model?
Faster and more accurate decision-making, as the evidence is clearer or stronger.
39
In a lexical decision task, what does the x-axis represent in a diffusion model plot?
The amount of accumulated evidence over time.
40
What do the decision boundaries in the diffusion model represent?
Thresholds for making a decision (e.g., "word" vs. "non-word").
41
What does it mean when the evidence hits the decision threshold?
A decision is made and a response is initiated.
42
Why can errors occur in the diffusion model, even if evidence is accumulated?
Because the evidence is noisy and the distributions for word vs. non-word may overlap.
43
What is "non-decision time" in the Ratcliff model?
The time needed for processes like encoding the stimulus and executing the motor response (e.g., pressing a key).
44
What does the overlap between word and non-word evidence curves indicate?
That decisions are probabilistic and errors can occur due to noise in the evidence.
45
What drives the evidence accumulation in the lexical decision task, according to the model?
Repeated draws from the underlying lexical dimension (wordness vs. non-wordness).
46
drift rate
quantifies task difficulty or subject ability
47
boundary seperation (a)
quantifies response caution and is responsible for the speed-accuracy trade of (distance between word and non word threshold)
48
Starting point (z)
reflects a priori biases. For example when proportion of word and non words are being manipulated, say 80% of the letter strings you already had where words, you are more likely to expect that the next one is also 'word'. Which is why the starting point in the diagram above is also a bit leaning towards the word threshold
49
Non-decision time (Ter)
measures the time peope need to encode the simulus and execute the response
50
When everyone has the same drift rate, what does that mean
that everyone has the same ability to process/task difficulty
51
When some people have wide boundary separation, what does that mean
they are more cautious and the younger participants have narrower boundary separation, indicating their willingness to take more risks
52
Example of a theory construction methodology
1. identify a set of phenomena that you want to explain 2. come up with a proto-theory - formalise both the proto-theory and the phenomena - evaluate how well the resulting formal theory actually explains the phenomena - overall evaluation of the theory
53
The main take-aways of the acrophobia affects vr ding
- the model acts as a thinking tool: it allows us to see where we're wrong - the model is precise: detailed model predicitons can be connected to detailed vr data - the model allows us to simulate interventions in the system and the model therapy - non of this is achievable with purely verbal theory (this does not mean verbal theories are unimportant!!)
54
What is the speed accuracy trade off and how does the diffusion model account for it?
The SAT is the general ability of people to increase accuracy at the cost of taking more time. The diffusion model accounts for it by being able to increase accuracy by widening the distance between the two response thresholds (the boundary separation: dicisions will take longer to terminate at a boundary, but those that do are less likely to do so in error because you are increasing response caution
55