Evaluating Interactive Systems Flashcards

(32 cards)

1
Q

What is Formative Evaluation?

A

Used in the early stages of a project to compare, assess and refine design ideas. Involves open research questions so that the researcher can learn more information to inform the design

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is Summative Evaluation?

A

Used later in the stages of a project. Involves closed research questions to test and evaluate systems according to predefined criteria

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is Analytical Evaluation?

A

Based on applying a theory to analyse and discuss the design. Analysing your design

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is Empirical Evaluation?

A

Making observations and measurement of users. Collecting data for analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is Quantitative Data?

A

Numbers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is Qualitative Data?

A

Words, pictures, audio or video

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Are analytical methods used for formative or summative evaluation?

A

Formative

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

List the qualitative analytical evaluation methods

A
  1. Cognitive walkthrough
  2. Cognitive dimensions of notations
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is cognitive walkthrough useful for evaluating?

A

Closed research questions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is cognitive dimensions of notations useful for evaluating?

A

Open research questions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is Keystroke Level Model useful for evaluating?

A

To create numerical comparisons of closed research questions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

List the quantitative analytical evaluation methods

A
  1. Keystroke Level Model
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

List the quantitative empirical evaluation methods

A
  1. A/B experiments
  2. Controlled laboratory trials
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

List the qualitative empirical evaluation methods

A
  1. Think-aloud / ethnography
  2. Interviews
  3. Field observation
  4. Surveys
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Are qualitative empirical methods used for formative or summative evaluation?

A

Formative

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Are quantitative empirical methods used for formative or summative evaluation?

17
Q

What 3 things do you need to run a Randomised Control Trial?

A
  1. A performance measure
  2. A representative sample of your target population
  3. An experimental task that can be used to collect performance data
18
Q

What is Internal Validity? What factors does it include?

A

Asks “was the study done right?”
Includes factors: reproducibility, scientific integrity, refutability

19
Q

What is External Validity? What factors does it include?

A

Asks “does the study tell us useful things?” and focuses on if results can be generalisable to real world situations
Includes factors: representativeness of sample population, experimental task, application context

20
Q

How are the results of a randomised control trial measured?

A

In terms of effect size, possibly including correlation with factors that might affect performance

21
Q

What is reported as the results of a randomised control trial?

A

Significance measures are reported to check whether the observed effects might have resulted from random variation or other factors rather than the treatment

22
Q

Give 2 disadvantages of RCTs

A
  1. Overcoming natural variation needs large samples
  2. They do not naturally provide understanding of why a change occurred so it is hard to know if the effect will generalise. If there are many relevant variables that are orthogonal, many separate experiments might be required to distinguish between their effects and interactions
23
Q

What do companies tend to use instead of RCTs?

A

Proxy measures such as the number of days that customers continue actively using the product

24
Q

What must all controlled experiments be assessed according to?

A

Their internal and external validity

25
Why is qualitative data often recorded and transcribed?
So it can be analysed using a reproducible scientific method
26
Which qualitative data analysis method is used to answer closed questions? Give an example of a closed question
Categorical coding Eg. comparing different groups of people or users of different products
27
Which qualitative data analysis method is used to answer open questions? What is an open question?
Grounded theory Used when there is no prior expectation of the insights the researcher is looking for
28
What are the steps of categorical coding?
1. Create a coding frame of expected categories of interest 2. Text data is segmented (eg. on phrase boundaries) 3. Each segment is assigned to one category so that frequency and correspondence can be compared
29
What is inter-rater reliability?
Two or more people make the coding decisions independently to avoid systematic bias or misinterpretation. They then compare how many decisions agree relative to chance using a statistical measure such as Cohen's Kappa (2 people) or Fleiss' Kappa (more). It may involve refining the coding frame to resolve decision criteria if there are still significant disagreements. This is used to 'prototype' the coding frame before proceeding to the main corpus
30
Which qualitative data analysis method should incorporate inter-rater reliability?
Categorical coding
31
What are the steps of grounded theory?
1. Open coding - read the data closely, looking for interesting categories 2. Collect fragments, write memos to capture insights as they occur 3. Emerging themes are organised using axial coding across different sources of evidence 4. Memos, themes and findings are constantly compared to the original data so they can be objectively justified Ends when the theoretical description has reached saturation in relation to the original data
32