Evaluating Interactive Systems Flashcards by Tabby Black

What is Formative Evaluation?

Used in the early stages of a project to compare, assess and refine design ideas. Involves open research questions so that the researcher can learn more information to inform the design

How well did you know this?

Not at all

Perfectly

What is Summative Evaluation?

Used later in the stages of a project. Involves closed research questions to test and evaluate systems according to predefined criteria

How well did you know this?

Not at all

Perfectly

What is Analytical Evaluation?

Based on applying a theory to analyse and discuss the design. Analysing your design

How well did you know this?

Not at all

Perfectly

What is Empirical Evaluation?

Making observations and measurement of users. Collecting data for analysis

How well did you know this?

Not at all

Perfectly

What is Quantitative Data?

Numbers

How well did you know this?

Not at all

Perfectly

What is Qualitative Data?

Words, pictures, audio or video

How well did you know this?

Not at all

Perfectly

Are analytical methods used for formative or summative evaluation?

Formative

How well did you know this?

Not at all

Perfectly

List the qualitative analytical evaluation methods

Cognitive walkthrough
Cognitive dimensions of notations

How well did you know this?

Not at all

Perfectly

What is cognitive walkthrough useful for evaluating?

Closed research questions

How well did you know this?

Not at all

Perfectly

What is cognitive dimensions of notations useful for evaluating?

Open research questions

How well did you know this?

Not at all

Perfectly

What is Keystroke Level Model useful for evaluating?

To create numerical comparisons of closed research questions

How well did you know this?

Not at all

Perfectly

List the quantitative analytical evaluation methods

Keystroke Level Model

How well did you know this?

Not at all

Perfectly

List the quantitative empirical evaluation methods

A/B experiments
Controlled laboratory trials

How well did you know this?

Not at all

Perfectly

List the qualitative empirical evaluation methods

Think-aloud / ethnography
Interviews
Field observation
Surveys

How well did you know this?

Not at all

Perfectly

Are qualitative empirical methods used for formative or summative evaluation?

Formative

How well did you know this?

Not at all

Perfectly

Are quantitative empirical methods used for formative or summative evaluation?

Study These Flashcards

Summative

What 3 things do you need to run a Randomised Control Trial?

Study These Flashcards

A performance measure
A representative sample of your target population
An experimental task that can be used to collect performance data

What is Internal Validity? What factors does it include?

Study These Flashcards

Asks “was the study done right?”
Includes factors: reproducibility, scientific integrity, refutability

What is External Validity? What factors does it include?

Study These Flashcards

Asks “does the study tell us useful things?” and focuses on if results can be generalisable to real world situations
Includes factors: representativeness of sample population, experimental task, application context

How are the results of a randomised control trial measured?

Study These Flashcards

In terms of effect size, possibly including correlation with factors that might affect performance

What is reported as the results of a randomised control trial?

Study These Flashcards

Significance measures are reported to check whether the observed effects might have resulted from random variation or other factors rather than the treatment

Give 2 disadvantages of RCTs

Study These Flashcards

Overcoming natural variation needs large samples
They do not naturally provide understanding of why a change occurred so it is hard to know if the effect will generalise. If there are many relevant variables that are orthogonal, many separate experiments might be required to distinguish between their effects and interactions

What do companies tend to use instead of RCTs?

Study These Flashcards

Proxy measures such as the number of days that customers continue actively using the product

What must all controlled experiments be assessed according to?

Study These Flashcards

Their internal and external validity

Why is qualitative data often recorded and transcribed?

So it can be analysed using a reproducible scientific method

Which qualitative data analysis method is used to answer closed questions? Give an example of a closed question

Categorical coding Eg. comparing different groups of people or users of different products

Which qualitative data analysis method is used to answer open questions? What is an open question?

Grounded theory Used when there is no prior expectation of the insights the researcher is looking for

What are the steps of categorical coding?

1. Create a coding frame of expected categories of interest 2. Text data is segmented (eg. on phrase boundaries) 3. Each segment is assigned to one category so that frequency and correspondence can be compared

What is inter-rater reliability?

Two or more people make the coding decisions independently to avoid systematic bias or misinterpretation. They then compare how many decisions agree relative to chance using a statistical measure such as Cohen's Kappa (2 people) or Fleiss' Kappa (more). It may involve refining the coding frame to resolve decision criteria if there are still significant disagreements. This is used to 'prototype' the coding frame before proceeding to the main corpus

Which qualitative data analysis method should incorporate inter-rater reliability?

Categorical coding

What are the steps of grounded theory?

1. Open coding - read the data closely, looking for interesting categories 2. Collect fragments, write memos to capture insights as they occur 3. Emerging themes are organised using axial coding across different sources of evidence 4. Memos, themes and findings are constantly compared to the original data so they can be objectively justified Ends when the theoretical description has reached saturation in relation to the original data

Evaluating Interactive Systems Flashcards

(32 cards)