Evaluating Interactive Systems Flashcards
(32 cards)
What is Formative Evaluation?
Used in the early stages of a project to compare, assess and refine design ideas. Involves open research questions so that the researcher can learn more information to inform the design
What is Summative Evaluation?
Used later in the stages of a project. Involves closed research questions to test and evaluate systems according to predefined criteria
What is Analytical Evaluation?
Based on applying a theory to analyse and discuss the design. Analysing your design
What is Empirical Evaluation?
Making observations and measurement of users. Collecting data for analysis
What is Quantitative Data?
Numbers
What is Qualitative Data?
Words, pictures, audio or video
Are analytical methods used for formative or summative evaluation?
Formative
List the qualitative analytical evaluation methods
- Cognitive walkthrough
- Cognitive dimensions of notations
What is cognitive walkthrough useful for evaluating?
Closed research questions
What is cognitive dimensions of notations useful for evaluating?
Open research questions
What is Keystroke Level Model useful for evaluating?
To create numerical comparisons of closed research questions
List the quantitative analytical evaluation methods
- Keystroke Level Model
List the quantitative empirical evaluation methods
- A/B experiments
- Controlled laboratory trials
List the qualitative empirical evaluation methods
- Think-aloud / ethnography
- Interviews
- Field observation
- Surveys
Are qualitative empirical methods used for formative or summative evaluation?
Formative
Are quantitative empirical methods used for formative or summative evaluation?
Summative
What 3 things do you need to run a Randomised Control Trial?
- A performance measure
- A representative sample of your target population
- An experimental task that can be used to collect performance data
What is Internal Validity? What factors does it include?
Asks “was the study done right?”
Includes factors: reproducibility, scientific integrity, refutability
What is External Validity? What factors does it include?
Asks “does the study tell us useful things?” and focuses on if results can be generalisable to real world situations
Includes factors: representativeness of sample population, experimental task, application context
How are the results of a randomised control trial measured?
In terms of effect size, possibly including correlation with factors that might affect performance
What is reported as the results of a randomised control trial?
Significance measures are reported to check whether the observed effects might have resulted from random variation or other factors rather than the treatment
Give 2 disadvantages of RCTs
- Overcoming natural variation needs large samples
- They do not naturally provide understanding of why a change occurred so it is hard to know if the effect will generalise. If there are many relevant variables that are orthogonal, many separate experiments might be required to distinguish between their effects and interactions
What do companies tend to use instead of RCTs?
Proxy measures such as the number of days that customers continue actively using the product
What must all controlled experiments be assessed according to?
Their internal and external validity