Mod 2E Truth & Feedback Flashcards
What is biased data?
Data that inaccurately represents the intended population due to a non-random sample.
What is selection bias?
A bias from drawing data from a non-random sample.
Example: Polling via landlines may overrepresent older individuals.
What is Simpson’s paradox?
A form of selection bias where a trend in subsets disappears or reverses when data is aggregated.
What is survivor bias?
A bias where data includes only subjects that “survived” a process, excluding those that did not.
How can data visualisations be misleading?
When designers intentionally mislead or influence the audience’s interpretation.
How can we identify missing data in a dataset?
Using the COUNTBLANK function
Applying conditional formatting
How can we detect data errors?
Summary statistics
Frequency distributions
Column/bar charts
Histograms
Scatter charts
What are some deceptive chart design techniques?
Manipulating chart axes
Using dual-axis charts
Selective data and time intervals
Misleading geographic/choropleth maps