summarising and displaying data Flashcards
—- are scales w underlying defined
unit.
example:
– A count (number of children)
– An accepted unit
* Years
* Metres
* Euros
these scales can be —- or —-
numeric scales
continuous or discrete
true or false:
-Many things cannot have a defined unit
as :Depression, satisfaction, pain
-We recognise that people can be satisfied, or in pain, to a
greater or lesser extent
-The problem is measuring these concepts without a defined
unit
true
—– Used to measure relative quantity
ordinal scales
age measured in years, unit of days are examples of
defined units
– Severity of pain: mild, moderate, severe
–Alcohol consumption: none, low, high
–Quality of life score: 0, 1, 2,….,10
are examples of:
ordinal scales ( check slide 12)
Numeric and ordinal scales are labels that tell us —- and the more basic example is — by which —- is the basis of measurement
how much
what
classification
Labelling schemes that classify people or things or events are —-
examples are:
nominal measurement scales
– Disease classification schemes e.g ICD 10 (International Classification
of Diseases)
– Eye color: Blue, green, brown, hazel, gray
– Types of activity: sitting, walking, cycling, swimming, other
nominal measurement scales tells us— of thing something is and its based on ——
what kind
agreed classification
Some scales have only two labels these are called —-
dichotomous scales
– Eye color: Blue, green, brown, hazel, gray
– Types of activity: sitting, walking, cycling, swimming, other
are examples of
nominal measurement as blood groups types
– Disease status: Presence or absence of disease
– Lab test result: Positive or negative
– Mortality : alive or dead status
– Exam result: Pass or fail
are examples of
dichotomous scales - simplest sort
types of variables summary:
1- —- variables
– Defined units, tell us how much in an absolute sense
– Can be continuous or discrete
Categorical variables
*—– scales
– Tell us how much, but in a relative rather than absolute sense
*—– scales
– Classify. Tell us what rather than how much
– Called —- scale when only two values
numeric
ordinal
nominal
dichotomous
Knowing the measurement scale of data informs us as to how we should —- and — it
display and summarise it
Summaries are — than the original because of what they leave out
* So any summary is a —- of the original
things can go wrong by:
1- We present aspects of the data that lead to the wrong conclusion
2- We leave out some important aspect of the data, leading to the reader drawing the wrong conclusion
- In practice, data analysts will examine the data in —– ways to make sure to avoid these pitfalls when reporting on them
smaller
simplification
different wats
The most basic summary statistic is a —-
frequency as count or percent ( check the graph of stacked histogram ) and we can use a frequency table
rule of thumbs:
—- for precise information
—- for patterns and understanding
numbers
graphs
A simple graph displaying
frequencies of categories is —-
– —- is preferable but often they
presented —-
bar graph
horizontal
vertical
When the data are measured on a
continuous scale but we have
relatively small amounts of data, we
can display the data as —
dots aka a dot plot this can be used for heights of women and men from a small study
– For men or women with the
same height, the dots are shown
beside each other
With —- amounts of data, we don’t need to rely on the summaries, we can simply show all the data in a plot
* But with —- datasets, the dots become too numerous and we rely more and more on summaries
small
larger
death in intensive care unit:
Patients had their risk of death calculated using —– scores
* These scores combines — to produce an —- of the—- of death
* The study also looked at length of stay
* These two variables - length of stay and APACHE-II scores
- the dots show will be —–
APACHE-II
risk indicators
overal prediction
chance
predicted risk of
death (APACHE-II scores)
( check slide 27 pls , 28)
Summarising the risk scores using % cut-offs :
- These summaries don’t show us — the data, but they give us a good idea of —
- they show —-
- and give some idea of how scores – around that
all
key marker
middle point/halfway
vary
—– is a value representing a cut off of a specified percentage of the data
percentiles but also called quantiles ( check graph 29 plsss)
—– is the half-way point of the data values.
– Strictly speaking, half of the values lie —— the median
– The — percentile!
median
lie at or below
50th
( check slide 31 PLSSSS)
—- is the average and it indicates approcimaently where the data is located on the number line.
and its calculated as:
mean
“Sum up the individual values then
divide by the number of them”
mean can be misleading tho ( check the bar graph 35 )