Data analysis Flashcards

1
Q

Nominal

A

data categorical, independent and coded as numbers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

example of nominal

A
diagnosis eg 
1 = stable angina 
2 = unstable angina 
3 = acute coronary syndrome STEMI
4 = acute coronary artery syndrome NSTEMI
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

ordinal

A

data are categorical and have a relative direction

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

ordinal eg

A
BMI: 
1 = underweight
2 = normal weight 
3 = overweight 
4 = obese
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

interval

A

data relative to each other

no true zero (absence) exists

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

interval eg

A

temp:
degrees
kelvin
F

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

ratio

A

data relative to each other
true 0 (absence) exists
eg currency

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

what type of variable is colour

A

nominal - different primary colours
ordinal and interval - shades
ratio - wave lengths

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

what is a statistic

A

describes a characteristic of a sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

what is a parameter

A

describes a characteristic of a population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

samples

A

study small gp (sample) to infer what would happen in the pop
want to be representative

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

intention to treat

A

all data from randomised trials

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

per protocol analysis

A

based on treatment received/completed

risk of adding bias

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

quantitive data

A

interval and ratio

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

qualitative

A

nominal and ordinal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

axis labels for frequency distribution

A

y - frequency

x - variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

mean

A

add all values and divide by number of values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

median

A

order all values in ascending order and choose the middle value

19
Q

mode

A

identify the most frequent value measured

20
Q

effect of mean=mode=median

A

normally distributed

symmetrical

21
Q

effect of meanΒ»median>mode

A

shift to lower end

22
Q

averages

A

different indications of true averages
depend on data interpretation
non-parametric need different statistical tests

23
Q

normal distribution

A

bell shaped curve

or Gaussian distribution

24
Q

mean =

25
variance =
E(x - mean)squared/(n-1)
26
sd =
𝑠= βˆšπ‘‰π‘Žπ‘Ÿ
27
SEM =
𝑆𝐸𝑀= 𝑠/βˆšπ‘›
28
confidence interval =
mean +- z xSEM
29
z score for 95% ci
1.96
30
excel mean
=average(XX:XX)
31
excel variance
=var.s(XX:XX)
32
excel sd
=stdev(XX:XX)
33
excel SEM
=stdev(XX:XX)/sqrt(count(XX:XX))
34
comparision between SEM and sd
SEM
35
legend
remember n numbers
36
excel formulae for T test
ttest(array1,array2,tails,type)
37
t test tails
1 - see difference in 1 dirn | 2 - not assuming anything, T test in both directions
38
type of T test
``` paired - same sample unpaired - completely independent 2 - unpaired, equal variance 3 - unpaired, unequal variance better to do 3 if unsure ```
39
what does the appropriate index of average and variability depend on
distribution
40
normality
key assumption | informs how data is managed, analysed and reported
41
when to use mean, median or mode
not mean when outliers median - when data skewed - mean loses ability to show central value as data is dragging it away, median less stringly affected mode - not with continuous, unlikely to get more than 1 people with exactly same value mean - when normally distributed
42
when to use mean, median and mode with data typesn
nominal - mode ordinal - median interval/ratio not skewed - mean interval/ratio data skewed - median
43
when to use different types of variation
SD - shows general variability, descriptive, assess overall variation, estimate percentiles of normally distributed data SE - variability between samples, technical and inferential, assess precision of estimate CI - indicates likely range of values - assess the certainty of an estimate and compare to bench marks