STATS/CA Flashcards

(48 cards)

1
Q

what does association mean in statistics

A

two variables associated with one another in presence

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

what is spurious

A

relationship exists but is misleading as its due to the influence of confounding

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

indirect and direct association

A

Indirect: impact of a confounder
direct: associated with one another one impacts other

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

what is labbe graph

A

the event rate in the experimental (intervention) group against the event rate in the control group, as an aid to exploring the heterogeneity of effect estimates within a meta-analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

what is shown below and above the line in the graph

A

above -favours experimental Y

below -favours control X

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

what are the two criteria used for causation

A

Susser criteria

Bradford Hill

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

what are the key terms for both?

A

Time order
Association
Direction

TAD

SSTCC

Strength
Specificity
Temporality
Consistency
Coherence

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

what does direction and coherence refer to

A

plausible
based on knowledge /theory too

Temporality and time order - cause precedes the effect

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

what is bias ?

A

systematic error - conclusions that are incorrect

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

what is a confounding factor

A

may influence the independent or dependent variabl

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

ecological bias

A

conclusions on individuals drawn from group studies

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

lead time bias

A

time between early detection and onset of disease

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

what is the pygamalion effect

A

subconciously meaure data which favours outcome

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

late look bias

A

gather info at inappropriate time
such as studying disease when patient has died

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

phases of a trial
I
II
III
IV

A

i -healthy /pharmacological
ii -lager group/efficacy
— even larger/confirmatory
— after licence/post marketing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

how can u reduce confounding

A

randomisation
restriction
matching

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

what can be done at analysis stage to reduce confounding

A

stratification

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

what is regression?

A

to predict values of other dependent variables from indpdendent
i.e. linear , non linear + or - regiresison
weak, moderate, strong

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

what can be used for variables correlation coefficient

A

pearsons
parametrics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

what if it is non parametric

A

then you’d use spearmann
RS- sample, p for population

21
Q

parametric vs non parametric

A

The basic idea behind the Parametric method is that there is a set of fixed parameters that are used to determine a probability model that is used in Machine Learning as well. Parametric methods are those methods for which we priory know that the population is normal, or if not then we can easily approximate it using a Normal Distribution which is possible by invoking the Central Limit Theorem.

Parameters for using the normal distribution are as follows:

Mean
Standard Deviation

Non-parametric methods are statistical techniques that do not rely on specific assumptions about the underlying distribution of the population being studied. These methods are often referred to as “distribution-free” methods because they make no assumptions about the shape of the distribution.

22
Q

what is discrete data

A

certain values i.e. no of exercebations within the 1 year
vs continous e.g. weight

23
Q

intervAL or ratio data

A

ratio like weight:height -BMI meanginful

interval- diff in two values -meaningful

24
Q

berkson bias

A

hospital
patient selected
case control with those as controls

25
what is the delphi method
two rounds of questions 1st one is open ended second round is items summarised by investigators based on info from first round 3rd round -panelist have items to rate and revise
26
issue with delphi method
low response rate time consuming expert opinion needed
27
examples of inferitial statistics
t test chi squared to make generlisations and draw conclusions about a population based on data
28
what is the formula for disease rate
dx in exposed in cohort/dx in non exposed a/a+b / (c/c+d)
29
what does PICO stand for
patient intervention comparison outcome
30
what is associated with QALYS
cost utility
31
cost benefit example
live years saved
32
cost effectiveness
observation easy to mearsure i.e. symptom reduced
33
cost minimisation
indirect cost saved i.e. staff, work proudcitivity, equipement
34
precision
the ability of a test to produce the same measurement with repeated tests
35
what does a funnel plot show
bias in meta analysis triangle one
36
what does rejecting the null hypothesis do
accept alternate hypothesis
37
what does bonferroni correction do
adjust the significance level to avoid type 1 errors
38
kappa cofficent signifiance
the agreemeent between two sets of data/testers 1 =agreement 0=poor
39
internal and external validity
intwneal - confidence that we can place in the cause and effect in a study external - conclusion of the study can be used for other person ability to generalise
40
convergent and divergent validity
convergent validity is a subtype of construct validity. Construct validity is an indication of how well a test measures the concept it was designed to measure. high or low correlation
41
what CONSORT
conslidating standard of reporting triAL -RCT
42
What is STROBE
for observational studies strenghtening the report of OS in epidemology
43
PRISMA
statistical regression and meta analysis
44
QUORM
qyality of for meta ablysis
45
STARD
diagnostic accuracy Sequenced Treatment Alternatives to Relieve Depression
46
two sets of data related not normally distributed i.e. Before and after
wilcoxon r test
47
chi squared can be used for
2 variables association
48
mann whitney test
2 independent groups not normally distributed