Statistics Flashcards

Question

what is quota sampling?

Answer 1

the population is divided into subgroups by age, gender or ethnicity and a quota is set for filling each subgroup. The participants are then selected non-randomly until the quota is filled (i.e. by convenience, first come first serv)

Answer 2

participants are chosen based on specific criteria i.e. you would specifically contact patients with a disease, because you want to study it - rather than randomly sampling the general population

Answer 3

participants recruit other participants - good for hard to reach or isolated groups

Answer 4

take a cohort of people are study them over time - this is observational and prospective. Two or more groups are selected based on their exposure to a particular agent (e.g toxin, smoking) and are studied over time to see how many develop a disease or other outcome.

Answer 5

relative risk - as you compare the two groups

Answer 6

participants with a particular condition are matched with controls. This is observational and retrospective. Data is then collected on the past to identify a possible causal agent for the condition.

Answer 7

odds ratio

Answer 8

inexpensive produces quick results useful for rare conditions

Answer 9

usually prone to confounding factors

Answer 10

provides simply a snapshot in time - sometimes call prevelance studies provides weak evidence of cause and effect

Answer 11

participants experience both the experimental arm and the placebo - useful if it is unethical to deprive patients of a particular treatment (ie. cancer treatments)

Answer 12

participants are chosen who have ALREADY been exposed to the experiment where is unethical to expose them i.e. children playing violent video games

Answer 13

helpful when the exposure is unethical - as participants already have the exposure can measure multiple outcomes cheap can analyse risk

Answer 14

participants can be lost to follow up can be affected by recall bias if retrospective confounding variable

Answer 15

rate at which new cases occur in a population over time i.e. 10 new cases in 1000 per year

Answer 16

total no of cases of a disease that currently exist at any given time i.e. currently 50,000 people with asthma

Answer 17

cross sectional study

Answer 18

cohort study

Answer 19

meta analysis / systematic analysis

Answer 20

in qualitative research - it is a method used to generate a new theory about a phenomena of interested from the collection of new data. The new theory needs to be grounded or rooted in observations made - i.e. the name. It is a complex process, which begins by raising questions that help guide research but are not static or confining and then over time core theoretical concepts are identified.

Answer 21

the aim is to study an ENTIRE culture, through the researcher becoming immersed in the culture as an active participant and recording field notes.

Answer 22

the goal of phenomenology is to describe the real "lived experience" of a phenomenon

Answer 23

1 - convenience 2 - purposive 3 - snowballing 4 - case study - select a single individual

Answer 24

1 - triangulation 2 - respondent validation (aka member checking) 3 - bracketing 4 - reflexivity

Answer 25

comparing the results of two or more different methods of data collection (for example - interviews and observation)

Answer 26

techniques where the investigators account is compared to the participants in order to check the level at which they correspond

Answer 27

deliberating putting asides ones own beliefs about the phenomenon under investigation

Answer 28

sensitivity to the ways in whcih the researcher and research process have shaped the collected data

Answer 29

the way in which the researchers aim to gain a general agreement around a topic

Answer 30

delphi method nominal group technique

Answer 31

aims to gather opinions from experts in a particular area. Occurs in 3 stages: stage 1 - open ended questionnaires sent to participants to generate statements about the topic stage 2 - participants then asked to rank all of the statements produced in stage 1 stage 3 - statements are further refined and re-ranked to achieve consensus if consensus not achieved in stage 3 then that stage can be repeated

Answer 32

group of highly structured meetings with a controlled discussion members independently record ideas and opinions, which are then re-presented to the group and used to clarify and categorise ideas group members are then asked at the end to rank the ideas to achieve consensus

Answer 33

nominal data - data is placed into named categories - there is no hierachy given to these categories, you can count but not order them (i.e. birthplace) ordinal data - observed values can be put into categories which can be ordered (ie NHYA classification of heart failure symptoms)

Answer 34

discrete - values are finite whole numbers i.e. number of asthma exacerbations per year continuous - data can take any value i.e. weight binomial - data can have two values (i.e. biological sex) interval - measurement between the two values is meaningful i.e temperature (not the same as continuous as body temp cannot be 0)

Answer 35

prediction of no relationship between the two variables being tested

Answer 36

predicts a relationship does exist between the two variables being tested

Answer 37

the null hypothesis is rejected when it is true (i.e. showing that there is a difference between groups, when actually there is not - false positive) this is determined against a preset significance level of alpha

Answer 38

the null hypothesis is accepted when it is false i.e. saying there is no correlation between groups when actually there is this is termed a beta error

Answer 39

the power is the probability of correctly rejecting the null hypothesis when it is false

Answer 40

1 - the probability of a type II error (i.e. beta) - so can also be calculated as 1 - beta

Answer 41

by increasing the sample size

Answer 42

measure of whether the independent variable (cause) has an impact on the dependent variable (effect)

Answer 43

situation where two phenomena occur together - these could either be related or by chance

Answer 44

spurious - relationship between the variables occurs purely due to chance indirect - relationship between the two variables is due to a confounding factor direct - there is a true association between the two variables

Answer 45

bradford hill criteria

Answer 46

research method used to measure the relationship between two variables - measured as "p" value where p=0 is no correlation and P = 1 is perfect correlation

Answer 47

data that follows a normal distribution

Answer 48

data that does not follow a normal distribution

Answer 49

students T test - pearsons coefficient

Answer 50

is the consistency of the data - can it be replicated consistently to produce similar results

Answer 51

whether a test accurately measures what it is supposed to measure

Answer 52

sum of all values / total number of values

Answer 53

sort all the values into order and select the middle value

Answer 54

most common data appearing in the data set

Answer 55

postive skew

Answer 56

negative skew

Answer 57

normally distributed

Answer 58

68.3% lies within 1SD of the mean 95.4% lies within 2 SD of the mean 99.7% lies within 3 SD of the mean

Answer 59

cohort study

Answer 60

no of events/total no in the group

Answer 61

no of events in experimental group / total no in the experimental group

Answer 62

no of events in the control group / total no of participants in the control group

Answer 63

( CER - EER ) / CER OR 1- RR

Answer 64

EER - CER / EER

Answer 65

case control studies

Answer 66

no of people with event / no of people without the event

Answer 67

odds of exposure / odds of control

Answer 68

range or interval of values in which the "true" value lies - i.e. confidence interval of > 95% - you are 95% confident that the true result lies in the range, with a 5% chance that it lies outside of this range

Answer 69

t-tests compares the means of two samples only ANOVA - compares the mean or two or more samples (i.e. if you had groups of 20-30yrs, 30-40yrs, 40-50ys ANOVA would be used to compare the means across these different groups)

Answer 70

ordinal, interval, or ratio scales or unpaired data

Answer 71

compares two sets of observations on a single sample i.e. before and after test on the sample population following an intervention

Answer 72

used to compare proportions or percentages across patients following two different interventions

Answer 73

correlation between two variables

Answer 74

forest plot

Answer 75

funnel plot

Answer 76

y = a + bx a = point at which the line crosses y axis where x = 0 b = coefficient line x= chosen value on x axis

Answer 77

phase 0 - exploratory studies - very small no of participants to explore the effect of the drug in the human body phase I - safety assessment - determines SE prior to larger studies, conducted on health volunteers phase II - assess efficacy - involves a small no effect by the disease phase III - assess effectiveness - thousands of particpants RCT phase IV - monitoring for long term SE and effectiveness

Answer 78

correlation is a calculation of how closely one variable relates to another variable. linear regression is then used to predict how much one variable may change when a second variable is changed. this is when you use the formula y= a+ bx

Answer 79

parametric data - pearsons non-parametric data - spearmans

Answer 80

screening tool correctly identifies the patient as having the disease

Answer 81

screening tool correctly identifies the patient as not having the disease

Answer 82

the screening tool incorrectly identifies the patient as having the disease, when infact they do not

Answer 83

the screening incorrectly identifies the patient as not having the disease, when in fact they do

Answer 84

proportion of patients with the disease who have a POSITIVE result

Answer 85

positive results of a disease (TP) ----------------------------------------------- people with the disease (TP-FN)

Answer 86

proportion of patients without the disease who have a negative result

Answer 87

negative result of disease (TN) ----------------------------------------------- people without the disease (TN + FP)

Answer 88

the probability that a person with a positive test result actually has the disease

Answer 89

TP / (TP + FP)

Answer 90

the probability that a person with a negative test result actually does not have the disease

Answer 91

TN / (TN+FN)

Answer 92

CEA compares a number of interventions by relating costs to a single clinical measure of effectiveness (e.g. symptom reduction, improvement in activities of daily living).

Answer 93

total cost / unit of effectiveness

Answer 94

CBA is a technique in which all the costs and benefits of an intervention are measured in terms of money. A CBA is used to establish which of the alternatives has the greatest net benefit.

Answer 95

CUA is a special form of CEA in which health benefits / outcomes are measured in broader, more generic ways enabling comparisons between treatments for different diseases and conditions - i.e. using QALY's.

Answer 96

QALYs are a composite measure of gains in life expectancy and health-related quality of life. One QALY is equal to 1 year of life in perfect health.

Answer 97

CUA offers something that CEA cannot, which is to compare across treatments for different conditions. In principle, it is possible to compare treatments for, say, cancer with, say, schizophrenia to determine which is the most efficient at producing health gain in the form of QALYs.

Answer 98

Direct - those associated directly with the healthcare intervention (e.g. staff time, medical supplies, cost of travel for the patient, childcare costs for the patient, costs falling on other social sectors such as domestic help from social services) Indirect - those incurred by the reduced productivity of the patient (e.g. time of work, reduced work productivity, time spent caring for the patient by relatives) Intangible - those that are difficult to measure (e.g. pain or suffering on the part of the patient)

Answer 99

how much the odds of the disease increase when a test is positive

Answer 100

sensitivity / (1-specificity)

Answer 101

how much the odds of a disease decrease when a test is negative

Answer 102

(1-sensitivity) / specificity

Answer 103

a placebo that produces prominenet SE

Answer 104

P value - is the probability of obtaining a result by chance at least as extreme as the one that was actually observed, assuming that the null hypothesis is true

Statistics Flashcards

(135 cards)