lecture 4 Flashcards
(37 cards)
research process steps
Development of research question
Design of study
Collection of data Description/analysis of data Interpretation of results
Study design refers to
the methods used to select the study participants, control any experimental conditions, and collect the data.
study design
Interpretation of results depends on the study design.
The study design should be tailored to the research question.
Methods of statistical analysis and information produced will depend on the study design.
Good research requires good study design.
- Often given too little attention.
Broadly identify two main types of study design
descriptive studies
analytic studies
descriptive studies
studies which describe things such as surveys
nerve seek to intervene or split groups, only collecting data, no intervention
analytic studies
studies which test hypotheses
have a particular claim/value we want to test against
experimenting on humans or entities
two types = experimental studies and observational studies
experimental studies
type of analytic study
e.g. randomised controlled trial (e.g. testing a drug)
observational studies
type of analytic study
e.g. cohort studies, case-control studies
aim of descriptive studies
Aim: To describe, for example,
the characteristics of people with a disease (person; place; time);
lifestyle patterns in a population;
attitudes to health care.
Descriptive studies are often simply referred to as surveys.
Generally use a sample from the population of interest.
do not go into it with any preconceived hypothesis (can develop hypotheses through the results of these studies)
population
Complete set of entities or elements or units or people that we wish to describe or make inference about.
should be well defined such as all patient diagnosed with colorectal cancer in NZ in 2015 rather than the population of NZ (because is this right not? past? future?)
census
whole population investigated
sample
a subset of population
what does a sample need to be to provide a good description of a population? (exam)
To provide a good description of the population, samples need to be ‘representative’ of the population they were drawn from; this is why random sampling (also known as random selection) of study units is important.
sample must be representative of the population and randomisation is necessary to achieve a representative sample
What if random sampling is impossible
Need to consider carefully how likely it is that our sampling procedure
will produce a representative sample
- Sometimes we can compare the characteristics of the sample to known
facts about the population; e.g. are the age distributions the same?
Sampling frame
List of items in a population from which a sample is drawn.
A good one will encompass the population of interest
Rarely coincides with the entire population of interest:
- Telephone numbers (in the past this was good, the present not so good as less and less people have landlines therefore not as good to get a random sample from this)
- Electoral roll
Often doesn’t exist:
- All people with depression
- All potential users of a new drug
Even without a list we can ensure an ‘unbiased’ sample if every individual has the same chance of being drawn.
Method
Select a subgroup (sample) of people and measure (this is generally the process)
Random sampling
Choose the sample in such a way that every individual in the population has a known chance of being selected.
In a simple random sample, everyone has an equal chance of being chosen.
This method is the best way of obtaining a sample which is representative of the population.
randomisation is good because it aim to get a representative sample
simple random sample
In a simple random sample, everyone has an equal chance of being chosen. - probability of choosing people is known and equal
This method is the best way of obtaining a sample which is representative of the population.
equation of sample mean =
population (true) mean + error
error can be systematic error or random error
error can be positive or negative
error
is how far from the truth the estimate is
sample mean
is the estimate of the true corresponding parameter value
you hope that it is close to the true population mean
random error also known as
random variability or uncertainty or chance
this is good as it can be dealt with
Random error (chance)
Due to natural variability.
Increasing the sample size will reduce the random fluctuations in the sample mean.
Statistical methods allow us to quantify the influence of random error on our estimate.
Systematic error is descriptive study =
bias