WEEK 3 Flashcards
(30 cards)
Sampling distribution definition
A sampling distribution is the distribution of a statistic (like a mean or proportion) you get by taking many random samples from the same population.
What is bias
a term used to describe statistics that don’t provide an accurate representation of the population
What is precision
the consistency or reproducibility of measurements or estimates
How to improve precision
Increasing the sample size
How to reduce bias
Using random sampling
What is sampling error
a statistical error that occurs when an analyst does not select a sample that represents the entire population of data
What is a non-sampling error
an error that occurs during data collection, causing the data to differ from the true values
Difference between sampling and non-sampling errors
Sampling error is an error that occurs from taking a sample from a larger population, whilst non-sampling error comes from other means such as errors in data collection or data entry
What is an observational study
A type of study in which individuals are observed or certain outcomes are measured. No attempt is made to affect the outcome (for example, no treatment is given).
Used when an experiment is thought to be unethical
What is an experiment
a scientific test in which you perform a series of actions and carefully observe their effects to learn about something.
Preferred over observational studies as we’re able to test different groups of subjects to the measured response variables
Randomisation
A situation where individual outcomes are unpredictable, but there is a consistent overall pattern in the results that can be described using probability — for example, flipping a coin or rolling a die.
Replication
Repeating the data collection on many subjects to reduce chance variation in
results.
Blocking
splitting subjects into blocks (subsets) based on a variable. believed to have an impact on the results
Probability distribution meaning
a mathematical function that describes the probability of different possible values of a variable
It can take any number between 0 and 1, but the sum of the probabilities of all outcomes must equal 1 (1 being 100% and it ranges from 0% - 100%)
what does P(A) = 1 mean
the probability of an event happening is 100%
What does P(A) = 0 mean
the probability of an event happening is 0%
What does P(A) = 0.75 mean
the probability of an event happening is 75%
Sample space
the set of all possible outcomes
mutually exclusive
two or more events that cannot happen at the same time
e.g. rolling a 2 and a 3 in a single dice roll
Collectively exhaustive
A set of events that encompasses all possible outcomes
Relationship between collectively exhaustive events and sample space
The union of exhaustive events of an experiment forms the sample space
Union
If event A, event B or both event A & B can occur, it is expressed as ‘A or B’ or A∪B
Intersection
If event A & event B can occur at the same time, it is expressed as ‘A and B’ or A∩B
Marginal probability
the probability of a single event occurring, independent of other events
(e.g. P(A))