Deck 1 Flashcards
(58 cards)
Distribution skewed to the left
A distribution is skewed to the left (negatively skewed) if the left tail is longer than the right tail (e.g. lifespan)
Distribution skewed to the right
A distribution is skewed to the right (positively skewed) if the right tail is longer than the left tail (e.g. wealth)
cross sectional data
where observations are made at a single point in time. The unit of observation is, for example, individuals, industries, regions, countries.
time series data
where observations on the same variables are taken repeatedly over time. The unit of observation is a period of time, for example, annual, monthly, weekly, each minute.
what is an experiment
an activity such as tossing a coin, which has a range of possible outcomes
what is a trial
a single performance of the experiment
what is a sample space
all possible outcomes of the experiment. For a single tossof a coin the sample space is {Heads, Tails}.
mutually exclusive outcomes
onlyone can arise from a single trial. You cannot get bothheads and tails by tossing a coin!
event
a combination of outcomes, for example, rolling a die andobtaining an even number; a roulette wheel ball landingon one of the red numbers.
rules of probability
- A probability always lies between zero and one
- The sum of the probabilities of all outcomes in the sample space must equal one
- The probability of an outcome not occurring must equal one minus the probability of an outcome occurring
compound events
Compound events are when we combine events and try to find a joint probability
- and/or events
non-disjoint event
events that can both occur at the same time. In this case, their set of outcomes within the sample space overlap.
what are ‘or’ events
compound events where event a or event b can take place, we need to add the probabilities
what are ‘and’ events
events are not mutually exclusive
random variables
A random variable is a variable whose outcome (value it takes) is, at least to some extent, a result of chance, and therefore unpredictable (not perfectly predictable).
expected value
the mean of a probability distribution.
binomial distribution
- When the underlying probability experiment has only two possible outcomes
- Used for repeated trials
normal distribution
- When many small independent factors influence a variable
- Use the Normal Distribution to answer questions about the probability of a random variable falling within a range
poisson distribution
- For rare events, when the probability of occurrence is low
- It is used for binary outcomes (event happens or does not), like the Binomial
- Use in place of the Normal Approximation to the Binomial when nP < 5
central limit theorem
If the sample size is large (n > 30) the population does not have to be Normally distributed, the sample mean is (approximately) Normal whatever the shape of the population distribution
Different kinds of sampling
- Simple random sampling
- Stratified sampling
- Cluster sampling
- Multi-stage sampling.
the sampling frame
The sampling frame is the list of subjects in the population from which the sample is taken, ideally it lists the entire population of interest
simple random sampling
- Need a list of the population, then randomly select sample observations from that list
- Every member of the population has an equal chance of being selected.
stratified sampling
- Simple random sampling can lead to unrepresentative samples, if we are unlucky.
- If there is an important variable that affects the main outcome of interest, then we want to see that variable represented appropriately in our sample.