Stats Flashcards
(29 cards)
Variance
Standard deviation squared
No Correlation
r=0
Weak correlation
r<0.5
Moderate correlation
0.5<r<0.75
Strong correlation
0.75<r<1
Lower outlier
X < LQ - ( 1.5 x IQR )
Upper Outlier
X > UQ + ( 1.5 x IQR )
Convenience Sampling
- selecting most easily accessible members
Pros: fast, cheap easy
Cons: does not represent whole population, biassed sample, not replicable
Simple Random Sampling
- each member of population has equal chance of being selected
Pros: no sampling bias, generalisability
Cons: time consuming
Systematic Sampling
- when participants are selected using a fixed periodic interval
Pros: better population coverage, simple & easy
Cons: could be biased if there is periodic pattern
Stratified Sampling
- dividing the population into smaller groups based on characteristics. Then randomly selecting
Pros: ensures proper representation, more precise population estimates
Cons: if groups are not mutually exclusive there could still be bias
mode
most common number that appears in data
median
middle value in list of numbers
mean
the average
modal class
the interval within a set of data with the most number of data points
1 standard deviation
68% of data
1 standard deviation
68% of data
2 standard deviation
95% of data
3 standard deviation
99.7% of the data
Union
- u
- P(AuB)
- ‘or’
Intersection
- n
- P(AnB)
- ‘and’
Dash or ‘ meaning
- without
- if ‘ is on outside of probability brackets then distribute it to all the terms
e.g. P(AnB)’
= P(A’uB’)
the n (intersection) turns to u (union) with ‘
& vice versa
binomial theorem formula to memorise for binomial distribution : the probability: P(X=r)
P(X=r) = nCr (P^r) (1-P)^(n-r)