statistics Flashcards

(59 cards)

1
Q

what is a census

A

measures every member of a population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

advantage of a census

A

accurate results

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

disadvantage of a census

A

expensive / testing may destroy the population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

what is a sampling unit

A

individuals of a population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

what is a sampling frame

A

a list of sampling units

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

what is simple random sampling

A

same chance of being selected.
random number generator

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

advantage of simple random sampling

A

bias free

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

disadvantage of simple random sampling

A

need a sampling frame

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

what is systematic sampling

A

take every k’th unit. k=pop/sample
pick random number between 1 and k to start

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

advantage of systematic sampling

A

quick to use

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

disadvantage of systematic sampling

A

need a sampling frame

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

what is stratified sampling

A

sample represents the groups (strata) of a population.
sample/population x strata
to find out how many people you need in each group

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

advantage of stratified sampling

A

reflects population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

disadvantage of stratified sampling

A

population must be classified in strata

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

what are strata

A

groups

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

what is quota sampling

A

like stratified but strata is filled by interviewer/researcher

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

advantages of quota sampling

A

no sampling frame

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

disadvantages of quota sampling

A

non random, potential bias

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

what is opportunity sampling

A

quota filled by those available at the time

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

advantages of opportunity sampling

A

easy/cheap

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

disadvantages of opportunity sampling

A

unlikely to be representative

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

how to calculate the variance

A

(sum of x^2 / n) - mean ^2
mean of the squares minus the square of the mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

go to get from variance to standard deviation

A

square root variance = standard deviation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

why are histograms used

A

for continuous data

25
what to compare on histograms
measure of location and measure of spread
26
what does PMCC mean
product moment correlation coefficient
27
what does PMCC measure
strength and +/- of correlation
28
what is a regressions line
the best line of best fit
29
what is interpolation
estimating inside the data range more reliable
30
what is extrapolation
estimating outside the data range less reliable
31
what is U on venn diagrams
shade all of both
32
what is n on venn diagrams
shade overlap
33
what does B| A mean
probability of B given that A has been picked
34
what is a mutually exclusive event and its properties
the venn diagram does not overlap P(AnB) = 0 P (AUB) = P(A) + P(B)
35
probabilities of independant events on venn diagrams
P(AnB)= P(A) x P(B) P(A|B) = P(A)
36
conditional probability formula P(B|A) =
P(AnB)/P(A)
37
Addition law for probability P(AUB) =
P(A) + P(B) - P(AnB)
38
what is discrete uniform distributions
probabilities of all outcomes are equal
39
Binomial distribution notation
X~B(n,p) X is distributed binomials n = number of trials p = probability of success
40
when to use a binomial distribution
F - fixed number of trials F - fixed probability of success I - independent T - Two outcomes
41
cumulative probability of P(X<5)
P(X<_ 4)
42
cumulative probability of P(X>3)
1 - P(X<_3)
43
cumulative probability of P(6
P(X<_10) - P(X<_6)
44
what is normal distribution
used for continuous random variables
45
normal distribution notation
Y ~ N (mean , variance) y is distributed normally
46
what are the points of inflection on normal distribution
mean + /- standard deviation
47
what is the notation of standard normal distribution
Z~N(0, 1^2)
48
coding Z~ Z=
Y-mean/standard deviation
49
when can you approximate binomial distributions as normal distributions
if N is large if p is approximately 0.5 mean = np variance = np(1-p)
50
if you are approximating binomial as normal you are going from discrete to continuous values so you must change the values of the probability. e.g. P(X>5)
= P(Y> 4.5)
51
what is the null hypothesis
H0 what we assume to be true
52
what is alternative hypothesis
H1 what would be true if H0 is wrong
53
what is significance level
the given threshold of likeliness
54
what is a one tailed test
H1 : p>k or p
55
what is a two tailed test
when H1 : p does not equal k half significance level for each end
56
for correlation testing what is H0 and H1 compare r with table for more extreme less extreme
H0: r=0 H1= r>0 or r<0 or r is not 0 if more extreme reject H0 is less extreme no evidence to reject H0
57
for binomial testing what is test statistic H1 H0 assume H0 is
test statistic : number of successes observed H0: p=k H1 = p>k p_ value in question) if p< significance level - reject H0 if p> significance level - no evidence to reject H0
58
for normal testing what is sample mean H0 H1 assume H0 is
sample mean ~ N ( mean, variance/n) H0 : mean = k H1: mean > k mean < k or mean is not K assume sample mean ~N ( k, variance/n) find P(sample mean > or < mean of sample taken) if p significance level - no evidence to reject H0
59