Definitions Flashcards

1
Q
CRV E(X)=
Var(X)=
A

0.5(b+a) or integrate between boundaries and * x

1/12(b-a)^2 or integrate and *x^2 - mean^2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Conditions of binomial

A

fixed number of trials

constant probability, independent trial, two outcomes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Conditions of poisson

A

singly, constant rate, independantly

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Census =

A

investigation of every member or population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Sampling unit =

A

individual member/element of population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Sampling frame =

A

list of all the populations/sakmpling units e.g. name or unique ID

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Sampling distribution =

A

set of all possible values of the statistic together with their individual probabilities

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Why are samples better than a census

A

quicker and a census would use up all elements of sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

mode =

A

value of x at which maximum occurs, dy/dx = 0

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Median =

A

F(median) =0.25

integrate pdf

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

-ve skew means

A

mean less than median less than mode
Q3-Q2 less than Q2-Q1
quadratic up straight line down

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

When doing a pdf

A

draw points when y=0

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Population =

A

collection of all items

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Sample =

A

subset of population intended to represent the population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

B —> Po

A

n large > 50 p small <0.2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

B—-> N

A

Continuity correction (as discrete to continuous)
n large (n>50)
p close to 0.5
(np>5, nq>5)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Po—-N

A

lambda large (>20)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Po—-B

A

if only 3ish marks

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

X-Po(3)
P(X=2) = 0.5
X-Po(6)
P(X=2) =

A

0.5^2 = 0.25

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Poisson P(1<=X<=4) =

A

P(X<=4) - P(X<1)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Statisitic =

A

a r.v. which is some function of a sample and not dependant on any parameters e.g. not mu or sigma but x bar is fine

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Sampling distribution =

A

Probability distribution of all values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Hypothesis test =

A

mathematical procedure to examine a value of a population parameter proposed by the null hypothesis compared with an alternative hypothesis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Critical region =

A

range of values of a test statistic which would provide enough evidence to reject the null hypothesis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
If for top tail P(X<=9) > 0.95 then
X>=10
26
Actual significance level means
add values up and should be nearish original significance level
27
When drawing a 'suitable pdf'
think about skewness and logic not always exact normal distribution bell curve for example
28
In hypothesis testing can be more than or less than significance level but should be
as close as possible to it
29
Why do a CCC
Due to going from discrete to continuous so making up for gaps
30
P(|X|<1.5) =
P(-1.5 less than x less than 1.5)
31
2 tail bottom or top
Np < value top | Np > value bottom
32
Significance level in two tail is
Half of original
33
+skew means
mean>median>mode Q3-Q2>Q2-Q1 Majority of data on left Right tail longer
34
Finite population
A population is one in which each individual member can be given a number (a population might be so large that it is difficult or impossible to give each member a number – e.g. grains of sand on the beach).
35
Infinite population
A population is one in which each individual member cannot be given a number.
36
Simple random sample
A simple random sample of size n, is one taken so that every possible sample of size n has an equal chance of being selected. The members of the sample are independent random variables, X1, X2, … , Xn , and each Xi has the same distribution as the population
37
Sample
A selection of sampling units from the sampling frame
38
Sample survey
An investigation using a sample
39
Advantages of a census
Every member of the population is used. It is unbiased. It gives an accurate answer.
40
Disadvantages of a census
It takes a long time. It is costly. It is often difficult to ensure that the whole population is surveyed.
41
Advantages of sampling
Sample will be representative if population large and well mixed. Usually cheaper. Essential if testing involves destruction (life of a light bulb, etc.). Data usually more easily available
42
Disadvantages of sampling
Uncertainty, due to the natural variation – two samples are unlikely to give the same result. Uncertainty due to bias prevents the sample from giving a representative picture of the population
43
Bias comes from
subjective choice incomplete sampling frame *bias cannot be removed by increasing the size of the sample
44
Always remember
Define random variable Give in context +C Continuity correction as discrete to continuous Square root variance in normal distribution
45
Conditions of normal
``` Median = mean = mode Area under curve = 1 Bell shaped Symmetrical Mean and standard deviation parameters ```
46
mean = | variance =
x * P(X=x) or n*p | x^2 * P(X=x) - mean^2 or n^2p - (np)^2
47
P(5<=X<7)
P(X<=6) - P(X<=4)
48
Var(X+Y) | E(X+Y)
Var(X) + Var(Y) | E(X) + E(Y)
49
conditions for Binomial to Normal
n large p close to 0.5 np > 5 n(1-p) = npq > 5
50
Critical regions and Critical values are
Critical values are two specific values while the region is everything inside as well
51
Countably infinite population =
infinite size but each member can be given an individual member
52
Sample mean is.... the population mean | Sample variance is... the population variance
equal to | less than
53
Random sample =
every possibility has equal chance | Randint#(1,50)
54
Coin = Binomial =
(n, 0.5)
55
What is the most accurate p for B-->N
0,,5 so the binomial distribution is reasonably symmetrical
56
Y = Xbar =
sumx/n = statistic (X bar is allowed but mu is not)
57
Population does not always equal the sampling frame because
it is not always possible to keep this list up to date
58
Most accurate approximation is one which
most closely meets the requirements of approximation e.g. how close p is to 0.5
59
CRV P(X=a) =
0 as not a probability density function
60
Poisson events are independent so will occur again at the same rate in the same time
So just square the value or cube etc...
61
n! =
number of ways of ordering a collection of n objects
62
nCr =
n!/(n-r)!r! = number of ways selecting r objects from n
63
Why is Poisson to normal when lambda is large
because distribution would be fairly symmetrical
64
``` CRV = pdf = DRV = pd = ```
probability density function | probability distribution
65
Always show that x = 0 on a
pdf
66
Var(X) =
E(X^2) - E(X)^2
67
E(X^2) =
Var(X) + E(X)^2
68
P(X=4)
P(X<=4) - P(X<=3) = F(4) - F(3)
69
P(X>E(X)) > 0.5 therefore
mean less than median therefore negative skew
70
For two tailed test when not finding critical region make an educated guess on whether > than or < than or just do critical region for both sides
does not equal | half if doing both sides but if just one side then dont
71
E(X) =
``` integrate f(x)*x NOT integrate F(X) must be f(x) so differentiate if need be ```
72
E(X^2) =
``` integrate f(x)*x^2 NOT F(x) ```
73
E(-X^2 + 9X) =
E(-X^2) + E(9X) = 9E(X) - E(X^2)
74
P(X>k) = P(Y
P(Y>n-k)
75
P(X>k) =
P(Y>n-k)
76
P(4<=X<=8) =
P(X<=8) - P(X<=3) draw number line with squares at midpoints of numbers rather than lines
77
J-U[a, b]
Uniform
78
Discrete to continuous P(X=5) =
P(4.5<=X<=5.5)
79
X-B(100,0.975) approximation
Poisson
80
Significance level =
probability of incorrectly rejecting H0
81
Read question if as close as possible to
significance level
82
Sampling frame =
identifier
83
Statistic does not equal
mu or sigma
84
P(mu - ksigma less than X less than mu + ksigma) = 0.5
``` P(Q1 < X < Q3) = 0.5 therefore mu+ ksigma = Q3 etc or P(X = k) = 0.75 ```
85
key words for poisson
rate and randomly scattered
86
CRV: P(X=5)= P(X>=10) =
0 | 1-P(X<=10)
87
DRV P(x>=10) =
1-P(X<=9)
88
P(X...) is very low then
unlikely that parameter probability (p) is correct
89
P(X')^2 =
1 - P(X)^2 | NOT (1-P(X))^2
90
Always check F(X) =
0, 1, previous | and remember +C
91
F(x) put (0,1) at
maximum of sketch
92
2 minutes added to all values find new median and variance
``` E(X+2) = E(X) + 2 Var(X+2) = Var(X) ```
93
F(min x value) =
0
94
Mode =
minimum x value
95
Justify mode is maximum via
second derivative
96
E(5x^2) =
5E(X^2)
97
Given that X=4 over 2 days | Find X=1 on day 1 and X=3 on day 2
X-Po(1 per day) P(X=1) * P(X=4) when X-Po(1) divided by P(X=4) when X-Po(2)
98
Find the probability that 1 car after another car in less than 2 days
X-Po(2 days) | P(more than 1 cars) = P(X>=1)
99
Try not to have an unrounded value when going from discrete to normal distribution for continuity correction
Round to 0 decimal places
100
Diagram for binomial or Poisson is
Prob y axis number of events x axis vertical lines with spaces