02 Random Variables Flashcards

1
Q

What is a random variable?

A

A numerical measure of the outcomes of a random phenomenon.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

There are two kinds of random variables. What are they?

A

Discrete and continuous.

  • Discrete random variables can take a finite number of outcomes (eg shoe sizes)
  • Continuous random variables can take an infinite number of values (eg heights)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is a probability distribution?

A

A list or graph showing the probabilities of the different possible values the random variable can take

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

A bar chart showing the probabilities associated with a discrete random variable is called what?

A

A probability histogram.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

How does a probability distribution for a continuous random variable represent the probabilities?

A

As areas under a probability density curve.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

The mean of a probability distribution is also referred to as the…

A

Expected value, E(X)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How do you calculate the mean of a discrete random variable?

A

μX = Σxi⋅pi = x1⋅p1 + x2⋅p2 + x3⋅p3 + … + xn⋅pn

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What does the variance of a probability distribution measure?

A

How far the data is from the mean.

  • The larger the variance, the more data points that are far from the mean.
  • The smaller the variance, the more data points that are close to the mean
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How is the variance of a probability distribution calculated?

A

Var(X) = σ2X = Σ(XiX)2⋅pi = (X1-μ)2⋅p1 + (X2-μ)2⋅p2 + …+ (Xn-μ)2⋅pn

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is the standard deviation?

A

Standard deviation, σ, is the square root of the variance:

  • σ = √Var(X)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is a population?

A
  • Properties of the probability distribution are population properties.
  • The population is the total number of people/trees/chocolates/objects…. described by the probability distribution
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is a sample?

A
  • The data actually measured or observed
  • The sample is the group of people/chocolates/trees whose height/size/age you measure/count
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the Law of Large Numbers?

A

The mean of many trials approaches the true mean of the distribution as the number of trials increases.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

μ and σ are the symbols for the population or sample mean and standard deviation?

A

Population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

x̄ and s are the symbols for the population or sample mean and standard deviation?

A

Sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Symbolically, the Law of Large Numbers says what?

A

x̄ → μ as n → ∞

where n = number of trials

17
Q

If X is a random variable and a, b are constants, how are μa+bX and σ2a+bX related to μX and σ2X?

A
  • μa+bX = a + b⋅μX
  • σ2a+bX = a + b2⋅σ2X
18
Q

If X, Y and Z are random variables such that Z = X+Y, then how are the means and variances of X, Y and Z related?

A
  • μZ = μX + μY
  • σ2Z = σ2X + σ2X if X, Y are independent
19
Q

If X, Y and W are random variables such that W = X-Y, then how are the means and variances of X, Y and W related?

A
  • μW = μX - μY
  • σ2W = σ2X + σ2X if X, Y are independent
20
Q

If an event has only two possible outcomes, what distribution can we use if we want to know the probability of getting k successes out of a total number of n trials?

A

The binomial distribution.

21
Q

What are the 4 criteria for the use of a binomial model?

A
  1. There are only two possible outcomes of each observation (success or failure)
  2. There is a fixed number of observations, n
  3. The n observations are independent
  4. The probability of success, p, is the same for each observation
22
Q

What are the parameters of a distribution?

A

Parameters are the numbers that completely characterise a distribution. This means that only they are needed to calculate probabilities; nothing else.

23
Q

What are the parameters of the binomial distribution?

A
  • n* and p; B(n, p)
  • n* = the total number of trials
  • p* = the probability of obtaining a success
24
Q

What is the formula for calculating a probability with the binomial distribution?

A

The probability that there are k successes in n trials is given by P(X=k) = nCk⋅pk⋅(1-p)n-k

25
Q

What is the mean and standard deviation of a binomial distribution?

A
  • μ = np
  • σ = √np(1-p)
26
Q

What are the 4 features of a geometric model?

A
  1. Each observation can be one of only two outcomes - success or failure
  2. Each observation is independent
  3. The probability of success, p, is the same for each observation
  4. We want to find the probability that we need n trials in order to obtain the first success
27
Q

What are the parameters of a geometric distribution?

A

p, the probability of success

28
Q

How do you calculate a probability with the geometric distribution?

A

The probability that the first success is on trial number n is P(X=n) = (1-p)n-1p

29
Q

What is the expected value of a geometric random variable?

A

E(X) = μX = 1/p

30
Q

What is the variance and standard deviation of a geometric random variable?

A

σ2X = 1-p/p2 σ2X = √(1-p)/p

31
Q

What are the 4 key things to know about random variables?

A
  1. a random variable is a numerical measure of the outcome of a random phenomenon
  2. if X is a r.v. and a,b are fixed numbers, then
    E(a+bX) = a+b⋅E(X)
    Var(a+bX) = b2⋅Var(X)
  3. if X and Y are r.v.’s, then
    E(X±Y) = E(X)±E(Y)
  4. if X and Y are independent r.v.’s, then
    Var(X+Y) = Var(X-Y) = Var(X)+Var(Y)