Random Variables and Distributions Flashcards by Joanna Li

Sample vs population - which letters to denote?

sample is a subgroup of population
Greek letters for properties of POPULATION
roman letters for properties of SAMPLE

How well did you know this?

Not at all

Perfectly

define probability

predicting properties of a SAMPLE

How well did you know this?

Not at all

Perfectly

define statistics

deducing information about the POPULATION from the sample

How well did you know this?

Not at all

Perfectly

define cumulative probability (both discrete and continuous)

F(x) = P(X≤x)
sum of all probabilities up to X=x for discrete

integration from -inf to x for continuous
CDF F(x) obtained by integrating PDF f(x), vice versa by differentiating

How well did you know this?

Not at all

Perfectly

manipulating expectations:
E[aX+b] =

a E[X] + b

How well did you know this?

Not at all

Perfectly

expectation of a sum E[A+B] is equal to…

the sum of expectations
= E[A] + E[B]

How well did you know this?

Not at all

Perfectly

Manipulating variances: Var[aX+b] =

a^2 Var[X]

proof by expanding Var[X] = E[X^2] - (E[X])^2 definition

How well did you know this?

Not at all

Perfectly

under what conditions is it true that Var[X+Y] = Var[X] + Var[Y] = Var[X-Y] ?

only for INDEPENDENT X and Y

How well did you know this?

Not at all

Perfectly

standard deviation quantifies the…

width of a probability distribution

How well did you know this?

Not at all

Perfectly

Define skewness. How is it calculated? What does a -ve, +ve and skewness of 0 mean?

skewness is a measure of the asymmetry of a distribution.

Skew[X] = E[[X-µ)^3] / σ^3

negative skew = tail to the left
positive skew = tail to the right
0 skew usually symmetric

How well did you know this?

Not at all

Perfectly

define kurtosis. how is it calculated?

kurtosis is a measurement of how much WEIGHT of a distribution lies in its TAILS
(ie. how likely it is to observe extreme values)

Kurt[X] = E[(X-µ)^4] / σ^4

How well did you know this?

Not at all

Perfectly

define excess kurtosis - how is it calculated?

comparing kurtosis to that of a normal distribution, ie. how much more weight is in the tails

XS Kurt = Kurt[X] - 3

-ve XS kurt means less weight is in the tails compared to normal

How well did you know this?

Not at all

Perfectly

define moment of a distribution. how is the nth moment calculated?

nth moment of a distribution:
M(n) = E[X^n]

How well did you know this?

Not at all

Perfectly

what is a moment generating function?

some function m(t) such that the limit as t–>0 of the nth derivative wrt t gives the nth moment, M(n).

How well did you know this?

Not at all

Perfectly

how to find the moment generating function m(t)

m(t) = E[exp(tX)] = integral of exp(tx) f(x) for some continuous PDF f(x)

How well did you know this?

Not at all

Perfectly

what does the random variable for the Binomial distribution describe?

Study These Flashcards

the number of times, m, that a particular event happens in n independent measurements
constant probability of success, p
only two outcomes

the random variable in the Poisson distribution describes…

Study These Flashcards

the number of times, m, a random event occurs in a specified time interval.
Constant chance of occurrence.

probability of event occurring is proportional to length of time period.

will often be given the avg number of occurrences per interval (ie. lambda)

examples of exponential distributions

Study These Flashcards

residence time of molecules in a CSTR
lifetime of reactant molecules in batch reactor for first order reaction

moment generating function of exponential distributions

Study These Flashcards

m(t) = λ/λ-t

how to change variables for normal distribution to standardise to 𝛷(z)

Study These Flashcards

z = (x-µ) / σ

new random variable: Z = (X-µ)/σ
such that Z~N(0,1) which has a CDF of 𝛷(z)

where X~N(µ,σ)

a binomial dist can be approximated as normal if…

Study These Flashcards

np > 5 and
nq = n(1-p) >5

a poisson dist can be approximated as normal if…

Study These Flashcards

lamda > 15

when approximating a discrete distribution as continuous (normal), one must…

Study These Flashcards

APPLY A CONTINUITY CORRECTION

less than –> 0.5 higher
more than –> 0.5 lower

Bayes’ Theorem

Study These Flashcards

think probability tree branches
P(A|B) = P(B|A) * P(A)/P(B)

what is a joint probability distribution?

probability of X and Y having particular outcomes fXY(x, y) Cumulative FXY(x,y) calculated by a double sum or double integral

define marginal probability distribution

fX(x): probability that X takes a particular value, irrespective of Y value ie. fXY(x,y) integrated / summed over all y values

definition of conditional probability P(A|B)

P(A|B) = P(A∩B) / P(B) analogous for conditional joint probabilities: fX|y(x) = fXY(x,y) / fY(y) where fX|y(x) is prob dist of X given Y has outcome y

Pearson Correlation coefficient

𝜌 = Cov[X,Y] / √Var[X]Var[Y] = covariance / product of sd's

covariance of two independent variables

covariance = 0 so that E[XY] = E[X]E[Y] (note that cov=o does not necessarily mean independent)

Var[X+Y] = general case

Var[X+Y] = Var[X] + Var[Y] + 2Cov[X,Y] if indep, Cov=0

Random Variables and Distributions Flashcards

(30 cards)