Data Science Flashcards by Mahsa Zamanifard

How is the mean and variance of two 🎃INDEPENDENT, NORMALLY DISTRIBUTED🎃 variables calculated? calculate for both addition and subtraction

X+Y=T mean(T)= mean(X)+mean(Y) var(T)^2=var(X)^2+var(Y)^2

X-Y=Z mean(Z)= mean(X)-mean(Y) var(Z)^2=var(X)2+var(Y)^2

How well did you know this?

Not at all

Perfectly

Write the formula of mean and variance, using expected value:

Expected value is basically the same as mean, now:
Here’s how it’s calculated:
mean=E(x) = x*p(x) for all values of x (if they are discrete we use sigma, if they are continuous, we use integral)

For variance: variance =E ((x-E(x))^ 2) (mean of this variable: (x- mean (x))^2 )

How well did you know this?

Not at all

Perfectly

What does central limit theorem say?

The central limit theorem (CLT) states that the distribution of sample means approximates a normal distribution as the sample size gets larger,
🧨 regardless of the population’s distribution. 🧨
Sample sizes equal to or greater than 30 are often considered sufficient for the CLT to hold.

How well did you know this?

Not at all

Perfectly

What is a latent variable?

In statistics, latent variables are variables that are not directly observed but are rather inferred through a mathematical model from other variables that are observed

How well did you know this?

Not at all

Perfectly

What do we mean when we say the sample mean targets the population mean?

Means that it’s a good estimate of the population mean

How well did you know this?

Not at all

Perfectly

What is the variance of the sampling distribution of means telling us? (Considering CLT)

The formula for variance of the sampling distribution of means is: population variance/n, so the larger the sample size, the smaller the variance and the closer the means of samples to the population mean. In extreme form, n is the whole population, and whatever number of samples we take, we’ll have one mean only (variance really small).

How well did you know this?

Not at all

Perfectly

How the sampling distribution would look, if the original distribution is not normal?

It approaches a normal distribution.

How well did you know this?

Not at all

Perfectly

What is the relation between non-normal original distribution and the suitable sample size?

The further the original distribution is from normal, the larger the sample size should be so that the sampling distribution of means approaches normal distribution (typically sample size>=30 will approach normal distribution)

How well did you know this?

Not at all

Perfectly

If the original distribution is normal, the sampling distribution of the means will also be normal. True or False?

True

How well did you know this?

Not at all

Perfectly

Where do we use CLT?

When sample is not normally distributed, we can still use some tools designed for normal distribution using CLT.

How well did you know this?

Not at all

Perfectly

Summarize the 3 parts of CLT; which parts hold true for any sample size?

1-The mean of the sampling distribution of means is a fair estimate of the mean of the population from which the samples were drawn

2-The variance of the sampling distribution of means is a fair estimate to the variance of the population from which the samples were drawn, divided by n

3-If the original distribution is normal, the sampling distribution of the means will also be normal. Otherwise, if n>=30, we can still safely assume normal.

Part 1 and 2, are true for any sample size

How well did you know this?

Not at all

Perfectly

What is the sampling distribution of the mean?

The distribution of the means of the samples taken from the original data. (each sample has a mean, sampling distribution of the means, is the distribution patterns of the means of all the samples taken)

How well did you know this?

Not at all

Perfectly

What is the formula for calculating variance for continuous and discrete variables?

f(x) is the probability distribution function
p(x) is the probability of each discrete value the variable can take
🤓
Continuous Variables:
∫ (x-mean)^2*f(x) dx
🤓
Discrete Variables:
∑ (xi-mean)^2*pi(xi)

How well did you know this?

Not at all

Perfectly

How does adding/subtracting a constant to/from a variable change its variance?

It doesn’t change its variance

How well did you know this?

Not at all

Perfectly

How does multiplying a variable by a constant change its variance?

The new variance= the old variance* constant^ 2

How well did you know this?

Not at all

Perfectly

How does the variance change when we have a set of INDEPENDENT variables added together?

Study These Flashcards

When they are INDEPENDENT we add the variance of each variable:
var (x±y) =var(x) + var(y)

How does the variance change when we have a set of DEPENDENT variables added/subtracted?

Study These Flashcards

var (x+y) =var(x) + var(y) + 2 cov(x, y) (cov=covariance)

var (x-y) =var(x) + var(y) - 2 cov(x, y) (cov=covariance)

How is the STD of x+y calculated?

Study These Flashcards

1-Calculate x+y variance: var(x) + var(y)

2- Take the square root of the x+y variance: √ (var(x) + var(y))

What is The Most Important Probability Distribution for Discrete Random Variables?

Study These Flashcards

When a random variable follows a binomial distribution

What conditions should be met before being certain a distribution is binomial?

Study These Flashcards

1- Probability of success in each separate trial is the same
2- Trials are independent: the result of one trial doesn’t depend on others
3- Fixed number of trials
4- Each trial can be classified as either fail or success

What is the formula for getting x number of successes with n trials in a binomial distribution? And the shorthand of it?

Study These Flashcards

P(x) = [n!/x!(n-x)!] p^{x} q^{n-x}
X ~ B(N,P)
X is a binomial random variable with N trials and success probability of P

What’s the equation for the cumulative probability distribution for binomial distributions?

Study These Flashcards

P(X<=x) =∑ [n!/k!(n-k)!] p^{k} q^{n-k} 0

On which parameter does the shape of Binomial probability distribution (probability vs number of trials) depend? How does it change in relation to the change of this parameter?

Study These Flashcards

P, the probability of success the higher the P(closer to 1), more left skewed the probability distribution is. When P is around .5, it’s almost symmertic.
Explanation: If P is close to one, then let’s say we have 10 trials and we start by 1, meaning that: the probability of just having 1 success in 10 trials, it’s going to be really small since there’s a high chance we succeed more than 1 time. So it’s going to get bigger as we increase the number of successes. Therefore it’s going to be left skewed.

Binomial probability distribution is discrete. True or False?

Study These Flashcards

True

Explain how the binomial probability distribution is plotted

binomial distribution is made of p( number of successes) on y axis and number of trials on the x axis. for example if the x axis has 20 ticks, it means we have 20 trials, if we want to plot the probability of having one success among 20 trials, it would be y=p(X=1) , x=1

How is the mean and std of a binomial variable is calculated? What do they depend on?

``` They depend on the probability of success n= number of trials p= probability of success mean=n*p std=√(np(1-p)) ```

What is the 10% rule for assuming independence between trials?

We can make inferences based on things being close to a binomial distribution or a normal distribution. In case of binomial distribution, if the sample is less than or equal to 10% of the population, then it’s ok to assume approximate independence.

What does binompdf and binomcdf functions take as input and what’s their output?

Binompdf and Binomcdf: both take n: number of trials, p: probability of success, x: number of success. Binompdf output: the probability of exactly x times of success happening out of n trials Binomcdf output: the cumulative probability of up to and including x times of success happening out of n trials.

What is Bernoulli distribution?

The Bernoulli distribution describes events having exactly two outcomes, which are ubiquitous in real life. Some examples of such events are as follows: a team will win a championship or not, a student will pass or fail an exam, and a rolled dice will either show a 6 or any other number.

Data Science Flashcards

(29 cards)