Probability Flashcards

Question

What is the probability of rolling a 4 on a fair six-sided die?

Answer 1

The probability that an event does not occur. P(Not A) = 1 - P(A)

Answer 2

The probability of one or another occurring is the sum of their individual probabilities.

Answer 3

The probability of either event A or B occurring is P(A) + P(B) - P(A and B).

Answer 4

The occurrence of one event does not affect the probability of the other.

Answer 5

A probability puzzle that finds the chance that, in a group of people, at least two will share the same birthday.

Answer 6

The probability of one event occurring given that another has already occurred.

Answer 7

P(A|B) = P(A and B) / P(B)

Answer 8

A formula that describes how to update the probabilities of hypotheses when given evidence.

Answer 9

P(A|B) = [P(B|A) * P(A)] / P(B)

Answer 10

The initial probability of an event before new evidence is considered.

Answer 11

The updated probability of an event after taking new evidence into account.

Answer 12

A probabilistic classifier based on Bayes' Theorem assuming independence between features.

Answer 13

By computing the probability that an email is spam given the presence of certain words or phrases.

Answer 14

A probability puzzle involving choosing one of three doors, with a twist that affects your winning chances if you switch your choice.

Answer 15

In modeling uncertainty, predictions, classifiers like Naive Bayes, and in algorithms involving randomness.

Answer 16

The long-run average value of repetitions of a random experiment. For a discrete random variable, it is the weighted average of all possible outcomes.

Answer 17

The middle value in a sorted list of numbers, separating the higher half from the lower half.

Answer 18

The value that appears most frequently in a dataset.

Answer 19

For a random variable X and function g, E[g(X)] = ∑ g(x) * P(X=x) (discrete) or ∫ g(x) * f(x) dx (continuous).

Answer 20

For any random variables X and Y, E[X + Y] = E[X] + E[Y], regardless of independence.

Answer 21

A measure of how far a set of numbers are spread out from their mean: Var(X) = E[(X - μ)²].

Answer 22

The square root of the variance, providing a measure of dispersion in the same units as the data.

Answer 23

The sum of independent Gaussian random variables is also Gaussian.

Answer 24

Transforming data to have a mean of 0 and a standard deviation of 1: Z = (X - μ)/σ.

Answer 25

Skewness measures asymmetry; Kurtosis measures tail heaviness compared to a normal distribution.

Answer 26

Positive skew = right-tailed; Negative skew = left-tailed. Measures lack of symmetry.

Answer 27

High kurtosis = heavy tails/sharp peak; Low kurtosis = light tails/flat peak.

Answer 28

Values dividing the data into equal-sized intervals (e.g., quartiles divide into quarters).

Answer 29

A graphical display showing data distribution via quartiles, median, and outliers.

Answer 30

A non-parametric way to estimate the probability density function of a random variable.

Answer 31

A combination of a box plot and kernel density plot, showing distribution shape.

Answer 32

Quantile-Quantile plot, comparing two distributions by plotting their quantiles against each other.

Answer 33

The probability distribution of two or more discrete random variables.

Answer 34

The probability distribution of two or more continuous random variables.

Answer 35

Marginal: distribution of a subset ignoring others. Conditional: distribution given another variable's value.

Answer 36

A measure of how much two random variables change together: Cov(X,Y) = E[(X-μₓ)(Y-μᵧ)].

Answer 37

The expected value of the product of deviations from the mean for two random variables.

Answer 38

A matrix where each element (i,j) is the covariance between the i-th and j-th random variables.

Answer 39

A standardized measure of linear dependence: ρ = Cov(X,Y)/(σₓσᵧ), ranging from -1 to 1.

Answer 40

A range of values, derived from sample data, that is likely to contain the true population parameter with a specified confidence level (e.g., 95%).

Answer 41

'We are 95% confident that the true parameter lies within this interval.' (Note: The parameter is fixed; the interval is random.)

Answer 42

1. Sample size (↑n → ↓width), 2. Confidence level (↑confidence → ↑width), 3. Variability (↑σ → ↑width).

Answer 43

The radius of the CI: MoE = Critical Value × Standard Error. For 95% CI (Z=1.96): MoE = 1.96 × (σ/√n).

Answer 44

CI = X̄ ± Z*(σ/√n), where Z is the critical value (e.g., 1.96 for 95% CI).

Answer 45

Use the t-distribution: CI = X̄ ± t*(s/√n), where t depends on degrees of freedom (n-1).

Answer 46

Confidence refers to long-run frequency (e.g., 95% of CIs will contain the true parameter). Probability is about a single event.

Answer 47

CI = p̂ ± Z*√(p̂(1-p̂)/n), where p̂ is the sample proportion.

Answer 48

n = (Z² × σ²) / MoE² (for means) or n = (Z² × p(1-p)) / MoE² (for proportions).

Answer 49

Type I (False Positive): Rejecting a true H₀. Type II (False Negative): Failing to reject a false H₀.

Answer 50

The probability of observing data as extreme as the sample, assuming H₀ is true. Small p-value → reject H₀.

Answer 51

Thresholds in a test statistic's distribution that define rejection regions (e.g., Z=1.96 for α=0.05 two-tailed).

Answer 52

Probability of correctly rejecting H₀ (1 - Type II error). Increases with effect size, sample size, and α.

Answer 53

A test comparing means using the t-distribution (used when σ is unknown or n < 30). Types: One-sample, two-sample, paired.

Answer 54

To compare means of two independent groups (e.g., treatment vs. control). Assumes equal variances (or use Welch’s test).

Answer 55

Compares means of the same group under two conditions (e.g., before/after). Uses differences between pairs.

Answer 56

A statistical method to compare two versions (A/B) to determine which performs better (e.g., webpage conversions).

Answer 57

Use a z-test: z = (p̂₁ - p̂₂) / √(p̂(1-p̂)(1/n₁ + 1/n₂)), where p̂ is the pooled proportion.

Answer 58

A symmetric, bell-shaped distribution with heavier tails than Normal. Approaches Normal as n → ∞.

Answer 59

Right-tailed: H₁ > H₀. Left-tailed: H₁ < H₀. Two-tailed: H₁ ≠ H₀. Determines rejection region direction.

Probability Flashcards

(84 cards)