Recap Statistics 2 Flashcards
Inferential statistics
drawing conclusions or inferences about populations based on sample data (statistical inference)
What is a parameter?
It is a measure of a characteristic of an entire population (a mass of all units under consideration that share common characteristics) based on all the elements within that population. For example, all people living in one city, all-male teenagers in the world, all elements in a shopping trolley, or all students in a classroom.
What is a statistic?
It’s a measure of characteristic saying something about a fraction (a sample) of the population under study. A sample in statistics is a part or portion of a population. The goal is to estimate a certain population parameter.
notation of population parameter
In population parameter, population proportion is represented by P, mean is represented by µ (Greek letter mu), σ2 represents variance, N represents population size, σ (Greek letter sigma) represents standard deviation, σx̄ represents Standard error of mean, σ/µ represents Coefficient of variation, (X-µ)/σ represents standardized variate (z), and σp represents standard error of proportion.
notation sample statistic
In sample statistics, mean is represented by x̄ (x-bar), sample proportion is represented by p̂ (p-hat), s represents standard deviation, s2 represents variance, sample size is represented by n, sx̄ represents Standard error of mean, sp represents standard error of proportion, s/(x̄) represents Coefficient of variation, and (x-x̄)/s represents standardized variate (z).
Probability theory
Probability theory, a branch of mathematics concerned with the analysis of random phenomena. The outcome of a random event cannot be determined before it occurs, but it may be any one of several possible outcomes. The actual outcome is considered to be determined by chance.
- -> the mathematical tool for making inference from a sample to a population in statistics
- -> describes the probability distribution of a random variable; first step to make inference about the population
random variable
A random variable is a numerical description of the outcome of a statistical experiment (random process)
- a statistic based on a sample is treated as a random variable
- written in upper case letter X,Y
Example of a random variable
- the height of a randomly chosen Dutch male
Discrete random variable
values are countable
Discrete Random Variables. … A discrete variable is a variable which can only take a countable number of values. In this example, the number of heads can only take 4 values (0, 1, 2, 3) and so the variable is discrete. The variable is said to be random if the sum of the probabilities is one.
continuous random variable
values are uncountable
180.1,180.01…
Probability distribution
a function or rule that assigns a probability to each possible value in the sample space
P(X=x), or simply P(x)
Example of a discrete probability distribution
Binomial
Binomial Distribution
- n independent trails
- each trial with success rate p (failure rate 1-p)
- the # of successes among n trails is a binomial random variable with binomial distribution
What is special about continuous probability distribution?
- impossible to list all values
- zero probability associated with each single value
- -> focus on the probabilities for a range of values: P(a
What is a continuous probability distribution?defined?
by a probability density function (PDF)