Topic 13 Statistics Flashcards
Revision (45 cards)
What is the definition of statistics?
Statistics is the branch of mathematics dealing with data collection, analysis, interpretation, presentation, and organization.
True or False: A population includes all members of a specified group.
True
What is a sample in statistics?
A sample is a subset of a population used to represent the whole population.
What is the difference between descriptive and inferential statistics?
Descriptive statistics summarize and describe the characteristics of a dataset, while inferential statistics use a sample to make predictions or inferences about a population.
Fill in the blank: The _____ is the average of a set of numbers.
mean
What is the median?
The median is the middle value in a list of numbers ordered from least to greatest.
What is the mode?
The mode is the value that appears most frequently in a data set.
True or False: The range is the difference between the highest and lowest values in a data set.
True
What is a frequency distribution?
A frequency distribution is a summary of how often each different value occurs in a dataset.
Define ‘outlier’ in statistics.
An outlier is a data point that differs significantly from other observations in a dataset.
What is a histogram?
A histogram is a graphical representation of the distribution of numerical data, using bars to show the frequency of data within certain intervals.
What does a box plot display?
A box plot displays the median, quartiles, and potential outliers of a dataset.
What is variance?
Variance is a measure of how much values in a dataset differ from the mean.
What is standard deviation?
Standard deviation is the square root of the variance and measures the dispersion of a dataset.
True or False: A lower standard deviation indicates that data points tend to be close to the mean.
True
What is a probability?
Probability is a measure of the likelihood that an event will occur, expressed as a number between 0 and 1.
Fill in the blank: The _____ of an event is the number of favorable outcomes divided by the total number of possible outcomes.
probability
What is a random variable?
A random variable is a variable whose possible values are numerical outcomes of a random phenomenon.
Define ‘normal distribution’.
Normal distribution is a probability distribution that is symmetric about the mean, showing that data near the mean are more frequent in occurrence.
What is the Central Limit Theorem?
The Central Limit Theorem states that the distribution of sample means approaches a normal distribution as the sample size increases, regardless of the population’s distribution.
What is a null hypothesis?
A null hypothesis is a statement that there is no effect or no difference, and it is the hypothesis that researchers aim to test.
What is a p-value?
A p-value is the probability of obtaining results at least as extreme as the observed results, assuming that the null hypothesis is true.
True or False: A smaller p-value indicates stronger evidence against the null hypothesis.
True
What is a confidence interval?
A confidence interval is a range of values derived from a data set that is likely to contain the value of an unknown population parameter.