Statistics Flashcards
(61 cards)
Which measure of central tendency represents the most frequently occurring value in a dataset?
A) Mean
B) Median
C) Mode
D) Range
C) Mode
Which measure of central tendency is best used when the dataset contains outliers?
A) Mean
B) Median
C) Mode
D) Range
B) Median
- difference between highest and lowest observation in a data
Range
True or False
The median is less affected by outliers compared to the mean, making it a better measure of central tendency when outliers are present
True
Which measure of variability indicates the average distance of each data point from the mean?
A) Range
B) Interquartile range
C) Variance
D) Standard deviation
D) Standard deviation
used to measure how far the data values are dispersed from the mean
variance
True or False
Standard deviation measures the average distance of each data point from the mean, providing insight into the spread of the data.
true
True or False
In a positively skewed distribution, the mean is lower than the median, which is lower than the mode, due to the tail on the right side.
False
In a positively skewed distribution, the mean is greater than the median, which is greater than the mode, due to the tail on the right side
Which of the following is a measure of the central location of a dataset?
A) Standard deviation
B) Variance
C) Median
D) Interquartile range
C) Median
Which measure of spread is defined as the difference between the first quartile and the third quartile?
A) Range
B) Standard deviation
C) Variance
D) Interquartile range
D) Interquartile range
In hypothesis testing, what is the p-value?
A) The probability of accepting the null hypothesis
B) The probability of rejecting the null hypothesis when it is true C) The probability of observing the test results under the null hypothesis
D) The level of significance
C) The probability of observing the test results under the null hypothesis
Which of the following is a Type I error?
A) Rejecting the null hypothesis when it is true
B) Accepting the null hypothesis when it is false
C) Failing to reject the null hypothesis when it is false
D) Failing to accept the null hypothesis when it is true
A) Rejecting the null hypothesis when it is true
If you are going to describe the findings of a survey about what annual income is for the people of Makati City, in which you have both extremely wealthy and extremely poor people, which two measures would you use?
A) Mean and Mode
B) Mean and Range
C) Mean and Standard Deviation
D) Mode and Standard Deviation
C) Mean and Standard Deviation
In a confidence interval, what does the margin of error represent?
A) The range of values within which the population parameter lies B) The standard deviation of the sample
C) The maximum error allowed in the estimate
D) The sample mean
C) The maximum error allowed in the estimate
indicates the range within which we expect the true population parameter to lie
margin of error
What does a confidence level of 95% mean?
A) There is a 95% probability that the sample mean is within the confidence interval
B) 95% of the population data lies within the confidence interval C) 95% of the time, the true population parameter lies within the confidence interval
D) There is a 5% chance that the sample mean lies outside the confidence interval
C) 95% of the time, the true population parameter lies within the confidence interval
What is the purpose of a t-test?
A) To compare the variances of two populations
B) To compare the means of two populations
C) To test the independence of two variables
D) To test the relationship between two variables
B) To compare the means of two populations
values wanted to explain or forecast; values depend on something else; denote it as y
Dependent Variable
explains the other one; denote it as x
Independent Variable
point where the regression line crosses the Y-axis, representing the value of Y when X is zero
intercept
What does the coefficient of determination (R2) indicate?
A) The strength of the linear relationship between two variables
B) The percentage of variation in the dependent variable explained by the independent variable
C) The slope of the regression line
D) The correlation between two variables
B) The percentage of variation in the dependent variable explained by the independent variable
explaining or predicting a single Y variable from two or more X variables
Multiple Regression Analysis
occurs when independent variables in a regression model are highly correlated
Multicollinearity
Which of the following best describes heteroscedasticity in regression analysis?
A) The error terms have constant variance
B) The error terms have increasing or decreasing variance
C) The error terms are normally distributed
D) The error terms are autocorrelated
B) The error terms have increasing or decreasing variance