lecture 2 mcqs part 2 Flashcards
(40 cards)
1
Q
- Which of the following is not a characteristic of a normal distribution?
A) Symmetrical
B) Mean equals median equals mode
C) Discrete outcomes
D) Most data falls close to the mean
A
c
2
Q
- In a dataset, if the mean = 50 and standard deviation = 5, approximately 95% of values lie between:
A) 40 and 60
B) 45 and 55
C) 35 and 65
D) 30 and 70
A
C
3
Q
- A histogram is bell-shaped and symmetrical. Which distribution is most likely?
A) Poisson
B) Normal
C) Binomial
D) Exponential
A
B
4
Q
- A dataset has mean = 20, median = 15, mode = 12. What is the likely shape?
A) Symmetric
B) Right-skewed
C) Left-skewed
D) Uniform
A
B
5
Q
- Which summary statistic is least affected by outliers?
A) Mean
B) Mode
C) Standard deviation
D) Median
A
D
6
Q
- What is the range of a theoretical normal distribution?
A) Between 0 and 100
B) Between -3 and +3
C) Infinite
D) Between 1st and 3rd quartile
A
C
7
Q
- What is the first step in assessing normality of your data?
A) Run a t-test
B) Plot a histogram
C) Calculate variance
D) Apply a linear regression
A
B
8
Q
- Which of the following would suggest data are NOT normally distributed?
A) Bell-shaped histogram
B) Mean ≈ Median
C) Kolmogorov-Smirnov test p < 0.05
D) Q-Q plot points forming a straight line
A
C
9
Q
- In a Q-Q plot, a strong deviation from the diagonal line suggests:
A) Normality
B) Non-normality
C) Constant variance
D) Equal means
A
B
10
Q
- What does the standard error quantify?
A) Spread of the population
B) Bias in sampling
C) Variability of the sample mean across samples
D) Deviation of a single data point from the mean
A
C
11
Q
- Which of the following reduces standard error?
A) Increasing variability
B) Decreasing sample size
C) Increasing sample size
D) Increasing standard deviation
A
C
12
Q
- If 75% of observations lie within 1 standard deviation of the mean, what does that suggest?
A) More variable than a normal distribution
B) Less variable than normal
C) Normal
D) Uniform
A
A
13
Q
- The standard error (SE) of the mean is calculated as:
A) SD / n
B) SD × √n
C) SD / √n
D) √(SD)
A
C
14
Q
- In statistical inference, what is the main role of confidence intervals?
A) To identify outliers
B) To estimate the p-value
C) To express uncertainty in an estimate
D) To test causality
A
C
15
Q
- A small sample has mean = 10, SD = 2, n = 4. What is the standard error?
A) 0.5
B) 1.0
C) 1.41
D) 2.0
A
B
16
Q
- Which test checks if a dataset is normally distributed?
A) Paired t-test
B) Chi-squared test
C) Kolmogorov-Smirnov test
D) ANOVA
A
C
17
Q
- What is sampling variation?
A) Data entry error
B) Change in population mean
C) Differences in sample statistics due to chance
D) Variability from bias
A
C
18
Q
- In Lecture 2, coffee consumption showed mean ± SD = 8.1 ± 9.7. What % of observations fell within 1 SD?
A) 68%
B) 75%
C) 90%
D) 95%
A
C
19
Q
- What is a statistic, in the context of inference?
A) A known population parameter
B) A number describing a sample
C) A p-value
D) A model assumption
A
B
20
Q
- If a dataset’s mean = median = mode, the distribution is:
A) Right-skewed
B) Left-skewed
C) Uniform
D) Symmetric
A
D
21
Q
- Which of the following statements about the standard deviation (SD) is true?
A) It describes the spread of individual values in a sample
B) It decreases as sample size increases
C) It measures variability of sample means
D) It estimates the population mean
A
a
22
Q
- What is the difference between standard deviation and standard error?
A) SD describes population spread; SE describes sample spread
B) SD measures spread of data; SE measures spread of sample means
C) SE is always larger than SD
D) They are mathematically identical
A
B
23
Q
- What does the central limit theorem state?
A) Larger samples produce higher standard deviations
B) The population distribution must be normal
C) Sampling distributions of the mean approach normality as n increases
D) Mean = mode = median in all datasets
A
C
24
Q
- Which of the following would not indicate non-normality?
A) A symmetric histogram
B) Skewed boxplot
C) Shapiro-Wilk p < 0.05
D) Q-Q plot points curving away from diagonal
A
A
25
25. Which tool is used to visualise normality most directly?
A) Boxplot
B) Scatterplot
C) Q-Q plot
D) Bar chart
C
26
26. What is a common rule of thumb for the proportion of observations within 2 SDs in a normal distribution?
A) 50%
B) 68%
C) 95%
D) 99.7%
C
27
27. If the standard deviation is large, the data are:
A) Closely clustered around the mean
B) More spread out
C) Uniform
D) Normally distributed
B
28
28. The mean is most appropriate as a summary measure when:
A) The data are skewed
B) There are many outliers
C) The data are symmetrical and continuous
D) The variable is categorical
C
29
29. What is the purpose of statistical inference?
A) To describe the population using sample data
B) To calculate the median
C) To plot the data
D) To reduce sample size
A
30
30. If a sample mean is 12 and SE is 2, what is the 95% confidence interval approximately?
A) 8 to 16
B) 10 to 14
C) 11 to 13
D) 6 to 18
B
31
31. The mean ± 2 SE rule helps approximate:
A) Confidence intervals
B) Medians
C) P-values
D) Sample size
A
32
32. A boxplot displays which features of data?
A) Mean, mode, and SD
B) Minimum, quartiles, and maximum
C) Histogram frequency
D) Normality
B
33
33. In a sample of 100, which will be smaller:
A) Standard error
B) Standard deviation
C) The mean
D) The range
A
34
34. Why do researchers use samples instead of entire populations?
A) Populations are always biased
B) Statistical tests don’t work on populations
C) Sampling is more practical and efficient
D) Samples are more accurate
A
35
35. Which of the following describes sampling distribution?
A) Distribution of data in the population
B) Distribution of a statistic (like the mean) from repeated samples
C) Distribution of the confidence interval
D) Distribution of errors in measurement
B
36
36. What is the main assumption behind using a sample to estimate the population?
A) The sample is biased
B) The sample is randomly selected and representative
C) The population is always normal
D) The standard deviation is known
B
37
37. The mean of sample means from repeated sampling equals:
A) 0
B) Median of population
C) Sample standard error
D) Population mean
D
38
38. A symmetrical boxplot suggests:
A) A categorical variable
B) A uniform distribution
C) A roughly normal distribution
D) A skewed distribution
C
39
39. Which method would be least useful in assessing normality?
A) Q-Q plot
B) Boxplot
C) Histogram
D) Bar chart
D
40
40. A small standard error implies:
A) Greater variability across samples
B) A biased estimator
C) More precise estimate of the population mean
D) The data are skewed
C