Statistics (science) Flashcards

(34 cards)

1
Q

What is the definition of statistics?

A

Statistics is the science of collecting, analyzing, interpreting, presenting, and organizing data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

True or False: Descriptive statistics summarize data without making conclusions about the population.

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the purpose of inferential statistics?

A

Inferential statistics allow us to make predictions or inferences about a population based on a sample of data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Fill in the blank: The measure of the center of a dataset is called the _______.

A

mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What does the standard deviation measure?

A

The standard deviation measures the amount of variation or dispersion in a set of values.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Multiple choice: Which of the following is a measure of central tendency? A) Variance B) Mode C) Standard deviation

A

B) Mode

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is a population in statistics?

A

A population is the entire group of individuals or instances about whom we hope to learn.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

True or False: A sample is a subset of a population.

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the difference between qualitative and quantitative data?

A

Qualitative data describes characteristics or qualities, while quantitative data represents numerical values.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Fill in the blank: The _______ is the value that appears most frequently in a data set.

A

mode

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the range of a data set?

A

The range is the difference between the highest and lowest values in the data set.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Multiple choice: Which of the following is not a type of bias? A) Selection bias B) Confirmation bias C) Normal bias

A

C) Normal bias

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is a hypothesis in statistics?

A

A hypothesis is a testable statement about the relationship between two or more variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

True or False: A p-value less than 0.05 typically indicates statistical significance.

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Fill in the blank: In a normal distribution, approximately _______% of the data falls within one standard deviation of the mean.

A

68

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is a confidence interval?

A

A confidence interval is a range of values that is likely to contain the population parameter with a certain level of confidence.

17
Q

Multiple choice: What does a correlation coefficient of -1 indicate? A) No correlation B) Positive correlation C) Perfect negative correlation

A

C) Perfect negative correlation

18
Q

What is the purpose of a regression analysis?

A

Regression analysis estimates the relationships among variables.

19
Q

True or False: Outliers are data points that differ significantly from other observations.

20
Q

Fill in the blank: The _______ is the middle value when the data is ordered from least to greatest.

21
Q

What is the null hypothesis?

A

The null hypothesis is a statement that there is no effect or no difference, and it is what we seek to test against.

22
Q

Multiple choice: Which of the following is a type of data visualization? A) Histogram B) Pie chart C) Both A and B

A

C) Both A and B

23
Q

What does a box plot represent?

A

A box plot represents the distribution of a dataset based on a five-number summary: minimum, first quartile, median, third quartile, and maximum.

24
Q

True or False: The mean is always greater than the median in a skewed distribution.

25
Fill in the blank: A _______ variable is one that can take on a range of values.
continuous
26
What is the significance of the Central Limit Theorem?
The Central Limit Theorem states that the distribution of sample means approaches a normal distribution as the sample size increases, regardless of the population's distribution.
27
Multiple choice: Which test is used to compare the means of two groups? A) T-test B) ANOVA C) Chi-square test
A) T-test
28
What is skewness in statistics?
Skewness measures the asymmetry of the probability distribution of a real-valued random variable.
29
True or False: A larger sample size generally leads to more reliable results.
True
30
Fill in the blank: The _______ is a statistical method used to determine if there is a significant association between two categorical variables.
Chi-square test
31
What does the term 'type I error' refer to?
A type I error occurs when the null hypothesis is incorrectly rejected when it is actually true.
32
What is the purpose of a scatter plot?
A scatter plot is used to determine the relationship between two quantitative variables.
33
Multiple choice: What is the significance level (alpha) in hypothesis testing? A) The probability of rejecting the null hypothesis when it is true B) The probability of accepting the null hypothesis when it is false C) The probability of a type II error
A) The probability of rejecting the null hypothesis when it is true
34
What is the difference between a one-tailed and a two-tailed test?
A one-tailed test looks for an effect in one direction, while a two-tailed test looks for effects in both directions.