descriptive statistical Flashcards

1
Q

What is descriptive statistics?

A

Descriptive statistics is a branch of statistics that deals with summarizing and organizing data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

True or False: Descriptive statistics can infer conclusions about a population from a sample.

A

False

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are the two main types of descriptive statistics?

A

Measures of central tendency and measures of variability.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Fill in the blank: The three measures of central tendency are mean, median, and _____.

A

mode

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the mean?

A

The mean is the average of a set of numbers, calculated by dividing the sum of the values by the number of values.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How is the median determined?

A

The median is the middle value when the data set is ordered from least to greatest.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the mode?

A

The mode is the value that appears most frequently in a data set.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

True or False: A data set can have more than one mode.

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is a frequency distribution?

A

A frequency distribution is a summary of how often each value occurs in a data set.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What does a histogram represent?

A

A histogram represents the frequency distribution of numeric data using bars.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the range of a data set?

A

The range is the difference between the maximum and minimum values in a data set.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are quartiles?

A

Quartiles are values that divide a data set into four equal parts.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the interquartile range (IQR)?

A

The interquartile range is the difference between the first and third quartiles (Q3 - Q1).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

True or False: The standard deviation is a measure of how spread out the values in a data set are.

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is variance?

A

Variance is the average of the squared differences from the mean.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What does a box plot represent?

A

A box plot represents the distribution of a data set based on five summary statistics: minimum, first quartile, median, third quartile, and maximum.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Fill in the blank: In a box plot, the line inside the box represents the _____.

A

median

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What is a percentile?

A

A percentile is a measure indicating the value below which a given percentage of observations in a group of observations falls.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What is the purpose of descriptive statistics?

A

The purpose of descriptive statistics is to summarize and describe the main features of a data set.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

True or False: Descriptive statistics can be used for inferential purposes.

21
Q

What type of data is best suited for a pie chart?

A

Categorical data.

22
Q

What is a scatter plot used for?

A

A scatter plot is used to determine the relationship between two quantitative variables.

23
Q

Fill in the blank: The ____ is the sum of all data points divided by the number of points.

24
Q

What does the term ‘skewness’ refer to?

A

Skewness refers to the asymmetry of the distribution of values in a data set.

25
True or False: A positively skewed distribution has a longer tail on the right side.
True
26
What is kurtosis?
Kurtosis is a measure of the 'tailedness' of the probability distribution of a real-valued random variable.
27
What is a cumulative frequency distribution?
A cumulative frequency distribution shows the cumulative total of frequencies up to a certain point.
28
Fill in the blank: The ____ of a data set is the value that occurs most frequently.
mode
29
What is the difference between population and sample in statistics?
A population includes all members of a specified group, while a sample is a subset of the population.
30
What is a dot plot?
A dot plot is a simple graphical display that uses dots to represent the frequency of values in a data set.
31
True or False: Descriptive statistics can help identify outliers in data.
True
32
What is the purpose of using measures of central tendency?
To summarize a data set with a single representative value.
33
What is a stem-and-leaf plot?
A stem-and-leaf plot displays quantitative data while preserving the original data values.
34
Fill in the blank: The ____ is a measure that provides an idea of the average distance of data points from the mean.
standard deviation
35
What is the relationship between variance and standard deviation?
Standard deviation is the square root of variance.
36
What does a normal distribution look like?
A normal distribution is bell-shaped and symmetric about the mean.
37
True or False: In a normal distribution, approximately 68% of the data falls within one standard deviation of the mean.
True
38
What is a frequency polygon?
A frequency polygon is a graphical representation of the frequency distribution of a dataset using line segments.
39
Fill in the blank: The ____ of a data set is calculated as the square root of the variance.
standard deviation
40
What is a bar graph used for?
A bar graph is used to compare different categories of data.
41
What does a negative skew indicate?
A negative skew indicates that the left tail of the distribution is longer or fatter than the right tail.
42
What is a z-score?
A z-score indicates how many standard deviations a data point is from the mean.
43
Fill in the blank: A ____ is a visual display of the distribution of data points in a data set.
histogram
44
What is a two-way table?
A two-way table is used to summarize data that involves two categorical variables.
45
True or False: The mean is always a better measure of central tendency than the median.
False
46
What is a potential drawback of using the mean?
The mean can be heavily influenced by outliers.
47
What type of data is best suited for a bar graph?
Categorical data.
48
What is a scatter plot used for?
To show the relationship between two quantitative variables.