PAD-L7-intro to Stats-2025-LJ Flashcards

(24 cards)

1
Q

What is the main question addressed in the bullet hole pattern analysis during WWII?

A

How to minimize damage to bombers by identifying sensitive areas

The analysis focused on reinforcing areas that were not shot at on returning planes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is survivorship bias?

A

The error of focusing on the survivors or successful cases while ignoring those that did not survive or succeed

This can lead to incorrect conclusions about the data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are the three key take-home messages from the introduction?

A
  • Understand what question you want to address
  • Understand what data you need to answer a question
  • Understand what data you have and how reliable it is
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is a variable?

A

Something that takes on different values that can be measured or counted

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are the two main categories of variables?

A
  • Numerical (quantitative)
  • Categorical (qualitative)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are the five types of variables?

A
  • Binary
  • Nominal
  • Ordinal
  • Discrete
  • Continuous
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is descriptive statistics?

A

Describing and summarizing data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

How is the mean calculated?

A

Add the values of a set of observations together and divide by the number of observations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the median?

A

The exact middle value in a sorted list of observations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

When is the median preferred over the mean?

A

When dealing with skewed distributions or data with outliers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the mode?

A

The value that occurs most frequently in a dataset

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are the methods to measure variability?

A
  • Variance
  • Standard deviation
  • Range
  • Interquartile range (IQR)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What does variance measure?

A

The extent to which each observation deviates from the mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the standard deviation?

A

The square root of the variance, representing the average of the deviations of observations from the mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How is the range defined?

A

The difference between the largest and smallest observation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is a percentile?

A

A value indicating the percent of a distribution that is equal to or below it

17
Q

What is the Interquartile Range (IQR)?

A

The difference between the 75th percentile and the 25th percentile

18
Q

What is the purpose of visualizing data with plots?

A

To provide summary pictures that spot patterns, trends, and anomalies in data

19
Q

What is a histogram used for?

A

To plot the distribution of a numeric variable

20
Q

What characterizes a normal distribution?

A

It is symmetric and evenly distributed about the mean

21
Q

What are boxplots used for?

A

To compare groups of continuous data

22
Q

What do scatter diagrams illustrate?

A

The relationship between two continuous variables

23
Q

What is the significance of the second quartile in a boxplot?

A

It represents the median of the data

24
Q

What is the next topic to be covered after descriptive statistics?

A

Formulating a hypothesis