stats other stuff Flashcards

(34 cards)

1
Q

how do you calculate a marginal distribution

A

take every value in a column or row of a table and add them together, then divide by the table’s total

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

how do you calculate a conditional distribution

A

take one specific variable and divide it by its category (ex: girls with blue eyes divided by girls, when men with different colored eyes are also included in the table)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

what is a confounding variable

A

an outside variable that can cause disruption in a distribution’s results

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

what type of graph is good for categorical data

A

bar graphs and pie charts

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

what type of graph is good for quantitative data

A

dot plots

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

why is it important to start the y axis of a graph on 0

A

starting the y axis on a higher number may cause exaggerated results

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

what acronym should you use to describe a distribution’s properties

A

SOCS (shape, outliers, center, and spread)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

how do you interpret a stem and leaf plot

A

Digits in the stem represent the numbers first digit(s) and leafs represent the numbers second digit

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

what type of data uses histograms

A

quantitative

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

what is the difference in bar graphs and histograms

A

histograms are for quantitative data and their bars represent a range of data, but bar graphs are for categorical data and their bars represent specific values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

what is the difference in X bar and Mu (mean variables)

A

X bar represents the sample mean, while Mu represents the population mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

which type of distribution is described using mean and standard deviation

A

Normal distribution (symmetric)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

what type of distribution is described using median and IQR

A

A skewed distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

what percentile is Q1

A

25th

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

what percentile is Q2

A

50th

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

what percentile is Q3

A

75th (meaning 75% of the data values fall below the value of Q3)

17
Q

how do you calculate the high outliers in a data set

A

low outliers occur if points are higher than what you get when you calculate Q3 + (IQR x 1.5)

18
Q

how do you calculate the low outliers in a data set

A

low outliers occur if they are less than what you get when you calculate Q1- (IQR x 1.5)

19
Q

what elements does adding and subtracting affect in a distribution

A

mean and median but not spread (IQR or standard deviation)

20
Q

what acronym should you use to describe correlation and scatterplots

A

DUFS (direction, unusual points, form: whether its linear or not, and strength)

21
Q

what is the meaning of r2

A

how much of the data can be accounted for or explained by the independent variable in a distruibution

22
Q

what does correlation require

A

that both variables are quantitative

23
Q

how do you calculate residuals

A

Actual value - predicted value

24
Q

a census…

A

collects data from every individual in a population

25
what is a simple random sample
a sample taken from a population where every individual has equal chance of being chosen
26
blocking helps to
take away confounding variables by separating individuals in a data set into groups of shared characteristics
27
undercoverage bias
some members can't be chosen
28
nonresponse bias
individual can't be contacted or refuses to participate
29
response bias
instances include when individuals lie
30
wording of a question bias
when the question urges certain answers out
31
what is a control in statistics
when other variables that may affect the response are kept the same for all groups
32
replication
use enough groups to see more accuracy in affects
33
statistically significant
when an observation has an affect so large it could not have happened by chance DOES IMPLY CAUSATION
34