EXAM REVIEW Flashcards

(45 cards)

1
Q

quantitative variable

A

one that takes on a numerical value for a measured or counted quantity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

categorical/qualitative variable

A

one of a limited, usually fixed number of possible values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

discrete variable

A

a quantitative variable that takes a fixed set of possible values with gaps between

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

continuous variable

A

a quantitative variable that can take any value in a interval on the number line

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

relative frequency

A

the proportion or % of times a value occurs out of the total # of observations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

basic parameters for describing distribution

A

Context, shape(skew, uni vs bimodal), center(mean/median), variability(range, IQR or stdev) and outliers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

The mean is ___ than the median if the distribution has a right skew, The mean is __ than the median if the distribution has a left skew.

A

greater than, less than

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

IQR

A

the distance between the 1st and 3rd quartiles of a distribution(Q3-Q1)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

1.5 x 1QR rule for outliers

A

outliers < Q1-(1.5IQR); outliers > Q3 + (1.5IQR)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

z-score equation

A

x-mean/stdev

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

No pattern in a residual plot indicates that:

A

a linear model is appropriate

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

least squares regression line equation

A

y= a + bx

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

coefficient of determination(r^2)

A

proportion of the response variable variation the can be explained by the explanatory variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

correlation coefficient(r)

A

measure direction and strength of association; always between -1 and 1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

r equation(using least square regression line equation)

A

b=r(stdev y/stdev x)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

census

A

A study that collects data from every individual in the population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

cluster sample

A

The population is split into groups (clusters), and entire clusters are randomly chosen to be surveyed.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

simple random sample

A

A sample where every individual and group has an equal chance of being selected.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

stratified sample

A

The population is split into strata (groups) based on a shared trait, and a random sample is taken from each group.

20
Q

systematic random sample

A

Choose a random starting point, then select every nth person from a list.

21
Q

What are the 3 main types of bias?

A

voluntary, under-coverage and nonresponse bias

22
Q

matched pairs design

A

Subjects are paired based on similarity; each pair gets different treatments, or one subject gets both treatments in random order.

23
Q

experimental units

A

The individuals or items that receive treatments in an experiment.

24
Q

completely randomized design

A

All experimental units are randomly assigned to treatment groups, with no grouping beforehand.

25
what is ‘blocking’ in a study?
Subjects are grouped by a variable (like age), and random assignment to treatments is done within each group.
26
sample space
list of all possible outcomes
27
joint probability
P(A and B) if = 0; events are disjoint
28
probability addition rule(can be used to test if two events are mutually exclusive)
P(A U B): P(A) + P(B) - P(A and B) if = 0; exclusive
29
conditional probability
probability of an event occurring, given that something has already happened P(A|B)= P(A and B)/P(B)
30
the expected probability is found by:
adding the products of the amounts and their probabilities
31
binomial variable
count successes in a fixed number of trials
32
geometric variable
model the number of trials until the first success
33
critical value used a 90%, 95% and 99% CI
1.645, 1.96, and 2.576 respectively
34
3 "checks" for a proportion
1) Random 2) Normality(LC): np >= 10/ n(1-p) >= 10 3) 10% condition
35
As sample size increases...
confidence interval decreases, but center remains constant
36
Margin of error x 2 =
width of confidence interval
37
Confidence interval statement
We are ___% confident that the proportion/mean of ___ is between ___ and ___.
38
z-test equation for proportions
sample proportion-population/ stdev
39
The null hypothesis is accepted when...
The statistic is greater than the alpha level
40
Type I vs Type II error
Type 1: false positive Type 2: false negative
41
power
1- Probability of making a Type II error
42
interpreting p-value
If the null hypothesis were true, the probability of observing a proportion/mean/chi-squared value of at least ____ is ____(p-value).
43
What is the difference between a matched pairs vs two-sample t-test?
A two-sample t-test compares the means of two independent groups, while a matched pairs t-test compares the means of two related groups
44
different types of chi^2 tests
Gof: 1 sample, 1 variable Homogeneity: 2 samples, 1 variable Independence: 1 sample, 2 variables acronym: HIG H is the oldest(21); I is the middle(12); G is the youngest(11)
45
what contributes most to the chi-square test statistic?
the observed count farthest away from the expected count