Midterm 1 Flashcards

(47 cards)

1
Q

What are the four warnings in creating a histogram?

A

Choice of bin size has big affect
Changing axis range
Burying explanatory factors
How data is scaled

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

How are strip charts better than histograms?

A

Better for comparing multiple data series

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the second step in sizing up data

A
Calculate Numerical descriptors
Mean
Median 
Mode
Quantiles 
Variance 
Standard deviation 
Min and max
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are boxplots?

A

Graphical form of the quantiles

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What does the line inside a box in a boxplot represent?

A

Line indicates the median

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What does the box hold in a boxplot?

A

Box holds 50% of points

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are the whiskers in a boxplot?

A

The whiskers hold remaining points

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is a null hypothesis

A

Conservative statement saying that there isn’t an expected effect

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is a p-value

A

A measure of the strength of the evidence against a null hypothesis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How do we find the confidence level?

A

(1-p) x 100

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is sums of squares and how is it measured?

A

SSY is how we measure variability

Sum of each value minus grand mean squared

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the relationship between SSY and n

A

SSY always increases with n

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

How to find variance

A

SSY/ n-1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

what is the equation for standard variation

A

Square root of variance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What are the three variability arising in a data set?

A

Variability of the population (sigma)?
Variability by the sample (s)
Variability of the estimated mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Why is standard error of the mean important

A

SEM can give us confidence intervals for our estimate of the population mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

How to find confidence intervals

A

Mean-tcrit (SD/ square root of sample size)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Relationship between estimate range and confidence level

A

A wider estimate range gives you a high confidence level

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What is ANOVA

A

Analyze the difference among group means. Compare differences in values between treatments to the variation within a treatment group

20
Q

What is the response variable

A

A continuous variable that is being influenced

21
Q

What is a explanatory variable

A

Categorical or continuous variable that influences

22
Q

In ANOVA, how do you find the total mean square

23
Q

What is linear regression

A

Can the value of he response variable(x) be predicted by the explanatory variable

24
Q

Differences between ANOVA and regression?

A

ANOVA: discrete x values, values are names, values are unordered
Regression: continuously varying, values have number meaning, values are ordered

25
What is statistical elimination?
Including the second extra lavatory variable allowed us to eliminate its influence in the rest of our model
26
What are the four principles of experimental design?
Replication Randomization Blocking Orthonogonality
27
What is replication
Multiple measures of the same thing Appears in the # of error degrees of freedom(residuals) Have at least 10 df for error
28
What is randomization
Treatments need to be applied to experimental units randomly Use uniformly distributed random numbers
29
What cardinals sins does randomization avoid
Systemic design: similarity between plots that undermines replication Unconscious bias in assigning treatment groups Using haphazard bs random design
30
What is blocking?
Tool to minimize error variation Distribute individual data points into different "blocks" yo minimize biases due to known common features of subsets of the points Acts as another explanatory variable
31
What are the rules for block design
Blocks used to account for a factor that could influence response Blocks should be used as internally homogeneous as possible If possible, all treatments should be included in all blocks
32
What is Latin square design
2 way blocking. Blocking so that each treatment appears exactly once in each row and column:
33
What is orthogonality
The acknowledgement that one variable tell you nothing about the other variable
34
What is the benefit of orthogonal design
There is no statistical elimination between orthogonal explanatory variables
35
Which variables are easier for orthogonality
Easier for categorical variable than continuous variables
36
What is the first step in sizing up data?
Make a graph
37
In continuous explanatory variables what do the p-values in ANOVA table represent?
That each explanatory variable has no influence on the response variables
38
With continuous variables, what do the p-values represent in the coefficients table
That each specific coefficient value equals zero
39
In a continuous variable, what does the p-value mean overall?
Neither explanatory variables can be used to predict the response variable
40
In a categorical variable, what does the p-value mean in the ANOVA table?
That each explanatory variable has no influence in the response variable
41
In a categorical variable what does the p-value mean in the coefficients table mean?
Each specific coefficient value equals 0
42
In a categorical variable, what does the overall p-value mean in the coefficients table?
That neither of the variables can be used to predict the response variable
43
How does blocking affect residuals?
Blocking helps by reducing the size of the residuals. Increasing F but lowering P
44
What is an interaction
Two x-variables interact of the effect of one x-variable on y depends on the level of the other
45
Regarding interactions, what does non-parallel lines indicate
There is an interaction
46
Regarding interactions, what does two parallel line indicate?
There are no interactions
47
In which case are r-squared values high and low with and without interactions
Model without interactions have a low r- squared value while model with interactions have a higher-squared values