exam 2 Flashcards
(67 cards)
explain mean / pros+cons of it
mean = average
pro = quick idea about the data
con = sensitive to extreme values
explain median / pro
median = middle value of distribution
pro = immune to extreme values
explain mode / pro
mode = frequency of most recurring value
pro = gets an idea as to which answer is most popular in dataset
standard deviation / what will get you a bigger standard deviation
looks at how much on average is each value in the data set differing from the mean
if people are far from the mean it will be a bigger standard deviation
calculate range / what does a large range show about the data set
max value in data set - min value
a large range shows that people are giving low and high values
what type of hypothesis testing should you use in this example: On a scale from 1-7 how much do you like the brand?
one sample t-test
why would you use a one sample t-test?
when you are only dealing with one variable, it allows you to get some insight on just one
how do you analyze results from a one sample t-test?
by comparing the midpoint of the scale (ex: 4 on a 1-7 scale) with the mean of peoples responses to see if people fall away from midpoint or not
what types of hypothesis testing techniques do we use if we are dealing with 2 variables in cause and effect relationships?
multivariate techniques
what type of hypothesis testing should you use in this example: Are students or faculty more likely to use the internet?
what are the dependent and independent variable(s)?
independent sample t-test
independent variable = students or faculty
dependent variable = how much they use the internet
when would we use an independent sample t-test?
use when comparing two groups of responses to each other
what type of hypothesis testing should you use in this example:
Hypothesis: in communities where air pollution is a problem (vs not a problem), consumers are more likely to purchase an EV.
what are the independent variable(s)? dependent variable(s)?
independent sample t-test
Independent = air pollution presence (categorical because just answer yes or no)
dependent = likelihood of purchasing an EV (continuous because it could be measured on a 1-7 scale)
when running an independent sample t-test, how should i analyze the results? step by step
- look at “two sided p value” section. is it less than 0.05? if so, we can conclude that there is a significant difference between the two groups
- find the mean. which one is less? if the p value was significant, the lower value is more likely to do whatever it is
what type of hypothesis testing should you use in this example:
how much do you like strawberry ice cream? how much do you like vanilla ice cream?
paired sample t-test
because we are comparing strawberry and vanilla on the bases of two continuous scales because this SHOULD be 1-7 scale
when should you use an independent sample t-test ?
when we compare 2 groups of responses to each other
when do we use a paired sample t-test?
When we need to compare means of two scores
when we need to compare two variables in the data which should be measured on a continuous scale
what type of hypothesis testing should you use in this example:
lets predict people purchase an EV to help the environment more than to save money on gas.
what questions should we ask?
paired sample t-test
questions: 1. measuring whether people purchase an EV to save on gas 2. measures whether people purchase an EV to help the environment
how do we conduct a paired sample t-test analysis? step by step
- look at if one mean of one variable is higher than the other
- confirm the difference by looking at p value. is this less than 0.05?
- if so, we can confirm that the lower value of means = more likely to do that thing
when do you use an ANOVA? what types of variables to the X and Y need to be?
use when you compare two or more groups. your independent variable has to be categorical and dependent variable needs to be continuous
what type of what type of hypothesis testing should you use in this example:
different age groups and their frequency is using disney plus subscriptions
why?
ANOVA because you are measuring a categorical independent (age) with a continuous dependent (frequency of use)
how do you analyze an ANOVA table to get results? step by step
- look at the means. are there any categories that have higher means than others?
- look at significance. is it below 0.05? if so, we see that there is a difference between the groups.
- if there is a difference between groups means, you need to prove it.
- look at follow up Scheffe and Ad Hoc tests
- find the “significance” section under the multiple comparisons table. are there any significance values below 0.05? if so, we can conclude that there is a significant difference between those groups. if the sig value is over 0.05, then there is no sig difference between those two groups
explain linear relationships
association between two variables wherein the strength and nature of the relationship remains the same over the range of both variables
explain curvilinear relationships
a relationship between two variables wherein the strength/direction of the relationship changes over the range of both variables
when would we use a correlation?
when we are examining the relationship between two non categorical variables (through interval or ratio variables)