250A Midterm Flashcards

Question 1

Q

advantages and disadvantages of the mode

Answer

A

+ actually occurs in data
+ only thing that makes sense for nominal data
+ not affected by outliers
- not able to manipulated mathematically bc no formula

Question 2

Q

advantages and disadvantages of the median

Answer

A

\+ not affected by outliers
\+ does not require interval assumptions
\+ makes absolute error as small as possible
- does not enter into equations nicely
- cannot be decomposed
- poor estimator of the population value

Question 3

Q

advantages and disadvantages of the mean

Answer

A

+ unbiased (best estimate of pop mean)
+ makes average squared errors as small as possible
+ has a mathematical formula so can be manipulated
- affected by outliers
- need interval scale

Question 4

Q

+ and - of IQR

Answer

A

+ good for boxplots
+ does not assume normality of data
- throws away too much data

Question 5

Q

what does variance mean?

Answer

A

variance tells us the average squared deviation from the mean – how much, on average, each observation differs from the mean in squared units.
proportional to average squared difference between all pairs of observations so it summarizes both how different scores are from each other and how different they are from the mean!

Question 6

Q

what is standard deviation?

Answer

A

SD is average deviation from the mean: how much on average each observation differs from the mean

Question 7

Q

why do we use variance and sd instead of mean deviation and mean absolute deviation?

Answer

A

sum of mean deviations is always 0 so this doesn’t tell us anything
var and sd are useful mathematically because you can partition them.
mean absolute deviation is biased and inconsistent.

Question 8

Q

difference between trimmed and windsorized means/variances

Answer

A

trimmed = chop off top x% and bottom x% of data and recompute mean and var
windsorized: same as trimmed but replace missing data with new lowest and highest values
(if these procedures don’t change your statistics, your stats are robust. cool)

Question 9

Q

why would we want trimmed/windsorized stuff?

Answer

A

mean and var are especially sensitive to outliers in small samples, which can ruin statistical tests. so we may want indices that are more robust (varies little from one sample to another)
decrease influence of extreme values

Question 10

Q

why divide by n-1 when computing sample variance?

Answer

A

because dividing by n leads to a biased estimate of variance – the long run average will be too small
also, because we lose one degree of freedom in estimating xbar from the sample (now that we have estimated xbar, not all data points are free to vary - one is fixed)

Question 11

Q

chief problem in interpreting variance?

Answer

A

it’s in a squared metric and that don’t make no sense, bruh

Question 12

Q

how is standard deviation interpreted?

Answer

A

it’s in the units of your variable – average difference between an observation and the mean

Question 13

Q

what is expected value?

Answer

A

long run average of a statistic – if you resample infinity times and compute the statistic, the value that the statistic converges to is its expected value

Question 14

Q

what is an unbiased estimator?

Answer

A

when the expected value of the sample statistic is the population parameter

Question 15

Q

true/false: if a statistic is an unbiased estimator of a parameter, the statistic must have a symmetric sampling distribution.

Question 16

Q

how would you go about empirically (without equations) determining the bias and efficiency of sample mean and median in terms of estimating the population mean?

Answer

A

take a bunch of samples from your population and take the mean, median, and sd of each one. then construct a sampling distribution from this iterative procedure. if the mean of the sampling distribution = the population mean, it’s unbiased. if the standard error of the sampling distribution is small, it’s efficient.

Question 17

Q

define degrees of freedom

Answer

A

degrees of freedom is how many values in your sample are free to vary. e.g., if you calculate the mean, you lose a degree of freedom because now all your observations are not free to vary. one is fixed.

Question 18

Q

define linear transformations

Answer

A

t = a*x + b

adding or subtracting a constant or multiplying or dividing by a constant

Question 19

Q

effect of linear transformation on mean, sd, variance, relative ordering, and statistical tests

Answer

A

adding/subtracting: add/subtract same amount from mean (just shifts distribution left or right)
multiplying/dividing: multiplies/divides mean by the constant, variance by the square of the constant, sd by the constant
DO NOT AFFECT relative ordering or results of statistical tests

Question 20

Q

describe standardization transformation

Answer

A

z = (x - xbar)/s
standardization gives you a distribution with mean = 0 and sd = 1
so tells you how many deviations each value is from mean
to calculate probabilities from this distribution, need normality of data

Question 21

Q

goal of box-cox (power transformations)

Answer

A

optimize normality of predictors
box-cox considers all possible power transformations and computes likelihod of data under normal distribution…then finds the exponent that makes the data most likely

Question 22

Q

are all transformations monotonic (order preserving)?

Answer

A

yes, even nonlinear

Question 23

Q

what do nonlinear transformations preserve/not preserve

Answer

A

preserve order (bc monotonic) but not shape -- changes relative standing of data points
so results of statistical tests may not be preserved

Question 24

Q

nominal scale properties

Answer

A

scale that classifies people
no numeric/cardinal ordering
classifications mutually exclusive

Question 25

Q

ordinal scale properties

Answer

A

scale that conveys order but no equal distance

Question 26

Q

interval scale properties

Answer

A

dif between scores has same meaning throughout the scale
no true 0
minimum requirement for most statistics

Question 27

Q

ratio scale properties

Answer

A

dif between scores has same meaning but now we have a true 0 so we can talk about ratios of scores

Question 28

Q

when linear transformations are performed on interval scales, do we maintain same ratios?

Answer

A

well it’s no good to talk about ratios in interval scales but if it’s a ratio scale, then yeah you should keep the same ratio after a linear transformation

Question 29

Q

Why is the normal distribution so important in psychological research?

Answer

A

Because it allows us to compute probabilities of observing a score or test statistic

Question 30

Q

How can we explore the degree to which sample data are normally distributed?

Answer

A

Graph your data and look at it
Impose a normal density over your data and look at it
Examine mean, sd, skewness, and kurtosis
normal has skew = 0 and kurtosis = 3

Question 31

Q

How are areas under normal curve linked to probabilities?

Answer

A

p(selecting a case in some range) corresponds to area under the curve between those values

Question 32

Q

True/false: if a sampling distribution is unbiased, then it must be symmetric and normal as well.

Question 33

Q

Why do we need to know theoretical probability distributions like the normal, chi-square, t, etc.?

Answer

A

because important things like test statistics follow these distributions and we want to know the probability of obtaining a particular test statistic under different assumptions

Question 34

Q

How does one find the area under a curve?

Answer

A

area under curve from x to y: integrate density function from x to y or use software

Question 35

Q

Why can’t we compute the exact probability of a sample result instead of a probability for a range of values?

Answer

A

In continuous distributions, the probability of obtaining any one value is 0 so you must look at ranges

Question 36

Q

What do tabled values of the standard normal distribution actually tell you?

Answer

A

Probability (area under the curve) to the left of whatever the given value is

Question 37

Q

Under what conditions can we use a z-score table to compute probabilities of a sample result?

Answer

A

If your data are normal or sample size is greater than 30

Question 38

Q

Distinguish between measures of absolute standing and relative standing.

Answer

A

Uhh I don’t know

Question 39

Q

What is sampling error?

Answer

A

Sampling error is random variation from sample to sample due to chance
the numerical value of a statistic will probably deviate from the parameter it’s estimating as a result of the particular observations that happened to be included in the sample

Question 40

Q

What do sampling distributions tell us?

Answer

A

Degree of sample-to-sample variability we can expect by chance through sampling error

Question 41

Q

Why do we need sampling distributions?

Answer

A

Sampling distribution tells us the expected sample results under some assumptions, so we can calculate the probability of our sample results!

Question 42

Q

Why is random sampling so important? What are some consequences of nonrandom sampling?

Answer

A

Random sampling leads to external validity (the generalizability of your results). A representative sample can generalize results. If your sample is biased, your results depend on your sample and don’t generalize.

Question 43

Q

If the mean of a sampling distribution for a statistic equals the population parameter, then we say it is ________.

Question 44

Q

What is standard error of the mean and how is it different from a standard deviation of X?

Answer

A

Standard error = sd of sampling distribution = how much on average a sample value is from its corresponding population parameter
Different from SD(X) because it is the SD of xbar calculated from sampling distribution

Question 45

Q

How does standard error change as a function of N?

Answer

A

Standard error decreases as N increases

Question 46

Q

Can the shape (not just variance) of a sampling distribution change as a function of N?

Answer

A

Yes – variance

Question 47

Q

Describe completely what the central limit theorem is.

Answer

A

the central limit theorem states that if X~Normal in the population or n > 30, your sampling distribution of xbar will be ~Normal(mu, sigma^2/n)

Question 48

Q

Under what conditions can you compute a one sample z test?

Answer

A

Need to know population variance

X~Normal or n > 30

Question 49

Q

Differentiate sample statistics from test statistics. Do each have their own sampling distributions?

Answer

A

A sample statistic describes characteristics of samples. A test statistic is associated with a specific statistical procedure and has its own sampling distribution

Question 50

Q

True/false: the lower the p value, the less likely the sample result is a fluke of chance sampling.

Answer

A

True bruh

Question 51

Q

Contrast research vs null hypothesis.

Answer

A

Null hypothesis: all samples drawn from the same underlying population and thus have the same means
Alternative hypothesis: your research hypothesis that you are testing

Question 52

Q

Directional vs non directional alternative hypothesis

Answer

A

Directional: prior research suggests the DIRECTION of a difference (easier to reject null bc critical value smaller)
Non-directional: mu1 =/= mu2 without specifying direction

Question 53

Q

Two most common significance levels. How does choice of directional/non

Answer

A

.05 and .10

it’s easier to reject the null in directional hypotheses because the critical value is closer to 0

Question 54

Q

What is the relation between Type I and Type II error rates?

Answer

A

If you reduce alpha to make Type I errors less likely, you are more likely to make a Type II error (failing to reject false null)
as alpha goes down, type ii error rate increases and decreases power

Question 55

Q

Type I error

Answer

A

False positive: p(reject null | null is true)

Question 56

Q

Type II error

Answer

A

Miss: p(fail to reject null | null is false)

Question 57

Q

Usually want to avoid Type I errors over Type II. When would you want to avoid Type II errors?

Answer

A

You may want to avoid Type II errors in clinical settings. If you tell a patient they do not have a disease (null hyp) when in fact they do (alternative hyp), that could be very dangerous.

Question 58

Q

What does p value in a hypothesis test tell you?

Answer

A

p value tells you the probability of obtaining a sample result as extreme or more extreme than your sample result under the null hypothesis

Question 59

Q

What is actually tested in hypothesis testing?

Answer

A

Whether the difference in group means we obtained could reasonably have arisen if we drew our samples from the same underlying population (null distribution)

Question 60

Q

Distinguish between P(null | data) and P(data | null).

Answer

A

In hypothesis testing, we test the probability of obtaining our data GIVEN the null is true.
In Bayesian statistics, we test the probability of a null hypothesis being true GIVEN our data.

Question 61

Q

Steps of a hypothesis test.

Answer

A

Research/alternative hypothesis
Set up appropriate null hypothesis
Construct sampling distribution of particular statistic on assumption that null is true
Collect data
Compare sample statistic to that distribution
Reject or fail to reject null

Question 62

Q

How is hypothesis testing similar to innocent until proven guilty?

Answer

A

We assume the null is true and do not reject it until we have sufficient evidence to do so.

Question 63

Q

What are the merits of one vs. two tailed tests in terms of type ii errors?

Answer

A

A two tailed test at alpha = 0.05 is more liberal than a one-tailed test at alpha = 0.01

Question 64

Q

What are some criticisms of NHST?

Answer

A

Theories never get confirmed or disconfirmed, people just lose interest
The better researchers make their design (more power, better control of confounding variables) the weaker the test of theory
Null is always quasi-false – two groups won’t be exactly equal on everything
Just gives you a binary decision
With large enough sample size, SE goes down enough that all effects eventually become statistically significant

Answer 62

A

Because these things make it easier to reject the null!

Answer 63

A

Include confints: tells you all the nulls that would have been rejected and all the nulls that would fail to reject
report effect sizes to give an indication of how practically big or small a statistically significant difference is

Answer 64

A

theories used to be (and sometimes still are) tested by seeing how many experiments give significant results. The analogy is you basically have a scorecard and the theory with more points (significant results) “wins”, but this is a very weak, inconsistent, and error-prone way to evaluate scientific theories. Steve also calls this “research by tabular asterisks.”

Answer 65

A

In agronomy it’s like either your crops grow or they don’t. In psych, you have to make a bunch of other auxiliary hypotheses (like the construct exists, and is measured perfectly by your dependent variable).

Answer 66

A

Confidence intervals give you all the information contained in a NHST result and ~MOAR~

Answer 67

A

It is a standardized difference between means.

used as a complement or counter-point to significance tests where it gives and indication of how practically big or small a statistically significant difference is
To provide a common metric on which to compare effects when dependent variables may be measured on different scales

Answer 68

A

If we were to repeat the experiment over and over based on n cases, each time computing a sample mean, and then computing a confidence interval around that mean, 95% of those confidence intervals would contain the true population parameter

Answer 69

A

Any null hypothesis mean within the range of the 95% confidence interval would have led to a fail to reject the null hypothesis
Any hypothesized mean that falls outside the range of the confidence interval would have led to the null hypothesis being rejected.
Thus you really don’t need NHST, you just need the confidence interval. That interval tells you all null hypotheses that would have been accepted or rejected, alpha = .05, two-tailed

Answer 70

A

The distribution looks more and more normal

Answer 71

A

This means that if we repeat our experiment hella times, 95% of the confidence intervals we compute will contain the parameter

Answer 72

A

In t-tests, you don’t know the population variance so you have to estimate it. The t distribution has thicker tails (more extreme results by chance, more extreme critical values)

Answer 73

A

Z test because critical values closer to 0

Answer 74

A

You will falsely reject more nulls than you should

Answer 75

A

var(x1) + var(x2) - 2 * cor(x1, x2) * s1 * s2
if x1 and x2 are independent, just sum the variances
if x1 and x2 are dependent, smaller sd because subtract correlation

Answer 76

A

Because you can just take difference scores and do all the computations on the one sample of difference scores

Answer 77

A

Better matching increases correlation between groups –> decreases SE –> more POWER

Answer 78

A

Control confounding variables, which gives you more internal validity
Smaller SE for more powa

Answer 79

A

In terms of the process of the test, the normality assumption is required for accurate p-values, and the homogeneity of variance assumption is required to pool the variances. In terms of which type of t-test to use, Student’s (Steve called this Gosset’s) t-test is relatively robust to violations of either, Welch’s t-test is more so, and bootstrapping is best.

Answer 80

A

With a larger sample size (combining n1 and n2) we get a more precise estimate of variance

Answer 81

A

In dependent samples, the scores are dependent on one another in some way.
POWER: dep has higher power
DF: indep has higher df
CRITICAL VALUES: dep has lower crit vals
INTERPRETABILITY: uhh
COSTS: if you do indep you lose power

Answer 82

A

small = .20
medium = .50
large = .80

Answer 83

A

Positively skewed, especially for small n

Resulting value of t is likely to be larger than z that we would have obtained if sigma was known and used

Answer 84

A

Yaaa – variance becomes less positively skewed as N increases

Answer 85

A

Heyell naw

Answer 86

A

It’s a 2 sample t test on (x1 - xbar1) vs (x2 - xbar2)

Answer 87

A

If n1 = n2, t test is pretty p robust to small deviations

if t-test is not appropriate… use Welch-Satterthwaite test

Answer 88

A

more symmetric as df increases, and mean and var increase as df increases
mean = df, var = 2*df

Answer 89

A

For a given sample size, there are limited numbers of contingency tables that you could construct and thus limited number of chi sq values. So these discrete possibilities of chi sq cannot approximate the continuous chi sq distribution if the frequency of 1+ of the cells is small

Answer 90

A

prospective: treatments applied THEN future outcome determined/measured/collected
retrospective: select people who had or had not experienced something and then look BACK to measure outcome

Answer 91

A

You would run a bunch of simulations – draw hella samples and compute hella statistics, then make a sampling distribution for your statistics and examine it!

Answer 92

A

signal: how different than what is expected / noise: expected difference by chance

Brainscape's Knowledge GenomeTM

250A Midterm Flashcards

Brainscape's Knowledge Genome^TM