Final Exam Flashcards

Question

Experimental Data

Answer 1

Data collected from a randomized experiment

Answer 2

Data collected about naturally occurring events.

Answer 3

is a statistic that summarizes the relationship between two variables, X and Y, with a number denoted as cor(X,Y) in mathematical notation. It summarizes the direction and strength of the linear association between the two variables

Answer 4

confounding variable is a variable that affects both (i) the likelihood to receive the treatment X and (ii) the outcome Y

Answer 5

is the proportion of its occurrence among infinitely many identical trials ► Example: probability of heads when flipping a coin

Answer 6

probabilities represent one’s subjective beliefs about the relative likelihood of events ► Example: probability of rain in the afternoon

Answer 7

probability distribution of a binary variable

Answer 8

approximation for many non-binary variables

Answer 9

Refers to the fact that the value of a statistic varies from one sample to another because each sample contains a different set of observations drawn from the target population

Answer 10

As the sample size increases, the sample mean of X approximates the population mean of X

Answer 11

As the sample size increases, the standardized sample mean of X can be approximated by the standard normal distribution

Answer 12

Methodology based on proof by contradiction: We start by assuming the contrary of what we would like to prove and show how this assumption leads to a logical contradiction

Answer 13

what you are trying to disprove

Answer 14

what you are trying to provide evidence for

Answer 15

The P stands for probability and measures how likely it is that any observed difference between groups is due to chance.

Answer 16

the result is statistically significant at the 5% level when it is distinguishable from zero using 5% as the rejection threshold

Answer 17

a result is scientifically significant when it is large enough to be consequentiall

Answer 18

test statistic whose distribution under the null hypothesis is the standard normal distribution

Answer 19

determines the rejection threshold of the test and characterizes the probability of false rejection of the null hypothesis.

Answer 20

provides the range of values that is likely to include the true value of the parameter

Answer 21

defined as half the width of the estimator's confidence interval

Answer 22

The estimation error is the difference between the estimate and the true value of the parameter.

Answer 23

A function of observed data that can be used to test the null hypothesis

Answer 24

estimator for which the average estimation error over multiple samples is zero; estimator that provides, on average, accurate results

Answer 25

The standard normal distribution is the normal distribution with mean 0 and variance 1.

Answer 26

events that do not share any outcomes

Answer 27

Omega; the set of all possible outcomes produced by a trial; considered an event in itself

Answer 28

action or set of actions that produces outcomes of interest

Answer 29

The result of a trial

Answer 30

A set of outcomes; an event is said to occur if any one of the possible outcomes included in the event is realized

Answer 31

Variables affected by the treatment: X ----> post-treatment variable

Answer 32

ranges from 0 to 1 and measures the proportion of the variation of the outcome variable explained by the model.

Answer 33

This is a statistical model used to predict the value of one dependent variable based on two or more independent variables.

Answer 34

B (the Greek letter beta) is the slope coefficient

Answer 35

Y is the outcome for observation i

Answer 36

variable that we use as the basis for our predictions; predictors are also known as independent variables

Answer 37

variable that we are trying to predict based on the values of the predictor(s); outcome variables are also known as dependent variables

Answer 38

predicts the value of one dependent variable based on only one independent variable

Answer 39

measures how far our prediction is from the observed value; it is the difference between the observed outcome and the predicted outcome.

Answer 40

is the graphical representation of the relationship between two variables, where one variable is plotted along the x-axis, and the other is plotted along the y-axis.

Answer 41

shows the values the variable takes and the number of times each value appears in the variable.

Answer 42

shows the proportion of observations that take each combination of values of two specified variables.

Answer 43

histogram that uses densities instead of frequencies as the height of the bins, where densities are defined as the proportion of the observations in the bin divided by the width of the bin.

Answer 44

is defined as the average outcome for the treatment group minus the average outcome for the control group

Answer 45

A nonbinary variable can take more than two values, such as distonce={1.452, 2.345, 0.298} and dice_roll={2, 4, 6}.

Answer 46

refers to the cause-and-effect connection between two variables in which a change in one variable systematically produces a change in the other

Answer 47

cut-off point of the test statistic used to determine whether to reject the null hypothesis

Answer 48

observations that received the treatment

Answer 49

observations that did not receive the treatment.

Answer 50

variable whose change may produce a change in the outcome variable

Answer 51

X treatment / predictor

Answer 52

A binary variable can take only two values; we define binary variables as taking only 1 s and 0s

Answer 53

The internal validity of a study refers to the extent to which its causal conclusions are valid for the sample of observations in the study.

Answer 54

shows the number of observations that take each combination of values of two specified variables.

Answer 55

25th percentile / 75th percentile

Answer 56

Y / Outcome/ Effect

Answer 57

This is a simpler model that predicts the value of one dependent variable based on only one independent variable.

Final Exam Flashcards

Terms (82 cards)