stats midterm Flashcards

Question

Normative arguments include words like

Answer 1

“should” or “ought to.”

Answer 2

points at time in which your data changes

Answer 3

a sequence of data points collected or recorded at successive points in time, typically at equally spaced intervals, that represents how a particular variable or set of variables changes over time

Answer 4

the level at which your data changes

Answer 5

data that is structured in multiple nested levels, where observations are grouped within higher-level units

Answer 6

geographic locations in which your data changes

Answer 7

data collected at a single point in time from multiple units, such as states or countries, to analyze variations across those units

Answer 8

a variable that influences the strength or direction of the relationship between an independent and a dependent variable in a study.

Answer 9

a variable that explains the process or mechanism through which an independent variable affects a dependent variable, acting as an intermediary in the relationship11

Answer 10

a framework that uses mathematical models and logical structures to rigorously analyze and predict the behavior of complex systems or phenomena

Answer 11

individuals make decisions by systematically evaluating the costs and benefits to maximize their personal utility or advantage

Answer 12

the sum of all benefits of an action minus the sum of all costs from that action

Answer 13

an individual who seeks to make choices that yield the highest possible level of benefit based on their preferences and available options

Answer 14

the overall anticipated satisfaction or benefit (utility) derived from a particular choice or outcome

Answer 15

a branch of formal modeling that focuses on analyzing strategic interactions between rational decision-makers, where the outcome for each participant depends not only on their own choices but also on the choices of others

Answer 16

a classic game theory scenario where two individuals, who cannot communicate, face a choice between cooperating with each other or betraying one another

Answer 17

a domain within formal modeling that examines how individual preferences can be aggregated to make collective decisions

Answer 18

a preference structure that violates the transitivity condition. For example, an individual might prefer option A over option B, option B over option C, but still prefer option C over option A (A > B, B > C, but C > A).

Answer 19

a specialized form of formal modeling that incorporate spatial or geographic dimensions into the analysis of strategic interactions.

Answer 20

a formal modeling approach used to analyze how voters' preferences and spatial positioning influence electoral outcomes

Answer 21

voters and candidates are positioned on a spatial map (often a one-dimensional or two-dimensional continuum) based on their ideological or policy preferences

Answer 22

candidates choose positions or policies to maximize their votes, typically moving towards the median voter or the center of voter preferences to appeal to the largest segment of the electorate

Answer 23

The model identifies equilibrium points, where candidates' positions stabilize because any deviation would result in fewer votes. The most common equilibrium is the median voter theorem, where candidates converge to the preferences of the median voter

Answer 24

a connection between two variables where one variable directly influences or determines the outcome of the other

Answer 25

a variable that influences both the independent and dependent variables, potentially leading to a misleading or spurious association between them.

Answer 26

a false or misleading association between two variables that is actually caused by a third, confounding variable, rather than a direct causal link between the two

Answer 27

a variable or condition that is held constant or regulated in an experiment or study to isolate the effect of the independent variable on the dependent variable, ensuring that the results are not influenced by extraneous factors

Answer 28

a connection between two variables where one variable's value is precisely determined by the value of the other, with no randomness or uncertainty involved

Answer 29

a connection between two variables where changes in one variable are associated with changes in the likelihood or probability of different outcomes in the other variable, but the relationship is not perfectly predictable

Answer 30

information collected from real-world observations or measurements without conducting experiments

Answer 31

information collected from experiments where variables are systematically manipulated to observe their effects on other variables, allowing for causal inferences

Answer 32

experimental studies where participants are randomly assigned to either a treatment group or a control group to evaluate the effectiveness of an intervention while minimizing biases

Answer 33

a group of participants in a study that receives the treatment or intervention being tested, allowing researchers to assess its effects compared to a control group

Answer 34

the process of randomly allocating participants to control and treatment groups in a study to ensure that each group is comparable and to eliminate selection bias

Answer 35

when the sample of participants in a study is not representative of the population being studied, leading to distorted or unrepresentative results

Answer 36

four causal hurdles.

Answer 37

external validity

Answer 38

the degree to which one can be confident that the results of an analysis apply to the broader population

Answer 39

experiments that leverage naturally occurring random variations or events to investigate causal effects, without direct manipulation of the independent variable by the researcher

Answer 40

internal validity

Answer 41

studies that compare the effects of an intervention or treatment between pre-selected groups that are not randomly assigned, aiming to assess causal relationships while controlling for confounding variables

Answer 42

research designs that aim to evaluate interventions or treatments without full randomization, often using pre-existing groups or natural conditions to infer causal relationships

Answer 43

research designs in which the researcher does not have control over values of the independent variable because the independent variable occurs naturally

Answer 44

a specific question or statement in a survey designed to gather data on a particular aspect of a respondent's attitudes, opinions, or behaviors

Answer 45

items that allow respondents to provide their answers in their own words

Answer 46

item that asks respondents to rank a list of choices according to their preferences or importance

Answer 47

response options that allow respondents to rate their level of agreement or disagreement with a series of statements on an interval scale, typically ranging from "strongly disagree" to "strongly agree

Answer 48

a type of response with only two choices

Answer 49

multiple questions or items that measure a single underlying construct

Answer 50

the process of assessing whether a multi-item scale accurately and reliably captures the construct it is intended to measure, ensuring that it reflects the intended attributes and performs consistently across different contexts and populations

Answer 51

data collected about respondents' characteristics, such as age, gender, education level, income, and ethnicity

Answer 52

the entire group of individuals or units from which a sample is drawn and to whom the survey findings are intended to generalize

Answer 53

a subset of individuals or units selected from a larger population for the purpose of conducting a survey or study to draw conclusions about the entire population

Answer 54

to select and examine a subset of a population or data set to draw conclusions or make inferences about the larger population

Answer 55

the number of individual units or observations selected from a population for a study, used to ensure the results are statistically reliable and representative of the larger group

Answer 56

the probability that a statistical test will correctly reject a false null hypothesis, thereby detecting an effect or relationship if one truly exist

Answer 57

a subset of a population that accurately reflects the characteristics and diversity of the larger group, allowing the results to be generalized to the entire population

Answer 58

when each member of the population has a known, non- zero chance of being selected for the sample, allowing for statistical inference and generalization to the population

Answer 59

when members of the sample are not selected at random, making it difficult to determine the likelihood of any member being chosen and limiting the ability to generalize the findings

Answer 60

a type of non-probability sample where participants are selected based on their easy availability and proximity to the researcher, rather than through random sampling, which can lead to biases and limited generalizability

Answer 61

a method of inquiry that focuses on collecting and analyzing numerical data to identify patterns, test hypotheses, and make generalizations about a population

Answer 62

forming a precise definition for and clear understanding of the concepts being studied

Answer 63

a broad, abstract idea or general notion that provides a foundational understanding

Answer 64

a specific, measurable version of a concept used in research to operationalize and test theoretical ideas

Answer 65

the extent to which a measurement tool appears to measure what it is supposed to measure, based on casual inspection

Answer 66

the extent to which a variable or measurement is related to other measures that theory suggests should be related

Answer 67

the extent to which a variable or measurement accurately represents all of the elements that define the concept it is intended to measure

Answer 68

the consistency and stability of a measurement tool across repeated applications

Answer 69

when only the entities that have "survived" a particular process are considered, leading to a skewed understanding or conclusion.

Answer 70

a method of inquiry that focuses on understanding and interpreting the meanings, experiences, and perspectives of individuals or groups through non-numerical data, such as interviews, observations, and texts

Answer 71

represent categories or groups and do not have a numeric value

Answer 72

categorical variables with no inherent order or ranking among the categories.

Answer 73

categorical variables that have a meaningful order or ranking, but the intervals between the categories are not necessarily equal.

Answer 74

represent quantities and can be measured on a numeric scale

Answer 75

can take any value within a range and can be subdivided into finer increments with equal unit distances

Answer 76

can only take specific, distinct values, often counts or integers

Answer 77

a class of statistics used to describe the variation of continuous variables based on their ranking from lowest to highest values

Answer 78

a statistical term that divides a dataset into four equal parts, with each quartile containing 25% of the data

Answer 79

a graphical representation of data that displays the median, quartiles, and potential outliers, using a box to show the interquartile range and "whiskers" to indicate the range of the data

Answer 80

numerical measures derived from the data values themselves and their positions relative to the mean or origin

Answer 81

if you subtract the mean of a dataset from each data point, the sum of these deviations will always be zero

Answer 82

expected value because it is the value you would most expect the variable to take.

Answer 83

a measure of the dispersion of a variable around its mean

Answer 84

another measure of the dispersion of a variable around its mean.

Answer 85

a visual depiction of the distribution of a single variable based on a smoothed calculation of the density of cases across the range of values

Answer 86

a measure that indicates the symmetry of the variable’s distribution around the mean

Answer 87

a measure that indicates the steepness of the distribution of a variable

Answer 88

lots of nonrespondents.

Answer 89

a sample such that each member of the underlying population does NOT necessarily has an equal probability of being selected.

Answer 90

the process of using what we know about a sample to make probabilistic statements about the broader population.

Answer 91

parameters are numerical values that describe certain characteristics or features of a sample or an entire population, such as the mean, variance, or proportion.

Answer 92

a fundamental result from statistics indicating that if one were to collect an infinite number of random samples and plot the resulting sample means, those sample means would be distributed normally around the true population mean

Answer 93

a mathematical function that describes the probabilities of different outcomes in a random variable or set of data

Answer 94

the underlying mechanism or model that describes how data is produced and collected

Answer 95

an outcome whose occurrence is not influenced by the outcome of another event.

Answer 96

a bell-shaped statistical distribution that can be entirely characterized by its mean and standard deviation.

Answer 97

* One standard deviation in each direction captures 68.3% of the area under the curve. * Two standard deviations in each direction captures 95.5% of the area under the curve. * Three standard deviations in each direction captures 99.7% of the area under the curve.

Answer 98

the standard deviation of the sampling distribution means. -It is the measure of the variability or dispersion of sample means around the population mean

Answer 99

a probabilistic statement about the likely value of a population characteristic based on the observations in a sample.

Answer 100

a testable statement predicting a relationship or effect between variables, often framed as an expectation of what will happen

Answer 101

a specific type of hypothesis that assumes no effect or no difference between variables and serves as a baseline to test against

Answer 102

an alternative scenario or condition that contrasts with the proposed effect or relationship in the hypothesis, effectively serving as the null hypothesis which assumes no effect or difference

Answer 103

a predetermined threshold derived from a particular statistical distribution used to conduct a statistical test

Answer 104

the probability of rejecting the null hypothesis when its actually true, representing the threshold for statistical significance.

Answer 105

a value calculated by: * identifying the sample statistic (e.g., the mean), * determining its standard error (e.g. standard error of the mean), and * using a specific formula to assess how far the sample result deviates from the null hypothesis

Answer 106

the probability of obtaining a test statistic as extreme as, or more extreme than, the one observed, assuming the null hypothesis is true

Answer 107

an indication that an observed effect or relationship in the data is unlikely to have occurred by random chance alone. (assuming the null hypothesis is true and the study is repeated an infinite number of times by drawing random samples from the same population, less than 5% of these results will be more extreme than the current result.)

Answer 108

the alternative hypothesis is proven to be true. It just means you can reject the null hypothesis

Answer 109

a statistical test that evaluates whether observed categorical data align with the expected frequencies based on a specific hypothesis

Answer 110

a matrix that displays the frequency distribution of two categorical variables, showing how their values intersect

Answer 111

the number of independent values or quantities that can vary in a statistical calculation, typically indicating the number of values that are free to vary after certain constraints are applied

Answer 112

degrees of freedom

stats midterm Flashcards

(139 cards)