Quantitative Methods Flashcards

1
Q

Alpha Coefficient

A

The alpha coefficient (also referred to as Cronbach’s alpha) is a measure of internal consistency reliability (how closely related a set of items are as a group). It is used in survey design. Alpha is concerned with the degree to which different items appear to be tapping a single underlying construct (as suggested by the correlation/covariance between items on a scale). Cronbach’s alpha can be written as a function of the number of test items and the average inter-correlation among the items. Therefore, to increase alpha you can 1) add more items to the measure or 2) increase the homogeneity of items on the scale.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Alpha Level

A

The level of significance chosen before running a statistical test that denotes the probability of rejecting the null hypothesis (no relationship between two measured phenomena) when it is actually true. This probability should be small, because we do not want to reject the null hypothesis when it is true (type 1 error). Traditional values used for alpha are .05, .01, and .001. (Eg., When alpha is set at .05 it means that 5% of all possible outcomes will falsely reject the null hypothesis when it is actually true.) The alpha-level is chosen according to the researcher’s willingness to take that particular risk of claiming to find a significant “effect” when there is none.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Attributable Risk (Risk Difference)

A

It is the amount of risk of disease attributable to exposure in an epidemiological study. This measure is derived by subtracting the incidence rate of the outcome among the unexposed from the rate of the outcome among the exposed individuals (thereby measuring the difference between the disease risk in exposed and unexposed populations). Two assumptions must be made: 1) everyone can be categorized as either exposed or non-exposed; 2) a cause-effect relationship exists between the exposure and disease.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Autocorrelation

A

Autocorrelation (also known as serial correlation) is the correlation of of a variable with itself over successive observations. For example, if we are predicting the growth of stock dividends, an overestimate in one year is likely to lead to overestimates in succeeding years. In panel studies, the error terms are autocorrelated when the current values of a variable (e.g., political attitude) are correlated with previous values (e.g., religion). Autocorrelation can appear with time-series data (e.g., repeated measures) and with cluster sampling (e.g., surveying multiple individuals in a family/household). We hope for a situation in which, for any two observations, the error terms are uncorrelated with each other.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

BLUE

A

In using survey / sample data in regression, we want to look for items whose population parameters are the best, linear, and unbiased in order to increase the accuracy of the overall regression. Best—the property (of a sample estimate) of having the smallest deviation from the population parameter. Linear—the slope is constant. Unbiased—an estimator of the population parameter is unbiased if its mean value over all possible random samples is equal to the parameter being estimated. Estimator—an estimated value (calculated from sample data) of some population parameter of interest.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Bounded Recall

A

A survey method used to improve recall and reduce telescoping error (tendency of respondents to report events as happening earlier or later than they actually occurred) during the retrieval stage of response formation. Respondents are given a specific point of reference (generally the last survey response) to help jog their memories and prevent the compression of time in their responses. It is a primary device to reduce over-reporting of health events in continuous panel surveys (longitudinal studies of the same sample of people at different times) However, it does not necessarily help to reduce the other kind of recall error: omission/forgetting.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Case-control Study

A

A study design in epidemiology most often used when an outcome or disease is rare. Cases are selected based on their disease (or other outcome) status; controls are those without the outcome variable, but who resemble the cases in other factors (should be selected from the same source population as the cases). The point of this type of study is to look back in time to understand what the risk or exposure was that caused the cases to develop the disease in comparison to the controls (in order to generate a measure of association between disease and exposure status). Used to estimate the odds ratio (cannot determine incidence, prevalence, or (generally) the probability of disease given exposure or non-exposure).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Causal Inference

A

The art and science of making a causal claim about the relationship between two factors. If an association between two factors/variables is observed, then causal inference can answer the question: is one causing the other? However, association does not necessarily imply causation. David Hume (early 1700s) was the first to make a systematic statement of cause and effect. His lasting contribution was to raise doubts about the possibility of proving causation, arguing that there is no certain logic that can prove universality of causal claims. Thus, our study results will often be inconclusive, and the best we can expect to do is to reach the most reasonable explanation based on the evidence at hand.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Causality

A

According to Selltiz et al. (1959), three conditions must be met in order to infer the existence of a causal relationship between two variables 1) temporal sequencing (cause precedes effect), 2) empirical correlation between independent and dependent variables (cause and effect covary with one another), and 3) no alternative plausible explanations for the effect other than the cause (eliminating confounding factors).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Censored Observation

A

Observations that are incomplete in some way, as when certain values are unknown or ignored, or when an observation is discontinued before the event of interest is observed to occur, resulting in incomplete follow up of the individual. For example, in a study of the effects of a cancer treatment on survival time, study participants who remain alive at the end of the study are considered censored observations because their full survival time is still unmeasured, and thus cannot be included in the analysis of the results. Their records become censored observations. This situation occurs in survival analysis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Cohort

A

Types include prospective, retrospective and ambispective studies. The analytic method of epidemiologic study in which subsets of a defined population can be identified who are, have been, or in the future may be exposed or not exposed to a factor (or factors) hypothesized to influence the probability of occurrence of a given disease or other outcome. The basic feature is observation of a population (classified on the basis of exposure) for a sufficient number of person-years to generate reliable estimates of outcome incidence in the population subsets. A cohort study establishes temporal sequence between exposure and disease.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Period Effect

A

A period effect leads to cross-sectional changes in the observed incidence of a disease because of factors uniformly affecting all age groups and birth cohorts during a specific period of time, such as the effect from exposure to something in the air or drinking water. Often a period effect refers to an artifactual change in the reported disease rate, vs. a true change in the burden of that disease in the population. Ex.: the increased use of PSA tests artificially inflated the rates of prostate cancer (because diagnoses became more common, while actual prostate cancer did not). Such a shift in the sensitivity of techniques for detecting an outcome affects multiple age groups (those screened/observed, anyway), regardless of birth cohort.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Cohort Effect

A

A cohort is a group of people who share a common experience within a defined time period. Thus, a cohort effect will be concentrated in the group sharing a relationship to a particular event. Example: we might expect to observe a cohort effect of the WTC disaster on the incidence of clinical depression or alcoholism among firefighters working in the NYC area during fall of 2001.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Cohort Effect vs. Period Effect

A

Both terms describe the temporal effects on societal patterns of disease. Period effect refers to the effects of contemporary societal change in trends of disease due to factors uniformly affecting all age groups and birth cohorts during a specific period of time, such as the effect from exposure to something in the air or drinking water. A period effect changes a disease rate for a limited period of time around the time of its occurrence. (ex: Higher disease rates in the 1930s due to the Great Depression). Cohort effect will be concentrated in the group sharing a relationship to a particular event. For example, we might expect to observe a cohort effect of the WTC disaster on the incidence of clinical depression or alcoholism among firefighters working in the NYC area during fall of 2001.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Confidence Intervals

A

The range of values (bounded interval) within which a population parameter is estimated to lie. This depends on the point estimate, its variability, and the sample size. CI’s can be thought of as an enhancement of the point estimate because rather than just calculating a single number that’s intended to estimate the value of a population parameter, we can also calculate a range of values within which the parameter is likely to fall. Therefore, the CI reflects our best guess of a parameter (the point estimate) as well as the precision of this guess. Ideally, you want the CI to be reasonably narrow. A CI at 95% indicates that if you sample the same population repeatedly over time, 95% of the the CIs for the point estimate would include the point estimate.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Confounding Factors

A

A confounding factor is an extraneous variable that covaries with both the independent and dependent variable of interest. It is a plausable alternative explanation for the relationship between the variables being studied. Unlike a mediating variable, it lies outside of the causal pathway between IV and DV, threatening internal validity. You want to control for confounders, but not for mediators. For example, smoking, positively associated with both alcohol intake and cardiovascular disease, can confound the association between drinking and CVD. Thus, we would want to control for smoking in any equation used to model the relationship between drinking alcohol and risk for cardiovascular disease.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Moderating vs. Mediating Factors

A

A moderator variable is one that influences the strength of a relationship between two other variables, and a mediator variable is one that explains the relationship between the two other variables. Whereas moderator variables affect the direction and/or strength between an IV and DV, mediators speak to how or why such effects occur. As an example, let’s consider the relation between social class (SES) and frequency of breast self-exams (BSE). Age might be a moderator variable, in that the relation between SES and BSE could be stronger for older women and less strong or nonexistent for younger women. Education might be a mediator variable in that it explains why there is a relation between SES and BSE. When you remove the effect of education, the relation between SES and BSE disappears.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Content Validity

A

The validity of the instrument itself. In survey research, content validity relates to how accurately the scale operationalizes the latent construct(s). Poor content validity means that you are measuring something other than what you had intended to measure. Is the instrument really measuring the concept or idea indicated? Construct validity is directly concerned with the theoretical relationship of a variable (e.g. one that is measured by a scale) with other variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Contamination

A

The polluting of one’s experimental groups with outside influences – often unexpected and uncontrolled for. The term contamination often refers to a diffusion of the treatment of interest from experimental to control groups. Contamination is especially likely to occur where one treatment is more desirable than another treatment or than the control state, and the desired treatment can be obtained through sources other than the investigators or their service delivery staff.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Correlation vs. Causation

A

Correlation refers to the degree to which variables change (covary) together. Researchers have determined that correlation is not synonymous with causation. ‘A’ can only be said to cause ‘B’ if 1) A is prior to B; 2) change in A is correlated with change in B; and 3) this correlation is not itself the consequence of both A and B being correlated with some prior variable C.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Cost-benefits Analysis

A

A type of efficiency analysis in program evaluation that assesses the relationship between (direct and indirect) program costs and their (direct and indirect) benefits in monetary terms. Example: an anti-smoking campaign saved $1000 in healthcare costs for every $100 in project costs. The major issue with CBA is the difficulty (methodologically and philosophically) in placing a dollar value on non-monetary program benefits.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Cost Effectiveness Analysis

A

A type of efficiency analysis in program evaluation that assesses the non-monetary outcomes of an intervention in relation to a program’s input costs. It is expressed as a ratio of cost per unit of impact, or in other words, the cost of achieving a specific result. Example: an anti-smoking campaign caused one person to quit smoking for every $1,000 in project costs. This type of analysis is particularly helpful when comparing multiple programs that seek to achieve the same outcomes, or when monetizing program outcomes (for cost-benefit analyses) would be too difficult.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Counterfactual Condition

A

A counterfactual is something that is contrary to fact. In an experiment, we observe what did happen when people received a treatment. The counterfactual is knowledge of what would have happened to those same people if they simultaneously had not received treatment. An effect is the difference between what did happen and what would have happened. We cannot actually observe a counterfactual since it is impossible for respondents to both have and not have the causal condition simultaneously. Therefore, a central task of cause-probing research is to create reasonable approximations to the physically impossible counterfactual. (Shadish, Cook, & Campbell)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Criterion Validity

A

The extent to which a survey measure or scale predicts or agrees with some criterion or “gold standard” of the measure. Includes predictive validity (correspondence between the measure and a future criterion), concurrent validity (correspondence between the measure and a current criterion), and postdictive validity (correspondence between the measure a previously established criterion). Ex.: SATs demonstrate predictive validity to the extent that SAT scores correlate positively with college GPA, while patient reports of STI exposure demonstrate concurrent validity to the extent that these reports match up with records of lab tests of their disease status.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

Cronbach’s Alpha

A

A measure of internal consistency reliability for a scale. It is defined as the proportion of a scale’s total variance that is attributable to a common source, presumably the true score of a latent variable underlying the items. It examines the homogeneity of items through examining inter-item covariances/correlations. It measures to what proportion items are measuring the same thing. It is based on all possible ways of splitting and comparing sets of questions used to tap into a particular construct. The widely accepted cutoff for alpha is 0.70. 0.80 or better represents good reliability.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

Discriminant Validity

A

The degree to which the operationalization of a target construct diverges from operationalizations of other constructs to which it is conceptually similar or empirically associated. It is an aspect of construct validity. It flows from the notion that a measure of A can be discriminated from a measure of B when B is thought to be different than A. For example, if one is measuring a neighborhood’s collective efficacy, it would strengthen the study to show that the measurement is tapping something different than what a social capital measure would tap.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

Double-barrel Question

A

A question that groups two different topics or constructs into a single question, making the question multidimensional. For example, “Do you want to be rich and famous?” or “How difficult is it for adolescents to get and use birth control?” These types of questions violate the rule that closed-ended questions should be unidimensional (i.e., they ask about only one topic at a time). Double-barreled questions produce ambiguous answers and contribute to measurement error because the researcher cannot disentangle which question the respondent actually answered.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

Ecological Fallacy

A

A problem of inference when a relationship at the aggregate is assumed to hold at the individual level. Example: Party affiliation for a state used to determine individual voting behavior. It assumes that individuals in the study group will have the average characteristics of that group. The problem lies in the false assumption that a correlation between group characteristics (e.g., religion in a geographically defined population and incidence of suicide in that population) will be reproduced at the level of the individual (e.g., individual Protestants will be more likely to commit suicide than Catholics). This is a problem that can arise when one uses inappropriate units of analysis (with respect to the units of observation). For example, you cannot safely draw conclusions about individual voters based on data regarding their precincts; to do so would be to perpetuate an ecological fallacy.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

Effect Size

A

Effect size is a name given to a family of indices that measure the magnitude of a treatment effect. This includes the standardized mean difference statistic, the odds ratio, the correlation coefficient, the rate difference, and the rate ratio. Unlike significance tests, these indices are independent of sample size. ES measures are the common currency of meta-analysis studies that summarize the findings from a specific area of research.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

Endogeneity

A

Variables influenced by other variables in a model. In path diagrams all endogenous variables have error terms associated with them because there are almost always causes of an observed score on an endogenous variable other than the exogenous causes modeled, including random measurement error, and omitted causes not included in the study. We can think of endogenous variables as dependent variables, but it is important to remember that an endogenous variable can also predict other variables in the model.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

Expectation

A

The mean of a statistic based on repeated sampling. The expected value of a random variable, denoted as E(X). For a random variable, it is the integral of the random variable with respect to its probability measure. Intuitively, it is the long-run average: if a test could be repeated many times, expectation is the mean of all the results.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
32
Q

External Validity

A

Concerns whether inferences/conclusions hold over variations in persons, settings, treatments, and outcomes (generalizability). This is one of Cook and Campbell’s 4 “types” of validity, and it is the highest in the hierarchy. This means that it should be considered as a final objective, after statistical conclusion, internal, and construct validity have already been reasonably well established.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
33
Q

Formative Evaluation

A

Evaluation activities that assess the conduct of programs during their early stages. These activities are undertaken to furnish information that will guide program improvement, and/or to begin developing the measures and instruments that will permit ongoing evaluation of the program to be implemented. Thus, formative evaluation can shape or form the program to improve its performance, and is usually desired by program evaluators who will be called upon later to demonstrate that a program has met its goals and objectives. Formative evaluation can be focused on program development, targeting and structural issues, or can be conducted like mini-impact evaluations. It may include testing/assessing a program at certain sites, or with a small sample of targets, prior to full implementation. This often allows opportunities to pretest evaluation procedures, as well as the intervention itself. (Rossi & Freeman)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
34
Q

Goodness-of-fit Statistic

A

Describes how well a statistical model fits a set of observations. Common GOF statistics are Pearson’s statistic and the Likelihood-Ratio Statistic. The GOF statistic summarizes the discrepancy between observed values and the values expected under the model in question. It indicates the variance explained by the chosen model. In other words, it estimates how well the observed data fit the pattern predicted by the explanatory model or how well does the model predict what it is supposed to predict.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
35
Q

Heterogeneity

A

Refers to variance of responses, respondents, conditions, or treatments. When there is heterogeneity of units within conditions of an outcome variable, the standard deviations on that variable, and any others correlated with it, will be greater. Heterogeneity is a threat to statistical conclusion validity because it can obscure systematic covariation between treatment and outcome.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
36
Q

Heteroscedasticity

A

In statistics, a group of random variables is heteroscedastic when there are sub-populations with different variances. One of the Ordinary Least Squares regression assumptions is that the error term has a constant variance, or is homoscedastic. Heteroscedasticity can result from things such as measurement error in the dependent variable and from interaction between included and excluded independent variables. When graphed, heteroscedasticity generally looks like a megaphone around the regression line.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
37
Q

Human Subjects

A

A living individual about whom an investigator conducting research obtains 1) data through intervention or interaction with the individual, or 2) identifiable private information (US Federal Guidelines). Research involving human subjects must gain approval from an Institutional Review Board, which considers the basic ethical principles of respect for persons, beneficence, and justice. These principles are applied through the use of informed consent procedures (voluntary participation), assessment of risks and benefits, ability of participant to choose to be excluded at any time, participants being informed of study design (Belmont Report).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
38
Q

Impact Evaluation

A

An evaluative study that answers questions about program outcomes and impact on the social conditions it is intended to ameliorate. Also known as impact assessment or outcome evaluation. Impact evaluation requires methods for separating the treatment effect (the effect of interest) from confounding effects. Evaluation of impact is a part of the evaluation process that attempts to attribute the desired outcomes of program to the program itself. The objective of impact evaluations is to determine the net impact of a program on the outcome intended by the program.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
39
Q

Indirect Effect

A

In path analysis and causal modeling, the relationship between X and Y is said to be indirect if X causes Z which in turn causes Y. A predictor variable has an indirect effect on the outcome if there is another variable on the causal path between the first predictor variable and the outcome of interest.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
40
Q

Informed Consent

A

The process of giving prospective study participants the information they need to decide whether or not they want to participate in a study given its risks and benefits. U.S. federal guidelines indicate that there are a number of pieces of information that a human subject must be given in order to provide informed consent including limits of confidentiality, a statement that participation is voluntary and can be stopped at any time, a description of the purpose of the research and the procedures to be followed, and a description of any foreseeable risks.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
41
Q

Instrumental Variables

A

In statistics, econometrics, epidemiology and related disciplines, the method of instrumental variables (IV) is used to estimate causal relationships when controlled experiments are not feasible or when a treatment is not successfully delivered to every unit in a randomized experiment.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
42
Q

Intention to Treat

A

A method of analysis for randomized trials in which all patients randomly assigned to one of the treatments are analyzed together, regardless of whether or not they completed or received that treatment. This analysis preserves the benefits of random assignment for causal inference but yields an unbiased estimate only about the effects of being assigned to treatment, not of actually receiving treatment. The inference yielded by the intent to treat analysis is often of great policy interest because if a treatment is implemented widely as a matter of policy, imperfect treatment implementation will occur. So the intent to treat analysis gives an idea of the likely effects of the treatment-as-implemented in policy.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
43
Q

Interaction

A

In experiments, when the effects of treatment vary over levels of another variable. It describes a situation where the effects of two variables on a third variable are not additive. Other names for interaction are effect-measure modification or moderation. Causal partners interact with one another. One example of interaction is how, among oral contraceptive users, the effect of smoking on cardiovascular & cerebrovascular disease varies by age (age is the effect-measure modifier that interacts with smoking).

44
Q

Internal Validity

A

The validity of inferences about whether the relationship between two variables is causal. Is concerned with determining to what extent we can be sure the effects on a DV (outcome) are due to the IV (predictor). To demonstrate internal validity, one must show that the IV preceded the DV and that no other explanations for the relationship are plausible (i.e., rule out confounding). Some threats to internal validity include: ambiguous temporal precedence, systematic differences in selection, attrition, repeated testing, and maturation.

45
Q

Inter-rater Reliability

A

A measure of the equivalence of information obtained by different data gatherers on the same (or comparable) groups of respondents (Aday). In general, a correlation of answers among raters of .80 or higher is desirable. Inter-rater reliability can be computed through a number of statistics including a Pearson, Spearman, Kappa, and Intra-class correlation coefficient.

46
Q

Intervening vs. Spurious Factors

A

Another way to talk about mediator vs. confounder. An intervening variable is a link in the causal chain (lies in the causal path), it mediates the relationship between IV and DV and helps explain the relationship between the two. A spurious variable does not belong in the causal chain, it is a confounder and distorts the relationship between variables to which it is also related.

47
Q

Intraclass Correlation (Coefficients)

A

A measure of inter-rater reliability. Compares the variability of different ratings of the same subject to the total variation across all ratings and all subjects. It accounts for the proportion of total variance accounted for by within-subject variation. Pros: it’s a flexible measure that can be used in many complex research designs. It can measure agreement when there are more than two data gatherers, can assess the reliability of both single and mean ratings, and indicates both the pattern and level of agreement. Cons: not very suitable for comparing across populations. The same instrument could be judged reliable or unreliable depending on the population in which it is being used.

48
Q

Kappa

A

A measure of inter-rater reliability for categorical variables. Kappa takes chance agreement into consideration, which makes it a stronger measure than just percentage agreement. It measures the agreement between two raters only. Ranges from 1 (perfect agreement) to -1 (complete disagreement), with 0 meaning no agreement above what is expected by chance. Kappa does not account for different types of agreement.

49
Q

Latent Variable

A

A variable that is not directly observed but is inferred or estimated from observed variables. The latent variable is the underlying phenomenon or construct that a scale is intended to reflect. Ex.: Intelligence is a latent variable that might be observed through various survey measures or tests.

50
Q

Likert Scale

A

A measurement format for survey questions that consists of a fairly strong statement that a respondent rates using an ordinal response scale. The item is presented as a declarative sentence, followed by response options that indicate varying degrees of agreement with or endorsement of the statement. It generally includes a 3-7 point ordinal response scale ranging from strongly agree to strongly disagree and is typically used to measure subjective or attitudinal phenomena. A good scale should have meaningful differences between options and avoid weak statements that elicit too much agreement.

51
Q

Meta-analysis

A

The quantitative review of many experiments that address a common question. In its simplest form, it is the process of using statistical methods to combine the results of different studies, thus obtaining a more powerful, stable estimate of effect magnitude. Meta-analyses are often components of a systematic review procedure. One must watch for publication bias, how things are weighted, and the samples used/attrition. Example: combining the results of various studies about the relationship between hormonal birth control and breast cancer.

52
Q

Moderating vs. Mediating Variables

A

Mediating variable = a third variable that comes between a cause and effect and also transmits the causal influence from the cause to the effect. Without it the outcome would not occur how and when it does. Moderating variable = a variable that influences the effects of the exposure (IV). It interacts with the IV, and could change its strength and/or direction (interaction) Ex.: Relationship between athletic prowess and popularity depends on gender (gender is the moderator).

53
Q

Moderator Variable Example

A

The relationship between college GPA and starting salary is moderated by job interview skills (Vogt & Johnson 2011).

54
Q

Mediator Variable Example

A

Parents transmit their social status to their children through education (the mediating variable) (Vogt & Johnson 2011).

55
Q

Multicollinearity

A

In multiple regression analysis, multicollinearity exists when two or more independent variables are highly correlated; this makes it difficult to determine their separate effects on the dependent variable. The tolerance and the VIF are two methods of detecting multicollinearity.

56
Q

Multitrait-Multimethod Matrix

A

An approach to assessing the construct validity of a set of measures in a study that was originally proposed by Campbell and Fiske. It is a correlation matrix used to examine the convergent and discriminant validity of a construct. The matrix contains correlations among two or more constructs (traits) measured in two or more ways.

57
Q

Network/Multiplicity Sampling

A

A sampling strategy in which respondents are asked to report both for themselves and also for others linked to them in clearly specified ways. Commonly used linkages are familial relationships. Linkages to neighbors have also been used. This is helpful for studying rare populations or conditions. The probability of being sampled is associated with the size of the network. Generally increases response error, because respondents often cannot report on others as accurately as they can report on themselves.

58
Q

Nonrecursive Causal Model

A

A causal model that postulates that a variable can, at different times, be both a cause and an effect; that this, there is a reciprocal relationship between two or more variables. For example, if you believed that education increases knowledge and that knowledge increases individuals’ tendency to seek more education, you would be postulating a nonrecursive cause model (Vogt & Johnson 2011).

59
Q

Odds Ratio

A

The ratio of two odds. In epidemiology, the disease odds ratio is expressed as the odds of disease given exposure, divided by the odds of disease given no exposure. Odds ratios are often used in case control studies, where risk or rate differences are usually not obtainable, but attributable proportion is.

60
Q

P Value

A

Probability of the observed value of the test statistic if the null hypothesis were true. Researchers reject the null hypothesis when the p value is very small (alpha level of .05 or less). When the p value is small, the researcher believes that the result was not due merely to chance. For example, p=.001 means that the observed statistic would occur, in the long run, only once every thousand times.

61
Q

Panel Study

A

A longitudinal study of the same group (or “panel”) of subjects. Surveys the same individuals at different times, whereas a cohort study usually uses independent samples from the same group/cohort at different times. The panel study is thought of as a true longitudinal study; a cohort study is an approximation (Vogt & Johnson 2011).

62
Q

Partial Correlation Coefficient

A

A measure of the linear relationship between x and y when all other variables in the model are controlled for. The opposite of a partial correlation is a simple relation, that is, one uncomplicated by considering other variables.

63
Q

Predictive Validity

A

The extent to which a test, scale, or other measurement predicts subsequent performance of behavior. Also called criterion-related validity. Example: how well SAT scores predict students’ academic success in college. A high correlation between SAT scores and future GPA scores would provide evidence for predictive validity.

64
Q

Primary Sampling Unit

A

When multi-stage sampling is used, this is the general name for the first sampling unit chosen. For example, you might first sample blocks, then sample households within the block. The blocks are PSUs. Sampling units are also called clusters (as in cluster sampling). In a single-stage sampling design, the PSU is the only sampling unit, which is also equivalent to the “element” or “observation unit.”

65
Q

Probability Sample

A

A sample in which each case that could be chosen has a known probability of being included in the sample. Probalility sampling relies on laws of chance for units to be selected into the sample. Random selection is part of the process of every probability sample.

66
Q

Purposive Sample

A

A sample composed of subjects selected deliberately (on purpose) by researchers, usually because they think certain characteristics are typical or representative of the population. This is not based on probability. The investigator selects a sample of people, events, or settings that she believes will yield the most comprehensive understanding of her subject of study. Most often used in qualitative studies. May introduce bias.

67
Q

Process Evaluation

A

A form of program evaluation designed to determine whether the program is delivered as intended to the target recipients. Also known as implementation assessment. Process evaluation assesses 1) extent to which the program is reaching the appropriate population(s), 2) the degree to which service delivery is consistent with design specifications, and 3) the expenditure of resources in the process of conducting the program.

68
Q

Program Impact

A

The portion of an outcome change that can be attributed uniquely to a program. That is, the impact of the program with the influence of other sources controlled or removed. Also termed program effect.

69
Q

Prospective Study (vs. Case Control Study)

A

In a prospective study, the investigator first specifies a population and identifies the exposure status for each subject, and then divides the population on the basis of the presence or absence of the exposure. This population is followed over a period of time and the cases (of disease or other outcome) are collected. This is a longitudinal, or panel, study. In a case-control study, those with and without the disease are identified at the beginning of the study and exposure status is determined later. A case-control study permits calculation of the Odds Ratio: (Odds of exposure among diseased)/(Odds of exposure among non-diseased).

70
Q

Quasi-experimental Design

A

A type of research design for conducting studies in field or real-life situations where the researchers may be able to manipulate some independent variables but cannot randomly assign subjects to control and experimental groups. This happens either because the researcher has no control over the conditions (e.g., a natural disaster), or it would be unethical to assign a group to a particular condition (e.g., lead exposure). More feasible than experimental studies, but may sacrifice internal validity.

71
Q

Random Assignment

A

Putting subjects into experimental and control groups in such a way that each individual in each group is assigned entirely by chance. Each subject has an equal probability of being placed in each group. Reduces likelihood of bias.

72
Q

Recursive Causal Model

A

A causal model in which all the causal influences are assumed to work in one direction only (asymmetric). The arrow points from A to B (not both ways). For example, looking at age and math achievement, age might influence students’ math achievement, but math achievement won’t influence age. On the other hand, your model of the relationship between achievement and time spent studying would be non-recursive, since they could influence one another.

73
Q

Relative Risk

A

The ratio of the risk of disease in the exposed group to the risk of disease in the unexposed group. It is calculated in cohort or prospective studies. An RR=20 means that the exposed are 20 times more likely of developing the disease, as compared to the unexposed.

74
Q

Residual vs. Error Term

A

Two closely related and easily confused measures of the deviation of an observation from its “theoretical value.” Error = the deviation of an observation from the (unobservable) true function value. Residual = the difference between an observation and the value predicted by a model (Vogt & Johnson 2011). Both terms refer to degree of “inaccuracy” rather than a mistake.

75
Q

Respondent Driven Sample

A

A statistically advanced version of snowball-sampling that is used to find members of hard-to-reach populations through their social networks. What makes it advanced is that it applies a complicated mathematical model that weights the sample to compensate for the fact that it was not obtained through simple random sampling. This enables the researcher to make unbiased population estimates while using a method that has traditionally been regarded as merely a method of convenience sampling.

76
Q

Selection Bias

A

Errors due to systematic differences between those who take part in the study and those who do not. Selection bias results from study procedures that give a result among participants that is different from the result that would occur among individuals who are eligible but not included in the study.

77
Q

Selection and Participation Bias in Epi Research

A

Errors due to systematic differences between those who take part in the study and those who do not. Selection bias results from study procedures that give a result among participants that is different from the result that would occur among individuals who are eligible but not included in the study. Participation bias is an error that occurs when those who choose to provide particular data are systematically different from those who do not (i.e, answering a question about HIV or sexuality).

78
Q

Sample Selection Bias

A

Any selection procedure that causes comparison groups to be different. This is a problem because the researcher wants the groups to differ on only the manipulated variable, not on any other extraneous variable. If they differ on both, it will not be clear which one produced the observed effect.

79
Q

Sensitivity vs. Specificity

A

Sensitivity is the ability of a diagnostic test to identify the presence of a disease or condition (ability of a test to avoid false positives). Specificity is the ability of a test to judge that subjects do not have a disease or condition (ability of a test to avoid false negatives).

80
Q

Simple Random Sample vs. Cluster Sample

A

Simple random sample = Each unit in a population has an equal likelihood of being surveyed (taking names out of a hat). Gold standard of probability sampling. Need to know the boundaries of the population of interest, to know the probability of choosing a unit from it. Cluster sample = A population is divided into groups (e.g., city blocks) and a random sample of these groups is selected. All of the units or a sample of the units within these groups are then surveyed. Will have wider confidence intervals than simple random sampling. Way to retain the diversity of the sample while cutting costs (such as travel).

81
Q

Snowball Sample

A

A non-probability sampling method in which a group of individuals serve as initial contacts, and these subjects provide the name of a fixed number of other individuals who fulfill research criteria. The researcher approaches these persons, asks them to participate, and each subject who agrees is then asked to provide additional names. The researcher continues this process for as many stages as desired. This method is most useful in studies of hidden populations, social networks, and small, bounded populations (appropriate to field research).

82
Q

Specification Error

A

Refers to mistakes in specifying an appropriate theoretical structure/model for one’s data. The most common errors include leaving out an important variable and/or including an irrelevant variable. In other words, specification error is a failure to account properly for the actual variables influencing an outcome of interest.

83
Q

Spurious Correlation

A

A correlation between any two variables that does not result from a direct relation between them but from their relation to other variables. In path analysis, products of the paths linking the two variables to any third variables that cause them both (i.e., confounding).

84
Q

Spurious vs. Intervening Test Variable

A

A spurious variable (confounder) exerts an effect on both the independent and dependent variable. An intervening variable (mediator) comes between the two and only exerts influence on the dependent variable. Whereas an intervening variable is a link in the causal chain, a spurious variable does not belong in the causal chain (instead, it distorts the relationship between variables to which it is also related).

85
Q

Spuriousness

A

The incorrect inference of a causal relationship between two variables where the relationship is either accidental or the result of confounding. Researchers attempt to identify and eliminate spuriousness by the use of random assignment in an experimental design or through the use of control variables in the manipulation of data during analysis.

86
Q

Standard Error

A

The term “standard error” is used to refer to the standard deviation of various sample statistics such as the mean or median. For example, the “standard error of the mean” refers to the standard deviation of the distribution of sample means taken from a population. The smaller the standard error, the more representative the sample will be of the overall population. The standard error is also inversely proportional to the sample size - the larger the sample size, the smaller the standard error because the statistic will approach the actual value.

87
Q

Standardized Effect

A

Metric-free measures of effect. Standard effect size measures are typically used when the metrics of variables being studied do not have intrinsic meaning, when results from multiple studies that used different metrics are being compared, or when a meta-analysis is being performed. Examples: r, Cohen’s d, odds ratio)

88
Q

Standardized Regression Coefficient

A

A statistic that provides a way to compare the relative importance of different variables in a multi-regression analysis. Comparison is possible because the regression coefficients are expressed as z scores. Often called the beta coefficient.

89
Q

Statistical Conclusion (vs. Internal Validity)

A

Statistical conclusion refers to the accuracy of conclusions about covariation of variables made on the basis of statistical evidence. In other words, it refers to the appropriate use of statistics to infer whether the presumed independent and dependent variables covary. Internal validity refers to whether the covariation between variables resulted from a causal relationship. Establishing internal validity includes demonstrating that the IV preceded the DV and ruling out alternative explanations for their association. Both of these kinds of validity pertain only to the relationships between observables (variables as measured) within the study sample and context.

90
Q

Statistical Power

A

The power of a statistical test is the probability that the test will reject the null hypothesis when the alternative hypothesis is true (ie, that it will not make a type II error). In other words, it is the ability to correctly identify an actual effect or association. Power is affected by the magnitude of the effect of interest in the population and the sample size used to detect the effect.

91
Q

Stratified Random Sample

A

A type of probability sampling in which the sampling frame is divided into strata, along a characteristic that is likely to be meaningful to the issue under study (e.g. gender, class, race). Then a random sample is drawn from each cell. Good for ensuring adequate representation of subgroups. Can be proportionate (the strata are sampled in proportion that they appear in the population) or disproportionate (certain strata are over-represented in the sample, typically to get enough power to do sub-group analyses).

92
Q

Summative Evaluation

A

Evaluation research conducted in the latter stages of a program to assess its impact or to determine how well it has met is goals. Often undertaken to help policy makers decide whether to continue, expand, reduce, or terminate a program’s funding.

93
Q

Suppressor/Suppression Effect

A

A suppressor is a variable that conceals or reduces a relationship between other variables. It is a particular type of confounder that masks a causal relationship between IV and DV. For example, education tends to increase people’s liberalism. But it also tends to increase people’s incomes, and increased incomes tends to reduce people’s liberalism. In this case, income is the suppressor variable (Vogt & Johnson 2011).

94
Q

Test of Significance vs. Confidence Interval

A

A test of significance measures the discrepancy between a sample statistic and the value of the population parameter specified in the null hypothesis, taking into account sampling error. Correct interpretation: “assuming that the null is correct, the probability is less than p that the observed results will also be observed in the population.” A CI is an interval estimate of a population parameter that is used to indicate the reliability of an estimate. It gives a range of plausible values for a true population parameter Correct interpretation (using 95% confidence level as example): “if the statistical model is correct, then if the study was repeated many times in the same population, it would deliver a CI that included the true value of the parameter 95% of the time.”

95
Q

Difference between b and Beta Coefficients

A

Beta = the standardized regression coefficient. b = the unstandardized regression coefficient. Standardized beta coefficients are considered more appropriate for comparing the importance of different X’s in predicting an outcome. The unstandardized coefficient (in the original metric of the variable) may be preferable for making comparisons across populations/datasets.

96
Q

Three Conditions for Causal Relationship

A
  1. cause precedes effect, 2. the cause is related to the effect, and 3. alternative explanations can be ruled out. This classic analysis was formalized by John Stuart Mill. Randomized experiments are particulairly well suitied for exploring these three conditions because radomization enables 1, 2, and 3. Same with prospective. But with case-control, establishing temporarilty is difficult.
97
Q

Time Series Analysis

A

Time series = a sequence of data points, measured typically at successive times spaced at uniform intervals. Example of a time series: Dow Jones Index. Time series analysis = methods for analyzing time series data in order to extract meaningful statistics from the data. Time series data have a natural temporal ordering. Data points closer together are typically more closely related. Statistical methods for time series analysis are classified into either the time domain or the frequency domain.

98
Q

True Score

A

The score that would be obtained if there were no errors in the measurement. The expected number-correct score over an infinite number of independent administrations of a test. Unfortunately, test users never observe a person’s true score, only an observed score, which is the true score plus some error.

99
Q

Two Assumptions of Ordinary Least Squares (OLS)

A

OLS is a statistical estimation method of determining a regression equation, that is an equation that best represents the relationship between the dependent and independent variables. Three assumptions of OLS include: 1) The equation is linear and additive, and 2) there does not exist perfect multicollinearity between variables, and 3) variance of the error term must be constant (homoscedasticity).

100
Q

Type 1 Error

A

An error made by wrongly rejecting a true null hypothesis (false positive). This might involve incorrectly concluding that two variables are related when they are not, or wrongly deciding that a sample statistic exceeds the value that would be expected by chance.

101
Q

Type II Errors

A

An error made by wrongly accepting a false null hypothesis (false negative). You can decrease the probability of making a Type II error by increasing the sample size.

102
Q

Unbiased Estimator

A

A sample statistic that is free from systematic error. Over repeated sampling, the average value of an unbiased estimator will be equal to the true population parameter. Alternatively stated, the expected value of an unbiased estimator is equal to the true population parameter. Produce unbiased estimates, which contain only random error.

103
Q

Underidentification

A

In path analysis, if the number of unknown parameters is greater than the number of known pieces of information, the model is underidentified. In other words, there are fewer linearly independent equations than unknowns. In this situation, an infinite number of possible solutions exist, making causal inference unsupportable. It is not possible to estimate all of the model’s parameters.

104
Q

Unstandardized Regression Coefficient

A

Estimates resulting from a regression analysis carried out on variables that use their original scaling (as opposed to standardized variables that all have a variance of 1). It can be interpreted as the predicted change in Y given a one unit change in X. One pro of unstandardized coefficients is that they can be immediately translated into an impact, but a con to using them is that it is more difficult to compare the impact of different IVs.

105
Q

Z Score

A

In statistics, a measure of how many standard deviations an observation is above or below the mean. It is a dimensionless quantity derived by subtracting the population mean from an individual raw score and then dividing the difference by the population standard deviation. It is only defined if one knows the population parameters.