Statistics and Research Design - Flash Cards

Question

Internal Validity (Maturation, History, Statistical Regression, Selection)

Answer 1

Internal validity refers to the degree to which a research study allows an investigator to conclude that observed variability in a dependent variable is due to the independent variable rather than to other factors. Maturation is one threat to internal validity and occurs when a physical or psychological process or event occurs as the result of the passage of time (e.g., increasing fatigue, decreasing motivation) and has a systematic effect on subjects' status on the DV. History is a threat when an event that is external to the research study affects subjects' performance on the DV in a systematic way. Statistical regression is a threat when subjects are selected to participate because of their extreme status on the DV or a measure that correlates with the DV and refers to the tendency of extreme scores to "regress to the mean" on retesting. Selection threatens internal validity when groups differ at the beginning of the study because of the way subjects were assigned to groups and is a potential threat whenever subjects are not randomly assigned to groups.

Answer 2

Factorial designs are research designs that include two or more "factors" (independent variables). They permit the analysis of main and interaction effects: A main effect is the effect of a single IV on the DV, while an interaction refers to the effects of one IV at different levels of another IV.

Answer 3

Parametric tests are inferential statistical tests that are used when the data to be analyzed represent an interval or ratio scale and when certain assumptions about the population distribution(s) have been met - i.e., when scores on the variable of interest are normally distributed and when there is homoscedasticity (population variances are equal). An advantage of the parametric tests is that they are more "powerful" than the nonparametric tests. They include the Student's t-test and the analysis of variance. Nonparametric tests are inferential statistical tests used to analyze nominal or ordinal data (or interval or ratio data when the assumptions for a parametric test have not been met). They include the chi-square test, the Mann-Whitney U test, and the Wilcoxon matched-pairs test.

Answer 4

The rejection region of a sampling distribution contains the sample values (e.g., means) that are unlikely to be obtained simply as the result of sampling error. When an inferential statistical test indicates that the obtained sample value falls in the rejection region, the null hypothesis is rejected and the alternative hypothesis is retained. The size of the rejection region is defined by alpha. The retention region is the region of a sampling distribution that contains the values that are likely to be obtained simply as the result of sampling error. When an inferential statistical test indicates that an obtained sample value is in the retention region, the null hypothesis is retained and the alternative hypothesis is rejected. The retention region is equal to one minus alpha.

Answer 5

Moderator variables affect the strength or direction of the relationship between independent and dependent variables. If a treatment is more effective for reducing cigarette smoking for men than for women, gender is a moderator variable. Mediating variables explain or account for the relationship between independent and dependent variables. As an example, authoritative parenting may have positive effects on academic achievement because authoritative parenting leads to high self-efficacy beliefs (the mediator) which, in turn, leads to a high level of academic achievement.

Answer 6

A Type I error occurs when a true null hypothesis is rejected. The probability of making a Type I error is equal to alpha, which is set by the investigator prior to collecting or analyzing the data. A Type II error occurs when a false null hypothesis is retained. The probability of making a Type II error is equal to beta (which is usually unknown).

Answer 7

Trend analysis is a type of analysis of variance that is used to assess linear and nonlinear trends when the independent variable is quantitative.

Answer 8

Discriminant function analysis is the appropriate multivariate technique when two or more continuous predictors will be used to predict or estimate a person's status on a single discrete (nominal) criterion.

Answer 9

Single-subject designs include at least one A (baseline) and one B (treatment) phase and include multiple measurements of the DV at regular intervals during each phase. The AB design includes a single baseline phase and a single treatment phase. The reversal designs include, at a minimum, two baseline phases and one treatment phase (e.g., an ABA or ABAB design), with the treatment being withdrawn ("reversed") during the second and subsequent baseline phases. Use of the multiple-baseline design involves sequentially applying a treatment to different "baselines" (e.g., to different behaviors, settings, tasks, or subjects).

Answer 10

The four scales of measurement are one way to categorize the various ways of measuring variables. From least to most "mathematically sophisticated," the scales are nominal, ordinal, interval, and ratio. A nominal scale yields "frequency data" (the frequency of observations in each nominal category). Ordinal, interval, and ratio scales provide scale values or scores.

Answer 11

External validity refers to the degree to which a study's results can be generalized to other people, settings, conditions, etc. Threats include pretest sensitization (which occurs when pretesting affects how subjects react to the treatment), reactivity (which occurs when subjects respond differently to a treatment because they know they are participating in a research study), and multiple treatment interference (which occurs when subjects receive more than one level of an IV). Counterbalancing can be used to control multiple treatment interference and involves administering different levels of the IV to different groups of subjects in a different order.

Answer 12

The mean, median, and mode are the most commonly used measures of central tendency. The mean is the arithmetic average of a set of scores, and it can be used when scores represent an interval or ratio scale. The median is the middle score in a distribution when scores have been ordered from lowest to highest. It is used with ordinal data (and with interval and ratio data when the distribution is skewed or contains one or a few outliers). Finally, the mode is the most frequently occurring score or category, and it is used as a measure of central tendency for nominal variables or variables that are being treated as nominal variables.

Answer 13

An effect size is measure of the magnitude of the relationship between independent and dependent variables and is useful for interpreting the relationship's clinical or practical significance (e.g., for comparing the clinical effectiveness of two or more treatments). Several methods are used to calculate an effect size including Cohen's d (which indicates the difference between two groups in terms of standard deviation units) and eta squared (which indicates the percent of variance in the dependent variable that is accounted for by variance in the independent variable).

Answer 14

Path analysis is a structural equation (causal) modeling technique that is used to verify a pre-defined causal model or theory. It involves translating the theory into a path diagram, collecting data on the variables of interest (the observed variables), and calculating and interpreting path coefficients.

Answer 15

Regression analysis is used to predict a score on one criterion based on the person's obtained score on one predictor. It involves identifying the location of the regression line ("line of best fit") and using the equation for that line, the regression equation, to make predictions. The least squares criterion is used to locate the regression line so that the amount of error in prediction is minimized.

Answer 16

The independent variable (IV) is the variable that is believed to have an effect on the dependent variable and is varied or manipulated by the researcher in an experimental research study. Each independent variable in a study must have at least two levels. The dependent variable (DV) is the variable that is believed to be affected by the independent variable and is observed and measured.

Answer 17

The mixed ANOVA is a type of factorial ANOVA that is used when a study includes at least one between-groups independent variable and one within-subjects independent variable.

Answer 18

Systematic error is predictable error. Extraneous (confounding) variables are a source of systematic error that affects the relationship between independent and dependent variables.

Answer 19

Interval recording is a method of behavioral sampling that involves dividing a period of time into discrete intervals and recording whether the behavior occurs in each interval. It is particularly useful for behaviors that have no clear beginning or end. Event sampling is a method of behavioral sampling that is useful for behaviors that are rare or that leave a permanent product. It involves recording each occurrence of a behavior during a predefined or preselected event.

Answer 20

Statistical power refers to the probability of rejecting a false null hypothesis. Power cannot be directly controlled but is increased by having a large sample, maximizing the effects of the IV, increasing the size of alpha, and reducing error.

Answer 21

A correlation coefficient for two or more variables can be squared to obtain a measure of shared variability. For example, if the correlation between X and Y is .50, this means that 25% of variability in Y is shared with (or is accounted for by) variability in X.

Statistics and Research Design - Flash Cards

(45 cards)