Statistics Flashcards

Question

Median

Answer 1

The median is the **middle score in a distribution** when scores have been ordered from lowest to highest. It is used with ordinal data (and with interval and ratio data when the distribution is skewed or contains one or a few outliers).

Answer 2

The mean is the arithmetic average of a set of scores, and it can be used when scores represent an **interval or ratio scale.**

Answer 3

The **mean, median, and mode** are the most commonly used measures of central tendency.

Answer 4

The retention region is the region of a sampling distribution that **contains the values that are likely to be obtained simply as the result of sampling error.** When an inferential statistical test indicates that an obtained sample value is in the retention region, the **null hypothesis is retained** and the alternative hypothesis is rejected. The retention region is equal to one minus alpha.

Answer 5

The rejection region of a sampling distribution contains the sample values (e.g., means) that are **unlikely to be obtained simply as the result of sampling error.** When an inferential statistical test indicates that the obtained sample value falls in the rejection region, the null hypothesis is rejected and the alternative hypothesis is retained.

Answer 6

Statistical power refers to the **probability of rejecting a false null hypothesis.** Power cannot be directly controlled but is increased by having a * large sample * maximizing the effects of the IV * increasing the size of alpha * reducing error

Answer 7

which occurs when pretesting affects how subjects react to the treatment | threat to external validity

Answer 8

which occurs when subjects respond differently to a treatment because they know they are participating in a research study | threat to external validity

Answer 9

**Counterbalancing** can be used to control multiple treatment interference and involves administering different levels of the IV to different groups of subjects in a different order.

Answer 10

which occurs when subjects receive more than one level of an IV | threat to external validity

Answer 11

External validity refers to the degree to which a study's results can be **generalized** to other people, settings, conditions, etc.

Answer 12

Selection threatens internal validity when **groups differ** at the beginning of the study because of the way subjects were assigned to groups and is a potential threat whenever subjects are not randomly assigned to groups | threat to internal validity

Answer 13

Statistical regression is a threat when subjects are selected to participate because of their **extreme status** on the DV or a measure that correlates with the DV and refers to the tendency of extreme scores to "regress to the mean" on retesting. | threat to internal validity

Answer 14

History is a threat when an **event that is external to the research study affects subjects' performance** on the DV in a systematic way. | threat to internal validity

Answer 15

Maturation is one threat to internal validity and occurs when a **physical or psychological process or event occurs as the result of the passage of time** (e.g., increasing fatigue, decreasing motivation) and has a systematic effect on subjects' status on the DV.

Answer 16

Internal validity refers to the degree to which a research study allows an investigator to conclude that observed variability in a dependent variable is due to the independent variable rather than to other factors. * Maturation * History * Statistical Regression * Selection

Answer 17

The least squares criterion is used to locate the regression line so that the amount of error in prediction is minimized.

Answer 18

Regression analysis is used to **predict a score on one criterion based on the person's obtained score on one predictor.** It involves identifying the location of the regression line **("line of best fit")** and using the equation for that line, the regression equation, to make predictions.

Answer 19

The standard deviation is a **measure of dispersion (variability) of scores around the mean of the distribution.** It is the **square root of the variance** and is calculated by dividing the sum of the squared deviation scores by N (or N - 1) and taking the square root of the result.

Answer 20

* **Cohen's d** (which indicates the difference between two groups in terms of standard deviation units) and * **eta squared** (which indicates the percent of variance in the dependent variable that is accounted for by variance in the independent variable).

Answer 21

An effect size is measure of the **magnitude of the relationship between independent and dependent variables** and is useful for interpreting the relationship's **clinical or practical significance** (e.g., for comparing the clinical effectiveness of two or more treatments).

Answer 22

A correlation coefficient for two or more variables can be squared to obtain a measure of shared variability. For example, if the **correlation between X and Y is .50, this means that 25% of variability in Y is shared with (or is accounted for by) variability in X.**

Answer 23

Cluster analysis is a multivariate technique that is used to **group people or objects into a smaller number of mutually exclusive and exhaustive subgroups** (clusters) based on their similarities - i.e., to group people or objects so that the identified subgroups have **within-group homogeneity** and **between-group heterogeneity.**

Answer 24

Mediating variables **explain or account** for the relationship between independent and dependent variables. As an example, authoritative parenting may have positive effects on academic achievement because authoritative parenting leads to high **self-efficacy beliefs** (the mediator) which, in turn, leads to a high level of academic achievement.

Answer 25

Moderator variables affect the strength or direction of the relationship between independent and dependent variables. If a treatment is **more effective** for reducing cigarette smoking for men than for women, **gender is a moderator variable.**

Answer 26

The MANOVA is a form of the ANOVA that is used when a study includes **one or more IVs** and **two or more DVs** that are each measured on an **interval or ratio scale.** Use of the MANOVA helps **reduce the experimentwise error rate** and increases power by simultaneously analyzing the effects of the IV(s) on all of the DVs.

Answer 27

False negative Type II error occurs when a false null hypothesis is retained. The probability of making a Type II error is equal to beta (which is usually unknown).

Answer 28

False positive A Type I error occurs when a true null hypothesis is rejected. The probability of making a Type I error is equal to alpha, which is set by the investigator prior to collecting or analyzing the data.

Answer 29

LISREL is a structural equation (causal) modeling technique that is used to verify a predefined causal model or theory. It is **more complex than path analysis,** and it allows two-way (non-recursive) paths and takes into account observed variables, **the latent traits** they are believed to measure, and the effects of measurement error.

Answer 30

is a method of behavioral sampling that is useful for behaviors that are rare or that leave a permanent product. It involves recording each occurrence of a behavior during a predefined or preselected event.

Answer 31

Interval recording is a method of behavioral sampling that involves **dividing a period of time** into discrete **intervals** and recording whether the behavior occurs in each interval. It is particularly **useful for behaviors that have no clear beginning or end.**

Answer 32

Alpha determines the **probability of rejecting the null hypothesis when it is true;** i.e., the probability of making a Type I error. The value of alpha is set by the experimenter prior to collecting or analyzing the data. In psychological research, alpha is commonly set at **.01 or .05.**

Answer 33

the opposite of the null hypothesis and is expressed in a way that implies that the **independent variable does have an effect.**

Answer 34

the null hypothesis is stated in a way that implies that the independent variable **does not** have an effect on the dependent variable.

Answer 35

The randomized block ANOVA is the appropriate statistical test when **blocking has been used as a method for controlling an extraneous variable** (i.e., when the extraneous variable is treated as an independent variable). It allows an investigator to statistically analyze the main and interaction effects of the extraneous variable.

Answer 36

The AB design includes a single baseline phase and a single treatment phase. The reversal designs include, at a minimum, two baseline phases and one treatment phase (e.g., an ABA or ABAB design), with the treatment being withdrawn ("reversed") during the second and subsequent baseline phases.

Answer 37

Use of the multiple-baseline design involves sequentially applying a treatment to different "baselines" (e.g., to different **behaviors, settings, tasks, or subjects).**

Answer 38

Single-subject designs include at least one A (baseline) and one B (treatment) phase and include **multiple measurements of the DV at regular intervals during each phase.**

Answer 39

research involves conducting an **empirical study** to test hypotheses about the **relationships between independent and dependent variables.** A **true experimental** study permits greater control over experimental conditions, and its "hallmark" is random assignment to groups. A **quasi-experimental** study permits less control.

Answer 40

same subjects, receiving different levels of the IV at different times comparisons are made "within-subjects" instead of between groups

Answer 41

interaction refers to the effects of one IV at different levels of another IV

Answer 42

A main effect is the effect of a single IV on the DV

Answer 43

Factorial designs are research designs that include two or more "factors" (independent variables). They permit the analysis of main and interaction effect

Answer 44

the multiple-sample chi-square test when it includes **two or more variables.** (When counting variables for the chi-square test, independent and dependent variables are both included.)

Answer 45

the multiple-sample chi-square test when it includes **two or more variables.** (When counting variables for the chi-square test, independent and dependent variables are both included.)

Answer 46

The single-sample chi-square test is used when the study includes one variable

Answer 47

The chi-square test is a **nonparametric statistical test** that is used with **nominal data** (or data that are being treated as nominal data) - i.e., when the data to be compared are frequencies in each category.

Answer 48

Ideally, predictors included in a multiple regression equation will have **low correlations with each other** and **high correlations with the criterion.** High correlations between predictors is referred to as **multicollinearity.**

Answer 49

The output of multiple regression is a multiple correlation coefficient (R) and a multiple regression equation.

Answer 50

Multiple regression is a multivariate technique that is used for predicting a score on a continuous criterion based on performance on two or more continuous and/or discrete predictors.

Answer 51

Discriminant function analysis is the appropriate multivariate technique when two or more **continuous predictors** will be used to predict or estimate a person's status on a **single discrete (nominal) criterion.** (a doctor could perform a discriminant analysis to identify patients at high or low risk for stroke using age and weight as predictors)

Answer 52

Path analysis is a structural equation (causal) modeling technique that is used to verify a pre-defined causal model or theory. It involves translating the theory into a path diagram, collecting data on the variables of interest (the observed variables), and calculating and interpreting path coefficients.

Answer 53

Random error is error that is unpredictable (random). Sampling error and measurement error are types of random error.

Answer 54

Systematic error is predictable error. Extraneous (confounding) variables are a source of systematic error that affects the relationship between independent and dependent variables.

Answer 55

Mixed designs are a type of factorial design in which at least one IV is a between-groups variable and one IV is a within-subjects variable.

Answer 56

The factorial ANOVA is the appropriate statistical test when a study includes two or more IVs (i.e., when the study has used a factorial design) and a single DV that is measured on an interval or ratio scale. It is also referred to as a two-way ANOVA, three-way ANOVA, etc., with the words "two" and "three" referring to the number of IVs.

Answer 57

which occurs when subjects respond differently to a treatment because they know they are participating in a research study

Answer 58

which occurs when subjects respond differently to a treatment because they know they are participating in a research study

Answer 59

which occurs when subjects receive more than one level of an IV

Answer 60

**Counterbalancing** can be used to control multiple treatment interference and involves administering different levels of the IV to different groups of subjects in a different order.

Answer 61

is a method of behavioral sampling that is useful for behaviors that are rare or that leave a permanent product. It involves recording each occurrence of a behavior during a predefined or preselected event.

Answer 62

the multiple-sample chi-square test when it includes **two or more variables.** (When counting variables for the chi-square test, independent and dependent variables are both included.)