Research Flashcards

Question

What multivariate technique is used to predict sales performance of different stores based on its attributes (e.g., number of vendors, number of hours open). Such analysis would lead to a deeper understanding of what makes each store sell more, which could drive administrative changes in the most important attributes towards values that give higher profit.

Answer 1

Multiple regression is an option when the analyst stipulates only one dependent variable, which is metric. The result of applying a multiple regression is the degree of impact that each independent variable has on the dependent one. That result also leads to an estimation function, where it accepts values for the independent variables and returns the expected value for the dependent.

Answer 2

(Dependent Variable(s): one non-metric variable Independent Variables’ Nature: metric) - Multiple discriminant analysis is very similar to machine learning classifiers. It is an option when there is only one dependent variable, which is non-metric — also called “class” or “label”. The goal is to understand the characteristic of the data that pertain to each class. - - A discriminant analysis (also known as discriminant function analysis) involves using scores on two or more predictors to predict an individual's membership in a criterion group - i.e., it is used when the criterion is measured on a nominal scale.

Answer 3

MANOVA: require many independent variables and many measures The application of MANOVA in the collected data could reveal that the combination of E1-W2 is significantly worse, while E3-W1 is significantly better. The engineers can see how each engine, each wing, and each combination, impacts on each of the forces. It is not an easy technique to conduct or to interpret but is a rewarding and powerful one. -The multiple covariance analysis (MANCOVA) can fine-tune the results and reinforce the study’s validity by removing the effects of possible unobserved variables (for example, whether it was raining or not in the simulations). Thus, even if these factors affect the dependent variables, the MANCOVA reduces its impacts to isolate the effect of the treatments as much as possible.

Answer 4

Internal validity focuses on the causal relationship between independent and dependent variables. Face validity focuses on whether a test looks like it measures what it is intended to measure. Construct validity is established when a test measures the intended hypothetical trait. External validity focuses on the generalizability of one study to other conditions, individuals, etc.

Answer 5

interpret the main effects with caution since the interaction is significant.- When the interaction is significant, this means that the effects of one independent variable differ for different levels of another independent variable. Thus, it is not possible to conclude that the independent variable has consistent main effects. For example, a study might find that, overall, Teaching Method #1 is superior to Teaching Method #2 (i.e., there is a main effect of teaching method). However, there might also be an interaction between teaching method and level of self-esteem - for example, Teaching Method #1 might be more effective for students with high and moderate self-esteem, while Teaching Method #2 is more effective for students with low self-esteem. In this situation, the main effect of teaching method would have to be interpreted with caution

Answer 6

AB design Reversal ABA or ABAB Multiple Baseline Multiple baseline: does not require withdrawing a treatment, but involves sequentially applying treatment to different behaviors of the same subject (multiple baseline across behaviors) to the same subject in different setting (multiple baseline across settings) or same behavior across different subjects ( multiple baseline across subjects). In other words a treatment is applied to a baseline which can be a behavior, setting, task or subject) Multiple baseline: at regular intervals during the baseline and the treatment phases of the study.

Answer 7

when the researcher is studying a phenomenon (therapist-client interactions) under conditions that only resemble or approximate actual clinical conditions. In clinical research, an analogue study is a study in which the conditions are in some way an analogue (approximation) of actual clinical practice.

Answer 8

retains the null hypothesis when it is false: Type II error. (False negative) Telling a pregnant woman that she is not pregnant) Rejecting a true null hypothesis is referred to as a Type I error. Nothing works hypothesis: say it works but it doesn't False positive (telling a man he is pregnant)

Answer 9

Cohen's d is a measure of effect size. It indicates the difference between the means of two groups in terms of standard deviations. A Cohen's d of .50 indicates that one group obtained a mean that is one-half standard deviation higher than the mean obtained by the other group.

Answer 10

History, maturation An ABAB design is a single-subject design that involves collecting baseline data, administering the independent variable, removing the independent variable, and then readministering the independent variable.

Answer 11

Diffusion is a threat to a study's validity when participants in one group (often a no-treatment control group) benefit from the intervention administered to another group - i.e., when participants in the control group inadvertently learn about the treatment or are accidentally exposed to it. Attrition is a threat to a study's internal validity. It occurs when participants who drop out of one group differ in a systematic way from those who drop out of another group and this difference affects the study's results. Instrumentation is a threat to internal validity and refers to any change in tests or other measuring devices administered to participants during the course of the study that confounds the study's results. An ABAB design would not help control instrumentation effects.

Answer 12

Counterbalancing involves administering the treatments (different levels of the IV) in different orders to different groups of participants. It helps control multiple treatment interference (also known as carry-over or order effects) that may result when multiple levels of the independent variable(s) are administered to the same participants.

Answer 13

Structural equation model is used to explore or confirm hypothesized relationships between both measured and latent variables.

Answer 14

- Meta-analysis is used to evaluate an intervention by combining the results of a number of research studies. - The multitrait-multimethod matrix is used to evaluate convergent and divergent validity. Discriminant function analysis is used to classify people into criterion groups based on their scores or status on two or more predictors. Structural equation model is used to explore or confirm hypothesized relationships between both measured and latent variables.

Answer 15

Median Ordinarily, the best measure of central tendency for interval or ratio data is the arithmetic mean. However, in the situation described in this question, the mean would not be the most accurate measure because the distribution of data includes three scores that are estimates (rather than actual measures) of the value of interest. The mean is affected by the magnitude of every score in a distribution, but the median is not. Consequently, in this situation, the median would not be affected by the magnitude of the three missing scores and, therefore, would be a more accurate measure of central tendency.

Answer 16

Methods of behavioral sampling include event sampling, interval recording, and situational sampling. These are described in the Statistics and Research Design chapter of the written study materials. Event sampling involves observing a behavior each time it occurs and is useful for behaviors that occur infrequently, have a long duration, or leave a permanent record.

Answer 17

The t test tells you how significant the differences between groups are; In other words it lets you know if those differences (measured in means) could have happened by chance. - --There are three main types of t-test: - -An Independent Samples t-test compares the means for two groups. - --A Paired sample t-test compares means from the same group at different times (say, one year apart). - --- A One sample t-test tests the mean of a single group against a known mean.--- The more t-tests ran the bigger the experiment wise error and the more likely a type I error-- thus analysis of variance is best ANOVA: Basically, you’re testing groups to see if there’s a difference between them. Examples of when you might want to test different groups: A group of psychiatric patients are trying three different therapies: counseling, medication and biofeedback. You want to see if one therapy is better than the others. A manufacturer has two different processes to make light bulbs. They want to know if one process is better than the other. Students from different colleges take the same exam. You want to see if one college outperforms the other. One-way has one independent variable (with 2 levels). For example: brand of cereal, (1IV, 2+ groups, 1 DV) Two-way has two independent variables (it can have multiple levels). For example: brand of cereal, calories Factor Analysis - Nominal: Chi-square - Ordinal: Mann Whitney, Wilcoxon, Kruskal-Willis (2+ groups) .

Answer 18

What are “Groups” or “Levels”? Groups or levels are different groups within the same independent variable. For example, your levels for “brand of cereal” might be Lucky Charms, Raisin Bran, Cornflakes — a total of three levels. Your levels for “Calories” might be: sweetened, unsweetened — a total of two levels. Let’s say you are studying if an alcoholic support group and individual counseling combined is the most effective treatment for lowering alcohol consumption. You might split the study participants into three groups or levels: Medication only, Medication and counseling, Counseling only. Your dependent variable would be the number of alcoholic beverages consumed per day. If your groups or levels have a hierarchical structure (each level has unique subgroups), then use a nested ANOVA for the analysis.

Answer 19

one way ANOVA : One-way ANOVA between groups: used when you want to test two groups to see if there’s a difference between them. A one way ANOVA is used to compare two means from two independent (unrelated) groups using the F-distribution. The null hypothesis for the test is that the two means are equal. Therefore, a significant result means that the two means are unequal.

Answer 20

Percent correct scores represent a ratio scale of measurement - i.e., they have the properties of order, equal intervals, and an absolute 0 point. Consequently, it is possible to conclude that someone who obtains a score of 50% got twice as many items correct as someone whose score is 25%.

Answer 21

Anything that threatens a study's internal validity will also threaten its external validity. --multiple treatment interference is listed by Campbell and Stanley as a direct threat to external validity.

Answer 22

The time-series design is a type of within-subjects design that involves evaluating the effects of an intervention by comparing multiple quantitative observations of participants before and after they are exposed to the intervention. EX: For her dissertation research project, a graduate student administered a measure of state anxiety to a group of college students on five consecutive days before and after the students participated in a stress reduction workshop.

Answer 23

Homoscedasticity is one of the assumptions for use of the Pearson r and most other correlation coefficients. This assumption is met when the range of Y scores is about the same at every value of X.

Answer 24

In contrast to path analysis models, which predict the causal relationships among measured attributes only, LISREL models incorporate both measured attributes and latent traits. LISREL, a structural equation (causal) modeling technique, is used to test causal hypotheses about relationships among measured variables and the latent traits those variables are believed to measure.

Answer 25

Nominal Scale: unorder categories: Ordinal: divides into categories and provides information on the order of those categories (Likert scale. Rank) Interval: property of order as well as property of equal variables and No absolute zero (IQ) Ratio: properties of order and equal intervals as well as an Absolute Zero

Answer 26

A moderator variable affects the direction or strength of the relationship between independent and dependent variables. For example, if the results of a research study indicate that EMG biofeedback is more effective for tension headaches than for migraine or cluster headaches, type of headache is a moderator variable - i.e., type of headache moderates the effects of EMG feedback on headache symptoms.

Answer 27

The standard error of the mean is a value that represents the precision of how well the sample mean estimates the population mean.

Answer 28

The linear regression formula was mathematically derived in such a way that it minimizes the sum of the squared differences between predicted Y and actual Y. This aspect of linear regression is called the

Answer 29

Sampling error refers to the discrepancies between sample values and corresponding population values (parameters) that are due to chance factors in the selection process. Note that, by definition, sampling error is random, not systematic.

Answer 30

The level of significance (alpha) determines the location of the boundary between the regions of likely and unlikely values in the sampling distribution. When results are significant at the chosen level of significance, this means that the results are in the region of unlikely values and that the null hypothesis should be rejected. Answer A is correct: Significance at the .01 level means that there is a 1% chance that the obtained value (e.g., the mean or the difference between means) could have occurred by chance alone given the value specified in the null hypothesis. In other words, there is a 1% probability that the null hypothesis will be incorrectly rejected (that a Type I error will be made). ( in rejection region)

Answer 31

The multiple correlation coefficient indicates the degree of association between three or more variables. Like any other correlation coefficient, it can be squared in order to obtain a measure of shared variability. Correlation Coefficient that is (SQUARED) Never square a reliability coefficient- you will only be squaring it with it's self and is interpreted as a dIRECT Measure of "true score variability"

Answer 32

F-ratio: It is the ratio of the two variances i.e. variance between the groups to the variance within the groups. F-ratio is calculated when there are more than two means to compare. The formula for the F-ratio: F =MSB/ MSW where: MSB: Mean square between the groups (measure of variability between treatment groups and serves as an estimate of variability that is due to both error and ther effects of the IV MSW: Mean square within the groups: pooled variability within each treatment group. (Error) The mean square within (MSW) is the denominator of the F-ratio and, as its name implies, is a measure of within-group variability Within-group variability is a measure of error; and decreasing within-group variability decreases error and the magnitude of the denominator of the F-ratio.

Answer 33

The Central Limit Theorem makes three predictions about the sampling distribution of means: (a) if repeated random samples of size N are drawn from the population, as N increases in size, the sampling distribution of means increasingly approaches normal regardless of the shape of the population distribution; (b) the mean of the sampling distribution of means equals the population mean; and (c) the standard deviation of the sampling distribution of means equals the population standard deviation divided by the square root of N. a normal shape as the sample size increases regardless of the shape of the distribution of scores in the population.

Answer 34

--- the range of Y scores at each value of X is equal to the total range of Y scores.

Answer 35

The least squares criterion is used to locate the regression line in a scatterplot so that the amount of error in prediction is minimized when using the regression line or its equation to predict criterion scores. -at least squares gets to the line__

Answer 36

Linear Unrestricted range Homoscedasticity- range for Y scores is about the same for all values of X - -sample is randomly selected from the population - -observations are independent Differ: - - Parametric- evaluate hypothesis about population means, variances, or other parameters - appropriate forms of measures when the variable of interest is measured on an interval or ratio scale and when these assumptions are met: value of interest is normally distributed in the population (2) is when a study includes more than two groups there is a homoscedasticity ( variances of the populations that the different group represents are equal - -if violate --increase the probability of making a tye 1 or type 2 error.

Answer 37

The t-test for correlated (dependent) samples is used to compare two related means -- e.g., means obtained from the same sample at two different times.

Answer 38

Kuder-Richardson Formula 20 (KR-20) is an alternative to coefficient alpha when items are scored dichomotously (right or wrong), which would be the case for true/false items.

Answer 39

In research, probands (also known as index cases) are the first individuals who are brought to the attention of the investigator -

Answer 40

Differential validity The slope of a regression line for a test is directly related to the test's criterion-related validity: The steeper the slope, the greater the validity. A test has differential validity when it has different validity coefficients for different groups, which is what is suggested by different regression line slopes in a scatterplot.

Answer 41

Steps to minimize error  Increase the sample size.  Use a one-tailed test when appropriate (versus two tailed test)  Increase the length or intensity of the treatment  Use a parametric test (e.g., t-test, ANOVA) instead of non-parametric test

Answer 42

Increasing alpha: This has two effects  1. We find a real difference with less evidence (smaller differences between groups required to reject Ho)  2. A larger alpha increases the probability of Type I Error; in other words, we are lowering our standards i.e. lowering the bar to claim that a difference (treatment effect) is there, and increasing the odds of falsely claiming a treatment effect.

Answer 43

In this situation, you want to determine if there is a pattern (or "trend") in forgetting the list of nonsense syllables. Of the techniques listed, only trend analysis (a type of analysis of variance that is used when the IV is quantitative) would be useful for determining if there is a pattern or trend in subjects' memory for the list of nonsense syllables over time. The correct answer is: trend analysis.

Answer 44

In this situation, the participants may have acted in ways consistent with their expectations rather than simply in response to the experimental manipulation. Demand characteristics are unintentional cues in the experimental environment or manipulation that affect participants' beliefs or expectations and thereby may account for the results of the study.

Answer 45

Carryover effects occur in repeated measures designs when the effects of one treatment have an impact on the effects of subsequent treatments. -The halo bias is a type of rating error and occurs when a rater's rating of a person on one dimension of performance affects how the rater rates that person on unrelated dimensions of performance. The Hawthorne effect occurs when research participants act differently because of the novelty of the situation and the special attention they receive as research participants.

Answer 46

n this situation, you want to evaluate the effectiveness of the intervention in multiple settings. A multiple baseline across settings design would be appropriate for this study. It would allow you to sequentially assess the effects of the intervention in different settings.

Answer 47

The Spearman rank-order correlation coefficient is also known as Spearman rho and is used when both variables are ranks.

Answer 48

The contingency coefficient is used when both variables are measured on a nominal scale. The biserial coefficient is the appropriate correlation coefficient when one variable is continuous and the other is an artificial dichotomy. The phi coefficient is used when both variables are true dichotomies.

Answer 49

As one goes up the other goes up

Answer 50

The researcher in this study will be comparing the scores (most likely the mean scores) achieved by two independent groups of subjects. A t-test (a.k.a. Student's t-test) is used to compare the mean scores obtained by two groups.

Answer 51

Matched group assignment

Answer 52

The F-ratio is calculated by dividing the mean square between by the mean square within. Mean square between is a measure of treatment effects plus error, while mean square within in a measure of error only. Answer A is correct: A treatment effect is suggested when the numerator of the F ratio (mean square between) is larger than the denominator (mean square within) - i.e., when the F value is greater than +1.0. The correct answer is: 15.5

Answer 53

In this situation, there are two or more predictors and one dichotomous criterion. In addition, the assumption of linearity has been violated. Logistic regression is similar to discriminant analysis (response c) but is less restrictive in terms of assumptions.

Answer 54

The one-way analysis of variance is used to compare three or more independent groups. (Although the one-way ANOVA can be used to compare two groups, the t-test is ordinarily used in this case.) The Kruskal-Wallis is the nonparametric alternative to the one-way ANOVA. It can be used to compare two or more independent groups and is useful when one or more of the assumptions for the one-way ANOVA have been violated.

Answer 55

he multivariate analysis of variance (MANOVA) is used to simultaneously assess the effects of one or more independent variables on two or more dependent variables. The more statistical comparisons made within a research study, the greater the likelihood of making a Type I error. By using the MANOVA to simultaneously assess the effects of the independent variable(s) on the dependent variables, the fewer the total number of statistical comparisons, and the lower the probability of making a Type I error. (The word "error" in the term experimentwise error rate refers to a Type I error.)

Answer 56

Coefficient alpha is a type of reliability coefficient. It yields a coefficient of internal consistency. coefficient alpha is a measure of internal consistency.

Answer 57

The purpose of rotation in factor analysis is to facilitate interpretation of the factors. a. CORRECT Rotation alters the factor loadings for each variable and the eigenvalue for each factor (although the total of the eigenvalues remains the same). Knowing that an eigenvalue indicates the amount of variability accounted for by each factor may have helped you identify the correct answer to this question -- i.e., when the factor loadings change, the eigenvalues will also change. -It changes the factor loadings for the variables and the eigenvalue for each factor.

Answer 58

increase the length of the test and increase the heterogeneity of the examinees with regard to the attribute measured by the test Longer tests tend to be more reliable (assuming that the added items are similar in terms of quality and content to the original items). In addition, the reliability coefficient (like all correlation coefficients) is larger when the range of scores is unrestricted, which occurs when examinees are heterogeneous with regard to the attribute(s) being measured by the test.

Research Flashcards

(82 cards)