2.4 - A Statistical Primer Flashcards

1
Q

Descriptive Statistics

A

a set of techniques used to organization, summarize, and interpret data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Statistics used to describe and understand the data:

A

Frequency, central tendency, variability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Data Distribution

A

1) whether some scores occurred more often than others

2) whether all the scores were clumped in the middle or more evenly spaced across the whole range

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Histogram

A

Bar graph

*vertical axis shows the frequency

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Frequency

A

the number of observations that fall within a certain category or range of scores

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Normal Distribution (Bell Curve)

A

a symmetrical distribution with values clustered around a central, mean value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Negatively Skewed Distribution

A

a distribution in which the curve has an extended tail to the left of the cluster

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Positively Skewed Distribution

A

a distribution in which the curve has an extended tail to the right of the cluster

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Skews occur because?

A

there is an upper or lower limit to the data

(ex. person cannot take less than 0 mins on a quiz, curve of quiz time cannot continue indefinitely to the left, beyond the 0 point)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Central Tendency

A

a measure of the central point of distribution

*measured usually by the mean, but there are exceptions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Three different measures of Central Tendency

A

mean, median, mode

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Mean

A

the arithmetic average of set numbers

ex. class averages

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Median

A

the 50th percentile - the point on the horizontal axis at which 50% of all observations are lower, and 50% of all observations are higher

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Mode

A

the category with the highest frequency (category w/ most observations)

*measure that is used least

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Which to use to calculate central tendency when mean, median, and mode are equal?

A

Normally distributed data - Mean

Mode = measure that is used least, provides less info than other two, used when dealing w/ categories of data (ex. when you vote for a candidate, the mode = candidate w/ most votes)

Skewed Data (Positively/Negatively) - median (extreme values have a large effect on mean but will not affect the median

*the longer the tail, the more the mean is pulled away from the centre of the curve

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Variability

A

the degree to which scores are dispersed in a distribution
(some are spread out, some are clustered)

Higher Variability = larger # of cases that are closer to the extreme ends of the continuum for that set data
(ex. lots of excellent AND poor students in one class)

Lower Variability = most scores are similar
(ex. call filled with all “B” students)

*can be caused by measurement errors, imperfect measurement tools, differences between participants in the study, characteristics of participants on that given day (ex. mood, fatigue levels)

17
Q

Standard Deviation

A

a measure of variability around the mean (estimate of the average distance from the mean)

*links central tendency and variability

18
Q

The ______ always marks the 50th percentile of the distribution.

A

median

19
Q

The ______ is a measure of variability around the mean of a distribution.

A

standard deviation

20
Q

A histogram is created that presents data on the number of mistakes made on a memory test by participants in a research study. The vertical axis indicates?

A

the frequency of errors made

21
Q

In a survey of recent graduates, your university reports that the mean salaries of the former students are positively skewed. What are the consequences of choosing the mean rather than the median or the mode in this case?

A

The mean is likely to provide a number that is higher than the largest cluster of scores

22
Q

Hypothesis Test

A

a statistical method of evaluating whether differences among groups are meaningful, or could have been arrived at by chance alone

23
Q

Statistical Significance

A

the means of the groups are father apart than you would expect them to be by random chance alone

  • proposed by Ronald Fisher
  • not used for limited numbers of potential participants
24
Q

Null Hypothesis & Experimental Hypothesis

A

Null = any differences between groups (or conditions) are due to chance

Experimental = assumes that any differences are due to a variable controlled by the experimenter

25
Q

P-value

A

the probability of the results being due to chance

lower p-value = decreased likelihood that results were a fluke, and therefor, an increased likelihood that it was a good experiment

26
Q

Ronald FIsher

A
  • presented idea of significance testing, rejected null hypothesis
  • p-value cut-off point = p
27
Q

Paul Meehl

A
  • rejects significance testing

- more testing = more chance of fluke, p-value standard must be decreased as # of tests increase

28
Q

Jacob Cohen

A
  • developed power analysis
  • goal to calculate effect sizes = whether difference is statistically small or large
  • effect sizes allow researchers to adjust how much they believe that their hypothesis is true
29
Q

A hypothesis test is conducted after an experiment to?

A) determine whether the two groups in the study are exactly the same
B) determine how well the two groups are correlated
C) see if the groups are significantly different, as opposed to being different due to chance
D) summarize the distribution using a single score

A

see if the groups are significantly different, as opposed to being different due to chance

30
Q

Imagine an experiment where the mean of the experimental group is 50 and the mean of the control group is 40. Given that the two means are obviously different, is it still possible for a researcher to say that the two groups are not significantly different?

A) Yes, the two groups could overlap so much that the difference was not significant
B)Yes, if the difference was not predicted by the hypothesis
C) No, because the two groups are so far apart that the difference must be significant
D) No, in statistics a difference of 10 points is just enough to be significant

A

Yes, the two groups could overlap so much that the difference was not significant