Data: Descriptive Statistics Flashcards

1
Q

What are the various measures of central tendency

A
  • Mode
  • Median
  • Mean
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

How do you decide which measure of central tendency to use

A

Based on the levels of measurement

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

which measure of central tendency you’d use if the data was nominal

A

Mode

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

which measure of central tendency you’d use if the data was ordinal

A
  • the median
  • then the mode if calculating the median isn’t possible
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

which measure of central tendency you’d use if the data was interval/ratio

A
  • preferably the mean
  • however, if the data is skewed…
  • you need to calculate the median
  • then the mode if calculating the median isn’t possible
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the strength of using the mean

A
  • takes all the scores into account so it is the most sensitive
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the weaknesses of using the mean

A
  • impacted by extreme values (i.e. outliers) - it will be skewed (artificially inflated or deflated)
  • not useful when a decimal point is not an option for the data (e.g. 2.4 cola bottles)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the strength of using the mode

A
  • Not impacted by extreme values (i.e. outliers) — it will not be skewed (artificially inflated or deflated)
  • useful for nominal data as it is the only method
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the weakness of using the mode

A
  • Doesn’t take all the scores into account so it is not as sensitive
  • there may be several or none
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is the strength of using the median

A
  • not impacted by extreme values (i.e. outliers) - it will not be skewed (artificially inflated or deflated)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the weaknesses of using the median

A
  • Doesn’t take all the scores into account so it is not as sensitive
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

EXAMPLE:
In an unethical experiment 3 groups of 8 lab rats were given a maze to complete and times were recorded in seconds.
Group 1) Rats given brain lesions - 35, 27, 26, 27, 28, 79, 27, 30
Group 2) Rats with tails cut off - 15, 10, 18, 22, 8, 49, 16, 22
Group 3) Rats with eyes damaged - 33, 33, 32, 28, 67, 45, 24, 29
Which measure of central tendency should be used + why?

A
  • the data was ratio, however there were outliers in the groups
  • therefore calculating the median is the most appropriate measure of central tendency
  • because we can’t calculate the mean because there are outliers
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Describe the characteristics of a Bar chart

A

used for nominal (category/not continuous data):
- frequency = Y-axis
- categories = X-axis
- gaps between each bar represents the lack of continuity
For experiments: IV = X-axis, DV = Y-axis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Describe the characteristics of a Histogram

A

Used for continuous data:
- frequency = Y-axis (it must start at 0)
- continuous data = X-axis
- no gap between each bar, represents continuity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Describe the characteristics of a Line graph

A
  • frequency = Y-axis (must start at 0)
  • used for continuous data = displayed on X-axis
  • each dot is connected by a line
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Describe the characteristics of a Pie chart

A
  • suitable for nominal data
  • each slice represents a portion of
  • each slice is calculated by a portion of 360’
17
Q

Describe the characteristics of a Scatter diagram

A
  • each covariable is plotted on y and x axis
  • suitable for continuous data (displayed on both x and y axis)
  • each dot represents 2 scores
  • the scatter of the dots indicates the relationship between the covariable
18
Q

What is the definition of range

A

The distance between the top and bottom values in the data set

19
Q

How do you calculate the range in psychology

A

Subtract the lowest from the highest, then add one

20
Q

Why is it helpful to calculate the range

A
  • useful to describe data
  • help explain differences in data, not visible by only calculating the mean
21
Q

What does a high standard deviation score mean

A

Scores are spread widely from the mean

22
Q

What does a low standard deviation score mean

A

Scores are clustered near the mean

23
Q

What does it mean if the standard deviation score is 0

A

All the values in the data set are the same

24
Q

How do you calculate a standard deviation

A
25
Q

What is meant by standard deviation

A

A measure of dispersion, shows how the data is spread around the mean

26
Q

Explain the steps you would follow to calculate a standard deviation

A

1) calc the mean of the scores in the data set
2) take mean away from each score in data set
3) square each difference
4) add the sum of all the squared differences
5) divide this by the number of scores minus one
6) Calc the square root of the divided data ——> this is the standard deviation

27
Q

What is the strength of using a range

A

Easy to calculate

28
Q

What is the disadvantage of using a range

A
  • impacted by extreme values (e.g. outliers), it will be skewed (artificially inflated or deflated)
  • Fails to take into account the distribution of scores around the mean (don’t know if the scores are close together or far apart)
29
Q

What is the strength of using standard deviation

A
  • more precise and informative measure of dispersion than range (because it takes all the values into account)
  • Highlights if the mean is an appropriate measure of central tendency
  • Used in further statistical analysis, such as computing skewness
  • less affected by anomalous results than range scores
30
Q

What is the weakness of using standard deviation

A
  • can only be used if the data set is normally distributed and not skewed
  • more difficult to calculate than the range score
  • can only be used when data collected is ordinal level or above
  • Can only be used where an IV is plotted against frequency