Descriptive statistics Flashcards

(31 cards)

1
Q

Define Categorical data

A

Has 2 or more categories with no ordering to them (usually binary or nominal)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Define discrete data

A

Has a fixed value with a logical order (usually ordinal, ratio, or interval variables)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Define continuous data

A

Can take any fractional value (usually ratio or interval variables)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How can frequencies be presented?

A

As its raw frequency or as a percentage frequency

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Define Mode

A

The score occurring most often in a dataset

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Define Median

A

The middle score in a dataset

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Define Mean

A

Sum of data point divided by number of data points

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Define Central tendency

A

Where the centre of our frequency distribution lies

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Which bar on a bar chart would be the mode?

A

The highest bar

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How would you work out the median in an odd value dataset?

A

(n+1)/ 2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How would you work out the median in an even value dataset?

A

Middle two values/ 2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are some pros of the median?

A

Insensitive to outliers, often gives a real data value and is useful for ordinal data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are some cons of the median?

A

Ignores a lot of the data, difficult to calculate without a computer and can’t be used with nominal data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are some pros of the mean?

A

Uses all of the data, is most effective for normally distributed datasets

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What are some cons of the mean?

A

Sensitive to outliers, values are not always meaningful and is only useful for ratio and interval data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are the measures of spread for the mode?

A

There are no measures of spread

17
Q

What are the measures of spread for the median?

A

‘distance-based’ measures e.g. range and interquartile range

18
Q

What are the measures of spread for the mean?

A

‘centre-based’ measures of spread e.g. variance and standard deviation

19
Q

Define the interquartile range

A

Ignores most extreme values and is the range of scores within the middle 50% of scores

20
Q

Define the Lower quartile range

A

Median of lower half of data

21
Q

Define the Upper quartile range

A

Median of upper half of the data

22
Q

What are the pros and cons identical to?

A

The median pros and cons

23
Q

Define Deviance

A

Take each score and subtract it from the mean

24
Q

Define squared errors

A

Take each deviance score and square it

25
Define the variance
Average squared errors
26
What are the pros of variance?
Uses all of the data and forms the basis of several other tests
27
What are the cons of variance?
Requires a normal distribution and is sensitive to outliers
28
Define Standard deviation
A measure of spread that's equal to the unit of measurement of the dependent variable
29
How would you calculate the Standard deviation?
Using the square root of variance
30
What does the Standard deviation allow us to do?
Get an unbiased estimate of the population's standard deviation if we only have access to a sample of data
31
What can Standard deviance estimate?
Population based on a sample