Lecture 4 Flashcards

Summarizing Data Collected in the Sample (37 cards)

1
Q

What are the 4 types of variables?

A
  • dichotomous
  • ordinal
  • categorical
  • continuous
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the two pieces of information in every frequency distribution table for any type of variable?

A

frequency and relative frequency

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

How do you calculate relative frequency (%)?

A

RF = (frequency / total # of responses) x 100%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What two additional pieces of data is calculated for ordinal variables?

A

cumulative and cumulative relative frequency

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are the two types of measurements of continuous variables?

A

central tendencies and variability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are the three central tendency measurements?

A

mean, median, and mode

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are the four variability measurements?

A

range, IQR, variance, and standard deviation (SD)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

If data is missing, is that individual still counted in the total # of persons?

A

No, only participants who provided a value are accounted for in the denominator

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is mean denoted as?

A

x-bar

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are the advantages to median?

A
  • not impacted by outliers
  • good index of what is “typical” if distribution is skewed
  • some categorical data can have the median applied
  • good for ordinal data
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the main disadvantage of median?

A

does NOT take the data values themself into account (only a position indicator)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are the advantages of mode?

A
  • can be applied to any variable type (even nominal)
  • reflects and actual value from the data (easy to understand)
  • useful when there are multiple common values
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are the disadvantages to mode?

A
  • ignores majority of other info regrading distribution
  • tends to vary between samples
  • some samples do not have a mode
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

In a normal distribution, where are all the central tendencies found?

A

in the center of the bell-curve

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

In skewed distributions, where are the central tendencies found?

A

positive skew (tail to right)- mode > median > mean
negative skew (tail to left) - mean < median < mode
- in both the mean is what is off centered towards the tail end

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Where are the outliers found in skewed curves?

A

on the lower tail ends

17
Q

What are the two types of variability?

A

heterogeneous (high variability)
homogeneous (low variability)

18
Q

What percentage of the variation does range represent?

19
Q

What is the advantage to range?

A

it’s easily understood

20
Q

What are the disadvantages to range?

A
  • only dependent on 2 scores (not all info taken into account)
  • sensitive to outliers
  • tends to vary between samples
  • influenced by sample size
21
Q

What percentage of the variation does the IQR represent?

22
Q

What are the advantages to the IQR?

A
  • reduced influence of outliers (only the middle 50% is looked at)
  • uses more information than range
  • helpful for evaluating/identifying outliers
  • appropriate for ordinal measures
23
Q

What are the disadvantages of the IQR?

A
  • not easy to compute
  • not well understood
  • doesn’t take all values into account
24
Q

What is deviance?

A

the spread of the scores (how different each score is from the center of a distribution)

25
What is the deviance equation?
deviance = xᵢ - X̄
26
What is variance?
indicates the total dispersion of scores from the MEAN
27
What is the variance equation?
variance = (sum of (devience)^2) / N - 1
28
What needs to be done to the variance in order to obtain the standard deviation?
square root the value
29
What is the interpretation of the standard deviation?
- high result = values are FAR off from the mean - low results = values are not that far off (close to the mean)
30
In a normal distribution, past what point are value considered outliers?
anything beyond +/- 1.96 standard deviations
31
When outliers are NOT present, which central tendency and variability values are appropriate?
sample mean and the standard deviation
32
When outliers ARE present, which central tendency and variability values are appropriate?
median and interquartile range
33
What is the equation for calculating the upper and lower limits to determine mild outliers?
lower limit = Q1 - (1.5*IQR) upper limit = Q3 - (1.5*IQR)
34
What is the equation for calculating the upper and lower limits to determine extreme outliers?
lower limit = Q1 - (3*IQR) upper limit = Q3 - (3*IQR)
35
What type of graph is used to represent dichotomous or categorical variables?
bar charts (frequencies and relative frequencies)
36
What type of graph is used to represent ordinal variables?
histograms (frequencies, relative frequencies, cumulative frequencies, and cumulative relative frequencies)
37
What type of graph is used to represent continuous variables?
boxplot (central tendencies and variabilities/ranges)