Module 14: Descriptive statistics Flashcards

Question 1

Q

2 most common ways to sumamrize data

Answer

A

measure of central tendency
Measure of variability

Question 2

Q

Measure of central tendency (4)

what it is + represented by

Answer

A

A measure of the typical value in a collection of numbers or a data set
- measured by mean, median and mode

Question 3

Q

Mean (2)

+ how to find?

Answer

A

The average
Sum of all the scores divided by the total number of scores

Question 4

Q

Population mean

Question 5

Q

Sample mean

Question 6

Q

Median (2)

How to find?

Answer

A

The value that lies in the middle of the data when the data set is ordered
- First rank the data, then the position of the median is equal to the number of enteries plus one divided by 2

Question 7

Q

Odd number of entries when caculating median:

Answer

A

median is the middle data entry

Question 8

Q

Even number of entries when calculating median:

Answer

A

Median is the mean of the 2 middle data entries

Question 9

Q

Mode

Answer

A

The most frequent value

Question 10

Q

If no data set is repeated then the data has no

Question 11

Q

If two entries occur with the same greatest frequency each entry is a — and is called

Answer

A

mode
bimodal

Question 12

Q

Finding the mode

Answer

A

finding the greatest frequency

Question 13

Q

Advantage of using the mean (2)

Answer

A

most common statistic
Takes into account every entry of a data set

Question 14

Q

Disadvantage of using the mean (2)

Answer

A

greatly affgected by extreme scores (outliers)
Knowledge about individual cases is lost with averages

Question 15

Q

Advantages of using the median (2)

Answer

A

Little influence by extreme scores
Reasonable estimate of what most people mean by the center of a distribution

Question 16

Q

Disadvantage of using the median

Answer

A

may not be good to ignore extreme values

Question 17

Q

Advanatges of using the mode (2)

Answer

A

the most frequently obtained score
not influenced by extreme score

Question 18

Q

Disadvanatge of using the mode (2)

Answer

A

may not represent a large proportion of the scores
ignores extreme values

Question 19

Q

Variability

Answer

A

numbers which describe how spread out a set of data is

Question 20

Q

Examples of variability meausres (4)

Answer

A

range (interquartile range)
deviation
variance
standard deviation

Question 21

Q

Range+ formula (2)

Answer

A

length of the smallest interval that contains all the data

range= largest value - smallest value

Question 22

Q

range is sensitive to

Answer

A

sample size: small samples= less range (less respresentative range)
extreme scores (tells you smallest and largest but not bulk)

Question 23

Q

Interquartile range (2)

+ formula

Answer

A

Measure of distance between first and third quartiles
- IQR= Q3-Q1

Question 24

Q

second quartile is the

Question 25

Q

benefits of IQR (2)

Answer

A

less affected by extreme values
helpful for identifying outliers

Question 26

Q

Quartile (2)

What it is+median

Answer

A

positions in a range of values representing multiples of 25%
50% of scores fall below median, 50% scores above

Question 27

Q

First quartile (Q1)

Answer

A

25% of scores fall below Q1, 75% above

Question 28

Q

Third quartile (Q3)

Answer

A

75% of scores fall below Q3, 25% above

Question 29

Q

deviation

Answer

A

The diference between each score and the mean of the data set

How far you are from the mean

Question 30

Q

deviation formula

Question 31

Q

deviation scores always sum to

Question 32

Q

Difference between deviation and IQR/boxplots

Answer

A

Deviation scores show dispersion around the mean, IQR and boxplot show dispersion around the median

Question 33

Q

Variance

Answer

A

single number representing the average amount of variation in a set of scores/ how spread out the scores are

Question 34

Q

Steps for finding the sample variance (5)

Question 35

Q

Standard variation

Answer

A

Measure of the spread of scores out from the mean of the sample

Question 36

Q

How to cauclate standard deviation

Answer

A

calculate the variance
find the square root

Question 37

Q

Population standard deviation formula

Question 38

Q

Standard deviation is a measure of the typical amount an entry deviates from the mean, thus the more entries are spread out, the

Answer

A

greater the standard deviation

Question 39

Q

Descriptive statistics (2)

Answer

A

cannot make predictions or generalizations
only drawing conclusions about current sample and not extrapolating or going beyond

Question 40

Q

inferential statistics (2)

Answer

A

can make predictions or generalizations
allow conclusions about the population based on data from a sample

Question 41

Q

Data matrices

Answer

A

a table or worksheet that organizes the data together with all the variables of interest

Question 42

Q

Frequency distributions

Answer

A

A table indicating the frequency of each value in a data set

Question 43

Q

Histogram (3)

What it is+ illustrates+can help identify

Answer

A

A graphical representation of the frequency of a variable
illustrates the distribution of scores
can help identify outliers or violations of normal distribution assumptions

Question 44

Q

Answer

A

symmetrical

Question 45

Q

Answer

A

Negative skew or left skew

Question 46

Q

Answer

A

Positive skew/right skew

Question 47

Q

Central tendency

Answer

A

helps identify the typical or most common value in data

Question 48

Q

Measures of central tendency

Answer

A

Mean
median
mode

Question 49

Q

measure of central tendency for symmetrical distribution/ skewed

Question 50

Q

If the average is 100 and the standard deviation is 10, then there is

Answer

A

2/3 of the data that falls between 90 and 110

Question 51

Q

for data that is skewed or has outliers, —- may be better choice to describe the centre of the distribution

Question 52

Q

Q position

Answer

A

Qposition= [(Q#)(n+1)]/4

Q#= number of quartile your trying to find

Question 53

Q

Round Q position to

Answer

A

the median

Question 54

Q

How to find outlieers with IQR

Question 55

Q

Scatterplots

Answer

A

visualize the form, direction and strength of 2 variable relationships

Question 56

Q

correlation coefficients

Answer

A

indicate the degree of covariance between variables: how much one variable changes in relation to another

Question 57

Q

Data points that are more closely positioned around the best fit line represent

Answer

A

a stronger relationship than when data points are further from the lines

Brainscape's Knowledge GenomeTM

Module 14: Descriptive statistics Flashcards

Brainscape's Knowledge Genome^TM