statistics Flashcards

(43 cards)

1
Q

criteria for using parametric tests

A

populations drawn should be normally distributed
variances of population should be approxamiately equal
at least interval level data
no extreme scores

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

what is the rule for a test to be significant

A

if the test has the letter “r” in it, for significance the observed value needs to be greater than critical value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

what is nominal data

A

data that can be put it into categories

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

strength of nominal data

A

Easy to generate from closed questions; large amounts of questions can be collected quickly; increasing reliability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

weakness of nominal data

A

only mode can be used as a measure of spread
doesn’t provide any explanations for data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

what is ordinal data

A

results are able to be ranked in order
but the difference between each score is not known and does not have to be equal
e.g. test scores

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

strength of ordinal data

A

can find median and mode
provides more info than nominal as indicates values on a scale

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

weakness of ordinal data

A

gaps between points are unknown so is hard to compare and can be interpreted differently

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

what is interval data

A

most precise level of data
scale of equal or known units with equal distances
e.g. temperature, time in seconds

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

strength of interval data

A

most informative
points are linear with equal gaps so mean, modian and mode can all be calculated
standard deviation can be calculated to analyse dispersion
most reliable and scientific

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

weakness of interval data

A

do not gain any answers on why , lack of explanation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

how to calculate mean
strengths and wekanesses

A

add all data togheter then divide by number of pieces of data
strength- includes all pieces of data so is an accurate representation
weakness- extreme scores can distort the value, so is not an appropriate measure of central tendency if data set is skewed.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

how to calculate median
strenghts and weaknesses

A

all numbers in order, is middle number, if there are 2 numbers in middle add together and divide by 2.

one strength is extreme values do not distort the value
is a strong measure of central tendency even if data set is skewed
one weakness is it could be time consuming and difficult to calcuate with a large set, is less representative as does not depend on all pieces of data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

how to calculate mode
strenghts and weaknesses

A

mode is the one that appears the most

strenght- can use when data is quantatitive or qualitative, so always allows for analysis for most frequent category

weakness- does not accurately reflect the data set
only works out most popular, so is useless

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

what is a measure of central tendency

A

single data point which summarises whole data srt
for example mode, median, mean
is a summary measure

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

what is a measure of dispersion

A

measures level of spread or variability in data set
for example range, interquartile range and standard deviation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

what is discrete data

A

can be counted
distinct and speerate pieces of data
for example number of CD’s sold

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

what is continous data

A

has infinite value
e.g. height, time, age

19
Q

characterisitcs of bar chart

A

discrete data
not touching colums
used for comparing categoies
dependent variable on Y axis
independent variable on X axis
has equal intervals

20
Q

characterisitcs for histograms
including equation

A

continious data
columns touch
areas of columns show frequency
frequency density is on Yaxis
continious scale on X axis

frequency density= frequency/class width

21
Q

characterisitcs for pie charts

A

discrete data
shows contribution to overall total
shown as %
represented as an angle

22
Q

characterisitcs for line graphs

A

continious data
compares 2 or more conditions
continious scale on X axis
frequency on Y axis
data points connected with straight line

23
Q

characterisitcs of scatter diagrams

A

continious data
measures relationship between 2 variables
one variable on X axis, the other on Y axis
shows correlation

24
Q

what is range
strenghts and weaknnesses

A

range is highest answer take lowest

is easy and quick to calculate
but
affected by extreme values
doesn;t take into account distribution around mean

25
what is variance how to work it out strengths and weaknesses
calculates difference between each piece of data and mean, the larger the standard deviation, the more spread out data is. find the mean, find the difference between piece of data and mean, square each difference, sum the squares, and divide by the number of data points minus one. strengths-takes all scores into account, more representative weaknesses- may hide some characterisitcs of data, less useful on its own, often need standard deviation
26
what is standard deviation strenghts and weaknesses
tells us average number that differs from mean is the square root of variance strenghts- most precise measure of dispersion, all values considered returns to units same figure as mean making it easier to judge the smaller the SD, the more similar it is to mean
27
draw negatively skewed normal skewed and positively skewed labelling mean, median, mode
28
when do normal distribution occur
when variables distributed so most of scores are clustered around mean, median and mode 50% of scores to left, 50% to right of mean
29
when does positively skewed distribution occur
most scores bunched to left mode is to left of mean as mean is affected by anamolies
30
when does negatively skewed distribution occur
most scores are bunched to the right mode is to right of mean
31
type one error definition
says results are significant when theyre not
32
type two error defintion
says results are not significant when they are
33
when would you use chi square test
independent measures nominal data test of difference
34
when would you use mann whitney test
independent measures ordinal/interval data test of difference
35
when would you use wilcoxon test
ordinal/ interval repeated measures test of differences
36
when would use binomial test
repeated measures nominal test test of differences
37
when would you use spearmans rho
at least ordinal data exploring relatioship between co variables correlational design
38
steps for mann whitney
rank both sets of data together place them back into table find overall ranking for each category either r1 or r2 use smallest total (r1 r r2) in the equation u1=r1-n(n+1)/2 n= number of particpants to be significant. observed U value has to be equal or less than critical value at p 0.05 significance level
39
steps for wilcoxon
find differences between data sets rank the differences, if difference is 0 ignore count how many positive and negative there are add together the differences of less frequent sign n= number of differences (not including 0s) to be significant oberved value has to be equal or less than crtical at p 0.05 significance
40
steps for chi square
(piece of data already in cell is observed) find totals for each column find expected frequencies for each cell using row total x column total/ overall total use formula (observed-expected)2/expected for each cell finally add all numbers for each cell together calculate degree of freedom using (number of rows - 1) x (number of columns - 1) observed chi square value needs to be more than critical value to be significant
41
steps for binomial test
label flow of direction if same answer in both categories ignore count number of each direction of flow smallest number of direction of flow is observed n= number of pps whos scored was use (don't include the ignore) to be significant, observed value has be smaller or equal to critical binomial
42
steps of spearmans rho
rank each column individally find differences between ranks square differences this is sgima d 2 google equation cant inset it lol x n= number of participants do not ignore zeros score will be between 0 and 1 for significance, observed value needs to be more than spearmans rho crtical value
43