Statistics Flashcards

NOTE TO SELF SORT OUT STATS FLOW CHART

1
Q

What is the mean?

A

Add all the values together and divide by the number of values.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the median?

A

The middle value (Average middle 2 if odd number of values)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the mode?

A

The most frequently observed value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the standard deviation?

A

Estimate of the average variability (spread) of the data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the 95% confidence interval?

A

The range of values around that statistics (so 95% of values will fit into between the range of values)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is standard error?

A

The standard deviation of the sampling distribution of a given statistic.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the range?

A

The range in values (Maximum and minimum)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are the basic statistics?

A

The mean, mode, range, median, 95% confidence interval and standard deviatio.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How do you find the basic statistics using SPSS?

A

1) Enter data into SPSS
2) Analyse
3) Descriptive statistics
4) Explore
5) Add the variable into the dependant list
6) OK

YOU MUST HAVE ALL DATA IN SEPARATE COLUMNS (i.e. do not stack)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is P?

A

The p value is the probability of falsely rejecting the null hypothesis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the null hypothesis?

A

The null hypothesis is that there is no difference

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Why do we use statistics?

A

To test our data against the null hypothesis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What does p < 0.05 mean

A

p < 0.05 is described as significant.

Essentially if its less than 0.05 you are 99.95% sure there is a significant difference!

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Example of hypothesis and null hypothesis?

A

Hypothesis: Paracetamol will reduce symptoms of a headache

Null hypothesis: Paracetamol may reduce symptoms of a headache

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is normal distribution?

A

Normal distribution is:

  • Ensures that the data is representative of a “normal population”
  • Important for quantitative data
  • Helps determine how we present data
  • Helps determine what statistical test we use to analyse data
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are the tests used for normal distribution?

A

To see whether data is normally distributed use;

  • Kolmogorov - Smirnov test

OR

  • Shapiro - Wilk test
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

When do you use parametric or non parametric test?

A

P value is < 0.05 = NOT normally distributed = non parametric test

P value is > 0.05 = normally distributed = parametric test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

How can you test for normality i.e a Kolmogorov Smirnov test or Shapiro Wilko test?

A

You can do this at the same time as basic statistics in SPSS. YOU MUST HAVE ALL DATA IN SEPERATE COLUMNS (i.e. do not stack)

1) Enter data into SPSS
2) Analyse
3) Descriptive Statistics
4) Explore - Add the variable into the dependant list
5) Click plots
6) Click normality plot with tests
7) Histograms
8) Continue

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What would happen if one was significant and the other wasn’t?

A

It is not normally distributed they both need to be normally distributed in order to use the parametric test.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What is a type 1 error?

A

Occurs when we believe that there is a genuine effect in our population, when in fact there isn’t

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What is a type 2 error?

A

Occurs when we believe there is no effect in the population, when in reality there is.

22
Q

What is one tailed?

A

A test of directional hypothesis i.e. x will predict y or x is different from y

23
Q

What is two tailed?

A

A test of non directional hypothesis i.e. x may or may not predict y or x is maybe different from y

24
Q

What do we do with parametric data?

A
  • Use when data is normally distributed

When comparing 2 groups:

  • Independent T test - two different experimental conditions/ groups
  • Dependant T test - two groups that are dependant on each other

Comparing 3 or more groups:

One way ANOVA - You have 3 or more groups

25
How can you determine if your data is dependant or independent?
Independent = Compare participants in different group E.G. Different people receiving different treatments. Dependant = Compare participants in the same group E.G. Same people tested twice receiving the same treatment
26
What to do with 2 groups and data that is normally distributed?
2 groups and data that is normally distributed Dependant on each other OR Independent groups Dependant T test Independent T test
27
How do you do a dependant T test?
1) Analyse 2) Compare means 3) Paired samples t test Then add the data from test 1 under variable 1 and data for test 2 under variable 2 then press ok
28
How do you do independent T test?
Stack variables in one column. Then add group numbers (e.g. 1 and 2) into the next column and define groups 1) Analyse 2) compare means 3) Independant samples T test Then add the column of stacked data into the test variable box and the groups column into the grouping variable box. Define groups and press ok Also check levene's test, are equal variances assumed?
29
What does sig .000 mean?
P value < 0.0001
30
What is levene's test?
Tests whether groups are similar
31
What does ANOVA tell us?
ANOVA (Analysis of variance) which tell us if there is a difference between all groups But an ANOVA DOES NOT tell us where the difference is between groups, e.g. is group A different from group B and C or just group B We use Post - hoc tests to compare each group to work out which is different from each other
32
What do we for more then 3 groups?
If 3+ groups and data is normally distributed? Enter data into SPSS - Analyse - compare means - One Way ANOVA - Select your post hoc test (LSD and Tamhane's) - click options - choose homogeneity of variance test Check ANOVA sig/ P value is it < 0.05 Yes | No Check homogeneity of variance test is that significant? | Just report ANOVA, no need for further analysis No = Use LSD Post Hoc results Yes = Use Tamhane Post Hoc results MUST GROUP DATA E.G. ADD VARIABLE 1 TO TEST VARIABLE BOX AND GROUPS/VARIABLE 2 INTO GROUPING VARIABLE BOX. CHECK DESCRIPTIVES TO GET BASIC STATS TOO
33
How does ANOVA work?
ANOVA is < 0.05, check homogeneity of variance test We use LSD post hoc as homogeneity of variance test > 0.05 if (<0.05 use tamhanes)
34
What is ANCOVA
Compares several means but adjusts for the effects of one or more other variables
35
What is repeated measures ANOVA
ANOVA conducted when any independent variable or variables have been measured from the same participants in all conditions
36
What is factorial ANOVA?
2 or more independent variables or predictors
37
Tell me about non - parametric data?
- Use when data is NOT normally distributed Comparing 2 groups: Mann Whitney test - 2 different experimental conditions/groups, a non - parametric version of an independent test Wilcoxon Signed rank test - 2 groups that are dependent on each other, a non parametric version of a dependent t test. Comparing 3 or more groups: - Kruskal Wallis Test - You have three or more groups , a non - parametric version of a One - way ANOVA
38
How to do a Wilcoxon signed rank t test?
1) Analyse 2) non parametric tests 3) related samples Under fields tab transfer your parameters to the test field box Go to settings and choose Wilcoxon matched pair signed rank test (2 samples)" Press run
39
How to do a Mann Whitney T test?
Stack variables in one column. Then add group numbers into the next column e.g. 1 and 2. They define groups 1) Analyse 2) Non parametric test 3) Independent samples 4) Automatically compare distribution across groups. Then add the column of stacked data into the Test Variable box and the groups column into the grouping variable box Settings - Customize tests - Select Mann Whitney U (2 samples) - Run
40
How to do a Kruskal Wallis test?
3 + groups and data is NOT normally distributed 1) Enter data into SPSS 2) Rearrange data like you would for One - way ANOVA 3) Analyse 4) Non parametric test 5) Independent samples 6) Automatically compare distributions across groups 7) Add BMD/variable 1 into test variable box and groups/variable 2 into groups box Go to settings customize tests and Kruskal Wallis and run Check sig p value, is it <0.05 If yes, Run multiple Mann Whitney tests between each group and report the p values and determine which group shows significance If no, just report the p value no need for further analysis.
41
How to run a post hoc analysis?
To run a post hoc analysis (a bit like we did with a one way ANOVA) we need to run a individual Mann Whittney Tests which is a bit time consuming N.B. To do this you only need to run 2 groups at a time it will not work otherwise
42
What is correlation?
Correlation is measuring the relationship between 2 variables. Positive = one increases the other increases Negative - As one increases the other decreases. There are 2 types of tests: For parametric data = Pearson's correlation coefficient For non parametric data = Spearman's correlation coefficient
43
What is correlation coefficient score?
A correlation coefficient score has to lie between + 1 and - 1 A + 1 = perfect positive a -1 = perfect negative 0 = no relationship
44
Correlation grading
0.0 - 0.3 = Poor correlation 0.3 - 0.5 = Low correlation 0.5 - 0.7 = Moderate correlation 0.7 - 0.9 = Strong correlation 0.9 - 1 = Very strong correlation can refer to both positive and negative
45
How to do pearson?
Is your data normally distributed Yes = pearson Parametric data = normally distributed Enter data next to each other Analyse - correlate - bivariate - tick pearson for parametric data
46
How to do spearmans?
Is your data normally distributed No = Spearman Non - parametric - not normally distributed Enter data next to each other Analyse - correlate - Bivariate - Tick spearman for non parametric data
47
How do you report the correlation?
You should report the Pearson correlation coefficient as r and the p value For example: r = 0.796 (p<0.001) indicates a strong positive correlation You should report the spearman correlation coefficient as p and the p value For example: p = 0.796 (p<0.001) indicates a strong correlation
48
What is regression?
Simple regression = Predict an outcome variable from one predictor variable Multiple regression = Predict an outcome from several types of predictor variables Binary logistic regression = Predict a categorial outcome from several types of predictor variables.
49
Simple regression?
Dependent = what you are trying to predict Independent = What you are using to try and predict the variable Add data into SPSS next to each other - Analyse - Regression - Linear R = your Pearson's correlation R square = Look a it telling you how much one variable accounts for the other e.g. 0.948 = 94.8% Sig level = <0.001 = does significantly predict Multiple regression = aim for n = 5 - 10 people It tells us which variable predict the outcome
50
What is nominal data?
Nominal ( Categories wiht no obvious order) : Gender, smoker or non - smoker, eye colour
51
What is ordinal data?
Ordinal (Categories that can be ranked in order) : Shoe size, Age categories
52
Why do we use Pearson's chi squared test?
To look at the relationship between two categorical variables Chi squared will test the null hypothesis, i.e. that they are similar