Year 1 Flashcards

(35 cards)

1
Q

What are the four scales of measurement?

A

Nominal, ordinal, interval, ratio

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What distinguishes ordinal and interval?

A

Ordinal has a natural ordering to the data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What distinguishes interval and ratio?

A

Interval does not have an absolute zero point but ratio data does

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are the 4 movements of a distribution?

A
  1. Central tendency
  2. Dispersion
  3. Skewness
  4. Kurtosis
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Negative skew?

A

Mode > mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Positive skew?

A

Mode < mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

platykurtic, mesokurtic, leptokurtic?

A

Platykurtic (k<3)
Mesokurtic (k~3)
Leptokurtic (k>3)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Parametric data analysis requirements?

A
  1. continuous data
  2. n>30
  3. normally distributed
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Non-parametric data analysis requirements?

A
  1. not continuous
  2. n<30
  3. non-normal
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What process allows us to begin conducting arithmetic on parametric data?

A

Normalisation or standardisation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What happens to the mean and SD after you carry out normalisation on a distribution?

A
Mean = 0
SD = 1
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What way is a null hypothesis always phrased?

A

Negatively

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

who decides the level of significance associated with hypothesis testing?

A

user based on opinion and consideration of distribution characteristics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

When is something classed as not statistically significant?

A

If the significance value falls outside the significance confidence threshold.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What are the 2 ways of determining whether a distribution is normally distributed?

A

Q-Q plot

K-S test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

How does a q-q plot work?

A

Points should lie as closely along the line (representing a normal distribution) and be evenly distributed either side.

17
Q

How does a K-S test work?

A

Hypothesis testing - if the value returned lies outside the significance threshold then there is NOT a statistically significant difference between a normal distribution and the investigated normal distribution i.e. it is normally distributed

18
Q

What are inferential statistics?

A

Tests of difference between either samples and populations

19
Q

What are the 3 parametric inferential statistics?

A

On-sample t-test = sample and population
Two-sample t-test = sample and sample
ANOVA = 2+ samples

20
Q

What are relational statistics?

A

Testing for a relationship between variables i.e. correlation.

21
Q

What is the parametric relational statistic test?

A

Pearson’s correlation coefficient

22
Q

What are the 4 non-parametric statistical tests?

A

one-way chi = sample and population
two-way chi = 2+ sample
MWU = comparison of sample means
Kruskal Wallis = ANOVA

23
Q

What is the non-parametric relational statistic test?

A

Spearman’s Rank

24
Q

What is the ‘least squares criterion’?

A

the principle that the total difference between the points and the regression line is as small and identical either side.

25
What is the f-ratio?
the ratio of explained variance to unexplained variance
26
What is the coefficient of explanation for linear regression?
R squared
27
What are two sources of regression error?
Standard error = error associated with how difficult it is to represent data that is naturally very tricky sampling error = concerned with the regression line characteristics being incorrect or poor at representing data
28
What is homoscedascity?
When residuals from the regression line are consistently spaced either side of the regression line.
29
Why is homoscedascity important?
because that forms one of the assumptions held regarding analysis - that there are even residuals either side of the regression line
30
How do we test for homoscedascity/heteroscedascity in spss?
1. scatter plot needs to be well scattered with no clear patterns 2. P-plot needs to have similar amount of points either side of line and be well tied to the line 3. histogram needs to be normal in style (Gaussian)
31
What is autocorrelation?
When each correlation between x and y values are not independent i.e. the correlation is affected by something else
32
What is wrong with autocorrelation?
It is assumed to not occur in our parametric tests
33
What test do we use for testing for autocorrelation?
durbin watson
34
What is the range of values for significant positive autocorrelation, no autocorrelation and significant negative autocorrelation?
positive autocorrelation = 0-1.475 no autocorrelation = 1.566-2.434 negative autocorrelation = 2.525 - 4
35
what is the difference between autocorrelation and multicollinearity?
autocorrelation involves the correlation between one predictor and y being affected by something else whereas multicollinearity is when different predictors are linked so that distinguishing their individual impact on y is difficult to determine